OPUS 4 | 410 Linguistik

410 Linguistik

410 Linguistik (142)
411 Schriftsysteme (2)
412 Etymologie (4)
413 Wörterbücher (19)
414 Phonologie, Phonetik (8)
415 Grammatik (8)
417 Dialektologie, historische Linguistik (3)
418 Standardsprache; Angewandte Linguistik (16)
419 Gebärdensprachen (2)

22 search hits

1 to 10

Sort by

Gesprochene Sprache in DaF-Lernwörterbüchern (2016)

Meliss, Meike

Re-designing Online Terminology Resources for German Grammar. Project Report (2016)

Suchowolec, Karolina ; Lang, Christian ; Schneider, Roman

The compilation of terminological vocabularies plays a central role in the organization and retrieval of scientific texts. Both simple keyword lists as well as sophisticated modellings of relationships between terminological concepts can make a most valuable contribution to the analysis, classification, and finding of appropriate digital documents, either on the Web or within local repositories. This seems especially true for long-established scientific fields with various theoretical and historical branches, such as linguistics, where the use of terminology within documents from different origins is sometimes far from being consistent. In this short paper, we report on the early stages of a project that aims at the re-design of an existing domain-specific KOS for grammatical content grammis. In particular, we deal with the terminological part of grammis and present the state-of-the-art of this online resource as well as the key re-design principles. Further, we propose questions regarding ramifications of the Linked Open Data and Semantic Web approaches for our re-design decisions.

Automatic Classification by Topic Domain for Meta Data Generation, Web Corpus Evaluation, and Corpus Comparison (2016)

Schäfer, Roland ; Bildhauer, Felix

In this paper, we describe preliminary results from an ongoing experiment wherein we classify two large unstructured text corpora—a web corpus and a newspaper corpus—by topic domain (or subject area). Our primary goal is to develop a method that allows for the reliable annotation of large crawled web corpora with meta data required by many corpus linguists. We are especially interested in designing an annotation scheme whose categories are both intuitively interpretable by linguists and firmly rooted in the distribution of lexical material in the documents. Since we use data from a web corpus and a more traditional corpus, we also contribute to the important field of corpus comparison and corpus evaluation. Technically, we use (unsupervised) topic modeling to automatically induce topic distributions over gold standard corpora that were manually annotated for 13 coarse-grained topic domains. In a second step, we apply supervised machine learning to learn the manually annotated topic domains using the previously induced topics as features. We achieve around 70% accuracy in 10-fold cross validations. An analysis of the errors clearly indicates, however, that a revised classification scheme and larger gold standard corpora will likely lead to a substantial increase in accuracy.

Rechtssemantik und Rechtspragmatik Konflikte zwischen nationalen und internationalen Gerichten aus rechtslinguistischer Perspektive am Beispiel des Falls Görgülü (2016)

Luth, Janine

Lexikalische Vielfalt und Varianz aus kontrastiver Perspektive. Überlegungen zu einem Produktionswörterbuch aus der Sicht des Deutschen und Spanischen (2016)

Meliss, Meike

DRuKoLA – towards contrastive German-Romanian research based on comparable corpora (2016)

Cosma, Ruxandra ; Cristea, Dan ; Kupietz, Marc ; Tufiş, Dan ; Witt, Andreas

This paper introduces the recently started DRuKoLA-project that aims at providing mechanisms to flexibly draw virtual comparable corpora from the German Reference Corpus DeReKo and the Reference Corpus of Contemporary Romanian Language CoRoLa in order to use these virtual corpora as empirical basis for contrastive linguistic research.

Konflikt und Konfliktbewältigung im Spiegel der Sprache, oder: Plädoyer für die Suche nach einem linguistischen Beitrag zur Befriedung Europas (2016)

Vogel, Friedemann ; Luth, Janine ; Ptashnyk, Stefaniya

Kontrastivnye issledovanija grammatičeskich kategorij. (2016)

Trawiński, Beata

Der Aufsatz knüpft an die Diskussion zur Verwendung von formalen grammatischen Kategorien im Sprachvergleich an (vgl. insbesondere Haspelmath 2007, 2010a, b und Newmeyer 2007, 2010). Es wird dabei nicht danach gefragt, ob sprachübergreifende grammatische Kategorien (oder genauer gesagt Kategorienausprägungen) existieren oder nicht bzw. ob einzelsprachliche grammatische Kategorien im Sprachvergleich sinnvoll einsetzbar sind, sondern wie ähnlich bzw. unterschiedlich einzelsprachliche Kategorien bzw. Kategorisierungen sind. Das Ziel ist damit, eine Methode zur Messung des Äquivalenzgrades von grammatischen Kategorien in verschiedenen Sprachen zu präsentieren; dies wird am Beispiel des IMPERATIVS im Deutschen, Englischen, Polnischen und Tschechischen illustriert.

‘Patientenautonomie’ und ‘Lebensschutz’. Eine empirische Studie zu agonalen Zentren im Rechtsdiskurs über Sterbehilfe (2016)

Felder, Ekkehard ; Luth, Janine ; Vogel, Friedemann

On the basis of a law text corpus which consists of judicial decisions and jurisprudential papers on so-called assisted suicide from 1977 to 2011, agonal centres are determined within the paradigm of corpus-based pragma-semiotic text analysis. Agonal centres are defined as action-guiding concepts that are in conflict with each other concerning the general acceptance of event interpretations, options for actions, claims of validity, contextual knowledge and values. These action-guiding concepts are derived with the help of quantitative and qualitative methods. Discourse linguistic interpretations are thus rendered more objective with the help of semi-automatic methods; furthermore, specific discourse features of the discourse and approaches to interpretation can be derived from (un)expected linguistic significances of occurrence, distribution, frequency etc. at the linguistic surface. Finally, these agonal centres specific to the language of law are compared to agonal centres which are determined on the basis of a media corpus on the same issue. This provides a comparative insight into the constitution of a seemingly identical fact in everyday and special language, which demonstrates the sociopolitical relevance of analysing the constitution of reality as instructed by language.

Corpus-driven description of multi-word patterns (2016)

Steyer, Kathrin

This paper presents our model of ‘MultiWord Patterns’ (MWPs). MWPs are defined as recurrent frozen schemes with fixed lexical components and productive slots that have a holistic – but not necessarily idiomatic – meaning and/or function, sometimes only on an abstract level. These patterns can only be reconstructed with corpus-driven, iterative (qualitative-quantitative) methods. This methodology includes complex phrase searches, collocation analysis that not only detects significant word pairs, but also significant syntagmatic cotext patterns and slot analysis with our UWV Tool. This tool allows us to bundle KWICs in order to detect the nature of lexical fillers for and to visualize MWP hierarchies.

1 to 10

Open Access

410 Linguistik

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

22 search hits