Refine
Year of publication
Document Type
- Article (5)
- Part of a Book (4)
- Conference Proceeding (3)
- Book (1)
- Other (1)
Language
- German (10)
- English (3)
- Portuguese (1)
Has Fulltext
- yes (14)
Keywords
- Komposition <Wortbildung> (14) (remove)
Publicationstate
- Veröffentlichungsversion (14) (remove)
Reviewstate
- (Verlags)-Lektorat (6)
- Peer-Review (5)
Publisher
In diesem Beitrag werden Komposita mit den relationalen Zweitgliedern Gatte und Gattin aus genderlinguistischer Perspektive untersucht, basierend auf manuell annotiertem zeitungssprachlichen Korpusmaterial. Frauen werden im analysierten Korpus ca. 12-mal häufiger in ihrer ehelichen Rolle versprachlicht als Männer. Statistische Analysen zeigen, dass sie dabei systematisch in ein possessives Verhältnis zum Ehemann gesetzt werden (Arztgattin = Gattin eines Arztes), während Ehemänner in den untersuchten Komposita tendenziell doppelt individualisiert werden (Arztgatte = Gatte, der Arzt ist). Neben den Zweitgliedern geben auch die Genera der beiden Konstituenten Aufschluss über die kodierte Bedeutungsrelation: Genusgleichheit (Kanzlergatte) führt zu einer qualifizierenden, Genusdivergenz (Kanzleringatte) zu einer possessiven Lesart. Die Analyse belegt außerdem die Existenz movierter Kompositumserstglieder – diese sind sogar die häufigste Form zur Benennung weiblicher Personen im Erstglied. Trotzdem herrscht bei der Bezugnahme auf Frauen eine größere Formenvarianz als bei Männern, welche fast ausschließlich mit maskulinen Erstgliedern versprachlicht werden. Damit zeigt die Studie, wie genderlinguistische Perspektiven auch im Bereich der Wortbildung einen neuen Analysezugang bilden.
KoMuX, der Kompositamuster-Explorer, (www.owid.de/plus/komux) ist eine Webanwendung, die es ermöglicht, mehr als 50.000 nominale Komposita des Deutschen gezielt nach abstrakten oder lexikalisch-teilspezifizierten Mustern zu durchsuchen. Unterschiedliche Visualisierungen helfen dabei, Strukturen und Zusammenhänge innerhalb der Ergebnismenge zu erfassen.
The automatic recognition of idioms poses a challenging problem for NLP applications. Whereas native speakers can intuitively handle multiword expressions whose compositional meanings are hard to trace back to individual word semantics, there is still ample scope for improvement regarding computational approaches. We assume that idiomatic constructions can be characterized by gradual intensities of semantic non-compositionality, formal fixedness, and unusual usage context, and introduce a number of measures for these characteristics, comprising count-based and predictive collocation measures together with measures of context (un)similarity. We evaluate our approach on a manually labelled gold standard, derived from a corpus of German pop lyrics. To this end, we apply a Random Forest classifier to analyze the individual contribution of features for automatically detecting idioms, and study the trade-off between recall and precision. Finally, we evaluate the classifier on an independent dataset of idioms extracted from a list of Wikipedia idioms, achieving state-of-the art accuracy.
Corona- und andere Partys
(2020)
Both compounds and multi-word expressions are complex lexical units, made up of at least two constituents. The most basic difference is that the former are morphological objects and the latter result from syntactic processes. However, the exact demarcation between compounds and multi-word expressions differs greatly from language to language and is often a matter of debate in and across languages. Similarly debated is whether and how these two different kinds of units complement or compete with each other.
The volume presents an overview of compounds and multi-word expressions in a variety of European languages. Central questions that are discussed for each language concern the formal distinction between compounds and multi-word expressions, their formation and their status in lexicon and grammar.
The volume contains chapters on German, English, Dutch, French, Italian, Spanish, Greek, Russian, Polish, Finnish, and Hungarian as well as a contrastive overview with a focus on German. It brings together insights from word-formation theory, phraseology and theory of grammar and aims to contribute to the understanding of the lexicon, both from a language-specific and cross-linguistic perspective.
The present paper deals with grammaticalization as a comprehensive model of erosive processes in the history of natural languages, exemplified in German and Brazilian Portuguese. Grammaticalization is conceived of as the reduction of pragmatic versatility, semantic concreteness, syntactic liberty and phonetic substance of linguistic elements. It is subdivided into the processes of lexicalization, which transforms polylexematic into monolexematic elements, and deslexicalization, which reduces lexematic to sublexematic elements. In the middle of these processes stands the lexicon, which is seen as the central stock of linguistic elements. Within the lexicon, the process of grammaticalization continues, from lexical word classes through intermediate classes to grammatical word classes. The lower boundary of the lexicon is a critical threshold, down to which the process of grammaticalization is compensated for by linguistic recycling that leads lexematic elements back into the linguistic circuit, through the formation of new polylexematic units. Beyond this threshold, however, no recycling is possible any more, so that elements which have once lost their lexical character are condemned to disappear in the long run. The different stages of grammaticalization are introduced and illustrated by means of concrete examples, first from Brazilian Portuguese and afterwards from German.