Refine
Year of publication
Document Type
- Article (18)
- Part of a Book (18)
- Conference Proceeding (9)
- Book (6)
- Review (3)
- Doctoral Thesis (1)
- Other (1)
Has Fulltext
- yes (56)
Keywords
- Kontrastive Linguistik (56) (remove)
Publicationstate
- Veröffentlichungsversion (56) (remove)
Reviewstate
Publisher
This conference booklet provides information about 10th International Contrastive Linguistics Conference (ICLC-10) that took place in Mannheim, Germany, from 18 to 21 July 2023. It contains
– a description of the conference aims,
– details on the conference venue,
– information on committees,
– the conference program,
– the abstracts of the keynotes, oral and poster presentations, and
– an author index.
One of the fundamental questions about human language is whether all languages are equally complex. Here, we approach this question from an information-theoretic perspective. We present a large scale quantitative cross-linguistic analysis of written language by training a language model on more than 6500 different documents as represented in 41 multilingual text collections consisting of ~ 3.5 billion words or ~ 9.0 billion characters and covering 2069 different languages that are spoken as a native language by more than 90% of the world population. We statistically infer the entropy of each language model as an index of what we call average prediction complexity. We compare complexity rankings across corpora and show that a language that tends to be more complex than another language in one corpus also tends to be more complex in another corpus. In addition, we show that speaker population size predicts entropy. We argue that both results constitute evidence against the equi-complexity hypothesis from an information-theoretic perspective.
It is well known that the distribution of lexical and grammatical patterns is size- and register-sensitive (Biber 1986, and later publications). This fact alone presents a challenge to many corpus-oriented linguistic studies focusing on a single language. When it comes to cross-linguistic studies using corpora, the challenge becomes even greater due to the lack of high-quality multilingual corpora (Kupietz et al. 2020; Kupietz/Trawiński 2022), which are comparable with respect to the size and the register. That was the motivation for the creation of the European Reference Corpus EuReCo, an initiative started in 2013 at the Leibniz Institute for the German Language (IDS) together with several European partners (Kupietz et al. 2020). EuReCo is an emerging federated corpus, with large virtual comparable corpora across various languages and with an infrastructure supporting contrastive research. The core of the infrastructure is KorAP (Diewald et al. 2016), a scalable open-source platform supporting the analysis and visualisation of properties of texts annotated by multiple and potentially conflicting information layers, and supporting several corpus query languages. Until recently, EuReCo consisted of three monolingual subparts: the German Reference Corpus DeReKo (Kupietz et al. 2018), the Reference Corpus of Contemporary Romanian Language (Barbu Mititelu/Tufiş/Irimia 2018), and the Hungarian National Corpus (Váradi 2002). The goal of the present submission is twofold. On the one hand, it reports about the new component of EuReCo: a sample of the National Corpus of Polish (Przepiórkowski et al. 2010). On the other hand, it presents the results of a new pilot study using the newly extended EuReCo. This pilot study investigates selected Polish collocations involving light verbs and their prepositional / nominal complements (Fig. 1) and extends the collocation analyses of German, Romanian and Hungarian (Fig. 2) discussed in Kupietz/Trawiński (2022).
Comparaison de deux marqueurs d’affirmation dans des séquences de co-construction: voilà et genau
(2016)
This contribution investigates the German response particle genau and the French response particle voilà within collaborative turn sequences in videotaped ordinary conversations. Adopting a conversation analytic approach to cross-linguistic comparison, I will show that the basic epistemic value of both particles allows them to be used in similar sequential environments. When a co-participant formulates a candidate conclusion in environments where it can be easily inferred from previous talk, first speakers may confirm the adequacy of the pre-emptive completion by voilà or genau. These particles may then also be followed by self- or other-repeats. The analyses aim to illustrate that participants rely on a variety of practices in order to positively assess a pre-emptive completion, and to refute a supposed binary opposition of refusal vs. acceptance in the receipt slot.
Both compounds and multi-word expressions are complex lexical units, made up of at least two constituents. The most basic difference is that the former are morphological objects and the latter result from syntactic processes. However, the exact demarcation between compounds and multi-word expressions differs greatly from language to language and is often a matter of debate in and across languages. Similarly debated is whether and how these two different kinds of units complement or compete with each other.
The volume presents an overview of compounds and multi-word expressions in a variety of European languages. Central questions that are discussed for each language concern the formal distinction between compounds and multi-word expressions, their formation and their status in lexicon and grammar.
The volume contains chapters on German, English, Dutch, French, Italian, Spanish, Greek, Russian, Polish, Finnish, and Hungarian as well as a contrastive overview with a focus on German. It brings together insights from word-formation theory, phraseology and theory of grammar and aims to contribute to the understanding of the lexicon, both from a language-specific and cross-linguistic perspective.
Das Deutsche ist eine der am besten erforschten Sprachen der Welt; weniger bekannt ist, welche Gemeinsamkeiten es mit den europäischen Nachbarsprachen teilt und wo seine Besonderheiten liegen.
Die insgesamt acht Kapitel des Buches stellen prägnant und anhand von anschaulichen Beispielen Wortschatz und Grammatik des Deutschen vor. Dabei verhilft ein Vergleich mit den Optionen etwa im Englischen, Französischen, Polnischen, Ungarischen oder anderen europäischen Sprachen zu einem verschärften Blick. Ausgangspunkt ist dabei ein kurzer Abriss der Facetten von Sprache allgemein sowie die Herleitung der grundlegenden Sprachfunktionen aus einer handlungsbezogenen Perspektive. Die folgenden Kapitel stehen unter Motti wie: „Das Verb – Zeiten, Modi, Szenarios und Inszenierungen“, „Der nominale Bereich – die vielerlei Arten, Gegenstände zu konstruieren“ oder „Der Text – wenn wir kohärent und dabei narrativ oder argumentativ werden“. Das letzte Kapitel trägt den Titel: „Das Deutsche – auf dem Weg zu einem Sprachporträt“.
Das Buch soll Sprachinteressierten auch ohne linguistische Fachkenntnisse einen neuen Zugang zu unserer Muttersprache erschließen und die Sensibilität für die sprachliche Verbundenheit auf unserem Kontinent trotz aller Vielfalt stärken.
- Grammatik anschaulich und konkret
- Innovativer Blick auf das Deutsche im Kreis europäischer Sprachen
- Kurzweilige Einführung für Sprachinteressierte auch ohne linguistische Fachkenntnisse