News from EuReCo: Annotations, Applications, and LLM Assistance
- The field of contrastive corpus linguistics is inherently more resource-intensive than single-language studies due to the necessity of at least two corpora that are not only sufficiently representative with respect to the research question and intended language domain but also sufficiently similar. While parallel or translation corpora exist for many languages and domains (cf., Čermák/Rosen 2012) and meet the similarity requirement, their linguistic utility is often affected by translation effects, such as shining-through, over-normalization, and simplification (e.g., Teich, 2003; Granger et al., 2003). Comparable corpora present a more effective alternative for capturing authentic cross-linguistic patterns; however, locating or creating such corpora for specific language constellations and domains can be highly costly and labor-intensive.
| Author: | Beata TrawińskiORCiDGND, Marc KupietzORCiDGND, Nils DiewaldORCiDGND |
|---|---|
| URN: | urn:nbn:de:bsz:mh39-134731 |
| URL: | https://korpus.cz/events/iclc11/abstracts/524 |
| Parent Title (German): | ICLC-11 Praha 2025. The 11th International Contrastive Linguistics Conference. Prague, 17–19 September 2025 |
| Publisher: | Filozofická Fakulta Univerzita Karlova |
| Place of publication: | Karlova |
| Document Type: | Conference Proceeding |
| Language: | English |
| Year of first Publication: | 2025 |
| Date of Publication (online): | 2025/10/02 |
| Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
| Publicationstate: | Veröffentlichungsversion |
| Reviewstate: | Peer-Review |
| Tag: | Light Verb Constructions Comparable Corpora; European Languages; Tools |
| GND Keyword: | Großes Sprachmodell; Kontrastive Linguistik; Korpus <Linguistik> |
| Page Number: | 3 |
| DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
| Open Access?: | ja |
| Linguistics-Classification: | Korpuslinguistik |
| Program areas: | Grammatik |
| Program areas: | Digitale Sprachwissenschaft |
| Licence (German): | Urheberrechtlich geschützt |


