Vergleichbare Korpora für multilinguale kontrastive Studien. Herausforderungen und Desiderata
- This contribution aims to show the necessity of working in the development of multilingual corpora and appropriate tools for multilingual contrastive studies. We take the corpus of the lexicographical project COMBIDIGILEX as example to show, how difficultit is to build a suitable data basis to study and compare linguistic phenomena in German, Spanish and Portuguese. Despite the availability of big reference corpora for the three languages (at least for written language), it is not able to obtain a comparable data basis from, because the mentioned corpora are created according to different requirements and they are also powered by disparate information systems and analyse tools. To break the status quo, we plead for increasing research infrastructures by means of compatible language technology and sharing data.
Author: | Meike MelissORCiDGND, Vanessa González RibaoORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-111511 |
URL: | https://euralex2022.ids-mannheim.de/wp-content/uploads/2022/07/Proceedings_11.07.2022.pdf |
DOI: | https://doi.org/10.14618/ids-pub-11151 |
ISBN: | 978-3-937241-87-6 |
Parent Title (English): | Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany |
Publisher: | IDS-Verlag |
Place of publication: | Mannheim |
Editor: | Annette Klosa-Kückelhaus, Stefan Engelberg, Christine Möhrs, Petra Storjohann |
Document Type: | Part of a Book |
Language: | German |
Year of first Publication: | 2022 |
Date of Publication (online): | 2022/07/21 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
GND Keyword: | Automatische Sprachverarbeitung; Kontrastive Linguistik; Korpus <Linguistik> |
Psyndex Keyword: | Corpus linguistics; comparative corpora; contrastive multilingual linguistics; language technologies |
First Page: | 253 |
Last Page: | 261 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Korpuslinguistik |
Linguistics-Classification: | Lexikografie |
Conferences, Workshops: | Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany |
Licence (German): | Creative Commons - CC BY-SA - Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International |