Trendi - a monitor corpus of Slovene
- In this paper we present Trendi, a monitor corpus of written Slovene, which has been compiled recently as part of the SLED (Monitor corpus and related resources) project. The methodology and the contents of the corpus are presented, as well as the findings of the survey that aimed to identify the needs of potential users related to topical language use. The Trendi corpus currently contains news articles and other web content from 110 different sources, with the texts being collected and linguistically annotated on a daily basis. The corpus complements Gigafida 2.0, a 1.13-billion-word reference corpus of standard written Slovene. Also discussed are the ways in which the corpus will be integrated into various lexicographic projects, helping not only in the identification of neologisms but also in monitoring changes in already identified language phenomena.
Author: | Iztok Kosem |
---|---|
URN: | urn:nbn:de:bsz:mh39-111808 |
URL: | https://euralex2022.ids-mannheim.de/wp-content/uploads/2022/07/Proceedings_11.07.2022.pdf |
DOI: | https://doi.org/10.14618/ids-pub-11180 |
ISBN: | 978-3-937241-87-6 |
Parent Title (English): | Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany |
Publisher: | IDS-Verlag |
Place of publication: | Mannheim |
Editor: | Annette Klosa-Kückelhaus, Stefan Engelberg, Christine Möhrs, Petra Storjohann |
Document Type: | Part of a Book |
Language: | English |
Year of first Publication: | 2022 |
Date of Publication (online): | 2022/08/16 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | Monitor corpus; Slovene; language use; lexicography; neologisms; newsfeed; trends |
GND Keyword: | Korpus <Linguistik>; Lexikographie; Neologismus; Online-Medien; Slowenisch; Sprachgebrauch; geschriebene Sprache |
First Page: | 230 |
Last Page: | 239 |
DDC classes: | 400 Sprache / 420 Englisch |
Open Access?: | ja |
Conferences, Workshops: | Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany |
Licence (German): | ![]() |