Volltext-Downloads (blau) und Frontdoor-Views (grau)

Trendi - a monitor corpus of Slovene

  • In this paper we present Trendi, a monitor corpus of written Slovene, which has been compiled recently as part of the SLED (Monitor corpus and related resources) project. The methodology and the contents of the corpus are presented, as well as the findings of the survey that aimed to identify the needs of potential users related to topical language use. The Trendi corpus currently contains news articles and other web content from 110 different sources, with the texts being collected and linguistically annotated on a daily basis. The corpus complements Gigafida 2.0, a 1.13-billion-word reference corpus of standard written Slovene. Also discussed are the ways in which the corpus will be integrated into various lexicographic projects, helping not only in the identification of neologisms but also in monitoring changes in already identified language phenomena.

Export metadata

Additional Services

Search Google Scholar


Author:Iztok Kosem
Parent Title (English):Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany
Place of publication:Mannheim
Editor:Annette Klosa-Kückelhaus, Stefan Engelberg, Christine Möhrs, Petra Storjohann
Document Type:Part of a Book
Year of first Publication:2022
Date of Publication (online):2022/08/16
Publishing Institution:Leibniz-Institut für Deutsche Sprache (IDS)
Tag:Monitor corpus; Slovene; language use; lexicography; neologisms; newsfeed; trends
GND Keyword:Korpus <Linguistik>; Lexikographie; Neologismus; Online-Medien; Slowenisch; Sprachgebrauch; geschriebene Sprache
First Page:230
Last Page:239
DDC classes:400 Sprache / 420 Englisch
Open Access?:ja
Conferences, Workshops:Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany
Licence (German):License LogoCreative Commons - CC BY-SA - Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International