Volltext-Downloads (blau) und Frontdoor-Views (grau)

Standards in CLARIN

  • This chapter looks at a fragment of the ongoing work of the CLARIN Standards Committee (CSC) on producing a shared set of recommendations on standards, formats, and related best practices supported by the CLARIN infrastructure and its participating centres. What might at first glance seem to be a straightforward goal has over the years proven to be rather complex, reflecting the robustness and heterogeneity of the emerging distributed digital research infrastructure and the various disciplines and research traditions of the language-based humanities that it serves and represents, and therefore part of the chapter reviews the various initiatives and proposals that strove to produce helpful standards-related guidance. The focus turns next to a subtask initiated in late 2019, its scope narrowed to one of the core activities and responsibilities of CLARIN backbone centres, namely the provision of data deposition services. Centres are obligated to publish their recom-mendations concerning the repertoire of data formats that are best suited for their research profiles. We look at how this requirement has been met by the particular centres and suggest that having centres maintain their information in the Standards Information System (SIS) is the way to improve on the current state of affairs.

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar


Author:Piotr BańskiORCiDGND, Hanna Hedeland
Parent Title (English):CLARIN. The Infrastructure for language resources
Series (Serial Number):Digital Linguistics (1)
Publisher:de Gruyter
Place of publication:Berlin/Boston
Editor:Darja Fišer, Andreas Witt
Document Type:Part of a Book
Year of first Publication:2022
Date of Publication (online):2022/10/17
Publishing Institution:Leibniz-Institut für Deutsche Sprache (IDS)
Tag:CSC; SIS; data deposition; formats; standards
GND Keyword:Datenerfassung; Datenformat; Empfehlungssystem; Forschungsinfrastruktur; Standardisierung
First Page:307
Last Page:339
DDC classes:400 Sprache / 400 Sprache, Linguistik
Open Access?:ja
Leibniz-Classification:Sprache, Linguistik
Program areas:S2: Forschungskoordination und –infrastrukturen
Licence (English):License LogoCreative Commons - Attribution 4.0 International