Volltext-Downloads (blau) und Frontdoor-Views (grau)

Towards an optimum degree of order in the field of language resources

  • This chapter explores the role of standards and best practices in the creation, sharing, and sustainable use of language resources. As linguistic data becomes increasingly complex and varied, ranging from continuous corpora to multimodal annotations, the need for interoperability and long-term accessibility has grown. This chapter shows how encoding practices, metadata, and annotation frameworks contribute to the transparency, reproducibility, and reusability of language data. Emphasis is placed on the importance of consensus-driven standards, such as those developed by ISO (International Organization for Standardization), and the practical implementation of best practices within academic, archival, and computational contexts. By detailing the responsibilities and perspectives of data providers, analysts, and hosts, this chapter offers a comprehensive guide to achieving an optimum degree of order in the language resource ecosystem.

Export metadata

Additional Services

Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Piotr BańskiORCiDGND, Ulrich HeidORCiDGND, Laura HerzbergORCiDGND
URN:urn:nbn:de:bsz:mh39-134762
DOI:https://doi.org/10.1515/9783112208212
ISBN:978-3-11-220821-2
ISSN:2751-1286
Parent Title (English):Harmonizing language data: Standards for linguistic resources
Series (Serial Number):Digital Linguistics (4)
Publisher:De Gruyter
Place of publication:Berlin/Boston
Editor:Piotr BańskiORCiDGND, Ulrich HeidORCiDGND, Laura HerzbergORCiDGND
Document Type:Part of a Book
Language:English
Year of first Publication:2025
Date of Publication (online):2025/10/02
Publishing Institution:Leibniz-Institut für Deutsche Sprache (IDS)
Publicationstate:Veröffentlichungsversion
Reviewstate:(Verlags)-Lektorat
Tag:Data interoperability; FAIR principles; Language resources; Research data management; Standards and best practices
GND Keyword:Computerlinguistik; Digital Humanities; FAIR data principles; Sprachdaten
First Page:1
Last Page:16
DDC classes:400 Sprache / 400 Sprache, Linguistik
Open Access?:ja
Linguistics-Classification:Computerlinguistik
Program areas:Grammatik
Program areas:Digitale Sprachwissenschaft
Licence (English):License LogoCreative Commons - Attribution 4.0 International