Volltext-Downloads (blau) und Frontdoor-Views (grau)

The TEI-based ISO standard “Transcription of Spoken Language” as an exchange format within CLARIN and beyond

  • This paper describes the TEI-based ISO standard 2462:2016 “Transcription of spoken language” and other formats used within CLARIN for spoken language resources. It assesses the current state of support for the standard and the interoperability between these formats and with relevant tools and services. The main idea behind the paper is that a digital infrastructure providing language resources and services to researchers should also allow the combined use of resources and/or services from different contexts. This requires syntactic and semantic interoperability. We propose a solution based on the ISO/TEI format and describe the necessary steps for this format to work as an exchange format with basic semantic interoperability for spoken language resources across the CLARIN infrastructure and beyond.

Export metadata

Additional Services

Search Google Scholar


Author:Hanna HedelandORCiD, Thomas SchmidtORCiD
Parent Title (English):Proceedings of CLARIN Annual Conference 2021. 27 – 29 September 2021, Virtual Edition
Place of publication:Utrecht
Editor:Monica Monachini, Maria Eskevich
Document Type:Conference Proceeding
Year of first Publication:2021
Date of Publication (online):2021/10/14
Publishing Institution:Leibniz-Institut für Deutsche Sprache (IDS)
GND Keyword:Computerlinguistik; Datenmanagement; ISO-Norm; Korpus <Linguistik>; Mündliche Kommunikation; Text Encoding Initiative; Transkription
First Page:100
Last Page:104
DDC classes:400 Sprache / 400 Sprache, Linguistik
Open Access?:ja
Leibniz-Classification:Sprache, Linguistik
Program areas:P2: Mündliche Korpora
Licence (English):License LogoCreative Commons - Attribution 4.0 International