The TEI-based ISO Standard ‘Transcription of spoken language’ as an Exchange Format within CLARIN and beyond
- This paper describes the TEI-based ISO standard 24624:2016 ‘Transcription of spoken language’ and other formats used within CLARIN for spoken language resources. It assesses the current state of support for the standard and the interoperability between these formats and with rele- vant tools and services. The main idea behind the paper is that a digital infrastructure providing language resources and services to researchers should also allow the combined use of resources and/or services from different contexts. This requires syntactic and semantic interoperability. We propose a solution based on the ISO/TEI format and describe the necessary steps for this format to work as an exchange format with basic semantic interoperability for spoken language resources across the CLARIN infrastructure and beyond.
Author: | Hanna HedelandORCiD, Thomas SchmidtGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-111343 |
DOI: | https://doi.org/10.3384/9789179294441 |
ISBN: | 978-91-7929-444-1 |
ISSN: | 1650-3740 |
Parent Title (English): | Selected Papers from the CLARIN Annual Conference 2021. Virtual Event, 2021, 27–29 September |
Series (Serial Number): | Linköping Electronic Conference Proceedings (189) |
Publisher: | Linköping University Electronic Press |
Place of publication: | Linköping |
Editor: | Monica Monachini, Maria Eskevich |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2022 |
Date of Publication (online): | 2022/07/18 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | FAIR data; ISO/TEI; interoperability; spoken language; transcription |
GND Keyword: | Annotation; Clarin; Datenmanagement; Forschungsdaten; Gesprochene Sprache; Korpus <Linguistik>; Sprachübersetzung |
First Page: | 34 |
Last Page: | 45 |
DDC classes: | 400 Sprache / 420 Englisch |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Computerlinguistik |
Linguistics-Classification: | Korpuslinguistik |
Program areas: | P2: Mündliche Korpora |
Licence (German): | ![]() |