Towards comprehensive definitions of data quality for audiovisual annotated language resources
- Though digital infrastructures such as CLARIN have been successfully established and now provide large collections of digital resources, the lack of widely accepted standards for data quality and documentation still makes re-use of research data a difficult endeavour, especially for more complex resource types. The article gives a detailed overview over relevant characteristics of audiovisual annotated language resources and reviews possible approaches to data quality in terms of their suitability for the current context. Conclusively, various strategies are suggested in order to arrive at comprehensive and adequate definitions of data quality for this specific resource type and possibly for digital language resources in general.
Author: | Hanna HedelandORCiD |
---|---|
URN: | urn:nbn:de:bsz:mh39-105189 |
DOI: | https://doi.org/10.3384/ecp18011 |
ISBN: | 978-91-7929-609-4 |
ISSN: | 1650-3740 |
Parent Title (English): | Selected Papers from the CLARIN Annual Conference 2020. Virtual Event, 2020, 5-7 October |
Series (Serial Number): | Linköping Electronic Conference Proceedings (180) |
Publisher: | Linköping University Electronic Press |
Place of publication: | Linköping |
Editor: | Costanza Navarretta, Maria Eskevich |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2021 |
Date of Publication (online): | 2021/07/23 |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | CLARIN; TEI; audiovisual data; data quality; spoken corpora |
GND Keyword: | Audiovisuelles Material; Datenmanagement; Datenqualität; Forschungsdaten; Korpus <Linguistik>; Metadaten; Text Encoding Initiative |
First Page: | 93 |
Last Page: | 103 |
Note: | A previous version of this article was published in: "Proceedings of CLARIN Annual Conference 2020. 05 – 07 October 2020, Online Edition", see http://nbn-resolving.de/urn:nbn:de:bsz:mh39-100760. |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Computerlinguistik |
Linguistics-Classification: | Korpuslinguistik |
Program areas: | P2: Mündliche Korpora |
Licence (English): | ![]() |