Metadata for time aligned corpora
- For a detailed description of time aligned corpora, for example spoken language corpora and multimodal corpora, specific metadata categories are necessary, extending the scope of traditional metadata categories. We argue that it is necessary to allow metadata on all levels of annotation, i.e. on a general level for catalogues, on the session level for each recording, on the annotation level for multi tier score annotation, even on the level of individual annotation segments. We use existing standards where they allow this distinction and introduce metadata categories for the layer level.
Author: | Thorsten TrippelORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-126710 |
ISBN: | 2-9517408-1-6 |
Parent Title (English): | Proceedings of the Workshop: A Registry of Linguistic Data Categories within an Integrated Language Repository Area, Fourth International Conference on Language Resources and Evaluation (LREC 2004) |
Publisher: | European Language Resources Association |
Place of publication: | Luxemburg |
Editor: | Maria Lino, Maria Xavier, Fátima Ferreira, Rute Costa, Raquel Silva |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2004 |
Date of Publication (online): | 2024/05/13 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
GND Keyword: | Annotation; Gesprochene Sprache; Korpus <Linguistik>; Metadaten; Multimodales System |
Page Number: | 7 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Korpuslinguistik |
Licence (German): | ![]() |