Volltext-Downloads (blau) und Frontdoor-Views (grau)

Metadata for time aligned corpora

  • For a detailed description of time aligned corpora, for example spoken language corpora and multimodal corpora, specific metadata categories are necessary, extending the scope of traditional metadata categories. We argue that it is necessary to allow metadata on all levels of annotation, i.e. on a general level for catalogues, on the session level for each recording, on the annotation level for multi tier score annotation, even on the level of individual annotation segments. We use existing standards where they allow this distinction and introduce metadata categories for the layer level.

Download full text files

Export metadata

Additional Services

Search Google Scholar


Author:Thorsten TrippelORCiDGND
Parent Title (English):Proceedings of the Workshop: A Registry of Linguistic Data Categories within an Integrated Language Repository Area, Fourth International Conference on Language Resources and Evaluation (LREC 2004)
Publisher:European Language Resources Association
Place of publication:Luxemburg
Editor:Maria Lino, Maria Xavier, Fátima Ferreira, Rute Costa, Raquel Silva
Document Type:Conference Proceeding
Year of first Publication:2004
Date of Publication (online):2024/05/13
Publishing Institution:Leibniz-Institut für Deutsche Sprache (IDS)
GND Keyword:Annotation; Gesprochene Sprache; Korpus <Linguistik>; Metadaten; Multimodales System
Page Number:7
DDC classes:400 Sprache / 400 Sprache, Linguistik
Open Access?:ja
Licence (German):License LogoUrheberrechtlich geschützt