Devil’s advocate on metadata in science
- This paper uses a devil’s advocate position to highlight the benefits of metadata creation for linguistic resources. It provides an overview of the required metadata infrastructure and shows that this infrastructure is in the meantime developed by various projects and hence can be deployed by those working with linguistic resources and archiving. Possible caveats of metadata creation are mentioned starting with user requirements and backgrounds, contribution to academic merits of researchers and standardisation. These are answered with existing technologies and procedures, referring to the Component Metadata Infrastructure (CMDI). CMDI provides an infrastructure and methods for adapting metadata to the requirements of specific classes of resources, using central registries for data categories, and metadata schemas. These registries allow for the definition of metadata schemas per resource type while reusing groups of data categories also used by other schemas. In summary, rules of best practice for the creation of metadata are given.
Author: | Christina HoppermannORCiD, Thorsten TrippelORCiDGND, Claus ZinnORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-108728 |
URL: | http://exmaralda.org/gscl2011/downloads/AZM96.pdf |
ISSN: | 0176-599X |
Parent Title (English): | Multilingual Resources and Multilingual Applications. Proceedings of the Conference of the German Society for Computational Linguistics and Language Technology (GSCL) 2011. |
Series (Serial Number): | Arbeiten zur Mehrsprachigkeit : Folge B, Sonderforschungsbereich 538 (96) |
Publisher: | Universität Hamburg - Sonderforschungsbereich 538 |
Place of publication: | Hamburg |
Editor: | Hanna Hedeland, Thomas Schmidt, Kai Wörner |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2011 |
Date of Publication (online): | 2022/01/19 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | Component Metadata Infrastructure (CMDI); infrastructure; metadata; sustainable archives |
GND Keyword: | Datenmanagement; Forschung; Infrastruktur; Metadaten; Normung |
First Page: | 105 |
Last Page: | 109 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Computerlinguistik |
Licence (German): | Urheberrechtlich geschützt |