The ISOcat registry reloaded
- The linguistics community is building a metadata-based infrastructure for the description of its research data and tools. At its core is the ISOcat registry, a collaborative platform to hold a (to be standardized) set of data categories (i.e., field descriptors). Descriptors have definitions in natural language and little explicit interrelations. With the registry growing to many hundred entries, authored by many, it is becoming increasingly apparent that the rather informal definitions and their glossary-like design make it hard for users to grasp, exploit and manage the registry’s content. In this paper, we take a large subset of the ISOcat term set and reconstruct from it a tree structure following the footsteps of schema.org. Our ontological re-engineering yields a representation that gives users a hierarchical view of linguistic, metadata-related terminology. The new representation adds to the precision of all definitions by making explicit information which is only implicitly given in the ISOcat registry. It also helps uncovering and addressing potential inconsistencies in term definitions as well as gaps and redundancies in the overall ISOcat term set. The new representation can serve as a complement to the existing ISOcat model, providing additional support for authors and users in browsing, (re-)using, maintaining, and further extending the community’s terminological metadata repertoire.
Author: | Claus ZinnORCiDGND, Christina HoppermannORCiD, Thorsten TrippelORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-108624 |
DOI: | https://doi.org/10.1007/978-3-642-30284-8_26 |
ISBN: | 978-3-642-30284-8 |
ISSN: | 1611-3349 |
Parent Title (English): | The Semantic Web: Research and Applications. 9th Extended Semantic Web Conference, ESWC 2012, Heraklion, Crete, Greece, May 27-31, 2012. Proceedings. |
Series (Serial Number): | Lecture Notes in Computer Science (7295) |
Publisher: | Springer |
Place of publication: | Berlin/Heidelberg |
Editor: | Elena Simperl, Philipp Cimiano, Axel Polleres, Oscar Corcho, Valentina Presutti |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2012 |
Date of Publication (online): | 2022/01/11 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Zweitveröffentlichung |
Publicationstate: | Postprint |
Reviewstate: | Peer-Review |
Tag: | ISOcat registry; concept scheme; concept system; conceptual domain; data category; relation registry |
GND Keyword: | Forschungsdaten; Infrastruktur; Metadaten; Natürliche Sprache; Terminologie |
First Page: | 285 |
Last Page: | 299 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Computerlinguistik |
Licence (German): | Urheberrechtlich geschützt |