Shifting complexity from text to data model. Adding machine-oriented features to a human-oriented terminology resource.
- Grammis is a web-based information system on German grammar, hosted by the Institute for the German Language (IDS). It is human-oriented and features different theoretical perspectives on grammar. Currently, the terminology component of grammis is being redesigned for this theoretical diversity to play a more prominent role in the data model. This also opens opportunities for implementing some machine-oriented features. In this paper, we present the re-design of both data model and knowledge base. We explore how the addition of machine-oriented features to the data model impacts the knowledge base; in particular, how this addition shifts some of the textual complexity into the data model. We show that our resource can easily be ported to a SKOS-XL representation, which makes it available for data science, knowledge-based NLP applications, and LOD in the context of digital humanities.
Author: | Karolina SuchowolecGND, Christian LangGND, Roman SchneiderORCiDGND, Horst SchwinnGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-62898 |
DOI: | https://doi.org/10.1007/978-3-319-59888-8 |
ISBN: | 978-3-319-59887-1 |
ISSN: | 0302-9743 |
Parent Title (English): | Language, data and knowledge. First International Conference, LDK 2017. Galway, Ireland, June 19-20, 2017. Proceedings |
Series (Serial Number): | Lecture notes in artificial intelligence (10318) |
Publisher: | Springer |
Place of publication: | Cham, Switzerland |
Editor: | Jorge Gracia, Francis Bond, John P. McCrae, Paul Buitelaar, Christian Chiarcos, Sebastian Hellmann |
Document Type: | Part of a Book |
Language: | English |
Year of first Publication: | 2017 |
Date of Publication (online): | 2017/07/26 |
Publicationstate: | Preprint |
Tag: | Terminologiemanagement; grammatische Terminologie |
GND Keyword: | Grammatik; Informationsmanagement; Terminologie |
First Page: | 203 |
Last Page: | 212 |
Note: | Dieser Beitrag ist aus urheberrechtlichen Gründen nicht frei zugänglich. / Due to copyright reasons the full-text of the article is not freely accessible. |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik / 400 Sprache |
Open Access?: | nein |
Program areas: | Grammatik |
Program areas: | Digitale Sprachwissenschaft |
Licence (German): | ![]() |