Volltext-Downloads (blau) und Frontdoor-Views (grau)

Dealing with multiple annotations

  • The present chapter focuses on corpora that feature multiple layers of annotation and looks at the phenomenon from an application-based perspective, couched in the real-life context of DeReKo (the largest linguistically motivated collection of contemporary written German) and of the analysis engine KorAP that makes it possible to analyse texts with reference to multiple annotations. The authors review the basic concepts needed for talking about multiple annotations, present the most notorious obstacle in dealing with them in an XML-based format, show the basic ways of tackling it that have been suggested in the literature, and conclude with a presentation of KorAP-XML, an internal format designed for the purpose of easy enrichment of DeReKo texts with many diverse annotation layers.

Export metadata

Additional Services

Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Piotr BańskiORCiDGND, Nils DiewaldORCiDGND
URN:urn:nbn:de:bsz:mh39-135895
DOI:https://doi.org/10.1515/9783112208212-008
ISBN:978-3-11-220821-2
ISSN:2751-1286
Parent Title (English):Harmonizing language data. Standards for linguistic resources
Series (Serial Number):Digital Linguistics (4)
Publisher:de Gruyter
Place of publication:Berlin/Boston
Editor:Piotr BańskiORCiDGND, Ulrich HeidORCiDGND, Laura HerzbergORCiDGND
Document Type:Part of a Book
Language:English
Year of first Publication:2025
Date of Publication (online):2025/12/09
Publishing Institution:Leibniz-Institut für Deutsche Sprache (IDS)
Publicationstate:Veröffentlichungsversion
Reviewstate:(Verlags)-Lektorat
Tag:DeReKo; KorAP; multiple annotations; overlapping annotations; parallel processing; specialised format
GND Keyword:Annotation; Datenmodell; Deutsch; Korpus <Linguistik>; Metadaten
First Page:169
Last Page:200
DDC classes:400 Sprache / 400 Sprache, Linguistik
Open Access?:ja
Linguistics-Classification:Computerlinguistik
Linguistics-Classification:Korpuslinguistik
Program areas:Grammatik
Program areas:Digitale Sprachwissenschaft
Licence (English):License LogoCreative Commons - Attribution-ShareAlike 4.0 International