Dealing with multiple annotations
- The present chapter focuses on corpora that feature multiple layers of annotation and looks at the phenomenon from an application-based perspective, couched in the real-life context of DeReKo (the largest linguistically motivated collection of contemporary written German) and of the analysis engine KorAP that makes it possible to analyse texts with reference to multiple annotations. The authors review the basic concepts needed for talking about multiple annotations, present the most notorious obstacle in dealing with them in an XML-based format, show the basic ways of tackling it that have been suggested in the literature, and conclude with a presentation of KorAP-XML, an internal format designed for the purpose of easy enrichment of DeReKo texts with many diverse annotation layers.
| Author: | Piotr BańskiORCiDGND, Nils DiewaldORCiDGND |
|---|---|
| URN: | urn:nbn:de:bsz:mh39-135895 |
| DOI: | https://doi.org/10.1515/9783112208212-008 |
| ISBN: | 978-3-11-220821-2 |
| ISSN: | 2751-1286 |
| Parent Title (English): | Harmonizing language data. Standards for linguistic resources |
| Series (Serial Number): | Digital Linguistics (4) |
| Publisher: | de Gruyter |
| Place of publication: | Berlin/Boston |
| Editor: | Piotr BańskiORCiDGND, Ulrich HeidORCiDGND, Laura HerzbergORCiDGND |
| Document Type: | Part of a Book |
| Language: | English |
| Year of first Publication: | 2025 |
| Date of Publication (online): | 2025/12/09 |
| Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
| Publicationstate: | Veröffentlichungsversion |
| Reviewstate: | (Verlags)-Lektorat |
| Tag: | DeReKo; KorAP; multiple annotations; overlapping annotations; parallel processing; specialised format |
| GND Keyword: | Annotation; Datenmodell; Deutsch; Korpus <Linguistik>; Metadaten |
| First Page: | 169 |
| Last Page: | 200 |
| DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
| Open Access?: | ja |
| Linguistics-Classification: | Computerlinguistik |
| Linguistics-Classification: | Korpuslinguistik |
| Program areas: | Grammatik |
| Program areas: | Digitale Sprachwissenschaft |
| Licence (English): | Creative Commons - Attribution-ShareAlike 4.0 International |


