Twenty-two historical encyclopedias encoded in TEI: a new resource for the Digital Humanities
- This paper accompanies the corpus publication of EncycNet, a novel XML/TEI annotated corpus of 22 historical German encyclopedias from the early 18th to early 20th century. We describe the creation and annotation of the corpus, including the rationale for its development, suggested methodology for TEI annotation, possible use cases and future work. While many well-developed annotation standards for lexical resources exist, none can adequately model the encyclopedias at hand, and we therefore suggest how the TEI Lex-0 standard may be modified with additional guidelines for the annotation of historical encyclopedias. As the digitization and annotation of historical encyclopedias are settling on TEI as the de facto standard, our methodology may inform similar projects.
Author: | Thora Hagen, Erik Ketzan, Fotis JannidisORCiDGND, Andreas WittORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-102512 |
URL: | https://www.aclweb.org/anthology/2020.latechclfl-1.13/ |
ISBN: | 978-1-952148-34-7 |
Parent Title (English): | Proceedings of the The 4th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. December 12, 2020, Barcelona, Spain (Online) |
Publisher: | Association for Computational Linguistics |
Place of publication: | Stroudsburg, PA |
Editor: | Stefania DeGaetano, Anna Kazantseva, Nils Reiter, Stan Szpakowicz |
Document Type: | Part of a Book |
Language: | English |
Year of first Publication: | 2020 |
Date of Publication (online): | 2020/12/21 |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
GND Keyword: | Annotation; Digital Humanities; Enzyklopädie; Historische Lexikografie; Korpus <Linguistik> |
First Page: | 112 |
Last Page: | 120 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Computerlinguistik |
Linguistics-Classification: | Korpuslinguistik |
Program areas: | S2: Forschungskoordination und –infrastrukturen |
Licence (English): | ![]() |