Tools for historical corpus research, and a corpus of Latin
- We present LatinlSE, a Latin corpus for the Sketch Engine. LatinlSE consists of Latin works comprising a total of 13 million words, covering the time span from the 2nd Century BC to the 21st century AD. LatinlSE is provided with rich metadata mark-up, including author, title, genre, era, date and century, as well as book, section, paragraph and line of verses. We have automatically annotated LatinlSE with lemma and part-of-speech information, enabling users to search the corpus with a number of criteria, ranging from lemma, part-of speech, context, to subcorpora defined chronologically or by genre. We also illustrate word sketches, one-page summaries of a word’s corpus based collocational behaviour. Our future plan is to produce word sketches for Latin words by adding richer morphological and syntactic annotation to the corpus.
Author: | Barbara McGillivrayORCiDGND, Adam Kilgarriff |
---|---|
URN: | urn:nbn:de:bsz:mh39-128994 |
ISBN: | 978-3-8233-6760-4 |
Parent Title (English): | New methods in historical corpora |
Series (Serial Number): | Korpuslinguistik und interdisziplinäre Perspektiven auf Sprache | Corpus Linguistics and Interdisciplinary Perspectives on Language | CLIP (3) |
Publisher: | Narr |
Place of publication: | Tübingen |
Editor: | Paul Bennett, Martin Durrell, Silke Scheible, Richard J. Whitt |
Document Type: | Part of a Book |
Language: | English |
Year of first Publication: | 2013 |
Date of Publication (online): | 2024/11/07 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Zweitveröffentlichung |
Reviewstate: | (Verlags)-Lektorat |
GND Keyword: | Computerlinguistik; Historische Sprachwissenschaft; Korpus <Linguistik>; Latein |
First Page: | 247 |
Last Page: | 256 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
BDSL-Classification: | Grammatik |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Korpuslinguistik |
Licence (German): | ![]() |