Analyzing formulaic patterns in historical corpora
- This paper aims to point out a linguistic phenomenon that due to the current stage of research can be analysed only insufficiently with the help of an electronic text corpus. In this way, the paper adds a new aspect to the discussion about historical corpora by tackling the question of how they should be designed in order to be useful for linguistic research on so-called formulaic patterns. The novelty of the question becomes apparent considering the fact that at present such historical corpora do not exist. In section 1, we define the term formulaic pattern because a clear understanding of this phenomenon is a prerequisite condition for collaborative research of it by historians of language and corpus and computer linguists. Section 2 gives a brief outline of the state of the art in the field of modern formulaic language within the framework of corpus and computer linguistics. Section 3 shows that some well known problems in this area are exacerbated when applied to historical texts. Section 4 presents a possible solution that has been implemented by the HiFoS Research Group at the University of Trier (Germany). Joint research efforts planned with UKP Lab at the TU Darmstadt (section 5) demonstrate that the restrictions posed by historical formulaic patterns are challenges to be overcome, rather than insurmountable obstacles.
Author: | Claudine MoulinGND, Iryna GurevychORCiDGND, Natalia FilatkinaGND, Richard Eckart de CastilhoORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-125542 |
ISBN: | 978-3-8233-6922-6 |
Parent Title (English): | Historical corpora. Challenges and perspectives |
Series (Serial Number): | Korpuslinguistik und interdisziplinäre Perspektiven auf Sprache | Corpus Linguistics and Interdisciplinary Perspectives on Language | CLIP (5) |
Publisher: | Narr |
Place of publication: | Tübingen |
Editor: | Jost Gippert, Ralf Gehrke |
Document Type: | Part of a Book |
Language: | English |
Year of first Publication: | 2015 |
Date of Publication (online): | 2024/03/06 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Zweitveröffentlichung |
Reviewstate: | (Verlags)-Lektorat |
GND Keyword: | Historische Sprachwissenschaft; Korpus <Linguistik> |
First Page: | 51 |
Last Page: | 63 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
BDSL-Classification: | Grammatik |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Grammatikforschung |
Linguistics-Classification: | Korpuslinguistik |
Licence (German): | Urheberrechtlich geschützt |