Korpuslinguistik
Refine
Year of publication
Document Type
- Conference Proceeding (161) (remove)
Keywords
- Korpus <Linguistik> (144)
- Deutsch (33)
- Annotation (28)
- Gesprochene Sprache (21)
- Forschungsdaten (17)
- corpus linguistics (13)
- Computerlinguistik (12)
- Corpus linguistics (12)
- Datenmanagement (11)
- Corpus technology (10)
Publicationstate
- Veröffentlichungsversion (127)
- Zweitveröffentlichung (14)
- Postprint (3)
Reviewstate
- Peer-Review (85)
- (Verlags)-Lektorat (41)
- Peer-review (2)
- Review-Status-unbekannt (1)
Publisher
- European Language Resources Association (ELRA) (22)
- European Language Resources Association (17)
- Institut für Deutsche Sprache (17)
- Leibniz-Institut für Deutsche Sprache (9)
- CLARIN (8)
- Linköping University Electronic Press (8)
- Nisaba (4)
- University of Birmingham (4)
- Association for Computational Linguistics (3)
- Extreme Markup Languages Conference (3)
Newspapers became extremely popular in Germany during the 18th and 19th century, and thus increasingly influential for modern German. However, due to the lack of digitized historical newspaper corpora for German, this influence could not be analyzed systematically. In this paper, we introduce the Mannheim Corpus of Digital Newspapers and Magazines, which in its current release comprises 21 newspapers and magazines from the 18th and 19th century. With over 4.1 Mio tokens in about 650 volumes it currently constitutes the largest historical corpus dedicated to newspapers in German. We briefly discuss the prospect of the corpus for analyzing the evolution of news as a genre in its own right and the influence of contextual parameters such as region and register on the language of news. We then focus on one historically influential aspect of newspapers – their role in disseminating foreign words in German. Our preliminary quantitative results indeed indicate that newspapers use foreign words significantly more frequently than other genres, in particular belles lettres.