Refine
Year of publication
Document Type
- Part of a Book (7)
- Article (2)
- Conference Proceeding (2)
- Other (2)
- Book (1)
- Working Paper (1)
Keywords
- Korpus <Linguistik> (12)
- Deutsch (7)
- Methode (5)
- Institut für Deutsche Sprache <Mannheim> (3)
- Neologismus (3)
- Deutsches Referenzkorpus (DeReKo) (2)
- Textkorpus (2)
- Absolute Häufigkeit (1)
- Differenzenkoeffizient (1)
- Distribution <Linguistik> (1)
Publicationstate
Reviewstate
- (Verlags)-Lektorat (8)
- Verlags-Lektorat (1)
Publisher
- Institut für Deutsche Sprache (6)
- Benjamins (1)
- ELRA (1)
- Fink (1)
- Narr (1)
- Tokyo University of Foreign Studies (1)
- University of Liverpool (1)
- de Gruyter (1)
This introductory tutorial describes a strictly corpus-driven approach for uncovering indications for aspects of use of lexical items. These aspects include ‘(lexical) meaning’ in a very broad sense and involve different dimensions, they are established in and emerge from respective discourses. Using data-driven mathematical-statistical methods with minimal (linguistic) premises, a word’s usage spectrum is summarized as a collocation profile. Self-organizing methods are applied to visualize the complex similarity structure spanned by these profiles. These visualizations point to the typical aspects of a word’s use, and to the common and distinctive aspects of any two words.
Valenz und Kookkurrenz
(2015)
^This paper describes DeReKo (Deutsches Referenzkorpus), the Archive of General Reference Corpora of Contemporary Written German at the Institut für Deutsche Sprache (IDS) in Mannheim, and the rationale behind its development. We discuss its design, its legal background, how to access it, available metadata, linguistic annotation layers, underlying standards, ongoing developments, and aspects of using the archive for empirical linguistic research. The focus of the paper is on the advantages of DEREKO’s design as a primordial sample from which virtual corpora can be drawn for the specific purposes of individual studies. Both concepts, primordial sample and virtual corpus are explained and illustrated in detail. Furthermore, we describe in more detail how DEREKO deals with the fact that all its texts are subject to third parties’ intellectual property rights, and how it deals with the issue of replicability, which is particularly challenging given DEREKO’s dynamic growth and the possibility to construct from it an open number of virtual corpora.