Refine
Year of publication
- 2010 (3) (remove)
Document Type
- Article (1)
- Part of a Book (1)
- Conference Proceeding (1)
Has Fulltext
- yes (3)
Is part of the Bibliography
- no (3)
Keywords
- Deutsch (1)
- Forschungsmethode (1)
- Kompositum (1)
- Korpus <Linguistik> (1)
- Korpuslinguistik (1)
- Wortbildung (1)
Publicationstate
Publisher
Das IDS, insbesondere der Programmbereich Korpuslinguistik, bekommt häufig Anfragen zum Wortbestand der deutschen Sprache, sei es, welche Wörter besonders häufig sind, sei es, nach (Listen von) Wörtern mit bestimmten Eigenschaften. Zu dem Themenschwerpunkt „häufigkeitsbasierte Wortlisten“ wurde unter dem Schlagwort DeReWo eine Plattform eingerichtet, auf der Erkenntnisse und Ergebnisse zu diesem Bereich erarbeitet und veröffentlicht werden (<www.ids-mannheim.de/kl/projekte/methode/derewo.html>). Die Frage nach dem „längsten Wort der deutschen Sprache“ hat zwar gewisse Berührungspunkte zu diesem Schwerpunkt, sie hebt sich aber doch ein wenig ab. Deshalb soll sie an dieser Stelle in Form eines fiktiven Gesprächs thematisiert werden (auch wenn eine konkrete Anfrage für eine Kindersendung den Anlass geliefert hat).
Empirical synchronic language studies generally seek to investigate language phenomena for one point in time, even though this point in time is often not stated explicitly. Until today, surprisingly little research has addressed the implications of this time-dependency of synchronic research on the composition and analysis of data that are suitable for conducting such studies. Existing solutions and practices tend to be too general to meet the needs of all kinds of research questions. In this theoretical paper that is targeted at both corpus creators and corpus users, we propose to take a decidedly synchronic perspective on the relevant language data. Such a perspective may be realised either in terms of sampling criteria or in terms of analytical methods applied to the data. As a general approach for both realisations, we introduce and explore the FReD strategy (Frequency Relevance Decay) which models the relevance of language events from a synchronic perspective. This general strategy represents a whole family of synchronic perspectives that may be customised to meet the requirements imposed by the specific research questions and language domain under investigation.
Taking a usage-based perspective, lexical-semantic relations and other aspects of lexical meaning are characterised as emerging from language use. At the same time, they shape language use and therefore become manifest in corpus data. This paper discusses how this mutual influence can be taken into account in the study of these relations. An empirically driven methodology is proposed that is, as an initial step, based on self-organising clustering of comprehensive collocation profiles. Several examples demonstrate how this methodology may guide linguists in explicating implicit knowledge of complex semantic structures. Although these example analyses are conducted for written German, the overall methodology is language-independent.