Refine
Year of publication
- 2016 (8) (remove)
Document Type
- Part of a Book (5)
- Article (3)
Has Fulltext
- yes (8)
Keywords
- Computerunterstützte Lexikographie (5)
- Internet (3)
- Wörterbuch (3)
- Autokorrelation (1)
- Benutzerforschung (1)
- Benutzungsforschung (1)
- Datenstruktur (1)
- Google Ngram (1)
- Logfileanalyse (1)
- Pearson Korrelation (1)
Publicationstate
Reviewstate
- Peer-Review (3)
Publisher
- De Gruyter (5)
- Buro van die Wat (1)
- Oxford Univ. Press (1)
Wiegand’s opus magnum „Wörterbuchforschung“ ends with a chapter on the state and the relevant taslcs for research into dictionary use in the middle of the 1990s. This article aims at reflecting the taste and the relevance of dictionary usage research 20 years later. I will argue that the fundamentally changed lexicographic landscape makes it necessary to shift the focus of research. In my view, the most important aim of research into dictionary use can no longer be limited to improving dictionaries. Research into dictionary use should also raise more awareness for user- orientation in general and should provide methodological reflection to enlighten the increasingly important usage statistics for online dictionaries. Another goal should be to look behind the scenes of collaborative dictionaries in order to provide background data to classify their relevance in relation to dictionaries elaborated by lexicographic experts. The crisis of lexicography makes it also necessary to broaden our view and concentrate on situations in which linguistic questions arise. In this context, we could examine in which of these situations the consultation of lexicographic data helps. In summary, the aim of research into dictionary use is to identify the fields where sound lexicographic work is really helpful for potential users.
Datenmodellierung
(2016)
Wiktionary is increasingly gaining influence in a wide variety of linguistic fields such as NLP and lexicography, and has great potential to become a serious competitor for publisher-based and academic dictionaries. However, little is known about the "crowd" that is responsible for the content of Wiktionary. In this article, we want to shed some light on selected questions concerning large-scale cooperative work in online dictionaries. To this end, we use quantitative analyses of the complete edit history files of the English and German Wiktionary language editions. Concerning the distribution of revisions over users, we show that — compared to the overall user base — only very few authors are responsible for the vast majority of revisions in the two Wiktionary editions. In the next step, we compare this distribution to the distribution of revisions over all the articles. The articles are subsequently analysed in terms of rigour and diversity, typical revision patterns through time, and novelty (the time since the last revision). We close with an examination of the relationship between corpus frequencies of headwords in articles, the number of article visits, and the number of revisions made to articles.
In order to demonstrate why it is important to correctly account for the (serial dependent) structure of temporal data, we document an apparently spectacular relationship between population size and lexical diversity: for five out of seven investigated languages, there is a strong relationship between population size and lexical diversity of the primary language in this country. We show that this relationship is the result of a misspecified model that does not consider the temporal aspect of the data by presenting a similar but nonsensical relationship between the global annual mean sea level and lexical diversity. Given the fact that in the recent past, several studies were published that present surprising links between different economic, cultural, political and (socio-)demographical variables on the one hand and cultural or linguistic characteristics on the other hand, but seem to suffer from exactly this problem, we explain the cause of the misspecification and show that it has profound consequences. We demonstrate how simple transformation of the time series can often solve problems of this type and argue that the evaluation of the plausibility of a relationship is important in this context. We hope that our paper will help both researchers and reviewers to understand why it is important to use special models for the analysis of data with a natural temporal ordering.
We present an empirical study addressing the question whether, and to which extent, lexicographic writing aids improve text revision results. German university students were asked to optimise two German texts using (1) no aids at all, (2) highlighted problems, or (3) highlighted problems accompanied by lexicographic resources that could be used to solve the specific problems. We found that participants from the third group corrected the largest number of problems and introduced the fewest semantic distortions during revision. Also, they reached the highest overall score and were most efficient (as measured in points per time). The second group with highlighted problems lies between the two other groups in almost every measure we analysed. We discuss these findings in the scope of intelligent writing environments, the effectiveness of writing aids in practical usage situations and teaching dictionary skills.