Volltext-Downloads (blau) und Frontdoor-Views (grau)

How many people constitute a crowd and what do they do? Quantitative analyses of revisions in the English and German Wiktionary editions

  • Wiktionary is increasingly gaining influence in a wide variety of linguistic fields such as NLP and lexicography, and has great potential to become a serious competitor for publisher-based and academic dictionaries. However, little is known about the "crowd" that is responsible for the content of Wiktionary. In this article, we want to shed some light on selected questions concerning large-scale cooperative work in online dictionaries. To this end, we use quantitative analyses of the complete edit history files of the English and German Wiktionary language editions. Concerning the distribution of revisions over users, we show that — compared to the overall user base — only very few authors are responsible for the vast majority of revisions in the two Wiktionary editions. In the next step, we compare this distribution to the distribution of revisions over all the articles. The articles are subsequently analysed in terms of rigour and diversity, typical revision patterns through time, and novelty (the time since the last revision). We close with an examination of the relationship between corpus frequencies of headwords in articles, the number of article visits, and the number of revisions made to articles.

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Sascha Wolfer, Carolin Müller-SpitzerGND
URN:urn:nbn:de:bsz:mh39-55918
DOI:https://doi.org/10.5788/26-1-1346
ISSN:2224-0039 (online)
ISSN:1684-4904 (print)
Parent Title (German):Lexikos
Publisher:Buro van die Wat
Place of publication:Stellenbosch
Document Type:Article
Language:English
Year of first Publication:2016
Date of Publication (online):2016/11/17
Contributing Corporation:African Association for Lexicography (AFRILEX)
Publicationstate:Veröffentlichungsversion
Reviewstate:Peer-Review
Tag:online dictionary; revision; user-generated content; wiktionary; wisdom of the crowd
GND Keyword:Computerunterstützte Lexikographie; Internet; Qualitätskontrolle; Wörterbuch
Volume:26
First Page:347
Last Page:371
Dewey Decimal Classification:400 Sprache
Leibniz-Classification:Sprache, Linguistik
Linguistics-Classification:Lexikografie
Open Access?:Ja
Licence (German):Es gilt das UrhG