Refine
Year of publication
- 2016 (19) (remove)
Document Type
- Part of a Book (11)
- Conference Proceeding (4)
- Book (3)
- Article (1)
Keywords
- Computerunterstützte Lexikographie (19) (remove)
Publicationstate
Reviewstate
- (Verlags)-Lektorat (8)
- Peer-Review (1)
Wiktionary is increasingly gaining influence in a wide variety of linguistic fields such as NLP and lexicography, and has great potential to become a serious competitor for publisher-based and academic dictionaries. However, little is known about the "crowd" that is responsible for the content of Wiktionary. In this article, we want to shed some light on selected questions concerning large-scale cooperative work in online dictionaries. To this end, we use quantitative analyses of the complete edit history files of the English and German Wiktionary language editions. Concerning the distribution of revisions over users, we show that — compared to the overall user base — only very few authors are responsible for the vast majority of revisions in the two Wiktionary editions. In the next step, we compare this distribution to the distribution of revisions over all the articles. The articles are subsequently analysed in terms of rigour and diversity, typical revision patterns through time, and novelty (the time since the last revision). We close with an examination of the relationship between corpus frequencies of headwords in articles, the number of article visits, and the number of revisions made to articles.
The wdlpOst dictionary writing system to be presented in this paper has been developed for the specific purposes of a lexicographical project on German loanwords in the East Slavic languages Russian, Belarusian, and Ukrainian. The project’s main objectives are (i) to document those loanwords for which a cognate lexical borrowing from German is known in Polish and (ii) to establish possible borrowing pathways for these lexical items. In the first phase of the project, the collaborative client/server architecture of the wdlpOst system has been used for excerpting detailed lexicographical information from a large range of historical and contemporary East Slavic dictionaries, taking the entries in a large dictionary of German loanwords in Polish as a common frame of reference. For the project’s second phase, the wdlpOst system provides innovative tooling for compiling entries of the East Slavic loanwords. Most importantly, the numerous word sense definitions for a set of cognate loanwords, as excerpted from different lexicographical sources, are mapped onto a system of newly defined cross-language word senses; in a similar vein, the phonemic and graphemic variation in the loanwords and their derivatives is captured through a tool that abstracts from dictionary-specific idiosyncrasies.
Lexicography of Language Contact: An Internet Dictionary of Words of German Origin in Tok Pisin
(2016)
The paper presents an ongoing project in the domain of lexicography of language contact, namely, the “Internet Dictionary of Words of German Origin in Tok Pisin”. The German influence onto the lexicon of the main pidgin language of Papua New Guinea has its roots in the German colonial empire, where Tok Pisin played an important role as a lingua franca in the colony of German New Guinea. Tok Pisin also served as an intermediate language for many borrowing processes; that is, German loans entered many languages in the South Pacific via Tok Pisin. The Internet Dictionary of Words of German Origin in Tok Pisin is based on all available lexicographical sources from the early 20th century up to now. These sources are systematically evaluated within our project; the results will be documented in the dictionary. The microstructure of the dictionary will be presented with respect to its major features: documentation of sources, examples for word usage, audio files, and lexicographic comment.
The Online Bibliography of Electronic Lexicography (OBELEXmeta) is a bibliographic database which is developed for researchers working in the field of dictionary research. The platform is hosted at the Institute for the German Language (Institut für Deutsche Sprache, IDS) in Mannheim. The poster presentation aims at presenting the current status of the ongoing project.
Der lexikografische Prozess
(2016)
Der lexikografische Prozess ist bislang fast ausschließlich bezogen auf gedruckte Wörterbücher untersucht worden. Bei Internetwörterbüchern (und insbesondere bei solchen, die sich im Aufbau befinden) gestaltet sich dieser Prozess ganz anders: Hier ist kein Nacheinander der einzelnen Herstellungsphasen zu beschreiben, sondern ein permanentes Neben- und Ineinander einzelner Arbeits- schritte. In diesem Zusammenhang stellt sich somit eine ganze Reihe von Fragen, z. B. danach, wie Bearbeitungsteilwortschätze auszuwählen sind oder welchen Einfluss die neuen Möglichkeiten der Datengewinnung aus elektronischen Textkorpora auf den le- xikografischen Prozess haben, welche Software zur Unterstützung lexikografischer Prozesse eingesetzt werden kann oder wie sich all diese Änderungen auf die Nachschlagenden auswirken.
Datenmodellierung
(2016)
Das Internet ist mittlerweile die wichtigste Plattform zur Publikation lexikografischer Inhalte. Im vorliegenden Band führen Vertreterinnen und Vertreter des von der Deutschen Forschungsgemeinschaft geförderten Netzwerks Internetlexikografie erstmals in die zentralen Arbeitsfelder der Internetlexikografie ein und stellen den aktuellen Stand der wissenschaftlichen Forschung und lexikografischen Praxis vor.
Das Kapitel widmet sich den grundsätzlichen technischen Rahmenbedingungen für die heutige Internetlexikografie. Zum einen skizzieren die Autoren, was „hinter“ den auf einem Monitor sichtbaren Benutzeroberflächen geschieht, wenn eine Nutzerin online auf ein Wörterbuch zugreift, und wie diese Prozesse zu Dokumentationszwecken in I.ogdateien protokolliert werden können. Zum anderen diskutieren sie, wie die Identität und dauerhafte Verfügbarkeit von Inhalten angesichts der ständig möglichen Aktualisierbarkeit von Online-Angeboten sichergestellt werden können.