Volltext-Downloads (blau) und Frontdoor-Views (grau)

Refining and Exploiting the Structural Markup of the eWDG

  • In this paper, the authors describe a semi-automated approach to refine the dictionary-entry structure of the digital version of the Wörterbuch der deutschen Gegenwartssprache (WDG, en.: Dictionary of Present-day German), a dictionary compiled and published between 1952 and 1977 by the Deutsche Akademie der Wissenschaften that comprises six volumes with over 4,500 pages containing more than 120,000 headwords. We discuss the benefits of such a refinement in the context of the dictionary project Digitales Wörterbuch der deutschen Sprache (DWDS, en: Digital Dictionary of the German language). In the current phase of the DWDS project, we aim to integrate multiple dictionary and corpus resources in German language into a digital lexical system (DLS). In this context, we plan to expand the current DWDS interface with several special purpose components, which are adaptive in the sense that they offer specialized data views and search mechanisms for different dictionary functions-e.g. text comprehension, text production-and different user groups-e.g. journalists, translators, linguistic researchers, computational linguists. One prerequisite for generating such data views is the selective access to the lexical items in the article structure of the dictionaries which are the object of study. For this purpose, the representation of the eWDG has to be refined. The focus of this paper is on the semiautomated approach used to transform eWDG into a refined version in which the main structural units can be explicitly accessed. We will show how this refinement opens new and flexible ways of visualizing and querying the lexicographic content of the refined version in the context of the DLS project.

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Thomas Schmidt, Alexander Geyken, Angelika Storrer
URN:urn:nbn:de:bsz:mh39-22582
URL:http://euralex.org/category/publications/euralex-2008/
ISBN:978-84-96742-67-3
Parent Title (English):Proceedings of the XIII EURALEX International Congress (Euralex 2008). Barcelona, Spain. 15-19 July, 2008
Series (Serial Number):Sèrie Activitas (20)
Publisher:Institut Universitari de Linguistica Aplicada, Universitat Pompeu Fabra:
Place of publication:Barcelona
Editor:Elisenda Bernal, Janet Ann DeCesaris
Document Type:Conference Proceeding
Language:English
Year of first Publication:2008
Date of Publication (online):2014/05/07
Tag:Digitales Wörterbuch der deutschen Sprache (DWDS); Wörterbuch der deutschen Gegenwartssprache (WDG)
GND Keyword:Computerunterstützte Lexikographie
First Page:469
Last Page:481
Dewey Decimal Classification:400 Sprache / 410 Linguistik / 413 Wörterbücher
Linguistics-Classification:Computerlinguistik
Linguistics-Classification:Lexikografie
Open Access?:Ja
Licence (English):License LogoCreative Commons - Attribution-NonCommercial-ShareAlike 3.0 Unported