Refine
Year of publication
Document Type
- Part of a Book (28)
- Article (17)
- Conference Proceeding (16)
- Book (1)
Keywords
- Lehnwort (20)
- Computerunterstützte Lexikographie (15)
- Deutsch (14)
- Sprachstatistik (7)
- Wörterbuch (7)
- Linguist (6)
- Biografie (4)
- Korpus <Linguistik> (4)
- Online-Wörterbuch (4)
- Russisch (4)
Publicationstate
- Veröffentlichungsversion (35)
- Postprint (5)
- Zweitveröffentlichung (2)
Reviewstate
- Peer-Review (15)
- (Verlags)-Lektorat (12)
- Verlags-Lektorat (12)
- Peer-review (1)
Publisher
- Niemeyer (6)
- De Gruyter (4)
- IDS-Verlag (3)
- Lexical Computing CZ s.r.o. (3)
- Sagner (3)
- Institut für Deutsche Sprache (2)
- Trojina, Institute for Applied Slovene Studies (2)
- Zenodo (2)
- Accademia della Crusca (1)
- Democritus University of Thrace (1)
We start by trying to answer a question that has already been asked by de Schryver et al. (2006): Do dictionary users (frequently) look up words that are frequent in a corpus. Contrary to their results, our results that are based on the analysis of log files from two different online dictionaries indicate that users indeed look up frequent words frequently. When combining frequency information from the Mannheim German Reference Corpus and information about the number of visits in the Digital Dictionary of the German Language as well as the German language edition of Wiktionary, a clear connection between corpus and look-up frequencies can be observed. In a follow-up study, we show that another important factor for the look-up frequency of a word is its temporal social relevance. To make this effect visible, we propose a de-trending method where we control both frequency effects and overall look-up trends.
Am 12. Mai 1965 nahmen der Staat Israel und die Bundesrepublik Deutschland offiziell diplomatische Beziehungen auf. Damit kam über 15 Jahre nach der Konstitution der beiden Länder und 20 Jahre nach dem Ende der Shoah ein komplexer Prozess der langsamen politischen Annäherung zu einem keineswegs selbstverständlichen Abschluss. Das fünfzigjährige Jubiläum dieses Ereignisses im Jahr 2015 war weltweit, vor allem aber in Israel und Deutschland, Anlass für zahlreiche Veranstaltungen, über die eine offizielle bilaterale Webseite <www.de50il.org/> (Stand: 6.11.2017) Auskunft gibt. Im Rahmen des Jubiläums wurde am 30. September 2015 in einer feierlichen Abendveranstaltung im Jüdischen Museum Berlin offiziell das „Wörterbuch deutscher Lehnwörter im Hebräischen“ von Uriel Adiv in einer ersten Fassung im „Lehnwortportal Deutsch“ des IDS freigeschaltet. Eine von Koautor Jakob Mendel erheblich überarbeitete und verbesserte zweite Version ging im Mai 2017 online. Der vorliegende Beitrag möchte einige Hintergründe zum deutschen Lehnwortschatz im modernen Hebräischen darstellen sowie die Entstehungsgeschichte des Werks und seinen Platz in der lehnwortlexikografischen Publikationsplattform „Lehnwortportal Deutsch“ <http://lwp.ids-mannheim.de/> (Stand: 6.11.2017) beleuchten.
Das Lehnwortportal Deutsch (LWPD) ist ein Online-Informationssystem zu Entlehnungen von Wörtern aus dem Deutschen in andere Sprachen. Es beruht auf einer wachsenden Zahl von lexikographischen Ressourcen zu verschiedenen Sprachen und bietet eine einfache ressourcenübergreifende Suchfunktion an. Das Poster präsentiert eine derzeit in Entwicklung befindliche onomasiologische Suchfunktion für das LWPD.
The present study examines the dynamics of the kanji combinations that form common (or general) and proper nouns in Japanese. The following three results were obtained. First, the degree of distribution results from two similar processes which are based on a steady-state of birth-and-death processes with different birth and death rates, resulting in a positive negative binomial distribution with the proper nouns and in a positive Waring distribution with common nouns. Second, all rank-frequency distributions follow the negative hypergeometric distribution used very frequently in ranking problems. Third, the building of kanji compounds follows a dissortative strategy. The higher the outdegree of a kanji, the more it prefers kanji with lower indegrees. A linear dependence can be observed with common nouns, whereas the relationship between compounded kanji is rather curvilinear with proper nouns. The actual analytical expression is not yet known.
In this paper, we address issues of inconsistencies of dictionary information and how different corpus methods and computer tools can assist in providing systematic cross-referencing. The question is raised how hyperlinking in an electronic reference work can be approached systematically in order to warrant consistent symmetrical links between synonyms or antonyms. Firstly, it is argued that working with a comprehensive corpus does not account for consistent cross-referencing. It is shown that a top-down corpus-driven linguistic analysis also does not guarantee the lexicographic documentation of binary lexico-semantic relations covered by corpus data, as proposed by Paradis/Willners (2006a, b). Secondly, with the help of dictionary examples taken from elexiko (an online dictionary of contemporary German) we demonstrate how a combination of both corpus-driven and corpus-based procedures enables lexicographers to systematically exploit corpus material in more depth than by using only one of these methods. It is also discussed where and why lexicographers are still prone to inconsistencies in the editing processes, irrespective of their underlying corpus methodologies. Finally, we introduce a cross-reference management tool that has been developed for elexiko and we explain its technological prerequisites and implications. This software supports lexicographers in detecting existing and missing references from and to a specific headword. It also offers options to automatically and comfortably correct discrepancies. Overall, we suggest a method that includes linguistic competence, complementary corpus approaches and additional software in order to ensure that links or references between synonymic and antonymic pairings are given in both directions.
In this paper we present an experimental semantic search function, based on word embeddings, for an integrated online information system on German lexical borrowings into other languages, the Lehnwortportal Deutsch (LWPD). The LWPD synthesizes an increasing number of lexicographical resources and provides basic cross-resource search options. Onomasiological access to the lexical units of the portal is a highly desirable feature for many research questions, such as the likelihood of borrowing lexical units with a given meaning (Haspelmath & Tadmor, 2009; Zeller, 2015). The search technology is based on multilingual pre-trained word embeddings, and individual word senses in the portal are associated with word vectors. Users may select one or more among a very large number of search terms, and the database returns lexical items with word sense vectors similar to these terms. We give a preliminary assessment of the feasibility, usability and efficacy of our approach, in particular in comparison to search options based on semantic domains or fields.
The representation of semantic relations between word senses of different entries in a dictionary is subject to a number of consistency requirements. This paper discusses the issue of maintaining and accessing consistent information on cross-references between sense-related items in electronic dictionaries from a mainly text-technological point of view. We present a number of consistency criteria for cross-referencing related senses and propose a practical approach to handling sense relations in an online dictionary. Our proposal is currently being tested in a large ongoing online dictionary project for German called elexiko. We focus on three different aspects of the dictionary development and editing process where consistency is an important issue: lexicographic data modelling, implementation of a lexicographic database system for an electronic dictionary, and development of practical tools for the lexicographer’s workbench.
In dem Beitrag präsentieren und diskutieren die Autoren zunächst einige Untersuchungen aus der Benutzungsforschung zu elektronischen Wörterbüchern, die sich mit der nutzerseitigen Beurteilung des Mehrwerts multimedialer und benutzeradaptiver Elemente befassen (Kap. 1. In einem zweiten Teil versuchen sie, ausgehend von den Stärken und Schwächen vorhandener Ansätze in diesem Bereich, Antworten auf die Frage zu finden, welche Anforderungen an Visualisierungstechniken und ‑strategien in elektronischen Wörterbüchern gestellt werden müssen, um einen solchen Mehrwert zu erhalten (Kap. 2). Abschließend stellen sie als praktisches Beispiel für eine mögliche Umsetzung solcher Anforderungen den Prototyp einer Software zur interaktiven Erkundung von Wortbildungsangaben im Wörterbuch vor.
Making 1:n explorable: a search interface for the ZAS database of clause-embedding predicates
(2017)
We introduce a recently published corpus-based database of German clause-embedding predicates and present an innovative web application for exploring it. The application displays the predicates and the corpus examples for these predicates in two separate tables that can be browsed and searched in real time. While familiar web interface paradigms make it easy for users to get started, the data presentation and the interactive advanced search components for the two tables are designed to accommodate remarkably complex query needs without the need for resorting to a dedicated query language or a more specialized tool. The 1:n relationship between predicates and their examples is exploited in the two tables in that, e.g. the predicate table also shows, for each predicate and each example attribute, all values that occur in the examples for this predicate. An easy-to-use visual query builder for arbitrary Boolean combinations of search criteria can optionally be displayed to pre-filter the underlying data presented in both tables. Several options for altering quantifier scope can be activated with simple checkboxes and considerably widen the space of searchable constellations.