OPUS 4 | Search

Wie misst man Textqualität im digitalen Zeitalter? (MIT.Qualität) (2019)

Abel, Andrea ; Frey, Jennifer-Carmen ; Glaznieks, Aivars ; Linthe, Maja ; Müller-Spitzer, Carolin ; Storrer, Angelika ; Wolfer, Sascha

Einführung in das Themenheft „Textqualität im digitalen Zeitalter“ (2020)

Abel, Andrea ; Glaznieks, Aivars ; Müller-Spitzer, Carolin ; Storrer, Angelika

Das Kommunizieren in Sozialen Medien und der Umgang mit Hypertexten ist im Jahr 2020 kein Randphänomen mehr. Die sprachlichen Besonderheiten internetbasierter Kommunikation und Sozialer Medien sind mittlerweile auch gut erforscht und beschrieben, allerdings werden diese bislang in deutschen Grammatiken, mit Ausnahme von Hoffmann (2014), allenfalls am Rande behandelt. Selbst neuere Ansätze zur Textanalyse, z. B. Ágel (2017), konzentrieren sich auf gestaltstabile, linear organisierte Schrifttexte. Dasselbe gilt für Ansätze, die primär für die Bewertung von Schreibprodukten in Bildungskontexten entwickelt wurden.

Einführung (2022)

Beißwenger, Michael ; Lemnitzer, Lothar ; Müller-Spitzer, Carolin

Challenges to Internet lexicography : the Internet dictionary portal at the Institute for German Language (2009)

Engelberg, Stefan ; Klosa, Annette ; Müller-Spitzer, Carolin

Internet lexicography at the Leibniz-Institute for the German Language (2020)

Engelberg, Stefan ; Klosa-Kückelhaus, Annette ; Müller-Spitzer, Carolin

Lexikographie zwischen Grimm und Google (2019)

Engelberg, Stefan ; Klosa-Kückelhaus, Annette ; Müller-Spitzer, Carolin

Dictionary portals (2013)

Engelberg, Stefan ; Müller-Spitzer, Carolin

Vernetzungs- und Zugriffsstrukturen (2016)

Engelberg, Stefan ; Müller-Spitzer, Carolin ; Schmidt, Thomas

elexiko – das elektronische, lexikografisch-lexikologische korpusbasierte Wortschatzinformationssystem : zur Neukonzeption, Erweiterung und Revision einzelner Angabebereiche (2008)

Hahn, Marion ; Klosa, Annette ; Müller-Spitzer, Carolin ; Schnörch, Ulrich ; Storjohann, Petra

In diesem Beitrag werden wichtige Neukonzeptionen und umfangreiche Nachbearbeitungen einzelner Angabebereiche in elexiko erläutert. Die linguistische Konzeption dieser Angaben stellt eine Weiterentwicklung gegenüber der Konzeption dar, wie sie im Band „Grundfragen der elektronischen Lexikographie. elexiko – das Online-Informationssystem zum deutschen Wortschatz“ (2005) vorgelegt wurde. Betroffen sind z.B. die Angabebereiche der typischen Verwendungen, der sinn- und sachverwandten Wörter und der Besonderheiten des Gebrauchs.

Datenmodellierung (2016)

Herold, Axel ; Meyer, Peter ; Müller-Spitzer, Carolin

Einleitung (2011)

Klosa, Annette ; Müller-Spitzer, Carolin

Grammatische Angaben in elexiko und ihre Modellierung (2007)

Klosa, Annette ; Müller-Spitzer, Carolin

The project elexiko compiles an extensive, monolingual dictionary of Contemporary German. This contribution deals with the grammatical data in this dictionary; it is not only described how these are arranged content-wise depending on corpus data, but also how they were modelled. Das Projekt elexiko erarbeitet ein umfangreiches, einsprachiges Wörterbuch des Gegenwartsdeutschen. In diesem Beitrag geht es um die grammatischen Angaben in diesem Wörterbuch; es wird nicht nur erläutert, wie diese inhaltlich in Abhängigkeit vom Prinzip der Korpusbasiertheit gestaltet sind, sondern auch, wie sie modelliert wurden.

Internetlexikografie: Einleitung (2016)

Klosa, Annette ; Müller-Spitzer, Carolin

OWID und OWIDplus: lexikographische und lexikalische Ressourcen am IDS Mannheim (2019)

Klosa-Kückelhaus, Annette ; Müller-Spitzer, Carolin

Lexikographische und lexikalische Ressourcen zum Deutschen werden an vielen unterschiedlichen Institutionen erarbeitet. Zum einen im Dudenverlag, der mit den gedruckten Wörterbüchern der Duden-Reihe und mit „Duden online“ die meistkonsultierten gegenwartssprachlichen Wörterbücher zum Deutschen erstellt, dann die Union deutscher Akademien, unter deren Dach an verschiedenen einzelnen Akademien zahlreiche historische wie auch synchrone Wörterbücher zum Deutschen erstellt werden (z. B. das „Digitale Wörterbuch der deutschen Sprache“, das „Wörterbuchnetz“ sowie das geplante Informationssystem des neuen „Zentrums für digitale Lexikographie der deutschen Sprache“). Auch am Institut für Deutsche Sprache in Mannheim werden wissenschaftliche wortschatzbezogene Ressourcen zum Deutschen erarbeitet und der (Fach-)Öffentlichkeit unter dem Dach von OWID, dem „Online-Wortschatz-Informationssystem Deutsch“, präsentiert. Obwohl wir uns in OWID auf Ressourcen zu spezialisierten Wortschatzbereichen konzentriert haben, erreichen wir Nutzerinnen und Nutzer in verschiedensten Ländern der Welt. Wir wollen hier die Gelegenheit wahrnehmen, den ZGL-Leserinnen und -Lesern unsere Ressourcen in OWID und OWIDplus näher vorzustellen.

Dictionary users do look up frequent words. A log file analysis (2014)

Koplenig, Alexander ; Meyer, Peter ; Müller-Spitzer, Carolin

In this paper, the authors use the 2012 log files of two German online dictionaries (Digital Dictionary of the German Language and the German Version of Wiktionary) and the 100,000 most frequent words in the Mannheim German Reference Corpus from 2009 to answer the question of whether dictionary users really do look up frequent words, first asked by de Schryver et al. (2006). By using an approach to the comparison of log files and corpus data which is completely different from that of the aforementioned authors, we provide empirical evidence that indicates - contrary to the results of de Schryver et al. and Verlinde/Binon (2010) - that the corpus frequency of a word can indeed be an important factor in determining what online dictionary users look up. Finally, we incorporate word class Information readily available in Wiktionary into our analysis to improve our results considerably.

The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort (2017)

Koplenig, Alexander ; Meyer, Peter ; Wolfer, Sascha ; Müller-Spitzer, Carolin

Languages employ different strategies to transmit structural and grammatical information. While, for example, grammatical dependency relationships in sentences are mainly conveyed by the ordering of the words for languages like Mandarin Chinese, or Vietnamese, the word ordering is much less restricted for languages such as Inupiatun or Quechua, as these languages (also) use the internal structure of words (e.g. inflectional morphology) to mark grammatical relationships in a sentence. Based on a quantitative analysis of more than 1,500 unique translations of different books of the Bible in almost 1,200 different languages that are spoken as a native language by approximately 6 billion people (more than 80% of the world population), we present large-scale evidence for a statistical trade-off between the amount of information conveyed by the ordering of words and the amount of information conveyed by internal word structure: languages that rely more strongly on word order information tend to rely less on word structure information and vice versa. Or put differently, if less information is carried within the word, more information has to be spread among words in order to communicate successfully. In addition, we find that–despite differences in the way information is expressed–there is also evidence for a trade-off between different books of the biblical canon that recurs with little variation across languages: the more informative the word order of the book, the less informative its word structure and vice versa. We argue that this might suggest that, on the one hand, languages encode information in very different (but efficient) ways. On the other hand, content-related and stylistic features are statistically encoded in very similar ways.

The first two international studies on online dictionaries - background information (2014)

Koplenig, Alexander ; Müller-Spitzer, Carolin

Population Size Predicts Lexical Diversity, but so Does the Mean Sea Level – Why It Is Important to Correctly Account for the Structure of Temporal Data (2016)

Koplenig, Alexander ; Müller-Spitzer, Carolin

In order to demonstrate why it is important to correctly account for the (serial dependent) structure of temporal data, we document an apparently spectacular relationship between population size and lexical diversity: for five out of seven investigated languages, there is a strong relationship between population size and lexical diversity of the primary language in this country. We show that this relationship is the result of a misspecified model that does not consider the temporal aspect of the data by presenting a similar but nonsensical relationship between the global annual mean sea level and lexical diversity. Given the fact that in the recent past, several studies were published that present surprising links between different economic, cultural, political and (socio-)demographical variables on the one hand and cultural or linguistic characteristics on the other hand, but seem to suffer from exactly this problem, we explain the cause of the misspecification and show that it has profound consequences. We demonstrate how simple transformation of the time series can often solve problems of this type and argue that the evaluation of the plausibility of a relationship is important in this context. We hope that our paper will help both researchers and reviewers to understand why it is important to use special models for the analysis of data with a natural temporal ordering.

Was zeichnet ein gutes Onlinewörterbuch aus? : Ergebnisse von empirischen Studien zur Wörterbuchbenutzung (2013)

Koplenig, Alexander ; Müller-Spitzer, Carolin

Die Benutzung von Onlinewörterbüchern ist bislang wenig erforscht. Am Institut für Deutsche Sprache in Mannheim wurde versucht, diese Forschungslücke mit einem Projekt zur Benutzungsforschung zumindest zum Teil schließen (s. www.benutzungsforschung.de). Die empirischen Studien wurden methodisch sowohl in Form von Onlinefragebögen, die neben befragenden auch experimentelle Elemente enthielten, als auch anhand eines Labortests (mit Eyetracking-Verfahren) durchgeführt. Die erste Studie untersuchte generell die Anlässe und sozialen Situationen der Verwendung von Onlinewörterbüchern sowie die Ansprüche, die Nutzer an Onlinewörterbücher stellen. An der zweisprachigen Onlinestudie (deutsch/englisch) nahmen international fast 700 Probanden teil. Durch die hohe Resonanz auf die erste Studie und den daraus folgenden Wunsch, die gewonnenen Informationen empirisch zu vertiefen, richtet sich auch die die zweite Studie an ein internationales Publikum und schloss inhaltlich an die erste Studie an. Später konzentrierten sich die Studien auf monolinguale deutsche Onlinewörterbücher wie elexiko (Studien 3 und 4), sowie auf das Wörterbuchportal OWID (Studie 5). Im Vortrag werden ausgewählte Ergebnisse der verschiedenen Studien vorgestellt.

Questions of design (2014)

Koplenig, Alexander ; Müller-Spitzer, Carolin

All lexicographers working on online dictionary projects that do not wish to use an established form of design for their online dictionary, or simply have new kinds of lexicographic data to present, face the problem of what kind of arrangement is best suited for the intended users of the dictionary. In this chapter, we present data about questions relating to the design of online dictionaries. This will provide projects that use these or similar ways of presenting their lexicographic data with valuable information about how potential dictionary users assess and evaluate them. In addition, the answers to corresponding open-ended questions show, detached from concrete design models, which criteria potential users value in a good online representation. Clarity and an uncluttered look seem to dominate in many answers, as well as the possibility of customization, if the latter is not connected with a too complex usability model.

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

130 search hits