Refine
Year of publication
- 2019 (359) (remove)
Document Type
- Article (123)
- Part of a Book (110)
- Conference Proceeding (39)
- Book (34)
- Review (24)
- Part of Periodical (14)
- Other (9)
- Working Paper (4)
- Doctoral Thesis (1)
- Report (1)
Language
- German (249)
- English (106)
- Multiple languages (2)
- Ukrainian (1)
- Chinese (1)
Keywords
- Deutsch (142)
- Korpus <Linguistik> (65)
- Gesprochene Sprache (26)
- Sprache (19)
- Konversationsanalyse (17)
- Automatische Sprachanalyse (15)
- Grammatik (15)
- Linguistik (15)
- Rezension (15)
- Interaktion (14)
Publicationstate
- Zweitveröffentlichung (163)
- Veröffentlichungsversion (136)
- Postprint (28)
- Erstveröffentlichung (2)
Reviewstate
- Peer-Review (147)
- (Verlags)-Lektorat (144)
- (Verlags-)Lektorat (2)
- Peer review (1)
- Peer-review (1)
- Qualifikationsarbeit (Dissertation, Habilitationsschrift) (1)
Publisher
- de Gruyter (66)
- Leibniz-Institut für Deutsche Sprache (IDS) (33)
- Leibniz-Institut für Deutsche Sprache (20)
- Erich Schmidt (18)
- Narr Francke Attempto (11)
- German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg (9)
- Winter (8)
- Institut für Deutsche Sprache (7)
- Stauffenburg (7)
- Lang (6)
Lebenslauf bis 2019
(2019)
Ulrich Engel schildert die einzelnen Stationen seines Lebens: als Kind im Vorkriegsdeutschland und als junger Soldat, anschließend seine Lehrertätigkeit und wissenschaftliche Laufbahn, insbesondere seine Funktion als Direktor des Instituts für Deutsche Sprache in Mannheim. Er hebt seine Tätigkeit als Leiter von mehreren Projekten von kontrastiven zweisprachigen Grammatiken sowie Valenzwörterbüchern hervor. Dabei schildert er seinen familiären Hintergrund als Spiegel des gesellschaftlich‑politischen Wandels im Vor‑ und Nachkriegsdeutschland.
This edited collection provides an overview of linguistic diversity, societal discourses and interaction between majorities and minorities in the Baltic States. It presents a wide range of methods and research paradigms including folk linguistics, discourse analysis, narrative analyses, code alternation, ethnographic observations, language learning motivation, languages in education and language acquisition. Grouped thematically, its chapters examine regional varieties and minority languages (Latgalian, Võro, urban dialects in Lithuania, Polish in Lithuania); the integration of the Russian language and its speakers; and the role of international languages like English in Baltic societies. The editors’ introductory and concluding chapters provide a comparative perspective that situates these issues within the particular history of the region and broader debates on language and nationalism at a time of both increased globalization and ethno-regionalism. This book will appeal in particular to students and scholars of multilingualism, sociolinguistics, language discourses and language policy, and provide a valuable resource for researchers focusing on Baltic States, Northern Europe and the post-Soviet world in the related fields of history, political science, sociology and anthropology.
Gute Argumente. Wo beginnen?
(2019)
Gerade allgemeinere Verben zeigen eine Variationsbreite der Verwendung, die nicht leicht zu einem einheitlichen Bild zu fassen ist. Am Beispiel des Verbs beginnen wird gezeigt, wie hier die Interaktion zwischen der Struktur der Aktanten und den grammatischen Regelmäßigkeiten funktioniert. Dabei wird versucht, in der Kombination von Valenzinformationen, Argumentstrukturpositionierungen und Musterbildungen im Gebrauch ein zusammenhängendes Bild dieses Verbs in seinen verschiedenen Verwendungen zu entwerfen.
Dieser Beitrag berichtet nicht nur über „Neues vom heutigen Deutsch“, sondern auch „vom alten Deutsch“, das bislang nicht gehoben wurde. Tief in grammatische Strukturen eingelassen verstecken sich (historische) Geschlechterkonzepte, die weit über das hinausgehen, was die Linguistik zu eindimensional unter Sexus versteht. Vielmehr geht es um Gender, um Geschlechterordnungen, die Frauen und Männern ihre sozialen Plätze zuweisen. Zuwiderhandlungen werden durch grammatische Devianzen und ‚Fehlklassifikationen‘ geahndet. Dabei werden die beiden Nominalklassifikationen des Genus (die Tunte, das Weib) und der Deklinationsklasse (die Vögte vs. die Strolche) analysiert. Als Drittes werden syntaktisch verfestigte Sprachgebrauchsmuster in Gestalt von Binomialen beleuchtet. Als gehärtete Folgen koordinierter Personenbezeichnungen kodieren sie geschlechterhierarchische Rangfolgen (Mann und Frau, Mama und Papa) und erweisen sich dabei ebenfalls als Reflexe von Sozial- und Geschlechterordnungen: Männer treten dabei (immer noch) vor Frauen, Mütter aber zunehmend vor Väter und vor allem Mamas vor Papas.
Der vorliegende Beitrag beschäftigt sich mit dem Gebrauch von konnektintegrierbaren Konnektoren im gesprochenen Deutsch. Die Analyse wird am Beispiel der Adverbkonnektoren deshalb und deswegen als Korrelate zum Subjunktor weil und ausgehend von theoretischen Prämissen aus der traditionellen Grammatik und aus der Gesprächsforschung durchgeführt. Der Gebrauch der genannten Konnektoren wird innerhalb einer Auswahl von Korpusdaten gesprochener Sprache beobachtet, die mehrere verschiedene Gattungen der alltäglichen bzw. der institutionellen Kommunikation umfasst.
This paper investigates two verbal constructions containing the German verb verdienen (‘to earn / deserve’), e.g. er verdient sich sein Brot ‘he earns his living’ (lit. he earns himself his bread) und er verdient gewürdigt zu werden ‘he deserves to be appreciated". It is shown that the notion of analogy allows for motivating some important features of particular constructions with verdienen. Two interpretations of analogy are employed: analogy in the sense of non-hierarchical family resemblance on the one hand, and analogy leading to changes by mapping a structure from one domain to another on the other hand. It is suggested that both verdienen in combination with sich and verdienen in combination with a verbal complement can be accounted for by focusing on their formal and semantic similarities connecting them to other constructions coming from the same construction family. Moreover, it is shown that versprechen and vermögen could be regarded as analogical models for verdienen.
Die aus einer Doktorarbeit hervorgegangene, ausgesprochen reife Monographie von Julia Kaiser ist ein solides Stück linguistischer Arbeit. Die Lektüre spricht an, erweitert den Wissenshorizont und bereichert somit viele Linguisten – von den Epigonen des Strukturalismus über Anhänger der Dependenzoder Konstruktionsgrammatik bis hin zu den Vertretern der modernen Semantik. Im Fokus der Arbeit stehen „infinitivlose“ (= absolut verwendete) Modalverben (MV) im gesprochenen Deutsch. Im Einzelnen wird auf Vollverb-Verwendungen, Ellipsen, Analepsen, MV mit Richtungsbestimmungen und idiomatisierte absolute Verwendungen eingegangen.
Both compounds and multi-word expressions are complex lexical units, made up of at least two constituents. The most basic difference is that the former are morphological objects and the latter result from syntactic processes. However, the exact demarcation between compounds and multi-word expressions differs greatly from language to language and is often a matter of debate in and across languages. Similarly debated is whether and how these two different kinds of units complement or compete with each other.
The volume presents an overview of compounds and multi-word expressions in a variety of European languages. Central questions that are discussed for each language concern the formal distinction between compounds and multi-word expressions, their formation and their status in lexicon and grammar.
The volume contains chapters on German, English, Dutch, French, Italian, Spanish, Greek, Russian, Polish, Finnish, and Hungarian as well as a contrastive overview with a focus on German. It brings together insights from word-formation theory, phraseology and theory of grammar and aims to contribute to the understanding of the lexicon, both from a language-specific and cross-linguistic perspective.
Common Crawl is a considerably large, heterogeneous multilingual corpus comprised of crawled documents from the internet, surpassing 20TB of data and distributed as a set of more than 50 thousand plain text files where each contains many documents written in a wide variety of languages. Even though each document has a metadata block associated to it, this data lacks any information about the language in which each document is written, making it extremely difficult to use Common Crawl for monolingual applications. We propose a general, highly parallel, multithreaded pipeline to clean and classify Common Crawl by language; we specifically design it so that it runs efficiently on medium to low resource infrastructures where I/O speeds are the main constraint. We develop the pipeline so that it can be easily reapplied to any kind of heterogeneous corpus and so that it can be parameterised to a wide range of infrastructures. We also distribute a 6.3TB version of Common Crawl, filtered, classified by language, shuffled at line level in order to avoid copyright issues, and ready to be used for NLP applications.
Text corpora come in many different shapes and sizes and carry heterogeneous annotations, depending on their purpose and design. The true benefit of corpora is rooted in their annotation and the method by which this data is encoded is an important factor in their interoperability. We have accumulated a large collection of multilingual and parallel corpora and encoded it in a unified format which is compatible with a broad range of NLP tools and corpus linguistic applications. In this paper, we present our corpus collection and describe a data model and the extensions to the popular CoNLL-U format that enable us to encode it.
As the Web ought to be considered as a series of sources rather than as a source in itself, a problem facing corpus construction resides in meta-information and categorization. In addition, we need focused data to shed light on particular subfields of the digital public sphere. Blogs are relevant to that end, especially if the resulting web texts can be extracted along with metadata and made available in coherent and clearly describable collections.
Nearly all of the very large corpora of English are “static”, which allows a wide range of one-time, pre-processed data, such as collocates. The challenge comes with large “dynamic” corpora, which are updated regularly, and where preprocessing is much more difficult. This paper provides an overview of the NOW corpus (News on the Web), which is currently 8.2 billion words in size, and which grows by about 170 million words each month. We discuss the architecture of NOW, and provide many examples that show how data from NOW can (uniquely) be extracted to look at a wide range of ongoing changes in English.
Wie werden Wörter im Deutschen und im Englischen geschrieben? Wo sind Gemeinsamkeiten, wo sind Unterschiede? Diese Fragen werden aus morphologisch-graphematischer Perspektive bearbeitet. Es geht hier also nicht um Bezüge zwischen Schrift und Lautform (traditionell oft im Fokus der Graphematik), sondern um Korrespondenzen zwischen Schrift und Morphologie. Das betrifft zum einen den Aufbau von Morphemen. Welche Beschränkungen lassen sich hier für die Abfolge der Buchstaben formulieren? Was sind minimale, was sind prototypische Stämme und Affixe? Zum anderen geht es um Fragen der Einheitlichkeit (Wie uniform wird ein Morphem in der Schrift repräsentiert?) und der Eindeutigkeit (Wie distinkt verweist eine Schreibung auf ein Morphem?). Insgesamt zeigt sich, dass im Englischen eher Affixe verlässlich kodiert werden (oft eindeutig und einheitlich), während im Deutschen häufig Stämme einheitlich kodiert werden. Das sind zwei grundsätzlich unterschiedliche Strategien der Leseerleichterung.
Sprechen im Umbruch. Zeitzeugen erzählen und argumentieren rund um den Fall der Mauer im Wendekorpus
(2019)
This contribution aims to describe privacy, publicness and anonymity as essential analytic dimensions for media linguistic research. The dimensions are not inherent in and predetermined by the technical features and forms of communication provided by mobile devices, but are used by the participants as an orientation grid for shaping their online and offline practices in and with mobile media. Consid-ering both mobile device use in the public realm and the dissemina-tion of increasingly private content in social media (which is said to lead to ‘blurred boundaries’ between the private and the public), the paper provides a brief overview of the main developments in mobile media research: Studies adopting various approaches – e. g. socio-logical-ethnographic, linguistic and media studies – illustrate how publicness, privacy and anonymity are actively shaped and brought about by mobile media users in face-to-face and remote social en-counters. As this shows that publicness, privacy and anonymity are still relevant concepts for users, future media linguistics studies should focus on the dynamic multimodal practices by which they are contextualized and accomplished.
This paper aims at investigating the usage of present subjunctive (Konjunktiv I), which is traditionally labelled as a feature of standard written language and therefore as typically occurring in communication genres based on it such as press texts and reporting, in everyday spoken German. Through an analysis of corpus data performed according to theory and method of Interactional Linguistics and encompassing private, institutional and public interactional domains, the paper will show how this particular verb form expresses different epistemic stances according to its syntactic embedment.
Central complements: good arguments are self-explanatory.
Together with its central complements, verbs model basic patterns of interaction. The constellations of these complements in turn correspond to central patterns of the argument structure. Nominative and accusative complements formally occupy the first and second positions (subject and object), but they also have certain semantic preferences. The formal function of the dative is less pronounced, where it occurs (ditransitive verbs) the semantic imprint of the frame („transfer“) is very strong. This corresponds to the meaning of a core group of corresponding verbs. Other verbs that allow this pattern are used more often in other valence structures and the ditransitive use appears as a systematic way of personal extension of object‑related activities. This will be discussed with reference to the verbs zeigen and (in a different way) lehren.
The present research unites two emergent trends in the area of language attitudes: (a) research on perceptions of nonnative speakers by nonnative listeners and (b) the search for general, basic mechanisms underlying the evaluation of nonnative accented speakers. In three experiments featuring an employment situation, German participants listened to a presentation given in English by a German speaker with a strong versus native-like accent (in Studies 1–3) versus a native speaker of English (in Study 1). They evaluated candidates with a strong accent worse than candidates with a native(-like) pronunciation—even to the degree that the quality of arguments was of no relevance (Study 1). Study 2 introduces an effective intervention to reduce these discriminatory tendencies. Across studies, affect and competence emerged as major mediators of hirability evaluations. Study 3 further revealed sequential indirect influences, which advance our understanding of previous inconsistent findings regarding disfluency and warmth perceptions.
We report on a new project building a Natural Language Processing resource for Zulu by making use of resources already available. Combining tagging results with the results of morphological analysis semi-automatically, we expect to reduce the amount of manual work when generating a finely-grained gold standard corpus usable for training a tagger. From the tagged corpus, we plan to extract verb-argument pairs with the aim of compiling a verb valency lexicon for Zulu.
This paper focuses on so called syntactic projection phenomena in the German language. This term from the German Gesprächsforschung is used to define the fact that an utterance or part of it foreshadows another one. This paper aims at pointing out how such projection phenomena are consciously exploited for rhethorical purposes. This will be observed on the basis of excerpts from the Stuttgart 21 mediation talks. The linguistic analysis carried out in this paper will focus on syntactic projection phenomena involving the use of causal adverbial connectives deshalb and deswegen.
Research on language politics, policy, and planning is of importance to contact linguistics, since political relations between groups of language users, the way in which the use of language(s) is organized, and how language issues are politicized fundamentally shape the political and social conditions under which language varieties are in contact. This chapter first provides a short sketch of how language policy, planning, and politics have so far been conceptualized. Major subfields will be discussed, and then relevant actors and factors in these processes will be introduced. At the end, these aspects will be discussed from a contact linguistic perspective and summarized in a graphic visualization.
Studies on the Linguistic Landscapes (LLs) investigate frequencies, functions, and power relations between languages and their speakers in public space. Research on the LL thereby aims to understand how the production and perception of signs reflect and simultaneously shape realities. In this sense, the LL is one of the most dynamic places where processes of minoritization take place: the (in)visibility of minority languages and the functional and symbolic relationships to majority languages are in direct relationship with negotiations of minorities’ place in society. This chapter looks at minority languages in the LL from two major perspectives. Firstly, it discusses language policies, focussing on which policy categories and which domains of language use are of particular relevance for understanding minority languages in the LL. Then, it turns to issues of conflict, contestation, and exclusion by providing examples from a range of geographically and typologically prototypical case studies, including Israel, Canada, Belgium, the Basque Country, and Friesland.
Resistance and adaptation to newspeakerness in educational institutions: two tales from Estonia
(2019)
The term ‘new speaker’ has recently emerged as an attempt by sociolinguists not only to understand the diferent types of speaker profles that can be found in contemporary societies, but also to grasp the underlying processes of becoming a legitimate speaker in a given society. In this article, we combine the results from two studies situated in two educational institutions in Estonia in order to fnd out about speakers’ language attitudes and experiences in connection to learning and using Estonian. We concentrate on members of the international community who have relatively recently arrived to the country. Our results indicate that these speakers fuctuate between two prototypical discourses, which we broadly dub as ‘resistance’ and ‘adaptation’ to newspeakerness. Our study thereby adds to current debates on ‘new speaker’ and language policy issues by illustrating how tensions around language legitimacy are played out on the ground in a small nation state such as Estonia.
This chapter investigates differences in language regards in Latvia and Estonia. Based on the results of a survey that had about 1000 respondents in each country, it analyses general views on languages and language-learning motivation, as well as specific regards of Estonian, Latvian, Russian, English, German and other languages. The results show that languages and language learning are generally important for the respondents; language-learning motivation is overwhelmingly instrumental. Besides the obvious value of the titular languages of each country, English and Russian are to differing degrees considered of importance for professional and leisure purposes, ahead of German, Finnish (in Estonia) and French, whereas other languages are of little relevance. In more emotionally related categories, differences are more salient. L1-speakers of Russian differ in their views from L1-speakers of Estonian and Latvian, indicating that the linguistic acculturation of society in Estonia tends to be more monodirectional towards Estonian, whereas in Latvia there are more bidirectional tendencies as both Latvian and Russian L1-speakers regard each other’s languages as at least moderately relevant.
This chapter introduces readers to the context and concept of this volume. It starts by providing an historical overview of languages and multilingualism in Lithuania, Estonia and Latvia, highlighting the 100th anniversary of statehood which the three Baltic states are celebrating in 2018. Then, the chapter briefly presents important strands of research on multilingualism in the region throughout the past decades; in particular, questions about language policies and the status of the national languages (Estonian, Latvian and Lithuanian) and Russian. It also touches on debates about languages in education and the roles of other languages such as the regional languages of Latgalian and Võro and the changing roles of international languages such as English and German. The chapter concludes by providing short summaries of the contributions to this book.