Refine
Year of publication
- 2017 (370) (remove)
Document Type
- Part of a Book (161)
- Article (101)
- Conference Proceeding (43)
- Book (33)
- Part of Periodical (13)
- Other (7)
- Working Paper (6)
- Report (4)
- Doctoral Thesis (2)
Keywords
- Deutsch (154)
- Korpus <Linguistik> (64)
- Gesprochene Sprache (30)
- Grammatik (22)
- Sprachvariante (22)
- Englisch (14)
- Linguistik (14)
- Sprache (14)
- Diskursanalyse (13)
- Interaktion (13)
Publicationstate
- Veröffentlichungsversion (163)
- Zweitveröffentlichung (87)
- Postprint (20)
- Erstveröffentlichung (1)
- Preprint (1)
Reviewstate
- (Verlags)-Lektorat (135)
- Peer-Review (114)
- Peer-review (12)
- (Verlags-)Lektorat (2)
- Peer-Revied (2)
- (Verlags-)lektorat (1)
- Peer Review (1)
- Qualifikationsarbeit (Dissertation, Habilitationsschrift) (1)
Publisher
- Institut für Deutsche Sprache (56)
- de Gruyter (50)
- Narr Francke Attempto (39)
- Narr (19)
- De Gruyter (17)
- Verlag für Gesprächsforschung (11)
- Stauffenburg (10)
- Hempen (9)
- Springer (6)
- TUDpress (6)
Begegnungen mit neuen Wörtern: Zu lexikografischen Praktiken im Neologismenwörterbuch des IDS
(2017)
Meine folgenden Überlegungen gehen weit über rein „linguistische Theorien und Methoden" hinaus. Sie beziehen sich auch weniger als seine auf innersprachliche Fragen und mehr auf sprachensoziologische und -politische. Allerdings entziehen sie sich auch damit nicht Poppers pauschalem Urteil, die mit „human society and human history" befassten Wissenschaften seien generell unfähig zu Prognosen - im Gegensatz zu manchen (wenn auch nicht allen) Naturwissenschaften. Abgesehen davon räume ich für das Folgende jedoch gerne Abstriche ein vom Grad der von Popper für Prognosen offenbar vorausgesetzten Zuverlässigkeit und Exaktheit. Sie entsprechen auch verbreiteten Auffassungen, dass sich die Weiterentwicklung der Technik zuverlässiger Voraussagen lässt als die der menschlichen Sozialbeziehungen, angesichts unkalkulierbarer „Anarchie und Ignoranz, die das Gefüge unserer Gesellschaft zerstören könnten" (Kaku 2016, S. 33). Bei einer solchen Abschwächung der Ansprüche im Sinne derartiger Vorbehalte erscheint es mir aber dennoch treffender, die folgenden Überlegungen, soweit sie zukunftsgerichtet sind, eher den Prognosen zuzuordnen als den bloßen Prophezeiungen, denen man ja dann - bei ihrer typischen Stütze durch „göttliche Offenbarung" - jegliche theoretische oder faktische, also wissenschaftliche Grundlage absprechen darf. Freilich verliert mit der genannten Abschwächung die Opposition zwischen den Begriffen 'Prognose' und 'Prophezeiung' ihre strenge Disjunktheit und wird in Richtung eines abgestuften oder kontinuierlichen Übergangs aufgelockert. Jedoch widerspricht dies keineswegs gängigem wissenschaftlichen Procedere. Damit nun aber genug an allgemeinen methodischen Vorüberlegungen! Im Übrigen geht es mir im Folgenden weniger um die Auseinandersetzung mit bisherigen Publikationen zum Thema, auch nicht denen des mit diesem Band Geehrten, die - bei einem nicht zu engen Verständnis - in großer Zahl vorliegen, als um die Skizzierung meiner eigenen Einschätzungen.
Das von der Leibniz-Gemeinschaft geförderte Projekt „Lexik des gesprochenen Deutsch“(LeGeDe, Leibniz-Wettbewerb 2016, Förderlinie I: „Innovative Vorhaben“) nahm im September 2016 am Institut für Deutsche Sprache (IDS) seine Arbeit auf.1 Das Hauptziel ist die Erstellung einer korpusbasierten lexikografischen Online-Ressource zur Lexik des gesprochenen Deutsch auf der Grundlage von lexikologischen und gesprächsanalytischen Untersuchungen authentischer gesprochensprachlicher Daten. Als Kooperationsprojekt der Abteilungen Lexik und Pragmatik arbeiten Mitarbeiter/innen aus der Lexikologie, Lexikografie, Interaktionalen bzw. Gesprächslinguistik, Korpus- und Computerlinguistik und den Empirischen Methoden zusammen, wodurch sowohl aus der Sicht der Gesprochene- Sprache-Forschung als auch aus lexikografischer Perspektive eine innovative Form der Sprachbeschreibung entstehen soll.
Der Auftaktworkshop "Lexik des gesprochenen Deutsch: Forschungsstand, Erwartungen und Anforderungen an die Entwicklung einer innovativen lexikografischen Ressource" fand am 16. und 17. Februar 2017 am Institut fur Deutsche Sprache (IDS) in Mannheim statt. Das von der Leibniz-Gemeinschaft geforderte Projekt "Lexik des gesprochenen Deutsch" (=LeGeDe, Leibniz-Wettbewerb 2016, Forderlinie "Innovative Vorhaben") nahm im September 2016 am IDS seine Arbeit auf. Das Hauptziel ist die Erstellung einer korpusbasierten elektronischen Ressource zur Lexik des gesprochenen Deutsch auf der Grundlage von lexikologischen und gesprachsanalytischen Untersuchungen authentischer gesprochensprachlicher Daten.
Sound units play a pivotal role in cognitive models of auditory comprehension. The general consensus is that during perception listeners break down speech into auditory words and subsequently phones. Indeed, cognitive speech recognition is typically taken to be computationally intractable without phones. Here we present a computational model trained on 20 hours of conversational speech that recognizes word meanings within the range of human performance (model 25%, native speakers 20–44%), without making use of phone or word form representations. Our model also generates successfully predictions about the speed and accuracy of human auditory comprehension. At the heart of the model is a ‘wide’ yet sparse two-layer artificial neural network with some hundred thousand input units representing summaries of changes in acoustic frequency bands, and proxies for lexical meanings as output units. We believe that our model holds promise for resolving longstanding theoretical problems surrounding the notion of the phone in linguistic theory.
We present a method to identify and document a phenomenon on which there is very little empirical data: German phrasal compounds occurring in the form of as a single token (without punctuation between their components). Relying on linguistic criteria, our approach implies to have an operational notion of compounds which can be systematically applied as well as (web) corpora which are large and diverse enough to contain rarely seen phenomena. The method is based on word segmentation and morphological analysis, it takes advantage of a data-driven learning process. Our results show that coarse-grained identification of phrasal compounds is best performed with empirical data, whereas fine-grained detection could be improved with a combination of rule-based and frequency-based word lists. Along with the characteristics of web texts, the orthographic realizations seem to be linked to the degree of expressivity.
In conversation, turn-taking is usually fluid, with next speakers taking their turn right after the end of the previous turn. Most, but not all, previous studies show that next speakers start to plan their turn early, if possible already during the incoming turn. The present study makes use of the list-completion paradigm (Barthel et al., 2016), analyzing speech onset latencies and eye-movements of participants in a task-oriented dialogue with a confederate. The measures are used to disentangle the contributions to the timing of turn-taking of early planning of content on the one hand and initiation of articulation as a reaction to the upcoming turn-end on the other hand. Participants named objects visible on their computer screen in response to utterances that did, or did not, contain lexical and prosodic cues to the end of the incoming turn. In the presence of an early lexical cue, participants showed earlier gaze shifts toward the target objects and responded faster than in its absence, whereas the presence of a late intonational cue only led to faster response times and did not affect the timing of participants' eye movements. The results show that with a combination of eye-movement and turn-transition time measures it is possible to tease apart the effects of early planning and response initiation on turn timing. They are consistent with models of turn-taking that assume that next speakers (a) start planning their response as soon as the incoming turn's message can be understood and (b) monitor the incoming turn for cues to turn-completion so as to initiate their response when turn-transition becomes relevant.
In this paper we present the results of an automatic classification of Russian texts into three levels of difficulty. Our aim is to build a study corpus of Russian, in which a L2 student is able to select texts of a desired complexity. We are building on a pilot study, in which we classified Russian texts into two levels of difficulty. In the current paper, we apply the classification to an extended corpus of 577 labelled texts. The best-performing combination of features achieves an accuracy of 0,74 within at most one level difference.
The paper reviews the results of work done in the context of TEI-Lex0, a joint ENeL / DARIAH / PARTHENOS initiative aimed at formulating guidelines for the encoding of retrodigitized dictionaries by streamlining and simplifying the recommendations of the “Print Dictionaries” chapter of the TEI Guidelines. TEI-Lex0 work is performed by teams concentrating on each of the main components of dictionary entries. The work presented here concerns proposals for constraining TEI-based encoding of orthographic, phonetic, and grammatical information on written and spoken forms of the lemma (headword), including auxiliary inflected forms. We also adduce examples of handling various types of orthographic and phonetic variants, as well as examples of handling the representation of inflectional paradigms, which have received less attention in the TEI Guidelines but which are nonetheless essential for properly exposing data content to the various uses that digitized lexica may have.
CoMParS is a resource under construction in the context of the long-term project German Grammar in European Comparison (GDE) at the IDS Mannheim. The principal goal of GDE is to create a novel contrastive grammar of German against the background of other European languages. Alongside German, which is the central focus, the core languages for comparison are English, French, Hungarian and Polish, representing different typological classes. Unlike traditional contrastive grammars available for German, which usually cover language pairs and are based on formal grammatical categories, the new GDE grammar is developed in the spirit of functionalist typology. This implies that, instead of formal criteria, cognitively motivated functional domains in terms of Givón (1984) are used as tertia comparationis. The purpose of CoMParS is to document the empirical basis of the theoretical assumptions of GDE-V and to illustrate the otherwise rather abstract content of grammar books by as many as possible naturally occurring and adequately presented multilingual examples, including information on their use in specific contexts and registers. These examples come from existing parallel corpora, and our presentation will focus on the legal aspects and consequences of this choice of language data.
In this chapter, a conversation-analytic approach is used to study medical recommendations as an essential part of medical advice. Tlte analyses are based on renal treatment planning conversations in which physicians inform patients about an upcoming dialysis therapy. The data reveals that medical recommendations are marked throughout by their strikingly tentative and relativistic phrasing in which the conflict between physicians duty of care and the patient’s autonomy is obvious. The observed discrepancy between what should be said and what patients and physicians want to be said - and heard - not only gives reason to challenge the ethical and legal requirements concerning medical recommendations and their implications for medical practice, but also to rethink the current models of decision-making in medical communication.
Zeitungsartikel mit wirtschaftlichem Inhalt sind nicht immer nach dem Textmuster „Bericht“ geschrieben, sie können auch erzähltechnische Elemente enthalten. Die Autorinnen untersuchen wirtschaftliche Krisenberichterstattungen aus deutschen, schweizerischen und österreichischen (Wochen-)Zeitungen; sie postulieren, dass Bericht und Erzählung nicht dichotomische Textmuster darstellen, sondern Pole einer Skala, auf der die konkreten Texte verortet werden können. Sie differenzieren vier Grade der Narrativität: nicht /schwach/mittel/stark narrativ. Es zeigt sich, dass der Anteil der schwach und mittel narrativen Texte zwischen 1973 und 2010-12 stark zunimmt. Außerdem werden die Positionen der Gesamtnarration „Krise“ ebenfalls je nach Untersuchungszeitraum bzw. Zeitung verschieden besetzt. Insgesamt dient der Einsatz narrativer Techniken dazu, durch eine textuelle Umsetzung der Krankheitsmetapher zunehmend abstraktere Prozesse zu veranschaulichen.
Reden über Geld
(2017)
The paper presents best practices and results from projects dedicated to the creation of corpora of computer-mediated communication and social media interactions (CMC) from four different countries. Even though there are still many open issues related to building and annotating corpora of this type, there already exists a range of tested solutions which may serve as a starting point for a comprehensive discussion on how future standards for CMC corpora could (and should) be shaped like.
The paper reports on the results of a scientific colloquium dedicated to the creation of standards and best practices which are needed to facilitate the integration of language resources for CMC stemming from different origins and the linguistic analysis of CMC phenomena in different languages and genres. The key issue to be solved is that of interoperability – with respect to the structural representation of CMC genres, linguistic annotations metadata, and anonymization/pseudonymization schemas. The objective of the paper is to convince more projects to partake in a discussion about standards for CMC corpora and for the creation of a CMC corpus infrastructure across languages and genres. In view of the broad range of corpus projects which are currently underway all over Europe, there is a great window of opportunity for the creation of standards in a bottom-up approach.