OPUS 4 | Search

"... durch Worte heilen" - Linguistik und Psychotherapie (2016)

Marciniak, Agnieszka ; Nikendei, Christoph ; Ehrenthal, Johannes C. ; Spranz-Fogasy, Thomas

A CUP of CoFee: A Large Collection of Feedback Utterances Provided with Communicative Function Annotations (2016)

Prévot, Laurent ; Gorisch, Jan ; Bertrand, Roxane

There have been several attempts to annotate communicative functions to utterances of verbal feedback in English previously. Here, we suggest an annotation scheme for verbal and non-verbal feedback utterances in French including the categories base, attitude, previous and visual. The data comprises conversations, maptasks and negotiations from which we extracted ca. 13,000 candidate feedback utterances and gestures. 12 students were recruited for the annotation campaign of ca. 9,500 instances. Each instance was annotated by between 2 and 7 raters. The evaluation of the annotation agreement resulted in an average best-pair kappa of 0.6. While the base category with the values acknowledgement, evaluation, answer, elicit and other achieves good agreement, this is not the case for the other main categories. The data sets, which also include automatic extractions of lexical, positional and acoustic features, are freely available and will further be used for machine learning classification experiments to analyse the form-function relationship of feedback.

Annotating Discourse Relations in Spoken Language: A Comparison of the PDTB and CCR Frameworks (2016)

Rehbein, Ines ; Scholman, Merel ; Demberg, Vera

In discourse relation annotation, there is currently a variety of different frameworks being used, and most of them have been developed and employed mostly on written data. This raises a number of questions regarding interoperability of discourse relation annotation schemes, as well as regarding differences in discourse annotation for written vs. spoken domains. In this paper, we describe ouron annotating two spoken domains from the SPICE Ireland corpus (telephone conversations and broadcast interviews) according todifferent discourse annotation schemes, PDTB 3.0 and CCR. We show that annotations in the two schemes can largely be mappedone another, and discuss differences in operationalisations of discourse relation schemes which present a challenge to automatic mapping. We also observe systematic differences in the prevalence of implicit discourse relations in spoken data compared to written texts,find that there are also differences in the types of causal relations between the domains. Finally, we find that PDTB 3.0 addresses many shortcomings of PDTB 2.0 wrt. the annotation of spoken discourse, and suggest further extensions. The new corpus has roughly theof the CoNLL 2015 Shared Task test set, and we hence hope that it will be a valuable resource for the evaluation of automatic discourse relation labellers.

Aufgabenorientierung Jungen - Küchenschatz [Transkript 5.1] (2016)

Torres Cajo, Sarah

Bericht über die 19. Arbeitstagung zur Gesprächsforschung am Institut für Deutsche Sprache (Mannheim) vom 16.-18. März 2016, Rahmenthema: Diskursmarker (2016)

Koblischke, Kristina

Bericht über die 19. Arbeitstagung zur Gesprächsforschung vom 16. bis 18. März 2016 in Mannheim (2016)

Koblischke, Kristina

Comparaison de deux marqueurs d’affirmation dans des séquences de co-construction: voilà et genau (2016)

Oloff, Florence

This contribution investigates the German response particle genau and the French response particle voilà within collaborative turn sequences in videotaped ordinary conversations. Adopting a conversation analytic approach to cross-linguistic comparison, I will show that the basic epistemic value of both particles allows them to be used in similar sequential environments. When a co-participant formulates a candidate conclusion in environments where it can be easily inferred from previous talk, first speakers may confirm the adequacy of the pre-emptive completion by voilà or genau. These particles may then also be followed by self- or other-repeats. The analyses aim to illustrate that participants rely on a variety of practices in order to positively assess a pre-emptive completion, and to refute a supposed binary opposition of refusal vs. acceptance in the receipt slot.

Construction and dissemination of a corpus of spoken interaction - tools and workflows in the FOLK project (2016)

Schmidt, Thomas

This paper is about the workflow for construction and dissemination of FOLK (Forschungs - und Lehrkorpus Gesprochenes Deutsch – Research and Teaching Corpus of Spoken German), a large corpus of authentic spoken interaction data, recorded on audio and video. Section 2 describes in detail the tools used in the individual steps of transcription, anonymization, orthographic normalization, lemmatization and POS tagging of the data, as well as some utilities used for corpus management. Section 3 deals with the DGD (Datenbank für Gesprochenes Deutsch - Database of Spoken German) as a tool for distributing completed data sets and making them available for qualitative and quantitative analysis. In section 4, some plans for further development are sketched.

Datenbank für Gesprochenes Deutsch (DGD) (2016)

Schmidt, Thomas

Einführung in die Benutzung der Ressourcen DGD und FOLK für gesprächsanalytische Zwecke. Handreichung: "sprich" als Reformulierungsindikator (2016)

Kaiser, Julia ; Schmidt, Thomas

Diese Handreichung stellt die Datenbank für Gesprochenes Deutsch (DGD) und speziell das Forschungs- und Lehrkorpus Gesprochenes Deutsch (FOLK) als Instrumente gesprächsanalytischer Arbeit vor. Nach einem kurzen einführenden Überblick werden anhand des Beispiels "sprich" als Diskursmarker bzw. Reformulierungsindikator Schritt für Schritt die Ressourcen und Tools für systematische korpus- und datenbankgesteuerte Recherchen und Analysen vorgestellt und illustriert.

Einführung in die Benutzung der Ressourcen DGD und FOLK für gesprächsanalytische Zwecke. Handreichung: Einfache Recherche-Anfragen als Übungsbeispiele (2016)

Kaiser, Julia ; Schmidt, Thomas

Diese Handreichung stellt die Datenbank für Gesprochenes Deutsch (DGD) und speziell das Forschungs- und Lehrkorpus Gesprochenes Deutsch (FOLK) als Instrumente gesprächsanalytischer Arbeit vor. Nach einem kurzen einführenden Überblick werden anhand vier verschiedener Beispiele Schritt für Schritt die Ressourcen und Tools für systematische korpus- und datenbankgesteuerte Recherchen und Analysen vorgestellt und illustriert.

Einführung in die Benutzung der Ressourcen DGD und FOLK für gesprächsanalytische Zwecke. Handreichung: Metapragmatische Modalisierungen. (2016)

Kaiser, Julia ; Schmidt, Thomas

Diese Handreichung stellt die Datenbank für Gesprochenes Deutsch (DGD) und speziell das Forschungs- und Lehrkorpus Gesprochenes Deutsch (FOLK) als Instrumente gesprächsanalytischer Arbeit vor. Nach einem kurzen einführenden Überblick werden anhand des Beispiels metapragmatischer Modalisierungen mit den Adverbien "sozusagen" und "gewissermaßen" und mit der Formel "in Anführungszeichen/-strichen" Schritt für Schritt die Ressourcen und Tools für systematische korpus- und datenbankgesteuerte Recherchen und Analysen vorgestellt und illustriert.

FOLK-Gold ― A gold standard for part-of-speech-tagging of spoken German (2016)

Westpfahl, Swantje ; Schmidt, Thomas

In this paper, we present a GOLD standard of part-of-speech tagged transcripts of spoken German. The GOLD standard data consists of four annotation layers – transcription (modified orthography), normalization (standard orthography), lemmatization and POS tags – all of which have undergone careful manual quality control. It comes with guidelines for the manual POS annotation of transcripts of German spoken data and an extended version of the STTS (Stuttgart Tübingen Tagset) which accounts for phenomena typically found in spontaneous spoken German. The GOLD standard was developed on the basis of the Research and Teaching Corpus of Spoken German, FOLK, and is, to our knowledge, the first such dataset based on a wide variety of spontaneous and authentic interaction types. It can be used as a basis for further development of language technology and corpus linguistic applications for German spoken language.

Fragmente Jungen - Gewalterfahrung (King Ali) [Transkript 6.2] (2016)

Torres Cajo, Sarah

Fragmente Jungen - Gewalterfahrung (Schlägerei) [Transkript 6.3] (2016)

Torres Cajo, Sarah

Fragmente Jungen [Transkript 6.4] (2016)

Arens, Katja

Gesprochene Sprache in DaF-Lernwörterbüchern (2016)

Meliss, Meike

Good practices in the compilation of FOLK, the research and teaching corpus of spoken German (2016)

Schmidt, Thomas

The paper presents practices in the compilation of FOLK, the Research and Teaching Corpus of Spoken German, a large collection of spontaneous verbal interaction from diverse discourse domains. After introducing the aims and organisational circumstances of the construction of FOLK, the general idea discussed is that good practices cannot be developed without considering methodological, technological and organisational aspects on equal footing. Starting from this idea, this paper inspects more closely some actual practices in FOLK, namely the handling of legal (especially privacy protection) issues, the decisions taken for the transcription and annotation workflow, and the question of how to best disseminate a corpus like FOLK. The final section sketches some possible future improvements for practices in FOLK.

Im Zweifel für den Zweifel: Praktiken des Zweifelns (2016)

Imo, Wolfgang

Aufgrund der Tatsache, dass wir häufig Zweifel an Behauptungen von Gesprächspartnern, an Sachverhalten, an Wahrheitsgehalten von Aussagen etc. hegen, ist davon auszugehen, dass wir entsprechend über mehr oder weniger stark verfestigte interaktionale Praktiken des Anzeigens und des Behebens von Zweifeln verfügen, die Problemlösungsroutinen für die Bearbeitung von Zweifeln bereitstellen. Anhand einer empirischen Untersuchung von gesprochenem Alltagsdeutsch (ca. dreieinhalb Stunden Audiomaterial) soll versucht werden, exemplarisch solche Praktiken des Zweifelns im Deutschen zu beschreiben.

Martin Pfeiffer: Selbstreparaturen im Deutschen. Syntaktische und interaktionale Analysen und Laura Di Venanzio: Die Syntax von Selbstreparaturen. Sprach- und erwerbsspezifische Reparaturorganisation im Deutschen und Spanischen [Rezension] (2016)

Zifonun, Gisela

Methodological approaches to people's notions of spoken Standard German (2016)

Koplenig, Alexander ; Knöbl, Ralf ; Deppermann, Arnulf

This paper explores speakers’ notions of the situational appropriacy of linguistic variants. We conducted a web-based survey in which we collected ratings of the appropriacy of variants of linguistic variables in spoken German. A range of quantitative methods (cluster analysis, factor analysis and various forms of visualization techniques) is applied in order to analyze metalinguistic awareness and the differences in the evaluation of written vs. spoken stimuli. First, our data show that speakers’ ratings of the appropriacy of linguistic variants vary reliably with two rough clusters representing formal and informal speech situations and genres. The findings confirm that speakers adhere to a notion of spoken standard German which takes genre and register-related variation into account. Secondly, our analysis reveals a written language bias: metalinguistic awareness is strongly influenced by the physical mode of the presentation of linguistic items (spoken vs. written).

Narrationen – Erotische Fremderfahrungen [Transkript 4.2] (2016)

Torres Cajo, Sarah

Provokation Mädchen [Transkript 2.2] (2016)

Torres Cajo, Sarah

Reformulierungsindikatoren im gesprochenen Deutsch: Die Benutzung der Ressourcen DGD und FOLK für gesprächsanalytische Zwecke (2016)

Kaiser, Julia

Dieser Beitrag stellt nach einer kurzen allgemeinen Einführung die Datenbank für Gesprochenes Deutsch (DGD) und das Forschungs- und Lehrkorpus Gesprochenes Deutsch (FOLK) als Instrumente speziell für gesprächsanalytisches Arbeiten vor. Anhand des Beispiels sprich als Diskursmarker für Reformulierungen werden Schritt für Schritt die Ressourcen und Tools für systematische korpus- und datenbankgesteuerte Recherchen illustriert: Nutzungsmöglichkeiten der Token-, Kontext-, Metadaten- und Positionssuche werden gezeigt, jeweils in Bezug auf und im wechselseitigen Verhältnis mit qualitativen Fallanalysen, auch mit Belegannotationen nach analyserelevanten (strukturellen und funktionalen) Kategorien. Schließlich wird das heißt als weiterer Reformulierungsindikator für eine vergleichende Analyse herangezogen. Dieser Beitrag stellt eine detailliertere Ausarbeitung einer kürzeren, eher technisch-didaktischen Online-Handreichung (Kaiser/ Schmidt 2016) zu diesem Thema dar, und hat einen stärker inhaltlich-analytischen Fokus.

Small-Talk Jungen [Transkript 3.1] (2016)

Arens, Katja

Spaß haben Mädchen [Transkript 1.1] (2016)

Torres Cajo, Sarah

The IFCASL Corpus of French and German Non-native and Native Read Speech (2016)

Trouvain, Jürgen ; Bonneau, Anne ; Colotte, Vincent ; Fauth, Camille ; Fohr, Dominique ; Jouvet, Denis ; Jügler, Jeanin ; Laprie, Yves ; Mella, Odile ; Möbius, Bernd ; Zimmerer, Frank

The IFCASL corpus is a French-German bilingual phonetic learner corpus designed, recorded and annotated in a project on individualized feedback in computer-assisted spoken language learning. The motivation for setting up this corpus was that there is no phonetically annotated and segmented corpus for this language pair of comparable of size and coverage. In contrast to most learner corpora, the IFCASL corpus incorporate data for a language pair in both directions, i.e. in our case French learners of German, and German learners of French. In addition, the corpus is complemented by two sub-corpora of native speech by the same speakers. The corpus provides spoken data by about 100 speakers with comparable productions, annotated and segmented on the word and the phone level, with more than 50% manually corrected data. The paper reports on inter-annotator agreement and the optimization of the acoustic models for forced speech-text alignment in exercises for computer-assisted pronunciation training. Example studies based on the corpus data with a phonetic focus include topics such as the realization of /h/ and glottal stop, final devoicing of obstruents, vowel quantity and quality, pitch range, and tempo.

The Karl Eberhards Corpus of spontaneously spoken southern German in dialogues - audio and articulatory recordings (2016)

Arnold, Denis ; Tomaschek, Fabian

The current paper presents a corpus containing 35 dialogues of spontaneously spoken southern German, including half an hour of articulography for 13 of the speakers. Speakers were seated in separate recording chambers, mimicking a telephone call, and recorded on individual audio channels. The corpus provides manually corrected word boundaries and automatically aligned segment boundaries. Annotations are provided in the Praat format. In addition to audio recordings, speakers filled out a detailed questionnaire, assessing among others their audio-visual consumption habits.

User, who art thou? User profiling for oral corpus platforms (2016)

Fandrych, Christian ; Frick, Elena ; Hedeland, Hanna ; Iliash, Anna ; Jettka, Daniel ; Meißner, Cordula ; Schmidt, Thomas ; Wallner, Franziska ; Weigert, Kathrin ; Westpfahl, Swantje

This contribution presents the background, design and results of a study of users of three oral corpus platforms in Germany. Roughly 5.000 registered users of the Database for Spoken German (DGD), the GeWiss corpus and the corpora of the Hamburg Centre for Language Corpora (HZSK) were asked to participate in a user survey. This quantitative approach was complemented by qualitative interviews with selected users. We briefly introduce the corpus resources involved in the study in section 2. Section 3 describes the methods employed in the user studies. Section 4 summarizes results of the studies focusing on selected key topics. Section 5 attempts a generalization of these results to larger contexts.

Was denkt der Arzt, was sagt er? Hypothesenbildungsprozesse in einem ärztlichen Gespräch (2016)

Spranz-Fogasy, Thomas

Zur Perspektivierung von verbalen Handlungen und kognitiven Prozessen durch die Verwendung von Bewegungsverben im gesprochenen Deutsch (2016)

Proske, Nadine

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

31 search hits