OPUS 4 | Search

14 search hits

1 to 10

Sort by

"Corpus-driven": Systematische Auswertung automatisch ermittelter sprachlicher Muster (2007)

Annotation and classification of French feedback communicative functions (2015)

Prévot, Laurent ; Gorisch, Jan ; Mukherjee, Sankar

Feedback utterances are among the most frequent in dialogue. Feedback is also a crucial aspect of all linguistic theories that take social interaction involving language into account. However, determining communicative functions is a notoriously difficult task both for human interpreters and systems. It involves an interpretative process that integrates various sources of information. Existing work on communicative function classification comes from either dialogue act tagging where it is generally coarse grained concerning the feed- back phenomena or it is token-based and does not address the variety of forms that feed- back utterances can take. This paper introduces an annotation framework, the dataset and the related annotation campaign (involving 7 raters to annotate nearly 6000 utterances). We present its evaluation not merely in terms of inter-rater agreement but also in terms of usability of the resulting reference dataset both from a linguistic research perspective and from a more applicative viewpoint.

Automatic Prominence Annotation of a German Speech Synthesis Corpus: Towards Prominence-Based Prosody Generation for Unit Selection Synthesis (2010)

Windmann, Andreas ; Wagner, Petra ; Tamburini, Fabio ; Arnold, Denis ; Oertel, Catharine

This paper describes work directed towards the development of a syllable prominence-based prosody generation functionality for a German unit selection speech synthesis system. A general concept for syllable prominence-based prosody generation in unit selection synthesis is proposed. As a first step towards its implementation, an automated syllable prominence annotation procedure based on acoustic analyses has been performed on the BOSS speech corpus. The prominence labeling has been evaluated against an existing annotation of lexical stress levels and manual prominence labeling on a subset of the corpus. We discuss methods and results and give an outlook on further implementation steps.

Combining quantitative and qualitative methods in a study on dictionary use (2018)

Wolfer, Sascha ; Nied Curcio, Martina ; Silva Dias, Idalete Maria ; Müller-Spitzer, Carolin ; Dominguez Vazquez, Maria Jose

Many studies on dictionary use presuppose that users do indeed consult lexicographic resources. However, little is known about what users actually do when they try to solve language problems on their own. We present an observation study where learners of German were allowed to browse the web freely while correcting erroneous German sentences. In this paper, we are focusing on the multi-methodological approach of the study, especially the interplay between quantitative and qualitative approaches. In one example study, we will show how the analysis of verbal protocols, the correction task and the screen recordings can reveal the effects of intuition, language (learning) awareness, and determination on the accuracy of the corrections. In another example study, we will show how preconceived hypotheses about the problem at hand might hinder participants from arriving at the correct solution.

Comparing methods for deriving intensity scores for adjectives (2014)

Ruppenhofer, Josef ; Wiegand, Michael ; Brandes, Jasper

We compare several different corpus- based and lexicon-based methods for the scalar ordering of adjectives. Among them, we examine for the first time a low- resource approach based on distinctive- collexeme analysis that just requires a small predefined set of adverbial modifiers. While previous work on adjective intensity mostly assumes one single scale for all adjectives, we group adjectives into different scales which is more faithful to human perception. We also apply the methods to both polar and non-polar adjectives, showing that not all methods are equally suitable for both types of adjectives.

Corpus-driven description of multi-word patterns (2016)

Steyer, Kathrin

This paper presents our model of ‘MultiWord Patterns’ (MWPs). MWPs are defined as recurrent frozen schemes with fixed lexical components and productive slots that have a holistic – but not necessarily idiomatic – meaning and/or function, sometimes only on an abstract level. These patterns can only be reconstructed with corpus-driven, iterative (qualitative-quantitative) methods. This methodology includes complex phrase searches, collocation analysis that not only detects significant word pairs, but also significant syntagmatic cotext patterns and slot analysis with our UWV Tool. This tool allows us to bundle KWICs in order to detect the nature of lexical fillers for and to visualize MWP hierarchies.

Das Problem der subsumtionslogischen Konstitution von Interkulturalität (1995)

Schmitt, Reinhold ; Keim, Inken

Ziel des Beitrags sind methodisch orientierte Überlegungen zur Analyse interkultureller Kommunikation, deren Relevanz wir an einem Beispielfall demonstrieren wollen. Wir beziehen uns dabei auf das Datenmaterial und den Beitrag von Slavo Ondrejoviö in diesem Band. Angeregt zu diesem Beitrag wurden wir durch unsere zeitweilige Teilnahme an den Sitzungen der Forschungsgruppe. Im Zusammenhang mit dieser Thematik kamen immer wieder Probleme der interkulturellen Kommunikation zur Sprache, konnten aber in dem der Gruppe vorgegebenen Rahmen nicht als eigenständiger Untersuchungsgegenstand behandelt werden. Wir wollen uns nun im Folgenden mit einigen methodisch-methodologischen Problemen der Analyse interkultureller Kommunikation befassen.

Die Gesprächspause: Verbale "Auszeiten" aus multimodaler Perspektive (2004)

Schmitt, Reinhold

Der Beitrag hat zwei Schwerpunkte: Zum einen wird die interaktive Struktur einer 12 Sekunden dauernden Pause und deren soziale Bedeutung in einem spezifischen Kontext aus gesprächsanalytischgesprächsrhetorischer Perspektive analysiert. Damit wird ein interaktives Phänomen zum Gegenstand der Untersuchung, das – aus triftigen Gründen und in motivierter Weise – in der bisherigen gesprächsanalytischen Forschung einen eher peripheren Stellenwert besaß: der temporäre Verzicht der Interaktionsbeteiligten auf verbalen Ausdruck. Zum anderen wird diese Abwesenheit von Verbalität zum Anlass genommen, einen Aspekt zu thematisieren, der sich meines Erachtens auf die künftige Entwicklung der Gesprächsanalyse gravierend auswirken wird: die Tatsache, dass die empirische Grundlage gesprächsanalytischer Untersuchungen immer häufiger aus audiovisuellen Daten besteht. Diese „visuelle Revolution“, die durch die technologische Entwicklung in Form problemlos einsetzbarer Videokameras angestoßen wird, hat weit reichende Konsequenzen für die Theorie, Methodologie und Methode eines Analyseansatzes, der sich bislang und programmatisch bei der Analyse interaktiver Ordnungsstrukturen primär auf Verbalität konzentriert hat.

Korpusanalytische Zugänge zu sprachlichem Usus (2008)

Belica, Cyril ; Steyer, Kathrin

CONTRIBUTIONS TO THE STUDY OF GERMAN USAGE A CORPUS-BASED APPROACH This paper outlines some basic assumptions and principles underlying the corpus linguistics research and some application domains at the Institute for German Language in Mannheim. We briefly address three complementary but closely related tasks: first, the acquisition of very large corpora, second, the research on statistical methods for automatically extracting information about associations between word configurations, and, third, meeting the challenge of understanding the explanatory power of such methods both in theoretical linguistics and in other fields such as second language acquisition or lexicography. We argue that a systematic statistical analysis of huge bodies of text can reveal substantial insights into the language usage und change, far beyond just collocational patterning.

Methodological approaches to people's notions of spoken Standard German (2016)

Koplenig, Alexander ; Knöbl, Ralf ; Deppermann, Arnulf

This paper explores speakers’ notions of the situational appropriacy of linguistic variants. We conducted a web-based survey in which we collected ratings of the appropriacy of variants of linguistic variables in spoken German. A range of quantitative methods (cluster analysis, factor analysis and various forms of visualization techniques) is applied in order to analyze metalinguistic awareness and the differences in the evaluation of written vs. spoken stimuli. First, our data show that speakers’ ratings of the appropriacy of linguistic variants vary reliably with two rough clusters representing formal and informal speech situations and genres. The findings confirm that speakers adhere to a notion of spoken standard German which takes genre and register-related variation into account. Secondly, our analysis reveals a written language bias: metalinguistic awareness is strongly influenced by the physical mode of the presentation of linguistic items (spoken vs. written).

1 to 10

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

14 search hits