Refine
Year of publication
- 2019 (136) (remove)
Document Type
- Article (52)
- Conference Proceeding (30)
- Part of a Book (23)
- Book (15)
- Other (5)
- Review (5)
- Working Paper (4)
- Part of Periodical (2)
Language
- German (76)
- English (57)
- Multiple languages (2)
- Ukrainian (1)
Keywords
- Deutsch (45)
- Korpus <Linguistik> (31)
- Gesprochene Sprache (13)
- Automatische Sprachanalyse (12)
- Konversationsanalyse (9)
- corpus linguistics (9)
- Social Media (7)
- Interaktion (6)
- Sprachstatistik (6)
- corpus processing (6)
Publicationstate
- Veröffentlichungsversion (136) (remove)
Reviewstate
- Peer-Review (74)
- (Verlags)-Lektorat (57)
Publisher
- Leibniz-Institut für Deutsche Sprache (IDS) (26)
- Leibniz-Institut für Deutsche Sprache (19)
- German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg (9)
- Lexical Computing CZ s.r.o. (6)
- Institut für Deutsche Sprache (5)
- de Gruyter (5)
- Heidelberg University Publishing (4)
- Narr (4)
- The Association for Computational Linguistics (4)
- Spektrum der Wissenschaft Verlagsgesellschaft (3)
Smooth turn-taking in conversation depends in part on speakers being able to communicate their intention to hold or cede the floor. Both prosodic and gestural cues have been shown to be used in this context. We investigate the interplay of pitch movements and hand gestures at locations at which speaker change becomes relevant, comparing their use in German and Swedish. We find that there are some shared functions of prosody and gesture with regard to turn-taking in the two languages, but that these shared functions appear to be mediated by the different phonological demands on pitch in the two languages.
Looking at gestures as a means for communication, they can serve conversational participants at several levels. As co-speech gestures, they can add information to the verbally expressed content and they can serve to manage turn-taking. In order to look closer at the interplay between these resources in face-to face conversation, we annotated hand gestures, syntactic completion points and the related turn-organisation, and measured the timing of gesture strokes and their lexical/phrasal referent. In a case study on German, we observe the trend that speakers vary less in gesturelexis on- and offsets when keeping the turn after syntactic completions than at speaker changes, backchannel or other locations of a conversation. This indicates that timing properties of non-verbal cues interact with verbal cues to manage turn-taking.
We examine the new task of detecting derogatory compounds (e.g. curry muncher). Derogatory compounds are much more difficult to detect than derogatory unigrams (e.g. idiot) since they are more sparsely represented in lexical resources previously found effective for this task (e.g. Wiktionary). We propose an unsupervised classification approach that incorporates linguistic properties of compounds. It mostly depends on a simple distributional representation. We compare our approach against previously established methods proposed for extracting derogatory unigrams.
We discuss the impact of data bias on abusive language detection. We show that classification scores on popular datasets reported in previous work are much lower under realistic settings in which this bias is reduced. Such biases are most notably observed on datasets that are created by focused sampling instead of random sampling. Datasets with a higher proportion of implicit abuse are more affected than datasets with a lower proportion.
We present a descriptive analysis on the two datasets from the shared task on Source, Subjective Expression and Target Extraction from Political Speeches (STEPS), the only existing German dataset for opinion role extraction of its size. Our analysis discusses the individual properties of the three components, subjective expressions, sources and targets and their relations towards each other. Our observations should help practitioners and researchers when building a system to extract opinion roles from German data.
A Supervised learning approach for the extraction of opinion sources and targets from German text
(2019)
We present the first systematic supervised learning approach for the extraction of opinion sources and targets on German language data. A wide choice of different features is presented, particularly syntactic features and generalization features. We point out specific differences between opinion sources and targets. Moreover, we explain why implicit sources can be extracted even with fairly generic features. In order to ensure comparability our classifier is trained and tested on the dataset of the STEPS shared task.