430 Deutsch
Refine
Document Type
- Part of a Book (8)
- Book (1)
- Conference Proceeding (1)
Has Fulltext
- yes (10)
Keywords
- Automatische Sprachverarbeitung (10) (remove)
Publicationstate
Reviewstate
- (Verlags)-Lektorat (5)
- Peer-Review (5)
We study German affixoids, a type of morpheme in between affixes and free stems. Several properties have been associated with them – increased productivity; a bleached semantics, which is often evaluative and/or intensifying and thus of relevance to sentiment analysis; and the existence of a free morpheme counterpart – but not been validated empirically. In experiments on a new data set that we make available, we put these key assumptions from the morphological literature to the test and show that despite the fact that affixoids generate many low-frequency formations, we can classify these as affixoid or non-affixoid instances with a best F1-score of 74%.
In this paper we use methods for creating a large lexicon of verbal polarity shifters and apply them to German. Polarity shifters are content words that can move the polarity of a phrase towards its opposite, such as the verb “abandon” in “abandon all hope”. This is similar to how negation words like “not” can influence polarity. Both shifters and negation are required for high precision sentiment analysis. Lists of negation words are available for many languages, but the only language for which a sizable lexicon of verbal polarity shifters exists is English. This lexicon was created by bootstrapping a sample of annotated verbs with a supervised classifier that uses a set of data- and resource-driven features. We reproduce and adapt this approach to create a German lexicon of verbal polarity shifters. Thereby, we confirm that the approach works for multiple languages. We further improve classification by leveraging cross-lingual information from the English shifter lexicon. Using this improved approach, we bootstrap a large number of German verbal polarity shifters, reducing the annotation effort drastically. The resulting German lexicon of verbal polarity shifters is made publicly available.
Both for psychology and linguistics, emotion concepts are a continuing challenge for analysis in several respects. In this contribution, we take up the language of emotion as an object of study from several angles. First, we consider how frame semantic analyses of this domain by the FrameNet project have been developing over time, due to theory-internal as well as application-oriented goals, towards ever more fine-grained distinctions and greater within-frame consistency. Second, we compare how FrameNet’s linguistically oriented analysis of lexical items in the emotion domain compares to the analysis by domain experts of the experiences that give rise (directly or indirectly) to the lexical items. And finally, we consider to what extent frame semantic analysis can capture phenomena such as connotation and inference about attitudes, which are important in the field of sentiment analysis and opinion mining, even if they do not involve the direct evocation of emotion.
We present the pilot edition of the GermEval Shared Task on the Identification of Offensive Language. This shared task deals with the classification of German tweets from Twitter. It comprises two tasks, a coarse-grained binary classification task and a fine-grained multi-class classification task. The shared task had 20 participants submitting 51 runs for the coarse-grained task and 25 runs for the fine-grained task. Since this is a pilot task, we describe the process of extracting the raw-data for the data collection and the annotation schema. We evaluate the results of the systems submitted to the shared task. The shared task homepage can be found at https://projects.cai. fbi.h-da.de/iggsa/
Offensive language in social media is a problem currently widely discussed. Researchers in language technology have started to work on solutions to support the classification of offensive posts. We present the pilot edition of the GermEval Shared Task on the Identification of Offensive Language. This shared task deals with the classification of German tweets from Twitter. GermEval 2018 is the fourth workshop in a series of shared tasks on German processing.
Der Beitrag gibt zunächst einen Überblick über die Herangehensweisen der Computerlinguistik an das Automatische Textverstehen seit den Siebziger Jahren. Nach einer psycholinguistisch geprägten Frühphase rückten zunächst die wissensbasierten Ansätze der Künstlichen Intelligenz in den Mittelpunkt des Interesses, die allerdings nicht über den Status kleiner experimenteller Systeme hinaus gelangen konnten. Es folgte eine Hinwendung zur Linguistik mit syntaktischer und semantischer Analyse, bevor um 1990 die „statistische Wende“ der Computerlinguistik begann; seither stehen quantitative Verfahren im Vordergrund. Am Beispiel der Aufgabe der Automatischen Textzusammenfassung plädiert der Beitrag dafür, intelligente Verknüpfungen von symbolischen linguistisch-fundierten und statistischen Verfahren zu suchen, um die Robustheit statistischer Verfahren auch durch eine insgesamt höhere Qualität zu verbessern.