Volltext-Downloads (blau) und Frontdoor-Views (grau)

Automatic generation of lexica for sentiment polarity shifters

  • Alleviating pain is good and abandoning hope is bad. We instinctively understand how words like alleviate and abandon affect the polarity of a phrase, inverting or weakening it. When these words are content words, such as verbs, nouns, and adjectives, we refer to them as polarity shifters. Shifters are a frequent occurrence in human language and an important part of successfully modeling negation in sentiment analysis; yet research on negation modeling has focused almost exclusively on a small handful of closed-class negation words, such as not, no, and without. A major reason for this is that shifters are far more lexically diverse than negation words, but no resources exist to help identify them. We seek to remedy this lack of shifter resources by introducing a large lexicon of polarity shifters that covers English verbs, nouns, and adjectives. Creating the lexicon entirely by hand would be prohibitively expensive. Instead, we develop a bootstrapping approach that combines automatic classification with human verification to ensure the high quality of our lexicon while reducing annotation costs by over 70%. Our approach leverages a number of linguistic insights; while some features are based on textual patterns, others use semantic resources or syntactic relatedness. The created lexicon is evaluated both on a polarity shifter gold standard and on a polarity classification task.

Export metadata

Additional Services

Share in Twitter Search Google Scholar


Author:Marc SchulderORCiDGND, Michael WiegandGND, Josef RuppenhoferGND
Parent Title (English):Natural Language Engineering
Publisher:Cambridge University Press
Place of publication:Cambridge
Document Type:Article
Year of first Publication:2020
Date of Publication (online):2020/07/31
Tag:lexical semantics; lexicon generation; negation content words; sentiment analysis; sentiment polarity
GND Keyword:Klassifikation; Lexikalische Semantik; Lexikon; Maschinelles Lernen; Negativer Polaritätsausdruck; Polarität
First Page:153
Last Page:179
DDC classes:400 Sprache / 400 Sprache, Linguistik
Open Access?:ja
Leibniz-Classification:Sprache, Linguistik
Licence (English):License LogoCreative Commons - Attribution 4.0 International