OPUS 4 | Textwissenschaft

Textwissenschaft

9 search hits

1 to 9

Sort by

Kombinierbare Textanalyseverfahren für die Korpusannotation und Informationsextraktion (2006)

Klatt, Stefan

Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation (2009)

Rehbein, Ines ; Ruppenhofer, Josef ; Sporleder, Caroline

In this paper, we present the results of an experiment in which we assess the usefulness of partial semi-automatic annotation for frame labeling. While we found no conclusive evidence that it can speed up human annotation, automatic pre-annotation does increase its overall quality.

Ordering adverbs by their scaling effect on adjective intensity (2015)

Ruppenhofer, Josef ; Brandes, Jasper ; Steiner, Petra ; Wiegand, Michael

In recent years, theoretical and computational linguistics has paid much attention to linguistic items that form scales. In NLP, much research has focused on ordering adjectives by intensity (tiny < small). Here, we address the task of automatically ordering English adverbs by their intensifying or diminishing effect on adjectives (e.g. extremely small < very small). We experiment with 4 different methods: 1) using the association strength between adverbs and adjectives; 2) exploiting scalar patterns (such as not only X but Y); 3) using the metadata of product reviews; 4) clustering. The method that performs best is based on the use of metadata and ranks adverbs by their scaling factor relative to unmodified adjectives.

Collocational Information in the FrameNet Database (2002)

Ruppenhofer, Josef ; Baker, Collin F. ; Fillmore, Charles J.

The FrameNet lexical database yields information about collocations and multiword expressions in various ways. In some cases phrasal units have been entered from the start as lexical entries (write down). In other cases headword + preposition pairs can be recognized as special collocations Where the preposition in question is a necessary and lexically specified marker of an argument of the headword + fond of, hostile to). Nominal compounds are annotated with respect to noun or (pertinative) adjective modifiers, some of which are analyzable but also entrenched (wheel chair, fiscal year). Nouns that name aggregates, portions, types, etc., sometimes hold lexically specified relations to their dependents (flock of geese). And event nouns frequently Select the support verbs which permit them to enter into predications (file an objection, enter a plea). A subproject aims at extracting, as structured clusters of lexical items, the minimal semantically central kernel dependency graphs from the set of annotations. Such research will yield not only commonplace groupings (eat: dog, bone) but will also yield hitherto unnoticed collocations within such graphs (answer: you, door) where certain dependency links within them are idiomatic or otherwise lexically special, here answer > door. Collocational information can also be retrieved by various types of queries within our MySQL search tool

Towards Weakly Supervised Resolution of Null Instantiations (2013)

Gorinski, Philip ; Ruppenhofer, Josef ; Sporleder, Caroline

This paper addresses the task of finding antecedents for locally uninstantiated arguments. To resolve such null instantiations, we develop a weakly supervised approach that investigates and combines a number of linguistically motivated strategies that are inspired by work on semantic role labeling and corefence resolution. The performance of the system is competitive with the current state-of-the-art supervised system.

Opinion Holder and Target Extraction for Verb-based Opinion Predicates – The Problem is Not Solved (2015)

Wiegand, Michael ; Schulder, Marc ; Ruppenhofer, Josef

We offer a critical review of the current state of opinion role extraction involving opinion verbs. We argue that neither the currently available lexical resources nor the manually annotated text corpora are sufficient to appropriately study this task. We introduce a new corpus focusing on opinion roles of opinion verbs from the Subjectivity Lexicon and show potential benefits of this corpus. We also demonstrate that state-of-the-art classifiers perform rather poorly on this new dataset compared to the standard dataset for the task showing that there still remains significant research to be done.

IGGSA-STEPS: Shared Task on Source and Target Extraction from Political Speeches (2013)

Ruppenhofer, Josef ; Struß, Julia Maria ; Sonntag, Jonathan ; Grindl, Stefan

In this paper, we report on the definition of a shared task considering source (whose opinion?) and target (about what?) extraction in protocols of the Swiss parliament that will be conducted by the Interest Group on German Sentiment Analysis (IGGSA)1.

IGGSA-STEPS: Shared Task on Source and Target Extraction from Political Speeches (2014)

Ruppenhofer, Josef ; Struß, Julia Maria

Accurate opinion mining requires the exact identification of the source and target of an opinion. To evaluate diverse tools, the research community relies on the existence of a gold standard corpus covering this need. Since such a corpus is currently not available for German, the Interest Group on German Sentiment Analysis decided to create such a resource and make it available to the research community in the context of a shared task. In this paper, we describe the selection of textual sources, development of annotation guidelines, and first evaluation results in the creation of a gold standard corpus for the German language.

Korpuslinguistik – Das unbekannte Wesen oder Mythen über Korpora und Korpuslinguistik (2006)

Perkuhn, Rainer ; Belica, Cyril

Eine angemessene, sachgemäße Diskussion über Stärken und Schwächen, Möglichkeiten und Grenzen der Korpuslinguistik ist überschattet von vielen Mythen, die sich mittlerweile eingebürgert haben und die in vielen Diskussionen – gerade unter Linguisten – immer wieder aufkommen. An dieser Stelle möchten wir einige der verbreitetsten Mythen zusammenstellen und die Hintergründe aus dieser korpuslinguistischen Perspektive erörtern.

1 to 9

Open Access

Textwissenschaft

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

9 search hits