Volltext-Downloads (blau) und Frontdoor-Views (grau)

Processing Text-Technological Resources in Discourse Parsing

  • Discourse parsing of complex text types such as scientific research articles requires the analysis of an input document on linguistic and structural levels that go beyond traditionally employed lexical discourse markers. This chapter describes a text-technological approach to discourse parsing. Discourse parsing with the aim of providing a discourse structure is seen as the addition of a new annotation layer for input documents marked up on several linguistic annotation levels. The discourse parser generates discourse structures according to the Rhetorical Structure Theory. An overview of the knowledge sources and components for parsing scientific joumal articles is given. The parser’s core consists of cascaded applications of the GAP, a Generic Annotation Parser. Details of the chart parsing algorithm are provided, as well as a short evaluation in terms of comparisons with reference annotations from our corpus and with recently developed Systems with a similar task.

Export metadata

Additional Services

Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Henning LobinGND, Harald LüngenGND, Mirco Hilbert, Maja Bärenfänger
URN:urn:nbn:de:bsz:mh39-48317
DOI:https://doi.org/10.1007/978-3-642-22613-7_3
ISBN:978-3-642-22612-0 (Print)
ISBN:978-3-642-22613-7 (Online)
Parent Title (English):Modeling, Learning, and Processing of Text-Technological Data Structures
Series (Serial Number):Studies in Computational Intelligence (370)
Publisher:Springer
Place of publication:Berlin/Heidelberg
Editor:Alexander Mehler, Kai-Uwe Kühnberger, Henning Lobin, Harald Lüngen, Angelika Storrer, Andreas Witt
Document Type:Part of a Book
Language:English
Year of first Publication:2011
Date of Publication (online):2016/05/02
Publicationstate:Postprint
Reviewstate:(Verlags)-Lektorat
Tag:Parsing
GND Keyword:Computerlinguistik; Textanalyse
First Page:35
Last Page:58
Note:
The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-22613-7_3
DDC classes:400 Sprache / 410 Linguistik
Open Access?:ja
Leibniz-Classification:Sprache, Linguistik
Linguistics-Classification:Computerlinguistik
Licence (German):License LogoCreative Commons - Namensnennung-Nicht kommerziell-Keine Bearbeitung 3.0 Deutschland