Volltext-Downloads (blau) und Frontdoor-Views (grau)

OWL ontologies as a resource for discourse parsing

  • In the project SemDok (Generic document structures in linearly organised texts) funded by the German Research Foundation DFG, a discourse parser for a complex type (scientific articles by example), is being developed. Discourse parsing (henceforth DP) according to the Rhetorical Structure Theory (RST) (Mann and Taboada, 2005; Marcu, 2000) deals with automatically assigning a text a tree structure in which discourse segments and rhetorical relations between them are marked, such as Concession. For identifying the combinable segments, declarative rules are employed, which describe linguistic and structural cues and constraints about possible combinations by referring to different XML annotation layers of the input text, and external knowledge bases such as a discourse marker lexicon, a lexico-semantic ontology (later to be combined with a domain ontology), and an ontology of rhetorical relations. In our text-technological environment, the obvious choice of formalism to represent such ontologies is OWL (Smith et al., 2004). In this paper, we describe two OWL ontologies and how they are consulted from the discourse parser to solve certain tasks within DP. The first ontology is a taxononomy of rhetorical relations which was developed in the project. The second one is an OWL version of GermaNet, the model of which we designed together with our project partners.

Export metadata

Additional Services

Share in Twitter Search Google Scholar


Author:Maja Bärenfänger, Mirco Hilbert, Henning LobinGND, Harald LüngenGND
Parent Title (English):LDV-Forum - GLDV-Journal for Computational 
Linguistics and Language Technology
Publisher:Gesellschaft für Linguistische Datenverarbeitung
Place of publication:Bonn
Document Type:Article
Year of first Publication:2008
Date of Publication (online):2018/07/04
GND Keyword:Ontologie <Wissensverarbeitung>; Parser; Strukturbaum; Textstruktur
First Page:17
Last Page:26
DDC classes:400 Sprache / 430 Deutsch
Open Access?:ja
Licence (German):License LogoCreative Commons - CC BY-NC-ND - Namensnennung - Nicht kommerziell - Keine Bearbeitungen 4.0 International