OWL ontologies as a resource for discourse parsing
- In the project SemDok (Generic document structures in linearly organised texts) funded by the German Research Foundation DFG, a discourse parser for a complex type (scientific articles by example), is being developed. Discourse parsing (henceforth DP) according to the Rhetorical Structure Theory (RST) (Mann and Taboada, 2005; Marcu, 2000) deals with automatically assigning a text a tree structure in which discourse segments and rhetorical relations between them are marked, such as Concession. For identifying the combinable segments, declarative rules are employed, which describe linguistic and structural cues and constraints about possible combinations by referring to different XML annotation layers of the input text, and external knowledge bases such as a discourse marker lexicon, a lexico-semantic ontology (later to be combined with a domain ontology), and an ontology of rhetorical relations. In our text-technological environment, the obvious choice of formalism to represent such ontologies is OWL (Smith et al., 2004). In this paper, we describe two OWL ontologies and how they are consulted from the discourse parser to solve certain tasks within DP. The first ontology is a taxononomy of rhetorical relations which was developed in the project. The second one is an OWL version of GermaNet, the model of which we designed together with our project partners.
Author: | Maja Bärenfänger, Mirco Hilbert, Henning LobinORCiDGND, Harald LüngenGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-76105 |
URL: | http://www.jlcl.org/2008_Heft1/LDV_Forum_23_(1).pdf |
ISSN: | 0175-1336 |
Parent Title (English): | LDV-Forum - GLDV-Journal for Computational Linguistics and Language Technology |
Publisher: | Gesellschaft für Linguistische Datenverarbeitung |
Place of publication: | Bonn |
Document Type: | Article |
Language: | English |
Year of first Publication: | 2008 |
Date of Publication (online): | 2018/07/04 |
Publicationstate: | Zweitveröffentlichung |
Reviewstate: | (Verlags)-Lektorat |
GND Keyword: | Ontologie <Wissensverarbeitung>; Parser; Strukturbaum; Textstruktur |
Volume: | 23 |
Issue: | 1 |
First Page: | 17 |
Last Page: | 26 |
DDC classes: | 400 Sprache / 430 Deutsch |
Open Access?: | ja |
Linguistics-Classification: | Computerlinguistik |
Licence (German): | ![]() |