OPUS 4 | 430 Deutsch

430 Deutsch

430 Deutsch (130)
431 Schriftsysteme und Phonologie des Deutschen (1)
432 Etymologie des Deutschen (20)
433 Deutsche Wörterbücher (51)
435 Deutsche Grammatik (111)
437 Varianten des Deutschen (121)
438 Gebrauch des Standard-Deutsch (27)
439 Andere germanische Sprachen (40)

6 search hits

1 to 6

Sort by

Dokumentgrammatiken als Grundlage von XML-Tools (2004)

Lobin, Henning

OWL ontologies as a resource for discourse parsing (2008)

Bärenfänger, Maja ; Hilbert, Mirco ; Lobin, Henning ; Lüngen, Harald

In the project SemDok (Generic document structures in linearly organised texts) funded by the German Research Foundation DFG, a discourse parser for a complex type (scientific articles by example), is being developed. Discourse parsing (henceforth DP) according to the Rhetorical Structure Theory (RST) (Mann and Taboada, 2005; Marcu, 2000) deals with automatically assigning a text a tree structure in which discourse segments and rhetorical relations between them are marked, such as Concession. For identifying the combinable segments, declarative rules are employed, which describe linguistic and structural cues and constraints about possible combinations by referring to different XML annotation layers of the input text, and external knowledge bases such as a discourse marker lexicon, a lexico-semantic ontology (later to be combined with a domain ontology), and an ontology of rhetorical relations. In our text-technological environment, the obvious choice of formalism to represent such ontologies is OWL (Smith et al., 2004). In this paper, we describe two OWL ontologies and how they are consulted from the discourse parser to solve certain tasks within DP. The first ontology is a taxononomy of rhetorical relations which was developed in the project. The second one is an OWL version of GermaNet, the model of which we designed together with our project partners.

Erweiterte Dokumentgrammatiken als Grundlage innovativer XML-Tools (Extended Document Grammars as a Basis for Innovative XML Tools) (2003)

Lobin, Henning

Wohlgeformte XML-Dokumente lassen sich als Bäume interpretieren und diese wiederum durch Grammatiken beschreiben. Dokumentgrammatiken weisen einige Besonderheiten auf, die sie von Grammatiken für natürliche Sprachen oder Programmiersprachen unterscheidet. Dieser Beitrag erläutert die Verarbeitungsmöglichkeiten, die aus der Nutzung von formalen Dokumentgrammatiken erwachsen.

Textauszeichnungssprachen und Dokumentgrammatiken (2004)

Lobin, Henning

Using OWL ontologies in discourse parsing (2007)

Bärenfänger, Maja ; Hilbert, Mirco ; Lobin, Henning ; Lüngen, Harald

Interrelating Treebanks with Language-Specific Descriptions of Information Structure (2004)

Storbeck, Daniel ; Kwon, Sanghee ; Sasaki, Felix ; Witt, Andreas

The motivation for this article is to describe a methodology for interrelating and analyzing language and theory-specific corpus data from various languages. As an example phenomeon we use information structure (IS, see [3]) in treebanks from three languages: Spanish, Korean and Japanese. Korean and Japanese are typologically close, while both are typologically different from Spanish. Therefore, the problem of annotating IS is that there are diverging language-specific formal linguistic means for the realization of IS-functions (like “topicalization / contrast”) on various levels like prosody, morphology and word-order. Hence, it is necessary to describe the relations between language-specific formal means and functional views on IS, and how to operationalize these relations for corpus analysis.

1 to 6

Open Access

430 Deutsch

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

6 search hits