Text, Speech and Language Technology
Refine
Year of publication
- 2010 (1)
Document Type
- Part of a Book (1)
Language
- English (1)
Has Fulltext
- yes (1)
Is part of the Bibliography
- no (1)
Keywords
- Discourse parsing (1)
- Discourse relations (1)
- Document structure (1)
- Linguistic annotations (1)
- Text technology (1)
- XML (1)
Publicationstate
- Postprint (1)
Publisher
- Springer (1)
41
This chapter addresses the requirements and linguistic foundations of automatic relational discourse analysis of complex text types such as scientific journal articles. It is argued that besides lexical and grammatical discourse markers, which have traditionally been employed in discourse parsing, cues derived from the logical and generical document structure and the thematic structure of a text must be taken into account. An approach to modelling such types of linguistic information in terms of XML-based multi-layer annotations and to a text-technological representation of additional knowledge sources is presented. By means of quantitative and qualitative corpus analyses, cues and constraints for automatic discourse analysis can be derived. Furthermore, the proposed representations are used as the input sources for discourse parsing. A short overview of the projected parsing architecture is given.