TY - CHAP U1 - Buchbeitrag A1 - Lüngen, Harald A1 - Bärenfänger, Maja A1 - Hilbert, Mirco A1 - Lobin, Henning A1 - Puskás, Csilla ED - Witt, Andreas ED - Dieter, Metzing T1 - Discourse Relations and Document Structure T2 - Linguistic Modeling of Information and Markup Languages. Contributions to Language Technology N2 - This chapter addresses the requirements and linguistic foundations of automatic relational discourse analysis of complex text types such as scientific journal articles. It is argued that besides lexical and grammatical discourse markers, which have traditionally been employed in discourse parsing, cues derived from the logical and generical document structure and the thematic structure of a text must be taken into account. An approach to modelling such types of linguistic information in terms of XML-based multi-layer annotations and to a text-technological representation of additional knowledge sources is presented. By means of quantitative and qualitative corpus analyses, cues and constraints for automatic discourse analysis can be derived. Furthermore, the proposed representations are used as the input sources for discourse parsing. A short overview of the projected parsing architecture is given. T3 - Text, Speech and Language Technology - 41 KW - Discourse parsing KW - Discourse relations KW - Document structure KW - Text technology KW - Linguistic annotations KW - XML Y1 - 2010 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-48005 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-48005 SN - 978-90-481-3330-7 SB - 978-90-481-3330-7 N1 - The final publication is available at Springer via https://dx.doi.org/10.1007/978-90-481-3331-4 SP - 97 EP - 123 PB - Springer CY - Dordrecht ER -