TY - CHAP U1 - Buchbeitrag A1 - Lüngen, Harald A1 - Puskás, Csilla A1 - Bärenfänger, Maja A1 - Hilbert, Mirco A1 - Lobin, Henning ED - Pahikkala, Tapio ED - Pyysalo, Sampo ED - Ginter, Filip ED - Salakoski, Tapio T1 - Discourse segmentation of German written texts T2 - Advance in natural language processing. 5th International Conference on NLP FinTAL 2006 Turku, Finnland, August 23-25 N2 - Discourse segmentation is the division of a text into minimal discourse segments, which form the leaves in the trees that are used to represent discourse structures. A definition of elementary discourse segments in German is provided by adapting widely used segmentation principles for English minimal units, while considering punctuation, morphology, sytax, and aspects of the logical document structure of a complex text type, namely scientific articles. The algorithm and implementation of a discourse segmenter based on these principles is presented, as well an evaluation of test runs. KW - Computerlinguistik KW - Diskursanalyse KW - Automatische Sprachanalyse KW - Computational linguistics KW - Discourse annotation KW - Tag KW - Annotation KW - Discourse analysis Y1 - 2006 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-23 SN - 978-3-540-37334-6 SB - 978-3-540-37334-6 U6 - https://doi.org/10.1007/11816508_26 DO - https://doi.org/10.1007/11816508_26 N1 - The final publication is available at Springer via http://dx.doi.org/10.1007/11816508_26 SP - 245 EP - 256 S1 - 12 PB - Springer-Verlag CY - Berlin [u.a.] ER -