Discourse segmentation of German written texts
- Discourse segmentation is the division of a text into minimal discourse segments, which form the leaves in the trees that are used to represent discourse structures. A definition of elementary discourse segments in German is provided by adapting widely used segmentation principles for English minimal units, while considering punctuation, morphology, sytax, and aspects of the logical document structure of a complex text type, namely scientific articles. The algorithm and implementation of a discourse segmenter based on these principles is presented, as well an evaluation of test runs.
Author: | Harald LüngenGND, Csilla Puskás, Maja Bärenfänger, Mirco Hilbert, Henning LobinGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-23 |
DOI: | https://doi.org/10.1007/11816508_26 |
ISBN: | 978-3-540-37334-6 |
Parent Title (English): | Advance in natural language processing. 5th International Conference on NLP FinTAL 2006 Turku, Finnland, August 23-25 |
Publisher: | Springer-Verlag |
Place of publication: | Berlin [u.a.] |
Editor: | Tapio Pahikkala, Sampo Pyysalo, Filip Ginter, Tapio Salakoski |
Document Type: | Part of a Book |
Language: | English |
Year of first Publication: | 2006 |
Publicationstate: | Postprint |
Reviewstate: | (Verlags)-Lektorat |
Tag: | Computational linguistics; Discourse analysis; Discourse annotation; Tag |
GND Keyword: | Annotation; Automatische Sprachanalyse; Computerlinguistik; Diskursanalyse |
Page Number: | 12 |
First Page: | 245 |
Last Page: | 256 |
Note: | The final publication is available at Springer via http://dx.doi.org/10.1007/11816508_26 |
DDC classes: | 400 Sprache / 410 Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Korpuslinguistik |
Licence (German): | Urheberrechtlich geschützt |