Extending the STTS for the Annotation of Spoken Language
- This paper presents an extension to the Stuttgart-Tübingen TagSet, the standard part-of-speech tag set for German, for the annotation of spoken language. The additional tags deal with hesitations, backchannel signals, interruptions, onomatopoeia and uninterpretable material. They allow one to capture phenomena specific to spoken language while, at the same time, preserving inter-operability with already existing corpora of written language.
Author: | Ines Rehbein, Sören Schalowski |
---|---|
URN: | urn:nbn:de:bsz:mh39-56026 |
URL: | http://www.oegai.at/konvens2012/proceedings.shtml |
ISBN: | 3-85027-005-X |
Parent Title (English): | Proceedings of the 11th Edition of the Conference on Natural Language Processing (KONVENS). Vienna, September 19-21, 2012. |
Series (Serial Number): | Schriftenreihe der Österreichischen Gesellschaft für Artificial Intelligence (ÖGAI) (5) |
Publisher: | Eigenverlag ÖGAI |
Place of publication: | Wien |
Editor: | Jeremy Jancsary |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2012 |
Date of Publication (online): | 2016/11/21 |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | (Verlags)-Lektorat |
GND Keyword: | Annotation; Automatische Sprachanalyse; Gesprochene Sprache; Interoperabilität; Korpus <Linguistik> |
First Page: | 238 |
Last Page: | 242 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Korpuslinguistik |
Licence (German): | ![]() |