Schriftenreihe der Österreichischen Gesellschaft für Artificial Intelligence (ÖGAI)
Wien: Eigenverlag ÖGAI
Refine
Document Type
Has Fulltext
- yes (2)
Is part of the Bibliography
- no (2)
Keywords
- Gesprochene Sprache (2) (remove)
Publicationstate
Reviewstate
Publisher
5
This paper presents an extension to the Stuttgart-Tübingen TagSet, the standard part-of-speech tag set for German, for the annotation of spoken language. The additional tags deal with hesitations, backchannel signals, interruptions, onomatopoeia and uninterpretable material. They allow one to capture phenomena specific to spoken language while, at the same time, preserving inter-operability with already existing corpora of written language.
5
This paper attempts a new look at computer assisted transcription as it is commonly practised within the fields of discourse analysis and language acquisition studies. The first part proposes a bridge between discourse analytical methodology and text technological methods with the concept of modelling as its central idea. The second part demonstrates the EXMARaLDA system, a set of formats and tools for computer assisted transcription that builds on the ideas developed in the first part and implements them in a way that can lead to significant improvement in current research practice.