OPUS 4 | Search

Refine

Has Fulltext

yes (4)

4 search hits

1 to 4

Sort by

Extending the STTS for the Annotation of Spoken Language (2012)

This paper presents an extension to the Stuttgart-Tübingen TagSet, the standard part-of-speech tag set for German, for the annotation of spoken language. The additional tags deal with hesitations, backchannel signals, interruptions, onomatopoeia and uninterpretable material. They allow one to capture phenomena specific to spoken language while, at the same time, preserving inter-operability with already existing corpora of written language.

STTS goes Kiez – Experiments on Annotating and Tagging Urban Youth Language (2013)

Rehbein, Ines ; Schalowski, Sören

The KiezDeutsch Korpus (KiDKo) Release 1.0 (2014)

Rehbein, Ines ; Schalowski, Sören ; Wiese, Heike

This paper presents the first release of the KiezDeutsch Korpus (KiDKo), a new language resource with multiparty spoken dialogues of Kiezdeutsch, a newly emerging language variety spoken by adolescents from multi-ethnic urban areas in Germany. The first release of the corpus includes the transcriptions of the data as well as a normalisation layer and part-of-speech annotations. In the paper, we describe the main features of the new resource and then focus on automatic POS tagging of informal spoken language. Our tagger achieves an accuracy of nearly 97% on KiDKo. While we did not succeed in further improving the tagger using ensemble tagging, we present our approach to using the tagger ensembles for identifying error patterns in the automatically tagged data.

Annotating Spoken Language (2014)

Rehbein, Ines ; Schalowski, Sören ; Wiese, Heike

1 to 4

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

4 search hits