OPUS 4 | Search

Refine

Has Fulltext

yes (3)

3 search hits

1 to 3

Sort by

Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German (2012)

Hedeland, Hanna ; Schmidt, Thomas

This article discusses questions concerning the creation, annotation and sharing of spoken language corpora. We use the Hamburg Map Task Corpus (HAMATAC), a small corpus in which advanced learners of German were recorded solving a map task, as an example to illustrate our main points. We first give an overview of the corpus creation and annotation process including recording, metadata documentation, transcription and semi-automatic annotation of the data. We then discuss the manual annotation of disfluencies as an example case in which many of the typical and challenging problems for data reuse – in particular the reliability of interpretative annotations – are revealed.

Vorhersage von Fugenelementen in nominalen Komposita (2012)

Bubenhofer, Noah ; Hein, Katrin ; Brinckmann, Caren

Ein korpusbasiertes Beschreibungsmodell für die elektronische Sprichwortlexikografie (2012)

Steyer, Kathrin ; Ďurčo, Peter