Refine
Year of publication
- 2012 (3) (remove)
Document Type
- Article (1)
- Part of a Book (1)
- Other (1)
Has Fulltext
- yes (3)
Keywords
- Korpus <Linguistik> (3) (remove)
Publicationstate
- Postprint (3) (remove)
Reviewstate
Publisher
- Benjamins (1)
This article discusses questions concerning the creation, annotation and sharing of spoken language corpora. We use the Hamburg Map Task Corpus (HAMATAC), a small corpus in which advanced learners of German were recorded solving a map task, as an example to illustrate our main points. We first give an overview of the corpus creation and annotation process including recording, metadata documentation, transcription and semi-automatic annotation of the data. We then discuss the manual annotation of disfluencies as an example case in which many of the typical and challenging problems for data reuse – in particular the reliability of interpretative annotations – are revealed.