Hamburg Studies on Multilingualism
Refine
Year of publication
- 2012 (1)
Document Type
- Part of a Book (1)
Language
- English (1)
Has Fulltext
- yes (1)
Is part of the Bibliography
- yes (1)
Keywords
- Annotation (1)
- Gesprochene Sprache (1)
- Korpus <Linguistik> (1)
- Transkription (1)
Publicationstate
- Postprint (1)
Reviewstate
Publisher
- Benjamins (1)
14
This article discusses questions concerning the creation, annotation and sharing of spoken language corpora. We use the Hamburg Map Task Corpus (HAMATAC), a small corpus in which advanced learners of German were recorded solving a map task, as an example to illustrate our main points. We first give an overview of the corpus creation and annotation process including recording, metadata documentation, transcription and semi-automatic annotation of the data. We then discuss the manual annotation of disfluencies as an example case in which many of the typical and challenging problems for data reuse – in particular the reliability of interpretative annotations – are revealed.