Refine
Year of publication
- 2012 (4) (remove)
Document Type
- Part of a Book (3)
- Book (1)
Language
- English (4) (remove)
Is part of the Bibliography
- yes (4) (remove)
Keywords
- Ethnolinguistik (2)
- Annotation (1)
- Bartmiński, Jerzy (1)
- Deutsch (1)
- Gesprochene Sprache (1)
- Kognitive Linguistik (1)
- Korpus <Linguistik> (1)
- Nationalsozialismus (1)
- Polnisch (1)
- Schreiben (1)
Publicationstate
- Postprint (3)
Reviewstate
Publisher
- Equinox (2)
- Benjamins (1)
- Oxford University Press (1)
This article discusses questions concerning the creation, annotation and sharing of spoken language corpora. We use the Hamburg Map Task Corpus (HAMATAC), a small corpus in which advanced learners of German were recorded solving a map task, as an example to illustrate our main points. We first give an overview of the corpus creation and annotation process including recording, metadata documentation, transcription and semi-automatic annotation of the data. We then discuss the manual annotation of disfluencies as an example case in which many of the typical and challenging problems for data reuse – in particular the reliability of interpretative annotations – are revealed.