TY - CHAP U1 - Konferenzveröffentlichung A1 - Westpfahl, Swantje A1 - Schmidt, Thomas ED - Calzolari, Nicoletta ED - Choukri, Khalid ED - Declerck, Thierry ED - Goggi, Sara ED - Grobelnik, Marko ED - Maegaard, Bente ED - Mariani, Joseph ED - Mazo, Helene ED - Moreno, Asuncion ED - Odijk, Jan ED - Piperidis, Stelios T1 - FOLK-Gold ― A gold standard for part-of-speech-tagging of spoken German T2 - Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slovenia N2 - In this paper, we present a GOLD standard of part-of-speech tagged transcripts of spoken German. The GOLD standard data consists of four annotation layers – transcription (modified orthography), normalization (standard orthography), lemmatization and POS tags – all of which have undergone careful manual quality control. It comes with guidelines for the manual POS annotation of transcripts of German spoken data and an extended version of the STTS (Stuttgart Tübingen Tagset) which accounts for phenomena typically found in spontaneous spoken German. The GOLD standard was developed on the basis of the Research and Teaching Corpus of Spoken German, FOLK, and is, to our knowledge, the first such dataset based on a wide variety of spontaneous and authentic interaction types. It can be used as a basis for further development of language technology and corpus linguistic applications for German spoken language. KW - German spoken language KW - GOLD standard KW - Deutsch KW - Gesprochene Sprache KW - Korpus KW - Part-of-Speech-Tagging = POS Y1 - 2016 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-50786 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-50786 SN - 978-2-9517408-9-1 SB - 978-2-9517408-9-1 SP - 1493 EP - 1499 PB - European Language Resources Association (ELRA) CY - Paris ER -