TY - CHAP U1 - Konferenzveröffentlichung A1 - Rehbein, Ines A1 - Schalowski, Sören A1 - Wiese, Heike ED - Calzolari, Nicoletta ED - Choukri, Khalid ED - Declerck, Thierry ED - Loftsson, Hrafn ED - Maegaard, Bente ED - Mariani, Joseph ED - Moreno, Asuncion ED - Odijk, Jan ED - Piperidis, Stelios T1 - The KiezDeutsch Korpus (KiDKo) Release 1.0 T2 - Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). May 26-31, 2014. Harpa Concert Hall and Conference Center. Reykjavik, Iceland N2 - This paper presents the first release of the KiezDeutsch Korpus (KiDKo), a new language resource with multiparty spoken dialogues of Kiezdeutsch, a newly emerging language variety spoken by adolescents from multi-ethnic urban areas in Germany. The first release of the corpus includes the transcriptions of the data as well as a normalisation layer and part-of-speech annotations. In the paper, we describe the main features of the new resource and then focus on automatic POS tagging of informal spoken language. Our tagger achieves an accuracy of nearly 97% on KiDKo. While we did not succeed in further improving the tagger using ensemble tagging, we present our approach to using the tagger ensembles for identifying error patterns in the automatically tagged data. KW - spoken language corpora KW - urban youth language KW - Kiezdeutsch KW - Gesprochene Sprache KW - Stadtmundart KW - Jugendsprache KW - Multikulturelle Gesellschaft KW - Korpus Y1 - 2014 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-55999 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-55999 UR - www.lrec-conf.org/proceedings/lrec2014/index.html SN - 978-2-9517408-8-4 SB - 978-2-9517408-8-4 SP - 3927 EP - 3934 PB - European Language Resources Association CY - Paris ER -