TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Schmidt, Thomas ED - Kupietz, Marc ED - Geyken, Alexander T1 - Construction and dissemination of a corpus of spoken interaction - tools and workflows in the FOLK project JF - Journal for language technology and computational linguistics (JLCL) N2 - This paper is about the workflow for construction and dissemination of FOLK (Forschungs - und Lehrkorpus Gesprochenes Deutsch – Research and Teaching Corpus of Spoken German), a large corpus of authentic spoken interaction data, recorded on audio and video. Section 2 describes in detail the tools used in the individual steps of transcription, anonymization, orthographic normalization, lemmatization and POS tagging of the data, as well as some utilities used for corpus management. Section 3 deals with the DGD (Datenbank für Gesprochenes Deutsch - Database of Spoken German) as a tool for distributing completed data sets and making them available for qualitative and quantitative analysis. In section 4, some plans for further development are sketched. KW - Gesprochene Sprache KW - Korpus KW - Deutsch KW - Datenbank Y1 - 2016 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-62156 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-62156 UR - http://www.jlcl.org/2016_Heft1/jlcl-2016-1-7Schmidt.pdf SN - 2190-6858 SS - 2190-6858 VL - 31 IS - 1 SP - 127 EP - 154 S1 - 28 CY - Berlin ER -