TY - CHAP U1 - Konferenzveröffentlichung A1 - Knight, Dawn A1 - Fitzpatrick, Tess A1 - Morris, Steve A1 - Evas, Jeremy A1 - Rayson, Paul A1 - Spasić, Irena A1 - Stonelake, Mark A1 - Thomas, Enlli Môn A1 - Neale, Steven A1 - Needs, Jennifer A1 - Piao, Scott A1 - Rees, Mair A1 - Watkins, Gareth A1 - Anthony, Laurence A1 - Cobb, Thomas Michael A1 - Deuchar, Margaret A1 - Donnelly, Kevin A1 - McCarthy, Michael A1 - Scannell, Kevin ED - Bański, Piotr ED - Kupietz, Marc ED - Lüngen, Harald ED - Rayson, Paul ED - Biber, Hanno ED - Breiteneder, Evelyn ED - Clematide, Simon ED - Mariani, John ED - Stevenson, Mark ED - Sick, Theresa T1 - Creating CorCenCC (Corpws Cenedlaethol Cymraeg Cyfoes - The National Corpus of Contemporary Welsh) T2 - Proceedings of the Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing (CMLC-5+BigNLP) 2017 including the papers from the Web-as-Corpus (WAC-XI) guest section. Birmingham, 24 July 2017 N2 - CorCenCC is an interdisciplinary and multiinstitutional project that is creating a large-scale, open-source corpus of contemporary Welsh. CorCenCC will be the first ever large-scale corpus to represent spoken, written and electronicallymediated Welsh (compiling an initial data set of 10 million Welsh words), with a functional design informed, from the outset, by representatives of all anticipated academic and community user groups. KW - Korpus KW - Kymrisch KW - Walisisch KW - Corpus linguistics KW - National corpus KW - Welsh Y1 - 2017 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-62578 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-62578 SP - 13 EP - 14 S1 - 2 PB - Institut für Deutsche Sprache CY - Mannheim ER -