TY - CHAP U1 - Konferenzveröffentlichung A1 - Buschjäger, Sebastian A1 - Pfahler, Lukas A1 - Morik, Katharina ED - Bański, Piotr ED - Biber, Hanno ED - Breiteneder, Evelyn ED - Kupietz, Marc ED - Lüngen, Harald ED - Witt, Andreas T1 - Discovering Subtle Word Relations in Large German Corpora T2 - Proceedings of the 3rd Workshop on Challenges in the Management of Large Corpora (CMLC-3), Lancaster, 20 July 2015 N2 - With an increasing amount of text data available it is possible to automatically extract a variety of information about language. One way to obtain knowledge about subtle relations and analogies between words is to observe words which are used in the same context. Recently, Mikolov et al. proposed a method to efficiently compute Euclidean word representations which seem to capture subtle relations and analogies between words in the English language. We demonstrate that this method also captures analogies in the German language. Furthermore, we show that we can transfer information extracted from large non-annotated corpora into small annotated corpora, which are then, in turn, used for training NLP systems. KW - Korpus KW - Datenbanksystem KW - Annotation KW - Large corpora KW - National corpus KW - Corpus technology KW - Corpus management KW - Corpus annotation KW - Corpus linguistics Y1 - 2015 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-38317 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-38317 SP - 11 EP - 14 PB - Institut für Deutsche Sprache CY - Mannheim ER -