TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Koplenig, Alexander T1 - The Impact of Lacking Metadata for the Measurement of Cultural and Linguistic Change Using the Google Ngram Data Sets—Reconstructing the Composition of the German Corpus in Times of WWII JF - Digital Scholarship in the Humanities N2 - The Google Ngram Corpora seem to offer a unique opportunity to study linguistic and cultural change in quantitative terms. To avoid breaking any copyright laws, the data sets are not accompanied by any metadata regarding the texts the corpora consist of. Some of the consequences of this strategy are analyzed in this article. I chose the example of measuring censorship in Nazi Germany, which received widespread attention and was published in a paper that accompanied the release of the Google Ngram data (Michel et al. (2010): Quantitative analysis of culture using millions of digitized books. Science, 331(6014): 176–82). I show that without proper metadata, it is unclear whether the results actually reflect any kind of censorship at all. Collectively, the findings imply that observed changes in this period of time can only be linked directly to World War II to a certain extent. Therefore, instead of speaking about general linguistic or cultural change, it seems to be preferable to explicitly restrict the results to linguistic or cultural change ‘as it is represented in the Google Ngram data’. On a more general level, the analysis demonstrates the importance of metadata, the availability of which is not just a nice add-on, but a powerful source of information for the digital humanities. KW - Sprachwandel KW - Sprachstatistik KW - Metadaten KW - Kulturwandel KW - Korpus KW - Datenstruktur Y1 - 2017 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-49493 U6 - https://doi.org/10.1093/llc/fqv037 DO - https://doi.org/10.1093/llc/fqv037 N1 - Preprint is published under http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-31557 Advance Access published September, 12, 2015 VL - 32 IS - 1 SP - 169 EP - 188 PB - Oxford University Press (OUP) CY - Oxford ER -