TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - nicht begutachtet (unreviewed) A1 - Koplenig, Alexander T1 - The impact of lacking metadata and data truncation for the measurement of cultural and linguistic change using the Google Ngram datasets N2 - As a result of legal restrictions the Google Ngram Corpora datasets are a) not accompanied by any metadata regarding the texts the corpora consist of and the data are b) truncated to prevent an indirect conclusion from the n-gram to the author of the text. Some of the consequences of this strategy are discussed in this article. KW - Sprachwandel KW - Kulturwandel KW - Sprachstatistik KW - Korpus KW - Datenstruktur KW - Metadaten KW - N-Gramm Y1 - 2014 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-31557 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-31557 N1 - An updated version of this paper entitled: "The impact of lacking metadata for the measurement of cultural and linguistic change using the Google Ngram datasets – reconstructing the composition of the German corpus in times of WWII" is accepted for publication in the the journal "Digital Scholarship in the Humanities" (http://dsh.oxfordjournals.org/content/early/2015/09/02/llc.fqv037). SP - 28 S., 2 Anhänge S1 - 28 S., 2 Anhänge PB - Institut für Deutsche Sprache CY - Mannheim ER -