Refine
Year of publication
- 2014 (2) (remove)
Document Type
- Preprint (2) (remove)
Is part of the Bibliography
- yes (2)
Keywords
- Datenstruktur (1)
- Korpus <Linguistik> (1)
- Kulturwandel (1)
- Metadaten (1)
- N-Gramm (1)
- Sprachstatistik (1)
- Sprachwandel (1)
- ado file (1)
- maximum likelihood (1)
- zipf (1)
Publisher
As a result of legal restrictions the Google Ngram Corpora datasets are a) not accompanied by any metadata regarding the texts the corpora consist of and the data are b) truncated to prevent an indirect conclusion from the n-gram to the author of the text. Some of the consequences of this strategy are discussed in this article.