A non-parametric significance test to compare corpora
- Classical null hypothesis significance tests are not appropriate in corpus linguistics, because the randomness assumption underlying these testing procedures is not fulfilled. Nevertheless, there are numerous scenarios where it would be beneficial to have some kind of test in order to judge the relevance of a result (e.g. a difference between two corpora) by answering the question whether the attribute of interest is pronounced enough to warrant the conclusion that it is substantial and not due to chance. In this paper, I outline such a test.
Author: | Alexander KoplenigORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-92494 |
DOI: | https://doi.org/10.1371/journal.pone.0222703 |
ISSN: | 1932-6203 |
Parent Title (English): | PLoS One |
Publisher: | PLOS |
Place of publication: | San Francisco, California, USA |
Document Type: | Article |
Language: | English |
Year of first Publication: | 2019 |
Date of Publication (online): | 2019/09/20 |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | corpus linguistic methodology; null-hypothesis testing; quantitative approaches |
GND Keyword: | Korpus <Linguistik>; Sprachstatistik; Statistischer Test |
Volume: | 14 |
Issue: | 9 |
First Page: | 1 |
Last Page: | 18 |
Note: | The publication of this article was partially funded by the Open Access Fund of the Leibniz Association |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Program areas: | Lexik |
Licence (German): | ![]() |