Still no evidence for an effect of the proportion of non-native speakers on natural language complexity
- In a recent study, I demonstrated that large numbers of L2 (second language) speakers do not appear to influence the morphological or information-theoretic complexity of natural languages. This paper has three primary aims: First, I address recent criticisms of my analyses, showing that the points raised by my critics were already explicitly considered and analysed in my original work. Furthermore, I show that the proposed alternative analyses fail to withstand detailed examination. Second, I introduce new data on the information-theoretic complexity of natural languages, with the estimates derived from various language models—ranging from simple statistical models to advanced neural networks—based on a database of 40 multilingual text collections that represent a wide range of text types. Third, I re-analyse the information-theoretic and morphological complexity data using novel methods that better account for model uncertainty in parameter estimation, as well as the genealogical relatedness and geographic proximity of languages. In line with my earlier findings, the results show no evidence that large numbers of L2 speakers have an effect on natural language complexity.
Author: | Alexander KoplenigORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-129111 |
DOI: | https://doi.org/10.3390/e26110993 |
ISSN: | 1099-4300 |
Parent Title (English): | Entropy |
Publisher: | MDPI |
Place of publication: | Basel |
Document Type: | Article |
Language: | English |
Year of first Publication: | 2024 |
Date of Publication (online): | 2024/11/22 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | language complexity; language models; language typology; linguistic niche hypothesis; non-native speakers; quantitative linguistics |
GND Keyword: | Computerlinguistik; Natürliche Sprache; Non-native speaker; Sprachstatistik; Sprachtypologie; Statistische Analyse |
Volume: | 26 |
Issue: | 11 |
Article Number: | 993 |
Page Number: | 26 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Computerlinguistik |
Linguistics-Classification: | Quantitative Linguistik |
Linguistics-Classification: | Sprachtypologie |
Program areas: | Lexik |
Licence (English): | Creative Commons - Attribution 4.0 International |