Lexical semantic change discovery
- While there is a large amount of research in the field of Lexical Semantic Change Detection, only few approaches go beyond a standard benchmark evaluation of existing models. In this paper, we propose a shift of focus from change detection to change discovery, i.e., discovering novel word senses over time from the full corpus vocabulary. By heavily fine-tuning a type-based and a token-based approach on recently published German data, we demonstrate that both models can successfully be applied to discover new words undergoing meaning change. Furthermore, we provide an almost fully automated framework for both evaluation and discovery.
Author: | Sinan Kurtyigit, Maike ParkORCiD, Dominik Schlechtweg, Jonas KuhnGND, Sabine Schulte im WaldeORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-106930 |
DOI: | https://doi.org/10.18653/v1/2021.acl-long.543 |
ISBN: | 978-1-954085-52-7 |
Parent Title (English): | Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) |
Publisher: | Association for Computational Linguistics |
Place of publication: | Stroudsburg |
Editor: | Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2021 |
Date of Publication (online): | 2021/09/24 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
GND Keyword: | Deutsch; Korpus <Linguistik>; Semantik; Semasiologie; Sprachwandel; Wortschatz |
First Page: | 6985 |
Last Page: | 6998 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Lexikografie |
Linguistics-Classification: | Lexikologie / Etymologie |
Linguistics-Classification: | Semantik |
Program areas: | L1: Lexikographie und Sprachdokumentation |
Licence (English): | ![]() |