A corpus-assisted approach to paronym categorisation
- In this paper, we will present a first attempt to classify commonly confused words in German by consulting their communicative functions in corpora. Although the use of so-called paronyms causes frequent uncertainties due to similarities in spelling, sound and semantics, up until now the phenomenon has attracted little attention either from the perspective of corpus linguistics or from cognitive linguistics. Existing investigations rely on structuralist models, which do not account for empirical evidence. Still, they have developed an elaborate model based on formal criteria, primarily on word formation (cf. Lăzărescu 1999). Looking from a corpus perspective, such classifications are incompatible with language in use and cognitive elements of misuse. This article sketches first lexicological insights into a classification model as derived from semantic analyses of written communication. Firstly, a brief description of the project will be provided. Secondly, corpus-assisted paronym detection will be focused. Thirdly, in the main section the paper concerns the description of the datasets for paronym classification and the classification procedures. As a work in progress, new insights will continually be extended once spoken and CMC data are added to the investigations.
Author: | Ruth Maria MellGND, Petra StorjohannGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-64256 |
ISSN: | 2533-5626 |
Parent Title (English): | Electronic lexicography in the 21st century. Proceedings of eLex 2017 conference. Leiden, the Netherlands, 19 – 21 September 2017 |
Publisher: | Lexical Computing CZ s.r.o. |
Place of publication: | Brno, Czech Republic |
Editor: | Iztok Kosem, Carole Tiberius, Miloš Jakubíček, Jelena Kallas, Simon Krek, Vít Baisa |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2017 |
Date of Publication (online): | 2017/09/19 |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | categorisation; commonly confused words; e-dictionary; paronyms; semantic classification |
GND Keyword: | Computerunterstützte Lexikografie; Deutsch; Korpus <Linguistik>; Online-Wörterbuch; Paronym; Semantische Analyse |
First Page: | 342 |
Last Page: | 354 |
DDC classes: | 400 Sprache / 430 Deutsch |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Program areas: | Lexik |
Licence (English): | Creative Commons - Attribution-ShareAlike 4.0 International |