Volltext-Downloads (blau) und Frontdoor-Views (grau)

A corpus-assisted approach to paronym categorisation

  • In this paper, we will present a first attempt to classify commonly confused words in German by consulting their communicative functions in corpora. Although the use of so-called paronyms causes frequent uncertainties due to similarities in spelling, sound and semantics, up until now the phenomenon has attracted little attention either from the perspective of corpus linguistics or from cognitive linguistics. Existing investigations rely on structuralist models, which do not account for empirical evidence. Still, they have developed an elaborate model based on formal criteria, primarily on word formation (cf. Lăzărescu 1999). Looking from a corpus perspective, such classifications are incompatible with language in use and cognitive elements of misuse. This article sketches first lexicological insights into a classification model as derived from semantic analyses of written communication. Firstly, a brief description of the project will be provided. Secondly, corpus-assisted paronym detection will be focused. Thirdly, in the main section the paper concerns the description of the datasets for paronym classification and the classification procedures. As a work in progress, new insights will continually be extended once spoken and CMC data are added to the investigations.

Export metadata

Additional Services

Share in Twitter Search Google Scholar


Author:Ruth Maria MellGND, Petra StorjohannGND
Parent Title (English):Electronic lexicography in the 21st century. Proceedings of eLex 2017 conference. Leiden, the Netherlands, 19 – 21 September 2017
Publisher:Lexical Computing CZ s.r.o.
Place of publication:Brno, Czech Republic
Editor:Iztok Kosem, Carole Tiberius, Miloš Jakubíček, Jelena Kallas, Simon Krek, Vít Baisa
Document Type:Conference Proceeding
Year of first Publication:2017
Date of Publication (online):2017/09/19
Tag:categorisation; commonly confused words; e-dictionary; paronyms; semantic classification
GND Keyword:Computerunterstützte Lexikografie; Deutsch; Korpus <Linguistik>; Online-Wörterbuch; Paronym; Semantische Analyse
First Page:342
Last Page:354
DDC classes:400 Sprache / 430 Deutsch
Open Access?:ja
Leibniz-Classification:Sprache, Linguistik
Program areas:Lexik
Licence (English):License LogoCreative Commons - Attribution-ShareAlike 4.0 International