Description and acquisition of multiword lexemes
- This paper deals with multiword lexemes (MWLs), focussing on two types of verbal MWLs: verbal idioms and support verb constructions. We discuss the characteristic properties of MWLs, namely nonstandard compositionality, restricted substitutability of components, and restricted morpho-syntactic flexibility, and we show how these properties may cause serious problems during the analysis, generation, and transfer steps of machine translation systems. In order to cope with these problems, MT lexicons need to provide detailed descriptions of MWL properties. We list the types of information which we consider the necessary minimum for a successful processing of MWLs, and report on some feasibility studies aimed at the automatic extraction of German verbal multiword lexemes from text corpora and machine-readable dictionaries.
Author: | Ulrike Schwall, Angelika Storrer |
---|---|
URN: | urn:nbn:de:bsz:mh39-66273 |
URL: | https://link.springer.com/content/pdf/10.1007%2F3-540-59040-4_19.pdf |
DOI: | https://doi.org/10.1007/3-540-59040-4_19 |
ISBN: | 978-3-540-49174-3 |
Parent Title (English): | Machine translation and the lexicon. Proceedings of the Third International EAMT Workshop, Heidelberg, Germany, April 26-28, 1993 |
Series (Serial Number): | Lecture Notes in Computer Science (898) |
Publisher: | Springer |
Place of publication: | Berlin [u.a.] |
Editor: | Petra Steffens |
Document Type: | Part of a Book |
Language: | English |
Year of first Publication: | 1995 |
Date of Publication (online): | 2017/10/27 |
Publicationstate: | Postprint |
Reviewstate: | (Verlags)-Lektorat |
GND Keyword: | Computerunterstützte Lexikografie; Korpus <Linguistik>; Lexem; Maschinelle Sprachverarbeitung; Maschinelle Übersetzung; Spracherkennung |
First Page: | 35 |
Last Page: | 50 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
BDSL-Classification: | Lexikographie, Wörterbücher |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Computerlinguistik |
Licence (German): | ![]() |