Corpus-based identification and disambiguation of reading indicators for German nominalizations
- Corpus data is often structurally and lexically ambiguous; corpus extraction methodologies thus must be made aware of ambiguities. Therefore, given an extraction task, all relevant ambiguities must be identified. To resolve these ambiguities, contextual data responsible for one or another reading is to be considered. In the context of our present work, German -ung-nominalizations and their sortal readings are under examination. A number of these nominalizations may be read as an event or a result, depending on the semantic group they belong to. Here, we concentrate on nominalizations of verbs of saying (henceforth: "verba dicendi"), identify their context partners and their influence on the sortal reading of the nominalizations in question. We present a tool which calculates the sortal reading of such nominalizations and thus may improve not only corpus extraction, but also e.g. machine translation. Lastly, we describe successful attempts to identify the correct sortal reading, conclusions and future work.
Author: | Kurt EberleGND, Gertrud FaaßORCiD, Ulrich HeidORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-117800 |
URL: | https://ucrel.lancs.ac.uk/publications/cl2009/315_FullPaper.doc |
Parent Title (English): | Proceedings of the Corpus Linguistics Conference 2009, Liverpool |
Publisher: | University of Liverpool |
Place of publication: | Liverpool |
Editor: | Michaela Mahlberg, Victorina González-Díaz, Catherine Smith |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2010 |
Date of Publication (online): | 2023/05/25 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
GND Keyword: | Ambiguität; Deutsch; Implementation; Indikator; Korpus <Linguistik>; Nominalisierung |
Page Number: | 13 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Korpuslinguistik |
Licence (German): | Urheberrechtlich geschützt |