Co-reference annotation and resources: a multilingual corpus of typologically diverse languages
- This article introduces a dialogue corpus containing data from two typologically different languages, Japanese and Kilivila. The corpus is annotated in accordance with language specific annotation schemes for co-referential and similar relations. The article describes the corpus data, the properties of language specific co-reference in the two languages and a methodology for its annotation. Examples from the corpus show how this methodology is used in the workflow of the annotation process.
Author: | Felix Sasaki, Claudia Wegener, Andreas WittORCiDGND, Dieter Metzing, Jens Pönninghaus |
---|---|
URN: | urn:nbn:de:bsz:mh39-45529 |
URL: | http://www.lrec-conf.org/proceedings/lrec2002/ |
Parent Title (English): | Proceedings of the 3rd International Conference on Language Resources and Evaluation (LREC-2002). Las Palmas, Gran Canaria |
Publisher: | European Language Resources Association (ELRA) |
Place of publication: | Paris |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2002 |
Date of Publication (online): | 2016/01/11 |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | (Verlags)-Lektorat |
Tag: | Coreference; Interrelated document grammars; Multilingual corpus; Multiple annotations |
First Page: | 1225 |
Last Page: | 1230 |
DDC classes: | 400 Sprache / 410 Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Korpuslinguistik |
Licence (German): | Urheberrechtlich geschützt |