TY - CHAP U1 - Konferenzveröffentlichung A1 - Müller, Mark-Christoph A1 - Strube, Michael ED - González Rodríguez, Manuel ED - Suarez Araujo, Carmen Paz T1 - An API for discourse-level access to XML-encoded corpora T2 - Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02). May 29-31, 2002, Las Palmas, Canary Islands, Spain N2 - We describe a simple and efficient Java object model and application programming interface (API) for (possibly multi-modal) annotated natural language corpora. Corpora are represented as elements like Sentences, Turns, Utterances, Words, Gestures and Markables. The API allows linguists to access corpora in terms of these discourse-level elements, i.e. at a conceptual level they are familiar with, with the flexibility offered by a general purpose programming language. It is also a contribution to corpus standardization efforts because it is based on a straightforward and easily extensible data model which can serve as a target for conversion of different corpus formats. KW - corpus exploitation KW - standardization KW - discourse processing KW - XML KW - reusability KW - API KW - XML KW - Korpus KW - Natürliche Sprache KW - Vereinheitlichung KW - Datenmodell KW - Softwarewiederverwendung Y1 - 2002 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-111602 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-111602 UR - http://www.lrec-conf.org/proceedings/lrec2002/pdf/296.pdf UR - https://aclanthology.org/L02-1296/ SP - 26 EP - 30 PB - European Language Resources Association (ELRA) CY - Paris ER -