TY - CHAP U1 - Konferenzveröffentlichung A1 - Baumann, Stefan A1 - Brinckmann, Caren A1 - Hansen-Schirra, Silvia A1 - Kruijff, Geert-Jan A1 - Kruijff-Korbayová, Ivana A1 - Neumann, Stella A1 - Steiner, Erich A1 - Teich, Elke A1 - Uszkoreit, Hans T1 - The MULI Project: Annotation and Analysis of Information Structure in German and English T2 - Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004). Lisbon, Portugal N2 - The goal of the MULI (MUltiLingual Information structure) project is to empirically analyse information structure in German and English newspaper texts. In contrast to other projects in which information structure is annotated and investigated (e.g. in the Prague Dependency Treebank, which mirrors the basic information about the topic-focus articulation of the sentence), we do not annotate theory-biased categories like topic-focus or theme-rheme. Trying to be as theory-independent as possible, we annotate those features which are relevant to information structure and on the basis of which typical patterns, co-occurrences or correlations can be determined. We distinguish between three annotation levels: syntax, discourse and prosody. The data is based on the TIGER Corpus for German and the Penn Treebank for English, since the existing information on part-of-speech and syntactic structure can be re-used for our purposes. The actual annotation of an English example sequence illustrates our choice of categories on each level. Their combination offers the possibility to investigate how information structure is realised and can be interpreted. KW - Deutsch KW - Automatische Sprachanalyse KW - Englisch KW - Annotation Y1 - 2004 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-68474 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-68474 UR - http://www.lrec-conf.org/proceedings/lrec2004/ SN - 2-9517408-1-6 SB - 2-9517408-1-6 SP - 1 EP - 4 PB - European Language Resources Association (ELRA) CY - Paris ER -