OPUS 4 | Search

76 search hits

1 to 10

Sort by

Applying co-training to reference resolution (2002)

Müller, Mark-Christoph ; Rapp, Stefan ; Strube, Michael

In this paper, we investigate the practical applicability of Co-Training for the task of building a classifier for reference resolution. We are concerned with the question if Co-Training can significantly reduce the amount of manual labeling work and still produce a classifier with an acceptable performance.

An API for discourse-level access to XML-encoded corpora (2002)

Müller, Mark-Christoph ; Strube, Michael

We describe a simple and efficient Java object model and application programming interface (API) for (possibly multi-modal) annotated natural language corpora. Corpora are represented as elements like Sentences, Turns, Utterances, Words, Gestures and Markables. The API allows linguists to access corpora in terms of these discourse-level elements, i.e. at a conceptual level they are familiar with, with the flexibility offered by a general purpose programming language. It is also a contribution to corpus standardization efforts because it is based on a straightforward and easily extensible data model which can serve as a target for conversion of different corpus formats.

Ein gesamtdeutscher Westdeutscher. Laudatio auf Horst Dieter Schlosser (2002)

Hellmann, Manfred W.

Neologismen der neunziger Jahre. Vom Textkorpus zur Datenbank (2002)

Tellenbach, Elke

Frühneuhochdeutsche Zustände im Spätneuhochdeutschen? (2002)

Schmidt, Hartmut

Online Access Tools for Spoken German: The Resources of the Deutsches Spracharchiv in a Database (2002)

Wagener, Peter

This paper shows some details of the modernization of the Deutsches Spracharchiv (DSAv). It explores some future possibilities of linguistical documentation and analysis using the Web. The Institut für Deutsche Sprache (IDS) in Mannheim is the central institution for linguistic research in Germany. The DSAv in the IDS is the center for documentation and research of spoken German. These archives include the largest collection of sound recordings of spoken German (dialects and colloquial speech, including e.g. lots of extinct dialects of former German territories in Eastern Europe) - altogether more than 15,000 sound recordings. The lacking clarification and accessibility of this data material has been felt as an essential deficit. The opportunity to edit the sound signal digitally offers a much easier access to spoken language. Through the integration of the already existing information about the corpora and the transcribed texts in an information- and full text databank, as well as the linking of the data with the acoustic signal (alignment), arises a data-pool with considerably better documentation of the materials and a fast direct grasp of the recorded sounds. Thus, the DSAv initiates totally new research questions for the work at the IDS, as well as for linguistics altogether.

Die Verwendung ethnischer Stereotypen im interethnischen Erstkontakt: Zum Zusammenhang von Selbst- und Fremddarstellung, Interaktionsmodalität und Perspektivität (2002)

Keim, Inken

René Métrich / Eugène Faucher / Gilbert Courdier: Les Invariables Difficiles. Dictionnaire allemand-français des particules, connecteurs, interjections et autres "mots de la communication". Tome 4: obendrein - zwar. ATILF UMR 7118 - CNRS/Université Nancy II. 2001. 388 Seiten, brosch. € 15,- [Rezension] (2002)

Engel, Ulrich

Traditionen der parlamentarischen Rede. Alte und neue Wörter, Formulierungen und Konstruktionen in den Texten der Frankfurter Nationalversammlung (2002)

Schmidt, Hartmut

Social style of communication and bilingual speech practices: Case study of three migrant youth groups of Turkish origin in Mannheim/Germany (2002)

Keim, Inken