Refine
Year of publication
- 2002 (167) (remove)
Document Type
- Part of a Book (73)
- Article (51)
- Conference Proceeding (15)
- Book (9)
- Part of Periodical (5)
- Review (5)
- Other (3)
- Working Paper (3)
- Report (2)
- Lecture (1)
Language
- German (137)
- English (26)
- Multiple languages (2)
- French (1)
- Spanish (1)
Has Fulltext
- yes (167) (remove)
Is part of the Bibliography
- no (167) (remove)
Keywords
- Deutsch (73)
- Konversationsanalyse (25)
- Korpus <Linguistik> (11)
- Rezension (11)
- Gesprochene Sprache (9)
- Computerlinguistik (8)
- Neologismus (8)
- Kommunikation (7)
- Interaktion (6)
- Politische Sprache (6)
Publicationstate
- Veröffentlichungsversion (76)
- Zweitveröffentlichung (20)
- Postprint (10)
- Preprint (1)
Reviewstate
- (Verlags)-Lektorat (85)
- Peer-Review (17)
- Verlags-Lektorat (2)
- Peer-review (1)
Publisher
- Narr (25)
- Institut für Deutsche Sprache (17)
- Lang (11)
- Verlag für Gesprächsforschung (11)
- de Gruyter (7)
- Benjamins (6)
- Niemeyer (5)
- Stauffenburg (4)
- Erich Schmidt Verlag (2)
- European Language Resources Association (ELRA) (2)
In this paper, we investigate the practical applicability of Co-Training for the task of building a classifier for reference resolution. We are concerned with the question if Co-Training can significantly reduce the amount of manual labeling work and still produce a classifier with an acceptable performance.
We describe a simple and efficient Java object model and application programming interface (API) for (possibly multi-modal) annotated natural language corpora. Corpora are represented as elements like Sentences, Turns, Utterances, Words, Gestures and Markables. The API allows linguists to access corpora in terms of these discourse-level elements, i.e. at a conceptual level they are familiar with, with the flexibility offered by a general purpose programming language. It is also a contribution to corpus standardization efforts because it is based on a straightforward and easily extensible data model which can serve as a target for conversion of different corpus formats.
Kein Grund zur Panikmache
(2002)