Schriftenreihe der Österreichischen Gesellschaft für Artificial Intelligence (ÖGAI)
Wien: Eigenverlag ÖGAI
Refine
Year of publication
- 2012 (2)
Document Type
Language
- English (2) (remove)
Has Fulltext
- yes (2)
Is part of the Bibliography
- no (2) (remove)
Keywords
- Korpus <Linguistik> (2)
- Annotation (1)
- Automatische Sprachanalyse (1)
- Deutsch (1)
- Frame-Semantik (1)
- Gesprochene Sprache (1)
- Interoperabilität (1)
- SALSA (1)
Publicationstate
Reviewstate
- (Verlags)-Lektorat (1)
- Peer-Review (1)
Publisher
- Eigenverlag ÖGAI (2)
5
This paper presents Release 2.0 of the SALSA corpus, a German resource for lexical semantics. The new corpus release provides new annotations for German nouns, complementing the existing annotations of German verbs in Release 1.0. The corpus now includes around 24,000 sentences with more than 36,000 annotated instances. It was designed with an eye towards NLP applications such as semantic role labeling but will also be a useful resource for linguistic studies in lexical semantics.
5
This paper presents an extension to the Stuttgart-Tübingen TagSet, the standard part-of-speech tag set for German, for the annotation of spoken language. The additional tags deal with hesitations, backchannel signals, interruptions, onomatopoeia and uninterpretable material. They allow one to capture phenomena specific to spoken language while, at the same time, preserving inter-operability with already existing corpora of written language.