Schriftenreihe der Österreichischen Gesellschaft für Artificial Intelligence (ÖGAI)
Wien: Eigenverlag ÖGAI
Refine
Year of publication
- 2012 (2)
Document Type
Language
- English (2)
Has Fulltext
- yes (2)
Is part of the Bibliography
- no (2)
Keywords
- Korpus <Linguistik> (2)
- Computerlinguistik (1)
- Deutsch (1)
- Empirische Linguistik (1)
- Frame-Semantik (1)
- Information Extraction (1)
- Lebensmittel (1)
- SALSA (1)
Publicationstate
Reviewstate
- Peer-Review (2) (remove)
Publisher
Band 5
In this paper, we examine methods to automatically extract domain-specific knowledge from the food domain from unlabeled natural language text. We employ different extraction methods ranging from surface patterns to co-occurrence measures applied on different parts of a document. We show that the effectiveness of a particular method depends very much on the relation type considered and that there is no single method that works equally well for every relation type. We also examine a combination of extraction methods and also consider relationships between different relation types. The extraction methods are applied both on a domain-specific corpus and the domain-independent factual knowledge base Wikipedia. Moreover, we examine an open-domain lexical ontology for suitability.
5
This paper presents Release 2.0 of the SALSA corpus, a German resource for lexical semantics. The new corpus release provides new annotations for German nouns, complementing the existing annotations of German verbs in Release 1.0. The corpus now includes around 24,000 sentences with more than 36,000 annotated instances. It was designed with an eye towards NLP applications such as semantic role labeling but will also be a useful resource for linguistic studies in lexical semantics.