Refine
Year of publication
Document Type
- Part of a Book (9)
- Article (8)
- Conference Proceeding (2)
- Other (1)
- Working Paper (1)
Language
- German (21) (remove)
Keywords
- Korpus <Linguistik> (15)
- Deutsch (10)
- Computerlinguistik (4)
- Deutsches Referenzkorpus (DeReKo) (4)
- Institut für Deutsche Sprache <Mannheim> (3)
- Methode (3)
- Neologismus (3)
- Gesprochene Sprache (2)
- Studium (2)
- Annotation (1)
Publicationstate
Reviewstate
- (Verlags)-Lektorat (10)
- Peer-Review (3)
- Verlags-Lektorat (1)
- Verlagslektorat (1)
This paper describes an approach to modelling a general-language wordnet, GermaNet, and a domain-specific wordnet, TermNet, in the web ontology language OWL. While the modelling process for GermaNet adopts relevant recommendations with respect to the English Princeton WordNet, for Term-Net an alternative modelling concept is developed that considers the special characteristics of domain-specific terminologies. We present a proposal for linking a general-language wordnet and a terminological wordnet within the framework of OWL and on this basis discuss problems and alternative modelling approaches.
Linguistische Annotationen für die Analyse von Gliederungsstrukturen wissenschaftlicher Texte
(2012)
Editorial
(2011)
Most research on automated categorization of documents has concentrated on the assignment of one or many categories to a whole text. However, new applications, e.g. in the area of the Semantic Web, require a richer and more fine-grained annotation of documents, such as detailed thematic information about the parts of a document. Hence we investigate the automatic categorization of text segments of scientific articles with XML markup into 16 topic types from a text type structure schema. A corpus of 47 linguistic articles was provided with XML markup on different annotation layers representing text type structure, logical document structure, and grammatical categories. Six different feature extraction strategies were applied to this corpus and combined in various parametrizations in different classifiers. The aim was to explore the contribution of each type of information, in particular the logical structure features, to the classification accuracy. The results suggest that some of the topic types of our hierarchy are successfully learnable, while the features from the logical structure layer had no particular impact on the results.