OPUS 4 | Search

3 search hits

1 to 3

Sort by

Flexible UIMA components for information retrieval research (2008)

Müller, Christof ; Zesch, Torsten ; Müller, Mark-Christoph ; Bernhard, Delphine ; Ignatova, Kateryna ; Gurevych, Iryna ; Mühlhäuser, Max

In this paper, we present a suite of flexible UIMA-based components for information retrieval research which have been successfully used (and re-used) in several projects in different application domains. Implementing the whole system as UIMA components is beneficial for configuration management, component reuse, implementation costs, analysis and visualization.

Information extraction with the Darmstadt Knowledge Processing Software Repository (Extended Abstract) (2008)

Gurevych, Iryna ; Müller, Mark-Christoph

Current Natural Language Processing (NLP) systems feature high-complexity processing pipelines that require the use of components at different levels of linguistic and application specific processing. These components often have to interface with external e.g. machine learning and information retrieval libraries as well as tools for human annotation and visualization. At the UKP Lab, we are working on the Darmstadt Knowledge Processing Software Repository (DKPro) (Gurevych et al., 2007a; Müller et al., 2008) to create a highly flexible, scalable and easy-to-use toolkit that allows rapid creation of complex NLP pipelines for semantic information processing on demand. The DKPro repository consists of several main parts created to serve the purposes of different NLP application areas

LRTwiki: enriching the likelihood ratio test with encyclopedic information for the extraction of relevant terms (2009)

Jakob, Niklas ; Müller, Mark-Christoph ; Gurevych, Iryna

This paper introduces LRTwiki, an improved variant of the Likelihood Ratio Test (LRT). The central idea of LRTwiki is to employ a comprehensive domain specific knowledge source as additional “on-topic” data sets, and to modify the calculation of the LRT algorithm to take advantage of this new information. The knowledge source is created on the basis of Wikipedia articles. We evaluate on the two related tasks product feature extraction and keyphrase extraction, and find LRTwiki to yield a significant improvement over the original LRT in both tasks.

1 to 3

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

3 search hits