Refine
Year of publication
Document Type
- Part of a Book (12)
- Article (11)
- Conference Proceeding (10)
- Other (2)
Keywords
- Deutsch (18)
- Grammatik (14)
- Korpus <Linguistik> (12)
- Computerlinguistik (10)
- Automatische Sprachanalyse (5)
- Grammis (5)
- Lyrics <Lyrik> (5)
- Popmusik (4)
- Terminologie (4)
- Informationssystem (3)
Publicationstate
- Veröffentlichungsversion (35) (remove)
Reviewstate
- (Verlags)-Lektorat (18)
- Peer-Review (14)
- (Verlags-)Lektorat (1)
- Peer-review (1)
Publisher
- Narr (5)
- Institut für Deutsche Sprache (3)
- Universitätsverlag Rhein-Ruhr (3)
- European language resources association (ELRA) (2)
- Gesellschaft für Sprachtechnologie und Computerlinguistik (2)
- Zenodo (2)
- de Gruyter (2)
- Association for Computational Linguistics (1)
- Bibliothek der Universität Konstanz (1)
- European Language Resources Association (1)
The compilation of terminological vocabularies plays a central role in the organization and retrieval of scientific texts. Both simple keyword lists as well as sophisticated modellings of relationships between terminological concepts can make a most valuable contribution to the analysis, classification, and finding of appropriate digital documents, either on the Web or within local repositories. This seems especially true for long-established scientific fields with various theoretical and historical branches, such as linguistics, where the use of terminology within documents from different origins is sometimes far from being consistent. In this short paper, we report on the early stages of a project that aims at the re-design of an existing domain-specific KOS for grammatical content grammis. In particular, we deal with the terminological part of grammis and present the state-of-the-art of this online resource as well as the key re-design principles. Further, we propose questions regarding ramifications of the Linked Open Data and Semantic Web approaches for our re-design decisions.
Editorial
(2013)
Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer
linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.
The main objective of this article is to describe the current activities at the Mannheim Institute for German Language regarding the implementation of a domain-specific ontology for German grammar. We differentiate ontology bases from ontology management Systems, point out the benefits of database-driven Solutions, and go Step by Step through all phases of the ontology lifecycle. In Order to demonstrate the practical use of our approach, we outline the interface between our ontology and the grammis web Information System, and compare the ontology-based retrieval mechanism with traditional full text search.
Song lyrics can be considered as a text genre that has features of both written and spoken discourse, and potentially provides extensive linguistic and cultural information to scientists from various disciplines. However, pop songs play a rather subordinate role in empirical language research so far - most likely due to the absence of scientifically valid and sustainable resources. The present paper introduces a multiply annotated corpus of German lyrics as a publicly available basis for multidisciplinary research. The resource contains three types of data for the investigation and evaluation of quite distinct phenomena: TEI-compliant song lyrics as primary data, linguistically and literary motivated annotations, and extralinguistic metadata. It promotes empirically/statistically grounded analyses of genre-specific features, systemic-structural correlations and tendencies in the texts of contemporary pop music. The corpus has been stratified into thematic and author-specific archives; the paper presents some basic descriptive statistics, as well as the public online frontend with its built-in evaluation forms and live visualisations.