Korpuslinguistik
Refine
Year of publication
- 2013 (8) (remove)
Document Type
- Part of a Book (4)
- Conference Proceeding (3)
- Article (1)
Has Fulltext
- yes (8)
Is part of the Bibliography
- no (8)
Keywords
- Korpus <Linguistik> (8)
- Deutsch (4)
- Korpusanalyseplattform (KorAP) (2)
- Automatische Sprachanalyse (1)
- Französisch (1)
- Fremdsprachenlernen (1)
- French-German (1)
- Internet (1)
- Methode (1)
- Neologismus (1)
Publicationstate
- Veröffentlichungsversion (8) (remove)
Reviewstate
- (Verlags)-Lektorat (6)
- Peer-Review (1)
Publisher
- UCREL (2)
- GSCL (1)
- Institut für Deutsche Sprache (1)
- Köllen (1)
- Narr (1)
- Université de Strasbourg (1)
Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer
linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.
Editorial
(2013)
Igel is a small XQuery-based web application for examining a collection of document grammars; in particular, for comparing related document grammars to get a better overview of their differences and similarities. In its initial form, Igel reads only DTDs and provides only simple lists of constructs in them (elements, attributes, notations, parameter entities). Our continuing work is aimed at making Igel provide more sophisticated and useful information about document grammars and building the application into a useful tool for the analysis (and the maintenance!) of families of related document grammars