OPUS 4 | Search

22 search hits

1 to 10

Sort by

Year
Year
Title
Title
Author
Author

Towards the Detection of Reliable Food-Health Relationships (2013)

We investigate the task of detecting reliable statements about food-health relationships from natural language texts. For that purpose, we created a specially annotated web corpus from forum entries discussing the healthiness of certain food items. We examine a set of task-specific features (mostly) based on linguistic insights that are instrumental in finding utterances that are commonly perceived as reliable. These features are incorporated in a supervised classifier and compared against standard features that are widely used for various tasks in natural language processing, such as bag of words, part-of speech and syntactic parse information.

POS für(s) FOLK – Part of Speech Tagging des Forschungs- und Lehrkorpus Gesprochenes Deutsch (2013)

Westpfahl, Swantje ; Schmidt, Thomas

Designing a bilingual speech corpus for French and German language learners (2013)

Trouvain, Jürgen ; Laprie, Yves ; Möbius, Bernd ; Andreeva, Bistra ; Bonneau, Anne ; Colotte, Vincent ; Fauth, Camille ; Fohr, Dominique ; Jouvet, Denis ; Mella, Odile ; Jügler, Jeanin ; Zimmerer, Frank

Präsenz und Absenz lokaler Diskursgebrauchsmuster am Beispiel des deutschen und britischen Krisendiskurses (2013)

Storjohann, Petra ; Schröter, Melani

Lexical, corpus-methodological and lexicographic approaches to paronyms (2013)

Storjohann, Petra

Igel: Comparing document grammars using XQuery (2013)

Sperberg-McQueen, Christopher M. ; Schonefeld, Oliver ; Kupietz, Marc ; Lüngen, Harald ; Witt, Andreas

Igel is a small XQuery-based web application for examining a collection of document grammars; in particular, for comparing related document grammars to get a better overview of their differences and similarities. In its initial form, Igel reads only DTDs and provides only simple lists of constructs in them (elements, attributes, notations, parameter entities). Our continuing work is aimed at making Igel provide more sophisticated and useful information about document grammars and building the application into a useful tool for the analysis (and the maintenance!) of families of related document grammars

Metadaten für Gesprächsdatenbanken. Ein Überblick und ihre Verwaltung in der IDS-Datenbank Gesprochenes Deutsch (DGD) (2013)

Schütte, Wilfried

Editorial (2013)

Schneider, Roman ; Storrer, Angelika ; Mehler, Alexander

KoGra-DB: Using MapReduce for language corpora (2013)

Schneider, Roman

Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.

Die Datenbank für Gesprochenes Deutsch - DGD2 (2013)

Schmidt, Thomas ; Dickgießer, Sylvia ; Gasch, Joachim

Die „Datenbank für Gesprochenes Deutsch“ (DGD2) ist ein Korpusmanagementsystem im Archiv für Gesprochenes Deutsch (AGD) am Institut für Deutsche Sprache. Über die DGD2 werden Teilbestände des Archivs (Audioaufnahmen gesprochener Sprache, sowie zugehörige Metadaten, Transkripte und Zusatzmaterialien) der wissenschaftlichen Öffentlichkeit online zur Verfügung gestellt. Sie enthält derzeit knapp 9000 Datensätze aus 18 Korpora. Die DGD2 ist das Nachfolgesystem der älteren „Datenbank Gesprochenes Deutsch“ (ab hier: DGD1, siehe Fiehler/Wagener 2005). Da die DGD1 aufgrund ihrer technischen Realisierung mittelfristig kaum wartbar und erweiterbar ist, wurde die DGD2 auf eine neue technische Basis gestellt und stellt insofern keine direkte Weiterentwicklung der DGD1 dar, sondern eine Neuentwicklung, die freilich einen Großteil der Datenbestände und Funktionalität mit der DGD1 teilt. Die DGD2 wurde der Öffentlichkeit erstmals in einem Beta-Release im Februar 2012 zugänglich gemacht. In diesem Beitrag stellen wir die Datenbestände, die technische Realisierung sowie die Funktionalität des ersten offiziellen Release der DGD2 vom Dezember 2012 vor. Wir schließen mit einem Ausblick auf geplante Weiterentwicklungen.

1 to 10

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

22 search hits