OPUS 4 | Search

Refine

Has Fulltext

yes (71)

71 search hits

1 to 10

Sort by

KoGra-DB: Using MapReduce for language corpora (2013)

Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.

Mediatisierung und Medialität in Social Media: Das Diskurssystem „Twitter“ (2013)

Dang-Anh, Mark ; Einspänner, Jessica ; Thimm, Caja

Mediatization and Mediality in Social Media: the Discourse System Twitter The article contributes to the debate about mediatization and the use of language in social media. The theoretical approach evolves from the intersection of linguistics, media and communication studies. While the concept of mediatization describes relations between medial and sociocultural change and the ubiquity of media in everyday life, the concept of mediality sheds light on the inseparability of media and language. From this interdisciplinary perspective, specific practices of media and language use within the microblogging service Twitter were analyzed. Examples from different case studies reveal certain user practices that can be described as formed by ‘moulding forces’ of the medium Twitter without considering technology as determining or symptomatic. Our analysis shows that the use of specific semiotic and functional operators (#, @, RT, http://) establish user practices of creating personal and semantic references and thus constitute Twitter as a multi-referential discourse system.

Lesen, Schreiben, Erzählen: Vorwort (2013)

Lobin, Henning ; Leitenstern, Regine ; Lehnen, Katrin ; Klawitter, Jana

Die Macht der Algorithmen - Selektive Distribution in Twitter (2013)

Dang-Anh, Mark ; Einspänner, Jessica ; Thimm, Caja

Der Beitrag analysiert die Strukturen der Inhaltsdistribution im Microblogging-System Twitter. Den Ausgangspunkt hierfür bildet eine Fokussierung der Medienforschung auf Produktion und Rezeption von „User Generated Content“ im Social Web, die ebenso wie die Annahme einer „freien“ Wahl von Themen- und Informationsquellen im Web hinterfragt werden soll. Die zentrale These lautet hierbei, dass nicht nur Nutzerinnen und Nutzer über die Verteilung der Inhalte bestimmen, sondern in hohem Maße auch Algorithmen. Im Konzept der selektiven Distribution werden die typischen Distributionsmodi sowie deren Erzeugungsmechanismen herausgearbeitet und dargestellt. Die medienethische Verantwortung für die Verteilung der nutzergenerierten Inhalte liegt (auch) bei den Medienunternehmen, die die Macht über algorithmische Distributionsstrukturen haben. Die Unternehmen geraten dadurch, wie abschließend argumentiert wird, in einen Konflikt zwischen wirtschaftlichen Interessen und gesellschaftlicher Verantwortung. Aus der Analyse ergeben sich Forderungen nach mehr Transparenz der algorithmischen Distributionsprinzipien sowie mehr Kontrollmöglichkeiten für die User.

Extending the possibilities for collaborative work with TEI/XML through the usage of a wiki system (2013)

Entrup, Bastian ; Binder, Frank ; Lobin, Henning

This paper presents and discusses an integrated project-specific working environment for editing TEI/XML-files and linking entities of interest to a dedicated wiki system. This working environment has been specifically tailored to the workflow in our interdisciplinary digital humanities project GeoBib. It addresses some challenges that arose while working with person-related data and geographical references in a growing collection of TEI/XML-files. While our current solution provides some essential benefits, we also discuss several critical issues and challenges that remain.

Vom Konnektorenhandbuch zum Konnektorenwörterbuch (2013)

Breindl, Eva

Monika Schwarz-Friesel/Jehuda Reinharz: Die Sprache der Judenfeindschaft im 21. Jahrhundert. (Europäisch-judische Studien; Beiträge. Herausgegeben vom Moses Mendelssohn Zentrum für europäisch-judische Studien Berlin-Brandenburg. Band 7) Berlin, Boston: de Gruyter 2013.) [Rezension] (2013)

Kämper, Heidrun

Stefan Engelberg u.a. (Hg.): Sprachliches Wissen zwischen Lexikon und Grammatik [Rezension] (2013)

Gerdes, Jens

Von Inseln und Kernen: Gebrauchsbasierte Standard-Begriffe (2013)

Fisseni, Bernhard ; Schröder, Bernhard

Decision Tree-Based Evaluation of Genitive Classification – An Empirical Study on CMC and Text Corpora. Language Processing and Knowledge in the Web (2013)

Hansen, Sandra ; Schneider, Roman

Contemporary studies on the characteristics of natural language benefit enormously from the increasing amount of linguistic corpora. Aside from text and speech corpora, corpora of computer-mediated communication (CMC) Position themselves between orality and literacy, and beyond that provide in- sight into the impact of "new", mainly intemet-based media on language beha- viour. In this paper, we present an empirical attempt to work with annotated CMC corpora for the explanation of linguistic phenomena. In concrete terms, we implement machine leaming algorithms to produce decision trees that reveal rules and tendencies about the use of genitive markers in German.

1 to 10

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

71 search hits