Refine
Year of publication
- 2013 (71) (remove)
Document Type
- Part of a Book (44)
- Article (16)
- Book (4)
- Conference Proceeding (4)
- Review (3)
Has Fulltext
- yes (71)
Keywords
- Deutsch (28)
- Korpus <Linguistik> (9)
- Lexikographie (6)
- Gesprochene Sprache (5)
- Sprachwandel (5)
- Wörterbuch (5)
- Diskursanalyse (4)
- Rezension (4)
- Cyber-Mobbing (3)
- Funktionswort (3)
Publicationstate
- Veröffentlichungsversion (53)
- Zweitveröffentlichung (13)
- Postprint (1)
Reviewstate
- (Verlags)-Lektorat (71) (remove)
Publisher
- de Gruyter (17)
- De Gruyter (4)
- Lang (4)
- Narr (3)
- Olms (3)
- Winter (3)
- ATUT / Neisse (2)
- Hempen (2)
- Springer (2)
- UCREL (2)
Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer
linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.
Mediatization and Mediality in Social Media: the Discourse System Twitter
The article contributes to the debate about mediatization and the use of language in social media. The theoretical approach evolves from the intersection of linguistics, media and communication studies. While the concept of mediatization describes relations between medial and sociocultural change and the ubiquity of media in everyday life, the concept of mediality sheds light on the inseparability of media and language. From this interdisciplinary perspective, specific practices of media and language use within the microblogging service Twitter were analyzed. Examples from different case studies reveal certain user practices that can be described as formed by ‘moulding forces’ of the medium Twitter without considering technology as determining or symptomatic. Our analysis shows that the use of specific semiotic and functional operators (#, @, RT, http://) establish user practices of creating personal and semantic references and thus constitute Twitter as a multi-referential discourse system.
Der Beitrag analysiert die Strukturen der Inhaltsdistribution im Microblogging-System Twitter. Den Ausgangspunkt hierfür bildet eine Fokussierung der Medienforschung auf Produktion und Rezeption von „User Generated Content“ im Social Web, die ebenso wie die Annahme einer „freien“ Wahl von Themen- und Informationsquellen im Web hinterfragt werden soll. Die zentrale These lautet hierbei, dass nicht nur Nutzerinnen und Nutzer über die Verteilung der Inhalte bestimmen, sondern in hohem Maße auch Algorithmen. Im Konzept der selektiven Distribution werden die typischen Distributionsmodi sowie deren Erzeugungsmechanismen herausgearbeitet und dargestellt. Die medienethische Verantwortung für die Verteilung der nutzergenerierten Inhalte liegt (auch) bei den Medienunternehmen, die die Macht über algorithmische Distributionsstrukturen haben. Die Unternehmen geraten dadurch, wie abschließend argumentiert wird, in einen Konflikt zwischen wirtschaftlichen Interessen und gesellschaftlicher Verantwortung. Aus der Analyse ergeben sich Forderungen nach mehr Transparenz der algorithmischen Distributionsprinzipien sowie mehr Kontrollmöglichkeiten für die User.
Extending the possibilities for collaborative work with TEI/XML through the usage of a wiki system
(2013)
This paper presents and discusses an integrated project-specific working environment for editing TEI/XML-files and linking entities of interest to a dedicated wiki system. This working environment has been specifically tailored to the workflow in our interdisciplinary digital humanities project GeoBib. It addresses some challenges that arose while working with person-related data and geographical references in a growing collection of TEI/XML-files. While our current solution provides some essential benefits, we also discuss several critical issues and challenges that remain.
Contemporary studies on the characteristics of natural language benefit enormously from the increasing amount of linguistic corpora. Aside from text and speech corpora, corpora of computer-mediated communication (CMC) Position themselves between orality and literacy, and beyond that provide in- sight into the impact of "new", mainly intemet-based media on language beha- viour. In this paper, we present an empirical attempt to work with annotated CMC corpora for the explanation of linguistic phenomena. In concrete terms, we implement machine leaming algorithms to produce decision trees that reveal rules and tendencies about the use of genitive markers in German.