Refine
Year of publication
Document Type
- Part of a Book (3005)
- Article (1980)
- Book (679)
- Conference Proceeding (456)
- Part of Periodical (227)
- Review (114)
- Working Paper (58)
- Doctoral Thesis (49)
- Other (42)
- Contribution to a Periodical (14)
- Preprint (8)
- Report (7)
- Habilitation (6)
- (3)
- Master's Thesis (3)
- Bachelor Thesis (2)
- Periodical (2)
- Lecture (1)
Language
- German (5547)
- English (928)
- Russian (120)
- Spanish (16)
- Portuguese (14)
- French (12)
- Multiple languages (9)
- Polish (5)
- Bulgarian (1)
- Danish (1)
Keywords
- Deutsch (3787)
- Korpus <Linguistik> (492)
- Wörterbuch (405)
- Rezension (292)
- Sprachgeschichte (285)
- Konversationsanalyse (261)
- Grammatik (227)
- Gesprochene Sprache (223)
- Rechtschreibung (204)
- Verb (198)
Publicationstate
- Veröffentlichungsversion (2222)
- Zweitveröffentlichung (365)
- Postprint (154)
- Preprint (8)
- Erstveröffentlichung (4)
- (Verlags)-Lektorat (2)
- Verlags-Lektorat (1)
Reviewstate
- (Verlags)-Lektorat (1914)
- Peer-Review (615)
- Verlags-Lektorat (94)
- Peer-review (57)
- Qualifikationsarbeit (Dissertation, Habilitationsschrift) (25)
- Peer-Revied (20)
- Review-Status-unbekannt (13)
- (Verlags-)Lektorat (7)
- Abschlussarbeit (Bachelor, Master, Diplom, Magister) (Bachelor, Master, Diss.) (4)
- Peer-reviewed (4)
Traditionally, research on language change has been a post-mortem activity, focused on isolated changes that are complete and often only documented in written texts. In the 1960s the field was advanced considerably by Labovian sociolinguistics and the investigation of “change in progress” adduced through patterns of community-internal linguistic variation correlated with external facts about speakers such as age and class (see Labov 1994 for an overview). However, despite the many benefits of such work on “dynamic synchrony,” we still know relatively little about how language change unfolds over the lifetimes of individual speakers, that is, in real time (cf. Bailey et al. 1991). The logistical challenges of such research are, of course, considerable. Whereas it is straightforward for psycholinguists to observe language development in children over the course of a few years, documenting changes in the verbal behavior of individuals over several decades is by contrast much less feasible. Nevertheless, present theoretical models of language change could be considerably improved by the results of real-time studies.
Berichtet wird aus einem Forschungsprojekt des Instituts für deutsche Sprache, Mannheim, das sich zum Ziel gesetzt hat, Sprachwandel in statu nascendi zu beobachten, den Sprecher und die individuellen Veränderungen seines Sprechens und seiner Einstellung zur Sprache nach Ablauf von etwa vier Jahrzehnten in den Blick zu nehmen. Erneut interviewt werden Sprecher deutscher Dialekte oder Umgangssprachen, die in verschiedenen Forschungsprojekten in den 50er und 60er Jahren aufgenommen wurden und von denen eine Tonbandaufnahme im Deutschen Spracharchiv archiviert ist. Im Rahmen einer dem Forschungsprojekt vorgeschalteten inzwischen abgeschlossenen Pilotstudie wurde ein umfängliches methodisches Instrumentarium erprobt, um aussagekräftiges Vergleichsmaterial und Sprachbiographien einiger ausgewählter Sprecher elizitieren zu können. Auf der Basis dieser Studie werden das Projektdesign und die Analysekategorien für die Hauptuntersuchung festgelegt.
In this paper, we describe MLSA, a publicly available multi-layered reference corpus for German-language sentiment analysis. The construction of the corpus is based on the manual annotation of 270 German-language sentences considering three different layers of granularity. The sentence-layer annotation, as the most coarse-grained annotation, focuses on aspects of objectivity, subjectivity and the overall polarity of the respective sentences. Layer 2 is concerned with polarity on the word- and phrase-level, annotating both subjective and factual language. The annotations on Layer 3 focus on the expression-level, denoting frames of private states such as objective and direct speech events. These three layers and their respective annotations are intended to be fully independent of each other. At the same time, exploring for and discovering interactions that may exist between different layers should also be possible. The reliability of the respective annotations was assessed using the average pairwise agreement and Fleiss’ multi-rater measures. We believe that MLSA is a beneficial resource for sentiment analysis research, algorithms and applications that focus on the German language.
We use a convolutional neural network to perform authorship identification on a very homogeneous dataset of scientific publications. In order to investigate the effect of domain biases, we obscure words below a certain frequency threshold, retaining only their POS-tags. This procedure improves test performance due to better generalization on unseen data. Using our method, we are able to predict the authors of scientific publications in the same discipline at levels well above chance.
Though polarity classification has been extensively explored at document level, there has been little work investigating feature design at sentence level. Due to the small number of words within a sentence, polarity classification at sentence level differs substantially from document-level classification in that resulting bag-of-words feature vectors tend to be very sparse resulting in a lower classification accuracy.
In this paper, we show that performance can be improved by adding features specifically designed for sentence-level polarity classification. We consider both explicit polarity information and various linguistic features. A great proportion of the improvement that can be obtained by using polarity information can also be achieved by using a set of simple domain-independent linguistic features.