430 Deutsch
Refine
Year of publication
Document Type
- Article (16)
- Part of a Book (5)
- Conference Proceeding (5)
- Doctoral Thesis (1)
Language
- English (20)
- German (6)
- Portuguese (1)
Has Fulltext
- yes (27)
Keywords
- German (27) (remove)
Publicationstate
- Veröffentlichungsversion (19)
- Postprint (4)
- Zweitveröffentlichung (3)
Reviewstate
Publisher
- de Gruyter (4)
- Elsevier (2)
- Narr (2)
- Aletheia (1)
- Buske (1)
- CSLI Publications (1)
- Charles University, Prague (1)
- Dublin City University (1)
- European Language Resources Association (ELRA) (1)
- FFLCH-USP (1)
This article details the process of creating the Nottinghamer Korpus deutscher YouTube-Sprache ('The Nottingham German YouTube Language Corpus' - or NottDeuYTSch corpus) and outlines potential research opportunities. The corpus was compiled to analyse the online language produced by young German-speakers and offers significant opportunity for in-depth research across several linguistic fields including lexis, morphology, syntax, orthography, and conversational and discursive analysis. The NottDeuYTSch corpus contains over 33 million words taken from approximately 3 million YouTube comments from videos published between 2008 to 2018 targeted at a young, German-speaking demographic and represent an authentic language snapshot of young German speakers. The corpus was proportionally sampled based on video category and year from a database of 112 popular German-speaking YouTube channels in the DACH region for optimal representativeness and balance and contains a considerable amount of associated metadata for each comment that enable further longitudinal cross-sectional analyses. The NottDeuYTSch corpus is available for analysis as part of the German Reference Corpus (DeReKo).
Comprehending conditional statements is fundamental for hypothetical reasoning about situations. However, the online comprehension of conditional statements containing different conditional connectives is still debated. We report two self-paced reading experiments on German conditionals presenting the conditional connectives wenn (‘if’) and nur wenn (‘only if’) in identical discourse contexts. In Experiment 1, participants read a conditional sentence followed by the confirmed antecedent p and the confirmed or negated consequent q. The final, critical sentence was presented word by word and contained a positive or negative quantifier (ein/kein ‘one/no’). Reading times of the two quantifiers did not differ between the two conditional connectives. In Experiment 2, presenting a negated antecedent, reading times for the critical positive quantifier (ein) did not differ between conditional connectives, while reading times for the negative quantifier (kein) were shorter for nur wenn than for wenn. The results show that comprehenders form distinct predictions about discourse continuations due to differences in the lexical semantics of the tested conditional connectives, shedding light on the role of conditional connectives in the online interpretation of conditionals in general.
With recourse to a broader understanding of the concept of translation, the transfer of source texts in one variety into another variety of the same language can also be called translation. This paper focuses on the target language – or rather – the target variety “easy-to-read language”, which is meant to make texts comprehensible for people with communication limitations. Considering its origins in the disability rights movement, the aim is to inform affected persons about their rights and democratic processes, i.e. to translate especially legal texts into the so-called easy-to-read language. Although there is a whole range of rules and guidelines for formulating in easy-to-read language, ”none offers a sufficient approach for translation into easy-to-read language“ (Bredel & Maaß, 2016a, p. 109). Standardization of the variety is also still a long way off. On the one hand, the contribution takes stock of legal regulations in easy-to-read language. On the other hand, four versions of the Federal Participation Law in easy-to-read language are analysed with regard to their external features and the constructions used to explain technical terminology. The analysis shows that legal texts in easy-to-read language are (still) quite limited in number and are also difficult to find. Concerning the second part, the constructions used exhibit a great structural variance, both intra- and intertextually. It is therefore questionable whether the addressees can access the texts independently. Also, it is still necessary to make the rules, the formulations of the rules and the implementations clearer so that the translations fulfil their function.
Are borrowed neologisms accepted more slowly into the German language than German words resulting from the application of word formation rules? This study addresses this question by focusing on two possible indicators for the acceptance of neologisms: a) frequency development of 239 German neologisms from the 1990s (loanwords as well as new words resulting from the application of word formation rules) in the German reference corpus DeReKo and b) frequency development in the use of pragmatic markers (‘flags’, namely quotation marks and phrases such as sogenannt ‘so-called’) with these words. In the second part of the article, a psycholinguistic approach to evaluating the (psychological) status of different neologisms and non-words in an experimentally controlled study and plans to carry out interviews in a field test to collect speakers’ opinions on the acceptance of the analysed neologisms are outlined. Finally, implications for the lexicographic treatment of both types of neologisms are discussed.
This paper deals with the creation of the first morphological treebank for German by merging two pre-existing linguistic databases. The first of these is the linguistic database CELEX which is a standard resource for German morphology. We build on its refurbished and modernized version. The second resource is GermaNet, a lexical-semantic network which also provides partial markup for compounds. We describe the state of the art and the essential characteristics of both databases and our latest revisions. As the merging involves two data sources with distinct annotation schemes, the derivation of the morphological trees for the unified resource is not trivial. We discuss how we overcome problems with the data and format, in particular how we deal with overlaps and complementary scopes. The resulting database comprises about 100,000 trees whose format can be chosen according to the requirements of the application at hand. In our discussion, we show some future directions for morphological treebanks. The Perl script for the generation of the data from the sources will be made publicly available on our website.
Instruction practices in German driving lessons: Differential uses of declaratives and imperatives
(2018)
Building on а corpus of 70 hours of German driving lessons, this paper studies the use of declaratives vs. imperatives for instruction. It shows how these linguistic resources are adapted to different praxeological, temporal and participant-related environments. Declaratives are used for first instructions, task-setting and post- trial discussions. They exhibit complex syntax and do not call for immediate compliance. Their high degree of explicitness conveys how the action is to be carried out. Imperative instructions overwhelmingly correct ongoing actions of students or respond to their failure to produce expected actions. They exhibit minimal argument structure. They are reminders which presuppose that the student monitors the scene and can perform the action unproblematically. They index that requests have to be complied with immediately or even urgently.
The naturalness of synthetic speech depends strongly on the prediction of appropriate prosody. For the present study the original annotation of the German speech database “Kiel Corpus of Read Speech” was extended automatically with syntactic features, word frequency, and syllable boundaries. Several classification and regression trees for predicting symbolic prosody features, postlexical phonological processes, duration, and F0 were trained on this database. The perceptual evaluation showed that the overall perceptual quality of the German text-to-speech system MARY can be significantly improved by training all models that contribute to prosody prediction on the same database. Furthermore, it showed that the error introduced by symbolic prosody prediction perceptually equals the error produced by a direct method that does not exploit any symbolic prosody features.
Our paper deals with the use of ICH WEIß NICHT (‘I don’t know’) in German talk-in-interaction. Pursuing an Interactional Linguistics approach, we identify different interactional uses of ICH WEIß NICHT and discuss their relationship to variation in argument structure (SV (O), (O)VS, V-only). After ICH WEIß NICHT with full complementation, speakers emphasize their lack of knowledge or display reluctance to answer. In contrast, after variants without an object complement, in contrast, speakers display uncertainty about the truth of the following proposition or about its sufficiency as an answer. Thus, while uses with both subject and object tend to close a sequence or display lack of knowledge, responses without an object, in contrast, function as a prepositioned epistemic hedge or a pragmatic marker framing the following TCU. When ICH WEIß NICHT is used in response to a statement, it indexes disagreement (independently from all complementation patterns).
The present paper deals with grammaticalization as a comprehensive model of erosive processes in the history of natural languages, exemplified in German and Brazilian Portuguese. Grammaticalization is conceived of as the reduction of pragmatic versatility, semantic concreteness, syntactic liberty and phonetic substance of linguistic elements. It is subdivided into the processes of lexicalization, which transforms polylexematic into monolexematic elements, and deslexicalization, which reduces lexematic to sublexematic elements. In the middle of these processes stands the lexicon, which is seen as the central stock of linguistic elements. Within the lexicon, the process of grammaticalization continues, from lexical word classes through intermediate classes to grammatical word classes. The lower boundary of the lexicon is a critical threshold, down to which the process of grammaticalization is compensated for by linguistic recycling that leads lexematic elements back into the linguistic circuit, through the formation of new polylexematic units. Beyond this threshold, however, no recycling is possible any more, so that elements which have once lost their lexical character are condemned to disappear in the long run. The different stages of grammaticalization are introduced and illustrated by means of concrete examples, first from Brazilian Portuguese and afterwards from German.