Sprache im 20. Jahrhundert. Gegenwartssprache
Refine
Year of publication
Document Type
- Article (31)
- Part of a Book (31)
- Conference Proceeding (27)
- Doctoral Thesis (1)
Language
- English (90) (remove)
Keywords
- Deutsch (51)
- Englisch (18)
- Korpus <Linguistik> (16)
- Konversationsanalyse (12)
- Gesprochene Sprache (11)
- Interaktion (9)
- Verb (8)
- Bedeutung (5)
- German (5)
- Massenmedien (5)
Publicationstate
- Veröffentlichungsversion (26)
- Postprint (19)
- Zweitveröffentlichung (5)
Reviewstate
Publisher
- Benjamins (8)
- Institut für Deutsche Sprache (7)
- Elsevier (5)
- European Language Resources Association (ELRA) (4)
- Lang (3)
- Verlag für Gesprächsforschung (3)
- de Gruyter (3)
- ISCA (2)
- Niemeyer (2)
- University of Birmingham (2)
"Standard language" is a contested concept, ideologically, empirically and theoretically. This is particularly true for a language such as German, where the standardization of the spoken language was based on the written standard and was established with respect to a communicative situation, i.e. public speech on stage (Bühnenaussprache), which most speakers never come across. As a consequence, the norms of the oral standard exhibit many features which are infrequent in the everyday speech even of educated speakers. This paper discusses ways to arrive at a more realistic conception of (spoken) standard German, which will be termed "standard usage". It must be founded on empirical observations of speakers linguistic choices in everyday situations. Arguments in favor of a corpus-based notion of standard have to consider sociolinguistic, political, and didactic concerns. We report on the design of a large study of linguistic variation conducted at the Institute for the German Language (project "Variation in Spoken German", Variation des gesprochenen Deutsch) with the aim of arriving at a representative picture of "standard usage" in contemporary German. It systematically takes into account both diatopic variation covering the multi-national space in which German an official language, and diastratic variation in terms of varying degrees of formality. Results of the study of phonetic and morphosyntactic variation are discussed. At least for German, a corpus-based notion of "standard usage" inevitably includes some degree of pluralism concerning areal variation, and it needs to do justice to register-based variation as well.
This thesis is a corpus linguistic investigation of the language used by young German speakers online, examining lexical, morphological, orthographic, and syntactic features and changes in language use over time. The study analyses the language in the Nottinghamer Korpus deutscher YouTube‐Sprache ("Nottingham corpus of German YouTube language", or NottDeuYTSch corpus), one of the first large corpora of German‐language comments taken from the videosharing website YouTube, and built specifically for this project. The metadatarich corpus comprises c.33 million tokens from more than 3 million comments posted underneath videos uploaded by mainstream German‐language youthorientated YouTube channels from 2008‐2018.
The NottDeuYTSch corpus was created to enable corpus linguistic approaches to studying digital German youth language (Jugendsprache), having identified the need for more specialised web corpora (see Barbaresi 2019). The methodology for compiling the corpus is described in detail in the thesis to facilitate future construction of web corpora. The thesis is situated at the intersection of Computer‐Mediated Communication (CMC) and youth language, which have been important areas of sociolinguistic scholarship since the 1980s, and explores what we can learn from a corpus‐driven, longitudinal approach to (online) youth language. To do so, the thesis uses corpus linguistic methods to analyse three main areas:
1. Lexical trends and the morphology of polysemous lexical items. For this purpose, the analysis focuses on geil, one of the most iconic and productive words in youth language, and presents a longitudinal analysis, demonstrating that usage of geil has decreased, and identifies lexical items that have emerged as potential replacements. Additionally, geil is used to analyse innovative morphological productiveness, demonstrating how different senses of geil are used as a base lexeme or affixoid in compounding and derivation.
2. Syntactic developments. The novel grammaticalization of several subordinating conjunctions into both coordinating conjunctions and discourse markers is examined. The investigation is supported by statistical analyses that demonstrate an increase in the use of non‐standard syntax over the timeframe of the corpus and compares the results with other corpora of written language.
3. Orthography and the metacommunicative features of digital writing. This analysis identifies orthographic features and strategies in the corpus, e.g. the repetition of certain emoji, and develops a holistic framework to study metacommunicative functions, such as the communication of illocutionary force, information structure, or the expression of identities. The framework unifies previous research that had focused on individual features, integrating a wide range of metacommunicative strategies within a single, robust system of analysis.
By using qualitative and computational analytical frameworks within corpus linguistic methods, the thesis identifies emergent linguistic features in digital youth language in German and sheds further light on lexical and morphosyntactic changes and trends in the language of young people over the period 2008‐2018. The study has also further developed and augmented existing analytical frameworks to widen the scope of their application to orthographic features associated with digital writing.
This paper outlines the generation process of a specifi computational linguistic representation termed the Multilingual Time Map, conceptually a multi-tape finit state transducer encoding linguistic data at different levels of granularity. The fi st component acquires phonological data from syllable labeled speech data, the second component define feature profiles the third component generates feature hierarchies and augments the acquired data with the define feature profiles and the fourth component displays the Multilingual Time Map as a graph.
The paper deals with the use of ICH WEIß NICHT (‘I don’t know’) in German talk-in-interaction. Pursuing an Interactional Linguistics approach, we identify different interactional uses of ICH WEIß NICHT and discuss their relationship to variation in argument structure (SV (O), (O)VS, V-only). After ICH WEIß NICHT with full complementation, speakers emphasize their lack of knowledge or display reluctance to answer. In contrast, after variants without an object complement, in contrast, speakers display uncertainty about the truth of the following proposition or about its sufficiency as an answer. Thus, while uses with both subject and object tend to close a sequence or display lack of knowledge, responses without an object, in contrast, function as a prepositioned epistemic hedge or a pragmatic marker framing the following TCU. When ICH WEIß NICHT is used in response to a statement, it indexes disagreement (independently from all complementation patterns).
Our paper deals with the use of ICH WEIß NICHT (‘I don’t know’) in German talk-in-interaction. Pursuing an Interactional Linguistics approach, we identify different interactional uses of ICH WEIß NICHT and discuss their relationship to variation in argument structure (SV (O), (O)VS, V-only). After ICH WEIß NICHT with full complementation, speakers emphasize their lack of knowledge or display reluctance to answer. In contrast, after variants without an object complement, in contrast, speakers display uncertainty about the truth of the following proposition or about its sufficiency as an answer. Thus, while uses with both subject and object tend to close a sequence or display lack of knowledge, responses without an object, in contrast, function as a prepositioned epistemic hedge or a pragmatic marker framing the following TCU. When ICH WEIß NICHT is used in response to a statement, it indexes disagreement (independently from all complementation patterns).
This paper describes work in progress on I5, a TEI-based document grammar for the corpus holdings of the Institut für Deutsche Sprache (IDS) in Mannheim and the text model used by IDS in its work. The paper begins with background information on the nature and purposes of the corpora collected at IDS and the motivation for the I5 project (section 1). It continues with a description of the origin and history of the IDS text model (section 2), and a description (section 3) of the techniques used to automate, as far as possible, the preparation of the ODD file documenting the IDS text model. It ends with some concluding remarks (section 4). A survey of the additional features of the IDS-XCES realization of the IDS text model is given in an appendix.
The paper will give a concise account of the theory of Lexical Event Structures. The paper has three objectives which correspond to the following three sections. In section 2 I will sketch the theory and discuss the empirical goals the theory pursues (section 2.1) and the semantic components Lexical Event Structures consist of (section 2.2). Section 3 is devoted to linguistic phenomena whose explanation depends on Lexical Event Structures. In section 3.1 I will briefly illustrate in how far Lexical Event Structures are related to phenomena from five central empirical domains of lexical semantics and in section 3.2 it will be shown how Lexical Event Structures function in a linking theory. Section 4 aims to show how the central semantic concepts in Lexical Event Structures can be anchored to concepts which are well-founded in cognitive science. Section 4.1 discusses the event concept employed and illustrates the relation between the perception of movements and the use of verbs of movement. Section 4.2 deals with the concept of volition with respect to the licensing conditions for intransitive verb passives. In section 4.3 the distinction between durativity and punctuality, which has proven relevant for a number of verb semantic phenomena, is tied to the way we perceive events and structure our own actions. Section 5 provides a conclusion.
This paper is concerned with a novel methodology for generating phonetic questions used in tree-based state tying for speech recognition. In order to implement a speech recognition system, language-dependent knowledge which goes beyond annotated material is usually required. The approach presented here generates phonetic questions for decision trees are based on a feature table that summarizes the articulatory characteristics of each sound. On the one hand, this method allows better language-specific triphone models to be defined given only a feature-table as linguistic input. On the other hand, the feature-table approach facilitates efficient definition of triphone models for other languages since again only a feature table for this language is required. The approach is exemplified with speech recognition systems for English and Thai.
The transition between phases of activities is a practical problem which participants in an interaction have to deal with routinely. In meetings, the sequence of phases of activity is often outlined by a written agenda. However, transitions still have to be accomplished by local interactional work of the participants. In a detailed conversation analytic case study based on video-data, it is shown how participants collaboratively accomplish an emergent interactional state of affairs (a break-like activity) which differs widely from the state of affairs which was projected by awritten agenda (the next presentation), although in doing so, the participants still show their continuous orientation to the agenda. The paper argues that the reconstruction of emergent developments in interaction calls for a multimodal analysis of interaction, because the fine-grained multimodal co-ordination of bodily and verbal resources provides for opportunities of sequentially motivated, relevant next actions. These, however, can amount to emergent activity sequences, which may be at odds with the activity types which are projected by an interactional agenda or expected on behalf of some institutional routine.
Language attitudes may be differentiated into attitudes towards speakers and attitudes towards languages. However, to date, no systematic and differentiated instrument exists that measures attitudes towards language. Accordingly, we developed, validated, and applied the Attitudes Towards Languages (AToL) scale in four studies. In Study 1, we selected 15 items for the AToL scale, which represented the three dimensions of value, sound, and structure. The following studies replicated and validated the three-factor structure and differential mean profiles along the three dimensions for different languages (a) in a more diverse German sample (Study 2), (b) in different countries (Study 3), and (c) when participants based their evaluations on speech samples (Study 4). Moreover, we investigated the relation between the AToL dimensions and stereotypic speaker evaluations. Results confirm the reliability, validity, and generalizability of the AToL scale and its incremental value to mere speaker evaluations.
This article is concerned with the use of speech variation in a classroom in south-west Germany. It shows how a teacher uses dialect-standard variation as a resource for carrying out different tasks. An analysis of the variational practices of the teacher reveals two different code-alternating procedures with different functional scopes. On the one hand, the teacher uses code-shifting along a continuum of standard forms, especially to draw attention to relevant aspects of the instructional activities, and to guide participation in the unfolding discourse. On the other hand, she uses the context-cueing function of code-switching between standard and dialect, especially to locally manage the key of interaction (interaction modalities). It is shown that, for the teacher analysed, switching to dialect is a methodological resource which matches the intricate pedagogical tasks involved in the evaluation moves which follow pupils' 'troublesome' answers.
Within cognitive linguistics, there is an increasing awareness that the study of linguistic phenomena needs to be grounded in usage. Ideally, research in cognitive linguistics should be based on authentic language use, its results should be replicable, and its claims falsifiable. Consequently, more and more studies now turn to corpora as a source of data. While corpus-based methodologies have increased in sophistication, the use of corpus data is also associated with a number of unresolved problems. The study of cognition through off-line linguistic data is, arguably, indirect, even if such data fulfils desirable qualities such as being natural, representative and plentiful. Several topics in this context stand out as particularly pressing issues. This discussion note addresses (1) converging evidence from corpora and experimentation, (2) whether corpora mirror psychological reality, (3) the theoretical value of corpus linguistic studies of ‘alternations’, (4) the relation of corpus linguistics and grammaticality judgments, and, lastly, (5) the nature of explanations in cognitive corpus linguistics. We do not claim to resolve these issues nor to cover all possible angles; instead, we strongly encourage reactions and further discussion.
Content analysis provides a useful and multifaceted, methodological framework for Twitter analysis. CAQDAS tools support the structuring of textual data by enabling categorising and coding. Depending on the research objective, it may be appropriate to choose a mixed-methods approach that combines quantitative and qualitative elements of analysis and plays out their respective advantages to the greatest possible extent while minimising their shortcomings. In this chapter, we will discuss CAQDAS speech act analysis of tweets as an example of software-assisted content analysis. We start with some elementary thoughts on the challenges of the collection and evaluation of Twitter data before we give a brief description of the potentials and limitations of using the software QDA Miner (as one typical example for possible analysis programmes). Our focus will lie on analytical features that can be particularly helpful in speech act analysis of tweets.
One major issue in the accomplishment of contrasts in conversation is lexical choice of items which carry the semantic Ioad of the two states of affair which are represented as being opposed to one another. These items or expressions are co-selected to be understood as being contrastively related to each other. In this paper, it is argued that the activity of contrasting itself provides them with a specific local opposite meaning which they would not obtain in other contexts. Practices of contrastingare thus seen as an example of conversational activities which creatively and systematically affect situated meanings. Basedon data from various genres, such as meetings, mediation sessions and conversations, the paper discusses two practices of contrasting, their sequential construction and their interpretative effects. It is concluded that the interpretative effects of conversational contrasting rest on the sequential deployment oflinguistic resources and on the cognitive procedures of frame-based interpretation and constructing a maximally contrastive interpretation for the co-selected expressions.
In the context of the HyTex project, our goal is to convert a corpus into a hypertext, basing conversion strategies on annotations which explicitly mark up the text-grammatical structures and relations between text segments. Domain-specific knowledge is represented in the form of a knowledge net, using topic maps. We use XML as an interchange format. In this paper, we focus on a declarative rule language designed to express conversion strategies in terms of text-grammatical structures and hypertext results. The strategies can be formulated in a concise formal syntax which is independend of the markup, and which can be transformed automatically into executable program code.
Although most of the relevant dictionary productions of the recent past have relied on digital data and methods, there is little consensus on formats and standards. The Institute for Corpus Linguistics and Text Technology (ICLTT) of the Austrian Academy of Sciences has been conducting a number of varied lexicographic projects, both digitising print dictionaries and working on the creation of genuinely digital lexicographic data. This data was designed to serve varying purposes: machine-readability was only one. A second goal was interoperability with digital NLP tools. To achieve this end, a uniform encoding system applicable across all the projects was developed. The paper describes the constraints imposed on the content models of the various elements of the TEI dictionary module and provides arguments in favour of TEI P5 as an encoding system not only being used to represent digitised print dictionaries but also for NLP purposes.
We present evidence for the analysis of the vowels in English <say> and <so> as biphonemic diphthongs /ɛi/ and /əu/, based on neutralization patterns, regular alternations, and foot structure. /ɛi/ and /əu/ are hence structurally on a par with the so called “true diphthongs” /ɑi/, /ɐu/, /ɔi/, but also share prosodic organization with the monophthongs /i/ and /u/. The phonological evidence is supported by dynamic measurements based on the American English TIMIT database.
Calculations of F2-slopes proved to be especially suited to distinguish the relevant groups in accordance with their phonologically motivated prosodic organizations.
In this paper, we analyze a dramatically aggravated conflict interaction taking place in the course of an association’s meeting in an urban community center. The interaction can be seen as the culmination point of a social conflict developing and increasing over a period of years. In this conflict, one of the crucial points of the sociocultural development in the city under study is to be seen in an exemplary way. Our analysis started with the question, why this conflict is unsolvable although the interest divergences of the opposing parties are not irreconcilable. Our analysis shows that the protagonists practice different communicative social styles. These stylistic differences however, are not the cause for misunderstandings, but the protagonists use stylistic differences and different cultural orientations as a resource for political action. Thereby a process of increasing hardening of perspective divergence emerges together with an interaction modality of drama and of the fundamental grounding of divergent views. Theoretically we are concerned with the explication of a sociolinguistic theory which includes as constitutive components the concepts of communicative social style, of perspectivation and of interaction modality. We want to show, that the analyzed type of sociocultural conflict can be explained by virtue of considering the interplay of features on these three levels.
Based on German data from history-taking in doctor-patient interaction, the paper shows that the three basic syntactic types of questions (questions fronted by a question-word (w-questions), verb-first (V1) questions, and declarative questions) provide different opportunities for displaying understanding in medical interaction. Each syntactic questionformat is predominantly used in a different stage of topical sequences in history taking: w-questions presuppose less knowledge and are thus used to open up topical sequences; declarative questions are used to check already achieved understandings and to close topical sequences. Still, the expected scope of answers to yes/no-questions and to declarative questions is less restricted than previously thought. The paper focuses in detail on the doctors’ use of formulations as declarative questions, which are designed to make patients elaborate on already established topics, giving more details or accounting for a confirmation. Formulations often involve a shift to psychological aspects of the illness. Although patients confirm doctors’ empathetic formulations, they, however, regularly do not align with this shift, returning to the description of symptoms and to biomedical accounts instead. The study shows how displays of understanding are responded to not only in terms of correctness, but also (and more importantly) in terms of their relevance for further action.
Co-development of action, conceptualization and social interaction mutually scaffold and support each other within a virtuous feedback cycle in the development of human language in children. Within this framework, the purpose of this article is to bring together diverse but complementary accounts of research methods that jointly contribute to our understanding of cognitive development and in particular, language acquisition in robots. Thus, we include research pertaining to developmental robotics, cognitive science, psychology, linguistics and neuroscience, as well as practical computer science and engineering. The different studies are not at this stage all connected into a cohesive whole; rather, they are presented to illuminate the need for multiple different approaches that complement each other in the pursuit of understanding cognitive development in robots. Extensive experiments involving the humanoid robot iCub are reported, while human learning relevant to developmental robotics has also contributed useful results.
Disparate approaches are brought together via common underlying design principles. Without claiming to model human language acquisition directly, we are nonetheless inspired by analogous development in humans and consequently, our investigations include the parallel co-development of action, conceptualization and social interaction. Though these different approaches need to ultimately be integrated into a coherent, unified body of knowledge, progress is currently also being made by pursuing individual methods.
This paper investigates emergent pseudo-coordination in spoken German. In a corpus-based study, seven verbs in the first conjunct are analyzed regarding the degree of semantic bleaching and the development of subjective or aspectual meaning components. Moreover, it is shown that each verb shows distinct tendencies for co-ocurrences, especially with deictic adverbs in the first conjunct and with specific verbs and verb classes in the second conjunct. It is argued that pseudo-coordination is originally motivated by the need for ‘chunking’ in unplanned speech and that it is still prominently used in this function in German, in contrast to languages in which pseudo-coordination is grammaticalized further.
In this paper, we present a GOLD standard of part-of-speech tagged transcripts of spoken German. The GOLD standard data consists of four annotation layers – transcription (modified orthography), normalization (standard orthography), lemmatization and POS tags – all of which have undergone careful manual quality control. It comes with guidelines for the manual POS annotation of transcripts of German spoken data and an extended version of the STTS (Stuttgart Tübingen Tagset) which accounts for phenomena typically found in spontaneous spoken German. The GOLD standard was developed on the basis of the Research and Teaching Corpus of Spoken German, FOLK, and is, to our knowledge, the first such dataset based on a wide variety of spontaneous and authentic interaction types. It can be used as a basis for further development of language technology and corpus linguistic applications for German spoken language.
This paper aims at showing how quantitative corpus linguistic analysis can inform qualitative analysis of digital media discourse with respect to the mediality of language in use. Using the example of protest discourse in Twitter, in the field of anti-Islamic ‘Pegida’ demonstrations, a three-step method of collecting, reducing and interpreting salient data is proposed. Each step is aligned with operative medial features of the microblog: hashtags, retweets and @-interactions. The exemplary analysis reveals the importance of discussions of attendance numbers in protest discourse and the asymmetry between administrative (i.e. the police) and non-administrative discourse agents. Furthermore, it exemplifies how frequency analysis and sequence analysis can be combined for research in media linguistics.
In her overview, Margret Selting makes the case for the claim that dealing with authentic conversation necessarily lies at the heart of an interactionallinguistic approach to prosody (see Selting this volume, Section 3.3). However, collecting and transcribing corpora of authentic interaction is a time-consuming enterprise. This fact often severely restricts what the individual researcher is able to do in terms of analysis within the scope of his or her resources. Still, for dealing with many of the desiderata Margret Selting points out in Section 5 of her extensive overview, the use of larger corpora seems to be required. In this commenting paper, I want to argue that future progress in research on prosody in interaction will essentially rest on the availability and use of large public corpora. After reviewing arguments for and against the use of public corpora, I will discuss some upshots regarding corpus design and issues of transcription of public corpora.
The research project “German Today” aims to determine the amount of regional variation in (near-)standard German spoken by young and older educated adults and to identify and locate regional features. To this end, we compile an areally extensive corpus of read and spontaneous German speech. Secondary school students and 50-to-60-year-old locals are recorded in 160 cities throughout the German speaking area of Europe. All participants read a number of short texts and a word list, name pictures, translate words and sentences from English, answer questions in a sociobiographic interview, and take part in a map task experiment. The resulting corpus comprises over 1000 hours of speech, which is transcribed orthographically. Automatically derived broad phonetic transcriptions, selective manual narrow phonetic transcriptions, and variationalist annotations are added. Focussing on phonetic variation we aim to show to what extent national or regional standards exist in spoken German. Furthermore, the linguistic variation due to different contextual styles (read vs. spontaneous speech) shall be analysed. Finally, the corpus enables us to investigate whether linguistic change has occurred in spoken (near-)standard German.
Conversation Analysis (CA) and Discursive Psychology (DP) reject the view that assumptions
about cognitive processes should be used to account for discursive phenomena. Instead, cognitive
issues are respecified as discursive phenomena. Discursive psychologists do this by
studying discursive practices of talking about mental phenomena and using mental predicates.
This approach is exemplified by a study of the use of constructions with German verstehen
(‘to understand’) in conversation. Some conversation analysts take another approach,
namely, inquiring into how participants display mental states in talk-in-interaction. This is
exemplified by a study of how grammatical constructions are used to display different types
of inferences drawn from a partner’s prior turn. It will be argued that the constructivist, antiessentialist
stance which CA and DP take with regard to cognition is a prosperous line of
research, which has much in its favor from a methodological point of view. However, it
can be shown that tacit assumptions about cognitive processes are still inevitable when
doing CA and DP. As a conclusion, the paper pleads for an enhanced awareness of how cognitive
processes come into play when analysing talk-in-interaction and it advocates the integration
of a more explicit cognitive perspective into research on talk-in-interaction.
This article advocates an understanding of ‘positioning’ as a key to the analysis of identities in interaction within the methodological framework of conversation analysis. Building on research by Bamberg, Georgakopoulou and others, a performative, interaction-based approach to positioning is outlined and compared to membership categorization analysis. An interactional episode involving mock stories to reveal and reproach an inadequate identity-claim of a co-participant is analysed both in terms of practices of membership categorization and positioning. It is concluded that membership categorization is a core element of positioning. Still, positioning goes beyond membership categorization in a) revealing biographical dimensions accomplished by narration and b) by uncovering implicit performative claims of identity, which are not established by categorization or description.
In this paper, I argue that the main questions that arise in the process of making a dictionary of political metaphors - that of identifying live conceptual metaphors in a corpus of text - may be solved on the basis of a pragmatic approach, taking into account the reflections in a text of cognitive processes in the minds of its author and its reader. Certainly, this goal cannot be attained without a further fine-grained semantic analysis o f presumably metaphoric expressions in their linguistic and cultural context.
Following a welcome in Lithuanian and English to the guests and members on the occa- sion of the 10"’ anniversary of EFNIL, the history of this European language Organization is sketched. A brief survey of the sociolinguistic themes treated at previous Conferences and the state of the inajor projects is given, followed by an introduction (in German) to the general topic of the present Conference. The importance that translation and interpretation have for European language diversity and the individual national languages beside foreign language education of all Europeans is being stressed.
The present study introduces articulography, the measurement of the position of tongue and lips during speech, as a promising method to the study of dialect variation. By using generalized additive modeling to analyze articulatory trajectories, we are able to reliably detect aggregate group differences, while simultaneously taking into account the individual variation across dozens of speakers. Our results on the basis of Dutch dialect data show clear differences between the southern and the northern dialect with respect to tongue position, with a more frontal tongue position in the dialect from Ubbergen (in the southern half of the Netherlands) than in the dialect of Ter Apel (in the northern half of the Netherlands). Thus articulography appears to be a suitable tool to investigate structural differences in pronunciation at the dialect level.
This paper explores on the basis of empirical research, how patterns of interaction and argumentation in political discourse on Twitter evolve as translocal communities in the creative shape of “joint digital storytelling”. Joint storytelling embraces coordinated activities by multiple actors focusing on a shared topic. By adding personal information and evaluation, participants construct an open narrative format, which can be inviting and inspiring for others, who then join in with their own narratives. This model will be exemplified by analyzing a large amount of tweets (107,000) collected during a political conflict between proponents and adversaries of a local traffic project in Germany. Analysis is based on (1) the textual level, (2) the operative level (hashtags, @- and RT-Symbol, hyperlinks etc.) and (3) the visual level of storytelling (embedded photos, videos). Results show a new way of creating translocal online communities and political deliberation.
Europe is a continent of many languages. We all know that, but normally when we think about this fact, we focus on national languages, the type of language that shapes our political and our linguistic geography. But as natural as it may seem today, the idea of a language closely being interrelated with one's identity does not have a very long tradition. In fact it is only since the late 18th century that we think there is some type of intimate connection between the language spoken and the identity of a person as belonging to a nation. And even if the stabilization of European nation states was closely connected with this type of reasoning, European language communities differ considerably in their way of dealing with natural variation within their national language. For some of them, it is only the standardized national language that is relevant in this respect; for others, a certain amount of variation is a central part of their linguistic identity.
The paper introduces a theory of Lexical Event Structures as a means to represent the meaning of verbs. The theory is guided by the assumption that verbs refer to events that are internally structured in the sense that they consist of several subevents and states. The temporal properties and relations of these have to be specified. The occurrence of subevents is either implied or presupposed by the verb, and event participants are related to some, but not necessarily all subevents by semantic relations.
This is a revised and translated version of my article "Die doppelte Wende - Zur Verbindung von Sprache, Sprachwissenschaft und zeitgebundener politischer Bewertung am Beispiel deutsch-deutscher Sprachdifferenzierung" which appeared in Politische Semantik - Bedeutungsanalytische und sprachkritische Beiträge zur politischen Sprachverwendung, ed. Josef Klein (Opladen: Westdeutscher Verlag, 1989), pp. 297-326. I am indepted to Colin Good, Norwich, England, for having translated the text into English.
The metadata management system for speech corpora “memasysco” has been developed at the Institut für Deutsche Sprache (IDS) and is applied for the first time to document the speech corpus “German Today”. memasysco is based on a data model for the documentation of speech corpora and contains two generic XML schemas that drive data capture, XML native database storage, dynamic publishing, and information retrieval. The development of memasysco’s information architecture was mainly based on the ISLE MetaData Initiative (IMDI) guidelines for publishing metadata of linguistic resources. However, since we also have to support the corpus management process in research projects at the IDS, we need a finer atomic granularity for some documentation components as well as more restrictive categories to ensure data integrity. The XML metadata of different speech corpus projects are centrally validated and natively stored in an Oracle XML database. The extension of the system to the management of annotations of audio and video signals (e.g. orthographic and phonetic transcriptions) is planned for the near future.
This paper deals with the creation of the first morphological treebank for German by merging two pre-existing linguistic databases. The first of these is the linguistic database CELEX which is a standard resource for German morphology. We build on its refurbished and modernized version. The second resource is GermaNet, a lexical-semantic network which also provides partial markup for compounds. We describe the state of the art and the essential characteristics of both databases and our latest revisions. As the merging involves two data sources with distinct annotation schemes, the derivation of the morphological trees for the unified resource is not trivial. We discuss how we overcome problems with the data and format, in particular how we deal with overlaps and complementary scopes. The resulting database comprises about 100,000 trees whose format can be chosen according to the requirements of the application at hand. In our discussion, we show some future directions for morphological treebanks. The Perl script for the generation of the data from the sources will be made publicly available on our website.
We present an XML-based metadata standard for the documentation of speech and multimedia corpora that was developed at the Institute for German Language (IDS) in Mannheim, Germany. The IDS is one of the major institutions providing German speech and language corpora to researchers. These corpora stem from many different sources and were previously documented in a rather heterogeneous fashion using a variety of data models and formats. In order to unify the documentation for existing and future corpora, the IDS- internal Archive for Spoken German collaborated with several projects and developed a set of standardised XML metadata schemas. These XML schemas build on existing internal and external documentation schemas (such as IMDI) and take into account the workflow of speech corpus production. In order to minimise redundancy, separate schemas were designed for projects, speakers, recording sessions, and entire corpora. The resulting schemas are tested in ongoing speech and multi-media projects at the IDS and are regularly revised. They are accompanied by element definitions, guidelines, and examples. In addition, a mapping to IMDI will be provided.
We present the annotation of information structure in the MULI project. To learn more about the information structuring means in prosody, syntax and discourse, theory- independent features were defined for each level. We describe the features and illustrate them on an example sentence. To investigate the interplay of features, the representation has to allow for inspecting all three layers at the same time. This is realised by a stand-off XML mark-up with the word as the basic unit. The theory-neutral XML stand-off annotation allows integrating this resource with other linguistic resources such as the Tiger Treebank for German or the Penn treebank for English.
This special issue of the Journal of Pragmatics has its origins in the International Conference on Conversation Analysis 10 (ICCA10), which took place in Mannheim (Germany) in July 2010. More than 650 scholars attended the conference, whose theme was ‘‘multimodal interaction’’. This volume includes papers based on the four plenary talks given at ICCA10 and four additional contributions related to the conference theme.
High word frequency and neighborhood density contribute to the accuracy and speed of word production in English adults (e.g., Vitevitch & Sommers 2003), and characterize early words in child English (e.g., Storkel 2004). The present study investigated a speech corpus of child German (ages 2;00-3;00) to further the understanding of the influence of frequency and density on production. Results for four children suggest that, contrary to English, words produced early are not from denser neighborhoods in an adult lexicon than later words. As in English, frequent words are produced before less frequent words. Implications on theory and methodology are discussed.
In order to determine priorities for the improvement of timing in synthetic speech this study looks at the role of segmental duration prediction and the role of phonological symbolic representation in listeners' preferences. In perception experiments using German speech synthesis, two standard duration models (Klatt rules and CART) were tested. The input to these models consisted of symbolic strings which were either derived from a database or a text-to-speech system. Results of the perception experiments show that different duration models can only be distinguished when the symbolic string is appropriate. Considering the relative importance of the symbolic representation, "post-lexical" segmental rules were investigated with the outcome that listeners differ in their preferences regarding the degree of segmental reduction. As a conclusion, before fine-tuning the duration prediction, it is important to calculate an appropriate phonological symbolic representation in order to improve timing in synthetic speech.
Playing with the voice of the other : stylized kanaksprak in conversations among German adolescents
(2007)
Rejecting the validity of inferred attributions of incompetence in German talk-in-interaction
(2024)
This paper deals with pragmatic inference from the perspective of Conversation Analysis. In particular, we examine a specific variety of inferences - the attribution of incompetence which Self constructs on the basis of Other's prior action, hearable as positioning Self as incompetent (e.g., instructions, offers of assistance, advice); this attribution of incompetence concerns Self's execution of some practical task. This inference is indexed in Self's response, which highlights Self's expertise, or competence concerning the task at hand. We focus on two recurrent types of such responses in our data: (i) accounting for competence through formulations of prior experience with carrying out a practical action and (ii) explicit claims of competence for accomplishing this action. We analyze the interactional environments in which these responses occur, the ways in which the two practices index Self's understanding of being positioned as incompetent and the interactional work they do. Finally, we discuss how through rejecting and inferred attribution of incompetence, Self implicitly seeks to restore their face and defend their autonomy as an agent, yet, without entering an explicit identity-negotiation. Findings rest on the analysis of 20 cases found in video-recordings of naturally occurring talk-in-interaction in German from the corpus FOLK.
This paper is concerned with relative constructions in non-standard varieties of European languages, which will be analyzed on the basis of three typological parameters (word order, relative element, syntactic role of the relativized item). The validity of claims raised in studies on the areal distribution of relative constructions in Europe will be checked against the results of the analysis, so as to ascertain whether they still hold when non-standard varieties are examined.
This paper contributes to the growing body of knowledge on current listeners' responses in talk-in-interaction. In particular, it complements earlier findings on double sayings of German JA by describing some additional prosodic-phonetic parameters and a visual feature of its realization in institutional and semi-private interaction (doctor-patient interaction, Big Brother, TV talk shows). These include pitch contour, pitch range and phonetic ending, on the one hand, and nodding on the other. The paper shows that JAJA is a truly multimodal phenomenon, with the individual features accomplishing interactional functions across sequence-organizational habitats, including re)claiming epistemic priority in an aside, making continuation relevant, agreeing/ acknowledging with reservation and aligning with the continuation of a sequence. Lack of nodding is suggested to have situational as well as misalignment reasons. On the basis of its observations, the paper also raises the question whether it is the applicability of response token variants across action and sequence types which makes them memorizable despite their variability.
As can be shown for English data, the assimilation of the alveolar stop can result from an increased gestural overlap of the following oral closure gesture. Our experiment with German synthetic speech showed similar results. Further, it suggests that it is neccessary to complete the gestural specification of the glottal state. A voiced stop should be represented not only by an oral gesture, but by a glottal one as well.
The paper explores how verbs like helfen "help" should be treated within event semantics. These verbs allow both agentive NP-subjects and sentential CP-subjects. Their behavior with respect to adverbial modification reveals that in their agentive variant these verbs refer to events, while in their sentential variant they refer to states. The meaning that sentential helfen conveys is that the beneficiary is in a good disposition and that this state is brought about by what is expressed by the sentential subject. This involves a kind of subjective value statement about what is good for the beneficiary and what is not. The relation of "bringing about" involved here is not mainly one of causal dependence - lacking the typical denseness of causal chains - but one that involves supervenience. Supervenience, a notion widely used in moral theory and philosophy of the mind, allows accounting for the dependence of the rather subjective nature of the resultant state of helfen on particular events which occur in the world. The agentive variant of helfen is derived by embedding the meaning of sentential helfen into an event description.
The "imperfective-paradox" paradox and other problems with the semantics of the progressive aspect
(2000)
This paper is about the meaning of the progressive aspect, of which it has been notoriously difficult to give a satisfying account. 1 A number of intriguing properties of its meaning were first brought out in formal semantic treatments. An event semantics approach to the progressive that integrates concepts of nonnality and perspective as well as adequate lexical representations seems to be particularly promising. In section 1 I will present several problems connected with the semantics of the progressive that are crucial for shaping its truth conditions. Several solutions to these problems that have been suggested in the literature will be discussed. 2 In section 2 I will sketch a preliminary account of the meaning of the progressive aspect. In section 2.1 the basic components that underlie the truth conditions of the progressive will be described. In section 2.2 I will present underlying lexical assumptions and the truth conditions for the progressive. Finally, in section 2.3, I will evaluate the proposal by revisiting the problems discussed.
The naturalness of synthetic speech depends strongly on the prediction of appropriate prosody. For the present study the original annotation of the German speech database “Kiel Corpus of Read Speech” was extended automatically with syntactic features, word frequency, and syllable boundaries. Several classification and regression trees for predicting symbolic prosody features, postlexical phonological processes, duration, and F0 were trained on this database. The perceptual evaluation showed that the overall perceptual quality of the German text-to-speech system MARY can be significantly improved by training all models that contribute to prosody prediction on the same database. Furthermore, it showed that the error introduced by symbolic prosody prediction perceptually equals the error produced by a direct method that does not exploit any symbolic prosody features.
Discourse analysis in general, and media discourse analysis in particular, are currently attracting increased attention from linguists. This interest can be seen in the tendency to apply the term ‘discourse’ to various sciences and academic disciplines. It is possible to trace its dispersion both horizontally, i.e. in different sciences, and vertically, i.e. on various linguistic levels. Furthermore, the majority of interpretations of the term ‘discourse’ appearing in the works of modern scholars have arisen as a result of the interdisciplinary nature of language study within the cognitive paradigm in linguistics.
The current paper presents a corpus containing 35 dialogues of spontaneously spoken southern German, including half an hour of articulography for 13 of the speakers. Speakers were seated in separate recording chambers, mimicking a telephone call, and recorded on individual audio channels. The corpus provides manually corrected word boundaries and automatically aligned segment boundaries. Annotations are provided in the Praat format. In addition to audio recordings, speakers filled out a detailed questionnaire, assessing among others their audio-visual consumption habits.
One of the specific historical and cultural characteristics of the Russian political discourse is its orientation to precedents. It is considered correct to follow the behaviouristic models shown by one of the “heroes” (Peter I, Lenin, Stalin, etc.), to reproduce standard texts, and to compare the present situations with past situations (The Time of Troubles, Weimar Republic, NEP “New Economic Policy” (1921-1928), etc.). One of the peculiarities of the present time in Russia is the deep conflict between different social groups orientated to different precedents. Each group has its own variant of the national myth using the same means of the language for actualisation of this myth. Therefore, it is very important to analyse changes in the national cognitive foundation. Precedential phenomena are the central components of this foundation.
Lexical-semantic theories often suffer from the imprecision of the concepts they employ in their representations. This leads to a considerable decrease in empirical strength by inviting circular argumentation. A demonstration of how to go about overcoming such shortcomings will be carried out, using the lexical semantic concept of "punctuality" as an example. Firstly, I will argue that the distinction between punctuality and durativity plays a crucial role for the explanation of a wide range of syntactic and semantic phenomena. Secondly, I will discuss methodological issues involved in arriving at a more precise definition of punctuality and, finally, the notion of "punctuality" will be given an interpretation on the basis of extensive consultation of research on cognitive time concepts.
The goal of the MULI (MUltiLingual Information structure) project is to empirically analyse information structure in German and English newspaper texts. In contrast to other projects in which information structure is annotated and investigated (e.g. in the Prague Dependency Treebank, which mirrors the basic information about the topic-focus articulation of the sentence), we do not annotate theory-biased categories like topic-focus or theme-rheme. Trying to be as theory-independent as possible, we annotate those features which are relevant to information structure and on the basis of which typical patterns, co-occurrences or correlations can be determined. We distinguish between three annotation levels: syntax, discourse and prosody. The data is based on the TIGER Corpus for German and the Penn Treebank for English, since the existing information on part-of-speech and syntactic structure can be re-used for our purposes. The actual annotation of an English example sequence illustrates our choice of categories on each level. Their combination offers the possibility to investigate how information structure is realised and can be interpreted.
This paper presents the concept of the "participant perspective" as an approach to the study of spoken language. It discusses three aspects of this concept and shows that they can offer helpful tools in spoken language research. Employing the participant perspective provides us with an alternative to many of the approaches currently in use in the study of spoken language in that it favours small-scale, qualitative research that aims to uncover categories relevant for the participants. Its results can usefully complement large-scale studies of phenomena on all linguistic dimensions of talk.
In spring 2002, we celebrated the inauguration of the first German-Russian-Jewish kindergarten in Berlin. Nowadays, there are seven bilingual German-Russian kindergartens with 4 60 places and 78 bilingual kindergartens with other combinations of languages [SENBWF]. Maybe it is not enough, taking into account the large proportion o f immigrants in the population of Berlin1. And yet, much progress has been achieved, endorsing the fact that German society has begun to change its attitude towards other languages on its territory. The initial request for German monolingualism first changed into societal tolerance of multilingualism and eventually to the recognition o f the value of multilingualism. This process is a very slow one, and it is not yet complete. In my article, I would like to look at the development in the last few years of the political framework that has made possible, on the one hand, the opening of bilingual kindergartens in Berlin, and on the other hand, to consider what has hampered this process until now. I would like to emphasise three most important political spheres: linguistic, educational and integrational.
In this study we investigate the intonational characteristics of the four utterance types statement, wh-question, yes/no-question and declarative question. Readings of two German scripted dialogues were examined to ascertain characteristic features of the F0 contour for each utterance type. Final boundary tone, nuclear pitch accent, F0 offset, F0 onset, F0 range, and the slopes of a topline and a bottomline were determined for each utterance and compared for the four utterance types. Results show that for an average speaker, the final boundary tone, the F0 range, and the slope of the topline can be used to distinguish between the four utterance types. However, speakers may deviate from this pattern and exploit other intonational means to distinguish certain utterance types or choose not to mark a syntactic difference at all.
This paper is about the meaning of the progressive aspect, which has been notoriously difficult to give a satisfying account of. A number of intriguing properties of its meaning were first brought out in formal semantic treatments. An event semantics approach to the progressive which integrates concepts of normality and perspective as well as adequate lexical representations seems to be particularly promising. In section 2 I will present several problems connected with the semantics of the progressive that are crucial for shaping its truth conditions. Several solutions to these problems that have been suggested in the literature will be discussed. In section 3 I will sketch a preliminary account of the meaning of the progressive aspect. In section 3.1 the basic components that underlie the truth conditions of the progressive will be described. In section 3.2 I will present underlying lexical assumptions and the truth conditions for the progressive. Finally, in section 4, I will evaluate the proposal by revisiting the problems discussed.
Both for psychology and linguistics, emotion concepts are a continuing challenge for analysis in several respects. In this contribution, we take up the language of emotion as an object of study from several angles. First, we consider how frame semantic analyses of this domain by the FrameNet project have been developing over time, due to theory-internal as well as application-oriented goals, towards ever more fine-grained distinctions and greater within-frame consistency. Second, we compare how FrameNet’s linguistically oriented analysis of lexical items in the emotion domain compares to the analysis by domain experts of the experiences that give rise (directly or indirectly) to the lexical items. And finally, we consider to what extent frame semantic analysis can capture phenomena such as connotation and inference about attitudes, which are important in the field of sentiment analysis and opinion mining, even if they do not involve the direct evocation of emotion.
The aim of this paper is to highlight the actual need for corpora that have been annotated based on acoustic information. The acoustic information should be coded in features or properties and is needed to inform further processing systems, i.e. to present a basis for a speech recognition system using linguistic information. Feature annotation of existing corpora in combination with segmental annotation can provide a powerful training material for speech recognition systems, but will as well challenge the further processing of features to segments and syllables. We present here the theoretical preliminaries for our multilingual feature extraction system, that we are currently working on.
While written corpora can be exploited without any linguistic annotations, speech corpora need at least a basic transcription to be of any use for linguistic research. The basic annotation of speech data usually consists of time-aligned orthographic transcriptions. To answer phonetic or phonological research questions, phonetic transcriptions are needed as well. However, manual annotation is very time-consuming and requires considerable skill and near-native competence. Therefore it can take years of speech corpus compilation and annotation before any analyses can be carried out. In this paper, approaches that address the transcription bottleneck of speech corpus exploitation are presented and discussed, including crowdsourcing the orthographic transcription, automatic phonetic alignment, and query-driven annotation. Currently, query-driven annotation and automatic phonetic alignment are being combined and applied in two speech research projects at the Institut für Deutsche Sprache (IDS), whereas crowdsourcing the orthographic transcription still awaits implementation.
Based on German speaking data from various activity types, the range of multimodal resources used to construct turn-beginnings is reviewed. It is claimed that participants in talk-in-interaction need to deal with four tasks in order to construct a turn which precisely fits the interactional moment of its production:
1. Achieve joint orientation: The accomplishment of the socio-spatial prerequisites necessary for producing a turn which is to become part of the participants’ common ground.
2. Display uptake: Next speaker needs to display his/her understanding of the interaction so far as the backdrop on which the production of the upcoming turn is based.
3. Deal with projections from prior talk: The speaker has to deal with projections which have been established by (the) previous turn(s) with respect to the upcoming turn.
4. Project properties of turn-in-progress: The speaker needs to orient the recipient to properties of the turn s/he is about to produce.
Turn-design thus can be seen to be informed by tasks related to the multimodal, embodied, and interactive contingencies of online-construction of turns. The four tasks are ordered in terms of prior tasks providing the prerequisite for accomplishing a later task.
Semantic theories based on predicate-argument structures have always acknowledged that lexical information associated with verbs is the basic source for the rudimentary semantic structure of sentences. The central role of verbs in sentence structure has become a major insight of modern syntactic theories since the lexical turn in linguistics, too. As a result of this development there has been an increasing interest in theories on the lexical representation of verbs. This paper will briefly review prevailing theories on verb semantics (section 1), showing that they can capture only a part of the wide range of syntactic and semantic phenomena dependent on verb meaning. For several of these phenomena (section 2) it will turn out that a theory based on highly structured events is more suitable for representing verb meaning. This theory is based on the idea that verbs refer to events that consist of several subevents which are temporally related, classified according to their duration, and whose event participants are connected to some but not necessarily all subevents by semantic relations (section 3).
The following analysis explores the nature of everyday activities of people in economic leadership positions. It inquires as to the institutions and people they are in contact with, and the way they communicate with them. First, I will present existing studies in this field and the ethnographic procedure of this study. I will then describe the communicative activities of high-level personnel and their communication networks based on observations. These are then linked to their communicative tasks and types of interaction. Finally, I will discuss some characteristics of the communicative style.
Whether verbs have to be marked as punctual vs. durative has been a controversial issue from the very beginnings of research on aktionsarten in the last century right on up to modern theories of aspectual classes and aspect composition. Debates about the linguistic necessity of this distinction have often been accompanied by the question of what it means for a verb to be temporally punctual. In this paper I will, firstly, sketch the history of research on the punctual-durative distinction and present several linguistic arguments in its favor. Secondly, I will show how this distinction is captured in an eventstructure- based approach to lexical semantics. Thirdly, I will discuss the extent to which a precise definition of the notions used in lexical
representations helps avoid circular argumentation in lexical semantics. Finally, I will demonstrate how this can be done for the notion of ‘punctuality’ by clarifying the logical type of this predicate and relating it to central cognitive time concepts.