Refine
Year of publication
Document Type
- Part of a Book (171)
- Conference Proceeding (94)
- Article (21)
- Book (12)
- Working Paper (3)
- Review (2)
Language
- English (303) (remove)
Is part of the Bibliography
- no (303) (remove)
Keywords
- Korpus <Linguistik> (75)
- Deutsch (64)
- Annotation (22)
- Computerlinguistik (19)
- Englisch (18)
- Sprachpolitik (17)
- Mehrsprachigkeit (15)
- Semantik (15)
- Wörterbuch (15)
- Lexikografie (14)
Publicationstate
- Veröffentlichungsversion (200)
- Zweitveröffentlichung (55)
- Postprint (52)
Reviewstate
- (Verlags)-Lektorat (303) (remove)
Publisher
- de Gruyter (39)
- Benjamins (30)
- Springer (18)
- European Language Resources Association (ELRA) (13)
- Lang (13)
- Narr (12)
- Palgrave Macmillan (8)
- Oxford University Press (6)
- CSLI Publications (5)
- Association for Computational Linguistics (4)
Since Lerner coined the notion of delayed completion in 1989, this recurrent social practice of continuing one’s speaking turn while disregarding an intermediate co-participant’s utterance has not been investigated with regard to embodied displays and actions. A sequential approach to videotaped mundane conversations in German will explain the occurrence and use of delayed completions. First, especially in multi-party and multi-activity settings, delayed completions can result from reduced monitoring and coordinating activities. Second, recipients can use intra-turn response slots for more extended responsive actions than the current speaker initially projected, leading to delayed completion sequences. Finally, delayed completions are used for blocking possibly misaligned co-participant actions. The investigation of visible action illustrates that delayed completions are a basic practice for retrospectively managing co-participant response slots.
The chapter on formats and models for lexicons deals with different available data formats of lexical resources. It elaborates on their structure and possible uses. Motivated by the restrictions in merging different lexical resources based on widely spread formalisms and international standards, a formal lexicon model for lexical resources is developed which is related to graph structures in annotations. For lexicons this model is termed the Lexicon Graph. Within this model the concepts of lexicon entries and lexical structures frequently described in the literature are formally defined and examples are given. The article addresses the problem of ambiguity in those formal terms. An implementation based on XML and XML technology such as XQuery for the defined structures is given. The relation to international standards is included as well.
In this chapter, we discuss steps toward extending CMDI’s semantic interoperability beyond the Social Sciences and Humanities: We stress the need for an initial data curation step, in part supported by a relation registry that helps impose some structure on CMDI vocabulary; we describe the use of authority file information and other controlled vocabulary to help connecting CMDI-based metadata to existing Linked Data; we show how significant parts of CMDI-based metadata can be converted to bibliographic metadata standards and hence entered into library catalogs; and finally we describe first steps to convert CMDI-based metadata to RDF. The initial grassroots approach of CMDI (meaning that anybody can define metadata descriptors and components) mirrors the AAA slogan of the Semantic Web (“Anyone can say Anything about Any topic”). Ironically, this makes it hard to fully link CMDI-based metadata to other Semantic Web datasets. This paper discusses the challenges of this enterprise.
Concurrent standardization as a necessity: The genesis of the new official orthographic guidelines
(2009)
The new official orthographic guidelines were brought into force by the official state authorities on August 1st, 1998 and its principle goals were a standardized representation of the guidelines and a «gentle simplification in respect of content». This regulation was not supported by the public and in fact it was the starting point for a struggle for conceptual solutions and a quest for the achievement of' a consensus between different possible norms. Since orthography is an officially codified standard taking up a prominent position among linguistic standards, it is of particular socio-political importance. It was the foremost task of the Council for German Orthography (Rat für deutsche Rechtschreibung), instituted in December 2004, to elaborate a compromise in order to bring the «Orthographical war» (Die Zeit) to an end, which was led enthusiastically for more than a decade. - The concern of this article is to classify historically the agreement reached in 2006. Against this background, it can be stated that official guidelines will only be accepted, if they are based upon the usage in writing and if they take into account the interests of the reader. Both principles are characterizing the proposal made by the Council for German Orthography. An outlook on the Council's activities concerning orthographic standardization expected in the future will conclude this article.
EFNIL, the European Federation of National Institutions for Language, promotes the standard languages and the linguistic diversity of the European countries as an essential characteristic of their cultural diversity and wealth. The 17th annual conference of EFNIL in Tallinn dealt with the relation between language and economy.
• Language politics often have economic intentions, the language use of the individual is embedded in economic conditions, languages seem to differ in their economic value. In recent years, economists and sociolinguists have developed models of describing these interdependencies.
• The interaction in multilingual settings needs professional handling. There are traditional instances such as language teaching or translation and new professional fields of the digital age such as multilingual databases. Lots of economic needs and opportunities appear in this field.
• Digitization and societal diversity are two elements leading to more successful interaction, assisted by the use of automatic everyday translation, the development of plain language etc.
This volume presents an extensive overview of the interplay of language and economy.
This chapter focuses on the contributions of German scholars to two of the three main research questions that have defined EU studies. Leaving aside the debate on the drivers of European integration, i.e. European integration theory, we will discuss the «governance turn» Fritz Scharpf, Beate Kohler-Koch, Arthur Benz, Ingeborg Tömmel and others promoted in studying EU institutions as well as the more policy-oriented approaches by Adrienne Héritier and again Fritz Scharpf and their students. We will then address the ever-growing literature on Europeanization on how EU policies, institutions and political processes have been affecting the domestic structures of member states, membership candidates, as well as neighborhood and third countries. In this context, German scholars also contributed to EU studies in what could be coined in methodological rather than substantial terms. Whereas Thomas König, Gerald Schneider, and others promoted the application of quantitative approaches, scientists like Bernhard Ebbinghaus and Markus Haverland dealt with general questions on research designs like case selection and causal inference. Finally, we will also discuss German contributions to diffusion research. The European Union as a most likely case for the diffusion of policies has attracted considerable attention by scholars dealing with the question of when and how policies spread across time and space. So it comes as no surprise that EU studies as well as diffusion research mutually benefitted from each other. In this regard, German scientists like Katharina Holzinger, Christoph Knill, Tanja Börzel, Thomas Plümper, Thomas Risse and others played a prominent role, too.
Dieser Beitrag gibt einen Überblick über CoDII, die Collection of Distributionally Idiosyncratic Items. CoDII ist eine elektronische Sammlung verschiedener Untergruppen lexikalischer Elemente, die sich durch idiosynkratische Distribution auszeichnen. Das bedeutet, dass sich die Verteilung dieser Lexeme im Text nicht alleine aufgrund ihrer syntaktischen Kategorie Vorhersagen lässt. Die Methoden, die in der Entwicklung von CoDII angewandt werden, greifen über traditionelle Fachgrenzen hinaus und umfassen Korpuslinguistik, Computerlinguistik, Phraseologie und theoretische Sprachwissenschaft. Ein wichtiger Schwerpunkt unserer Diskussion liegt auf der Darstellung, inwiefern die in CoDII gesammelten, annotierten und unter anderem mit Suchwerkzeugen abfragbaren Daten dazu beitragen können, die linguistische Theoriebildung durch die Bereitstellung sorgfältig aufbereiteter Datensammlungen bei der Überprüfung ihrer Datengrundlage zu unterstützen.
Syntactic theory has tended to vacillate between implausible methodological extremes. Some linguists hold that our theories are accountable solely for the corpus of attested utterances; others assume our subject matter is unobservable intuitive feelings about sentences. Both extremes should be rejected. The subject matter of syntax is neither past utterance production nor the functioning of inaccessible mental machinery; it is normative - a system of tacitly grasped constraints defining correctness of structure. There are interesting parallels between syntactic and moral systems, modulo the key difference that linguistic systems are diverse whereas morality is universal. The appropriate epistemology for justifying formulations of normative systems is familiar in philosophy: it is known as the method of reflective equilibrium.
This study investigates the question of whether the processing of complex anaphors require more cognitive effort than the processing of NP-anaphors. Complex anaphors refer to abstract objects which are not introduced as a noun phrase and bring about the creation of a new discourse referent. This creation is called “complexation process”. We describe ERP findings which provide converging support for the assumption that the cognitive cost of this complexation process is higher than the cognitive cost of processing NP-anaphors.
Preface
(2010)
This paper is concerned with relative constructions in non-standard varieties of European languages, which will be analyzed on the basis of three typological parameters (word order, relative element, syntactic role of the relativized item). The validity of claims raised in studies on the areal distribution of relative constructions in Europe will be checked against the results of the analysis, so as to ascertain whether they still hold when non-standard varieties are examined.
This paper argues that a lectometric approach may shed light on the distinction between destandardization and demotization, a pair of concepts that plays a key role in ongoing discussions about contemporary trends in standard languages. Instead of a binary distinction, the paper proposes three different types of destandardization, defined as quantitatively measurable changes in a stratigraphic language continuum. The three types are illustrated on the basis of a case study describing changes in the vocabulary of Dutch in The Netherlands and Flanders between 1990 and 2010.
Corpora with high-quality linguistic annotations are an essential component in many NLP applications and a valuable resource for linguistic research. For obtaining these annotations, a large amount of manual effort is needed, making the creation of these resources time-consuming and costly. One attempt to speed up the annotation process is to use supervised machine-learning systems to automatically assign (possibly erroneous) labels to the data and ask human annotators to correct them where necessary. However, it is not clear to what extent these automatic pre-annotations are successful in reducing human annotation effort, and what impact they have on the quality of the resulting resource. In this article, we present the results of an experiment in which we assess the usefulness of partial semi-automatic annotation for frame labeling. We investigate the impact of automatic pre-annotation of differing quality on annotation time, consistency and accuracy. While we found no conclusive evidence that it can speed up human annotation, we found that automatic pre-annotation does increase its overall quality.
This paper deals with multiword lexemes (MWLs), focussing on two types of verbal MWLs: verbal idioms and support verb constructions. We discuss the characteristic properties of MWLs, namely nonstandard compositionality, restricted substitutability of components, and restricted morpho-syntactic flexibility, and we show how these properties may cause serious problems during the analysis, generation, and transfer steps of machine translation systems. In order to cope with these problems, MT lexicons need to provide detailed descriptions of MWL properties. We list the types of information which we consider the necessary minimum for a successful processing of MWLs, and report on some feasibility studies aimed at the automatic extraction of German verbal multiword lexemes from text corpora and machine-readable dictionaries.
On valence-binding grammars
(1978)
The valence of a verb determines the number, and the syntactic class, of those expressions that must co-occur with it in a sentence. Definitions of "valence-term" and "valence-boundness" are provided whereby the precise conditions are formulated that a valence-binding grammar must satisfy. These conditions are exemplified in the framework of a simple categorial grammar, in which various reductions of the general notions can be carried out.
This is a revised and translated version of my article "Die doppelte Wende - Zur Verbindung von Sprache, Sprachwissenschaft und zeitgebundener politischer Bewertung am Beispiel deutsch-deutscher Sprachdifferenzierung" which appeared in Politische Semantik - Bedeutungsanalytische und sprachkritische Beiträge zur politischen Sprachverwendung, ed. Josef Klein (Opladen: Westdeutscher Verlag, 1989), pp. 297-326. I am indepted to Colin Good, Norwich, England, for having translated the text into English.
This paper focuses on the interaction of interrogativity and information structure in Slavic polarity questions where the clitic li may indicate interrogativity as well as focusation. We will see how the semantic category sentence force as well as the pragmatically induced information structuring are anchored syntactically and represented semantically. Even though we will introduce two notions of li for methodological reasons, there is only one li in each language. Within the framework of Rizzi's (1997) theory of the split C-Domain, we will see that li only occurs in Force[0] in Russian and Serbian/Croatian indicating that li is some kind of complementizer. In Bulgarian and Macedonian, on the other hand, li is generated more or less 'independently' from Force[0], but forces the constituent it is adjoined to to move up to FocP. We will further show that Rizzi's theory also accounts for the compositional derivation of meaning of yes/no-interrogatives with information or identificational focus.
Departing from Rooth's focus interpretation theory the article discusses two types of (German) ellipsis phenomena: direct alternative and implicit alternative coordinative ellipsis. For the first type, which includes Stripping, Gapping, ATB, and RNR, it is characteristic that the semantic value of either conjunct instantiates the context variable of the respective focus operator in the other. For German Polarity ellipsis and Sluicing, which constitute the other type, it is characteristic that the semantic value, which instantiates the variable given by the focus operator in the second conjunct, must be derived from the semantic value of the first conjunct and that the second conjunct always hosts an alternative set inducing item which demands new information focus in the first conjunct.
Perhaps the biggest challenge in derivational morphology is to reconcile morphological idiosyncrasy with semantic regularity. How can it be explained that words with dead affixes and irregulär allomorphy can nonetheless exhibit straightforward and stable semantic relations to their etymological bases (cf. strength ‘property of being strong’, obedience ‘act of obeying’, ‘property of being obedient’)? Theories based on the idea of capturing regularity in terms of synthetic rules for building up complex words out of morphemes along with rules for interpreting such structures in a compositional fashion have not made - and arguably cannot make - sense of this phenomenon. Taking the perspective of the learner in acquisition, I propose an alternative approach to meaning assignment based, not on syntagmatic relations among their constituent morphemes, but on paradigmatic relations between whole words. This approach not only explains the conditions under which meaning relations between words are expected to be stable but also accounts for another notorious mystery in derivational morphology, the frequent occurrence of total synonymy among affixes, as opposed to words.
Annotating Spoken Language
(2014)
Designed as a contribution to contrastive linguistics, the present volume brings up-to-date the comparison of German with its closest neighbour, Dutch, and other Germanic relatives like English, Afrikaans, and the Scandinavian languages. It takes its inspiration from the idea of a "Germanic Sandwich", i.e. the hypothesis that sets of genetically related languages diverge in systematic ways in diverse domains of the linguistic system. Its contributions set out to test this approach against new phenomena or data from synchronic, diachronic and, for the first time in a Sandwich-related volume, psycholinguistic perspectives. With topics ranging from nickname formation to the IPP (aka 'Ersatzinfinitiv'), from the grammaticalisation of the definite article to /s/-retraction, and from the role of verb-second order in the acquisition of L2 English to the psycholinguistics of gender, the volume appeals to students and specialists in modern and historical linguistics, psycholinguistics, translation studies, language pedagogy and cognitive science, providing a wealth of fresh insights into the relationships of German with its closest relatives while highlighting the potential inherent in the integration of different methodological traditions.
This English language version (translated by Martin Wynne) is a reworked and slightly abridged version of the paper given in Canterbury in February 1999. A German language version has appeared as Hellmann, Manfred W.: ‘Wörter in Texten der Wendezeit’ 1989-1990 - Ein Wörterbuch zur lexikographischen Erschließung des ‘Wendekorpus’ in Jordanova, Ljubima (edj: 10 godini promjana v Iztotschna Evropa (10 Jahre Wende in Osteuropa), (= Socio- linguistika Bd. 4), BULLEKS: Sofia (Bulgaria) 1999, S. 11-39.
In the first part of this contribution, we will present, as a starting point for the following discussions, a simple formal language P containing one stative predicate. We will then discuss, on an intuitive level, how a treatment of predicates of change could be conceived, and how the progressive could be rendered in a formal language.
We will then give a formal definition of a language, TP1, based on P, and we will construct a semantics for TP1, which incorporates the ideas discussed.
Der Aufsatz nähert sich der Frage, wie Sprachwandel beobachtet und beschrieben werden kann, auf empirischen Wege: Es werden Sprachbiographien von deutschstämmigen Amerikaner(inne)n aus Wisconsin nachgezeichnet. Diese Fallstudien - von denen hier zwei etwas näher beleuchtet werden - lassen ganz unterschiedliche Entwicklungen in der Lebenszeit eines Sprechers erkennbar werden. Der Beibehaltung und behutsamen Wandlung im Sprachgebrauch einer schweizerdeutschen Sprecherin steht der beinahe komplette Verlust der deutschen Sprachkompetenz einer Niederdeutsch-Sprecherin gegenüber.
Für die Rekonstruktion dieser Wandlungsprozesse in realer Zeit wird die Methode des Re-Recordings präsentiert - der erneuten Aufnahme von Sprechern, die in früheren Tonaufnahme-Aktionen in Wisconsin bereits einmal erfasst wurden (hier: 1968 und 2001). Erste Ergebnisse der zu Grunde liegenden linguistischen Analysen werden in Tabellen dargestellt.
This paper discusses the semi-formal language of mathematics and presents the Naproche CNL, a controlled natural language for mathematical authoring. Proof Representation Structures, an adaptation of Discourse Representation Structures, are used to represent the semantics of texts written in the Naproche CNL. We discuss how the Naproche CNL can be used in formal mathematics, and present our prototypical Naproche system, a computer program for parsing texts in the Naproche CNL and checking the proofs in them for logical correctness.
Positioning analysis, a variant of discourse analysis, was used to explore the narratives of 40 psychiatric patients (11 females and 29 males; mean age = 40 years) who had manifest difficulties with engagement with statutory mental health services. Positioning analysis is a qualitative method that captures how people linguistically position the roles and identities of themselves and others in their day-to-day lives and narratives. The language of disengagement incorporated the passive positioning of self in relation to their lives and treatment through the use of metaphor, the passive voice and them and us attribution, while the discourse of engagement incorporated more active positioning of self, achieved through the use of the personal pronoun we and metaphoric references to balanced relationships. The findings corroborate previous thematic analysis that highlighted the importance of identity and agency in the ‘making or breaking’ of therapeutic relationships (Priebe et al. 2005). Implications are discussed in relation to how positioning analysis may help signal and emphasize important life and therapeutic experiences in spoken narratives as well as clinical consultations.
Introduction
(2010)
The present thesis investigates the syntagmatic relations of certain Finnish emotion verbs that are formed by the derivational suffix -ua/-yä (e.g. suuttua ‘get angry’, pelästyä ‘get frightened’). Prototypically, the suffix expresses reflexivity, but in the case of the “inchoative” emotion verbs, it indicates a change of state on behalf of the experiencer, from a non-emotional state to an emotional state.
The analysis which we present in this paper is part of an ethnographically based sociolinguistic study of various immigrant youth groups and their social style of communication. The study describes the wide variety of migrant groups and their socio-cultural orientation in relation to different migrant worlds as well as to different social worlds of the dominant German society. The development of a social style of communication is grounded in the groups’ socio-cultural orientation as well as in the perception of themselves in relation to relevant others. The main purpose of our study is to analyse the construction of the groups’ social identity in terms of their social style of communication.
The main point of this chapter is to demonstrate how a speaker’s concept of his/her professional role can be inferred from his/her perspectival work (perspective setting and relating different perspectives to one another) in professional encounters. Thereby some risks of complex perspectival work in discourse will become manifest which result - at one point in the talk - in perspectival inconsistency, revealing a deeply grounded social problem for the speaker. This will be examined in the framework of a rhetorical conversation analysis.
In this paper, I present some aspects of a youth group’s construction of a communicative style and show how the group’s stylistic repertoire changes over the course of their growing into adulthood. My paper is based on an ethnographic case study of a group of Turkish girls, the ‘Powergirls’, who grew up in a typical Turkish migrant neighborhood in the inner city of Mannheim, Germany. The aim of the case study was, on the basis of biographical interviews with group members and long-term observation of group interactions, to reconstruct the formation of an ethnically defined ‘ghetto’-clique and its style of communication and to describe the group’s development into educated, modern, German-Turkish young women. In this process, a change in the group’s stylistic repertoire could be observed.
The Stuttgart-Tübingen Tagset (STTS) is a widely used POS annotation scheme for German which provides 54 different tags for the analysis on the part of speech level. The tagset, however, does not distinguish between adverbs and different types of particles used for expressing modality, intensity, graduation, or to mark the focus of the sentence. In the paper, we present an extension to the STTS which provides tags for a more fine-grained analysis of modification, based on a syntactic perspective on parts of speech. We argue that the new classification not only enables us to do corpus-based linguistic studies on modification, but also improves statistical parsing. We give proof of concept by training a data-driven dependency parser on data from the TiGer treebank, providing the parser a) with the original STTS tags and b) with the new tags. Results show an improved labelled accuracy for the new, syntactically motivated classification.
Traditionally, research on language change has been a post-mortem activity, focused on isolated changes that are complete and often only documented in written texts. In the 1960s the field was advanced considerably by Labovian sociolinguistics and the investigation of “change in progress” adduced through patterns of community-internal linguistic variation correlated with external facts about speakers such as age and class (see Labov 1994 for an overview). However, despite the many benefits of such work on “dynamic synchrony,” we still know relatively little about how language change unfolds over the lifetimes of individual speakers, that is, in real time (cf. Bailey et al. 1991). The logistical challenges of such research are, of course, considerable. Whereas it is straightforward for psycholinguists to observe language development in children over the course of a few years, documenting changes in the verbal behavior of individuals over several decades is by contrast much less feasible. Nevertheless, present theoretical models of language change could be considerably improved by the results of real-time studies.