Refine
Year of publication
- 2008 (235) (remove)
Document Type
- Part of a Book (114)
- Article (57)
- Conference Proceeding (34)
- Book (17)
- Part of Periodical (6)
- Doctoral Thesis (3)
- Working Paper (2)
- Master's Thesis (1)
- Review (1)
Keywords
- Deutsch (113)
- Wörterbuch (29)
- Korpus <Linguistik> (21)
- Internet (15)
- Mehrsprachigkeit (12)
- Gesprochene Sprache (10)
- Konversationsanalyse (9)
- Computerunterstützte Lexikographie (8)
- OWID (8)
- Sprachgeschichte (8)
Publicationstate
- Veröffentlichungsversion (103)
- Zweitveröffentlichung (19)
- Postprint (9)
- Preprint (2)
Reviewstate
- (Verlags)-Lektorat (90)
- Peer-Review (21)
- Verlags-Lektorat (4)
- Qualifikationsarbeit (Dissertation, Habilitationsschrift) (3)
- (Verlag)-Lektorat (1)
- (Verlags-) Lektorat (1)
- (Verlags-)Lektorat (1)
- Abschlussarbeit (Bachelor, Master, Diplom, Magister) (Bachelor, Master, Diss.) (1)
- Peer-Revied (1)
- Peer-review (1)
Publisher
- Institut für Deutsche Sprache (29)
- de Gruyter (26)
- Narr (24)
- European Language Resources Association (ELRA) (7)
- iudicium (7)
- Lang (5)
- Stauffenburg (5)
- Dudenverlag (4)
- Olms (4)
- BBAW (3)
Das Medium Internet ist im Wandel, und mit ihm ändern sich seine Publikations- und Rezeptionsbedingungen. Welche Chancen bieten die momentan parallel diskutierten Zukunftsentwürfe von Social Web und Semantic Web? Zur Beantwortung dieser Frage beschäftigt sich der Beitrag mit den Grundlagen beider Modelle unter den Aspekten Anwendungsbezug und Technologie, beleuchtet darüber hinaus jedoch auch deren Unzulänglichkeiten sowie den Mehrwert einer mediengerechten Kombination. Am Beispiel des grammatischen Online-Informationssystems grammis wird eine Strategie zur integrativen Nutzung der jeweiligen Stärken skizziert.
Slowakei
(2008)
In literalen Gesellschaften umfasst das Sprachvermögen sowohl das Sprechen wie auch das Schreiben. Dies gilt für die Muttersprache ebenso wie für Fremdsprachen. Sprechen und Schreiben sind dabei recht unterschiedliche Tätigkeiten, so dass zu erwarten wäre, dass sie im Fremdsprachen- wie auch im DaF-Unterricht zu gleichen Anteilen berücksichtigt werden. Die Unterrichtspraxis zeigt jedoch, dass die Schriftsprache dominant vertreten ist und die gesprochene Sprache ein Schattendasein führt. In diesem Beitrag benenne ich fünf Gründe, warum die gesprochene Sprache in dieser Weise im Hintergrund steht und ein sperriger, schwer zu handhabender Gegenstand ist (Abschnitt 2). Im Anschluss versuche ich zu verdeutlichen, wie weitreichend die Unterschiede zwischen gesprochener und geschriebener Sprache sind (Abschnitt 3). Abschließend formuliere ich einige Konsequenzen, die sich hieraus für den Fremdsprachen- und DaF-Unterricht ergeben, und plädiere dafür, sich die Schwierigkeiten, die mit einer Berücksichtigung der gesprochenen Sprache verbunden sind, bewusst zu machen und sich ihnen zu stellen, denn gesprochene Sprache ist m.E.ein unverzichtbarer Bestandteil des fremdsprachlichen Unterrichts.
Die Bibliografie des Projekts "Deutsch in Russland" enthält 359 Titel, von denen zwei Drittel auf Russisch sind. Die Inhalte der meisten russischsprachigen Veröffentlichungen werden im Text der Bibliografie kurz zusammengefasst. In der Einführung finden sich einige Anmerkungen zum Forschungsstand nach 1990 und eine Beschreibung der Titelinhalte.
Im Deutschen und anderen europäischen Sprachen können Demonstrativa das Antezedens von Relativsätzen bilden oder als Determinator eines solchen Antezedens fungieren. Konstruktionen dieser Art weisen Besonderheiten in Bezug auf Form und Bedeutung auf: Einerseits finden sich Demonstrativa, die nicht oder nur marginal mit appositiven Relativsätzen kombiniert werden können, andererseits solche, die entweder keine restriktiven Relativsätze zulassen oder sich mit diesen nur in speziellen, nichtdeiktischen und nichtphorischen Bedeutungen kombinieren lassen. Zumindest einige dieser Besonderheiten scheinen auf allgemeinere, sprachübergreifende Beschränkungen hinzuweisen. So zeigt sich tendenziell, dass die Kombinierbarkeit von Demonstrativa mit restriktiven Relativsätzen mit der deiktischen Stärke des Demonstrativums korreliert: Distanzmarkierende und in diesem Sinn deiktisch starke Demonstrativa schließen restriktive Relativsätze tendenziell aus, während distanzneutrale oder nichtdeiktisch verwendbare Demonstrativa sie in der Regel zulassen. Beschränkungen dieser Art werden anhand des Deutschen, Französischen und Schwedischen aufgezeigt.
The contribution deals with the interactive structure of doctor-patient-communication. After a short discussion about the relevance of doctor-patient-communication within the public health policy, an outline is given on the medical and linguistic research on doctor-patient-communication in Germany. Basic features of conversations and the conversation analytic methodology are presented then. Conversation analyses of doctor-patient-communication reveal five main interactive components which are discussed in detail. Finally, some considerations concerning implementation of linguistic research in medical practice are discussed.
Digital Text Collections, Linguistic Research Data, and Mashups: Notes on the Legal Situation
(2008)
Comprehensive data repositories are an essential part of practically all research carried out in the digital humanities nowadays. For example, library science, literary studies, and computational and corpus linguistics strongly depend on online archives that are highly sustainable and that contain not only digitized texts but also audio and video data as well as additional information such as metadata and arbitrary annotations. Current Web technologies, especially those that are related to what is commonly referred to as the Web 2.0, provide a number of novel functions such as multiuser editing or the inclusion of third-party content and applications that are also highly attractive for research applications in the areas mentioned above. Hand in hand with this development goes a high degree of legal uncertainty. The special nature of the data entails that, in quite a few cases, there are multiple holders of personal rights (mostly copyright) to different layers of data that often have different origins. This article discusses the legal problems of multiple authorships in private, commercial, and research environments. We also introduce significant differences between European and U.S. law with regard to the handling of this kind of data for scientific purposes.
The Meta-data-Database of a Next Generation Sustainability Web-Platform for Language Resources
(2008)
Our goal is to provide a web-based platform for the long-term preservation and distribution of a heterogeneous collection of linguistic resources. We discuss the corpus preprocessing and normalisation phase that results in sets of multi-rooted trees. At the same time we transform the original metadata records, just like the corpora annotated using different annotation approaches and exhibiting different levels of granularity, into the all-encompassing and highly flexible format eTEI for which we present editing and parsing tools. We also discuss the architecture of the sustainability platform. Its primary components are an XML database that contains corpus and metadata files and an SQL database that contains user accounts and access control lists. A staging area, whose structure, contents, and consistency can be checked using tools, is used to make sure that new resources about to be imported into the platform have the correct structure.
The multiple gradations of German strong verbs are but manifestations of a rather uncomplicated system. There is a small number of ways to make up ablaut forms; these types of formation are identifiable in formal terms and, what is more, they have definite functions as morphological markers. Using classifications of stem forms according to quality, complexity and quantity of vowels, three types of operations involved in ablaut formation are identified. Ablaut always includes a change of quality type or a change of complexity type, and in addition it may include a change of quantity type. Ablaut forms are clearly distinguished as against bases (and against each other): their vocalism meets a defined standard of dissimilarity. On this basis, gradations are collected into inflectional classes that are defined in strictly synchronic terms. These classes continue the historical seven classes known from reference grammars. For the majority of strong verbs, membership in these classes (and thus ablaut) is predictable.
Lexical chaining has become an important part of many NLP tasks. However, the goodness of a chaining process and hence its annotation output depends on the quality of the chaining resource. Therefore, a framework for chaining is needed which integrates divergent resources in order to balance their deficits and to compare their strengths and weaknesses. In this paper we present an application that incorporates the framework of a meta model of lexical chaining exemplified on three resources and its generalized exchange format.
Anakoluthe dependenziell
(2008)
Electronic corpora play an ever growing role in lexicography. On the one hand, new access to linguistic usage is made possible through the use of text corpora and intelligent corpus-based query tools; however, the final results are still interpreted and described by lexicographers. In this case corpora are used for data acquisition. On the other hand, there are also projects that provide purely automatically acquired data in the form of "dictionaries". Lexicographers play only a minor role here. This latter type of corpus use creates a completely new kind of electronic dictionary. This article addresses the questions as to what extent these dictionaries differ from lexicographic tradition and whether they must be considered in metalexicography. Starting from previously compiled electronic dictionary typologies, we try to supplement the formulation of lexicographic data as a distinguishing feature. Finally, based on the findings of the project elexiko (Institute for the German Language - IDS), we demonstrate that the distinction between electronic versus man-made lexicographic data is also relevant to lexicographical practice.
CONTRIBUTIONS TO THE STUDY OF GERMAN USAGE A CORPUS-BASED APPROACH
This paper outlines some basic assumptions and principles underlying the corpus linguistics research and some application domains at the Institute for German Language in Mannheim. We briefly address three complementary but closely related tasks: first, the acquisition of very large corpora, second, the research on statistical methods for automatically extracting information about associations between word configurations, and, third, meeting the challenge of understanding the explanatory power of such methods both in theoretical linguistics and in other fields such as second language acquisition or lexicography. We argue that a systematic statistical analysis of huge bodies of text can reveal substantial insights into the language usage und change, far beyond just collocational patterning.
The development of user-adapted views of lexicographic data is frequently in demand by dictionary research on electronic reference works and hypertext information systems. In the printed dictionary it has been indispensable to develop a complete dictionary relative to a user group and using situations. In contrast, for any electronic presentation of lexicographic data there are possibilities to define user-specific views of an initially user-unspecific resource. However, research on the use of dictionaries in general, still has to answer several open questions as far as this subject is concerned. This paper will firstly provide an overview of the present state of research on dictionary use with respect to electronic lexicography. Subsequently, explanations of further prerequisites for a possible user-adapted access to data are followed, as exemplified by OWID, the Online Vocabulary Information System of the Institut für Deutsche Sprache. Finally, it will be outlined what results on the subject have been accomplished so far. Also the prospects of potential user-adapted presentations of lexicographic data will be highlighted.
elexiko ist ein Online-Wörterbuch zum Gegenwartsdeutschen, das korpusbasiert und modular erarbeitet wird. Ein Schwerpunkt liegt dabei auf der ausführlichen korpusbasierten Beschreibung der Bedeutung und Verwendung sprachlicher Ausdrücke sowie ihrer Vernetzung untereinander. Die Präsentation des Wörterbuchs soll insbesondere zeigen, wie Korpusdaten in den Wortartikeln aufbereitet werden und wie elexiko genutzt werden kann, um lexikalisches Wissen in verschiedenen Benutzungssituationen aus den Wortartikeln zu gewinnen.
COOCCURRENCE ANALYSIS SEEN CONTRASTIVELY
On applying collocational patterning in bilingual lexicography - some examples from the large German-Czech academic dictionary
This paper resumes some of thoughts presented in the study by C. Belica and K. Steyer in this volume. It shows how bilingual lexicographers can take advantage of the cooccurrence analysis results when dealing with German-Czech contrast and structuring word configurations in an entry. They also sketch the corpus data in a form of structural types based on the collocational patterns and stress the importance of cooccurrence analysis for an enlarged offer of equivalents. They plead for more consideration of the syntactic variability. They argue that the cooccurrence analysis used for both German and for Czech should be an important step.
This is a study of how aspects of information structure can be captured within a formal grammar of Spanish, couched in the framework of Head-Driven Phrase Structure Grammar (HPSG, Pollard
and Sag 1994). While a large number of morphological, syntactic and semantic aspects in a variety of languages have been successfully analysed in this theory, information structure has not been paid the same attention in the HPSG literature. However, as a theory of signs, HPSG should include all
levels of description without which the structural descriptions offered by the grammar would ultimately remain incomplete. Languages often explicitly mark the information-structural partitioning of utterances. Depending on the particular language, linguistic resources used for this purpose include
prosody (stress/intonation), syntax (e. g. constituent order, special syntactic constructions) and morphology (e. g. special affixes). In HPSG, phonological, syntactic, semantic and pragmatic information is represented in parallel, which would seem to be a well-suited architecture for modelling
the sort of interfaces called for.
The research project “German Today” aims to determine the amount of regional variation in (near-)standard German spoken by young and older educated adults and to identify and locate regional features. To this end, we compile an areally extensive corpus of read and spontaneous German speech. Secondary school students and 50-to-60-year-old locals are recorded in 160 cities throughout the German speaking area of Europe. All participants read a number of short texts and a word list, name pictures, translate words and sentences from English, answer questions in a sociobiographic interview, and take part in a map task experiment. The resulting corpus comprises over 1000 hours of speech, which is transcribed orthographically. Automatically derived broad phonetic transcriptions, selective manual narrow phonetic transcriptions, and variationalist annotations are added. Focussing on phonetic variation we aim to show to what extent national or regional standards exist in spoken German. Furthermore, the linguistic variation due to different contextual styles (read vs. spontaneous speech) shall be analysed. Finally, the corpus enables us to investigate whether linguistic change has occurred in spoken (near-)standard German.
The present study examines the dynamics of the kanji combinations that form common (or general) and proper nouns in Japanese. The following three results were obtained. First, the degree of distribution results from two similar processes which are based on a steady-state of birth-and-death processes with different birth and death rates, resulting in a positive negative binomial distribution with the proper nouns and in a positive Waring distribution with common nouns. Second, all rank-frequency distributions follow the negative hypergeometric distribution used very frequently in ranking problems. Third, the building of kanji compounds follows a dissortative strategy. The higher the outdegree of a kanji, the more it prefers kanji with lower indegrees. A linear dependence can be observed with common nouns, whereas the relationship between compounded kanji is rather curvilinear with proper nouns. The actual analytical expression is not yet known.
Open peer commentary on the target article “Who Conceives of Society?” by Ernst von Glasersfeld. Excerpt: I will focus on one crucial step in von Glasersfeld’s argumentation, viz. his view that every individual constructs his own private meanings (understood as conceptual structures or elements thereof) for linguistic expressions, so that linguistic interaction and even communication in general is based on a notion of compatibility between different speakers’ private conceptual schemes. The central question here is: “Just what does it mean that different private conceptual schemes (private meanings) are compatible, or what constitutes a viable criterion to this end?” As von Glasersfeld himself stresses twice (§28, §37), the criteria to be looked for can only be “public,” residing in properties of verbal and non-verbal actions of the interacting individuals, properties that can be sensed and processed by the participating system.
This paper is a project report of the lexicographic Internet portal OWID, an Online Vocabulary Information System of German which is being built at the Institute of German Language in Mannheim (IDS). Overall, the contents of the portal and its technical approaches will be presented. The lexical database is structured in a granular way which allows to extend possible search options for lexicographers. Against the background of current research on using electronic dictionaries, the project OWID is also working on first ideas of useradapted access and user-adapted views of the lexicographic data. Due to the fact that the portal OWID comprises dictionaries which are available online it is possible to change the design and functions of the website easily (in comparison to printed dictionaries). Ideas of implementing user-adapted views of the lexicographic data will be demonstrated by using an example taken from one of the dictionaries of the portal, namely elexiko.
Research on syntactic ambiguity resolution in language comprehension has shown that subjects' processing decisions are influenced by a variety of heterogeneous factors such as e.g., syntactic complexity, semantic fit and the discourse frequency of the competing structures. The present paper investigates a further potentially relevant factor in such processes: effects of syntagmatic lexical chunking (or matching to a complex memorized prefab) whose occurrence would be predicted from usage-based assumptions about linguistic categorisation. Focusing on the widely studied so-called DO/SC-ambiguity in which a post-verbal NP is syntactically ambiguous between a direct object and the subject of an embedded clause, potentially biasing collocational chunks of the relevant type are identified in a number of corpus-linguistic pretests and then investigated in a self-paced reading experiment. The results show a significant increase in processing difficulty from a collocationally neutral over a lexically biasing to a strongly biasing condition. This suggests that syntagmatically complex and partially schematic templates of the kind envisioned in usage-based Construction Grammar may impinge on speakers' online processing decisions during sentence comprehension.
Introduction
(2008)
In usage-based Construction Grammar, grammatical structure is assumed to ‘sedimenl’ from concrete linguistic experience as an automatic by-product o f repeated similar categorisation judgments (a process known as schematisation). At the same time, there is functional pressure on prospective inputs to such schematisations to retain or develop specialised properties that differentiate them from their near neighbours, i.e. other stored units in the constructicon (Goldberg: 1995). Moreover, Speakers are not assumed to necessarily extract all possible generalisations from their input. Using the example o f a group of German support verb constructions, the present study outlines a corpus-linguistic approach to identifying those Schemas that really seem to be formed by Speakers, and how they can be kept apart from mere potential generalisations.
Badania etnolingwistyczne zdobyly w ciqgu ostatnich dwu dekad znaozna populamosc. Najwazniejsz^ formuh\ nietaforycznn okreslajqcii glowny przedmiot tych badaií jest .jçzykowy obraz swiata”. W zwiqzku z tym. iz powstaj^ obecnie projekty studiów komparatyslycznych na duzíi skalç, warto byt moze rozwazyc, czego takie ujçcie etnolingwistyki nie uwzglçdnia. Wizualna metafora obrazów implikuje, ze mówincy si\ w slanie wyjsc ix>za swiat i patrzec nan (oraz nazywac go) z zewmprz. Artykul oinawia dwie kcinsekwencje tej inetafory, które mog^ przysporzyc problemów. Po pierwsze, wyizolowanie jçzyka ze swiata ludzkich dzialan, którego jyzyk wszak jest czçsci^. prowadzi do przyjçcia kognitywistycznego modeln znaczenia jako oddzielnego stmmienia komunikaeji. Taki model nie pasuje do eodziennego doswiadezenia przezroczystosci jyzyka. Po drugie, wyizolowanie jçzyka z zycia sprzyja stosowaniu metod „bezczasowych” oraz studiom nad stowami wyalKtrahowanymi z sytuaeji, w której zostaly one uzyte (jesli nie wyjçtymi z kontekstu). Przyjmuj^c takie metafory i inetody, inozetny stracic z oczu znaczn^ czçsc tego, co jest istotne dla jyzyka poUx;znego — przedmiotu badan etnonauki.
Elexiko is a lexicological-lexicographic, corpus-guided German Internet reference work (cf. www.elexiko.de). Compared to printed dictionaries, in elexiko, restrictions on space disappear. Specific comments on the use of a word do not need to be given in traditional abbreviated forms, like the so-called field labels or usage. In this paper, I will show its advantages for the description of the particular pragmatic characteristics of a word: I will argue that traditional labelling such as formal, informal, institutional, etc. cannot account for the comprehensive pragmatic dimension of a word and that these are not transparent, particularly for non-native speakers of German. The main focus of the paper will be on an alternative approach to this dictionary information-as suggested by elexiko. I will demonstrate how narrative, descriptive and user friendly notes can be formulated for the explanation of the discursive contextual embedding or tendencies of evaluative use. I will outline how lexicographers can derive such information from language data in an underlying corpus which was designed and compiled for specific lexicographic purposes. Both, the theoretical-conceptual ideas and their lexicographic realisation in elexiko will be explained and illustrated with the help of relevant dictionary entries.
Language-aware text editing
(2008)
While software developers have various power tools at their disposal that make the writing of computer programs more efficient, authors of texts do not have the support of such power tools. Text processors still operate on the level of characters and strings rather than on the level of word forms and grammatical constructions. This forces authors to constantly switch between low-level, character oriented, editing operations and high-level, conceptual, verbalisation processes. We suggest the development of language-aware text editing tools that simplify certain frequent, yet complex editing operations by defining them on the level of linguistic units. Pluralizing an entire noun phrase plus the verb forms governed by it would be an ambitious example, swapping the elements of a conjunctive construction a more modest one. We describe a pilot implementation for German where these operations are seamlessly integrated with the standard functions of an existing open-source editor. The operations can be invoked on demand and do not intrude on the authoring process. Changes can be performed locally or globally, thus simplifying the writing process considerably, and making the resulting texts more consistent.
In this paper the authors briefly outline editing functions which use methods from computational linguistics and take the structures of natural languages into consideration. Such functions could reduce errors and better support writers in realizing their communicative goals. However, linguistic methods have limits, and there are various aspects software developers have to take into account to avoid creating a solution looking for a problem: Language-aware functions could be powerful tools for writers, but writers must not be forced to adapt to their tools.
Medienkompetenz gilt als zentrale Qualifikation in der Informations- und Wissensgesellschaft, die das Leben, Lernen und Arbeiten betrifft. Für das Erlangen dieser Kompetenz sind sowohl Individuen als auch Organisationen und Systeme verantwortlich. Da sie zur Voraussetzung der aktiven Teilhabe und kreativen Mitbestimmung dieser Gesellschaft geworden ist, sollten alle Ziel- und Altersgruppen über diese Kompetenz verfügen. Sowohl in der Medienforschung als auch in der Förderung von Medienkompetenz für Menschen mit Migrationshintergrund liegen in Deutschland jedoch große Defizite vor. Aktuelle Integrationsinitiativen und offizielle Stellungnahmen betonen die Notwendigkeit, diese Mängel zu beseitigen und die großen Potenziale der Integration durch Medien effizienter zu nutzen. Studien zur Mediennutzung von Erwachsenen und Kindern zeigen, dass die Voraussetzungen hierzu relativ gut sind. So sind Menschen mit Migrationshintergrund in ihren Haushalten oft besser mit Medien ausgestattet als deutsche Haushalte. Auch wird die Mehrheit der Zuwanderer von deutschen und heimatsprachigen Medienangeboten erreicht. Die Mediennutzung wird stärker von soziodemografischen Faktoren als von der ethnischen Zugehörigkeit bestimmt. Um die heterogene Gruppe der Menschen mit Migrationshintergrund für die interkulturelle Medienarbeit erreichen zu können, sind vielfältige Aspekte und Zusammenhänge zu berücksichtigen. Ausgewählte Projekte und Aktivitäten bieten Anregungen zur praktischen Förderung von Medienkompetenz für diese Zielgruppe.
‘Linguistic relativity’ has become a major keyword in debates on the psychological significance of language diversity. In this context, the term ‘relativity’ was originally taken on loan from Einstein’s then-recent theories by Edward Sapir (1924) and Benjamin L. Whorf (1940). The present paper assesses how far the idea of linguistic relativity does analogically build on relevant insights in modern physics, and fails to find any substantial analogies. The term was used rhetorically by Sapir and Whorf, and has since been incorporated into a cognitivist research programme that seeks to answer whether ‘language influences thought’. Contemporary research on ‘linguistic relativity’ has developed into a distinct way of studying language diversity, which shares a lot with the universalistic cognitivist framework it opposes, but little with relational approaches in science.
Einleitung
(2008)
Diskurswörterbuch
(2008)
After a brief discussion on the term discourse, discourse will be related to the tasks o f a discourse dictionary. The paper goes on developing the subject of discourse lexicography, which is a lexicographic presentation of discourse vocabulary, of the net of its semantic relations, and of the societal and historical circumstances of the usage people have made of it. This background will be useful for the presentation of two types of discourse dictionaries. On the one hand, they are based on the same primary conception. On the other hand, they are adapted to the respective discourse constellations, The first example is the result of a project on the early post-war period and presents the already-existing discourse dictionary of this project. The content of this dictionary is the vocabulary of three different groups, which participate in one discourse and specifically represent its main item. Since this dictionary also exists in electronic version, this concept will be proved by examples taken out of this version. The second example refers to a project running on the 1967/68 protest period. The vocabulary of this discourse makes up a set of several single discourse items, while these items constitute the leading subject of the discourse of 1967/68: democracy. Thus, the task of the lexicographic description o f a complex discourse like this is not at least: to assign the discourse vocabulary to the single discourses and to describe the different usages relating to these single discourses. The paper ends with a draft o f a lexicographic program based on the type discourse dictionary
In diesem Artikel wird aus einer konservationsanalytischer Perspektive untersucht, wie in der letzten Stunde der psychoanalytischen Behandlung »Amalie« die Aufgabe des Abschiednehmens interaktiv thematisiert und ausgehandelt wird. Anhand von drei zentralen Interaktionssequenzen wird rekonstruiert, wie die Patientin den Abschied systematisch de-thematisiert. Sie benutzt unterschiedliche Verfahren der Selbstpositioniernng und der Fremdpositionierung des Therapeuten zur Legitimierung ihrer Verweigerung der Bearbeitung und zur Negierung der Relevanz des Abschieds. Darüber hinaus löst sie die Aufgabe des Abschieds, indem sie ihn symbolisch aufhebt durch die Behauptung einer mentalen Verschmelzung mit dem Therapeuten, die den bevorstehenden Verlust der Realbeziehung irrelevant mache.
One of the most popular techniques used in HPSG-based studies to describe linguistic phenomena is the raising mechanism. Besides ordinary raising verbs or adjectives, this tool has been applied for handling verbal complexes and discontinuous constituents, among other phenomena. In this paper, a new application for raising within the HPSG paradigm will be discussed, thereby investigating data from the prepositional domain. We will analyze linguistic properties of word combinations in German consisting of a preposition, a noun, and another preposition (such as auf Grund von (‘by virtue of’)), thus arguing that raising is the most appropriate method for satisfactorily describing the crucial syntactic features which are typical for those expressions. The objective of this paper is thus to demonstrate the efficiency of the raising mechanism as used in HPSG, and therefore, to emphasize the importance of designing a satisfactory uniform theory of raising within this grammar framework.
The authors present a multilingual electronic database of lexical items with idiosyncratic occurrence patterns. Currently, our database consists of: (1) a collection of 444 bound words in German; (2) a collection of 77 bound words in English; (3) a collection of 58 negative polarity items in Romanian; (4) a collection of 84 negative polarity items in German; and (5) a collection of 52 positive polarity items in German. The database is encoded in XML and is available via the Internet, offering dynamic and flexible access.
This paper presents three electronic collections of polarity items: (i) negative polarity items in Romanian, (ii) negative polarity items in German, and (iii) positive polarity items in German. The presented collections are a part of a linguistic resource on lexical units with highly idiosyncratic occurrence patterns. The motivation for collecting and documenting polarity items was to provide a solid empirical basis for linguistic investigations of these expressions. Our databe provides general information about the collected items, specifies their syntactic properties, and describes the environment that licenses a given item. For each licensing context, examples from various corpora and the Internet are introduced. Finally, the type of polarity (negative or positive) and the class (superstrong, strong, weak or open) associated with a given item is speci ed. Our database is encoded in XML and is available via the Internet, offering dynamic and exible access.
The authors describe two data sets submitted to the database of MWE evaluation resources: (1) cranberry expressions in English and (2) cranberry expressions in German. The first package contains a collection of 444 cranberry words in German (CWde.txt) and a collection of the corresponding cranberry expressions (CCde.txt). The second package consists of a collection of 77 cranberry words in English (CWen.txt) and a collection of the corresponding cranberry expressions (CCen.txt). The data included in these packages was extracted from the Collection of Distributionally Idiosyncratic Items (CoDII), an electronic linguistic resource of lexical items with idiosyncratic occurrence patterns. Each package contains a readme file, and can be downloaded from multiword.wiki.sourceforge.net/Resources.
The Online-Wortschatz-Informationssystem Deutsch (OWID; Online Vocabulaty Information System German) o f the Institut fUr Deutsche Sprache (IDS; German Language Institute) in Mannheim is a lexicographic Internet portal for various electronic diciionary resources that are being compiled as the IDS. It is an explicit goal of OWID, not to present a random collection of unrelated reference works but to build a network of actually related lexicographic products. Hence, the core of the project is the design of an innovative concept of data modelling and structuring. The goal of this granular data modelling is to allow flexible access of each individual lexicographic resource as well as access across diverse dictionary resources. At the same time, fine-grained interconnectedness of all resources should be made possible. Every lexicographic resource within OWID—elexiko, Neologismenwörterbuch, Wortverbindungen online, Schulddiskurs im ersten Nachkriegsjahrzehnt—accomplishes this requirement with regard to data modelling and structuring. The paper explains the underlying consistent concept of the data modelling for the overall heterogeneous lexicographical resources. Also it is shown, how the modelling potential has been converting into the Internet presence of OWID.
La diminution des compétences linguistiques (ou: attrition des langues) est un phénomène que l’on rencontre dans différents contextes lorsque l’accès à ce qui est acquis dans une langue (L1, L2 ou langue étrangère) diminue. Les recherches sur le sujet montrent par exemple que l’influence de la L2 rend difficile aux locuteurs L1 d’exploiter toutes les variations stylistiques ou pragmatiques que leur L1 devrait normalement leur permettre. La question qui se pose est de savoir ce qui se perd en effet: est-ce la competence langagière, la representation mentale de la connaissance qui est affectée ou s’agit-il plutôt d’une limitation de l’accès et du contrôle des connaissances acquises qui, elles, restent intactes? Dans le cadre des discussions actuelles autour des avantages et des risques du plurilinguisme il n’est pas seulement intéressant mais bien nécessaire d’approfondir les recherches sur les processus de l’attrition. Il faut par ailleurs, pour que les plurilingues aient un réel bénéfice de leur potentiel, que la société reconnaisse et apprécie concrètement ces compétences et qu’elle encourage les locuteurs à afficher leur identité bilingue en toute confiance et transparence.
Laz, a sister language of Georgian spoken on the southeastem coast of the Black Sea, is the only member of the South Caucasian family which is spoken primarily in Turkey. Due to the socio-political circumstances all Speakers of Laz living in Turkey are bilingual and use Laz primarily in private communication. Using these observations as a starting point, the paper looks at the question of whether Laz is an endangered language. In order to clarify the sociolinguistic Situation of Laz in Turkey, the different levels involved in the process of gradual language loss (language-extemal factors, speech behaviour and structural consequences within the language system) are dealt with in detail. To determine which data should be taken as basis for the documentation of the language, the paper also discusses linguistic criteria for differentiating between fully competent Speakers of Laz and Speakers who show signs of language attrition.
The paper reports on experiments with acoustic recordings of a self-built replica of the historic speaking machine of Wolfgang von Kempelen. Several possibilities of the reed as the glottal excitation mechanism were tested. Perception tests with naïve listeners revealed that the machinegenerated words 'mama' and 'papa' were partially recognised as an authentic child voice – as it was also the case in von Kempelen's demonstrations in the late 18th century.
This contribution deals with the representation of verbs with multiple meanings or senses in general monolingual dictionaries. Criteria for differentiating senses in dictionary entries have traditionally been formulated with respect to the vocabulary in general. This paper argues that, while some criteria do indeed apply to the entire lexicon, many of them are relevant only to specific semantic classes. This will be demonstrated considering two selected verb classes: speech-act verbs and perception verbs. Like verbs of other classes, speech-act verbs and perception verbs may be ambiguous in different but recurrent ways. Since recurrent patterns of ambiguity are always typical of particular semantic classes, class-specific semantic criteria are formulated to decide whether a particular ambiguous speech act or perception verb should be treated as being polysemous or homonymous in dictionary entries. In addition to these class-specific semantic criteria, the semantic-syntactic criterion of identity or difference of argument structure is suggested for the lexicographical representation of verbs which may not be considered to be polysemous or homonymous on the basis of semantic criteria alone. According to the suggested argument-structure criterion, these verbs should be treated as polysemous when their senses correlate with identical argument structures and as homonymous when their senses correlate with different argument structures properties. As opposed to the semantic criteria suggested, the semantic-syntactic criterion of identity vs. difference of argument structure applies to verbs of different semantic classes. However, as will be illustrated by the discussion of the different senses of smell, it may sometimes force us to treat different but related senses as corresponding to two distinct lexical items. In order to solve this problem, the criteria suggested are supplemented by a preference rule stating that semantic criteria apply prior to the semantic-syntactic criterion of identity vs. difference of argument structure...
COSMAS II
(2008)
Der, die oder das Nutella
(2008)
Der vorliegende Aufsatz gibt einen Überblick über die Funktionsvielfalt der Nominalgruppe im Gegenwartsdeutschen. In pragmatischer Hinsicht wird unterschieden zwischen referentiellem und nicht-referentiellem Gebrauch. Letzterer zerfällt weiter in beschreibenden und benennenden Gebrauch. Syntaktisch können Nominalgruppen als Argumente, Prädikative und Adverbialia fungieren. Diese Funktionen unterscheiden sich vor allem bezüglich der Verteilung morphologischer Kasus und thematischer Rollen. Hinsichtlich ihres internen Aufbaus ist zu unterscheiden zwischen artikellosen und artikelhaltigen Nominalgruppen. Artikelhaltige stellen im Deutschen den Normalfall dar, während artikellose auf besondere Verwendungskontexte beschränkt sind. In engem Zusammenhang damit steht die Definitheitsunterscheidung: Definite Nominalgruppen enthalten einen Definitmarker, indefinite nicht. In semantischer Hinsicht ist zwischen Individuen- und Masse-Lesarten sowie zwischen partikulären und generischen Lesarten zu unterscheiden. – Der Aufsatz versteht sich als Handreichung für Studierende und Lehrende der Germanistischen Linguistik sowie des Faches Deutsch als Fremdsprache.
O presente trabalho discute a classificação dos substantivos e/ou sintagmas nominais em contáveis e não-contáveis no alemão e no português do Brasil. Propomos um modelo de estrutura, válido para ambas as línguas, em que a contabilidade é construída composicionalmente em nível do sintagma nominal, mediante três traços sintático-semânticos: [±individuado], [±incrementado] e [±delimitado]. O valor do primeiro traço é fixado pelo quantificador, o do segundo, pelo número e o do terceiro, pelo substantivo. Na língua alemã, os três traços contribuem para a constituição da contabilidade, sendo o terceiro o traço menos importante. No português brasileiro apenas os dois primeiros mostram-se produtivos, enquanto o terceiro é irrelevante. Isso corresponde a dizer que não se distinguem substantivos contáveis e não-contáveis no léxico do português brasileiro. Para ambos os idiomas, as propostas são ilustradas com exemplos autênticos de uso.
Este artigo desenvolve sete teses acerca do conceito de coerência e de outros conceitos básicos da análise do discurso e da lingüística textual. Na primeira parte, inicia-se com algumas observações históricas acerca das noções de texto, discurso e comunicação. Na segunda parte, discute as relações entre coerência e coesão, intertextualidade e polifonia, bem como entre coerência e intertextualidade; define coesão como um tipo especial de coerência e polifonia como um tipo especial de intertextualidade e argumenta que as noções clássicas de coerência e intertextualidade representam perspectivas opostas dentro da lingüística textual. Na Terceira parte, busca uma redefinição de coerência que possa explicar esse conceito simultaneamente para o discurso, a cognição e o texto. Descarta as definições de coerência como resultado da constituição de sentido e como estado-alvo estável de um sistema e propõe sua definição como relativa uniformidade local de um sistema, segundo parâmetros considerados relevantes pelo observador. No último item, postula que coerência e incoerência são igualmente necessários dentro de qualquer sistema natural para garantir sua evolução histórica.
Belemnons Curiöses Bauem-Lexicon (CBL) aus dem Jahr 1728 ist ein ungewöhnliches Wörterbuch schwieriger Ausdrücke und Syntagmen (fast ausschließlich aus dem Bereich der Fremdwörter), die von ungebildeten Sprechern des frühen 18. Jhs. ("Bauern") falsch verwendet wurden. Das CBL listet rund 800 dieser Fremdwörter alphabetisch auf, um ihnen nach knappen Angaben zur korrekten Aussprache, Bedeutung und Verwendung die jeweiligen Verballhornungen oder Fehlverwendungen, meist durch (oft komische) Verwendungsbeispiele illustriert, gegenüberzustellen. In diesem Beitrag werden einführend die äußere Gestalt, Überlieferung und Nachwirkung, Zielsetzung und Adressaten sowie Makro- und Mikrostruktur des Wörterbuchs beschrieben. Im Anschluss wird der Gesamtbestand der korrekten wie inkorrekten Wortformen gesichtet und auf zwei Arten sortiert: zuerst in der Anordnung des Wörterbuchs, um einen Überblick über seine Makrostruktur zu gewinnen, und dann unter Umkehrung der Benutzerperspektive in Form einer alphabetischen Auflistung der 2000 "Falschwörter" mit Zuordnung der jeweils zugrundeliegenden korrekten Form(en). Eine erste Durchsicht im Anschluss lässt verschiedene Typen von Fehl Verwendungen erkennen, abhängig vom sozio- und dialektalen Umfeld der Sprachbenutzer. Im Hintergrund steht die Frage, inwiefern das CBL eine sprachhistorische Quelle zur Alltagssprache des frühen 18. Jhs. darstellt: dient es in erster Linie der Erheiterung gebildeter Kreise auf Kosten der weniger Gebildeten, denen womöglich auch erfundene, besonders lächerliche sprachliche Fehlleistungen zugeschrieben werden, oder dokumentiert es tatsächlich den defizitären Fremdwortgebrauch von Sprachbenutzern aus der ländlichen Unterschicht seiner Entstehungszeit? Beigegeben wird eine fotografische Reproduktion des CBL in Gestalt einer pdf-Datei, die der Forschung bis zum Erscheinen einer hoffentlich bald verfügbaren kritischen Edition einen leichteren Zugriff auf diesen in mehrfacher Hinsicht interessanten Quellentext ermöglichen soll.
Data and transcription
(2008)
Using different constructions with the German item vcrstchcn (engl. understand), the current study addresses the relationship between lexical and constructional meaning. Construction grammar and cognitive grammar reject the theoretical distinction between (a semantic) lexicon and (a formal) syntax (e.g., Langacker: 2000). Instead, they take constructions to be the units of linguistic competence. It is claimed that constructions consist of form-meaning-pairings (e.g. Goldberg: 1995; Croft: 2001). From this view, it follows that formal variation should result in functional variation. Lexical items should therefore acquire different meanings depending on the constructions in which they occur. To test this claim, 300 instances of uses of the German lexical item verstehen in talk-in-interaction were inspected for the local meanings verstehen acquires in each case. The article compares the semantics of verstehen in two different constructions: The discourse marker verstehst du? (engl. do you understand?) and the negative construction [NP] nicht verstehen [COMP], The data show a poly- semic spectrum of meanings of verstehen, which is similar for both constructions. The precise local meaning of verstehen in most cases depends on pragmatic and discursive factors and is not provided for by the constructions themselves. There are, however, subtypes of the two constructions that satisfy the condition of being a form-meaning-pair. As a conclusion, some prospects for the conceptualization of different sources of meaning within a construction grammar approach are suggested.
Emotionale Kommunikation
(2008)
This article examines the interrelation between communicative behavior and emotion. First, it clarifies the notions of emotion as a concept (section 2) and the concept of communication (section 3). Then, it outlines the need to develop a model for emotions in communicative interaction (section 4). The interrelation between communicative behavior and emotion is interdependent — on the one hand, communicative behavior can influence a person’s own emotions and those of another person and, on the other hand, emotions can affect a person’s own and another person’s communicative behavior (section 5).
EuroGr@mm
(2008)
This paper presents the Kicktionary, a multilingual (English - German - French) electronic lexical resource of the language of football. In the Kicktionary, methods from corpus linguistics and two approaches to lexical semantics - the theory of frame semantics and the concept of semantic relations - are combined to construct a lexical resource in which the user can explore relationships between lexical units in various ways. This paper explains the theoretical background of the Kicktionary, sketches the data and methods which were used in its construction, and describes how the resulting resource is presented to users via a set of hyperlinked webpages.
Rescuing Legacy Data
(2008)
This paper discusses issues that arise in the transformation of electronic language data from outdated to modern, sustainable formats. We first describe the problem and then present four different cases in which corpora of spoken language were converted from legacy formats to an XML-based representation. For each of the four cases, we describe the conversion workflow and discuss the difficulties that we had to overcome. Based on this experience, we formulate some more general observations about transforming legacy data and conclude with a set of best practice recommendations for a more sustainable handling of language corpora.
This paper presents the results of a joint effort of a group of multimodality researchers and tool developers to improve the interoperability between several tools used for the annotation and analysis of multimodality. Each of the tools has specific strengths so that a variety of differ-ent tools, working on the same data, can be desirable for project work. However this usually re-quires tedious conversion between formats. We propose a common exchange format for multi-modal annotation, based on the annotation graph (AG) formalism, which is supported by import and export routines in the respective tools. In the current version of this format the common de-nominator information can be reliably exchanged between the tools, and additional information can be stored in a standardized way.
In this paper, the authors describe a semi-automated approach to refine the dictionary-entry structure of the digital version of the Wörterbuch der deutschen Gegenwartssprache (WDG, en.: Dictionary of Present-day German), a dictionary compiled and published between 1952 and 1977 by the Deutsche Akademie der Wissenschaften that comprises six volumes with over 4,500 pages containing more than 120,000 headwords. We discuss the benefits of such a refinement in the context of the dictionary project Digitales Wörterbuch der deutschen Sprache (DWDS, en: Digital Dictionary of the German language). In the current phase of the DWDS project, we aim to integrate multiple dictionary and corpus resources in German language into a digital lexical system (DLS). In this context, we plan to expand the current DWDS interface with several special purpose components, which are adaptive in the sense that they offer specialized data views and search mechanisms for different dictionary functions-e.g. text comprehension, text production-and different user groups-e.g. journalists, translators, linguistic researchers, computational linguists. One prerequisite for generating such data views is the selective access to the lexical items in the article structure of the dictionaries which are the object of study. For this purpose, the representation of the eWDG has to be refined. The focus of this paper is on the semiautomated approach used to transform eWDG into a refined version in which the main structural units can be explicitly accessed. We will show how this refinement opens new and flexible ways of visualizing and querying the lexicographic content of the refined version in the context of the DLS project.
There has been a long tradition of discussing the advantages and disadvantages of using foreign words in the German language. In the first part of this paper, an historical example of this discussion will be presented. It shows that at the end of the 18th century a highly differentiated approach to this question had been developed. The type of functional reasoning applied there could also be useful for the present discussion about the influence of English on the German language. A functional interpretation of the use of indigenous and foreign words respectively in a language like German unavoidably leads to the conclusion that the use of elements of foreign origin is an integral part of what it means to be a modem European language. Of course languages differ in the wavs in which they technically deal with this fact. To document the fact that the integration of the European tradition o f mutual cultural and linguistic contact is a characteristic feature of European languages, and that different languages deal with this in technically different ways, the second part o f this article compares a German non-fictional text with its counterparts in seven other European languages.
Erst seit dem 19. Jahrhundert gewinnt die deutsche Hochsprache in ihrer gesprochenen Form in großen Kreisen der Bevölkerung an Bedeutung. Bis dahin spricht der Großteil der Bevölkerung eine jener regionalen Varietäten des Deutschen, die unter dem Eindruck der Ausbreitung der Hochsprache und von sogenannten Umgangssprachen eine Verschiebung ihrer Funktion mitmachen, als der Hochsprache gegenüberstehender Pol verstanden, so als ‘Dialekt’ wissenschaftlich beschrieben und ideologisch integriert werden. Spätestens seit der Mitte des 20. Jahrhunderts verändert sich der Sprachgebrauch in eine Richtung, die eine solche dichotomische Einordnung als obsolet erscheinen lässt. In den letzten zwei Jahrzehnten beobachtet man eine beschleunigte weiträumige und tiefgreifende Annäherung an die Standardsprache auch beim Sprechen. Das hat Konsequenzen für die normativen Vorstellungen von solch einer Sprachform, für die das Bild vom plurizentrischen Charakter des Deutschen keine hinreichende Basis mehr abgibt. Eine andere Frage ist, wie sich diese Entwicklungen angemessen modellieren lassen und welche Rolle die Kategorie Regionalität dabei spielt.
Altern wird in diesem Band untersucht als eine Aufgabe, die von allen Menschen - durchaus auf unterschiedliche Weise - zu bewältigen ist und an der sie aktiv teilhaben. Altern ist demnach nicht etwas, was einem nur passiert bzw. widerfährt, sondern erfolgt in einem sozialen Prozess, in dem sich die Beteiligten mit dem Altern auseinandersetzen und es interaktiv gestalten. Altern impliziert so als Aufgabe auch die Reflexion der lebensgeschichtlich eintretenden Veränderungen und ihre interaktive und kommunikati-ve Be- und Verarbeitung. In der kommunikativen Bewältigung dieser Veränderungen wird zugleich Identitätsarbeit geleistet und werden Aspekte von Altersidentität ausgebildet. Diese Wechselwirkungen zwischen Altern, Kommunikation und Identitätsarbeit werden anhand von Ausschnitten aus authentischen Gesprächen herausgearbeitet und mit gesprächsanalytischen Methoden untersucht. Im Anhang geben zwei lange Transkriptausschnitte Einblick in die Kommunikationsweisen älterer Menschen und stellen Material für weitere Analysen bereit.
Between 1884 and 1900, Germany established protectorates in large areas of the South Pacific. The authorities assumed that the linguistically extremely diverse areas would pose communication problems. Thus the question arose whether German should become the lingua franca in the South Pacific. After a controversial discussion; the German government implemented language policies to promote the German language in the colonies. This chapter shows why, on the one hand, German language policies were doomed to failure and why, on the other, they unintentionally supported other linguistic developments such as the introduction of borrowing from German into indigenous languages, the development of German settler varieties, and the spread of pidgin languages.
In spring 2002, we celebrated the inauguration of the first German-Russian-Jewish kindergarten in Berlin. Nowadays, there are seven bilingual German-Russian kindergartens with 4 60 places and 78 bilingual kindergartens with other combinations of languages [SENBWF]. Maybe it is not enough, taking into account the large proportion o f immigrants in the population of Berlin1. And yet, much progress has been achieved, endorsing the fact that German society has begun to change its attitude towards other languages on its territory. The initial request for German monolingualism first changed into societal tolerance of multilingualism and eventually to the recognition o f the value of multilingualism. This process is a very slow one, and it is not yet complete. In my article, I would like to look at the development in the last few years of the political framework that has made possible, on the one hand, the opening of bilingual kindergartens in Berlin, and on the other hand, to consider what has hampered this process until now. I would like to emphasise three most important political spheres: linguistic, educational and integrational.
In dem Beitrag werden jüngste Entwicklungen auf dem Gebiet der Sprachpolitik, der Bildungspolitik und der Integrationspolitik in Deutschland dargestellt, die ein neues Verhältnis zur Mehrsprachigkeit erkennen lassen und die Schaffung zweisprachiger Bildungseinrichtungen ermöglichen. Dieser Beitrag wurde auch in einer englischen Version mit dem Titel "The political framework for creation and development of bilingual Kindergartens in Berlin" veröffentlicht. Sie ist über den Dokumentenserver des IDS zugänglich. Die deutsche Version des Beitrags trägt den Titel "Politische Rahmenbedingungen für zweisprachige Kindertagesstätten in Berlin". Sie ist nicht veröffentlicht, aber ebenfalls über den Dokumentenserver des IDS erhältlich.
Lexikografie im Internet
(2008)
This contribution article focuses on German-language collocation research and lexicographic practice from a corpus linguistic perspective. Although there is no dictionary called “Deutsches Kollokationswörterbuch” (German collocation dictionary), the collocation perspective acquires increasing popularity in linguistic research and dictionary work in the German-speaking area. On the one hand, this tendency is due to the growing number of studies dealing with German as a contrast language and works on foreign language didactics. On the other hand, powerful electronic resources such as large corpora and lexical databases, which are nowadays available for the German language, are recognised as a valuable empirical basis. Nevertheless, the application of novel corpus linguistic methods in lexicographic practice is still unsatisfactory. Therefore, this article concludes by discussing innovative aspects of corpus linguistic empirical research on the basis of collocations. These ideas are presented as an incentive for further research as well as practical application.