410 Linguistik
Refine
Year of publication
Document Type
- Article (69)
- Part of a Book (32)
- Conference Proceeding (19)
- Book (10)
- Contribution to a Periodical (4)
- Doctoral Thesis (2)
- Review (2)
- Working Paper (2)
- Preprint (1)
- Report (1)
Language
- German (89)
- English (50)
- Polish (2)
- Portuguese (1)
Keywords
- Deutsch (18)
- Computerlinguistik (17)
- Linguistik (15)
- Gesprochene Sprache (13)
- Korpus <Linguistik> (13)
- Transkription (11)
- Konversationsanalyse (10)
- Englisch (8)
- Kommunikation (8)
- Diskursanalyse (7)
Publicationstate
- Veröffentlichungsversion (21)
- Postprint (6)
- Zweitveröffentlichung (2)
- Erstveröffentlichung (1)
Reviewstate
Publisher
- Institut für Deutsche Sprache (10)
- De Gruyter (7)
- Narr (6)
- de Gruyter (4)
- Benjamins (2)
- ELRA (2)
- Equinox (2)
- Institut für Deutsche Sprache (IDS) (2)
- Lang (2)
- Springer (2)
In their analysis of methods that participants use to manage the realization of practical courses of action, Kendrick and Drew (2016/this issue) focus on cases of assistance, where the need to be addressed is Self’s, and Other lends a helping hand. In our commentary, we point to other forms of cooperative engagement that are ubiquitously recruited in interaction. Imperative requests characteristically expect compliance on the grounds of Other’s already established commitment to a wider and shared course of actions. Established commitments can also provide the engine behind recruitment sequences that proceed nonverbally. And forms of cooperative engagement that are well glossed as assistance can nevertheless be demonstrably oriented to established commitments. In sum, we find commitment to shared courses of action to be an important element in the design and progression of certain recruitment sequences, where the involvement of Other is best defined as contribution. The commentary highlights the importance of interdependent orientations in the organization of cooperation. Data are in German, Italian, and Polish.
Psychological research has emphasized the importance of narrative for a person’s sense of self. Building a coherent narrative of past events is one objective of psychotherapy. However, in guided self-help therapy the patient has to develop this narrative autonomously. Identifying patients’ narrative skills in relation to psychological distress could provide useful information about their suitability for self-help. The aim of this study was to explore whether the syntactic integration of clauses into narrative in texts written by prospective psychotherapy patients was related to mild to moderate psychological distress. Cross-clausal syntax of texts by 97 people who had contacted a primary care mental health service was analyzed. Severity of symptoms associated with mental health difficulties was assessed by a standardized scale (Clinical Outcomes in Routine Evaluation outcome measure). Cross-clausal syntactic integration was negatively correlated with the severity of symptoms. A multiple regression analysis confirmed that the use of simple sentences, finite complement clauses, and coordinated clauses was associated with symptoms (R2 = .26). The results suggest that the analysis of cross-clausal syntax can provide information on patients’ narrative skills in relation to distressing events and can therefore provide additional information to support treatment decisions.
Cognitive linguists have long been interested in analogies people habitually use in thinking and speaking, but little is known about the nature of the relationship between verbal behaviour and such analogical schemas. This article proposes that discourse metaphors are an important link between the two. Discourse metaphors are verbal expressions containing a construction that evokes an analogy negotiated in the discourse community. Results of an analysis of metaphors in a corpus of newspaper texts support the prediction that regular analogies are form-specific, i.e., bound to particular lexical items. Implications of these results for assumptions about the generality of habitual analogies are discussed.
This article explores the role that metaphors play in the ideological interpretation of events. Research in cognitive linguistics has brought rich evidence of the enormous influence that body experience has on (metaphorical) conceptualization. However, the role of the cultural net in which an individual is embedded has mostly been neglected. As a step towards the integration of cultural experience into the experientialist framework in cognitive metaphor research I propose to differentiate two ideal types of motivation for metaphor: correlation and intertextuality. Evidence for the important role that intertextual metaphors play in ideological discourse comes from an analysis of Polish newspaper discourse on the tenth anniversary of the end of communism.
When formulating a request for an object, speakers can choose among different grammatical resources that would all serve the overall purpose. This paper examines the social contexts indexed and created by the choice of the turn format can I have x to request a shared good (the pepper grinder, a tissue from a box on the table, etc.) in British English informal interaction. The analysis is based on a video corpus of approximately 25 h of everyday interaction among family and friends. In its home environment, a request in the format can I have x treats the other as being in control over the relevant material object, a control that is the contingent outcome of ongoing courses of action. This contingent control over a shared good produces an obligation to make it available. This analysis is supported by an examination of similarly formatted request turns in other languages, of can I have x in another interactional environment (after a relevant offer has been made) in British English, and of deviant cases. The results highlight the intimate connection of request format selection to the present engagements of (prospective) request recipients.
Badania etnolingwistyczne zdobyly w ciqgu ostatnich dwu dekad znaozna populamosc. Najwazniejsz^ formuh\ nietaforycznn okreslajqcii glowny przedmiot tych badaií jest .jçzykowy obraz swiata”. W zwiqzku z tym. iz powstaj^ obecnie projekty studiów komparatyslycznych na duzíi skalç, warto byt moze rozwazyc, czego takie ujçcie etnolingwistyki nie uwzglçdnia. Wizualna metafora obrazów implikuje, ze mówincy si\ w slanie wyjsc ix>za swiat i patrzec nan (oraz nazywac go) z zewmprz. Artykul oinawia dwie kcinsekwencje tej inetafory, które mog^ przysporzyc problemów. Po pierwsze, wyizolowanie jçzyka ze swiata ludzkich dzialan, którego jyzyk wszak jest czçsci^. prowadzi do przyjçcia kognitywistycznego modeln znaczenia jako oddzielnego stmmienia komunikaeji. Taki model nie pasuje do eodziennego doswiadezenia przezroczystosci jyzyka. Po drugie, wyizolowanie jçzyka z zycia sprzyja stosowaniu metod „bezczasowych” oraz studiom nad stowami wyalKtrahowanymi z sytuaeji, w której zostaly one uzyte (jesli nie wyjçtymi z kontekstu). Przyjmuj^c takie metafory i inetody, inozetny stracic z oczu znaczn^ czçsc tego, co jest istotne dla jyzyka poUx;znego — przedmiotu badan etnonauki.
W artykule tym przyglfjdam si. zasadniczej dia j.zykowego obrazu swiata opozycji mi.dzy swotm i obcym w przykladowych tckstach przynaleznych do polskiego i niemieckiego dyskursu Ideologieznego (politycznego). Za van Dijkicin przyjmuj., ze charakterystyczne dla dyskursu ideologicznego jest ustalenie i reprodukeja rozr.zmenia mi.dzy grupa wlasn^ a innymi grupami. Funkcjq dyskursu ideologicznego jest legitymizaeja dzialan i przekonan grupy wlasnej oraz delegitymizacja dzialan i przekonan innych grup. W populamych czasopismach polskich i niemieckich, traktuj^cych o tematach politycznych ( Wprost i Spiegel) takie pojmowanie swojego i obeego wydaje si. byc akeeptowane. Konkretyzacja absttakcyjnych poj.c. sw.j i obey przy tym nie jest stala, a raczej funkcjonalnie zmienna, zaleznie od tego, kto ma byc postrzegany jako rialeziycy do grupy wlasnej, a kto ma byc z niej wylijczony.
Relationale Adjektive, also Adjektive, die aus Substantiven abgeleitet werden und die in attributiver Konstruktion mit einem Kopfsubstantiv eine unspezifische Relation zwischen dem Begriff des Kopfs und dem Begriff der Basis ausdrücken, spielen in den klassischen Sprachen eine bedeutende Rolle. Ausgehend von der silvestris musa, der Waldmuse des Vergil, wird in dem vorliegenden Beitrag den Nachwirkungen dieses Musters in europäischen Sprachen, dem Französischen, Englischen, vor allem aber im Deutschen nachgegangen. Die semantische Funktion solcher Adjektive wird der funktionalen Domäne ‚klassifikatorische Modifikation‘ zugeordnet. Sprachübergreifende Gemeinsamkeiten und Unterschiede werden herausgearbeitet. In knapper Form werden auch relationale Adjektive im Polnischen und Ungarischen, den weiteren Vergleichssprachen des Projekts „Grammatik des Deutschen im europäischen Vergleich“, einbezogen. Die Frage nach dem Verhältnis von universalen, sprachfamiliären, arealen und sprachspezifischen Eigenschaften des Konstruktionsmusters sowie nach dem Grad des lateinischen Einflusses wird auf diesem Hintergrund präziser formulierbar.
Introduction
(2008)
This paper presents EXMARaLDA, a system for the computer-assisted creation and analysis of spoken
language corpora. The first part contains some general observations about technological and methodological requirements for doing corpus-based pragmatics. The second part explains the systems architecture and gives an overview of its most important software components a transcription editor, a corpus management tool and a corpus query tool. The last part presents some corpora which have been or are currently being compiled with the help of EXMARaLDA.
Gerade weil das Thema der diesjährigen Arbeitstagung bereits seit einigen Jahrzehnten immer wieder Gegenstand verschiedener Forschungsrichtungen gewesen ist und heute gleichermaßen polymorph erforscht wird, sollten im Rahmen dieser Tagung aktuelle Projekte aus unterschiedlichen Disziplinen vorgestellt und interdisziplinär verhandelt werden. Das Ziel der Tagung war es, MedizinerInnen, PsychologInnen und GesprächsanalytikerInnen eine Plattform zu bieten, miteinander in Kontakt zu treten, die vorgestellten Ansätze, Erkenntnisinteressen und Methoden gemeinschaftlich zu diskutieren und dabei herauszustellen, in welchen Punkten sich diese von den eigenen unterscheiden.
Geschlossene Klassen?
(2002)
The authors present a multilingual electronic database of lexical items with idiosyncratic occurrence patterns. Currently, our database consists of: (1) a collection of 444 bound words in German; (2) a collection of 77 bound words in English; (3) a collection of 58 negative polarity items in Romanian; (4) a collection of 84 negative polarity items in German; and (5) a collection of 52 positive polarity items in German. The database is encoded in XML and is available via the Internet, offering dynamic and flexible access.
The TEI has served for many years as a mature annotation format for corpora of different types, including linguistically annotated data. Although it is based on the consensus of a large community, it does not have the legal status of a standard. During the last decade, efforts have been undertaken to develop definitive de jure standards for linguistic data that not only act as a normative basis for the exchange of language corpora but also address recent advancements in technology, such as web-based standards, and the use of large and multiply annotated corpora.
In this article we will provide an overview of the process of international standardization and discuss some of the international standards currently being developed under the auspices of ISO/TC 37, a technical committee called “Terminology and other Language and Content Resources”. After that the relationship between the TEI Guidelines and these specifications, according to their formal model, notation format, and annotation model, will be discussed. The conclusion of the paper provides recommendations for dealing with language corpora.
La diminution des compétences linguistiques (ou: attrition des langues) est un phénomène que l’on rencontre dans différents contextes lorsque l’accès à ce qui est acquis dans une langue (L1, L2 ou langue étrangère) diminue. Les recherches sur le sujet montrent par exemple que l’influence de la L2 rend difficile aux locuteurs L1 d’exploiter toutes les variations stylistiques ou pragmatiques que leur L1 devrait normalement leur permettre. La question qui se pose est de savoir ce qui se perd en effet: est-ce la competence langagière, la representation mentale de la connaissance qui est affectée ou s’agit-il plutôt d’une limitation de l’accès et du contrôle des connaissances acquises qui, elles, restent intactes? Dans le cadre des discussions actuelles autour des avantages et des risques du plurilinguisme il n’est pas seulement intéressant mais bien nécessaire d’approfondir les recherches sur les processus de l’attrition. Il faut par ailleurs, pour que les plurilingues aient un réel bénéfice de leur potentiel, que la société reconnaisse et apprécie concrètement ces compétences et qu’elle encourage les locuteurs à afficher leur identité bilingue en toute confiance et transparence.
Vom IDS an die Uni Trier
(1995)
This paper discusses a theoretical and empirical approach to language fixedness that we have developed at the Institut für Deutsche Sprache (IDS) (‘Institute for German Language’) in Mannheim in the project Usuelle Worterbindungen(UWV) over the last decade. The analysis described is based on the Deutsches Referenzkorpus (‘German Reference Corpus’; DeReKo) which is located at the IDS. The corpus analysis tool used for accessing the corpus data is COSMAS II (CII) and – for statistical analysis – the IDS collocation analysis tool (Belica, 1995; CA). For detecting lexical patterns and describing their semantic and pragmatic nature we use the tool lexpan (or ‘Lexical Pattern Analyzer’) that was developed in our project. We discuss a new corpus-driven pattern dictionary that is relevant not only to the field of phraseology, but also to usage-based linguistics and lexicography as a whole.
Die Abbildung und Modellierung von Varianz wird im Projekt Wechselwirkungen zwischen linguistischen Verfahren, Methoden und Algorithmen auf der sprachlichen Seite u.a. repräsentiert durch die Metalemmaliste, die Lemmata der neuhochdeutschen Standardsprache mit diachronisch und diatopisch markierten Lemmata verknüpft. Die zeitlich und regional markierten Varianten stammen aus Wörterbüchern des Trierer Wörterbuchnetzes. Die Lemmata der nhd. Standardsprache werden in einer korpusgenerierten Basislemmaliste (BLL) zur Verfügung gestellt, in der neben den Lemmata auch Angaben zu deren Wortart(en) und Gebrauchshäufigkeit verzeichnet sind. Die Lemmata der BLL bilden das Gemeinsame Dritte, auf das die Lemmata der Varietäten-Wörterbücher in der Metalemmaliste abgebildet sind, die Lemmata der BLL der nhd. Standardsprache konstituieren die Metalemmata der Metalemmaliste. Die BLL soll in ihrer Funktion als Tertium Comparationis den Sprachgebrauch im heutigen Standarddeutsch widerspiegeln. Dadurch wird sichergestellt, dass die verschiedenen Instanzen der Varietätenlemmata auf Lemmata abgebildet werden, die momentan in der Standardsprache gebräuchlich sind. Über das Metalemma lassen sich die äquivalenten Ausdrücke in den Varietäten finden, ohne dass man von deren regionalen oder historischen Ausprägungen Kenntnisse besitzt. Die Umsetzung der semasiologischen Zugriffsmöglichkeit auf sämtliche Varietätenlemmata über ein Lemma der nhd. Standardsprache erfolgt auf der Grundlage einer XML-basierten Datenbank nach aktuellen Standards der Kodierung von Lexikoneinträgen (TEI P5). Die Metalemmaliste ist dynamisch und netzartig konzipiert, so dass immer neue Teilbereiche, Verzweigungen und Ontologien angedockt werden können (vgl. TV 2). Die Anknüpfung der Varietätenlemmata an die Lemmata der nhd. Standardsprache aus der BLL erfolgt mit Hilfe von Algorithmen, die im TV 3.2. (Informatik Würzburg) implementiert wurden.
As the nature of negative polarity items (NPIs) and their licensing contexts is still under much debate, a broad empirical basis is an important cornerstone to support further insights in this area of research. The work discussed in this paper is intended as a contribution to realizing this objective. The authors briefly introduce the phenomenon of NPIs and outline major theories about their licensing and also various licensing contexts before discussing our major topics: Firstly, a corpus-based retrieval method for NPI candidates is described that ranks the candidates according to their distributional dependence on the licensing contexts. Our method extracts single-word candidates and is extended to also capture multi-word candidates. The basic idea for automatically collecting NPI candidates from a large corpus is that an NPI behaves like a kind of collocate to its licensing contexts. Manual inspection and interpretation of the candidate lists identify the actual NPIs. Secondly, an online repository for NPIs and other items that show distributional idiosyncrasies is presented, which offers an empirical database for further (theoretical) research on these items in a sustainable way.
This article presents a revised version of GAT, a transcription system first devel-oped by a group of German conversation analysts and interactional linguists in 1998. GAT tries to follow as many principles and conventions as possible of the Jefferson-style transcription used in Conversation Analysis, yet proposes some conventions which are more compatible with linguistic and phonetic analyses of spoken language, especially for the representation of prosody in talk-in-interaction. After ten years of use by researchers in conversation and discourse analysis, the original GAT has been revised, against the background of past experience and in light of new necessities for the transcription of corpora arising from technologi-cal advances and methodological developments over recent years. The present text makes GAT accessible for the English-speaking community. It presents the GAT 2 transcription system with all its conventions and gives detailed instructions on how to transcribe spoken interaction at three levels of delicacy: minimal, basic and fine. In addition, it briefly introduces some tools that may be helpful for the user: the German online tutorial GAT-TO and the transcription editing software FOLKER.
Der Beitrag stellt eine aktualisierte Version des Gesprächsanalytischen Transkriptionssystems(GAT) dar. Nachdem GAT seit seiner Erstvorstellung im Jahr 1998 in der Gesprächsforschung eine breite Verwendung gefunden hat, war es nun an der Zeit, es aufgrund der bisherigen Erfahrungen und im Hinblick auf neue Anforderungen an Transkriptionen vorsichtig zu überarbeiten. Dieser Text stellt
das aktualisierte GAT 2-Transkriptionssystem mit allen seinen alten und neuen Konventionen dar, versucht bekannte Zweifelsfälle zu klären und bekannte Schwächen der ersten Version zu beheben. GAT 2 gibt detaillierte Anweisungen zum Erstellen gesprächsanalytischer Transkriptionen auf drei Detailliertheitsstufen, dem Minimal-, Basis- und Feintranskript, sowie neue Vorschläge zur Darstellung komplexerer Phänomene in Sonderzeilen. Zudem wurden für GAT 2 einige zusätzliche Hilfsmittel entwickelt, die im Anhang kurz vorgestellt werden: das Online-Tutorial GAT-TO sowie der Transkriptionseditor FOLKER.
Mit dem cGAT-Handbuch stellt das FOLK-Projekt eine Richtlinie für das computergestützte Transkribieren nach GAT 2 zur Verfügung. Das Handbuch wurde anhand der Transkriptionspraxis in FOLK entwickelt und enthält eine Vielzahl von authentischen Beispielen, die mit dem zugehörigen Audio auch über die Datenbank für Gesprochenes Deutsch (DGD) abgerufen werden können.
This paper presents the results of a joint effort of a group of multimodality researchers and tool developers to improve the interoperability between several tools used for the annotation and analysis of multimodality. Each of the tools has specific strengths so that a variety of differ-ent tools, working on the same data, can be desirable for project work. However this usually re-quires tedious conversion between formats. We propose a common exchange format for multi-modal annotation, based on the annotation graph (AG) formalism, which is supported by import and export routines in the respective tools. In the current version of this format the common de-nominator information can be reliably exchanged between the tools, and additional information can be stored in a standardized way.
This paper describes a new research initiative addressing the issue of sustainability of linguistic resources. The initiative is a cooperation between three collaborative research centres in Germany – the SFB 441 “Linguistic Data Structures” in Tübingen, the SFB 538 “Multilingualism” in Hamburg, and the SFB 632 “Information Structure” in Potsdam/Berlin. The aim of the project is to develop methods for sustainable archiving of the diverse bodies of linguistic data used at the three sites. In the first half of the paper, the data handling solutions developed so far at the three centres are briefly introduced. This is followed by an assessment of their commonalities and differences and of what these entail for the work of the new joint initiative. The second part then sketches seven areas of open questions with respect to sustainable data handling and gives a more detailed account of two of them – integration of linguistic terminologies and development of best practice guidelines.
Rescuing Legacy Data
(2008)
This paper discusses issues that arise in the transformation of electronic language data from outdated to modern, sustainable formats. We first describe the problem and then present four different cases in which corpora of spoken language were converted from legacy formats to an XML-based representation. For each of the four cases, we describe the conversion workflow and discuss the difficulties that we had to overcome. Based on this experience, we formulate some more general observations about transforming legacy data and conclude with a set of best practice recommendations for a more sustainable handling of language corpora.
This paper describes EXMARaLDA, a system for computer transcription of spoken discourse developed and used by the SFB "Mehrsprachigkeit" at the university of Hamburg. EXMARaLDA consists of several DTDs for XML coding of transcription data and some input and output tools for these formats. Apart from being a transcription system in its own right, EXMARaLDA also plays the role of a mediator between older existing data formats at the SFB and between these formats and a planned database of multilingual spoken discourse.
EXMARaLDA is a system for computer transcription of spoken discourse that is being developed at the SFB ‚Mehrsprachigkeit’ as a basis of a multilingual discourse database into which the transcriptions in use at the SFB will be integrated at a later point in time. The present paper describes the theoretical background of the development – a formal model of discourse transcription based on the annotation graph formalism (Bird/Liberman (2001)) – and its practical realisation in the form of an XML-based data format and several tools for input, output and manipulation of the data.
This paper describes EXMARaLDA, an XML-based framework for the construction, dissemination and analysis of corpora of spoken language transcriptions. Departing from a prototypical example of a “partitur” (musical score) transcription, the EXMARaLDA “single timeline, multiple tiers” data model and format is presented alongside with the EXMARaLDA Partitur-Editor, a tool for inputting and visualizing such data. This is followed by a discussion of the interaction of EXMARaLDA with other frameworks and tools that work with similar data models. Finally, this paper presents an extension of the “single timeline, multiple tiers” data model and describes its application within the EXMARaLDA system.
This paper attempts a new look at computer assisted transcription as it is commonly practised within the fields of discourse analysis and language acquisition studies. The first part proposes a bridge between discourse analytical methodology and text technological methods with the concept of modelling as its central idea. The second part demonstrates the EXMARaLDA system, a set of formats and tools for computer assisted transcription that builds on the ideas developed in the first part and implements them in a way that can lead to significant improvement in current research practice.
Ziel des vorliegenden Beitrags ist es, auszuloten, wie Sprechen und Handeln, das wir aus dem Alltag kennen, einzuschätzen ist, wenn es im Fernsehen und vor allem im so genannten Reality-TV erscheint. Einen guten Einstieg, diese Problemstellung zu illustrieren, bieten Pannen, wie man sie etwa aus Nachrichtensendungen wie der Tagesschau kennt.
A polarity-sensitive item (PSI), as traditionally defined, is an expression that is restricted to either an affirmative or negative context. PSIs like ‘lift a finger’ and ‘all the time in the world’ sub-serve discourse routines like understatement and emphasis. Lexical–semantic classes are increasingly invoked in descriptions of the properties of PSIs. Here, we use English corpus data and the tools of Frame Semantics (Fillmore, 1982, 1985) to explore Israel’s (2011) observation that the semantic role of a PSI determines how the expression fits into a contextually constructed scalar model. We focus on a class of exceptions implied by Israel’s model: cases in which a given PSI displays two countervailing patterns of polarity sensitivity, with attendant differences in scalar entailments. We offer a set of case studies of polaritysensitive expressions – including verbs of attraction and aversion like ‘can live without’, monetary units like ‘a red cent’, comparative adjectives and time-span adverbials – that demonstrate that the interpretation of a given PSI in a given polar context is based on multiple factors. These factors include the speaker’s perspective on and affective stance towards the described event, available inferences about causality and, perhaps most critically, particulars of the predication, including the verb or adjective’s frame membership, the presence or absence of an ability modal like can, the grammatical construction used and the range of contingencies evoked by the utterance.
Bilingual Kindergarten programmes. The interaction of language management and language attitudes
(2015)
Die Kausalkonjunktionen denn, weil, da im Deutschen und perché, poiché, siccome im Italienischen
(2011)
Gegenstand des vorliegenden Aufsatzes sind die deutschen Kausalkonjunktionen denn, weil und da und ihre (partiellen) italienischen Äquivalente perché, poiché und siccome. Sie werden vergleichend in syntaktischer und semantischer Hinsicht untersucht, mit dem Ziel, Gemeinsamkeiten und Unterschiede zwischen ihnen aufzuweisen.
Benefactive construction
(2013)
Von Buchscanner bis WorldCat : die Bibliothek des Instituts für Deutsche Sprache stellt sich vor
(2010)
Der vorliegende Artikel diskutiert die ethnographische Forschung in der Jugendsoziologie und problematisiert ihre Grenzen und Reichweite. Auf der Grundlage der Kritik der bisherigen Forschungspraxis wird ein Vorschlag zur konzeptionell-methodischen Neuorientierung ethnographischer Jugendforschung entwickelt. Die Diskussion geht nicht von einer theoriegeleiteten Perspektive aus, sondern befragt einschlägige Untersuchungen unter methodischem Blickwinkel. Dabei wird deutlich, daß die Forscher der Sicht der Akteure verhaftet bleiben, da ihr Datenmaterial aus den rekonstruierenden Darstellungen der Alltagspraxis durch die Akteure besteht (= Sekundärdatenstatus), nicht aber aus Dokumentationen der Alltagspraxis selbst. Die Forschung ist also noch nicht bei der Alltagspraxis der Akteure angekommen. Dies zeigt sich insbesondere am Beispiel des sog. Jugendsoziologischen Interviews. Als Alternative werden theoretische und methodische Konturen einer Ethnographie jugendlicher Kommunikationskulturen auf gesprächsanalytischer Basis umrissen. Abschließend wird die Fruchtbarkeit dieser Forschungsperspektive für traditionelle und neuartige jugendsoziologische Fragestellungen diskutiert.
Das Online-Wortschatz-Informationssystem Deutsch (OWID) ist ein digitales Wörterbuchportal des Instituts für Deutsche Sprache. Alle darin zusammengeführten lexikografischen Daten sind auf XML-Basis feingranular strukturiert. Speicherung, Verwaltung und Retrieval dieser Daten übernimmt das Orade-basierte Electronic Dictionary Administration System (EDAS). Der vorliegende Beitrag erläutert die XML-basierte Modellierung der Daten, XML-spezifische Fragen der Speicherung, sowie das Retrieval mit XPath und SQL/XML.
Digital or electronic lexicography has gained in importance in the last few years. This can be seen in the growing list of publications focusing on this field. In the OBELEX bibliography (http://www.owid.de/obelex/engl), the research contributions in this field are consolidated and are searchable by different criteria. The idea for OBELEX originated in the context of the dictionary portal OWID, which incorporates several dictionaries from the Institute for German Language (www.owid.de). OBELEX has been available online free of charge since December 2008. OBELEX includes articles, monographs, anthologies and reviews published since 2000 that relate to electronic lexicography, as well as some relevant older works. Our particular focus is on works about online lexicography. Systematically evaluated sources are relevant journals like International Journal of Lexicography, Lexicographica, Dictionaries, Lexikos; furthermore Euralex-Proceedings, proceedings of the International Symposium on Lexicography in Copenhagen as well as relevant monographs and anthologies. Information on dictionaries is currently not included in OBELEX; the main focus is on metalexicography. However, we are working on a database with information on online dictionaries as a supplement to OBELEX. All entries of OBELEX are stored in a database. Thus, all parts of the bibliographic entry (such as person, title, publication or year) are searchable. Furthermore, all publications are associated with our keyword list; therefore, a thematic search is possible. The subject language is also noted. With this type of content, the OBELEX bibliography supplements in a useful way other bibliographic projects such as the printed ‘Internationale Bibliographie zur germanistischen Lexikographie und Wörterbuchforschung’ by H. E. Wiegand (Wiegand 2006/2007), the ‘Bibliography of Lexicography’ by R. R. K. Hartmann (Hartmann 2007), and the ‘International Bibliography of Lexicography’ of Euralex (cf. also DeCesaris and Bernal 2006). OBELEX differs from all these bibliographic projects by its strong focus on electronic lexicography and its ability to retrieve bibliographic information.
We define collaborative commentary as the involvement of a research community in the interpretive annotation of electronic records. The goal of this process is the evaluation of competing theoretical claims. The process requires commentators to link their comments and related evidentiary materials to specific segments of either transcripts or electronic media. Here, we examine current work in the construction of technical methods for facilitating collaborative commentary through browser technology. To illustrate the relevance of this approach, we examine seven spoken language database projects that have reached a level of web-based publication that makes them good candidates as targets of collaborative commentary technology. For each database, we show how collaborative commentary can advance the relevant research agendas.
This paper describes work in progress on I5, a TEI-based document grammar for the corpus holdings of the Institut für Deutsche Sprache (IDS) in Mannheim and the text model used by IDS in its work. The paper begins with background information on the nature and purposes of the corpora collected at IDS and the motivation for the I5 project (section 1). It continues with a description of the origin and history of the IDS text model (section 2), and a description (section 3) of the techniques used to automate, as far as possible, the preparation of the ODD file documenting the IDS text model. It ends with some concluding remarks (section 4). A survey of the additional features of the IDS-XCES realization of the IDS text model is given in an appendix.
A text parsing component designed to be part of a system that assists students in academic reading an writing is presented. The parser can automatically add a relational discourse structure annotation to a scientific article that a user wants to explore. The discourse structure employed is defined in an XML format and is based the Rhetorical Structure Theory. The architecture of the parser comprises pre-processing components which provide an input text with XML annotations on different linguistic and structural layers. In the first version these are syntactic tagging, lexical discourse marker tagging, logical document structure, and segmentation into elementary discourse segments. The algorithm is based on the shift-reduce parser by Marcu (2000) and is controlled by reduce operations that are constrained by linguistic conditions derived from an XML-encoded discourse marker lexicon. The constraints are formulated over multiple annotation layers of the same text.
Most research on automated categorization of documents has concentrated on the assignment of one or many categories to a whole text. However, new applications, e.g. in the area of the Semantic Web, require a richer and more fine-grained annotation of documents, such as detailed thematic information about the parts of a document. Hence we investigate the automatic categorization of text segments of scientific articles with XML markup into 16 topic types from a text type structure schema. A corpus of 47 linguistic articles was provided with XML markup on different annotation layers representing text type structure, logical document structure, and grammatical categories. Six different feature extraction strategies were applied to this corpus and combined in various parametrizations in different classifiers. The aim was to explore the contribution of each type of information, in particular the logical structure features, to the classification accuracy. The results suggest that some of the topic types of our hierarchy are successfully learnable, while the features from the logical structure layer had no particular impact on the results.
Durch linguistische Textvergleiche soll vorrangig die Frage beantwortet werden, ob zwei oder mehrere Texte den gleichen Autor und/oder Schreiber haben. Dabei suggeriert der in diesem Zusammenhang auch verwendete Titel „linguistischer Fingerab-druck“, daß dabei ein den naturwissenschaftlichen Verfahren vergleichbarer Sicherheitsgrad erreicht werden könnte. Die Autorin, wissenschaftliche Mitarbeiterin in der Abteilung „Historische Lexikologie und Lexikographie“ am Institut für deutsche Sprache in Mannheim, erläutert, was von sprachlichen Textvergleichen kriminalistisch/forensisch tatsächlich erwartet werden darf.
COSMAS II
(2008)
As a result of legal restrictions the Google Ngram Corpora datasets are a) not accompanied by any metadata regarding the texts the corpora consist of and the data are b) truncated to prevent an indirect conclusion from the n-gram to the author of the text. Some of the consequences of this strategy are discussed in this article.
Bericht über die 19. Arbeitstagung zur Gesprächsforschung vom 16. bis 18. März 2016 in Mannheim
(2016)
This paper discusses the advantages and disadvantages of the combination of automated information and lexicographically interpreted information in online dictionaries, namely elexiko, a hypertext dictionary and lexical data information system of contemporary German (http://www.owid.de/ elexiko_/index.html), and DWDS, a digital dictionary of 20,h century German (http://www.dwds.de). Examples of automatically derived information (e.g. automatically extracted citations from the underlying corpus, lists on paradigmatic relations) and lexicographically compiled information (e.g. information on paradigmatic partners) are provided and evaluated, reflecting on the need to develop guidelines as to how computerised information and lexicographically interpreted information may be combined profitably in online reference works.
Diplomatie mit Diplomen
(1989)
Kommunikation vor Gericht
(1986)
Understanding the design of talk-in-interaction is important in many domains, including speech technology. Although phonetic, linguistic and gestural correlates have been identified for some of the social actions that conversational participants accomplish, it is only recently that researchers have begun to take account of the immediately prior interactional context as an important factor influencing the design of a speaker’s turn. The present study explores the influence of context by focussing on characteristics of short turns produced by one speaker between turns from another speaker. The hypothesis is that the speaker designs her inserted turn as a match to the prior turn when wishing to align with the previous speaker’s agenda. By contrast, non-matching would display that the speaker is non-aligning, preferring instead to initiate a new action for example. Data are taken from the AMI corpus, focussing on the spontaneous talk of first-language English participants. Using sequential analysis, such short turns are classified as either aligning or non-aligning in accordance with definitions in the Conversation Analysis literature. The degree of prosodic similarity between the inserted turn and the prior speaker’s turn is measured using novel acoustic techniques. The results show that aligning turns are significantly more similar to the immediately preceding turn, in terms of pitch contour, than non-aligning turns. In contrast to the prosodic-acoustic analysis, the results of the gestural analysis indicate that aligning and non-aligning are differentiated by the use of distinct gestures, rather than by the matching (or non-matching) of gestures across the adjacent turns. These results support the view that choice of pitch contour is managed locally, rather than by reference to an intonational lexicon. However, this is not the case for speakers’ use of gesture. The implications of these findings for a model of talk-in-interaction are considered, along with potential applications.
Der Beitrag stellt die theoretischen und methodologischen Grundlagen des Lernerwörterbuchprojekts DICONALE anhand einiger Analysebeispiele vor. Es handelt sich um ein zweisprachig-bidirektionales, onomasiologisch-konzeptuell ausgerichtetes Verbwörterbuch, das sowohl zur Konsultation für Produktionszwecke ab B2-Niveau im Bereich DaF und ELE als auch für den Übersetzungsprozess in die jeweilige Fremdsprache dienlich sein soll. Es beruht auf häufigkeitsbasierten Daten vergleichbarer elektronisch verfügbarer Korpora beider Sprachen und soll dem Benutzer online zugänglich gemacht werden. Das Wörterbuch gliedert sich in unterschiedliche konzeptuelle (Sub)Felder, denen sich lexikalisch-semantische (Mini)Paradigmen zuordnen lassen. Es basiert auf einem modular-multilateralen lexikologischen Beschreibungsmodell, welches einzelsprachliche und sprachvergleichend relevante korpusbasierte Informationen zu Form, Bedeutung und Verwendung durch die Information von verschiedenen paradigmatischen und syntagmatischen Relationen verbaler und deverbaler Lexeme präsentiert.
Emotionale Kommunikation
(2008)
This article examines the interrelation between communicative behavior and emotion. First, it clarifies the notions of emotion as a concept (section 2) and the concept of communication (section 3). Then, it outlines the need to develop a model for emotions in communicative interaction (section 4). The interrelation between communicative behavior and emotion is interdependent — on the one hand, communicative behavior can influence a person’s own emotions and those of another person and, on the other hand, emotions can affect a person’s own and another person’s communicative behavior (section 5).