Refine
Year of publication
- 2008 (235) (remove)
Document Type
- Part of a Book (114)
- Article (57)
- Conference Proceeding (34)
- Book (17)
- Part of Periodical (6)
- Doctoral Thesis (3)
- Working Paper (2)
- Master's Thesis (1)
- Review (1)
Keywords
- Deutsch (113)
- Wörterbuch (29)
- Korpus <Linguistik> (21)
- Internet (15)
- Mehrsprachigkeit (12)
- Gesprochene Sprache (10)
- Konversationsanalyse (9)
- Computerunterstützte Lexikographie (8)
- OWID (8)
- Sprachgeschichte (8)
Publicationstate
- Veröffentlichungsversion (103)
- Zweitveröffentlichung (19)
- Postprint (9)
- Preprint (2)
Reviewstate
- (Verlags)-Lektorat (90)
- Peer-Review (21)
- Verlags-Lektorat (4)
- Qualifikationsarbeit (Dissertation, Habilitationsschrift) (3)
- (Verlag)-Lektorat (1)
- (Verlags-) Lektorat (1)
- (Verlags-)Lektorat (1)
- Abschlussarbeit (Bachelor, Master, Diplom, Magister) (Bachelor, Master, Diss.) (1)
- Peer-Revied (1)
- Peer-review (1)
Publisher
- Institut für Deutsche Sprache (29)
- de Gruyter (26)
- Narr (24)
- European Language Resources Association (ELRA) (7)
- iudicium (7)
- Lang (5)
- Stauffenburg (5)
- Dudenverlag (4)
- Olms (4)
- BBAW (3)
Discourse metaphors
(2008)
The article introduces the notion of discourse metaphor, relatively stable metaphorical mappings that function as a key framing device within a particular discourse over a certain period of time. Discourse metaphors are illustrated by case studies from three lines of research: on the cultural imprint of metaphors, on the negotiation of metaphors and on cross-linguistic occurrence. The source concepts of discourse metaphors refer to phenomenologically salient real or fictitious objects that are part of interactional space (i.e., can be pointed at, like MACHINES or HOUSES) and/or occupy an important place in cultural imagination. Discourse metaphors change both over time and across the discourses where they are used. The implications of focussing on different types of source domains for our thinking about the embodiment and sociocultural situatedness of metaphor is discussed, with particular reference to recent developments in Conceptual Metaphor Theory. Research on discourse suggests that situatedness is a crucial factor in the functioning and dynamics of metaphor.
‘Linguistic relativity’ has become a major keyword in debates on the psychological significance of language diversity. In this context, the term ‘relativity’ was originally taken on loan from Einstein’s then-recent theories by Edward Sapir (1924) and Benjamin L. Whorf (1940). The present paper assesses how far the idea of linguistic relativity does analogically build on relevant insights in modern physics, and fails to find any substantial analogies. The term was used rhetorically by Sapir and Whorf, and has since been incorporated into a cognitivist research programme that seeks to answer whether ‘language influences thought’. Contemporary research on ‘linguistic relativity’ has developed into a distinct way of studying language diversity, which shares a lot with the universalistic cognitivist framework it opposes, but little with relational approaches in science.
Badania etnolingwistyczne zdobyly w ciqgu ostatnich dwu dekad znaozna populamosc. Najwazniejsz^ formuh\ nietaforycznn okreslajqcii glowny przedmiot tych badaií jest .jçzykowy obraz swiata”. W zwiqzku z tym. iz powstaj^ obecnie projekty studiów komparatyslycznych na duzíi skalç, warto byt moze rozwazyc, czego takie ujçcie etnolingwistyki nie uwzglçdnia. Wizualna metafora obrazów implikuje, ze mówincy si\ w slanie wyjsc ix>za swiat i patrzec nan (oraz nazywac go) z zewmprz. Artykul oinawia dwie kcinsekwencje tej inetafory, które mog^ przysporzyc problemów. Po pierwsze, wyizolowanie jçzyka ze swiata ludzkich dzialan, którego jyzyk wszak jest czçsci^. prowadzi do przyjçcia kognitywistycznego modeln znaczenia jako oddzielnego stmmienia komunikaeji. Taki model nie pasuje do eodziennego doswiadezenia przezroczystosci jyzyka. Po drugie, wyizolowanie jçzyka z zycia sprzyja stosowaniu metod „bezczasowych” oraz studiom nad stowami wyalKtrahowanymi z sytuaeji, w której zostaly one uzyte (jesli nie wyjçtymi z kontekstu). Przyjmuj^c takie metafory i inetody, inozetny stracic z oczu znaczn^ czçsc tego, co jest istotne dla jyzyka poUx;znego — przedmiotu badan etnonauki.
Was bringt der typologisch-kontrastive Blick auf die Grammatik des Deutschen? Eine Zwischenbilanz
(2008)
Ich nehme den Abstand von einigen Jahren und die in diesen Jahren geleistete Arbeit zum Anlass für eine kleine Zwischenbilanz, die sich auf das Nominalprojekt bezieht. Keine Zwischenbilanz im Sinne eines quantitativ gestützten Nachweises erbrachter Leistung – man mag sich über die publizierten Ergebnisse anhand der Publikationsliste auf der Internetseite des Projekts informieren (vgl. <www.ids-mannheim.de/gra/eurostudien.html>) – sondern eher ein reflexives Bilanzieren: Haben sich die Erwartungen, das Projekt werde einen innovativen Zugang zur Grammatik des Deutschen eröffnen, es werde Erkenntnisgewinn bringen, erfüllt oder zumindest als erfüllbar erwiesen?
Ausgangspunkt ist die z. B. von Hawkins und König vertretene These, kontrastive Grammatikschreibung sei das ,Komplement‘ der Typologie, die auf dem Hintergrund des Projekts „Grammatik des Deutschen im europäischen Vergleich" einer kritischen Prüfung und Modifikation unterzogen wird. Als Exemplifikation werden zwei Phänomenbereiche der deutschen und der rumänischen Grammatik, vor allem nach Maßgabe ihrer Darstellung in der deutsch-rumänisch kontrastiven Grammatik, vergleichend untersucht: die Kategorie des Genus und die Markierung syntaktischer Funktionen durch Kasusdifferenzierung oder andere Mittel, insbesondere die ,differentielle Objektmarkierung'. In beiden Fällen kann gezeigt werden, dass typologische Generalisierungen, etwa die mögliche Struktur von Genussystemen oder Hierarchien wie die Belebtheits- und die Definitheitshierarchie betreffend, dem kontrastiven Vergleich zu mehr Erklärungskraft verhelfen.
Research on syntactic ambiguity resolution in language comprehension has shown that subjects' processing decisions are influenced by a variety of heterogeneous factors such as e.g., syntactic complexity, semantic fit and the discourse frequency of the competing structures. The present paper investigates a further potentially relevant factor in such processes: effects of syntagmatic lexical chunking (or matching to a complex memorized prefab) whose occurrence would be predicted from usage-based assumptions about linguistic categorisation. Focusing on the widely studied so-called DO/SC-ambiguity in which a post-verbal NP is syntactically ambiguous between a direct object and the subject of an embedded clause, potentially biasing collocational chunks of the relevant type are identified in a number of corpus-linguistic pretests and then investigated in a self-paced reading experiment. The results show a significant increase in processing difficulty from a collocationally neutral over a lexically biasing to a strongly biasing condition. This suggests that syntagmatically complex and partially schematic templates of the kind envisioned in usage-based Construction Grammar may impinge on speakers' online processing decisions during sentence comprehension.
Introduction
(2008)
In usage-based Construction Grammar, grammatical structure is assumed to ‘sedimenl’ from concrete linguistic experience as an automatic by-product o f repeated similar categorisation judgments (a process known as schematisation). At the same time, there is functional pressure on prospective inputs to such schematisations to retain or develop specialised properties that differentiate them from their near neighbours, i.e. other stored units in the constructicon (Goldberg: 1995). Moreover, Speakers are not assumed to necessarily extract all possible generalisations from their input. Using the example o f a group of German support verb constructions, the present study outlines a corpus-linguistic approach to identifying those Schemas that really seem to be formed by Speakers, and how they can be kept apart from mere potential generalisations.
Weltansichten aus sprachlicher und rechtlicher Perspektive. Zur Ontisierung von Konzepten des Rechts
(2008)
The multiple gradations of German strong verbs are but manifestations of a rather uncomplicated system. There is a small number of ways to make up ablaut forms; these types of formation are identifiable in formal terms and, what is more, they have definite functions as morphological markers. Using classifications of stem forms according to quality, complexity and quantity of vowels, three types of operations involved in ablaut formation are identified. Ablaut always includes a change of quality type or a change of complexity type, and in addition it may include a change of quantity type. Ablaut forms are clearly distinguished as against bases (and against each other): their vocalism meets a defined standard of dissimilarity. On this basis, gradations are collected into inflectional classes that are defined in strictly synchronic terms. These classes continue the historical seven classes known from reference grammars. For the majority of strong verbs, membership in these classes (and thus ablaut) is predictable.
One problem of data-driven answer extraction in open-domain factoid question answering is that the class distribution of labeled training data is fairly imbalanced. In an ordinary training set, there are far more incorrect answers than correct answers. The class-imbalance is, thus, inherent to the classification task. It has a deteriorating effect on the performance of classifiers trained by standard machine learning algorithms. They usually have a heavy bias towards the majority class, i.e. the class which occurs most often in the training set. In this paper, we propose a method to tackle class imbalance by applying some form of cost-sensitive learning which is preferable to sampling. We present a simple but effective way of estimating the misclassification costs on the basis of class distribution. This approach offers three benefits. Firstly, it maintains the distribution of the classes of the labeled training data. Secondly, this form of meta-learning can be applied to a wide range of common learning algorithms. Thirdly, this approach can be easily implemented with the help of state-of-the-art machine learning software.
Eine kleine Umfrage unter 33 Nicht-Linguisten ergab, dass nach deren Sprachwissen bei weder und bei entweder „Monogamie“ herrscht. Als Gefährten wurden stets und ausschließlich noch bzw. oder genannt. Zum Vergleich wurde auch nach zwar gefragt, wo aus Sicht der Probanden „Polygamie“ vorzuliegen scheint. Zwar wurde von den meisten eine „Hauptfrau“, nämlich aber, angegeben, aber in nicht geringer Zahl stattdessen oder zusätzlich auch einer oder mehrere weitere Partner. Doch im Folgenden soll es nicht um diese Form des abstrakten, lexikalisch-paradigmatischen Sprachwissens gehen, sondern um die empirische Untersuchung der grammatisch-syntagmatischen Realität in Texten unter Berücksichtigung der dahinterstehenden Textkompetenz.
Lexical chaining has become an important part of many NLP tasks. However, the goodness of a chaining process and hence its annotation output depends on the quality of the chaining resource. Therefore, a framework for chaining is needed which integrates divergent resources in order to balance their deficits and to compare their strengths and weaknesses. In this paper we present an application that incorporates the framework of a meta model of lexical chaining exemplified on three resources and its generalized exchange format.
Sie ist schon ein erstaunliches Phänomen, die Sprache, bedenkt man, dass es auch ohne Einfluss einer steuernden Instanz so etwas wie Standarddeutsch gibt und die deutsche Sprache nicht in unzählige Variationen und Varietäten auseinanderdriftet. Die Verwunderung über den Zusammenhalt der Sprache ließ sich auch im Laufe der diesjährigen Jahrestagung des IDS immer wieder vernehmen, die unter dem Motto „Deutsche Grammatik. Regeln, Normen, Sprachgebrauch“ vom 11. bis 13. März 2008 im neugestalteten Rosengarten in Mannheim stattfand. Da man auf einer wissenschaftlichen Tagung beim Wundern nicht stehen bleibt, versuchten die versammelten Linguistinnen und Linguisten, der Natur von sprachlichen Regeln und Normen erklärend auf die Spur zu kommen. Wie entstehen sprachliche Normen? Welche Faktoren entscheiden, dass manche der neuen grammatischen Formen sich durchsetzen und zur Norm werden und andere nicht? Welche Bedeutung hat Sprachnormierung in verschiedenen gesellschaftlichen Bereichen wie Schule, Wirtschaft oder Recht? Und nicht zuletzt: Wie kann das grammatische Regelsystem erfasst werden?
Lexicon schemas and their use are discussed in this paper from the perspective of lexicographers and field linguists. A variety of lexicon schemas have been developed, with goals ranging from computational lexicography (DATR) through archiving (LIFT, TEI) to standardization (LMF, FSR). A number of requirements for lexicon schemas are given. The lexicon schemas are introduced and compared to each other in terms of conversion and usability for this particular user group, using a common lexicon entry and providing examples for each schema under consideration. The formats are assessed and the final recommendation is given for the potential users, namely to request standard compliance from the developers of the tools used. This paper should foster a discussion between authors of standards, lexicographers and field linguists.
Lexicography
(2008)
The authors present a multilingual electronic database of lexical items with idiosyncratic occurrence patterns. Currently, our database consists of: (1) a collection of 444 bound words in German; (2) a collection of 77 bound words in English; (3) a collection of 58 negative polarity items in Romanian; (4) a collection of 84 negative polarity items in German; and (5) a collection of 52 positive polarity items in German. The database is encoded in XML and is available via the Internet, offering dynamic and flexible access.
This paper presents three electronic collections of polarity items: (i) negative polarity items in Romanian, (ii) negative polarity items in German, and (iii) positive polarity items in German. The presented collections are a part of a linguistic resource on lexical units with highly idiosyncratic occurrence patterns. The motivation for collecting and documenting polarity items was to provide a solid empirical basis for linguistic investigations of these expressions. Our databe provides general information about the collected items, specifies their syntactic properties, and describes the environment that licenses a given item. For each licensing context, examples from various corpora and the Internet are introduced. Finally, the type of polarity (negative or positive) and the class (superstrong, strong, weak or open) associated with a given item is speci ed. Our database is encoded in XML and is available via the Internet, offering dynamic and exible access.
The authors describe two data sets submitted to the database of MWE evaluation resources: (1) cranberry expressions in English and (2) cranberry expressions in German. The first package contains a collection of 444 cranberry words in German (CWde.txt) and a collection of the corresponding cranberry expressions (CCde.txt). The second package consists of a collection of 77 cranberry words in English (CWen.txt) and a collection of the corresponding cranberry expressions (CCen.txt). The data included in these packages was extracted from the Collection of Distributionally Idiosyncratic Items (CoDII), an electronic linguistic resource of lexical items with idiosyncratic occurrence patterns. Each package contains a readme file, and can be downloaded from multiword.wiki.sourceforge.net/Resources.
One of the most popular techniques used in HPSG-based studies to describe linguistic phenomena is the raising mechanism. Besides ordinary raising verbs or adjectives, this tool has been applied for handling verbal complexes and discontinuous constituents, among other phenomena. In this paper, a new application for raising within the HPSG paradigm will be discussed, thereby investigating data from the prepositional domain. We will analyze linguistic properties of word combinations in German consisting of a preposition, a noun, and another preposition (such as auf Grund von (‘by virtue of’)), thus arguing that raising is the most appropriate method for satisfactorily describing the crucial syntactic features which are typical for those expressions. The objective of this paper is thus to demonstrate the efficiency of the raising mechanism as used in HPSG, and therefore, to emphasize the importance of designing a satisfactory uniform theory of raising within this grammar framework.
The present study examines the dynamics of the kanji combinations that form common (or general) and proper nouns in Japanese. The following three results were obtained. First, the degree of distribution results from two similar processes which are based on a steady-state of birth-and-death processes with different birth and death rates, resulting in a positive negative binomial distribution with the proper nouns and in a positive Waring distribution with common nouns. Second, all rank-frequency distributions follow the negative hypergeometric distribution used very frequently in ranking problems. Third, the building of kanji compounds follows a dissortative strategy. The higher the outdegree of a kanji, the more it prefers kanji with lower indegrees. A linear dependence can be observed with common nouns, whereas the relationship between compounded kanji is rather curvilinear with proper nouns. The actual analytical expression is not yet known.
Lehren und Lernen von Verben, Adjektiven und Substantiven ... Ein nimmerendender Diskussionsstoff
(2008)
La diminution des compétences linguistiques (ou: attrition des langues) est un phénomène que l’on rencontre dans différents contextes lorsque l’accès à ce qui est acquis dans une langue (L1, L2 ou langue étrangère) diminue. Les recherches sur le sujet montrent par exemple que l’influence de la L2 rend difficile aux locuteurs L1 d’exploiter toutes les variations stylistiques ou pragmatiques que leur L1 devrait normalement leur permettre. La question qui se pose est de savoir ce qui se perd en effet: est-ce la competence langagière, la representation mentale de la connaissance qui est affectée ou s’agit-il plutôt d’une limitation de l’accès et du contrôle des connaissances acquises qui, elles, restent intactes? Dans le cadre des discussions actuelles autour des avantages et des risques du plurilinguisme il n’est pas seulement intéressant mais bien nécessaire d’approfondir les recherches sur les processus de l’attrition. Il faut par ailleurs, pour que les plurilingues aient un réel bénéfice de leur potentiel, que la société reconnaisse et apprécie concrètement ces compétences et qu’elle encourage les locuteurs à afficher leur identité bilingue en toute confiance et transparence.
Europäische Hochsprachen in der Klemme. Zwischen globalem Englisch, Dialekten und Regionalsprachen
(2008)
Starting from declarations of the EU, the value of European languages and their diversity according to their different territorial, social, and legal extensions are discussed. The Standard varieties of the various languages are emphasized as being especially important for national and European language policies and for individual language cultivation. They contributed and may continue to contribute more than other language varieties to the cultural wealth of Europe. On the other hand, their development is especially impaired by the increasing use of ‘global' English. The increasing tendency toward a diaglossia (English plus one other language) and the use of languages within the institutions of the EU are discussed. In conclusion, it is argued that although tolerance is necessary, it is not sufficient for a thriving further development of the European linguistic diversity.
Es wird ein Überblick gegeben über den sprachenpolitischen Hintergrund, die Vorgeschichte, die Gründung, die Ziele und Aktivitäten von EFNIL, der European Federation of National Institutions for Language, also der Europäischen Föderation Nationaler Sprachinstitutionen. Sie ist eine Netzwerkorganisation, zu der sich im Jahr 2003 Sprachakademien und zentrale Sprachinstitute aus den Ländern der Europäischen Union verbunden haben mit der Aufgabe, zur Erhaltung und Weiterentwicklung der sprachlichen Vielfalt in Europa beizutragen. Sie konzentriert dabei ihre Aufmerksamkeit und Aktivitäten auf die Hochsprachen in den Staaten der Europäischen Union.
Das Institut für Deutsche Sprache ist eines der Gründungsmitglieder der „Europäischen Föderation nationaler Sprachinstitutionen", zu der die Sprachakademien und zentralen Sprachinstitute aus den Staaten der Europäischen Union gehören. Sie wird meist abkürzend E1N1I genannt mit dem Akronym ihres englischen Namens: „European Federation of National Institutions for Language“ (Näheres unter <www.efnil.org>). Auf ihrer Jahreskonferenz in Riga 2007 beschloss die Generalversammlung von EFNIL eine Resolution, die den sprachpolitisch zuständigen Stellen der EU und der Mitgliedsstaaten vorgelegt worden ist. Sie liegt inzwischen in allen 23 offiziellen Sprachen der Union vor. Der Beitrag gibt die deutsche Fassung wieder.
COOCCURRENCE ANALYSIS SEEN CONTRASTIVELY
On applying collocational patterning in bilingual lexicography - some examples from the large German-Czech academic dictionary
This paper resumes some of thoughts presented in the study by C. Belica and K. Steyer in this volume. It shows how bilingual lexicographers can take advantage of the cooccurrence analysis results when dealing with German-Czech contrast and structuring word configurations in an entry. They also sketch the corpus data in a form of structural types based on the collocational patterns and stress the importance of cooccurrence analysis for an enlarged offer of equivalents. They plead for more consideration of the syntactic variability. They argue that the cooccurrence analysis used for both German and for Czech should be an important step.
Dieser Artikel fasst wichtige Aspekte der vom Projekt ‘Usuelle Wortverbindungen’ (UWV) erarbeiteten
Konzeption für die korpusbasierte lexikografische Beschreibung von Wortverbindungen in OWID zusammen. Der Schwerpunkt in diesem Teilprojekt liegt auf der lexikografischen Beschreibung des typischen Gebrauchs von usuellen Wortverbindungen auf der Basis eines sehr großen Korpus des Deutschen. Zur differenzierten Untersuchung des Sprachgebrauchs werden korpusanalytische Methoden herangezogen und die Ergebnisse in einem nutzerfreundlichen Hypertextformat präsentiert. Zudem ist es ein Ziel, die sprachliche Vielfalt, die in den Korpora gerade auch in Bezug auf Wortverbindungen zu finden ist, durch eine große Menge authentischer Korpusbelege angemessen darzustellen.
This contribution article focuses on German-language collocation research and lexicographic practice from a corpus linguistic perspective. Although there is no dictionary called “Deutsches Kollokationswörterbuch” (German collocation dictionary), the collocation perspective acquires increasing popularity in linguistic research and dictionary work in the German-speaking area. On the one hand, this tendency is due to the growing number of studies dealing with German as a contrast language and works on foreign language didactics. On the other hand, powerful electronic resources such as large corpora and lexical databases, which are nowadays available for the German language, are recognised as a valuable empirical basis. Nevertheless, the application of novel corpus linguistic methods in lexicographic practice is still unsatisfactory. Therefore, this article concludes by discussing innovative aspects of corpus linguistic empirical research on the basis of collocations. These ideas are presented as an incentive for further research as well as practical application.
Der Aufsatz bietet zunächst einen Überblick über neuere Tendenzen und offene Fragen der internationalen begriffsgeschichtlichen Forschung und plädiert für ‚Historische Semantik‘ als Disziplinbezeichnung für das sich über die klassische Begriffsgeschichte hinaus erweiternde Feld. Gefragt wird sodann nach Erklärungsmodellen für semantischen Wandel in der Geschichte. Drei Modelle von Wandel werden genauer erörtert: Plausibilitätsverlust von Redeweisen durch überraschende Ereignisse und Umbrüche, Zunahme des strategischen Gebrauchswerts von Redeweisen in wiederkehrenden Kommunikationssituationen, Irritation des Wort- und Bedeutungshaushalts einer Sprache durch Wortimporte aus einer anderen Sprache. Ausgehend vom letztgenannten Modell werden abschließend Theorieprobleme diskutiert, die sich aus der Forderung nach einer transnationalen bzw. vergleichenden historischen Semantik ergeben.
Der Beitrag zeigt, ausgehend von der Darstellung der neuen Benutzungsmöglichkeiten der Onlinefassung
des Neologismenwörterbuches gegenüber seiner Printfassung, welche Links derzeit innerhalb des Neologismenwörterbuches sowie von diesem auf die Wörterbücher des Portals OWID und auf andere elektronische Wörterbücher gesetzt werden. Am Beispiel der Wortartikel Adresse und Klammeraffe, die sowohl im Neologismen- als auch im elexiko-Wörterbuch ausgearbeitet vorliegen, werden Überlegungen hinsichtlich der geplanten Verknüpfung zwischen Wortartikeln mit gleichlautenden Stichwörtern angestellt. Sie betreffen insbesondere die Lesarten und ihre Etikettierung sowie die Verlinkung von sinnverwandten Wörtern. Eine Verständigung darüber kann dazu beitragen, dem jeweiligen Projektkonzept besser gerecht zu werden und die Darstellung eindeutiger und damit letztlich auch benutzerfreundlicher zu machen.
The contribution deals with the interactive structure of doctor-patient-communication. After a short discussion about the relevance of doctor-patient-communication within the public health policy, an outline is given on the medical and linguistic research on doctor-patient-communication in Germany. Basic features of conversations and the conversation analytic methodology are presented then. Conversation analyses of doctor-patient-communication reveal five main interactive components which are discussed in detail. Finally, some considerations concerning implementation of linguistic research in medical practice are discussed.
This work proposes opinion frames as a representation of discourse-level associations which arise from related opinion topics. We illustrate how opinion frames help gather more information and also assist disambiguation. Finally we present the results of our experiments to detect these associations.
This work proposes opinion frames as a representation of discourse-level associations that arise from related opinion targets and which are common in task-oriented meeting dialogs. We define the opinion frames and explain their interpretation. Additionally we present an annotation scheme that realizes the opinion frames and via human annotation studies, we show that these can be reliably identified.
Den traditionellen Konzeptualisierungen von EMOTION (als einem für die Erklärung der menschlichen Kognition irrelevanten Phänomenkomplex) wird ein integrativer Ansatz gegenübergestellt, demzufolge Kognition und Emotion als zwei mentale Systeme interagieren und sowohl repräsentational als auch prozedural relevante Schnittstellen haben. Emotionen werden als Kenntnis- und Bewertungssysteme, Gefühle als kognitiv erfahrbare Emotionen, definiert. Es wird anhand exemplarischer Beispiele erörtert, inwiefern kognitive Gedanken und emotionale Gefühle (entgegen der vorherrschenden Auffassung) mehr Gemeinsamkeiten als Unterschiede aufweisen.
Ausgehend von den drei Einträgen „Gegenwart“, „blind“ und „Globalisierung“ und ihren jeweiligen Verknüpfungen mit einem der drei OWID-Produkte, versucht dieser Beitrag zu zeigen, ob und wie das
elexiko-Wörterbuch durch die Einbindung in OWID an Substanz und lexikografischem Informationsgehalt
gewonnen hat.
Elexiko is a lexicological-lexicographic, corpus-guided German Internet reference work (cf. www.elexiko.de). Compared to printed dictionaries, in elexiko, restrictions on space disappear. Specific comments on the use of a word do not need to be given in traditional abbreviated forms, like the so-called field labels or usage. In this paper, I will show its advantages for the description of the particular pragmatic characteristics of a word: I will argue that traditional labelling such as formal, informal, institutional, etc. cannot account for the comprehensive pragmatic dimension of a word and that these are not transparent, particularly for non-native speakers of German. The main focus of the paper will be on an alternative approach to this dictionary information-as suggested by elexiko. I will demonstrate how narrative, descriptive and user friendly notes can be formulated for the explanation of the discursive contextual embedding or tendencies of evaluative use. I will outline how lexicographers can derive such information from language data in an underlying corpus which was designed and compiled for specific lexicographic purposes. Both, the theoretical-conceptual ideas and their lexicographic realisation in elexiko will be explained and illustrated with the help of relevant dictionary entries.
E-VALBU: Advanced SQL/XML processing of dictionary data using an object-relational XML database
(2008)
Contemporary practical lexicography uses a wide range of advanced technological aids,most prominently database systems for the administration of dictionary content. Since XML has become a de facto standard for the coding of lexicographic articles, integrated markup functionality – such as query, update, or transformation of instances – is of particular importance. Even the multi-channel distribution of dictionary data benefits from powerful XML database services. Exemplified by E-VALBU, the most comprehensive electronic dictionary on German verb valency, we outline an integrated approach for advanced XML storing and processing within an object-relational database, and for a public retrieval frontend using Web Services and AJAX technology.
Vorwort
(2008)
Das Medium Internet ist im Wandel, und mit ihm ändern sich seine Publikations- und Rezeptionsbedingungen. Welche Chancen bieten die momentan parallel diskutierten Zukunftsentwürfe von Social Web und Semantic Web? Zur Beantwortung dieser Frage beschäftigt sich der Beitrag mit den Grundlagen beider Modelle unter den Aspekten Anwendungsbezug und Technologie, beleuchtet darüber hinaus jedoch auch deren Unzulänglichkeiten sowie den Mehrwert einer mediengerechten Kombination. Am Beispiel des grammatischen Online-Informationssystems grammis wird eine Strategie zur integrativen Nutzung der jeweiligen Stärken skizziert.
In this paper, the authors describe a semi-automated approach to refine the dictionary-entry structure of the digital version of the Wörterbuch der deutschen Gegenwartssprache (WDG, en.: Dictionary of Present-day German), a dictionary compiled and published between 1952 and 1977 by the Deutsche Akademie der Wissenschaften that comprises six volumes with over 4,500 pages containing more than 120,000 headwords. We discuss the benefits of such a refinement in the context of the dictionary project Digitales Wörterbuch der deutschen Sprache (DWDS, en: Digital Dictionary of the German language). In the current phase of the DWDS project, we aim to integrate multiple dictionary and corpus resources in German language into a digital lexical system (DLS). In this context, we plan to expand the current DWDS interface with several special purpose components, which are adaptive in the sense that they offer specialized data views and search mechanisms for different dictionary functions-e.g. text comprehension, text production-and different user groups-e.g. journalists, translators, linguistic researchers, computational linguists. One prerequisite for generating such data views is the selective access to the lexical items in the article structure of the dictionaries which are the object of study. For this purpose, the representation of the eWDG has to be refined. The focus of this paper is on the semiautomated approach used to transform eWDG into a refined version in which the main structural units can be explicitly accessed. We will show how this refinement opens new and flexible ways of visualizing and querying the lexicographic content of the refined version in the context of the DLS project.
This paper presents the results of a joint effort of a group of multimodality researchers and tool developers to improve the interoperability between several tools used for the annotation and analysis of multimodality. Each of the tools has specific strengths so that a variety of differ-ent tools, working on the same data, can be desirable for project work. However this usually re-quires tedious conversion between formats. We propose a common exchange format for multi-modal annotation, based on the annotation graph (AG) formalism, which is supported by import and export routines in the respective tools. In the current version of this format the common de-nominator information can be reliably exchanged between the tools, and additional information can be stored in a standardized way.
Rescuing Legacy Data
(2008)
This paper discusses issues that arise in the transformation of electronic language data from outdated to modern, sustainable formats. We first describe the problem and then present four different cases in which corpora of spoken language were converted from legacy formats to an XML-based representation. For each of the four cases, we describe the conversion workflow and discuss the difficulties that we had to overcome. Based on this experience, we formulate some more general observations about transforming legacy data and conclude with a set of best practice recommendations for a more sustainable handling of language corpora.
This paper presents the Kicktionary, a multilingual (English - German - French) electronic lexical resource of the language of football. In the Kicktionary, methods from corpus linguistics and two approaches to lexical semantics - the theory of frame semantics and the concept of semantic relations - are combined to construct a lexical resource in which the user can explore relationships between lexical units in various ways. This paper explains the theoretical background of the Kicktionary, sketches the data and methods which were used in its construction, and describes how the resulting resource is presented to users via a set of hyperlinked webpages.
Belemnons Curiöses Bauem-Lexicon (CBL) aus dem Jahr 1728 ist ein ungewöhnliches Wörterbuch schwieriger Ausdrücke und Syntagmen (fast ausschließlich aus dem Bereich der Fremdwörter), die von ungebildeten Sprechern des frühen 18. Jhs. ("Bauern") falsch verwendet wurden. Das CBL listet rund 800 dieser Fremdwörter alphabetisch auf, um ihnen nach knappen Angaben zur korrekten Aussprache, Bedeutung und Verwendung die jeweiligen Verballhornungen oder Fehlverwendungen, meist durch (oft komische) Verwendungsbeispiele illustriert, gegenüberzustellen. In diesem Beitrag werden einführend die äußere Gestalt, Überlieferung und Nachwirkung, Zielsetzung und Adressaten sowie Makro- und Mikrostruktur des Wörterbuchs beschrieben. Im Anschluss wird der Gesamtbestand der korrekten wie inkorrekten Wortformen gesichtet und auf zwei Arten sortiert: zuerst in der Anordnung des Wörterbuchs, um einen Überblick über seine Makrostruktur zu gewinnen, und dann unter Umkehrung der Benutzerperspektive in Form einer alphabetischen Auflistung der 2000 "Falschwörter" mit Zuordnung der jeweils zugrundeliegenden korrekten Form(en). Eine erste Durchsicht im Anschluss lässt verschiedene Typen von Fehl Verwendungen erkennen, abhängig vom sozio- und dialektalen Umfeld der Sprachbenutzer. Im Hintergrund steht die Frage, inwiefern das CBL eine sprachhistorische Quelle zur Alltagssprache des frühen 18. Jhs. darstellt: dient es in erster Linie der Erheiterung gebildeter Kreise auf Kosten der weniger Gebildeten, denen womöglich auch erfundene, besonders lächerliche sprachliche Fehlleistungen zugeschrieben werden, oder dokumentiert es tatsächlich den defizitären Fremdwortgebrauch von Sprachbenutzern aus der ländlichen Unterschicht seiner Entstehungszeit? Beigegeben wird eine fotografische Reproduktion des CBL in Gestalt einer pdf-Datei, die der Forschung bis zum Erscheinen einer hoffentlich bald verfügbaren kritischen Edition einen leichteren Zugriff auf diesen in mehrfacher Hinsicht interessanten Quellentext ermöglichen soll.
As many popular text genres such as blogs or news contain opinions by multiple sources and about multiple targets, finding the sources and targets of subjective expressions becomes an important sub-task for automatic opinion analysis systems. We argue that while automatic semantic role labeling systems (ASRL) have an important contribution to make, they cannot solve the problem for all cases. Based on the experience of manually annotating opinions, sources, and targets in various genres, we present linguistic phenomena that require knowledge beyond that of ASRL systems. In particular, we address issues relating to the attribution of opinions to sources; sources and targets that are realized as zero-forms; and inferred opinions. We also discuss in some depth that for arguing attitudes we need to be able to recover propositions and not only argued-about entities. A recurrent theme of the discussion is that close attention to specific discourse contexts is needed to identify sources and targets correctly.
The Meta-data-Database of a Next Generation Sustainability Web-Platform for Language Resources
(2008)
Our goal is to provide a web-based platform for the long-term preservation and distribution of a heterogeneous collection of linguistic resources. We discuss the corpus preprocessing and normalisation phase that results in sets of multi-rooted trees. At the same time we transform the original metadata records, just like the corpora annotated using different annotation approaches and exhibiting different levels of granularity, into the all-encompassing and highly flexible format eTEI for which we present editing and parsing tools. We also discuss the architecture of the sustainability platform. Its primary components are an XML database that contains corpus and metadata files and an SQL database that contains user accounts and access control lists. A staging area, whose structure, contents, and consistency can be checked using tools, is used to make sure that new resources about to be imported into the platform have the correct structure.
We present SPLICR, the Web-based Sustainability Platform for Linguistic Corpora and Resources. The system is aimed at people who work in Linguistics or Computational Linguistics: a comprehensive database of metadata records can be explored in order to find language resources that could be appropriate for one’s specific research needs. SPLICR also provides an interface that enables users to query and to visualise corpora. The project in which the system is being developed aims at sustainably archiving the ca. 60 language resources that have been constructed in three collaborative research centres. Our project has two primary goals: (a) To process and to archive sustainably the resources so that they are still available to the research community in five, ten, or even 20 years time. (b) To enable researchers to query the resources both on the level of their metadata as well as on the level of linguistic annota-tions. In more general terms, our goal is to enable solutions that leverage the interoperability, reusability, and sustainability of heterogeneous collections of language resources.
We present SPLICR, the Web-based Sustainability Platform for Linguistic Corpora and Resources. The system is aimed at people who work in Linguistics or Computational Linguistics: a comprehensive database of metadata records can be explored in order to find language resources that could be appropriate for one’s spe cific research needs. SPLICR also provides a graphical interface that enables users to query and to visualise corpora. The project in which the system is developed aims at sustainably archiving the ca. 60 language resources that have been constructed in three collaborative research centres. Our project has two primary goals: (a) To process and to archive sustainably the resources so that they are still available to the research community in five, ten, or even 20 years time. (b) To enable researchers to query the resources both on the level of their metadata as well as on the level of linguistic annotations. In more general terms, our goal is to enable solutions that leverage the interoperability, reusability, and sustainability of heterogeneous collec- tions of language resources.
This contribution deals with the representation of verbs with multiple meanings or senses in general monolingual dictionaries. Criteria for differentiating senses in dictionary entries have traditionally been formulated with respect to the vocabulary in general. This paper argues that, while some criteria do indeed apply to the entire lexicon, many of them are relevant only to specific semantic classes. This will be demonstrated considering two selected verb classes: speech-act verbs and perception verbs. Like verbs of other classes, speech-act verbs and perception verbs may be ambiguous in different but recurrent ways. Since recurrent patterns of ambiguity are always typical of particular semantic classes, class-specific semantic criteria are formulated to decide whether a particular ambiguous speech act or perception verb should be treated as being polysemous or homonymous in dictionary entries. In addition to these class-specific semantic criteria, the semantic-syntactic criterion of identity or difference of argument structure is suggested for the lexicographical representation of verbs which may not be considered to be polysemous or homonymous on the basis of semantic criteria alone. According to the suggested argument-structure criterion, these verbs should be treated as polysemous when their senses correlate with identical argument structures and as homonymous when their senses correlate with different argument structures properties. As opposed to the semantic criteria suggested, the semantic-syntactic criterion of identity vs. difference of argument structure applies to verbs of different semantic classes. However, as will be illustrated by the discussion of the different senses of smell, it may sometimes force us to treat different but related senses as corresponding to two distinct lexical items. In order to solve this problem, the criteria suggested are supplemented by a preference rule stating that semantic criteria apply prior to the semantic-syntactic criterion of identity vs. difference of argument structure...
Lingvistiskās ainavas metode – netradicionāls ceļš multilingvisma jautājumu izpētē un mācīšanā
(2008)
Šī raksta mērķis ir iepazīstināt ar lingvistiskās ainavas metodi un izskaidrot tās priekšrocības ne tikai valodnieku pētījumos, bet arī tās ieviešanā mācību procesā skolās un augstskolās. Pēc šī nelielā ievada vēlamies jums parādīt ne tikai metodes ieviešanas gaitu, bet arī pašreizējo attīstības stadiju. Mēs iepazīstināsim arī ar 2008. gada sākumā izstrādāto projektu ,,Latvijas lingvistiskā ainava Baltijas valstu kontekstā”, kuru arī šobrīd realizējam Rēzeknes Augstskolā (maģistra studiju programmas ,,Filoloģija” studenti un divi docētāji). Tāpat tiks dots neliels ieskats par projektā gūtajiem rezultātiem un problēmām, ar kurām saskārāmies pētījuma laikā, kā arī iepazīstināsim ar jauniegūto pieredzi.
Medienkompetenz gilt als zentrale Qualifikation in der Informations- und Wissensgesellschaft, die das Leben, Lernen und Arbeiten betrifft. Für das Erlangen dieser Kompetenz sind sowohl Individuen als auch Organisationen und Systeme verantwortlich. Da sie zur Voraussetzung der aktiven Teilhabe und kreativen Mitbestimmung dieser Gesellschaft geworden ist, sollten alle Ziel- und Altersgruppen über diese Kompetenz verfügen. Sowohl in der Medienforschung als auch in der Förderung von Medienkompetenz für Menschen mit Migrationshintergrund liegen in Deutschland jedoch große Defizite vor. Aktuelle Integrationsinitiativen und offizielle Stellungnahmen betonen die Notwendigkeit, diese Mängel zu beseitigen und die großen Potenziale der Integration durch Medien effizienter zu nutzen. Studien zur Mediennutzung von Erwachsenen und Kindern zeigen, dass die Voraussetzungen hierzu relativ gut sind. So sind Menschen mit Migrationshintergrund in ihren Haushalten oft besser mit Medien ausgestattet als deutsche Haushalte. Auch wird die Mehrheit der Zuwanderer von deutschen und heimatsprachigen Medienangeboten erreicht. Die Mediennutzung wird stärker von soziodemografischen Faktoren als von der ethnischen Zugehörigkeit bestimmt. Um die heterogene Gruppe der Menschen mit Migrationshintergrund für die interkulturelle Medienarbeit erreichen zu können, sind vielfältige Aspekte und Zusammenhänge zu berücksichtigen. Ausgewählte Projekte und Aktivitäten bieten Anregungen zur praktischen Förderung von Medienkompetenz für diese Zielgruppe.
Slowakei
(2008)
Anakoluthe dependenziell
(2008)
Lors de la négociation située de l'alternance des tours de parole en interaction (Sacks, Schegloff et Jefferson, 1974), les participants s'orientent vers la complétude possible des unités de construction de tour. Grâce à une complétion différée d'un tour de parole précédent, un locuteur peut revendiquer son droit à la parole au-delà d'un tour intercalaire d'un autre locuteur. Cet article exploite différentes formes de cette "delayed completion" (Lerner, 1989) en français parlé. À l'aide du cadre théorique de l'Analyse conversationnelle (ten Have, 1999), nous démontrerons que ce procédé ne relève pas uniquement d'une alternance de tour de parole problématique, mais aussi de séquences collaboratives, qui sont en lien étroit avec le phénomène des constructions syntaxiques collaboratives. En s'intéressant à ces structures syntaxiques émergentes, il est possible de démontrer la négociation située et locale - tour par tour – du droit à la parole et de la dynamique de l'alternance des tours en conversation ordinaire. A base d'une collection d'extraits issus d'interactions naturelles enregistrées en audio ou en vidéo, différentes manières de revendiquer ou de partager son tour seront illustrées. Lors des analyses, une attention particulière sera dédiée à quelques phénomènes récurrents dans les séquences de complétion différée. Ainsi, l'exploitation de certaines conjonctions en tant que marqueurs discursifs ou la présence d'allongements vocaliques en fin du premier segment semblent indiquer des co-occurrences de ressources audibles spécifiques à différents types de complétion différée en conversation française.
Cet article se fonde sur une collection de répétitions suite à un chevauchement, tirée de données vidéo en allemand et en français. La description systématique de cet outil de reprise de tour articule une comparaison entre cas clairs et cas déviants de ce phénomène. Il est démontré que le recyclage est aussi bien une ressource du locuteur suivant que du locuteur en cours.
Gespräche mit Patienten. Ein alltägliches und komplexes Arbeits- und Steuerungsinstrument für Ärzte
(2008)
This paper is a project report of the lexicographic Internet portal OWID, an Online Vocabulary Information System of German which is being built at the Institute of German Language in Mannheim (IDS). Overall, the contents of the portal and its technical approaches will be presented. The lexical database is structured in a granular way which allows to extend possible search options for lexicographers. Against the background of current research on using electronic dictionaries, the project OWID is also working on first ideas of useradapted access and user-adapted views of the lexicographic data. Due to the fact that the portal OWID comprises dictionaries which are available online it is possible to change the design and functions of the website easily (in comparison to printed dictionaries). Ideas of implementing user-adapted views of the lexicographic data will be demonstrated by using an example taken from one of the dictionaries of the portal, namely elexiko.
Im vorliegenden Beitrag soll der Aufbau einer maßgeschneiderten XML-Modellierung für ein Wörterbuchnetz erläutert werden. Diese Schriftfassung beruht auf einem gleichlautenden
Vortrag, der auf dem ersten Arbeitstreffen des DFG-Netzwerks „Internetlexikografie“ in
Mannheim im Mai 2011 gehalten wurde. Der Beitrag ist als Werkstattbericht zu verstehen,
d. h. als praktisch orientierter Blick sowohl darauf, wie wir unsere Modellierung für OWID
konzipiert haben, welche Konsequenzen dies für die lexikographische Arbeit sowie für die
Recherchemöglichkeiten der Nutzer hat, als auch darauf, welche Vor- und Nachteile wir bei
diesem Modellierungsansatz sehen. Der vorliegende Beitrag bietet damit keine umfassende
theoretische Auseinandersetzung mit verschiedenen Möglichkeiten der Modellierung. Lediglich
im folgenden Kapitel werden die Grundzüge des Modellierungsansatzes kurz erläutert
und es wird auf entsprechende weiterführende projektbezogene Literatur verwiesen.
Lexikografie im Internet
(2008)
Electronic corpora play an ever growing role in lexicography. On the one hand, new access to linguistic usage is made possible through the use of text corpora and intelligent corpus-based query tools; however, the final results are still interpreted and described by lexicographers. In this case corpora are used for data acquisition. On the other hand, there are also projects that provide purely automatically acquired data in the form of "dictionaries". Lexicographers play only a minor role here. This latter type of corpus use creates a completely new kind of electronic dictionary. This article addresses the questions as to what extent these dictionaries differ from lexicographic tradition and whether they must be considered in metalexicography. Starting from previously compiled electronic dictionary typologies, we try to supplement the formulation of lexicographic data as a distinguishing feature. Finally, based on the findings of the project elexiko (Institute for the German Language - IDS), we demonstrate that the distinction between electronic versus man-made lexicographic data is also relevant to lexicographical practice.
The development of user-adapted views of lexicographic data is frequently in demand by dictionary research on electronic reference works and hypertext information systems. In the printed dictionary it has been indispensable to develop a complete dictionary relative to a user group and using situations. In contrast, for any electronic presentation of lexicographic data there are possibilities to define user-specific views of an initially user-unspecific resource. However, research on the use of dictionaries in general, still has to answer several open questions as far as this subject is concerned. This paper will firstly provide an overview of the present state of research on dictionary use with respect to electronic lexicography. Subsequently, explanations of further prerequisites for a possible user-adapted access to data are followed, as exemplified by OWID, the Online Vocabulary Information System of the Institut für Deutsche Sprache. Finally, it will be outlined what results on the subject have been accomplished so far. Also the prospects of potential user-adapted presentations of lexicographic data will be highlighted.
The Online-Wortschatz-Informationssystem Deutsch (OWID; Online Vocabulaty Information System German) o f the Institut fUr Deutsche Sprache (IDS; German Language Institute) in Mannheim is a lexicographic Internet portal for various electronic diciionary resources that are being compiled as the IDS. It is an explicit goal of OWID, not to present a random collection of unrelated reference works but to build a network of actually related lexicographic products. Hence, the core of the project is the design of an innovative concept of data modelling and structuring. The goal of this granular data modelling is to allow flexible access of each individual lexicographic resource as well as access across diverse dictionary resources. At the same time, fine-grained interconnectedness of all resources should be made possible. Every lexicographic resource within OWID—elexiko, Neologismenwörterbuch, Wortverbindungen online, Schulddiskurs im ersten Nachkriegsjahrzehnt—accomplishes this requirement with regard to data modelling and structuring. The paper explains the underlying consistent concept of the data modelling for the overall heterogeneous lexicographical resources. Also it is shown, how the modelling potential has been converting into the Internet presence of OWID.
In this paper we investigate the coverage of the two knowledge sources WordNet and Wikipedia for the task of bridging resolution. We report on an annotation experiment which yielded pairs of bridging anaphors and their antecedents in spoken multi-party dialog. Manual inspection of the two knowledge sources showed that, with some interesting exceptions, Wikipedia is superior to WordNet when it comes to the coverage of information necessary to resolve the bridging anaphors in our data set. We further describe a simple procedure for the automatic extraction of the required knowledge from Wikipedia by means of an API, and discuss some of the implications of the procedure’s performance.
The thesis describes a fully automatic system for the resolution of the pronouns 'it', 'this', and 'that' in English unrestricted multi-party dialog. Referential relations considered include both normal NP-antecedence as well as discourse-deictic pronouns. The thesis contains a theoretical part with a comprehensive empiricial study, and a practical part describing machine learning experiments.
In this paper, we present a suite of flexible UIMA-based components for information retrieval research which have been successfully used (and re-used) in several projects in different application domains. Implementing the whole system as UIMA components is beneficial for configuration management, component reuse, implementation costs, analysis and visualization.
Der, die, das wird traditionell als Demonstrativpronomen eingestuft, obwohl es besonders in der gesprochenen Sprache zum Ausdruck der Referenz auf dritte Personen mit dem Personalpronomen er, sie, es in komplementärer Distribution auftritt. Im Beitrag wird das Verhältnis zwischen der und er zunächst auf dem Hintergrund anderer europäischer Sprachen kontrastiv untersucht. Anschließend wird die Frage der Didaktisierung aufgeworfen: an einem konkreten Beispiel wird gezeigt, wie man Deutschlernende auf den Gebrauch von der als Personalpronomen aufmerksam machen kann.
Our research task consists in the study of the way in which multilingual resources are mobilized in team work within collaborative activities; how they are exploited in a specific way in order both to enhance collaboration and to respect the specificities of the members’ linguistic competences and practices within the team. Central to our analytical work, which is inspired by ethnomethodological conversation analysis, is the relationship between multilingual resources and the situated organization of linguistic uses and of social practices. These two aspects are reflexively articulated, multilingual resources being shaped by the very contexts of their use and activities being constrained and thus structured by the available resources.
ANW und elexiko repräsentieren eine neue Generation von wissenschaftlichen elektronischen (Online-)
Wörterbüchern: sie sind keine digitalisierten Klone von schon existierenden Printwörterbüchern, sondern werden inhaltlich neu und mit voller Berücksichtigung der Möglichkeiten des neuen Mediums realisiert. In diesem Beitrag werden zuerst pauschal einige wichtige Parallelen und Unterschiede zwischen dem ANW und elexiko beleuchtet. Anschließend wird der substanzielle Unterschied in den Suchoptionen eingehend behandelt. Elexiko hantiert mit dem Unterschied „einfache Suche“ neben „Expertensuche“ – ein bekanntes System. Das ANW hat ein eigenes, neues System mit den folgenden Suchmöglichkeiten entwickelt: Suche nach Information zu einem Wort, Suche nach einem Wort (von der Bedeutung aus), Suche nach Wörtern (auf Grund eines oder mehrerer gemeinschaftlicher Merkmale), Suche nach Beispielen mit gemeinsamen Merkmalen und Suche nach Information über das Wörterbuch selbst. In den onomasiologischen Suchformen, die vom Inhalt zum Wort führen, spielt das „Semagramm“, die Darstellung von Kenntnis, die mit einem Wort zu verbinden ist, in einem Rahmen mit „Slots“ und „Fillern“ eine substanzielle Rolle. Das Semagramm ist eine weitere Erneuerung des ANW.
Open peer commentary on the target article “Who Conceives of Society?” by Ernst von Glasersfeld. Excerpt: I will focus on one crucial step in von Glasersfeld’s argumentation, viz. his view that every individual constructs his own private meanings (understood as conceptual structures or elements thereof) for linguistic expressions, so that linguistic interaction and even communication in general is based on a notion of compatibility between different speakers’ private conceptual schemes. The central question here is: “Just what does it mean that different private conceptual schemes (private meanings) are compatible, or what constitutes a viable criterion to this end?” As von Glasersfeld himself stresses twice (§28, §37), the criteria to be looked for can only be “public,” residing in properties of verbal and non-verbal actions of the interacting individuals, properties that can be sensed and processed by the participating system.
In spring 2002, we celebrated the inauguration of the first German-Russian-Jewish kindergarten in Berlin. Nowadays, there are seven bilingual German-Russian kindergartens with 4 60 places and 78 bilingual kindergartens with other combinations of languages [SENBWF]. Maybe it is not enough, taking into account the large proportion o f immigrants in the population of Berlin1. And yet, much progress has been achieved, endorsing the fact that German society has begun to change its attitude towards other languages on its territory. The initial request for German monolingualism first changed into societal tolerance of multilingualism and eventually to the recognition o f the value of multilingualism. This process is a very slow one, and it is not yet complete. In my article, I would like to look at the development in the last few years of the political framework that has made possible, on the one hand, the opening of bilingual kindergartens in Berlin, and on the other hand, to consider what has hampered this process until now. I would like to emphasise three most important political spheres: linguistic, educational and integrational.
In dem Beitrag werden jüngste Entwicklungen auf dem Gebiet der Sprachpolitik, der Bildungspolitik und der Integrationspolitik in Deutschland dargestellt, die ein neues Verhältnis zur Mehrsprachigkeit erkennen lassen und die Schaffung zweisprachiger Bildungseinrichtungen ermöglichen. Dieser Beitrag wurde auch in einer englischen Version mit dem Titel "The political framework for creation and development of bilingual Kindergartens in Berlin" veröffentlicht. Sie ist über den Dokumentenserver des IDS zugänglich. Die deutsche Version des Beitrags trägt den Titel "Politische Rahmenbedingungen für zweisprachige Kindertagesstätten in Berlin". Sie ist nicht veröffentlicht, aber ebenfalls über den Dokumentenserver des IDS erhältlich.
In the context of a Nordic Conference on Bilingualism, it can be a rewarding task to look at issues such as language planning, policy and legislation from a perspective of the southern neighbours of the Nordic world. This paper therefore intends to point attention towards a case of societal multilingualism at the periphery of the Nordic world by dealing with recent developments in language policy and legislation with regard to the North Frisian speech community in the German Land of Schleswig-Holstein. As I will show, it is striking to what degree there are considerable differences in the discourse on minority protection and language legislation between the Nordic countries and a cultural area which may arguably be considered to be part of the Nordic fringe - and which itself occasionally takes Scandinavia as a reference point, e.g. in the recent adoption of a pan-Frisian flag modelled on the Nordic cross (Falkena 2006).
The main focus of the paper will be on the Frisian Act which was passed in the Parliament of Schleswig-Holstein in late 2004. It provides a certain legal basis for some political activities with regard to Frisian, but falls short of creating a true spirit of minority language protection and/or revitalisation. In contrast to the traditions of the German and Danish minorities along the German-Danish border and to minority protection in Northern Scandinavia (in particular to Sámi language rights), the approach chosen in the Frisian Act is extremely weak and has no connotation of long-term oriented language-planning, let alone a rights-based perspective.
The paper will then look at policy developments in the time since the Act was passed, e.g. in the Schleswig-Holstein election campaign in 2005, and on latest perceptions of the Frisian language situation in the discourse on North Frisian Policy in Schleswig-Holstein majority society. In the final part of the paper, I will discuss reasons for the differences in minority language policy discourse between Germany and the Nordic countries, and try to provide an outlook on how Frisian could benefit from its geographic proximity to the Nordic world.
Language-aware text editing
(2008)
While software developers have various power tools at their disposal that make the writing of computer programs more efficient, authors of texts do not have the support of such power tools. Text processors still operate on the level of characters and strings rather than on the level of word forms and grammatical constructions. This forces authors to constantly switch between low-level, character oriented, editing operations and high-level, conceptual, verbalisation processes. We suggest the development of language-aware text editing tools that simplify certain frequent, yet complex editing operations by defining them on the level of linguistic units. Pluralizing an entire noun phrase plus the verb forms governed by it would be an ambitious example, swapping the elements of a conjunctive construction a more modest one. We describe a pilot implementation for German where these operations are seamlessly integrated with the standard functions of an existing open-source editor. The operations can be invoked on demand and do not intrude on the authoring process. Changes can be performed locally or globally, thus simplifying the writing process considerably, and making the resulting texts more consistent.
In this paper the authors briefly outline editing functions which use methods from computational linguistics and take the structures of natural languages into consideration. Such functions could reduce errors and better support writers in realizing their communicative goals. However, linguistic methods have limits, and there are various aspects software developers have to take into account to avoid creating a solution looking for a problem: Language-aware functions could be powerful tools for writers, but writers must not be forced to adapt to their tools.