Refine
Year of publication
Document Type
- Article (330)
- Conference Proceeding (290)
- Part of a Book (174)
- Book (18)
- Part of Periodical (8)
- Other (3)
- Working Paper (3)
- Image (1)
- Review (1)
Language
- English (828) (remove)
Keywords
- Korpus <Linguistik> (218)
- Deutsch (180)
- Computerlinguistik (96)
- Konversationsanalyse (83)
- Interaktion (75)
- Gesprochene Sprache (54)
- Annotation (52)
- Forschungsdaten (44)
- Automatische Sprachanalyse (40)
- Natürliche Sprache (39)
Publicationstate
- Veröffentlichungsversion (590)
- Zweitveröffentlichung (135)
- Postprint (113)
- Ahead of Print (6)
- Preprint (1)
Reviewstate
- Peer-Review (828) (remove)
Publisher
- IDS-Verlag (78)
- Association for Computational Linguistics (35)
- European Language Resources Association (34)
- Elsevier (24)
- Springer (24)
- de Gruyter (24)
- Taylor & Francis (22)
- Institut für Deutsche Sprache (20)
- European Language Resources Association (ELRA) (19)
- Benjamins (17)
This paper describes the lexical database tool LOLA (Linguistic-Oriented Lexical database Approach) which has been developed for the construction and maintenance of lexicons for the machine translation system LMT. First, the requirements such a tool should meet are discussed, then LMT and the lexical information it requires, and some issues concerning vocabulary acquisition are presented. Afterwards the architecture and the components of the LOLA system are described and it is shown how we tried to meet the requirements worked out earlier. Although LOLA originally has been designed and implemented for the German-English LMT prototype, it aimed from the beginning at a representation of lexical data that can be reused for other LMT or MT prototypes or even other NLP applications. A special point of discussion will therefore be the adaptability of the tool and its components as well as the reusability of the lexical data stored in the database for the lexicon development for LMT or for other applications.
In this paper, we analyze a dramatically aggravated conflict interaction taking place in the course of an association’s meeting in an urban community center. The interaction can be seen as the culmination point of a social conflict developing and increasing over a period of years. In this conflict, one of the crucial points of the sociocultural development in the city under study is to be seen in an exemplary way. Our analysis started with the question, why this conflict is unsolvable although the interest divergences of the opposing parties are not irreconcilable. Our analysis shows that the protagonists practice different communicative social styles. These stylistic differences however, are not the cause for misunderstandings, but the protagonists use stylistic differences and different cultural orientations as a resource for political action. Thereby a process of increasing hardening of perspective divergence emerges together with an interaction modality of drama and of the fundamental grounding of divergent views. Theoretically we are concerned with the explication of a sociolinguistic theory which includes as constitutive components the concepts of communicative social style, of perspectivation and of interaction modality. We want to show, that the analyzed type of sociocultural conflict can be explained by virtue of considering the interplay of features on these three levels.
This paper deals with the distribution of word length in short native mythological and historical Eskimo narrative texts. To my knowledge, no Eskimo‐Aleut data have been the object of quantitative linguistic investigation so far. Due to the strong linguistic and Stylistic homogeneity of the examined texts it was assumed that these texts can be subsumed under a single law of word length distribution, if word length distribution of a text is considered as a function of certain of its properties, such as author, language, and genre. So far, word length distribution in texts of a wide variety of languages and genres has been demonstrated to follow distributions of the compound Poisson family of discrete probability distributions. In view of the morphological idiosyncrasies of the Eskimo language in general, which are responsible for an unusually high mean word length of about 4.5 to 5.2 syllables per word in the texts, it is interesting to see whether Eskimo texts show a significantly different behaviour with respect to word length. The results demonstrate that the Eskimo data employed in this study can be fitted well by the Hyperpoisson distribution. Two further discrete probability distributions will be deduced from certain morphology‐based assumptions about Eskimo. It turns out that most of the Eskimo data can be fitted by these two distributions. The question to what extent these results point to a more grammar‐oriented theory of word length is also discussed.
The concept of text coherence was developed for linear text, i.e. text of sequentially organized content. The present article addresses to what extent this concept can be applied to hypertext. Following the introduction (section 1), I will define different aspects of text coherence (section 2). I will then explain the importance of the sequential order of text constituents for coherence-building, as explored by empirical studies on text comprehension (section 3). Section 4 discusses how hypertext-specific forms of reading affect the processes of coherence-building and coherence-design. Section 5 explores how the new challenges of hypertext comprehension may be met by hypertext-specific coherence cues. A summary and outlook is included (section 6).
In the context of the HyTex project, our goal is to convert a corpus into a hypertext, basing conversion strategies on annotations which explicitly mark up the text-grammatical structures and relations between text segments. Domain-specific knowledge is represented in the form of a knowledge net, using topic maps. We use XML as an interchange format. In this paper, we focus on a declarative rule language designed to express conversion strategies in terms of text-grammatical structures and hypertext results. The strategies can be formulated in a concise formal syntax which is independend of the markup, and which can be transformed automatically into executable program code.