Refine
Year of publication
- 2006 (115) (remove)
Document Type
- Part of a Book (87)
- Conference Proceeding (12)
- Article (11)
- Book (3)
- Master's Thesis (1)
- Other (1)
Keywords
- Deutsch (53)
- Textverstehen (20)
- Korpus <Linguistik> (11)
- Grammatik (8)
- Gesprochene Sprache (6)
- Metapher (6)
- Konversationsanalyse (5)
- Syntax (5)
- Textproduktion (5)
- Russisch (4)
Publicationstate
- Veröffentlichungsversion (115) (remove)
Reviewstate
Publisher
- de Gruyter (35)
- Narr (26)
- Lang (7)
- Oficyna Wydawnicza ATUT | Neisse (4)
- Association for Computational Linguistics (3)
- Schmidt (3)
- Verlag für Gesprächsforschung (3)
- Extreme Markup Languages Conference (2)
- Institut für Deutsche Sprache (2)
- Stauffenburg (2)
Linguistic corpora have been annotated by means of SGML-based markup languages for almost 20 years. We can, very roughly, differentiate between three distinct evolutionary stages of markup technologies. (1)Originally, single SGML tree-based document instances were deemed sufficient for the representation of linguistic structures. (2) Linguists began to realize that alternatives and extensions to the traditional model are needed. Formalisms such as, for example, NITE were proposed: the NITE Object Model (NOM) consists of multi-rooted trees. (3) We are now on the threshold of the third evolutionary stage: even NITE's very flexible approach is not suited for all linguistic purposes. As some structures, such as these, cannot be modeled by multi-rooted trees, an even more flexible approach is needed in order to provide a generic annotation format that is able to represent genuinely arbitrary linguistic data structures.
In the mid-1990s, the Faculty of Linguistics and Literary-Studies at Bielefeld University began to establish the field Text technology, both in research and education. Text technology is a new field of research on the border of Computational Linguistics and Computational Philology.
This paper focuses on Text technology in academic education. In 2002, Text Technology was introduced as a minor subject for B.A. Programs. It is organized in modules: Module 1 introduces the characteristics of electronic texts and documents, typography, typesetting systems and hypertext. Module 2 introduces one or two programming languages relevant to the field of humanities computing. Markup languages and the principles of information structuring are the main topics of Module 3. The formal fundamentals of computer-based text processing, as formal languages and their grammars, Logics et cetera are subjects of another module. The paper ends with a short description of other Bachelor- and Master-Programs at Bielefeld University which contain text technological themes.
Die textlinguistische Grundthese dieses Beitrags besagt, dass alle Texte elementar aus Zeit gemacht sind. Diese These gilt nicht nur für die Verbalgrammatik, wo sie sich schon wegen der Verbaltempora fast von selbst versteht, sondern auch für die Nominalgrammatik, die im Zentrum dieses Beitrags steht. Das wird am Beispiel von Kafkas Erzählung „Die Verwandlung“ zunächst an den Pronominalisierungen, dann an den Renominalisierungen des Textes gezeigt. Beide sind „Zeit-Zeichen“, die auf unterschiedliche Weise die Geltung eines Nomens in der Textzeit verlängern und gegebenenfalls modifizieren. Auch der Satz ist ein Textstück, in dem die Zeit nicht angehalten wird, sondern fortlaufend den Sinn des Textes verändert.