Refine
Year of publication
- 2009 (9) (remove)
Document Type
- Article (9) (remove)
Has Fulltext
- yes (9)
Is part of the Bibliography
- no (9) (remove)
Keywords
- Deutsch (2)
- Korpus <Linguistik> (2)
- Annotation (1)
- Arzt (1)
- Automatische Sprachanalyse (1)
- Computerlinguistik (1)
- Digital Humanities (1)
- Diskurs (1)
- Flexion (1)
- Format <Fernsehsendung> (1)
Publicationstate
- Veröffentlichungsversion (4)
- Postprint (3)
- Zweitveröffentlichung (2)
Reviewstate
- (Verlags)-Lektorat (9) (remove)
Publisher
Die Flexionsmorphologie des Deutschen ist ein zentraler Forschungsgegenstand des europäischen Forschungsnetzwerks EuroGr@mm, dessen Erschließung für Forschung und Lehre seit Anfang 2007 vorangetrieben wird. Das europäische Projekt hatte sich zur Aufgabe gemacht, diesen grammatischen Themenbereich aus französischer, italienischer, norwegischer, polnischer und ungarischer Perspektive kontrastiv zu beleuchten. Die ersten Ergebnisse wurden nun in Form von didaktisch aufbereiteten Wissenseinheiten auf der Lemplattform ProGr@mm kontrastiv veröffentlicht.
This article introduces the topic of ‘‘Multilingual language resources and interoperability’’. We start with a taxonomy and parameters for classifying language resources. Later we provide examples and issues of interoperatability, and resource architectures to solve such issues. Finally we discuss aspects of linguistic formalisms and interoperability.
We report on finished work in a project that is concerned with providing methods, tools, best practice guidelines, and solutions for sustainable linguistic resources. The article discusses several general aspects of sustainability and introduces an approach to normalizing corpus data and metadata records. Moreover, the architecture of the sustainability platform implemented by the authors is described.
This article shows that the TEI tag set for feature structures can be adopted to represent a heterogeneous set of linguistic corpora. The majority of corpora is annotated using markup languages that are based on the Annotation Graph framework, the upcoming Linguistic Annotation Format ISO standard, or according to tag sets defined by or based upon the TEI guidelines. A unified representation comprises the separation of conceptually different annotation layers contained in the original corpus data (e.g. syntax, phonology, and semantics) into multiple XML files. These annotation layers are linked to each other implicitly by the identical textual content of all files. A suitable data structure for the representation of these annotations is a multi-rooted tree that again can be represented by the TEI and ISO tag set for feature structures. The mapping process and representational issues are discussed as well as the advantages and drawbacks associated with the use of the TEI tag set for feature structures as a storage and exchange format for linguistically annotated data.