Studien
Refine
Document Type
- Conference Proceeding (3)
- Article (1)
Language
- English (4) (remove)
Has Fulltext
- yes (4)
Is part of the Bibliography
- no (4)
Keywords
- Korpus <Linguistik> (2)
- Burgenland (1)
- Datenverarbeitung (1)
- Deutsch (1)
- Kroatisch (1)
- Metadaten (1)
- Minderheitensprache (1)
- Ontologie <Wissensverarbeitung> (1)
- Semantische Relation (1)
- Slawisch (1)
Publicationstate
Reviewstate
Knowledge in textual form is always presented as visually and hierarchically structured units of text, which is particularly true in the case of academic texts. One research hypothesis of the ongoing project Knowledge ordering in texts - text structure and structure visualisations as sources of natural ontologies1 is that the textual structure of academic texts effectively mirrors essential parts of the knowledge structure that is built up in the text. The structuring of a modern dissertation thesis (e.g. in the form of an automatically generated table of contents - toes), for example, represents a compromise between requirements of the text type and the methodological and conceptual structure of its subject-matter. The aim of the project is to examine how visual-hierarchical structuring systems are constructed, how knowledge structures are encoded in them, and how they can be exploited to automatically derive ontological knowledge for navigation, archiving, or search tasks. The idea to extract domain concepts and semantic relations mainly from the structural and linguistic information gathered from tables of contents represents a novel approach to ontology learning.
In this paper we present an approach to faceted search in large language resource repositories. This kind of search which enables users to browse through the repository by choosing their personal sequence of facets heavily relies on the availability of descriptive metadata for the objects in the repository. This approach therefore informs the collection of a minimal set of metatdata for language resources. The work described in this paper has been funded by the EC within the ESFRI infrastructure project CLARIN.
This paper presents the application of the <tiger2/> format to various linguistic scenarios with the aim of making it the standard serialisation for the ISO 24615 [1] (SynAF) standard. After outlining the main characteristics of both the SynAF metamodel and the <tiger2/> format, as extended from the initial Tiger XML format [2], we show through a range of different language families how <tiger2/> covers a variety of constituency and dependency based analyses.
This article aims to show that it is only by comparing different multilingual communities that a typologically relevant description of such communities is made possible. An example (Brix, 1982) is presented to demonstrate that a usable description of such regions is implicitly based on comparison and what factors are important in this respect. It is shown that only a model of variables which can comprise the analogous traits of the situations as well as the differences between them enables an adequate comparison to be made. The examples of the Croats in the Burgenland and of the Slovenes in Carinthia show what the consequences for the description of the situations are and what difficulties arise with such a description. The domains of the semi-official use of language are examined to find out what parallels and differences in the factors which are used for the description of multilingual communities (e.g. economic situation, legislation, historical development, sociolinguistic situation) can be related the peculiarities of language use in the two situations. In this way typological similarities and certain idiosyncratic characteristics of the two regions can be understood.