400 Sprache
Refine
Year of publication
Document Type
- Part of a Book (54)
- Part of Periodical (25)
- Article (18)
- Book (16)
- Conference Proceeding (8)
- Working Paper (6)
- Review (4)
- Other (2)
- Image (1)
- Report (1)
Language
- German (64)
- English (61)
- Russian (6)
- Multiple languages (3)
- French (1)
Keywords
- Deutsch (26)
- Korpus <Linguistik> (21)
- Sprache (10)
- Forschungsdaten (9)
- Grammatik (9)
- Interaktion (9)
- Kommunikation (9)
- Gesprochene Sprache (8)
- Wörterbuch (8)
- Germanistik (7)
Publicationstate
- Veröffentlichungsversion (71)
- Zweitveröffentlichung (24)
- Postprint (4)
- Erstveröffentlichung (1)
- Preprint (1)
Reviewstate
- Peer-Review (63)
- (Verlags)-Lektorat (34)
- Peer-review (1)
Publisher
- IDS-Verlag (26)
- Leibniz-Institut für Deutsche Sprache (IDS) (11)
- De Gruyter (7)
- Zenodo (7)
- de Gruyter (6)
- Narr Francke Attempto (5)
- Springer (4)
- Benjamins (3)
- Gesellschaft für Sprachtechnologie und Computerlinguistik (3)
- Narr (3)
A comparison between morphological complexity measures: typological data vs. language corpora
(2016)
Language complexity is an intriguing phenomenon argued to play an important role in both language learning and processing. The need to compare languages with regard to their complexity resulted in a multitude of approaches and methods, ranging from accounts targeting specific structural features to global quantification of variation more generally. In this paper, we investigate the degree to which morphological complexity measures are mutually correlated in a sample of more than 500 languages of 101 language families. We use human expert judgements from the World Atlas of Language Structures (WALS), and compare them to four quantitative measures automatically calculated from language corpora. These consist of three previously defined corpus-derived measures, which are all monolingual, and one new measure based on automatic word-alignment across pairs of languages. We find strong correlations between all the measures, illustrating that both expert judgements and automated approaches converge to similar complexity ratings, and can be used interchangeably.
This paper describes a new approach to improve the analysis and categorization of web documents using statistical methods for template based clustering as well as semantical analysis based on terminological ontologies. A domain-specific environment serves for prove of concept. In order to demonstrate the widespread practical benefit of our approach, we outline a combined mathematical and semantical framework for information retrieval on internet resources.
affiziertes Objekt
(2020)
The paper presents the process of developing the AirFrame database, a specialized lexical resource in which aviation terminology is defined in the form of semantic frames, following the methodology of the Berkeley FrameNet (FN). First, the structure of the database is presented, and then the methodology applied in developing and populating the database is described. The link between specialized aviation frames and general language semantic frames, of which frames defining entities, processes, attributes and events are particularly relevant, is discussed on the example of the semantic frame of Flight and its related frames. The paper ends with discussing possibilities of using AirFrame as a model for further developing resources in which general and specialized knowledge are linked.
Wortgeschichte digital (Digital Word History) is an emerging historical dictionary of the German language that focuses on describing semantic shifts from about 1600 through today. This article provides deeper insight into the dictionary’s “cross-reference clusters,” one of its software tools that performs visualization of its reference network. Hence, the clusters are a part of the project’s macrostructure. They serve as both a means for users to find entries of interest and a tool to elucidate relations among dictionary entries. Rather than delve into technical aspects, this article focuses on the applied logics of the software and discusses the approach in light of the dictionary’s microstructure. The article concludes with some considerations about the clusters’ advantages and limitations.
Applying terminological methods to lexicography helps lexicographers deal with the terms occurring in general language dictionaries, especially when it comes to writing the definitions of concepts belonging to special fields. In the context of the lexicographic work of the Dicionário da Língua Portuguesa, an updated digital version of the last Academia das Ciências de Lisboa’ dictionary published in 2001, we have assumed that terminology – in its dual dimension, both linguistic and conceptual – and lexicography are complementary in their methodological approaches. Both disciplines deal with lexical items, which can be lexical units or terms. In this paper, we apply terminological methods to improve the treatment of terms in general language dictionaries and to write definitions as a form of achieving more precision and accuracy, and also to specify the domains to which they belong. Additionally, we highlight the consistent modelling of lexicographic components, namely the hierarchy of domain labels, as they are term identification markers instead of a flat list of domains. The need to create and make available structured, organised and interoperable lexicographic resources has led us to follow a path in which the application of standards and best practices of treating and representing specialised lexicographic content are fundamental requirements.