Grammatikforschung
Refine
Year of publication
Document Type
- Part of a Book (14)
- Other (12)
- Article (8)
- Book (2)
Keywords
Publicationstate
Reviewstate
- (Verlags)-Lektorat (23)
- Peer-Review (2)
Publisher
- Institut für Deutsche Sprache (16)
- Narr (5)
- de Gruyter (5)
- Heidelberg University Publishing (3)
- Lang (1)
- Univerzita Hradec Králové (1)
The variation of the strong genitive marker of the singular noun has been treated by diverse accounts. Still there is a consensus that it is to a large extent systematic but can be approached appropriately only if many heterogeneous factors are taken into account. Over thirty variables influencing this variation have been proposed. However, it is actually unclear how effective they can be, and above all, how they interact. In this paper, the potential influencing variables are evaluated statistically in a machine learning approach and modelled in decision trees in order to predict the genitive marking variants. Working with decision trees based exclusively on statistically significant data enables us to determine what combination of factors is decisive in the choice of a marking variant of a given noun. Consequently the variation factors can be assessed with respect to their explanatory power for corpus data and put in a hierarchized order.
Der korpuslinguistische Ansatz des Projekts »Korpusgrammatik« eröffnet neue Perspektiven auf unsere Sprachwirklichkeit allgemein und grammatische Regularitäten im Besonderen. Der vorliegende Band klärt auf, wie man korpuslinguistisch nach dem Standard fragen kann, wie die Projektkorpora aufgebaut und in einer Korpusdatenbank erschlossen sind, wie man in einem automatischen Abfragesystem der Variabilität der Sprache zu Leibe rückt und sie sogar messbar macht, schließlich aber auch, wo die Grenzen quantitativer Korpusanalysen liegen. Pilotstudien deuten an, wie der Ansatz unsere grammatischen Horizonte erweitert und die Grammatikografie voranbringt.
In recent years, the availability of large annotated and searchable corpora, together with a new interest in the empirical foundation and validation of linguistic theory and description, has sparked a surge of novel and interesting work using corpus-based methods to study the grammar of natural languages. However, a look at relevant current research on the grammar of the Germanic, Romance, and Slavic languages reveals a variety of different theoretical approaches and empirical foci, which can be traced back to different philological and linguistic traditions. Still, this current state of affairs should not be seen as an obstacle but as an ideal basis for a fruitful exchange of ideas between different research paradigms.
Einleitung
(2019)
Einleitung
(2020)
A corpus-based academic grammar of German is an enormous undertaking, especially if it aims at using state-of-the-art methodology while ensuring that its study results are verifiable. The Bausteine-series, which is being developed at the Leibniz Institute for the German Language (IDS), presents individual “building blocks” for such a grammar. In addition to the peer-reviewed texts, the series publishes the results of statistical analyses and, for selected topics, the underlying data sets.
The syntactic rules of today's High German are generally thought to have crystallized during the 18th century. One can therefore expect that during that time the processes of grammaticalization had a particular influence on doubtful cases in the usage of language. The syntactic development varied from region to region and was accompanied by theoretical controversies. One of the controversial issues was word order. The debates focused primarily on the order in verbal complexes and on the possibility of extraposing simple components and dependent clauses. This paper is based on the assumption that the theoretical controversies in some way reflected the doubtful cases in the usage of language. In order to identify the actual variants, the theoretical controversies will be outlined first. Then the analysis will focus on whether and how these variants were used in a corpus of 18th century texts. The objective is to determine the language-internal, sociological, and geographical factors of the variants' usage and thus to model the situations in which the doubtful cases ocurred. In conclusion, the following issues will be discussed: the relationship between the doubtful cases and diachronic language developments, the language-external factors of the doubtful cases, and the approach of language theorists to doubtful cases.