TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Mell, Ruth Maria T1 - Tagungsbericht. 5.Tagung des Netzwerkes "Diskurs - interdisziplinär": Diskurs - holistisch. Perspektiven integrierender Diskursforschung, Universität Bern, 13. und 14. November 2015 JF - Zeitschrift für Diskursforschung KW - Diskursanalyse KW - Kongressbericht Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-52521 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-52521 SN - 2195-867X SS - 2195-867X N1 - Dieser Beitrag ist aus urheberrechtlichen Gründen nicht frei zugänglich. IS - 2 SP - 202 EP - 207 PB - Beltz Juventa CY - Weinheim ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Raffelsiefen, Renate T1 - Allomorphy and the question of abstractness: evidence from German JF - Morphology N2 - A model of grammar needs to reconcile the undesirability inherent to allomorphy, the apparent extra burden on learning and memory, with its occurrence and possible stability. OT approaches this task by positing an anti-allomorphy constraint, henceforth referred to as "OO-correspondence", which requires leveling (i.e. sameness of sound structure) in related word forms (Benua 1997). The occurrence of allomorphy then indicates crucial domination of OO-correspondence by other constraints. To assess the adequacy of this proposal it is necessary to establish the level of abstractness at which OO-correspondence applies and to examine the consequences of this decision for ranking order. While proponents of OT tacitly assume the level in question to be rather concrete, the notion of allomorphy as originally envisioned in Structuralism was defined by distinctness at a more abstract level referred to as "phonemic" (Harris 1942; Nida 1944). The basic intuition here is that the defining property of subphonemic sound properties, their conditionedness by context, entails that whatever burden they put on learning and memory is of a fundamentally different nature than that entailed by phonemic distinctness. The evidence from German supports that intuition in that leveling can be shown to target phonemic sound structure to the exclusion of subphonemic properties. Allomorphy, defined by phonemic alterna-tion, tends to serve phonological optimization in closed class items (function words, affixes) while serving to express morphological distinctions in open class items. The key to demonstrating the correlations in question lies in the discernment of phonemic structure, which is therefore at the core of the article. KW - Allomorph KW - Deutsch KW - Phonologie KW - Allomorphy KW - OO-correspondence KW - Phonemic level KW - Abstractness KW - German Y1 - 2016 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-51433 SN - 1871-5656 SS - 1871-5656 U6 - https://dx.doi.org/10.1007/s11525-016-9289-0 DO - https://dx.doi.org/10.1007/s11525-016-9289-0 N1 - Dieser Beitrag ist aus urheberrechtlichen Gründen nicht frei zugänglich. N1 - The final publication is available at Springer via http://dx.doi.org/10.1007/s11525-016-9289-0 VL - 26 IS - 3/4 SP - 235 EP - 267 PB - Heidelberg u.a. CY - Springer ET - First online ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Steyer, Kathrin T1 - Patterns. Phraseology in a state of flux JF - International Journal of Lexicography N2 - Linguistic usage patterns are not just coincidental phenomena on the textual surface but constitute a fundamental constructional principle of language. At the same time, however, linguistic patterns are highly idiosyncratic in the sense that they tend to be item-specific and unpredictable, thus defying all attempts at capturing them by general abstract rules. […] What all these approaches [that deal with constructions, collocations, patterns, etc. K.S.] share, in addition to their interest in recurrent patterns, is a strong commitment to the value of usage, be it in the wider sense of usage as an empirical basis for sound linguistic analysis and description or in the narrower sense of usage as constituting the basis for the emergence and consolidation of linguistic knowledge. (Herbst et al. 2014: 1) In consequence of the feasibility of studying language data in new quantitative dimensions, the phraseology faces a paradigm shift. The traditional focus on strongly lexicalized, often idiomatic multi-word expressions (MWE) has led to an overestimation of their unique status in the mental lexicon. The majority of MWEs are typical lexical realisations of templates (‘MW patterns’) that emerged from repeated usage and can be instantiated with ever changing lexical elements. The – primarily functional – pattern restrictions cannot always be predicted with rules, but are the result of recurring context factors. In this article, at first, it has been shown the nature and the interrelations of MW patterns that are reconstructed with complex corpus-driven methods. Furthermore, a vision of a new phraseography of MW pattern that described their hierarchies and functions based on authentic corpus data like KWIC bundles, slot filler tables and collocation profiles has been discussed. KW - Deutsch KW - Kollokation KW - Mehrworteinheit KW - Sprachgebrauch Y1 - 2015 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-50678 SN - 0950-3846 SS - 0950-3846 U6 - https://dx.doi.org/doi:10.1093/ijl/ecv021 DO - https://dx.doi.org/doi:10.1093/ijl/ecv021 N1 - Dieser Beitrag ist aus urheberrechtlichen Gründen nicht frei zugänglich. VL - 28 IS - 3 SP - 279 EP - 298 PB - Oxford University Press CY - Oxford ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Fürbacher, Monica T1 - Empirische Erfassung und theoretische Modellierung grammatischer Variation. Bericht zur 52. Jahrestagung "Grammatische Variation - Empirische Zugänge und theoretische Modellierung" des Instituts für Deutsche Sprache, 8. bis 10. März 2016 JF - Deutsche Sprache KW - Deutsch KW - Grammatik KW - Grammatikalität KW - Sprachvariante KW - Kongress KW - Mannheim <2016> Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-50333 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-50333 SN - 0340-9341 SS - 0340-9341 N1 - Dieser Beitrag ist aus urheberrechtlichen Gründen nicht frei zugänglich. VL - 44 IS - 2 SP - 183 EP - 188 PB - E. Schmidt CY - Berlin ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Volodina, Anna T1 - Dmitrij Dobrovol'skij. 2013. Besedy o nemeckom slove. Studien zur deutschen Lexik. (=Studia Philologica). Moskau: Jazyki Slavjansko Kul'tury [Rezension] JF - Deutsche Sprache KW - Deutsch KW - Russisch KW - Korpus KW - Lexikologie KW - Phraseologie KW - Lexikographie KW - Rezension Y1 - 2016 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-50349 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-50349 UR - http://www.DSdigital.de/DS.02.2016.183 SN - 0340-9341 SS - 0340-9341 N1 - Dieser Beitrag ist aus urheberrechtlichen Gründen nicht frei zugänglich. VL - 44 IS - 2 SP - 189 EP - 191 PB - E. Schmidt CY - Berlin ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Koplenig, Alexander T1 - The Impact of Lacking Metadata for the Measurement of Cultural and Linguistic Change Using the Google Ngram Data Sets—Reconstructing the Composition of the German Corpus in Times of WWII JF - Digital Scholarship in the Humanities N2 - The Google Ngram Corpora seem to offer a unique opportunity to study linguistic and cultural change in quantitative terms. To avoid breaking any copyright laws, the data sets are not accompanied by any metadata regarding the texts the corpora consist of. Some of the consequences of this strategy are analyzed in this article. I chose the example of measuring censorship in Nazi Germany, which received widespread attention and was published in a paper that accompanied the release of the Google Ngram data (Michel et al. (2010): Quantitative analysis of culture using millions of digitized books. Science, 331(6014): 176–82). I show that without proper metadata, it is unclear whether the results actually reflect any kind of censorship at all. Collectively, the findings imply that observed changes in this period of time can only be linked directly to World War II to a certain extent. Therefore, instead of speaking about general linguistic or cultural change, it seems to be preferable to explicitly restrict the results to linguistic or cultural change ‘as it is represented in the Google Ngram data’. On a more general level, the analysis demonstrates the importance of metadata, the availability of which is not just a nice add-on, but a powerful source of information for the digital humanities. KW - Sprachwandel KW - Sprachstatistik KW - Metadaten KW - Kulturwandel KW - Korpus KW - Datenstruktur Y1 - 2017 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-49493 U6 - https://dx.doi.org/10.1093/llc/fqv037 DO - https://dx.doi.org/10.1093/llc/fqv037 N1 - Preprint is published under http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-31557 Advance Access published September, 12, 2015 VL - 32 IS - 1 SP - 169 EP - 188 PB - Oxford University Press (OUP) CY - Oxford ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Teich, Elke A1 - Degaetano-Ortlieb, Stefania A1 - Fankhauser, Peter A1 - Kermes, Hannah A1 - Lapshinova-Koltunski, Ekaterina T1 - The linguistic construal of disciplinarity: A data-mining approach using register features JF - Journal of the Association for Information Science and Technology N2 - We analyze the linguistic evolution of selected scientific disciplines over a 30-year time span (1970s to 2000s). Our focus is on four highly specialized disciplines at the boundaries of computer science that emerged during that time: computational linguistics, bioinformatics, digital construction, and microelectronics. Our analysis is driven by the question whether these disciplines develop a distinctive language use—both individually and collectively—over the given time period. The data set is the English Scientific Text Corpus (scitex), which includes texts from the 1970s/1980s and early 2000s. Our theoretical basis is register theory. In terms of methods, we combine corpus-based methods of feature extraction (various aggregated features [part-of-speech based], n-grams, lexico-grammatical patterns) and automatic text classification. The results of our research are directly relevant to the study of linguistic variation and languages for specific purposes (LSP) and have implications for various natural language processing (NLP) tasks, for example, authorship attribution, text mining, or training NLP tools. KW - data mining KW - automatic classification Y1 - 2015 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-44369 SN - 1097-4571 SS - 1097-4571 U6 - https://dx.doi.org/10.1002/asi.23457 DO - https://dx.doi.org/10.1002/asi.23457 SP - 1 EP - 11 ET - Early View (Online Version of Record published before inclusion in an issue) ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Koplenig, Alexander T1 - Why the quantitative analysis of diachronic corpora that does not consider the temporal aspect of time-series can lead to wrong conclusions JF - Digital Scholarship in the Humanities N2 - Recently, a claim was made, on the basis of the German Google Books 1-gram corpus (Michel et al., Quantitative Analysis of Culture Using Millions of Digitized Books. Science 2010; 331: 176–82), that there was a linear relationship between six non-technical non-Nazi words and three ‘explicitly Nazi words’ in times of World War II (Caruana-Galizia. 2015. Politics and the German language: Testing Orwell’s hypothesis using the Google N-Gram corpus. Digital Scholarship in the Humanities [Online]. http://dsh.oxfordjournals.org/cgi/doi/10.1093/llc/fqv011 (accessed 15 April 2015)). Here, I try to show that apparent relationships like this are the result of misspecified models that do not take into account the temporal aspect of time-series data. The main point of this article is to demonstrate why such analyses run the risk of incorrect statistical inference, where potential effects are both meaningless and can potentially lead to wrong conclusions. KW - Sprachstatistik KW - Korpus KW - Internet Y1 - 2017 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-43679 SN - 2055-768X SS - 2055-768X U6 - https://dx.doi.org/0.1093/llc/fqv030 DO - https://dx.doi.org/0.1093/llc/fqv030 N1 - Advance Access published August 13, 2015 Print version: Digital Scholarship Humanities (2017) 32 (1): 159-168. VL - 32 (2017) IS - 1 SP - 159 EP - 168 PB - Oxford University Press (OUP) CY - Oxford ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Klosa, Annette T1 - Zur Optimierung von Wortbildungsangaben in DaF-Wörterbüchern auf der Basis systematischer Korpusuntersuchungen JF - Deutsch als Fremdsprache KW - Deutsch als Fremdsprache KW - Wörterbuch KW - Grammatik Y1 - 2008 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-38098 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-38098 UR - http://www.DaFdigital.de/DaF.04.2008.208 SN - 0011-9741 SS - 0011-9741 VL - 45 IS - 4 SP - 208 EP - 215 ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Mahlow, Cerstin A1 - Piotrowski, Michael T1 - SMM: Detailed, Structured Morphological Analysis for Spanish JF - Polibits. An open access research journal on Computer Science and Computer Engineering N2 - We present a morphological analyzer for Spanish called SMM. SMM is implemented in the grammar development framework Malaga, which is based on the formalism of Left-Associative Grammar. We briefly present the Malaga framework, describe the implementation decisions for some interesting morphological phenomena of Spanish, and report on the evaluation results from the analysis of corpora. SMM was originally only designed for analyzing word forms; in this article we outline two approaches for using SMM and the facilities provided by Malaga to also generate verbal paradigms. SMM can also be embedded into applications by making use of the Malagaprogramming interface; we briefly discuss some application scenarios. KW - Natural language processing KW - morphology KW - Malaga KW - Spanish KW - Computerlinguistik KW - Morphologie KW - Spanisch Y1 - 2009 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-37073 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-37073 UR - http://polibits.gelbukh.com/2009_39/issue39.htm VL - 39 SP - 41 EP - 48 ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Schröter, Melani A1 - Storjohann, Petra T1 - Patterns of discourse semantics. A corpus-assisted study of financial crisis in British newspaper discourse in 2009 JF - Pragmatics and Society N2 - Corpus-assisted analyses of public discourse often focus on the level of the lexicon. This article argues in favour of corpus-assisted analyses of discourse, but also in favour of conceptualising salient lexical items in public discourse in a more determined way. It draws partly on non-Anglophone academic traditions in order to promote a conceptualisation of discourse keywords, thereby highlighting how their meaning is determined by their use in discourse contexts. It also argues in favour of emphasising the cognitive and epistemic dimensions of discourse-determined semantic structures. These points will be exemplified by means of a corpus-assisted, as well as a frame-based analysis of the discourse keyword financial crisis in British newspaper articles from 2009. Collocations of financial crisis are assigned to a generic matrix frame for ‘event’ which contains slots that specify possible statements about events. By looking at which slots are more, respectively less filled with collocates of financial crisis, we will trace semantic presence as well as absence, and thereby highlight the pragmatic dimensions of lexical semantics in public discourse. The article also advocates the suitability of discourse keyword analyses for systematic contrastive analyses of public/political discourse and for lexicographical projects that could serve to extend the insights drawn from corpus-guided approaches to discourse analysis. KW - Diskursanalyse KW - Diskurssemantik KW - Korpus KW - Englisch KW - Zeitung KW - discourse keywords (DKW) KW - discourse semantics KW - semantic presence/absence KW - frame-based contrastive analysis KW - corpus linguistics KW - public/ political discourse Y1 - 2015 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-36153 SN - 1878-9714 SS - 1878-9714 U6 - https://dx.doi.org/10.1075/ps.6.1.03sch DO - https://dx.doi.org/10.1075/ps.6.1.03sch N1 - Dieser Beitrag ist aus urheberrechtlichen Gründen nicht frei zugänglich. VL - 6 IS - 1 SP - 43 EP - 66 PB - John Benjamins CY - Amsterdam ER - TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Brandt, Patrick A1 - Fuß, Eric T1 - Most questionable pronouns: variation between das- vs. was-relatives in German JF - Linguistische Berichte N2 - The article investigates the conditions under which the w-relativizer was appears instead of the d-relativzer das in German relative clauses. Building on Wiese 2013, we argue that was constitutes the elsewhere case that applies when identification with the antecedent cannot be established by syntactic means via upward agreement with respect to phi-features. Corpuslinguistic results point to the conclusion that this is the case whenever there is no lexical nominal in the antecedent that, following Geach 1962 and Baker 2003, supplies a criterion of identity needed to establish sameness of reference between the antecedent and the relativizer. KW - Generative Transformationsgrammatik KW - Sprachstatistik KW - Indefinitpronomen KW - Relativpronomen KW - Deutsch KW - German KW - Frequency KW - Indefinite pronoun KW - Relative pronoun Y1 - 2014 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-30373 UN - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:mh39-30373 SN - 0024-3930 SS - 0024-3930 VL - 2014 IS - 239 SP - 297 EP - 329 PB - Buske CY - Hamburg ER -