OPUS 4 | 410 Linguistik

An XML Annotation Schema for speech, thought and writing representation (2014)

This contribution presents an XML Schema for annotating a high level narratological category: speech, thought and writing representation (ST&WR). It focusses on two aspects: Firstly, the original Schema is presented as an example for the challenge to encode a narrative feature in a structured and flexible way and secondly, ways of adapting this Schema to TEI are considered, in Order to make it usable for other, TEI-based projects.

Verbal feedback: positioning and acoustics of French “ouais” and “oui” (2014)

Prévot, Laurent ; Gorisch, Jan

Anticipatory reactions. Patients’ answers to doctors’ questions (2014)

Spranz-Fogasy, Thomas

Discourses of helping professions. Concepts and contextualization (2014)

Graf, Eva-Maria ; Sator, Marlene ; Spranz-Fogasy, Thomas

Creative commons and language resources: general issues and what's new in CC 4.0 (2014)

Kamocki, Paweł ; Ketzan, Erik

The impact of lacking metadata and data truncation for the measurement of cultural and linguistic change using the Google Ngram datasets (2014)

Koplenig, Alexander

As a result of legal restrictions the Google Ngram Corpora datasets are a) not accompanied by any metadata regarding the texts the corpora consist of and the data are b) truncated to prevent an indirect conclusion from the n-gram to the author of the text. Some of the consequences of this strategy are discussed in this article.

Multilingual corpora at the Hamburg centre for language corpora (2014)

Hedeland, Hanna ; Lehmberg, Timm ; Schmidt, Thomas ; Wörner, Kai

Zum Zusammenhang von Sprache und ethnischer Identität der zweiten Generation der Deutschen aus der ehemaligen Sowjetunion (2014)

Dück, Katharina

Eine Umschau in jüngeren sprachwissenschaftlichen Arbeiten zeigt einen häufig betonten engen Zusammenhang von Sprache und Identität, vor allem den der eigenen Sprache und der ethnischen Identität. Dass aber Sprache in einem zwei- oder mehrsprachigen Kontext nur eine Ressource einer Identitätskonstruktion sein kann, wird selten herausgestellt. Der nachstehende Aufsatz untersucht als charakteristisches Beispiel einer gelösten Bindung von Sprache und ethnischer Identität die Minderheit der deutschen Aussiedler aus der ehemaligen Sowjetunion. Im Vordergrund steht dabei die zweite Generation, bei der ihr Zugehörigkeitsgefühl zur ethnischen Identität als Deutsche trotz der erfolgten Sprachumstellung sich nicht oder selten verändert hat.

Forschungsinfrastrukturen in außeruniversitären Forschungseinrichtungen. Forschungsbericht (2014)

Fiedler, Norman ; Werthmann, Antonina ; Stührenberg, Maik ; Schonefeld, Oliver ; Bingel, Joachim ; Witt, Andreas

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers (2014)

Degaetano-Ortlieb, Stefania ; Fankhauser, Peter ; Kermes, Hannah ; Lapshinova-Koltunski, Ekaterina ; Ordan, Noam ; Teich, Elke

We present a methodology to analyze the linguistic evolution of scientific registers with data mining techniques, comparing the insights gained from shallow vs. linguistic features. The focus is on selected scientific disciplines at the boundaries to computer science (computational linguistics, bioinformatics, digital construction, microelectronics). The data basis is the English Scientific Text Corpus (SCITEX) which covers a time range of roughly thirty years (1970/80s to early 2000s) (Degaetano-Ortlieb et al., 2013; Teich and Fankhauser, 2010). In particular, we investigate the diversification of scientific registers over time. Our theoretical basis is Systemic Functional Linguistics (SFL) and its specific incarnation of register theory (Halliday and Hasan, 1985). In terms of methods, we combine corpus-based methods of feature extraction and data mining techniques.

Open Access

410 Linguistik

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

10 search hits