OPUS 4 | Search

Refine

Has Fulltext

yes (3)

3 search hits

1 to 3

Sort by

Was für Enthüllungen! heulte die wohlgekleidete respektable Menge – Eine korpus-linguistische Untersuchung zur lexikalischen Vielfalt von Redeeinleitern (2019)

Tu, Ngoc Duyen Tanja ; Engelberg, Stefan ; Weimer, Lukas

Corpus REDEWIEDERGABE (2020)

Brunner, Annelen ; Engelberg, Stefan ; Jannidis, Fotis ; Tu, Ngoc Duyen Tanja ; Weimer, Lukas

This article presents the corpus REDEWIEDERGABE, a German-language historical corpus with detailed annotations for speech, thought and writing representation (ST&WR). With approximately 490,000 tokens, it is the largest resource of its kind. It can be used to answer literary and linguistic research questions and serve as training material for machine learning. This paper describes the composition of the corpus and the annotation structure, discusses some methodological decisions and gives basic statistics about the forms of ST&WR found in this corpus.

Making Non-Normalized Content Retrievable – A Tagging Pipeline for a Corpus of Expert-Layperson Texts (2023)

Lang, Christian ; Tu, Ngoc Duyen Tanja ; Zeidler, Laura

Conventional terminology resources reach their limits when it comes to automatic content classification of texts in the domain of expertlayperson communication. This can be attributed to the fact that (non-normalized) language usage does not necessarily reflect the terminological elements stored in such resources. We present several strategies to extend a terminological resource with term-related elements in order to optimize automatic content classification of expert-layperson texts.

1 to 3

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

3 search hits