OPUS 4 | Search

3 search hits

1 to 3

Sort by

A distributional comparison between FOLK and DeReKo (2023)

Kupietz, Marc ; Fankhauser, Peter ; Ruppenhofer, Josef

Count-based and predictive language models for exploring DeReKo (2022)

We present the use of count-based and predictive language models for exploring language use in the German Reference Corpus DeReKo. For collocation analysis along the syntagmatic axis we employ traditional association measures based on co-occurrence counts as well as predictive association measures derived from the output weights of skipgram word embeddings. For inspecting the semantic neighbourhood of words along the paradigmatic axis we visualize the high dimensional word embeddings in two dimensions using t-stochastic neighbourhood embeddings. Together, these visualizations provide a complementary, explorative approach to analysing very large corpora in addition to corpus querying. Moreover, we discuss count-based and predictive models w.r.t. scalability and maintainability in very large corpora.

Visualizing Language Change in a Corpus of Contemporary German (2017)

Fankhauser, Peter ; Kupietz, Marc

1 to 3

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

3 search hits