Modeling and Measuring Short Text Similarities. On the Multi-Dimensional Differences between German Poetry of Realism and Modernism
- This study contributes to the ongoing discussion on how to operationalize text similarity for the purposes of computational literary studies by defining, justifying theoretically and employing a multi-dimensional text model. Additionally, we evaluate a set of strategies to implement this model for very short texts like poetry using a range of methods from weighted sparse vectors up to very recent neural sentence embeddings based on annotations of emotions, genre and similarity. And finally, we show the relevance of using such a complex text model by applying the best method to a research question about the development of early modernism in German poetry. While we can confirm some important hypotheses from literary studies, we are also able to differentiate or relativize others. In particular, our findings do not support the widely held thesis that the change from realism to modernism was a revolutionary 'rupture'.
Author: | Anton EhrmanntrautORCiD, Thora HagenORCiD, Fotis JannidisORCiDGND, Leonard KonleORCiD, Merten KrönckeORCiD, Simone WinkoORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-130792 |
DOI: | https://doi.org/10.48694/jcls.116 |
Parent Title (English): | Journal of Computational Literary Studies |
Publisher: | Universitäts- und Landesbibliothek Darmstadt |
Place of publication: | Darmstadt |
Document Type: | Article |
Language: | German |
Year of first Publication: | 2022 |
Date of Publication (online): | 2025/03/27 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | multi-dimensional text model computational literary studies; short text |
GND Keyword: | Lyrik; Modernismus; Realismus; Ähnlichkeit |
Volume: | 1 |
Issue: | 1 |
First Page: | 1 |
Last Page: | 30 |
DDC classes: | 400 Sprache / 430 Deutsch |
Open Access?: | ja |
BDSL-Classification: | Textwissenschaft |
Linguistics-Classification: | Textlinguistik / Schriftsprache |
Licence (English): | ![]() |