Volltext-Downloads (blau) und Frontdoor-Views (grau)

Exploring and visualizing variation in language resources

  • Language resources are often compiled for the purpose of variational analysis, such as studying differences between genres, registers, and disciplines, regional and diachronic variation, influence of gender, cultural context, etc. Often the sheer number of potentially interesting contrastive pairs can get overwhelming due to the combinatorial explosion of possible combinations. In this paper, we present an approach that combines well understood techniques for visualization heatmaps and word clouds with intuitive paradigms for exploration drill down and side by side comparison to facilitate the analysis of language variation in such highly combinatorial situations. Heatmaps assist in analyzing the overall pattern of variation in a corpus, and word clouds allow for inspecting variation at the level of words.

Export metadata

Additional Services

Share in Twitter Search Google Scholar


Author:Peter Fankhauser, Jörg Knappen, Elke Teich
Parent Title (English):Proceedings of the ninth international conference on language resources and evaluation (LREC '14)
Publisher:European Language Resources Association (ELRA)
Place of publication:Reykjavik
Document Type:Conference Proceeding
Year of first Publication:2014
Date of Publication (online):2014/06/13
Corpus Comparison; Language Variation; Visualization
GND Keyword:Amerikanisches Englisch; Englisch; Kontrastive Linguistik; Korpus <Linguistik>; Sprachvariante; Visualisierung
First Page:4125
Last Page:4128
DDC classes:400 Sprache / 420 Englisch / 427 Varianten des Englischen, Mittelenglisch
Open Access?:ja
Leibniz-Classification:Sprache, Linguistik
Linguistics-Classification:Kontrastive Linguistik
Licence (German):License LogoUrheberrechtlich geschützt