ZipfExplorer : a tool for the comparison of shared lexis
Coats, Steven (2021-05-14)
Coats, S. (2021) ZipfExplorer: A Tool for the Comparison of Shared Lexis. In Post-Proceedings of the 5th Conference Digital Humanities in the Nordic Countries (DHN 2020) Riga, Latvia, October 21-23, 2020, Article 6, pp. 145-155, http://ceur-ws.org/Vol-2865/short6.pdf
© 2021 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
https://creativecommons.org/licenses/by/4.0/
https://urn.fi/URN:NBN:fi-fe2021052131113
Tiivistelmä
Abstract
Word frequency statistics and lexical diversity measures can provide insights into discourse differences between texts. The ZipfExplorer, a tool and online app for the interactive visualization and comparison of word frequencies in two texts, shows side-by-side rank-frequency profiles and interactive tables of shared lexis,enabling keyword analysis and shedding light on discourse differences. Four lexical diversity measures (type-token ratio, Gini coefficient, power-law alpha parameter, and Shannon entropy) are calculated for the shared word types. Word frequency information is provided for a selection of mainly literary texts, and users can upload their own files. This paper provides an overview of the visualization of word frequency distributions, describes the functionality of the ZipfExplorer tool and demonstrates some of its features, and briefly discusses the lexical diversity measures calculated by the tool.
Kokoelmat
- Avoin saatavuus [32026]