The influence of font scale on semantic expression of word cloud

Yang, Lu; Li, Jie; Lu, Wenhuan; Chen, Yi; Zhang, Kang; Li, Yan

doi:10.1007/s12650-020-00678-3

The influence of font scale on semantic expression of word cloud

Regular Paper
Published: 22 July 2020

Volume 23, pages 981–998, (2020)
Cite this article

Journal of Visualization Aims and scope Submit manuscript

Lu Yang¹,
Jie Li¹,
Wenhuan Lu¹,
Yi Chen²,
Kang Zhang³ &
…
Yan Li ORCID: orcid.org/0000-0002-5019-2114⁴

761 Accesses
10 Citations
Explore all metrics

Abstract

Word cloud is a common text visualization technique. With the ability of presenting the keywords of a document in a direct way, it has been widely applied in many real-world situations. However, to better represent the main idea of a document, a critical aspect for word cloud design is to set an appropriate font size to facilitate semantic expression. In this paper, we explore the influence of font scale on semantic expression and evaluate font size of word cloud in a more systematic approach. To quantify semantic information of a document, we utilize an LDA ensemble-based method to support interactive selection of topics and obtain the semantics of documents in a scientific way. We conducted two pilot studies to decide important attributes of word clouds for the formal study. Through formal study 1, we find that the scale affects the semantic expression of word cloud, including accuracy, time and confidence in making judgments. In study 2, we explored different semantic expression patterns of word clouds under different document categories. Our findings aimed at optimizing the scale of word cloud and improving its semantic expressing ability.

Graphic abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Making data visualization more efficient and effective: a survey

Article 19 November 2019

Xuedi Qin, Yuyu Luo, … Guoliang Li

Automating creativity assessment with SemDis: An open platform for computing semantic distance

Article Open access 31 August 2020

Roger E. Beaty & Dan R. Johnson

Recent automatic text summarization techniques: a survey

Article 29 March 2016

Mahak Gambhir & Vishal Gupta

References

Alexander E, Chang CC, Shimabukuro M, Franconeri S, Collins C, Gleicher M (2018) Perceptual biases in font size as a data encoding. IEEE Trans Visualization Comp Gr 24(8):2397–2410
Article Google Scholar
Barth L, Fabrikant SI, Kobourov SG, Lubiw A, Nöllenburg M, Okamoto Y, Pupyrev S, Squarcella C, Ueckerdt T, Wolff A (2014) Semantic word cloud representations: Hardness and approximation algorithms. In: Latin American Symposium on Theoretical Informatics, pp. 514–525. Springer
Barth L, Kobourov SG, Pupyrev S (2014) Experimental comparison of semantic word clouds. In: International Symposium on Experimental Algorithms, pp. 247–258. Springer
Bateman S, Gutwin C, Nacenta M (2008) Seeing things in the clouds: the effect of visual features on tag cloud selections. In: Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, pp. 193–202. ACM (2008)
Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(Jan):993–1022
MATH Google Scholar
Chen S, Li J, Andrienko G, Andrienko N, Wang Y, Nguyen PH, Turkay C (2019) Supporting story synthesis: bridging the gap between visual analytics and storytelling. IEEE Transactions on Visualization and Computer Graphics, pp. 1–1
Chi MT, Lin SS, Chen SY, Lin CH, Lee TY (2015) Morphable word clouds for time-varying text data visualization. IEEE Trans Vis Comp Gr 21(12):1415–1426
Article Google Scholar
Cui W, Wu Y, Liu S, Wei F, Zhou MX, Qu H (2010) Context preserving dynamic word cloud visualization. In: 2010 IEEE Pacific Visualization Symposium (PacificVis), pp. 121–128. IEEE
Danielsson PE (1980) Euclidean distance mapping. Comp Gr Image Process 14(3):227–248
Article Google Scholar
Deutsch S, Schrammel J, Tscheligi M (2009) Comparing different layouts of tag clouds: Findings on visual perception. In: Workshop on Human-Computer Interaction and Visualization, pp. 23–37. Springer
Gansner ER, Hu Y, North SC (2013) Interactive visualization of streaming text data with dynamic maps. J Gr Algorithms Appl 17(4):515–540
Article MathSciNet Google Scholar
Haans RF, Pieters C, He ZL (2016) Thinking about u: theorizing and testing u-and inverted u-shaped relationships in strategy research. Strat Manag J 37(7):1177–1195
Article Google Scholar
Halvey MJ and Keane MT (2007) An assessment of tag presentation techniques. In: Proceedings of the 16th international conference on World Wide Web, pp. 1313–1314. ACM
Heimerl F, Lohmann S, Lange S, Ertl T (2014) Word cloud explorer: Text analytics based on word clouds. In: 2014 47th Hawaii International Conference on System Sciences, pp. 1833–1842. IEEE
King D, Janiszewski C (2011) The sources and consequences of the fluent processing of numbers. J Market Res 48(2):327–341
Article Google Scholar
Koh K, Lee B, Kim B, Seo J (2010) Maniwordle: providing flexible control over wordle. IEEE Trans Vis Comp Gr 16(6):1190–1197
Article Google Scholar
Krishna A (2012) An integrative review of sensory marketing: engaging the senses to affect perception, judgment and behavior. J Consum Psychol 22(3):332–351
Article Google Scholar
Li D, Mei H, Shen Y, Su S, Zhang W, Wang J, Zu M, Chen W (2018) Echarts: a declarative framework for rapid construction of web-based visualization. Vis Inf 2(2):136–146
Google Scholar
Li J, Chen S, Chen W, Andrienko G, Andrienko N (2020) Semantics-space-time cube: a conceptual framework for systematic analysis of texts in space and time. IEEE Trans Vis Comp Gr 26(4):1789–1806
Google Scholar
Liu X, Shen HW, Hu Y (2015) Supporting multifaceted viewing of word clouds with focus+ context display. Inf Vis 14(2):168–180
Article Google Scholar
Lohmann S, Heimerl F, Bopp F, Burch M, Ertl T (2015) Concentri cloud: Word cloud visualization for multiple text documents. In: 2015 19th International Conference on Information Visualisation, pp. 114–120. IEEE
Lohmann S, Ziegler J, Tetzlaff L (2009) Comparison of tag cloud layouts: Task-related performance and visual exploration. In: IFIP Conference on Human-Computer Interaction, pp. 392–404. Springer
Ma Y, Tung AK, Wang W, Gao X, Pan Z, Chen W (2018) Scatternet: a deep subjective similarity model for visual analysis of scatterplots. IEEE transactions on visualization and computer graphics
Mei H, Chen W, Ma Y, Guan H, Hu W (2018) Viscomposer: a visual programmable composition environment for information visualization. Vis Inf 2(1):71–81
Google Scholar
Pope D, Simonsohn U (2011) Round numbers as goals: evidence from baseball, sat takers, and the lab. Psychol Sci 22(1):71–79
Article Google Scholar
Rivadeneira AW, Gruen DM, Muller MJ, Millen DR (2007) Getting our head in the clouds: toward evaluation studies of tagclouds. In: Proceedings of the SIGCHI conference on Human factors in computing systems, pp. 995–998. ACM
Saravia E, Argueta C, Chen YS (2015) Emoviz: Mining the world’s interest through emotion analysis. In: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015, pp. 753–756. ACM
Schrammel J, Leitner M, Tscheligi M (2009) Semantically structured tag clouds: an empirical evaluation of clustered presentation approaches. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 2037–2040. ACM
Wang J (2012) Clustered layout word cloud for user generated online reviews. Ph.D. thesis, Virginia Tech
Wang J, Zhao J, Guo S, North C, Ramakrishnan N (2014) Recloud: semantics-based word cloud visualization of user reviews. In: Proceedings of Graphics Interface 2014, pp. 151–158. Canadian Information Processing Society
Wang X, Cui Z, Jiang L, Lu W, Li J (2020) Wordlenet: a visualization approach for relationship exploration in document collection. Tsinghua Sci Technol 25(3):384–400
Article Google Scholar
Wang Y, Chu X, Bao C, Zhu L, Deussen O, Chen B, Sedlmair M (2018) Edwordle: consistency-preserving word cloud editing. IEEE Trans Vis Comp Gr 24(1):647–656
Article Google Scholar
Wei Y, Mei H, Zhao Y, Zhou S, Lin B, Jiang H, Chen W (2020) Evaluating perceptual bias during geometric scaling of scatterplots. IEEE Trans Vis Comp Gr 26(1):321–331
Article Google Scholar
Wu Y, Provan T, Wei F, Liu S, Ma KL (2011) Semantic-preserving word clouds by seam carving. Computer graphics forum, vol 30. Wiley, New York, pp 741–750
Google Scholar
Xu J, Tao Y, Lin H (2016) Semantic word cloud generation based on word embeddings. In: 2016 IEEE Pacific Visualization Symposium (PacificVis), pp. 239–243. IEEE
Zhao Y, Luo F, Chen M, Wang Y, Xia J, Zhou F, Wang Y, Chen Y, Chen W (2018) Evaluating multi-dimensional visualizations for understanding fuzzy clusters. IEEE Trans Vis Comp Gr 25(1):12–21
Article Google Scholar

Download references

Acknowledgements

The work was partially supported by the National Key Research and Development Program of China (No. 2018YFC0831700) and the National Natural Science Foundation of China (No. 61972278).

Author information

Authors and Affiliations

College of Intelligence and Computing, Tianjin University, Tianjin, China
Lu Yang, Jie Li & Wenhuan Lu
Beijing Key Laboratory of Big Data Technology for Food Safety, Beijing Technology and Business University, Fangshan, China
Yi Chen
The University of Texas at Dallas, Richardson, USA
Kang Zhang
University of Science and Technology of China, Hefei, China
Yan Li

Authors

Lu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Li
View author publications
You can also search for this author in PubMed Google Scholar
Wenhuan Lu
View author publications
You can also search for this author in PubMed Google Scholar
Yi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Kang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yan Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yan Li.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, L., Li, J., Lu, W. et al. The influence of font scale on semantic expression of word cloud. J Vis 23, 981–998 (2020). https://doi.org/10.1007/s12650-020-00678-3

Download citation

Received: 14 October 2019
Revised: 27 May 2020
Accepted: 19 June 2020
Published: 22 July 2020
Issue Date: December 2020
DOI: https://doi.org/10.1007/s12650-020-00678-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions