Research Article

Visual Model Fit Estimation in Scatterplots and Distribution of Attention

Influence of Slope and Noise Level

Daniel Reimann

https://orcid.org/0000-0003-4687-8858

Department of Psychology, FernUniversität in Hagen, Hagen, Germany

Search for more papers by this author

Christine Blech

Department of Psychology, FernUniversität in Hagen, Hagen, Germany

Search for more papers by this author

, and

Robert Gaschler

Department of Psychology, FernUniversität in Hagen, Hagen, Germany

Search for more papers by this author

Published Online:December 04, 2020https://doi.org/10.1027/1618-3169/a000499

Abstract

Abstract. Scatterplots are ubiquitous data graphs and can be used to depict how well data fit to a quantitative theory. We investigated which information is used for such estimates. In Experiment 1 (N = 25), we tested the influence of slope and noise on perceived fit between a linear model and data points. Additionally, eye tracking was used to analyze the deployment of attention. Visual fit estimation might mimic one or the other statistical estimate: If participants were influenced by noise only, this would suggest that their subjective judgment was similar to root mean square error. If slope was relevant, subjective estimation would mimic variance explained. While the influence of noise on estimated fit was stronger, we also found an influence of slope. As most of the fixations fell into the center of the scatterplot, in Experiment 2 (N = 51), we tested whether location of noise affects judgment. Indeed, high noise influenced the judgment of fit more strongly if it was located in the middle of the scatterplot. Visual fit estimates seem to be driven by the center of the scatterplot and to mimic variance explained.

References

Bergstrom, C. T., & West, J. D. (2018). Why scatter plots suggest causality, and what we can do about it. ArXiv. https://arxiv.org/abs/1809.09328 First citation in article Google Scholar
Bindemann, M. (2010). Scene and screen center bias early eye movements in scene viewing. Vision Research, 50(23), 2577–2587. 10.1016/j.visres.2010.08.016 First citation in article Crossref Medline, Google Scholar
Bobko, P., & Karren, R. (1979). The perception of Pearson product moment correlations from bivariate scatterplots. Personnel Psychology, 32(2), 313–325. 10.1111/j.1744-6570.1979.tb02137.x First citation in article Crossref, Google Scholar
Bogen, J., & Woodward, J. (1992). Observations, theories and the evolution of the human spirit, Philosophy of Science, 59(4), 590–611. 10.1086/289697 First citation in article Crossref, Google Scholar
Brewer, W. F. (2012). The theory ladenness of the mental processes used in the scientific enterprise: Evidence from cognitive psychology and the history of science. In R. W. ProctorE. J. Capaldi (Eds.), Psychology of science: Implicit and explicit processes psychology of science: Implicit and explicit processes (pp. 289–334). Oxford University Press. 10.1093/acprof:oso/9780199753628.003.0013 First citation in article Crossref, Google Scholar
Cleveland, W. S., Diaconis, P., & McGill, R. (1982). Variables on scatterplots look more highly correlated when the scales are increased. Science, 216(4550), 1138–1141. 10.1126/science.216.4550.1138 First citation in article Crossref Medline, Google Scholar
Cohen, J. (1988). Statistical power analysis for the behavioral sciences. Routledge. 10.4324/9780203771587 First citation in article Crossref, Google Scholar
Doherty, M. E., & Anderson, R. B. (2009). Variation in scatterplot displays. Behavior Research Methods, 41(1), 55–60. 10.3758/BRM.41.1.55 First citation in article Crossref Medline, Google Scholar
Evans, N. J., Brown, S. D., Mewhort, D. J. K., & Heathcote, A. (2018). Refining the law of practice. Psychological Review, 125(4), 592–605. 10.1037/rev0000105 First citation in article Crossref Medline, Google Scholar
Faul, F., Erdfelder, E., Buchner, A., & Lang, A.-G. (2009). Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses. Behavior Research Methods, 41(4), 1149–1160. 10.3758/brm.41.4.1149 First citation in article Crossref Medline, Google Scholar
Friendly, M., & Denis, D. (2005). The early origins and development of the scatterplot. Journal of the History of the Behavioral Sciences, 41(2), 103–130. 10.1002/jhbs.20078 First citation in article Crossref Medline, Google Scholar
Godau, C., Vogelgesang, T., & Gaschler, R. (2016). Perception of bar graphs—A biased impression? Computers in Human Behavior, 59, 67–73. 10.1016/j.chb.2016.01.036 First citation in article Crossref, Google Scholar
Heathcote, A., Brown, S., & Mewhort, D. J. K. (2000). The power law repealed: The case for an exponential law of practice. Psychonomic Bulletin & Review, 7(2), 185–207. 10.3758/BF03212979 First citation in article Crossref Medline, Google Scholar
Kubovy, M., & van den Berg, M. (2008). The whole is equal to the sum of its parts: A probabilistic model of grouping by proximity and similarity in regular patterns. Psychological Review, 115(1), 131–154. 10.1037/0033-295X.115.1.131 First citation in article Crossref Medline, Google Scholar
Lane, D. M., Anderson, C. A., & Kellam, K. L. (1985). Judging the relatedness of variables: The psychophysics of covariation detection. Journal of Experimental Psychology: Human Perception and Performance, 11(5), 640–649. 10.1037/0096-1523.11.5.640 First citation in article Crossref, Google Scholar
Lauer, T. W., & Post, G. V. (1989). Density in scatterplots and the estimation of correlation. Behaviour & Information Technology, 8(3), 235–244. 10.1080/01449298908914554 First citation in article Crossref, Google Scholar
Masson, M. E. J., & Loftus, G. R. (2003). Using confidence intervals for graphically based data interpretation. Canadian Journal of Experimental Psychology, 57(3), 203–220. 10.1037/h0087426 First citation in article Crossref Medline, Google Scholar
Meyer, J., & Shinar, D. (1992). Estimating correlations from scatterplots. Human Factors: The Journal of the Human Factors and Ergonomics Society, 34(3), 335–349. 10.1177/001872089203400307 First citation in article Crossref, Google Scholar
Meyer, J., Taieb, M., & Flascher, I. (1997). Correlation estimates as perceptual judgments. Journal of Experimental Psychology: Applied, 3(1), 3–20. 10.1037/1076-898X.3.1.3 First citation in article Crossref, Google Scholar
Osborne, J. W., & Waters, E. (2002). Four assumptions of multiple regression that researchers should always test. Practical Assessment, Research and Evaluation, 8(2), 1–5. 10.7275/r222-hv23 First citation in article Crossref, Google Scholar
Palmeri, T. J. (1999). Theories of automaticity and the power law of practice. Journal of Experimental Psychology: Learning, Memory, and Cognition, 25(2), 543–551. 10.1037/0278-7393.25.2.543 First citation in article Crossref, Google Scholar
Pitt, M. A., Myung, I. J., & Zhang, S. (2002). Toward a method of selecting among computational models of cognition. Psychological Review, 109(3), 472–491. 10.1037/0033-295x.109.3.472 First citation in article Crossref Medline, Google Scholar
Reimann, D. (2019). Visual model fit estimation in scatterplots and distribution of attention: Influence of slope and noise level. 10.17605/OSF.IO/TG62S First citation in article Crossref, Google Scholar
Rensink, R. A. (2017). The nature of correlation perception in scatterplots. Psychonomic Bulletin & Review, 24(3), 776–797. 10.3758/s13423-016-1174-7 First citation in article Crossref Medline, Google Scholar
Rensink, R. A., & Baldridge, G. (2010). The perception of correlation in scatterplots. Computer Graphics Forum, 29(3), 1203–1210. 10.1111/j.1467-8659.2009.01694.x First citation in article Crossref, Google Scholar
Roberts, S., & Pashler, H. (2000). How persuasive is a good fit? A comment on theory testing. Psychological Review, 107(2), 358–367. 10.1037/0033-295x.107.2.358 First citation in article Crossref Medline, Google Scholar
Sarikaya, A., & Gleicher, M. (2018). Scatterplots: Tasks, data, and designs. IEEE Transactions on Visualization and Computer Graphics, 24(1), 402–412. 10.1109/TVCG.2017.2744184 First citation in article Crossref Medline, Google Scholar
Schnotz, W., & Bannert, M. (2003). Construction and interference in learning from multiple representation. Learning and Instruction, 13(2), 141–156. 10.1016/S0959-4752(02)00017-8 First citation in article Crossref, Google Scholar
Schunn, C., & Wallach, D. (2005). Evaluating goodness-of-fit in comparison of models to data. In W. Tack (Ed.), Psychologie der Kognition: Reden und Vorträge anlässlich der Emeritierung von Werner Tack (pp. 115–154). University of Saarland Press. First citation in article Google Scholar
Smith, L. D., Best, L. A., Stubbs, D. A., Johnston, J., & Archibald, A. B. (2000). Scientific graphs and the hierarchy of the sciences: A Latourian survey of inscription practices. Social Studies of Science, 30(1), 73–94. 10.1177/030631200030001003 First citation in article Crossref, Google Scholar
Smith, L. D., Best, L. A., Stubbs, D. A., Archibald, A. B., & Roberson-Nay, R. (2002). Constructing knowledge: The role of graphs and tables in hard and soft psychology. American Psychologist, 57(10), 749–761. 10.1037/0003-066X.57.10.749 First citation in article Crossref Medline, Google Scholar
Strahan, R. F., & Hansen, C. J. (1978). Underestimating correlation from scatterplots. Applied Psychological Measurement, 2(4), 543–550. 10.1177/014662167800200409 First citation in article Crossref, Google Scholar
Tatler, B. W. (2007). The central fixation bias in scene viewing: Selecting an optimal viewing position independently of motor biases and image feature distributions. Journal of Vision, 7(14), 4, 1–20. 10.1167/7.14.4 First citation in article Crossref Medline, Google Scholar
Wagenmakers, E.-J. (2003). How many parameters does it take to fit an elephant? Journal of Mathematical Psychology, 47(5–6), 580–586. 10.1016/S0022-2496(03)00064-6 First citation in article Crossref, Google Scholar
Wickham, H., Cook, D., & Hofmann, H. (2015). Visualizing statistical models: Removing the blindfold. Statistical Analysis and Data Mining: The ASA Data Science Journal, 8(4), 203–225. 10.1002/sam.11271 First citation in article Crossref, Google Scholar
Wixted, J. T., & Ebbesen, E. B. (1997). Genuine power curves in forgetting: A quantitative analysis of individual subject forgetting functions. Memory & Cognition, 25(5), 731–739. 10.3758/BF03211316 First citation in article Crossref Medline, Google Scholar
Yang, F., Harrison, L. T., Rensink, R. A., Franconeri, S. L., & Chang, R. (2019). Correlation judgment and visualization features: A comparative study. IEEE Transactions on Visualization and Computer Graphics, 25(3), 1474–1488. 10.1109/TVCG.2018.2810918 First citation in article Crossref Medline, Google Scholar

Volume 67Issue 5September 2020

ISSN: 1618-3169eISSN: 2190-5142

History

ReceivedDecember 4, 2019
RevisedAugust 19, 2020
AcceptedOctober 12, 2020
Published onlineDecember 4, 2020

Licenses & Copyright

Keywords

Open Data:

The raw data underlying the findings reported in the article as well as the materials are available at the public data repository https://osf.io/tg62s/

PDF download

Verify Phone

Congrats!

Visual Model Fit Estimation in Scatterplots and Distribution of Attention

Influence of Slope and Noise Level

Abstract

References

History

Licenses & Copyright

Support & Contact

Support & Contact

Legal information

Legal information

More offers

More offers

Our partners

Our partners

Change Password

Your password must have 8 characters or more and contain 3 of the following:

Password Changed Successfully

Create a new account

Request Username

Verify Phone

Congrats!

Visual Model Fit Estimation in Scatterplots and Distribution of Attention

Influence of Slope and Noise Level

Abstract

References

History

Licenses & Copyright

Support & Contact

Support & Contact

Legal information

Legal information

More offers

More offers

Our partners

Our partners