Aberrant individuals’ effects on fit indices both of confirmatory factor analysis and polytomous IRT models

Şengül Avşar, Asiye

doi:10.1007/s12144-021-01563-4

Aberrant individuals’ effects on fit indices both of confirmatory factor analysis and polytomous IRT models

Published: 23 March 2021

Volume 42, pages 2157–2166, (2023)
Cite this article

Current Psychology Aims and scope Submit manuscript

Asiye Şengül Avşar ORCID: orcid.org/0000-0001-5522-2514¹

508 Accesses
1 Citation
Explore all metrics

Abstract

Validity and reliability are important psychometric properties for specialists who try to understand human behavior with measurement tools. There are many factors that affect psychometric properties, and one of them is aberrant response behavior. Person-fit statistics are one way to detect aberrant individuals. The general purpose of the study was to specify the effects of possible aberrant individuals on fit indices in the Confirmatory Factor Analysis and their effects on estimated parameters by using the Mokken Homogeneity Model and the Graded Response Model. Satisfaction with Life Scale (SWLS) and the Positive and Negative Affect Scale (PANAS) were used in accordance with the purpose of this research. Flagged individuals were determined according to parametric and non-parametric person-fit statistics for both SWLS and PANAS. These individuals were removed from the data sets, and the trimmed data sets were created. Trimmed and untrimmed data sets were compared. It is seen that individuals who were flagged as aberrant generally affected the statistical results. Especially from the data sets trimmed by the parametric person-fit statistic more discriminative items were obtained. According to the research results, searching for aberrant responses can be suggested in scale development and scale adaptation studies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Measurement of Subjective Wellbeing: Item-Order Effects in the Personal Wellbeing Index—Adult

Article 19 November 2016

Melissa K. Weinberg, Catherine Seton & Nikki Cameron

Psychometric Properties of the Persian Generalized Trust Scale: Confirmatory Factor Analysis and Rasch Models and Relationship with Quality of Life, Happiness, and Depression

Article 20 May 2020

Chung-Ying Lin, Vida Imani, … Amir H. Pakpour

A solution for factorial validity testing of three-item scales: An example of tau-equivalent strict measurement invariance of three-item loneliness scale

Article Open access 07 March 2021

Stanisław K. Czerwiński & Paweł Andrzej Atroszko

Data Availability

The data that support the findings of this study are available from LISS (Longitudinal Internet Studies for the Social Sciences) panel administered by CentERdata (Tilburg University, The Netherlands) but restrictions apply to the availability of these data, which were used under license for the current study and so they are not publicly available. Data are however available from the author(s) upon reasonable request and with permission of LISS.

References

Chalmers, P. (2012). Mirt: A multidimensional item response theory package for the r environment. Journal of Statistical Software, 48(6), 1–29. https://doi.org/10.18637/jss.v048.i06.
Article Google Scholar
Conijn, J. M., Emons, W. H., & Sijtsma, K. (2014). Statistic lz-based person-fit methods for noncognitive multiscale measures. Applied Psychological Measurement, 38(2), 122–136. https://doi.org/10.1177/0146621613497568.
Article Google Scholar
Conijn, J. M., Emons, W. H., De Jong, K., & Sijtsma, K. (2015). Detecting and explaining aberrant responding to the outcome Questionnaire-45. Assessment, 22(4), 513–524. https://doi.org/10.1177/1073191114560882.
Article PubMed Google Scholar
Crișan, D. R., Tendeiro, J., & Meijer, R. (2019, September 24). The crit value as an effect size measure for violations of model assumptions in Mokken scale analysis for binary data. https://doi.org/10.31234/osf.io/8ydmr.
Crocker, L., & Algina, J. (1986). Introduction to classical and modern test theory. Holt: Rinehart and Winston.
Google Scholar
Curtis, D. D. (2004). Person misfit in attitude surveys: Influences, impacts and implications. International Education Journal, 5(2), 125–143 https://files.eric.ed.gov/fulltext/EJ903843.pdf.
Google Scholar
De Ayala, R. J. (2009). The theory and practice of item response theory. New York: Guilford Publications.
Google Scholar
De Koning, E., Sijtsma, K., & Hamers, J. H. M. (2002). Comparison of four irt models when analyzing two tests for inductive reasoning. Applied Psychological Measurement, 26(3), 302–320. https://doi.org/10.1177/0146621602026003005.
Article Google Scholar
Diener, E. D., Emmons, R. A., Larsen, R. J., & Griffin, S. (1985). The satisfaction with life scale. Journal of Personality Assessment, 49(1), 71–75. https://doi.org/10.1207/s15327752jpa4901_13.
Article PubMed Google Scholar
DiStefano, C., & Hess, B. (2005). Using confirmatory factor analysis for construct validation: An empirical review. Journal of Psychoeducational Assessment, 23(3), 225–241. https://doi.org/10.1177/073428290502300303.
Article Google Scholar
Egberink, I. J. L. (2010). Applications of item response theory to non-cognitive data. [doctoral thesis]. Groningen, The Netherlands: University of Groningen.
Google Scholar
Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. New Jersey, NJ: Lawrence Erlbaum Associates.
Google Scholar
Emons, W. H. M. (2008). Nonparametric person-fit analysis of polytomous item scores. Applied Psychological Measurement, 32(3), 224–247. https://doi.org/10.1177/0146621607302479.
Article Google Scholar
Ferrando, P. J. (2012). Assessing inconsistent responding in e and n measures: An application of person-fit analysis in personality. Personality and Individual Differences, 52(6), 718–722. https://doi.org/10.1016/j.paid.2011.12.036.
Article Google Scholar
Ferrando, P. J., & Chico, E. (2001). Detecting dissimulation in personality test scores: A comparison between person-fit indices and detection scales. Educational and Psychological Measurement, 61(6), 997–1012. https://doi.org/10.1177/00131640121971617.
Article Google Scholar
Floyd, F. J., & Widaman, K. F. (1995). Factor analysis in the development and refinement of clinical assessment instruments. Psychological Assessment, 7(3), 286–299. https://doi.org/10.1037/1040-3590.7.3.286.
Article Google Scholar
Glas, C. A. W., & Khalid, N. (2018). Person fit. In W. J. van der Linden (Ed.), Handbook of item response theory volume three applications (pp. 107–126). Boca Raton, FL: CRC Press/Taylor & Francis Group.
Glass, C. A. W., & Dagohoy, A. V. T. (2007). A person-fit test for irt models for polytomous items. Psychometrika, 72(2), 159–180. https://doi.org/10.1007/s11336-003-1081-5.
Article Google Scholar
Guo, H., Rios, J. A., Haberman, S., Liu, O. L., Wang, J., & Paek, I. (2016). A new procedure for detection of students’ rapid guessing responses using response time. Applied Measurement in Education, 29(3), 173–183. https://doi.org/10.1080/08957347.2016.1171766.
Article Google Scholar
Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. Newbury Park, CA: Sage publications.
Google Scholar
Hambleton, R. K., van der Linden, W. J., & Wells, C. S. (2011). IRT models for the analysis of polytomous scored data: Brief and selected history of model building advances. In M. L. Nering & R. Ostini (Eds.), Handbook of polytomous item response theory models (pp. 21–42). New York: Routledge.
Google Scholar
Hemker, B. T., Sijtsma, K., & Molenaar, I. W. (1995). Selection of unidimensional scales from a multidimensional item bank in the polytomous Mokken IRT model. Applied Psychological Measurement, 19(4), 337–352. https://doi.org/10.1177/014662169501900404.
Article Google Scholar
Hopfenbeck, T. N., & Maul, A. (2011). Examining evidence for the validity of PISA learning strategy scales based on student response processes. International Journal of Testing, 11(2), 95–121. https://doi.org/10.1080/15305058.2010.529977.
Article Google Scholar
Karabatsos, G. (2003). Comparing the aberrant response detection performance of thirty-six person-fit statistics. Applied Measurement in Education, 16(4), 277–298. https://doi.org/10.1207/S15324818AME1604_2.
Article Google Scholar
Kline, R. B. (2011). Principles and practice of structural equation modeling. New York: Guilford publications.
Google Scholar
Liu, T., Sun, Y., Li, Z., & Xin, T. (2019). The impact of aberrant response on reliability and validity. Measurement: Interdisciplinary Research and Perspectives, 17(3), 133–142. https://doi.org/10.1080/15366367.2019.1584848.
Article Google Scholar
Meijer, R. R. (1996). Person-fit research: An introduction. Applied Measurement in Education, 9(1), 3–8. https://doi.org/10.1207/s15324818ame0901_2.
Article Google Scholar
Meijer, R. R. (1997). Person fit and criterion-related validity: An extension of the Schmitt, Cortina, and Whitney study. Applied Psychological Measurement, 21(2), 99–113. https://doi.org/10.1177/01466216970212001.
Article Google Scholar
Meijer, R. R. (2003). Diagnosing item score patterns on a test using item response theory-based person-fit statistics. Psychological Methods, 8(1), 72–87. https://doi.org/10.1037/1082-989X.8.1.72.
Article PubMed Google Scholar
Meijer, R. R., & Nering, M. L. (1997). Trait level estimation for nonfitting response vectors. Applied Psychological Measurement, 21(4), 321–336. https://doi.org/10.1177/01466216970214003.
Article Google Scholar
Meijer, R. R., & Tendeiro, J. N. (2012). The use of the l_z and l_z^* person-fit statistics and problems derived from model misspecification. Journal of Educational and Behavioral Statistics, 37(6), 758–766. https://doi.org/10.3102/1076998612466144.
Article Google Scholar
Meijer, R. R., Egberink, I. J., Emons, W. H., & Sijtsma, K. (2008). Detection and validation of unscalable item score patterns using item response theory: An illustration with Harter's self-perception profile for children. Journal of Personality Assessment, 90(3), 227–238. https://doi.org/10.1080/00223890701884921.
Article PubMed Google Scholar
Mertler, C. A., & Reinhart, R. V. (2017). Advanced and multivariate statistical methods: practical application and interpretation. New York: Taylor & Francis Group.
Messick, S. (1994). Alternative modes of assessment, uniform standards of validity 1. (ETS research report series). https://doi.org/10.1002/j.2333-8504.1994.tb01634.x.
Mokken, R. J. (1971). A theory and procedure of scale analysis: With applications in political research. The Hague: Mouton.
Book Google Scholar
Mokken, R. J., Lewis, C., & Sijtsma, K. (1986). Rejoinder to “the Mokken scale: A critical discussion”. Applied Psychological Measurement, 10(3), 279–285. https://doi.org/10.1177/014662168601000306.
Article Google Scholar
Reise, S. P., & Revicki, D. A. (2015). Introduction: Age-old problems and modern solutions. In S. P. Reise, & D. A. Revicki (Eds.), Handbook of item response theory modeling: Applications to typical performance assessment (pp. 3–12). New York: Taylor & Francis Group.
Reise, S. P., & Waller, N. G. (1993). Traitedness and the assessment of response pattern scalability. Journal of Personality and Social Psychology, 65(1), 143–151. https://doi.org/10.1037/0022-3514.65.1.143.
Article Google Scholar
Rosseel, Y. (2012). Lavaan: An r package for structural equation modeling. Journal of Statistical Software, 48(2), 1–26. https://doi.org/10.18637/jss.v048.i02.
Article Google Scholar
Schermelleh-Engel, K., Moosbrugger, H., & Müller, H. (2003). Evaluating the fit of structural equation models: Tests of significance and descriptive goodness-of-fit measures. Methods of Psychological Research Online, 8(2), 23–74.
Google Scholar
Şengül-Avşar, A. (2019). Comparison of person-fit statistics for polytomous items in different test conditions. Journal of Measurement and Evaluation in Education and Psychology, 10(4), 348–364. https://doi.org/10.21031/epod.525647.
Article Google Scholar
Seo, D. G., & Weiss, D. J. (2013). lz person-fit index to identify misfit students with achievement test data. Educational and Psychological Measurement, 73(6), 994–1016. https://doi.org/10.1177/0013164413497015.
Article Google Scholar
Sijtsma, K. (1998). Methodology review: Nonparametric irt approaches to the analysis of dichotomous item scores. Applied Psychological Measurement, 22(1), 3–31. https://doi.org/10.1177/01466216980221001.
Article Google Scholar
Sijtsma, K., & Molenaar, I. W. (2002). Introduction to nonparametric item response theory. Thousand Oaks, CA: Sage Publications.
Book Google Scholar
Stone, C. A., & Zhu, X. (2015). Bayesian analysis of item response theory models using SAS. Cary, NC: SAS Institute Inc..
Google Scholar
Syu, J. J. (2013). Applying person fit-in faking detection-the simulation and practice of nonparametric item response theory. [doctoral dissertation, National Chengchi University]. http://nccur.lib.nccu.edu.tw/bitstream/140.119/58646/1/251501.pdf
Tendeiro, J. N., & Meijer, R. R. (2014). Detection of invalid test scores: The usefulness of simple nonparametric statistics. Journal of Educational Measurement, 51(3), 239–259. https://doi.org/10.1111/jedm.12046.
Article Google Scholar
Tendeiro, J. N., Meijer, R. R., & Niessen, M. S. A. (2016). PerFit: An r package for person-fit analysis in irt. Journal of Statistical Software, 74(5), 1–27. https://doi.org/10.18637/jss.v074.i05.
Article Google Scholar
Toland, M. D. (2014). Practical guide to conducting an item response theory analysis. The Journal of Early Adolescence, 34(1), 120–151. https://doi.org/10.1177/0272431613511332.
Article Google Scholar
Van der Ark, L. A. (2012). New developments in Mokken scale analysis in r. Journal of Statistical Software, 48(5), 1–26. https://doi.org/10.18637/jss.v048.i05.
Article Google Scholar
VandenBos, G. R. (2015). APA dictionary of psychology (2nd ed.). Washington DC: American Psychiatric Pub. https://doi.org/10.1037/14646-000.
Book Google Scholar
Walker, A. A., & Wind, S. A. (2020). Identifying misfitting achievement estimates in performance assessments: An illustration using Rasch and Mokken scale analyses. International Journal of Testing, 20(3), 231–251. https://doi.org/10.1080/15305058.2019.1673758.
Article Google Scholar
Watson, D., Clark, L. A., & Tellegen, A. (1988). Development and validation of brief measures of positive and negative affect: The PANAS scales. Journal of Personality and Social Psychology, 54(6), 1063–1070. https://doi.org/10.1037/0022-3514.54.6.1063.
Article PubMed Google Scholar
Wise, S. L. (2015). Effort analysis: Individual score validation of achievement test data. Applied Measurement in Education, 28(3), 237–252. https://doi.org/10.1080/08957347.2015.1042155.
Article Google Scholar
Zanon, C., Hutz, C. S., Yoo, H. H., & Hambleton, R. K. (2016). An application of item response theory to psychological test development. Psicologia: Reflexão e Crítica, 29(1), 1–10. https://doi.org/10.1186/s41155-016-0040-x.
Article Google Scholar

Download references

Acknowledgments

In this paper we make use of data of the LISS (Longitudinal Internet Studies for the Social Sciences) panel administered by CentERdata (Tilburg University, The Netherlands). The author sincerely thanks the LISS panel for sharing data sets.

Author information

Authors and Affiliations

Faculty of Education, Recep Tayyip Erdoğan University, Rize, Turkey
Asiye Şengül Avşar

Authors

Asiye Şengül Avşar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Asiye Şengül Avşar.

Ethics declarations

Ethical Approval

Data that were used in this study are from a publicly available database. As such, we have depended on the original researchers to comply with the 1964 Declaration of Helsinki and its later addenda and to obtain informed consent from their subjects. We had no interaction with human subjects in the current project.

Informed Consent

Informed consent is not required in this study.

Conflict of Interest

The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

ESM 1

(DOCX 51 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Şengül Avşar, A. Aberrant individuals’ effects on fit indices both of confirmatory factor analysis and polytomous IRT models. Curr Psychol 42, 2157–2166 (2023). https://doi.org/10.1007/s12144-021-01563-4

Download citation

Accepted: 02 March 2021
Published: 23 March 2021
Issue Date: January 2023
DOI: https://doi.org/10.1007/s12144-021-01563-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Aberrant individuals’ effects on fit indices both of confirmatory factor analysis and polytomous IRT models

Abstract

Access this article

Similar content being viewed by others

The Measurement of Subjective Wellbeing: Item-Order Effects in the Personal Wellbeing Index—Adult

Psychometric Properties of the Persian Generalized Trust Scale: Confirmatory Factor Analysis and Rasch Models and Relationship with Quality of Life, Happiness, and Depression

A solution for factorial validity testing of three-item scales: An example of tau-equivalent strict measurement invariance of three-item loneliness scale

Data Availability

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical Approval

Informed Consent

Conflict of Interest

Additional information

Publisher’s Note

Supplementary Information

ESM 1

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Aberrant individuals’ effects on fit indices both of confirmatory factor analysis and polytomous IRT models

Abstract

Access this article

Similar content being viewed by others

The Measurement of Subjective Wellbeing: Item-Order Effects in the Personal Wellbeing Index—Adult

Psychometric Properties of the Persian Generalized Trust Scale: Confirmatory Factor Analysis and Rasch Models and Relationship with Quality of Life, Happiness, and Depression

A solution for factorial validity testing of three-item scales: An example of tau-equivalent strict measurement invariance of three-item loneliness scale

Data Availability

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical Approval

Informed Consent

Conflict of Interest

Additional information

Publisher’s Note

Supplementary Information

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation