Skip to main content
Log in

Credit Scoring Based on the Set-Valued Identification Method

  • Published:
Journal of Systems Science and Complexity Aims and scope Submit manuscript

Abstract

Credit scoring is one of the key problems in financial risk managements. This paper studies the credit scoring problem based on the set-valued identification method, which is used to explain the relation between the individual attribute vectors and classification for the credit worthy and credit worthless lenders. In particular, system parameters are estimated by the set-valued identification algorithm based on a given recognition criteria. In order to illustrate the efficiency of the proposed method, practical experiments are conducted for credit card applicants of Australia and credit card holders from Taiwan, respectively. The empirical results show that the set-valued model has a higher prediction accuracy on both small and large numbers of data set compared with logistic regression model. Furthermore, parameters estimated by the set-valued identification method are more stable, which provide a meaningful and logical explanation for extracting factors that influence the borrowers’ credit scorings.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Beaver W H, Financial ratios as predictors of failure, Journal of Accounting Research, 1966, 4: 71–111.

    Google Scholar 

  2. Boyes W J, Hoffman D L, and Low S A, An econometric analysis of the bank credit scoring problem, Journal of Econometrics, 1989, 40(1): 3–14.

    Google Scholar 

  3. Mays E, Handbook of Credit Scoring, Glenlake Publishing Company, Ltd, Chicago, 2001.

    Google Scholar 

  4. Abdelmoula A K, Bank credit risk analysis with k-nearest-neighbor classifier: Case of Tunisian banks, Accounting and Management Information Systems, 2015, 14(1): 79.

    Google Scholar 

  5. Morgan J P, Creditmetrics-Technical Document, JP Morgan, New York, 1997.

    Google Scholar 

  6. Suisse C, CreditRisk+: A credit risk management framework, Credit Suisse Financial Products, 1997, 18–53.

  7. Crosbie P and Bohn J, Modeling default risk, Technical Report, KMV, LLC, 2003.

  8. Wang X, He X, Bao Y, et al., Parameter estimates of Heston stochastic volatility model with MLE and consistent EKF algorithm, Science China Information Sciences, 2018, 61: 042202.

    Google Scholar 

  9. Karaa A and Krichéne A, Credit risk assessment using support vectors machine and multilayer neural network models: A comparative study case of a Tunisian bank, Accounting and Management Information Systems, 2012, 11(4): 587–620.

    Google Scholar 

  10. Louzada F, Ara A, and Fernandes G B, Classification methods applied to credit scoring: Systematic review and overall comparison, Surveys in Operations Research and Management Science, 2016, 21(2): 117–134.

    MathSciNet  Google Scholar 

  11. Xu X, Zhou C, and Wang Z, Credit scoring algorithm based on link analysis ranking with support vector machine, Expert Systems with Applications, 2009, 36(2): 2625–2632.

    Google Scholar 

  12. Wang X, Bao Y, and Zhao Y, Arbitrage-free conditions for implied volatility surface by Delta, The North American Journal of Economics and Finance, https://doi.org/10.1016/j.najef.2018.08.011.

  13. Bennell J A, Crabbe D, Thomas S, et al., Modelling sovereign credit ratings: Neural networks versus ordered probit, Expert Systems with Applications, 2006, 30(3): 415–425.

    Google Scholar 

  14. Nehrebecka N, Predicting the default risk of companies, comparison of credit scoring models: Logit vs support vector machines, Econometrics, 2018, 22(2): 54–73.

    Google Scholar 

  15. Zhao Y, Zhang J F, and Guo J, System identification and adaptive control of set-valued systems, Journal of Systems Science and Mathematical Science, 2012, 32(10): 1257–1265.

    MathSciNet  MATH  Google Scholar 

  16. Guo J, Zhang J F, and Zhao Y, Adaptive tracking of a class of first-order systems with binary-valued observations and fixed thresholds, Journal of Systems Science and Complexity, 2012, 25(6): 1041–1051.

    MathSciNet  MATH  Google Scholar 

  17. Bi W, Zhao Y, Liu C, et al., Set-valued analysis for genome-wide association studies of complex diseases, The 32nd Chinese Control Conference (CCC), 2013, 8262–8267.

  18. Han J, Pei J, and Kamber M, Data Mining: Concepts and Techniques, Morgan Kaufmann, San Fransisco, 2011.

    MATH  Google Scholar 

  19. Henley W E and Hand D J, Statistical classification methods in consumer credit scoring: A review, Journal of the Royal Statistical Society, Series A (Statistics in Society), 1997, 160(3): 523–541.

    Google Scholar 

  20. Marques A I, García V, and Sánchez J S, A literature review on the application of evolutionary computing to credit scoring, Journal of the Operational Research Society, 2013, 64(9): 1384–1399.

    Google Scholar 

  21. Hosmer Jr. D W, Lemeshow S, and Sturdivant R X, Applied Logistic Regression, John Wiley & Sons, Inc., Hoboken, New Jersey, 2013.

    MATH  Google Scholar 

  22. Bolton C, Logistic Regression and Its Application in Credit Scoring, University of Pretoria, Pretoria, 2009.

    Google Scholar 

  23. Yeh I C and Lien C, The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients, Expert Systems with Applications, 2009, 36(2): 2473–2480.

    Google Scholar 

  24. Roubos J A, Setnes M, and Abonyi J, Learning fuzzy classification rules from labeled data, Information Sciences, 2003, 150(1–2): 77–93.

    MathSciNet  Google Scholar 

  25. Morales M H, Rodríguez J T, and Montero J, Credit rating using fuzzy algorithms, Actas de la XVI Conferencia CAEPIA, Albacete, 2015, 539–548.

  26. Yazdani H and Kwasnicka H, Fuzzy classification method in credit risk, International Conference on Computational Collective Intelligence, Springer, Berlin, Heidelberg, 2012, 495–504.

    Google Scholar 

  27. Galindo J and Tamayo P, Credit risk assessment using statistical and machine learning: Basic methodology and risk modeling applications, Computational Economics, 2000, 15(1–2): 107–143.

    MATH  Google Scholar 

  28. Paolo G, Bayesian data mining, with application to benchmarking and credit scoring, Applied Stochastic Models in Business and Society, 2011, 17: 69–81.

    MathSciNet  MATH  Google Scholar 

  29. Sharma D, Improving the art, craft and science of economic credit risk scorecards using random forests: Why credit scorers and economists should use random forests, Academy of Banking Studies Journal, 2012, 11(1): 93–116.

    Google Scholar 

  30. Pacelli V and Azzollini M, An artificial neural network approach for credit risk management, Journal of Intelligent Learning Systems and Applications, 2011, 3(2): 103.

    Google Scholar 

  31. Hand J and Henley W, Statistical classification methods in consumer credit scoring, Computer Journal of the Royal Statistical Society Series a Statistics in Society, 1997, 160(3): 523–541.

    Google Scholar 

  32. West D, Neural network credit scoring, Computer & Operations Research, 2000, 27(11): 1131–1152.

    MATH  Google Scholar 

  33. Abdou H A and Pointon J, Credit scoring, statistical techniques and evaluation criteria: A review of the literature, Intelligent Systems in Accounting, Finance and Management, 2011, 18(2–3): 59–88.

    Google Scholar 

  34. Berry M and Linoff G, Mastering Data Mining: The Art and Science of Customer Relationship Management, John Wiley & Sons, Inc, New York, 2000.

    Google Scholar 

  35. Miguéis V L, Benoit D F, and Van den Poel D, Enhanced decision support in credit scoring using Bayesian binary quantile regression, Journal of the Operational Research Society, 2013, 64(9): 1374–1383.

    Google Scholar 

  36. Baesens B, Van Gestel T, Viaene S, et al., Benchmarking state-of-the-art classification algorithms for credit scoring, Journal of the Operational Research Society, 2003, 54(6): 627–635.

    MATH  Google Scholar 

  37. Bi W and Zhao Y, Iterative parameter estimate with batched binary-valued observations: Convergence with an exponential rate, The 19th World Congress of the International Federation of Automatic Control, 2014.

  38. Murphy P M and Aha D W, UCI repository of machine learning databases, Department of Information and Computer Science, University of California, Irvine, CA, http://www.ics.uci.edu/mlearn/LRepository.html, 2001.

    Google Scholar 

  39. Ruxton G D, The unequal variance t-test is an underused alternative to student’s t-test and the Mann-Whitney U test, Behavioral Ecology, 2006, 17(4): 688–690.

    Google Scholar 

  40. Everitt B S, The Analysis of Contingency Tables, Chapman and Hall/CRC, London, 1992.

    MATH  Google Scholar 

  41. Hardy M A, Regression with Dummy Variables, Sage, Newbury Park, California, 1993.

    Google Scholar 

  42. Suits D B, Dummy variables: Mechanics vs interpretation, The Review of Economics and Statistics, 1984, 177–180.

  43. Wang X, Djehiche B, and Hu X, credit rating analysis based on the network of trading information, Journal of Network Theory in Finance, 2019, 5(1): 47–65.

    Google Scholar 

  44. St L and Wold S, Analysis of variance (ANOVA), Chemometrics and Intelligent Laboratory Systems, 1989, 6(4): 259–272.

    Google Scholar 

Download references

Acknowledgment

We thank the anonymous researcher and I-Cheng Yeh in the department of information management, Chung Hua University, Taiwan and department of civil engineering, Tamkang University, Taiwan for sharing the data in UCI machining learning repository. We are also grateful for the tutorial of Natalino Busa, the chief data officer of Teko Ventures for the data analysis and credit scoring prediction.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yanlong Zhao.

Additional information

This research was supported by the National Key R&D Program of China under Grant No. 2018YFA0703800, the National Natural Science Foundation of China under Grant No. 61622309, and the Verg Foundation (Sweden).

This paper was recommended for publication by Editor LIU Yungang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, X., Hu, M., Zhao, Y. et al. Credit Scoring Based on the Set-Valued Identification Method. J Syst Sci Complex 33, 1297–1309 (2020). https://doi.org/10.1007/s11424-020-9101-4

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11424-020-9101-4

Keywords

Navigation