Abstract
Credit scoring is a process of calculating the risk associated with an applicant on the basis of applicant’s credentials such as social status, financial status, etc. and it plays a vital role to improve cash flow for financial industry. However, the credit scoring dataset may have a large number of irrelevant or redundant features which leads to poorer classification performances and higher complexity. So, by removing redundant and irrelevant features may overcome the problem with huge number of features. This work emphasized on the role of feature selection and proposed a hybrid model by combining feature selection by utilizing Binary BAT optimization technique with a novel fitness function and aggregated with for Radial Basis Function Neural Network (RBFN) for credit score classification. Further, proposed feature selection approach is aggregated with Support Vector Machine (SVM) & Random Forest (RF), and other optimization approaches namely: Hybrid Particle Swarm Optimization and Gravitational Search Algorithm (PSOGSA), Hybrid Particle Swarm Optimization and Genetic Algorithm (PSOGA), Improved Krill Herd (IKH), Improved Cuckoo Search (ICS), Firefly Algorithm (FF) and Differential Evolution (DE) are also applied for comparative analysis.
Similar content being viewed by others
References
Abualigah L (2020) Multi-verse optimizer algorithm: a comprehensive survey of its results, variants, and applications. Neural Comput & Applic 32:12381–12401
Abualigah L, Diabat A (2020) A novel hybrid antlion optimization algorithm for multi-objective task scheduling problems in cloud computing environments. Clust Comput, 1–19
Abualigah LM, Khader AT (2017) Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering. J Supercomput 73(11):4773–4795
Abualigah LM, Khader AT, Hanandeh ES (2018) A combination of objective functions and hybrid krill herd algorithm for text document clustering analysis. Eng Appl Artif Intell 73:111–125
Abualigah LM, Khader AT, Hanandeh ES (2018) Hybrid clustering analysis using improved krill herd algorithm. Appl Intell 48(11):4047–4071
Abualigah LM, Khader AT, Hanandeh ES, Gandomi AH (2017) A novel hybridization strategy for krill herd algorithm applied to clustering techniques. Appl Soft Comput 60:423–435
Abualigah LMQ (2019) Feature selection and enhanced krill herd algorithm for text document clustering. Springer
Abualigah LMQ, Hanandeh ES (2015) Applying genetic algorithms to information retrieval using vector space model. Int J Comput Sci Eng Appl 5(1):19
Bequé A, Lessmann S (2017) Extreme learning machines for credit scoring: An empirical evaluation. Expert Syst Appl 86:42–53
Brahim AB, Limam M (2018) Ensemble feature selection for high dimensional data: a new method and a comparative study. ADAC 12(4):937–952
Broomhead DS, Lowe D (1988) Radial basis functions, multi-variable functional interpolation and adaptive networks. Tech rep Royal Signals and Radar Establishment Malvern (United Kingdom)
Brown I, Mues C (2012) An experimental comparison of classification algorithms for imbalanced credit scoring data sets. Expert Syst Appl 39(3):3446–3453
Chakravarthy H, Bachan P, Roshini P, Ch RK (2012) Bio inspired approach as a problem solving technique. Netw Complex Syst 2:14–21
Chen FL, Li FC (2010) Combination of feature selection approaches with svm in credit scoring. Expert Syst Appl 37(7):4902–4909
Chen W, Ma C, Ma L (2009) Mining the customer credit using hybrid support vector machine technique. Expert Syst Appl 36(4):7611–7616
Chi BW, Hsu CC (2012) A hybrid approach to integrate genetic algorithm into dual scoring model in enhancing the performance of credit scoring model. Expert Syst Appl 39(3):2650–2661
Deng S, Xiang Z, Zhao P, Taheri J, Gao H, Yin J, Zomaya AY (2020) Dynamical resource allocation in edge for trustable internet-of-things systems: a reinforcement learning method. IEEE Trans Ind Informat 16(9):6103–6113
Dong Y, Zhang Z, Hong WC (2018) A hybrid seasonal mechanism with a chaotic cuckoo search algorithm with a support vector regression model for electric load forecasting. Energies 11(4):1009
Edla DR, Tripathi D, Cheruku R, Kuppili V (2018) An efficient multi-layer ensemble framework with bpsogsa-based feature selection for credit scoring data analysis. Arab J Sci Eng 43(12):6909–6928
Gandomi AH, Alavi AH (2012) Krill herd: a new bio-inspired optimization algorithm. Commun Nonlinear Sci Numer Simul 17(12):4831–4845
Gao H, Kuang L, Yin Y, Guo B, Dou K (2020) ’Mining consuming behaviors with temporal evolution for personalized recommendation in mobile marketing apps. Proc. ACM/springer mobile netw appl.(MONET)
Gao H, Xu Y, Yin Y, Zhang W, Li R, Wang X (2019) Context-aware qos prediction with neural collaborative filtering for internet-of-things services. IEEE Internet Things J 7(5):4532–4542
Harris T (2015) Credit scoring using the clustered support vector machine. Expert Syst Appl 42(2):741–750
Hens AB, Tiwari MK (2012) Computational time reduction for credit scoring: an integrated approach based on support vector machine and stratified sampling method. Expert Syst Appl 39(8):6774–6781
Hong WC, Dong Y, Lai CY, Chen LY, Wei SY (2011) Svr with hybrid chaotic immune algorithm for seasonal load demand forecasting. Energies 4(6):960–977
Hong WC, Li MW, Geng J, Zhang Y (2019) Novel chaotic bat algorithm for forecasting complex motion of floating platforms. Appl Math Model 72:425–443
Hu Q, Yu D, Liu J, Wu C (2008) Neighborhood rough set based heterogeneous feature subset selection. Inf Sci 178(18):3577–3594
Hu Z, Bao Y, Xiong T, Chiong R (2015) Hybrid filter–wrapper feature selection for short-term load forecasting. Eng Appl Artif Intell 40:17–27
Huang CL, Chen MC, Wang CJ (2007) Credit scoring with a data mining approach based on support vector machines. Expert Syst Appl 33 (4):847–856
Huang CL, Dun JF (2008) A distributed pso–svm hybrid system with feature selection and parameter optimization. Appl Soft Comput 8(4):1381–1391
Huang CL, Wang CJ (2006) A ga-based feature selection and parameters optimizationfor support vector machines. Expert Syst Appl 31(2):231–240
Kala R, Vazirani H, Khanwalkar N, Bhattacharya M (2010) Evolutionary radial basis function network for classificatory problems. IJCSA 7(4):34–49
Kuppili V, Tripathi D, Reddy Edla D (2020) Credit score classification using spiking extreme learning machine. Comput Intell 36(2):402–426
Lee TS, Chen IF (2005) A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines. Expert Syst Appl 28(4):743–752
Liang D, Tsai CF, Wu HT (2015) The effect of feature selection on financial distress prediction. Knowl-Based Syst 73:289–297
Liao X, Li K, Yin J (2017) Separable data hiding in encrypted image based on compressive sensing and discrete fourier transform. Multimed Tools Appl 76(20):20,739–20,753
Liao X, Yu Y, Li B, Li Z, Qin Z (2019) A new payload partition strategy in color image steganography. IEEE Trans Circ Sys Vid Tech 30 (3):685–696
Lichman M (2013) UCI Machine learning repository. http://archive.ics.uci.edu/ml
Lin SW, Ying KC, Chen SC, Lee ZJ (2008) Particle swarm optimization for parameter determination and feature selection of support vector machines. Expert Syst Appl 35(4):1817–1824
Maldonado S, Weber R, Basak J (2011) Simultaneous feature selection and classification using kernel-penalized support vector machines. Inf Sci 181 (1):115–128
Mester LJ, et al. (1997) What’s the point of credit scoring? Bus Rev 3:3–16
Mirjalili S, Mirjalili SM, Yang XS (2014) Binary bat algorithm. Neural Comput & Applic 25(3-4):663–681
Mirjalili S, Wang GG, Coelho LDS (2014) Binary optimization using hybrid particle swarm optimization and gravitational search algorithm. Neural Comput & Applic 25(6):1423–1435
Neumann F, Witt C (2013) Bioinspired computation in combinatorial optimization: algorithms and their computational complexity. In: Proceedings of the 15th annual conference companion on Genetic and evolutionary computation, pp 567–590
Oreski S, Oreski G (2014) Genetic algorithm-based heuristic for feature selection in credit risk assessment. Expert Syst Appl 41(4):2052–2064
Paleologo G, Elisseeff A, Antonini G (2010) Subagging for credit scoring models. Eur J Oper Res 201(2):490–499
Ping Y, Yongheng L (2011) Neighborhood rough set and svm based hybrid credit scoring classifier. Expert Syst Appl 38(9):11,300–11,304
Rosenblatt F (1961) Principles of neurodynamics. perceptrons and the theory of brain mechanisms. Tech rep, Cornell Aeronautical Lab Inc Buffalo NY
Rumelhart DE, Hinton GE, Williams RJ (1985) Learning internal representations by error propagation. Tech rep, California Univ San Diego La Jolla Inst for Cognitive Science
Shukla AK, Singh P, Vardhan M (2018) A two-stage gene selection method for biomarker discovery from microarray data for cancer classification. Chemometr Intell Lab Syst 183:47–58
Shukla AK, Tripathi D (2020) Detecting biomarkers from microarray data using distributed correlation based gene selection. Genes & Genomics 42:449–465
Soleimani H, Kannan G (2015) A hybrid particle swarm optimization and genetic algorithm for closed-loop supply chain network design in large-scale networks. Appl Math Model 39(14):3990–4012
Storn R, Price K (1997) Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. J Glob Optim 11(4):341–359
Tawfik AS, Badr AA, Abdel-Rahman IF (2013) One rank cuckoo search algorithm with application to algorithmic trading systems optimization. Int J Comput Appl 64(6):30–37
Too J, Abdullah AR, Mohd Saad N, Tee W (2019) Emg feature selection and classification using a pbest-guide binary particle swarm optimization. Computation 7(1):12
Tripathi D, Cheruku R, Bablani A (2018) Relative performance evaluation of ensemble classification with feature reduction in credit scoring datasets. In: Advances in machine learning and data science. Springer, pp 293–304
Tripathi D, Edla DR, Cheruku R (2018) Hybrid credit scoring model using neighborhood rough set and multi-layer ensemble classification. J Intell Fuzzy Syst 34(3):1543–1549
Tripathi D, Edla DR, Cheruku R, Kuppili V (2019) A novel hybrid credit scoring model based on ensemble feature selection and multilayer ensemble classification. Comput Intell 35(2):371–394
Tripathi D, Edla DR, Kuppili V, Bablani A, Dharavath R (2018) Credit scoring model based on weighted voting and cluster based feature selection. Procedia Comput Sci 132:22–31
Tsai CF (2009) Feature selection in bankruptcy prediction. Knowl-Based Syst 22(2):120–127
Tsai CF, Wu JW (2008) Using neural network ensembles for bankruptcy prediction and credit scoring. Expert Syst Appl 34(4):2639–2649
Van Hulse J, Khoshgoftaar TM, Napolitano A (2007) Experimental perspectives on learning from imbalanced data. In: Proceedings of the 24th international conference on machine learning. ACM, pp 935–942
Wang G, Hao J, Ma J, Jiang H (2011) A comparative assessment of ensemble learning for credit scoring. Expert Syst Appl 38(1):223–230
Wang J, Guo K, Wang S (2010) Rough set and tabu search based feature selection for credit scoring. Procedia Comput Sci 1(1):2425–2432
Wang J, Hedar AR, Wang S, Ma J (2012) Rough set and scatter search metaheuristic based feature selection for credit scoring. Expert Syst Appl 39(6):6123–6128
West D (2000) Neural network credit scoring models. Comput Oper Res 27(11):1131–1152
Wongchinsri P, Kuratach W (2017) Sr-based binary classification in credit scoring. In: 14th international conference on electrical engineering/electronics, computer, telecommunications and information technology (ECTI-CON), 2017. IEEE, pp 385–388
Yang XS (2010) A new metaheuristic bat-inspired algorithm. In: Nature inspired cooperative strategies for optimization (NICSO 2010). Springer, pp 65–74
Yegnanarayana B (2009) Artificial neural networks. PHI Learning Pvt Ltd
Yu Q, Miche Y, Lendasse A, Séverin E (2011) Bankruptcy prediction with missing data. In: Proceedings of 2011 international conference on data mining, Las Vegas, USA, pp 279–285
Zhang Y, Song XF, Gong DW (2017) A return-cost-based binary firefly algorithm for feature selection. Inf Sci 418:561–574
Zhang Z, Hong WC (2019) Electric load forecasting by complete ensemble empirical mode decomposition adaptive noise and support vector regression with quantum-based dragonfly algorithm. Nonlinear Dyn 98(2):1107–1136
Zhang Z, Hong WC, Li J (2020) Electric load forecasting by hybrid self-recurrent support vector regression model with variational mode decomposition and improved cuckoo search algorithm. IEEE Access 8:14,642–14,658
Zhou L, Lai KK, Yu L (2009) Credit scoring using support vector machines with direct search for parameters selection. Soft Comput 13(2):149
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tripathi, D., Edla, D.R., Kuppili, V. et al. Binary BAT algorithm and RBFN based hybrid credit scoring model. Multimed Tools Appl 79, 31889–31912 (2020). https://doi.org/10.1007/s11042-020-09538-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09538-6