Abstract
As there is a rapid growth in healthcare systems and biomedical data. Machine learning algorithms are utilized in many researches for predicting the risk of the diseases. The major intuition of the present paper is to plan for a novel methodology for multi-disease prediction using deep learning. The overall prediction methodology involves several steps such as “(a) Data Acquisition, (b) Optimal Feature selection, (c) Statistical feature Extraction, and (d) prediction”. In the initial step, the medical datasets of diverse diseases is gathered from multiple benchmark sources. Further, the optimal feature selection is applied to the available set of attributes. This is accomplished by hybridizing two meta-heuristic algorithms such as Lion Algorithm (LA), and Butterfly Optimization Algorithm (BOA). In these prediction algorithms, the hidden neuron count of NN and DBN is finely tuned or optimized by the same hybrid Lion-based BOA (L-BOA). The experimental evaluation of various medical datasets validates that the prediction rate of the developed model outperforms several traditional methods.
Similar content being viewed by others
Abbreviations
- LA:
-
Lion algorithm
- BOA:
-
Butterfly optimization algorithm
- NN:
-
Neural network
- DBN:
-
Deep belief network
- L-BOA:
-
Lion-based BOA
- SVM:
-
Support vector machine
- EMR:
-
Electronic medical records
- MCI:
-
Mild cognitive impairment
- PD:
-
Parkinson's disease
- KNN:
-
K-nearest neighbor
- NB:
-
Naive Bayesian
- EPDP:
-
Efficient and privacy-preserving disease risk prediction scheme
- FPR:
-
False positive rate
- LM:
-
Linear method
- RF:
-
Random forest
- DSAE:
-
Deep sparse auto-encoder architecture
- ESD:
-
Energy spectral density
- HRFLM:
-
Hybrid random forest with linear model
- circRNA:
-
CircularRibo nucliec acid
- FNR:
-
False negative rate
- GIP:
-
Gaussian interaction profile
- GWO:
-
Grey Wolf optimizer
- MCC:
-
Matthew’s correlation coefficient
- FDR:
-
False discover rate
- OU:
-
Okamoto-Uchiyama
- CKD:
-
Chronic kidney disease
- NPV:
-
Negative predictive value
- LDA:
-
Linear discriminate analysis
- DT:
-
Decision tree
- WOA:
-
Whale optimization algorithm
- DBN:
-
Deep belief network
- FRQR:
-
Fuzzy rough quick reduct
- RBM:
-
Restricted Boltzmann machine
- QDA:
-
Quadratic discriminate analysis
- RNN:
-
Recurrent neural network
- CD:
-
Contrast divergence
References
Hong W, Xiong Z, Zheng N and Weng Y 2019 A medical-history-based potential disease prediction algorithm; IEEE Access. 7 131094–131101
Haq A Q, Li J P, Memon M H, Khan J, Malik A, Ahmad T, Ali A, Nazir S, Ahad I and Shahid M 2019 Feature selection based on L1-norm support vector machine and effective recognition system for Parkinson’s disease using voice recordings; IEEE Access 7: 37718–37734
Brisimi T S, Xu T, Wang T, Dai W, Adams W G and Paschalidis I C 2018 Predicting chronic disease hospitalizations from electronic health records: an interpretable classification approach; Proc. IEEE. 106 690–707
Abacha A B and Zweigenbaum P 2015 Means: a medical question answering system combining nlp techniques and semantic web technologies; Inform. Process Manag. 51 570–594
Sierra-Sosa D, Garcia-Zapirain M B, Castillo C, Oleagordia I, Nuño-Solinis R, Urtaran-Laresgoiti M, Elmaghraby A 2019 Scalable healthcare assessment for diabetic patients using deep learning on multiple GPUs; IEEE T. Ind. Inform. 15: 5682–5689
Lei H, Huang Z, Zhou F, Elazab A, Tan E-L, Li H, Qin J, Lei B 2019 Parkinson’s disease diagnosis via joint learning from multiple modalities and relations; IEEE J. Biomed. Health. 23: 1437–1449
Tong T, Gao Q, Guerrero R, Ledig C, Chen L, Rueckert D, Alzheimer's Disease Neuroimaging Initiative 2017 A novel grading biomarker for the prediction of conversion from mild cognitive impairment to Alzheimer’s disease; IEEE T Bio-Med. Eng. 64: 155–165
Vijayalaxmi A, Sridevi S, Sridhar N, Ambesange S 2020 Multi-disease prediction with artificial intelligence from core health parameters measured through non-invasive technique, IEEE Access, 1252-1258
Patil P B, Shastry P M and Ashokumar P S 2020 Machine learning based algorithm for risk prediction of cardio vascular disease (Cvd); J. Crit. Rev. 7 836–844
Saranya G and Pravin A 2020 A comprehensive study on disease risk predictions in machine learning; Int. J. Elec. Comput. Eng. 10 4217–4225
Luo J, Ding P, Liang C, Cao B and Chen X 2017 Collective prediction of disease-associated miRNAs based on transduction learning; IEEE ACM T Comput. Biol. 14 1468–1475
Benba A., Jilbab A and Hammouch A 2016 Discriminating between patients with Parkinson’s and neurological diseases using cepstral analysis; IEEE T. Neur. Sys. Reh. 24 1100–1108
Zhao Z, Wang K Y, Wu F, Wang W, Zhang K N, Hu H M, Liu Y W and Jiang T 2018 circRNA disease: a manually curated database of experimentally supported circRNA-disease associations, Cell Death Dis. 9
Alahmari S S, Cherezov D, Goldgof D B, Hall L O, Gillies R J and Schabath M B 2018 Delta radiomics improves pulmonary nodule malignancy prediction in lung cancer screening; IEEE Access. 6 77796–77806
Jadhav A S, Patil P B and Biradar S 2020 Computer-aided diabetic retinopathy diagnostic model using optimal thresholding merged with neural network; Int. J. Intell. Comput, Cybernet
Jadhav A S, Patil P B and Biradar S 2020 Optimal feature selection-based diabetic retinopathy detection using improved rider optimization algorithm enabled with deep learning; Evol, Intel
Tao R, Zhang S, Huang X, Tao M, Ma J, Ma S, Zhang C, Zhang T, Tang F, Lu J, Shen C and Xie X 2019 Magnetocardiography-based ischemic heart disease detection and localization using machine learning methods; IEEE T Bio-Med. Eng. 66: 1658–1667
Zhang J, Li Z, Pu Z and Xu C 2018 Comparing prediction performance for crash injury severity among various machine learning and statistical methods; IEEE Access. 6 60079–60087
Jiang S, Zhu X and Wang L 2015 EPPS: efficient and privacy-preserving personal health information sharing in mobile healthcare social networks; Sensors. 15 22419–22438
Samanthula B K, Elmehdwi Y and Jiang W 2015 k-nearest neighbor classification over semantically secure encrypted relational data; IEEE Trans. Knowl. Data Eng. 27 1261–1273
Zhu H, Liu X, Lu R and Li H 2017 Efficient and privacy-preserving online medical prediagnosis framework using nonlinear SVM; IEEE J. Biomed. Health 21 838–850
Yi W, Park J and Kim J 2020 GeCo: Classification restricted boltzmann machine hardware for on-chip semi-supervised learning and bayesian inference ; IEEE T Neur Net. Learn. 31 53–65
Prakaash A S and Sivakumar K 2020 Optimized Recurrent Neural Network with Fuzzy Classifier for Data Prediction using Hybrid Optimization Algorithm: Scope towards Diverse Applications, Int. J. Wavelets Multi.
Mohan S, Thirumalai C and Srivastava G 2019 Effective heart disease prediction using hybrid machine learning techniques; IEEE Access. 7 81542–81554
Haq A U, Li J P, Memon M H, Nazir S and Sun R 2018 A hybrid intelligent system framework for the prediction of heart disease using machine learning algorithms; Mob. Inf. Syst. 2018: 21–22
Xiao Q, Luo J and Dai J 2019 Computational prediction of human disease- associated circRNAs based on manifold regularization learning framework; IEEE J. Biomed. Health. 23 2661–2669
Yang X, Lu R, Shao J, Tang X and Yang H 2019 An efficient and privacy-preserving disease risk prediction scheme for E-healthcare; IEEE Internet Things. 6 3284–3297
Vásquez-Morales G R, Martínez-Monterrubio S M, Moreno-Ger P and Recio-García J A 2019 Explainable prediction of chronic renal disease in the colombian population using neural networks and case-based reasoning; IEEE Access. 7 152900–152910
Minhas S, Khanum A, Riaz F, Khan S A and Alvi A 2018 Predicting progression from mild cognitive impairment to Alzheimer’s disease using autoregressive modelling of longitudinal and multimodal biomarkers; IEEE J. Biomed. Health. 22 818–825
Escudero J, Ifeachor E, Zajicek J P, Green C, Shearer J and Pearson S 2013 Machine learning-based method for personalized and cost-effective detection of Alzheimer’s disease ; IEEE T Bio-Med Eng. 60 164–168
Arunkumar C and Ramakrishnan S 2019 Prediction of cancer using customised fuzzy rough machine learning approaches; Hlthc. Tech. Le. 6 13–18
Karim A M, Guzel M S, Tolun M R, Kaya H and Celebi F V 2018 A new framework using deep auto-encoder and energy spectral density for medical waveform data classification and processing; Biocybern. Biomed. Eng. 39 1–12
Yilmaz A A, Guzel M S, Bostanci E and Askerzade I 2020 A novel action recognition framework based on deep-learning and genetic algorithms; IEEE Access. 8 1–16
Mirjalili S, Mirjalili S M and Lewis, 2014 A grey wolf optimizer; Adv. Eng. Softw. 69 46–61
Mirjalili S and Lewis A 2016 The whale optimization algorithm; Adv Eng Softw. 95 51–67
Boothalingam R 2018 Optimization using lion algorithm: a biological inspiration from lion’s social behavior; Evol. Intel. 11 31–52
Arora S and Singh S 2018 Butterfly optimization algorithm: a novel approach for global optimization, Soft Comput. 715–734
Beno M. M., Valarmathi I. R., Swamy S. M. and Rajakumar B. R. 2014 Threshold prediction for segmenting tumour from brain MRI scans; Int. J. Imag. Syst. Tech. 24 129–137
Liu Y, Zhou H, Tsung F and Zhang S 2019 Real-time quality monitoring and diagnosis for manufacturing process profiles based on deep belief networks; Comput. Ind. Eng. 136 494–503
Yu S, Tan K K, Sng B L, Li S and Sia A T H 2015 Lumbar Ultrasound Image Feature Extraction and Classification with Support Vector Machine; Ultrasound Med. Biol. 41 2677–2689
Chen Y, Hu X, Fan W, Shen L, Zhang Z, Liu X, Du J, Li H, Chen Y and Li H 2019 Fast density peak clustering for large scale data based on kNN. Knowl-Based Syst.
Fernández-Navarro F, Carbonero-Ruz M, Alonso D B and Torres-Jiménez M 2017 Global Sensitivity estimates for neural network classifiers; IEEE T Neur. Netw. Learn. 28 2592–2604
Preetha N S N, Brammya G, Ramya R, Praveena S, Binu D and Rajakumar B R 2018 Grey wolf optimisation-based feature selection and classification for facial emotion recognition; IET Biom. 7 490–499
Alzheimer dataset, https://www.kaggle.com/hyunseokc/detecting-early-alzheimer-s/data.
Breast cancer dataset, https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic).
Dermatology, https://www.kaggle.com/syslogg/dermatology-dataset.
Heart disease, http://archive.ics.uci.edu/ml/datasets/Heart+Disease.
Lung cancer, http://archive.ics.uci.edu/ml/datasets/Lung+Cancer.
Parkinson’s disease, https://archive.ics.uci.edu/ml/datasets/Parkinsons.
Thyroid, https://www.kaggle.com/kumar012/hypothyroid#hypothyroid.csv.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dubey, A.K. Optimized hybrid learning for multi disease prediction enabled by lion with butterfly optimization algorithm. Sādhanā 46, 63 (2021). https://doi.org/10.1007/s12046-021-01574-8
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12046-021-01574-8