Skip to main content
Log in

Grammar-based automatic programming for medical data classification: an experimental study

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

In a computational medical model, diagnosis is the classification of disease status in the terms of abnormal or positive, normal or negative or intermediate stages. Different Machine learning techniques such as artificial neural networks (ANNs) are extensively and successfully used in disease diagnosis. However, there is no single classifier that can solve all classification problems. Selecting an optimal classifier for a problem is difficult, and it has become a relevant subject in the area. This paper focuses on grammar-based automatic programming (GAP) to build optimized discriminant functions for medical data classification in any arbitrary language. These techniques have an implicit power of automatic feature selection and feature extraction. This work carries out an in-depth investigation of the use of different GAP algorithms in the medical data classification problem. The objective is to identify the benefits and limitations of algorithms of this nature in the current problem. Classical classifiers were also considered for comparison purposes. Fourteen medical data sets were used in the experiments, and seven performance measures such as accuracy, sensitivity, specificity, precision, geometric-mean, F-measure, and false-positive rate are used to evaluate the performance of the produced classifier. The multiple criteria decision analysis (MCDA) demonstrates that GAP approaches are able to produce suitable classifiers for a given problem, and the GS performs better than other classical classifiers in medical data classification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  • Assunçao F, Lourenço N, Machado P, Ribeiro B (2017) Automatic generation of neural networks with structured grammatical evolution. In: 2017 IEEE congress on evolutionary computation (CEC). IEEE, pp 1557–1564

  • Asuncion A, Newman D (2007) Uci machine learning repository. University of California, School of Information and Computer Science, Irvine, CA. http://archive.ics.uci.edu/ml/datasets.php

  • Barandela R, Sánchez JS, Garca V, Rangel E (2003) Strategies for learning in class imbalance problems. Pattern Recogn 36(3):849–851

    Article  Google Scholar 

  • Brameier M, Banzhaf W (2001) A comparison of linear genetic programming and neural networks in medical data mining. IEEE Trans Evolut Comput 5(1):17–26

    Article  Google Scholar 

  • Cerri R, Barros RC, de Carvalho AC, Freitas AA (2013) A grammatical evolution algorithm for generation of hierarchical multi-label classification rules. In: 2013 IEEE congress on evolutionary computation. IEEE, pp 454–461

  • Chareka T, Pillay N (2016) A study of fitness functions for data classification using grammatical evolution. In: 2016 Pattern recognition association of South Africa and robotics and mechatronics international conference (PRASA-RobMech). IEEE, pp. 1–4

  • Contreras, I., Bertachi, A., Biagi, L., Vehí, J., Oviedo, S.: Using grammatical evolution to generate short-term blood glucose prediction models. In: KHD@ IJCAI, pp 91–96

  • Dehuri S, Roy R, Cho SB, Ghosh A (2012) An improved swarm optimized functional link artificial neural network (iso-flann) for classification. J Syst Softw 85(6):1333–1345

    Article  Google Scholar 

  • Elleuch S, Jarboui B (2018) Variable neighborhood programming for evolving discriminent functions with dynamic thresholds. In: 2018 international conference on machine learning and cybernetics (ICMLC), vol 1. IEEE, pp 263–268

  • Espejo PG, Ventura S, Herrera F (2009) A survey on the application of genetic programming to classification. IEEE Trans Syst, Man, Cybern, Part C (Appl Rev) 40(2):121–144

    Article  Google Scholar 

  • Gray H, Maxwell R, Martinez-Perez I, Arus C, Cerdan S (1996) Genetic programming for classification of brain tumours from nuclear magnetic resonance biopsy spectra. Genet Progr 424:1–6

  • Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier, Amsterdam

    MATH  Google Scholar 

  • Hope D, Munday E, Smith S (2007) Evolutionary algorithms in the classification of mammograms. In: 2007 IEEE symposium on computational intelligence in image and signal processing. IEEE, pp 258–265

  • Karaboga D (2005) An idea based on honey bee swarm for numerical optimization. In: Technical Report-TR06, Erciyes University. Engineering Faculty, Computer Engineering Department, Kayseri/Türkiye

  • Kou G, LU Y, Peng Y, Shi Y (2012) Evaluation of classification algorithms using mcdm and rank correlation. Int J Inf Technol Decis Mak 11:197–225. https://doi.org/10.1142/S0219622012500095

  • Lennartsson D, Nordin P (2004) A genetic programming method for the identification of signal peptides and prediction of their cleavage sites. EURASIP J Adv Signal Process 1:153697

    Article  Google Scholar 

  • Lin JY, Ke HR, Chien BC, Yang WP (2008) Classifier design with feature selection and feature extraction using layered genetic programming. Expert Syst Appl 34(2):1384–1393

    Article  Google Scholar 

  • López-Vázquez G, Ornelas-Rodriguez M, Espinal A, Soria-Alcaraz JA, Rojas-Domínguez A, Puga-Soberanes H, Carpio JM (2019) Rostro-Gonzalez H (2019) Evolutionary spiking neural networks for solving supervised classification problems. Comput Intell Neurosci 2019:4182639. https://doi.org/10.1155/2019/4182639

  • Luz EJS, Schwartz WR, Cámara-Chávez G, Menotti D (2016) Ecg-based heartbeat classification for arrhythmia detection: a survey. Comput Methods Progr Biomed 127:144–164

    Article  Google Scholar 

  • Mckay RI, Hoai NX, Whigham PA, Shan Y, O’neill M (2010a) Grammar-based genetic programming: a survey. Genet Progr Evolvable Mach 11(3–4):365–396

  • McKay RI, Hoai NX, Whigham PA, Shan Y, O’Neill M (2010b) Grammar-based genetic programming: a survey. Genet Progr Evolvable Mach 11:365–396

    Article  Google Scholar 

  • Motsinger-Reif AA, Deodhar S, Winham SJ, Hardison NE (2010) Grammatical evolution decision trees for detecting gene-gene interactions. BioData Min 3(1):8

    Article  Google Scholar 

  • Nag K, Pal NR (2015) A multiobjective genetic programming-based ensemble for simultaneous feature selection and classification. IEEE Trans Cybern 46(2):499–510

    Article  Google Scholar 

  • Neill M, Brabazon A (2004) Grammatical swarm. Genet Evolut Comput Conf 1:163–174

    Google Scholar 

  • O’Neill M, Brabazon A (2006) Grammatical swarm: the generation of programs by social programming. Nat Comput 5(4):443–462

    Article  MathSciNet  Google Scholar 

  • O’Neill M, Ryan C (2001) Grammatical evolution. IEEE Trans Evolut Comput 5(4):349–358

    Article  Google Scholar 

  • O’Neill M, Ryan C (2003) Grammatical evolution. In: Grammatical evolution. Springer, US, pp 33–47

  • Paul TK, Iba H (2006) Classification of scleroderma and normal biopsy data and identification of possible biomarkers of the disease. In: 2006 IEEE symposium on computational intelligence and bioinformatics and computational biology. IEEE, pp 1–6

  • Rivero D, Dorado J, Rabuñal J, Pazos A (2010) Generation and simplification of artificial neural networks by means of genetic programming. Neurocomputing 73(16–18):3200–3223

    Article  Google Scholar 

  • Si T (2016) Grammatical evolution using fireworks algorithm. In: Pant M, Deep K, Bansal J, Nagar A, Das K (eds) Proceedings of 5th international conference on soft computing for problem solving. Advances in intelligent systems and computing, vol 436, pp. 43–55. Springer, New York

  • Si T, De A, Bhattacharjee AK (2013) Grammatical bee colony. In: Panigrahi BK, Suganthan PN, Das S, Dash SS (eds) Swarm, evolutionary, and memetic computing. SEMCCO, Lecture Notes in computer science, vol 8297. Springer, New York, pp 436–445

  • Si T, De A, Bhattacharjee AK (2018) Segmentation of brain mri using wavelet transform and grammatical bee colony. J Circuits, Syst Comput 27(07):1850108

    Article  Google Scholar 

  • Si T, Dutta R (2019) Partial opposition-based particle swarm optimizer in artiżcial neural network training for medical data classiżcation. Int J Inf Technol Decis Mak 18(5):1717–1750

    Article  Google Scholar 

  • Si T, Sk S (2016) A comparison of grammatical bee colony and neural networks in medical data mining. Int J Comput Appl 134(6):1–4

    Google Scholar 

  • Tan Y, Zhu Y (2010) Fireworks algorithm for optimization. International conference in swarm intelligence. Springer, New York, pp 355–364

    Google Scholar 

  • Tharwat A (2018) Classification assessment methods. Appl Comput Inf. https://doi.org/10.1016/j.aci.2018.08.003

  • Triantaphyllou E (2000) Multi-criteria decision making methods: a comparative study 44. https://doi.org/10.1007/978-1-4757-3157-6

  • Valencia-Ramírez JM, Raya JA, Cedeno JR, Suárez RR, Escalante HJ, Graff M (2014) Comparison between genetic programming and full model selection on classification problems. In: 2014 IEEE international autumn meeting on power, electronics and computing (ROPEC). IEEE, pp 1–6

  • Winkler S, Affenzeller M, Wagner S (2007) Advanced genetic programming based machine learning. J Math Model Algor 6(3):455–480

    Article  MathSciNet  Google Scholar 

  • Yasodha P, Ananthanarayanan N (2018) Detecting the ovarian cancer using big data analysis with effective model

  • Yeh YC, Wang WJ, Chiou CW (2009) Cardiac arrhythmia diagnosis method using linear discriminant analysis on ecg signals. Measurement 42(5):778–789

    Article  Google Scholar 

  • Zhang M, Wong P (2008) Genetic programming for medical classification: a program simplification approach. Genet Progr Evolvable Mach 9(3):229–255

    Article  Google Scholar 

  • Zhao H (2007) A multi-objective genetic programming approach to developing pareto optimal decision trees. Decis Support Syst 43(3):809–826

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tapas Si.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Si, T., Miranda, P., Galdino, J.V. et al. Grammar-based automatic programming for medical data classification: an experimental study. Artif Intell Rev 54, 4097–4135 (2021). https://doi.org/10.1007/s10462-020-09949-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10462-020-09949-9

Keywords

Navigation