Skip to main content

Advertisement

Log in

Extraction of decision rules using genetic algorithms and simulated annealing for prediction of severity of traffic accidents by motorcyclists

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

The objective of this study is to analysis of accident of motorcyclists on Bogotá roads in Colombia. For detection of conditions related to crashes and their severity, the proposed model develops the strategies to enhance road safety. In this context, data mining and machine learning techniques are used to investigate 34,232 accidents by motorcyclists during January 2013 to February 2018. Both the Genetic algorithm and simulated annealing are applied in conjunction with mining rules (support, confidence, lift, and comprehensibility) as per objectives of the problem. The application of a hybrid algorithm allows for the creation and definition of optimal hierarchical decision rules for the prediction of the severity of motorcycle traffic accidents. The proposed method yields good results in the metrics of recall (90.07%), precision (89.87%), and accuracy (90.06%) on the data set. The results increase the prediction by 20–21% in comparisons with the following methods: Decision Trees (CART, ID3, and C4.5), Support Vector Machines (SVMs), K-Nearest Neighbor (KNN), Naive Bayes, Neural Networks, Random Forest, and Random Tree. The proposed method defines 11 rules for the prediction of accidents with material damage, 24 rules with injuries, and 12 rules with fatalities. The variables with the most recurrence in the definition of rules are time, weather and road conditions, and the number of victims involved in the accidents. Finally, the interactions of the conditions and characteristics presented in motorcycle accidents are analyzed which contribute to the definition of countermeasures for road safety.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Abdelwahab H, Abdel-Aty M (2001) Development of artificial neural network models to predict driver injury severity in traffic accidents at signalized intersections. Transp Res Rec J Transp Res Board 1746:6–13

    Article  Google Scholar 

  • Abdul Manan MM, Várhelyi A (2015) Motorcyclists’ road safety related behavior at access points on primary roads in Malaysia—a case study, (in English). Saf Sci 77:80–94

    Article  Google Scholar 

  • Abdul Manan MM, Ho JS, Syed Tajul Arif STM, Abdul Ghani MR, Várhelyi A (2017) Factors associated with motorcyclists’ speed behaviour on Malaysian roads, (in English). Transp Res Part F Traffic Psychol Behav 50:109–127

    Article  Google Scholar 

  • Abedi L, Sadeghi-Bazargani H (2017) Epidemiological patterns and risk factors of motorcycle injuries in Iran and Eastern Mediterranean Region countries: a systematic review, (in English). Int J Inj Control Saf Promot 24(2):263–270

    Article  Google Scholar 

  • Agrawal R, Imieliński T, Swami A (1993) Mining association rules between sets of items in large databases. ACM Sigmod Rec 22(2):207–216

    Article  Google Scholar 

  • Aha DW, Kibler D, Albert MK (1991) Instance-based learning algorithms. Mach Learn 6(1):37–66

    Google Scholar 

  • Ali M, Son D-H, Kang S-H, Nam S-R (2017) An accurate CT saturation classification using a deep learning approach based on unsupervised feature extraction and supervised fine-tuning strategy. Energies 10(11):1830

    Article  Google Scholar 

  • Araujo M, Illanes E, Chapman E, Rodrigues E (2017) Effectiveness of interventions to prevent motorcycle injuries: systematic review of the literature (in English). Int J Inj Control Saf Promot Rev 24(3):406–422

    Article  Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  MATH  Google Scholar 

  • Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees, the wadsworth statistics and probability series. Wadsworth International Group, Belmont California, p 356

    Google Scholar 

  • C. d. B. (CB) (2017) El 53% de la malla vial local se encuentra en mal estado

  • Chang L-Y, Wang H-W (2006) Analysis of traffic injury severity: an application of non-parametric classification tree techniques. Accid Anal Prev 38(5):1019–1027

    Article  Google Scholar 

  • Cheng W, Gill GS, Sakrani T, Dasu M, Zhou J (2017) Predicting motorcycle crash injury severity using weather data and alternative Bayesian multivariate crash frequency models, (in English). Accid Anal Prev 108:172–180

    Article  Google Scholar 

  • Das S, Maurya AK (2018) Modelling of motorised two-wheelers: a review of the literature, (in English). Transp Rev 38(2):209–231

    Article  Google Scholar 

  • Das S, Dutta A, Dixon K, Minjares-Kyle L, Gillette G (2018) Using deep learning in severity analysis of at-fault motorcycle rider crashes. Transp Res Rec 1:0361198118797212

    Google Scholar 

  • de Oña J, Mujalli RO, Calvo FJ (2011) Analysis of traffic accident injury severity on Spanish rural highways using Bayesian networks. Accid Anal Prev 43(1):402–411

    Article  Google Scholar 

  • Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. Evol Comput IEEE Trans 6(2):182–197

    Article  Google Scholar 

  • Delen D, Sharda R, Bessonov M (2006) Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks. Accid Anal Prev 38(3):434–444

    Article  Google Scholar 

  • Delen D, Tomak L, Topuz K, Eryarsoy E (2017) Investigating injury severity risk factors in automobile crashes with predictive analytics and sensitivity analysis methods. J Transp Health 4:118–131

    Article  Google Scholar 

  • Elassad ZEA, Mousannif H, Al Moatassime H, Karkouch A (2020) The application of machine learning techniques for driving behavior analysis: a conceptual framework and a systematic literature review. Eng Appl Artif Intell 87:103312

    Article  Google Scholar 

  • Gabauer DJ, Li X (2015) Influence of horizontally curved roadway section characteristics on motorcycle-to-barrier crash frequency, (in English). Accid Anal Prev 77:105–112

    Article  Google Scholar 

  • Geedipally SR, Turner PA, Patil S (2011) Analysis of motorcycle crashes in texas with multinomial logit model, (in English). Transp Res Rec 2265:62–69

    Article  Google Scholar 

  • Haque MM, Chin HC (2010) Right-angle crash vulnerability of motorcycles at signalized intersections: mixed logit analysis. Transp Res Rec 1:82–90

    Article  Google Scholar 

  • Harnen S, Umar RSR, Wong SV, Wan Hashim WI (2003) Predictive model for motorcycle accidents at three-legged priority junctions. Traffic Inj Prev 4(4):363–369

    Article  Google Scholar 

  • Hashmienejad SH-A, Hasheminejad SMH (2017) Traffic accident severity prediction using a novel multi-objective genetic algorithm. Int J Crashworthiness 22(4):425–440

    Article  Google Scholar 

  • Holland JH (1975) Adaptation in natural and artificial systems. University of Michigan Press, Michigan

    Google Scholar 

  • Hoos HH, Stützle T (2004) Stochastic local search: foundations and applications. Elsevier, Amsterdam

    MATH  Google Scholar 

  • Huang H, Abdel-Aty M (2010) Multilevel data and Bayesian analysis in traffic safety. Accid Anal Prev 42(6):1556–1565

    Article  Google Scholar 

  • Huang H, Chin HC, Haque MM (2008) Severity of driver injury and vehicle damage in traffic crashes at intersections: a Bayesian hierarchical analysis. Accid Anal Prev 40(1):45–54

    Article  Google Scholar 

  • Huysmans J, Dejaeger K, Mues C, Vanthienen J, Baesens B (2011) An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models. Decis Support Syst 51(1):141–154

    Article  Google Scholar 

  • I. C. V. e. L.-. IDU (2016) Movilidad, estado de malla vial arterial, local e intermedia. Retrieved from http://www.bogotacomovamos.org/interactivo-como-vamos-en-localidades-2016/. Accessed Nov 2018

  • I. f. H. M. a. E.-. IHME (2018) Global burden of disease (GBD). Available from: http://www.healthdata.org/gbd). Accessed Nov 2018

  • I. N. d. M. L. y. C. F. INMLCF (2017) Forensis, datos para la vida. Colombia

  • IDEAM (2017) Precipitación mensual por año para Bogota. Retrieved from http://www.ideam.gov.co/web/tiempo-y-clima/seguimiento. Accessed Nov 2018

  • Jafari SA, Jahandideh S, Jahandideh M, Asadabadi EB (2015) Prediction of road traffic death rate using neural networks optimised by genetic algorithm. Int J Inj Control Saf Promot 22(2):153–157

    Article  Google Scholar 

  • Jensupakarn A, Kanitpong K (2018) Influences of motorcycle rider and driver characteristics and road environment on red light running behavior at signalized intersections, (in English). Accid Anal Prev 113:317–324

    Article  Google Scholar 

  • Jimenez A, Bocarejo JP, Zarama R, Yerpez J (2015) A case study analysis to examine motorcycle crashes in Bogota, Colombia, (in English). J Saf Res Artic 52:29–38

    Article  Google Scholar 

  • John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the Eleventh conference on Uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc. pp. 338–345

  • Johnson DS, Aragon CR, McGeoch LA, Schevon C (1989) Optimization by simulated annealing: an experimental evaluation; part I, graph partitioning. Operations research 37(6):865–892

    Article  MATH  Google Scholar 

  • Jung S, Xiao Q, Yoon Y (2013) Evaluation of motorcycle safety strategies using the severity of injuries, (in English). Accid Anal Prev 59:357–364

    Article  Google Scholar 

  • Kanesalingam S, Nayak R (2020) Review of literature: motorcycle helmet. Sustainable phase change and polymeric water absorbent materials. Springer, Berlin, pp 7–61

    Chapter  Google Scholar 

  • Kashani AT, Mohaymany AS (2011) Analysis of the traffic injury severity on two-lane, two-way rural roads based on classification tree models. Saf Sci 49(10):1314–1320

    Article  Google Scholar 

  • Kashani AT, Rabieyan R, Besharati MM (2014) A data mining approach to investigate the factors influencing the crash severity of motorcycle pillion passengers, (in English). J Saf Res 51:93–98

    Article  Google Scholar 

  • Keerthi SS, Shevade SK, Bhattacharyya C, Murthy KRK (2001) Improvements to platt’s SMO algorithm for SVM classifier design. Neural Comput 13(3):637–649

    Article  MATH  Google Scholar 

  • Kirkpatrick S, Gelatt CD, Vecchi MP (1983) Optimization by simulated annealing. Science 220(4598):671–680

    Article  MathSciNet  MATH  Google Scholar 

  • Kumar V, Sarkar B, Sharma AN, Mittal M (2019) New product launching with pricing, free replacement, rework, and warranty policies via genetic algorithmic approach. Int J Comput Intell Syst 12(2):519–529

    Article  Google Scholar 

  • Li X, Lord D, Zhang Y, Xie Y (2008) Predicting motor vehicle crashes using support vector machine models. Accid Anal Prev 40(4):1611–1618

    Article  Google Scholar 

  • Li Z, Liu P, Wang W, Xu C (2012) Using support vector machine models for crash injury severity analysis. Accid Anal Prev 45:478–486

    Article  Google Scholar 

  • Li Y, Ma D, Zhu M, Zeng Z, Wang Y (2018) Identification of significant factors in fatal-injury highway crashes using genetic algorithm and neural network. Accid Anal Prev 111:354–363

    Article  Google Scholar 

  • Li Y, Yang L, Yang B, Wang N, Wu T (2019) Application of interpretable machine learning models for the intelligent decision. Neurocomputing 333:273–283

    Article  Google Scholar 

  • Lin M-R, Kraus JF (2009) A review of risk factors and patterns of motorcycle injuries. Accid Anal Prev 41(4):710–722

    Article  Google Scholar 

  • Mannering F (2018) Temporal instability and the analysis of highway accident data. Anal Methods Acid Res 17:1–13

    Google Scholar 

  • Martín L, Baena L, Garach L, López G, de Oña J (2014) Using data mining techniques to road safety improvement in Spanish roads. Procedia Soc Behav Sci 160:607–614

    Article  Google Scholar 

  • Moghaddam FR, Afandizadeh S, Ziyadi M (2011) Prediction of accident severity using artificial neural networks. Int J Civ Eng 9(1):41

    Google Scholar 

  • Montella A, Aria M, D’Ambrosio A, Mauriello F (2012) Analysis of powered two-wheeler crashes in Italy by classification trees and rules discovery, (in English). Accid Analy Prev 49:58–72

    Article  Google Scholar 

  • Nguyen-Phuoc DQ, Oviedo-Trespalacios O, Su DN, De Gruyter C, Nguyen T (2020a) Mobile phone use among car drivers and motorcycle riders: the effect of problematic mobile phone use, attitudes, beliefs and perceived risk. Accid Anal Prev 143:105592

    Article  Google Scholar 

  • Nguyen-Phuoc DQ, Oviedo-Trespalacios O, Nguyen T, Su DN (2020b) The effects of unhealthy lifestyle behaviours on risky riding behaviours–a study on app-based motorcycle taxi riders in Vietnam. J Transp Health 16:100666

    Article  Google Scholar 

  • ONSV (2019a) National road safety observatory. Statistical bulletins, deceased victims, injured by INMLCF in transit events in Colombia—National, Departmental and Municipal Data 2012–2018. Available: https://ansv.gov.co/observatorio/?op=Contenidos&sec=63&page=20. Accessed Mar 2019

  • ONSV (2019b) National road safety observatory. National automotive registry. Available: https://ansv.gov.co/observatorio/?op=Contenidos&sec=64. Accessed Mar 2019

  • Ospina-Mateus H, Quintana Jiménez LA (2019) Understanding the impact of physical fatigue and postural comfort experienced during motorcycling: a systematic review. J Transp Health 12:290–318

    Article  Google Scholar 

  • Ospina-Mateus H, Quintana Jiménez LA, Lopez-Valdes FJ, Salas-Navarro K (2019) Bibliometric analysis in motorcycle accident research: a global overview. Scientometrics 121(2):793–815

    Article  Google Scholar 

  • Ospina-Mateus H, Quintana Jiménez LA, Lopez-Valdes FJ (2020) Understanding motorcyclist-related accidents in Colombia. Int J Inj Control Saf Promot 27(2):215–231

    Article  Google Scholar 

  • Pai CW (2009) Motorcyclist injury severity in angle crashes at T-junctions: identifying significant factors and analysing what made motorists fail to yield to motorcycles, (in English). Saf Sci 47(8):1097–1106

    Article  Google Scholar 

  • Pai CW, Hwang KP, Saleh W (2009) A mixed logit analysis of motorists’ right-of-way violation in motorcycle accidents at priority T-junctions, (in English). Accid Anal Prev 41(3):565–573

    Article  Google Scholar 

  • Perez-Fuster P, Rodrigo MF, Ballestar ML, Sanmartin J (2013) Modeling offenses among motorcyclists involved in crashes in Spain, (in English). Accid Anal Prev 56:95–102

    Article  Google Scholar 

  • Platt JC (1999) 12 fast training of support vector machines using sequential minimal optimization. Adv Kernel Methods 1:185–208

    Google Scholar 

  • Quddus MA, Chin HC, Wang J (2001) Motorcycle crash prediction model for signalised intersections. In: Seventh international conference on urban transport and the environment for the 21st century, URBAN TRANSPORT VII. Vol. 8. Sucharov LJ, Brebbia CA (Eds.). Lemnos, 2001, pp. 609–617

  • Quddus MA, Noland RB, Chin HC (2002) An analysis of motorcycle injury and vehicle damage severity using ordered probit models, (in English). J Saf Res 33(4):445–462

    Article  Google Scholar 

  • Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81–106

    Google Scholar 

  • Quinlan JR (1993) C4. 5: programming for machine learning. Morgan Kauffmann 38:48

    Google Scholar 

  • Rezapour M, Nazneen S, Ksaibati K (2020) Application of deep learning techniques in predicting motorcycle crash severity. Eng Rep 1:e12175

    Google Scholar 

  • S. D. d. M. d. B. (SDMB) (2018) Accident records Bogota—20013–2018

  • Sameen MI, Pradhan B (2017) Severity prediction of traffic accidents with recurrent neural networks. Appl Sci 7(6):476

    Article  Google Scholar 

  • Santosa B, Damayanti R, Sarkar B (2016) Solving multi-product inventory ship routing with a heterogeneous fleet model using a hybrid cross entropy-genetic algorithm: a case study in Indonesia. Produc Manuf Res 4(1):90–113

    Google Scholar 

  • Savolainen P, Mannering F (2007) Probabilistic models of motorcyclists’ injury severities in single- and multi-vehicle crashes, (in English). Accid Anal Prev 39(5):955–963

    Article  Google Scholar 

  • Shankar V, Mannering F (1996) An exploratory multinomial logit analysis of single-vehicle motorcycle accident severity, (in English). J Saf Res 27(3):183–194

    Article  Google Scholar 

  • Sohn S, Shin H (2001) Data mining for road traffic accident type classification. Ergonomics 44:107–117

    Article  Google Scholar 

  • Taamneh M, Alkheder S, Taamneh S (2017) Data-mining techniques for traffic accident modeling and prediction in the United Arab Emirates. J Transp Saf Secur 9(2):146–166

    Google Scholar 

  • Teoh ER, Campbell M (2010) Role of motorcycle type in fatal motorcycle crashes, (in English). J Saf Res 41(6):507–512

    Article  Google Scholar 

  • Theofilatos A, Yannis G (2015) A review of powered-two-wheeler behaviour and safety. Int J Inj Control Saf Promot 22(4):284–307

    Article  Google Scholar 

  • Theofilatos A, Ziakopoulos A (2018) Examining injury severity of moped and motorcycle occupants with real-time traffic and weather data, (in English). J Transp Eng Part A Syst 144(11):04018066

    Article  Google Scholar 

  • Truong LT, Nguyen HTT, Tay R (2020) A random parameter logistic model of fatigue-related motorcycle crash involvement in Hanoi, Vietnam. Accid Anal Prev 144:105627

    Article  Google Scholar 

  • van Haperen W, Riaz MS, Daniels S, Saunier N, Brijs T, Wets G (2019) Observing the observation of (vulnerable) road user behaviour and traffic safety: a scoping review. Accid Anal Prev 123:211–221

    Article  Google Scholar 

  • Wang X-W, Jiang Y-M (2011) Analysis and improvement of ID3 decision tree algorithm. Comput Eng Des 9:1

    Google Scholar 

  • Wedagama DMP (2010) Estimating the influence of accident related factors on motorcycle fatal accidents using logistic regression (case study: Denpasar-Bali). Civ Eng Dimens 12(2):106–112

    Google Scholar 

  • Weiss SM, Indurkhya N (1998) Predictive data mining: a practical guide. Morgan Kaufmann

  • WHO (2017) WHO, Powered two- and three-wheelers safety: a road safety manual for decision-makers and practitioners, 2017. [Online]. Available: https://apps.who.int/iris/bitstream/handle/10665/272757/9789243511924-spa.pdf?sequence=1&isAllowed=y. Accessed Nov 2018

  • WHO (2018) WHO, Global status report on road safety 2018. World Health Organization, Geneva. [Online]. Available: https://apps.who.int/iris/bitstream/handle/10665/276462/9789241565684-eng.pdf?ua=1. Accessed Nov 2018

  • Witten IH, Frank E, Hall MA, Pal CJ (2016) Data mining: practical machine learning tools and techniques. Morgan Kaufmann

  • Xi J, Gao Z, Niu S, Ding T, Ning G (2013) A hybrid algorithm of traffic accident data mining on cause analysis. Math Prob Eng 2013:1

    Google Scholar 

  • Zheng M et al (2019) Traffic accident’s severity prediction: A deep-learning approach-based CNN network. IEEE Access 7:39897–39910

    Article  Google Scholar 

Download references

Funding

The study is not funded by any agency.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shib Sankar Sana.

Ethics declarations

Conflict of interest

The authors do hereby declare that there is no conflict of interest in other works regarding the publication of this paper.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1

Algorithm 1: Management approach for fast non-dominated (P), (NSGA-II).

Input: parameters (population, chromosome, operators, generations).

Step 1. Create the initial individuals (population size, N) using the random creation of the chromosomes.

Step 2. Applying crossover and mutation operators.

Step 3. Compiling the population of size 2 N (parents and offspring).

Step 4. Fitness assessment of each chromosome.

Step 5. Calculation and rank in fronts of non-dominance to the population (2 N).

Step 6. Calculation of crowding of each chromosome.

Step 7. Creation of the offspring population (N) with the best ranges of non-dominance and crowding distance.

Step 8. Repeat steps 2–7 until the stop criterion is satisfied or the defined number of generations is reached.

The crowding distance is calculated as the sum of the values of all the distances of the elements that correspond to each objective. To calculate the distance in a non-dominated set (i), the following equation is used:

$${d}_{i}=\sum_{m=1}^{M}\left|\frac{{f}_{m}^{({I}_{i+1}^{m})}- {f}_{m}^{({I}_{i-1}^{m})}}{{f}_{m}^{(max)}- {f}_{m}^{(min)}}\right|$$
(9)

where I (m) is the vector that indicates the neighboring alternative solution to alternative I; \({f}_{m}^{(max)}\) and \({f}_{m}^{(min)}\) are the maximum and minimum values in the solution space of the objective function m, respectively; and M is the number of objective functions.

Appendix 2

Algorithm 2. Simulated Annealing algorithm.

Step 1: Start.

Step 2: Choose an initial solution S;

Step 3: Select the initial temperature and the final temperature T 0 , and T f  > 0;

Step 4: While, as the termination condition is not satisfied, do Generate a neighbor S′ of S.

Step 5: If, S' satisfies the acceptance criterion S = S'.

Step 6: Return T according to the criterion of decrease.

Note: The algorithm can take T as the number of search steps.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ospina-Mateus, H., Quintana Jiménez, L.A., Lopez-Valdes, F.J. et al. Extraction of decision rules using genetic algorithms and simulated annealing for prediction of severity of traffic accidents by motorcyclists. J Ambient Intell Human Comput 12, 10051–10072 (2021). https://doi.org/10.1007/s12652-020-02759-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-020-02759-5

Keywords

Navigation