The Impact of driver distraction and secondary tasks with and without other co-occurring driving behaviors on the level of road traffic crashes

https://doi.org/10.1016/j.aap.2021.106010Get rights and content

Abstract

Driving safety is typically affected by concurrent non-driving tasks. These activities might negatively impact the trips’ outcome and cause near-crash or crash incidents and accidents. The crashes impose a tremendous social and economic cost to society and might affect the involving individuals’ quality of life. As it stands, road injuries are ranked among top-ten leading causes of death by the World Health Organization. Distracted driving is defined as an attention diversion of the driver toward a competing activity. It was shown in numerous studies that distracted driving increase the probability of near-crash or crash events. By leveraging the statistical power of the large SHRP2 naturalistic data, we are able to quantify the preponderance of specific distractions during daily trips and confirm the causality factor of an ubiquitous non-driving task in the crash event. We show that, except for phone usage which happens more frequently in near-crash and crash categories than in baseline trips, both distracted driving and secondary tasks occur almost uniformly in different types of trips. In this study, we investigate the impact of the co-occurrence of distracted driving with other driving behaviors and secondary tasks. It is found that the co-occurrence of distracted driving with other driving behaviors or secondary tasks increase the chance of near-crash and crash events. This study's findings can inform the design and development of more precise and reliable driving assistance and warning systems.

Introduction

According to NHTSA, 25 percent of the police-reported crashes are due to driver inattention defined as “insufficient or no attention to activities critical for safe driving” (Regan et al., 2011). The most substantial form of driver inattention is distracted driving. Distracted driving is defined as events or activities within or outside the vehicle (Young et al., 2007) that negatively affect a driver's ability to process information that is necessary to operate a vehicle safely (Regan and Hallett, 2011, Regan et al., 2008, Regan et al., 2011). This includes talking on cell phones, texting, eating, drinking, and other non-driving activities, called secondary tasks (Regan and Hallett, 2011). Distracted driving accounts for approximately 16 percent of economic loss and 15 percent of societal harm. In addition, 10 percent of fatal and 18 percent of injury crashes have been reported as “distraction-affected crashes” (Blincoe et al., 2015). These numbers represent driving trips that ended in non-fatal injuries or deaths. However, it is shown that as much as 16.1% of driving time gets affected by inattention (Stutts et al., 2003). Moreover, distracted driving has adverse impacts on traffic operation due to greater fluctuation in speed and significant lane deviation (Stavrinos et al., 2013). Therefore, numerous research studies have focused on the definition, theoretical foundation, formulation, prediction, and prevention of distraction and distracted driving to inform the development of the technological, behavioral, and infrastructure mitigating measures to enhance driving safety.

Classically, studies that focus on distracted driving use different combinations of data collection and analytical approaches. For example, the data used for examining distracted driving may be collected from human-in-the-loop simulation studies for retrospective (Jin et al., 2012, Ameyoe et al., 2015, Stavrinos et al., 2013) and real-time analysis or prediction (Wang et al., 2015, Liang et al., 2007). Another common approach is collecting naturalistic driving data using an instrumented vehicle for retrospective (Dukic et al., 2013, Jenkins et al., 2017, Aksan et al., 2013, Li et al., 2018) or real-time analysis and prediction (Liu et al., 2016, Deshmukh and Dehzangi, 2017, Kircher and Ahlstrom, 2010, Botta et al., 2019). Another approach is adopting qualitative techniques for data collection, such as interviews (Bakiri et al., 2013). Different methods are also used for analysis, modeling, and prediction of distraction, such as driver modeling including perceptual and motor components (Hermannstädter and Yang, 2013, Ameyoe et al., 2015, Li et al., 2018), statistical analysis (Bakiri et al., 2013, Dukic et al., 2013), and machine learning algorithms such as classification and regression (Jin et al., 2012, Liu et al., 2016, Jenkins et al., 2017, Deshmukh and Dehzangi, 2017, Wang et al., 2015, Kircher and Ahlstrom, 2010, Liang et al., 2007, Botta et al., 2019).

An avenue of research on distracted driving now highlights the effects of secondary tasks. According to the Second Strategic Highway Research Program (SHRP 2) Researcher Dictionary for Video Data Reduction (VTTI, 2015), the secondary task is defined as any distraction that includes non-driving related glances away from the direction of vehicle movements such as radio adjustments, seatbelt adjustments, window adjustments, visor adjustment, and other non-critical tasks. It does not include tasks that are critical to the driving, such as speedometer checks, blind spot checks, activating wipers/headlights, and other critical tasks.

As a specific data collection approach gets adopted, different sets of variables get generated. These variables can be grouped into three categories. The first category includes variables related to the driver (such as age and prior experience, and visual, motor, and cognitive capabilities) or variables measured to collect the level of distraction or inattention of driver (such as physiological changes in the driver state, eye movement patterns, and brain activity measures). The second category includes variables collected from the instrumented vehicle or simulator dynamics (such as lateral and longitudinal speed and acceleration, lateral deviation, and steering angle over the course of driving). Then the third category is composed of variables associated with the environment. The latter category characterizes the sources of internal or external distractions such as cell-phone and billboards or time and physical characteristics of the environment, such as traffic signs, the surrounding vehicle dynamics, or road curvature.

The findings of the above studies can be summarized as follows. In the studies conducted to measure the impacts of distracted driving, it is shown that distracted driving significantly and adversely impact the performance of drivers. Besides, distracted drivers experience changes in their physiological and brain state, functionality, and performance. These changes are meaningful enough to be used for prediction purposes and for the design and development of warning systems.

However, the impact of distraction co-occurring with other driving behaviors are studied only in a few studies. One study shows the relationship between driving drowsy and distracted driving (Anderson and Horne, 2013), while another study investigates the distractive effects of cell phone use on safe driving (Unknown, 2003). None of the previously reported studies explicitly considered the data-driven co-occurrence of driving behaviors and secondary tasks to crash risk. In this study, we categorize driving epochs based on their outcome: (i) epochs ending in a crash, (ii) epochs with a near-crash incident (but no crash), and (iii) baseline epochs without any near-crash or crash incidents. We adopt a data-driven approach to identify co-occurring behaviors in a repository of driving behaviors. The data used in this study is collected in a naturalistic driving experiment. The objectives of this study are to identify: frequent driving behaviors and their co-occurrences, frequent secondary tasks and their co-occurrences with driving behaviors, and impacts and frequency of distraction or secondary tasks with and without other driving behaviors among different outcome categories.

To meet the objectives of this study, we mine frequent driving behaviors and secondary tasks in a data set. The association rule mining technique is utilized to identify frequent driving behaviors. This approach has been adopted for mining of co-occurring patterns in other applications in previous studies as well. For example, in Brossette et al. (1998), a data set of health surveillance data is mined to reveal unknown patterns. Or, Abdullah et al. (2008) uses a data set of medical billing data to identify frequent associations between diagnosis codes and treatment procedures. A similar approach is adopted in Shan et al. (2008), Kareem et al. (2017) for the identification of suspicious claims and potentially fraudulent individuals from billing records. Other applications are prediction and forecasting of cardiovascular diseases and heart attacks (Ordonez, 2006, Jabbar et al., 2011, Khare and Gupta, 2016), location-wise and time-wise mining of frequent diseases (Ilayaraja and Meyyappan, 2013), identification of associations among environmental exposure to different chemical compounds and adverse health outcomes (Bell and Edwards, 2014), and identification of patterns which can inform the diagnosis of asthma in pediatric using a sequential version of this method (Campbell et al., 2020). For a survey on the applications of association rule mining techniques in healthcare applications, please refer to Altaf et al. (2017).

Using the association rule mining technique, this study aims to identify frequent co-occurring behaviors. In other words, we would like to investigate and identify the non-driving behaviors, more specifically, distraction, and related secondary tasks, that are commonly observed in different types of epochs. At the same time, their co-occurrences are more frequent in near-crash or crash epochs. Overlooking potential differences in impacts of behaviors when they occur individually or co-occur with other behaviors might result in higher false positive and false negative rates. Therefore, identifying such sets of behaviors can inform more accurate predictions of epochs’ outcomes and, consequently, the design and development of more reliable warning systems.

Section snippets

The SHRP 2 dataset

This study uses a naturalistic driving data set collected under the Second Strategic Highway Research Program (SHRP 2) conducted by the Virginia Transportation Technology Institute (VTTI) (Hankey et al., 2016, Transportation Research Board of the National Academy of Sciences, 2013). The program involves multiple partner organizations such as the Federal Highway Administration (FHWA), the American Association of State Highway and Transportation Officials (AASHTO), and Transportation Research

Results

The association rule mining technique adopted to mine frequent driving behaviors and secondary tasks requires a user-specified threshold (intra-supp threshold). This threshold is used to identify frequent driving behaviors and secondary tasks. In this study, we use the support threshold of 0.5% to consider a set of behaviors or secondary tasks frequent. It means that a set of behaviors or tasks is deemed frequent if this set is observed in at least 0.5% of the epochs. Because we consider 5,529

Discussion

The results of the adopted approach can be summarized as follows. Table 3 shows that distracted driving is the most frequent behavior. This behavior is observed in almost half of the epochs of different outcome categories. The total number of epochs in each category is 1,843. Table 3 shows that distracted driving is happening in about 47%, 50%, and 39% of baseline, near-crash, and crash epochs, respectively. Fig. 2 visualizes the normalized values for these percentages.

Although not as frequent

Conclusion and future works

In this work, we studied the frequent individual and co-occurring behaviors and secondary tasks using a data set created based on a naturalistic driving experiment. The findings show that all the 13 behavior and seven secondary task categories are individually frequent. However, only some of their combinations might be frequent. Among all the 13 driving behaviors, distracted driving is the most frequent behavior. Also, it co-occurs with other behaviors more often than other behaviors.

CRediT Author statement

Ali Jazayeri: Conceptualization, Methodology, Software, Data Curation, Visualization, Writing – Original Draft

John Ray B. Martinez: Software, Data Curation, Visualization, Writing – Original Draft

Helen Loeb: Supervision, Writing – Review & Editing, Funding acquisition

Christopher C. Yang: Conceptualization, Methodology, Supervision, Writing – Review & Editing, Project administration, Funding acquisition

Acknowledgments

This work was supported in part by the National Science Foundation under the GrantNSF-1741306,IIS-1650531, andDIBBs-1443019. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation or the views of the VTTI, SHRP 2, the Transportation Research Board, or the National Academy of Sciences.

Conflict of interest: The authors declare no conflict of interest.

References (45)

  • Umair Abdullah et al.

    Analysis of effectiveness of apriori algorithm in medical billing data mining

    2008 4th International Conference on Emerging Technologies

    (2008)
  • Rakesh Agrawal et al.

    Fast Algorithms for Mining Association Rules in Large Databases

    Proceedings of the 20th International Conference on Very Large Data Bases (VLDB ’94)

    (1994)
  • Nazan Aksan et al.

    Naturalistic Distraction and Driving Safety in Older DriversHumandistraction and driving safety in older drivers

    Factors

    (2013)
  • Wasif Altaf et al.

    Applications of association rule mining in health informatics: a survey

    Artificial Intelligence Review. Intell. Rev.

    (2017)
  • Ablamvi Ameyoe et al.

    Estimation of Driver Distraction Using the Prediction Error of a Cybernetic Driver Model

    (2015)
  • Shannon M. Bell et al.

    Building Associations between Markers of Environmental Stressors and Adverse Human Health Impacts Using Frequent Itemset Mining

    (2014)
  • Lawrence Blincoe et al.

    The Economic and Societal Impact of Motor Vehicle Crashes, 2010 (Revised)

    (2015)
  • Marco Botta et al.

    Real-time detection of driver distraction: random projections for pseudo-inversion-based neural training

    Knowledge and Information Systems. Inf. Syst.

    (2019)
  • Stephen E. Brossette et al.

    Association Rules and Data Mining in Hospital Infection Control and Public Health SurveillanceJournal of the American Medical Informatics Associationrules and data mining in hospital infection control and public health surveillance

    J. Am. Med. Inform. Assoc.

    (1998)
  • Elizabeth A. Campbell et al.

    Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma

    Journal of the American Medical Informatics Association. Am. Med. Inform. Assoc.

    (2020)
  • Shantanu Deshmukh et al.

    Identification of real-time driver distraction using optimal subband detection powered by Wavelet Packet Transform

    2017 IEEE 14th International Conference on Wearable and Implantable Body Sensor Networks (BSN)

    (2017)
  • Thomas A. Dingus et al.

    Driver crash risk factors and prevalence evaluation using naturalistic driving data

    Proceedings of the National Academy of Sciences. Natl. Acad. Sci.

    (2016)
  • Cited by (18)

    • Optimal design of a driver assistance controller based on surrounding vehicle's social behavior game model

      2023, Applied Mathematical Modelling
      Citation Excerpt :

      However, the behavioral goals of human drivers are actually difficult to be represented by a set of simple functional expressions, and the personalization of human drivers makes it impractical to construct a unified functional form [50]. Besides, there are many stochastic influencing factors during the vehicle interactions in traffic, such as driver’s emotion [51] and degree of distraction [52]. In future studies, there are possibilities for the determination of the human driver’s internal cost function using the machine learning methods (e.g., inverse reinforcement learning [53]) based on the drivers daily driving data (e.g., environment and control signal information).

    • A user study of directional tactile and auditory user interfaces for take-over requests in conditionally automated vehicles

      2022, Accident Analysis and Prevention
      Citation Excerpt :

      Melnicuk et al. introduced a novel method to determine stabilisation time as an indicator of a TO quality based on driver’s cognitive load (Melnicuk et al., 2021). Although You et al. observed longer TOTs with drivers engaged in a visual secondary task during the automated mode (You et al., 2018), many studies state that the type of a secondary task (e.g. reading news, watching a movie, making a phone call) does not affect the TOT (Petermeijer et al., 2017; Zeeb et al., 2016; Wu et al., 2020; Jazayeri et al., 2021). However, the overall quality of a TO is deteriorated for distracted drivers, which is manifested in larger lane deviations (Zeeb et al., 2016).

    View all citing articles on Scopus
    View full text