Evaluating temporal variability of exogenous variable impacts over 25 years: An application of scaled generalized ordered logit model for driver injury severity

https://doi.org/10.1016/j.amar.2018.09.001Get rights and content

Highlights

  • We examine changes in the impact of exogenous variables on driver injury severity.

  • We consider scaled and mixed versions of generalized ordered logit model.

  • We demonstrate our model by using General Estimates System (GES) crash database.

  • We consider data for a span of 25-year from 1989 through 2014 in 5-year increments.

Abstract

The current study undertakes a unique research effort to quantify the impact of various exogenous factors on crash severity over time. Specifically, we examine if over time, the impact of exogenous variables has changed and if so what is the magnitude of the change. The research contributes to driver injury severity analysis both methodologically and empirically by proposing a framework that addresses the challenges associated with pooled (or pseudo-panel) data. For our analysis, we draw data from the General Estimates System (GES) over a span of twenty-five years. The data is compiled for driver injury severity in single or two vehicle crashes from 1989 through 2014 in 5-year increments (1989, 1994, 1999, 2004, 2009 and 2014). The alternative econometric frameworks considered for the analysis include ordered logit, generalized ordered logit, scaled generalized ordered logit and mixed generalized ordered logit models. A host of comparison metrics are computed to evaluate the performance of these alternative models in examining the pooled data. The model development exercise is conducted with a host of exogenous variables including driver characteristics, vehicle characteristics, roadway attributes, environmental factors, crash characteristics and temporal attributes. The model estimation results are further augmented by performing a detailed policy scenario analysis, probability profile representations and elasticity effects for different driving and situational conditions across different years.

Introduction

The negative consequences of road traffic crashes have a significant impact on the emotional and financial well-being of the society. In the United States in 2015, roadway crashes were responsible for over 35,000 fatalities, while about 2.35 million individuals were injured or disabled. Financially, road traffic crashes cost the US nearly 230 billion annually. Given the magnitude of consequences of road traffic crashes, it is not surprising that road safety is a well-researched field. Earlier research has examined factors affecting crash occurrence using crash frequency models (see Yasmin and Eluru, 2016 for a review) and crash consequence in the event of a crash using crash severity models (see Yasmin and Eluru, 2013 for a review). The current study contributes to road safety literature by focusing on crash severity models. Earlier literature on severity modeling has identified several factors that significantly influence severity of vehicle occupants involved in traffic crashes including vehicle occupant age, restraint system use, driving under the influence of alcohol or drugs, vehicle age and type (such as sedan, van or pickup truck) of the vehicles involved, and collision type (such as head-on, rear-end and angle).

To be sure, road safety in the US has improved over the years. The number of traffic fatalities have reduced from about 47,000 in 1965 to about 35,000 in 2015 (NHTSA, 2016). The fatality rate per 100 million vehicle miles travelled has dropped from 5.3 to 1.12 during the same period. The improvement in traffic crash associated fatalities can chiefly be attributed to: (1) design and enforcement of several policies such as mandatory seat belt use, vehicle regulations requiring airbags, child rear facing seats, (2) advances in vehicle technology to improve occupant safety and (3) concerted effort dedicated to educational awareness campaigns for different driver age groups to encourage safe driving behavior. However, in recent years (since 2000) instances of increase in the number of traffic fatalities compared to the previous year have occurred multiple times. Thus, in spite of the significant progress made over the years, there is further scope for improving road safety.

The current study undertakes a unique research effort to quantify the impact of various exogenous factors on crash severity over time (see Mannering, 2018 for discussion on temporal instability). Specifically, we examine if over time, the impact of exogenous variables has changed and if so what is the magnitude of the change. For example, the injury severity of a vehicle occupant in a crash that occurred in 1995 was a function of the vehicle safety features available at the time. An observationally identical crash occurring in 2015 is likely to result in either the same level of injury severity or lower. This is an example of how improvement in vehicle technology has affected injury severity. In our study, we systematically attempt to identify exogenous variables that offer time-varying effects and quantify the change in their impact. For this purpose, we draw data from the General Estimates System (GES), over a span of twenty-five years. The data is compiled for driver injury severity in single or two passenger vehicle involved crashes from 1989 through 2014 in 5-year increments (1989, 1994, 1999, 2004, 2009 and 2014). In our analysis, injury severity is classified in four levels as follows: no injury, possible injury, non-incapacitating injury, and incapacitating/fatal injury.

The data compiled is a pooled dataset obtained from stitching together 6 cross-sectional datasets providing us with a pseudo-panel data. Such data pooling of different observations across multiple years offers unique methodological challenges. The modeling methodology should recognize the differences across multiple time points adequately since the outcome process for the observations in a year might be influenced by various observed and unobserved attributes (Anowar et al., 2016, Train, 2009). For illustrative purposes, consider the possibility that airbags were made mandatory in vehicles only in 2000. The data compiled after this time period would possibly experience lower injury severity for crashes relative to the years prior due to the installation of airbags. This is an example of an observed attribute affecting either one or more cross-sections of data in the overall pooled dataset. On the other hand, consider the possibility of a cultural phenomenon that encourages good driving behavior – such as wearing a seatbelt – happened between 1995 and 2000. As a result, more drivers wear seatbelts after 2000, thus potentially reducing injury severity consequences for cross-sections compiled later. However, parsing the true impact of the cultural phenomenon in the model is quite challenging and usually remains hidden or unobserved for a long time. This is an example of how an unobserved effect specific to one-time period or multiple time periods can affect severity outcomes. In our study, we implement modeling approaches that simultaneously accommodate for the influence of observed and unobserved attributes on driver injury severity across multiple time points.

Given the inherent ordering of the data, we adopt a generalized ordered logit (GOL) model kernel structure for severity analysis. The GOL framework relaxes the restrictive assumption of the traditional ordered outcome (Ordered logit/probit) models (monotonic effect of exogenous variables) while simultaneously recognizing the inherent ordering of the injury severity variable – information that unordered model alternatives fail to consider (see Eluru et al., 2008, Eluru and Yasmin, 2015, Yasmin and Eluru, 2013). Further, to incorporate the effect of observed and unobserved temporal effects, we specifically consider two versions of the GOL model – the mixed GOL model and the scaled GOL model. The two variants differ in the way they incorporate the influence of unobserved attributes within the outcome process. The mixed GOL model captures unobserved heterogeneity by allowing the variable effects (including the constant) influencing injury severity to be distributed across the observations, while the scaled model accommodates for common unobserved heterogeneity by allowing the variance of the unobserved component to vary by time period. The two approaches also vary in how they accommodate for heteroscedasticity across observations. We estimate both models and employ data fit comparison metrics to determine the appropriate model structure. The model specification is undertaken to quantify how the impact of exogenous variables has altered over the 25-year period on driver injury severity. The model development exercise is conducted with a host of exogenous variables including driver characteristics, vehicle characteristics, roadway attributes, environmental factors, crash characteristics and temporal attributes. In summary, the current research effort contributes empirically by identifying if the impact of any exogenous factors has varied over time, and if so, quantifying the change in magnitude. Methodologically, the research contributes to driver injury severity analysis by proposing a framework that addresses the challenges associated with pooled (or pseudo-panel) data.

The remainder of the paper is organized as follows. Section 2 provides a brief literature review. Section 3 briefly outlines the econometric frameworks that are considered. Section 4 presents data preparation steps for the modeling exercise. In Section 5, we discuss the model results. Section 6 presents a summary of policy analysis. Finally, Section 6 provides the summary and concludes our findings.

Section snippets

Literature review

Several research efforts have examined driver injury severity in safety literature. It is beyond the scope of the paper to provide a detailed review (see Savolainen et al., 2011, Yasmin and Eluru, 2013 for a detailed review). Based on earlier literature, the most important factors influencing injury severity by attribute group are presented below with emphasis on factors contributing to or reducing injury severity outcome in the event of a crash.

Econometric framework

In this section, we provide a brief description of the methodology of all the models considered for examining driver injury severity in our research. For the ease of presentation, we describe the general mathematical structure first and then identify the different modeling structures for various models in the subsequent discussion. We will first introduce the traditional ordered logit (OL) model, then discuss about the generalized ordered logit (GOL) model, scaled generalized ordered logit

Data source and data description

The data for the current study is sourced from the “General Estimates System (GES)” database for the years 1989, 1994, 1999, 2004, 2009, and 2014. The GES database is a nationally representative sample of road crashes collected and compiled from about 60 jurisdictions across the United States. The data is obtained from the U. S. Department of Transportation, National Highway Traffic Safety Administration’s National Center for Statistics and Analysis (ftp://ftp.nhtsa.dot.gov/GES/). The data

Variable considered

In road crash reporting system, the definition of independent variables and the reporting approach may vary across different years. The Analytical User’s Manual of GES data has detailed information on the label of each variables reported in the GES database from 1988 to 2014 (NHTSA, 2015). From the report it is quite evident that there are variations in variable documentation across different years. However, the variables should be uniform for all years in a pooled dataset for estimating a

Estimation results

In presenting the effects of exogenous variables in the model specification, we will restrict ourselves to the discussion of the SGOL model. Table 3 presents the estimation results. In SGOL model, when the threshold parameter is positive (negative), the result implies that the threshold is bound to increase (decrease); the actual effect on the probability is quite non-linear and can only be judged in conjunction with the influence of the variable on propensity and other thresholds. In the

Model illustration

The exogenous variable coefficients do not directly provide the magnitude of impacts of variables on the probability of different injury severity levels. Moreover, the impacts of coefficients of the SGOL framework might not be readily interpretable due to the interactions between propensity, threshold and scale parameters. Hence, to provide a better understanding of the impacts of exogenous factors, we compute disaggregate level changes in driver injury severity levels. Specifically, we focus

Elasticity effects

The model illustration section provides a representation of how overall injury severity profiles vary over time. However, to identify the impact of various exogenous variables, we compute the aggregate level “elasticity effects” for independent variables (see Eluru and Bhat (2007) for a discussion on the methodology for computing elasticities). We present the elasticity effects for a set of variables – Female, Under the influence of alcohol, Unrestrained, Pickup, On interstate highway,

Conclusion

The current study undertook a unique research effort to quantify the impact of various exogenous factors on crash severity over time. Specifically, we examined if over time, the impact of exogenous variables has changed and if so what is the magnitude of the change. In our study, we systematically identify exogenous variables that offer time-varying effects and quantify the change in their impact. For this purpose, we drew data from the General Estimates System (GES) database, over a span of

References (47)

  • N. Eluru et al.

    A joint econometric analysis of seat belt use and crash-related injury severity

    Accident Analysis and Prevention

    (2007)
  • N. Eluru et al.

    A mixed generalized ordered response model for examining pedestrian and bicyclist injury severity level in traffic crashes

    Accident Analysis and Prevention

    (2008)
  • N. Eluru et al.

    A note on generalized ordered outcome models

    Analytic Methods in Accident Research

    (2015)
  • M. Fredette et al.

    Safety impacts due to the incompatibility of SUVs, minivans, and pickup trucks in two-vehicle collisions

    Accident Analysis and Prevention

    (2008)
  • P. Gårder

    Segment characteristics and severity of head-on crashes on two-lane rural highways in Maine

    Accident Analysis and Prevention

    (2006)
  • J.M. Holdridge et al.

    The crash severity impacts of fixed roadside objects

    Journal of Safety Research

    (2005)
  • H. Huang et al.

    Severity of driver injury and vehicle damage in traffic crashes at intersections: a Bayesian hierarchical analysis

    Accident Analysis and Prevention

    (2008)
  • S. Islam et al.

    Driver aging and its effect on male and female single-vehicle accident injuries: some additional evidence

    Journal of Safety Research

    (2006)
  • J.-K. Kim et al.

    Driver-injury severity in single-vehicle crashes in California: a mixed logit analysis of heterogeneity due to age and gender

    Accident Analysis and Prevention

    (2013)
  • K.M. Kockelman et al.

    Driver injury severity: an application of ordered probit models

    Accident Analysis and Prevention

    (2002)
  • F. Mannering

    Temporal instability and the analysis of highway accident data

    Analytic Methods in Accident Research

    (2018)
  • S.M. Rifaat et al.

    Effect of street pattern on the severity of crashes involving vulnerable road users

    Accident Analysis and Prevention

    (2011)
  • P.T. Savolainen et al.

    The statistical analysis of highway crash-injury severities: a review and assessment of methodological alternatives

    Accident Analysis and Prevention

    (2011)
  • Cited by (41)

    View all citing articles on Scopus
    View full text