Exploring nonlinear effects of the built environment on ridesplitting: Evidence from Chengdu

https://doi.org/10.1016/j.trd.2021.102776Get rights and content

Highlights

  • Built environment factors have strong non-linear effects on the ridesplitting ratio.

  • Gradient Boosting Decision Trees model is used to analyze the effects.

  • Distance to the city center is the most important among built environment factors.

  • Land use diversity and road density also have strong impacts on ridesplitting ratios.

  • Non-linear thresholds of built environment factors are identified to guide the planning.

Abstract

Ridesplitting, a form of ridesourcing services that matches riders with similar routes to the same driver, is a high occupancy travel mode that can bring considerable benefits. However, the current ratio of ridesplitting in the ridesourcing services is relatively low and its influencing factors remain unrevealed. Therefore, this paper uses a machine learning method, gradient boosting decision tree (GBDT) model, to explore the nonlinear effects of built environment on the ridesplitting ratio of origin–destination pairs (census tract to census tract). The GBDT model also provides the relative importance ranking of all the built environment factors. The results indicate that distance to city center, land use diversity and road density are the key influencing factors of ridesplitting ratio. In addition, the non-linear thresholds of built environment factors are identified based on partial dependence plots, which could provide policy implications for the government and transportation network companies to promote ridesplitting.

Introduction

On-demand ridesourcing services, operated by transportation network companies, match passengers and drivers through intelligent mobile phone applications (Rayle et al., 2016). Ridesourcing services have become increasingly popular due to their convenience. As of 2020, Uber has provided services in over 890 cities in 71 countries (Wyatt, 2020). As of 2018, Lyft has provided services in 200 cities in the US with over 315,000 drivers (Lyft, 2020). DiDi Chuxing has provided mobility services to more than 550 million users all over China. On average, there are more than 30 million travel orders on the DiDi Chuxing platform per day, representing more than 350 million vehicle kilometers per day nationwide (DiDi Chuxing, 2018).

Ridesplitting, one form of ridesourcing services, matches riders with similar routes to the same driver, such as “UberPool” and “Lyft Line” (Shaheen et al., 2016a). There are some debates claiming that on-demand ridesourcing services could increase congestion and pollution (Anair et al., 2020). However, ridesplitting, a new shared mobility service, is a more sustainable travel mode for improving traffic efficiency and reducing traffic congestion and air pollution problems. Policy makers have realized the benefits of ridesplitting services and encouraged ridesourcing companies to launch ridesplitting services for passengers. For DiDi Chuxing, the number of passengers who choose ridesplitting services has continued to increase since its launch at the end of 2015. By the end of 2019, the cumulative number of users reached 2.9 billion, with a compound annual growth rate of 143.3%. The passenger volume using ridesplitting services was equivalent to 1.2 times the civil aviation passenger volume in 2019 (DiDi Chuxing, 2020).

However, the current ridesplitting ratio in key ridesourcing services provided in the city of Chengdu is low, only 7% (Li et al., 2019). The average time usage rate of private cars in China is only 7% (Accenture, 2016), while the rate is only 4% in the United Kingdom (Bates & Leibling, 2012) and 17% in the United States (Santos et al, 2011). This low utilization rate leads to a serious waste of resources. With the development of technology for autonomous driving, idle private cars could be used for autonomous ridesourcing services or ridesplitting services in the future. Tu et al. (2019) showed that such ridesplitting services could potentially have great benefits in the city of Chengdu, specifically by improving the percentage of cost savings and time savings to 17% and 23%, respectively.

The existing literature has proven that the built environment has a strong relationship with travel behaviors (Ding et al., 2018, Ding et al., 2019, Durning and Townsend, 2015, Tao et al., 2020). However, to the best of our knowledge, no existing research has investigated how the built environment impacts ridesplitting services. To fill this gap, the aim of this study is to explore the relationship between the built environment and origin–destination (OD) ridesplitting ratio using observed ridesourcing data of Chengdu, China. Specifically, this study involves three main tasks. First, we use a machine learning method, gradient boosting decision tree (GBDT) model, to identify the important features of the built environment at the origins and destinations of ridesplitting services. Second, we explore the nonlinear relationship between the ridesplitting service and key explanatory variables by creating partial dependence plots. Third, we provide insightful results that can help transportation network companies improve existing ridesplitting services and have policy implications for urban planners seeking to better understand how the built environment, demographic factors and travel time impact ridesplitting services.

The remainder of this paper is organized as follows. Section 2 summarizes the related research. Section 3 presents the data and variables used in this study. The GBDT model is described in Section 4. Section 5 presents the results in detail and discusses the related insights regarding how the built environment features, demographic factors and travel time impact ridesplitting services. The final section summarizes the main findings, presents some policy implications and proposes future research directions.

Section snippets

Studies on ridesourcing services

Most studies on ridesourcing focus on three specific aspects: (1) identifying the characteristics of passengers who use ridesourcing services and their preferences (Shaheen et al., 2016b, Nielsen et al., 2015, Wang et al., 2020); (2) exploring the impacts of ridesourcing services on passengers, such as waiting time (Rayle et al., 2014, Hughes and MacKenzie, 2016), drivers, such as income (Angrist et al., 2017, Chen et al., 2019, Hall et al., 2018), the environment (Kent, 2014), other traffic

Study area

Chengdu is the capital of Sichuan Province and is one of the three most populous cities in Western China. As of 2020, the total area in Chengdu was 14,335 square kilometers, the local population was 16.58 million, the urban population was 12.34 million, and the urbanization rate was 74.42% (Chengdu Bureau of Statistics, 2020).

Since the launch of ridesplitting services on the DiDi Chuxing platform in 2015, the number of passengers who chose ridesplitting services has continued to increase.

Methodology

We choose the built environment features that are measured by the ‘four Ds’, namely, density, land use diversity, design, and distance to city center. Then, we use the GBDT model to predict the OD ridesplitting ratio and investigate the relative importance ranking of all the explanatory variables. Furthermore, we explore the nonlinear relationship between the ridesplitting ratio and key explanatory variables by generating partial dependence plots. We find some useful thresholds from the partial

Performance of the GBDT model

We use a five-fold cross-validation procedure to obtain the optimal parameter settings and a robust result. We fit the models with different numbers of trees (5000, 10000, 15000), shrinkage (0.005, 0.05, 0.01, 0.1) and tree complexity (1, 3, 5, 7) based on the experimental findings of previous studies. We fit the model 240 times and find that the best performance is obtained when the number of trees, shrinkage and tree complexity are set as 5000, 0.005 and 7, respectively.

As shown in Table 3,

Conclusions and implications

To explore the effects of built environment features, demographic factors and travel time on the OD ridesplitting ratio, this paper employs a GBDT model using observed ridesplitting data of Chengdu obtained from DiDi’s open data project. GBDT is a machine learning method without a prior hypothesis of a particular function such as a linear or log linear relationship. The GBDT model can be used to predict the OD ridesplitting ratio and even the nonlinear relationships with the explanatory

CRediT authorship contribution statement

Meiting Tu: Conceptualization, Data curation, Methodology, Writing - original draft. Wenxiang Li: Conceptualization, Data curation, Writing - review & editing, Funding acquisition. Olivier Orfila: Funding acquisition, Supervision, Writing - review & editing. Ye Li: Funding acquisition, Supervision, Writing - review & editing. Dominique Gruyer: Project administration, Funding acquisition, Supervision, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This study was supported by the National Natural Science Foundation of China [Grant numbers: 52002244, 71774118]; Shanghai Pujiang Program [Grant number: 2020PJC083]; Shanghai Planning Office of Philosophy and Social Sciences [Grant number: 2020EGL019]; and the Science and Technology Commission of Shanghai Municipality [Grant number: 20692192200].

References (50)

  • W. Li et al.

    Siting of carsharing stations based on spatial multi-criteria evaluation: a case study of Shanghai EVCARD

    Sustainability

    (2017)
  • W. Li et al.

    Characterization of ridesplitting based on observed data: a case study of Chengdu, China

    Transport. Res. Part C: Emerg. Technol.

    (2019)
  • J.R. Nielsen et al.

    Of “white crows” and “cash savers:” A qualitative study of travel behavior and perceptions of ridesharing in Denmark

    Transport. Res. Part A: Policy Practice

    (2015)
  • L. Rayle et al.

    Just a better taxi? A survey-based comparison of taxis, transit, and ridesourcing services in San Francisco

    Transp. Policy

    (2016)
  • S. Sabouri et al.

    Exploring the influence of built environment on Uber demand

    Transport. Res. Part D: Transp. Environ.

    (2020)
  • S.A. Shaheen et al.

    Casual carpooling in the San Francisco Bay Area: understanding user characteristics, behaviors, and motivations

    Transp. Policy

    (2016)
  • T. Tao et al.

    Exploring the non-linear associations between spatial attributes and walking distance to transit

    J. Transp. Geogr.

    (2020)
  • M. Tu et al.

    Improving ridesplitting services using optimization procedures on a shareability network: a case study of Chengdu

    Technol. Forecast. Soc. Chang.

    (2019)
  • W. Tu et al.

    Spatial variations in urban public ridership derived from GPS trajectories and smart card data

    J. Transp. Geogr.

    (2018)
  • X. Yan et al.

    Using machine learning for direct demand modeling of ridesourcing services in Chicago

    J. Transp. Geogr.

    (2020)
  • H. Yu et al.

    Exploring the spatial variation of ridesourcing demand and its relationship to built environment and socioeconomic factors with the geographically weighted Poisson regression

    J. Transp. Geogr.

    (2019)
  • Y. Zhang et al.

    Exploring the relationship between ridesharing and public transit use in the United States

    Int. J. Environ. Res. Publ. Health

    (2018)
  • Accenture, 2016, Car Sharing Service Outlook in China....
  • D. Anair et al.

    Ride-Hailing’s Climate Risks: Steering a Growing Industry Toward a Clean Transportation Future

    (2020)
  • Angrist, J.D., Caldwell, S., Hall, J.V., 2017. Uber vs. taxi: A driver’s eye view (No. w23891). National Bureau of...
  • Cited by (82)

    View all citing articles on Scopus
    View full text