Space-time routing in dedicated automated vehicle zones

https://doi.org/10.1016/j.trc.2020.102777Get rights and content

Highlights

Abstract

With the fast development of automated vehicle (AV) technologies, scholars have proposed various innovative local traffic control schemes for more effective management of AV traffic, especially at intersections. However, due to computational intractability, the investigation of network-level AV control is still at the initial stage. This study proposes a space-time routing framework applicable in dedicated AV zones. To relieve the computational load, we establish a node-based conflict point network to model realistic road networks, and at each conflict point, we record the space-time occupations of AVs in continuous timelines. Then, based on the conflict point network, we develop two space-time routing algorithms for each AV once it enters the dedicated AV zone to minimize its trip travel time while maintaining the non-collision insurances; these two algorithms can trade-off between solution quality and computational load. Furthermore, to enhance the network throughput for handling heavy traffic, we develop a ”platoon strategy” that forces AVs to pass through conflict points in platoons, and we adopt Deep Q-learning (DQN) to optimize the platoon sizes at different spots dynamically. Numerical tests show that both proposed algorithms perform well in that they can execute the routing tasks with very limited computational time, and the average vehicle delay approaches zero when the traffic is relatively mild. Meanwhile, compared with the FCFS policy and the optimization-based approach, the platoon strategy can greatly reduce the average vehicle delay under congested scenarios and give a better balance between the optimality and real-time performance.

Introduction

Automated vehicles (AVs) are considered a crucial component of the future of transportation by many scholars and practitioners. AVs collect environmental information through their sensors, as well as through some roadside facilities (V2I) and other vehicles (V2V). Decisions can then be made either by the AVs themselves or by some cloud controllers. Several promising benefits of AVs are foreseeable, and AV technologies can be used to affect traffic demand (Auld et al., 2017) and improve lane throughput (Talebpour and Mahmassani, 2016). Furthermore, with the development of a new generation of communication technologies and advanced decision-making techniques, it is envisioned that AVs will reshape the form of human transport. For example, the traffic organization of AVs can be taken over by some local or global controller in the future, and the controllers can acquire the optimal trajectory of each vehicle for the minimization of total travel time or energy consumption.

The management and planning of AVs is becoming a hot research area. In recent years, some scholars have developed approaches for traffic organization and management in the AV era, mainly focusing on corridor-level and intersection-level management. In particular, the corridor-level problem mainly includes mixed traffic speed harmonization (Ghiasi et al., 2019), trajectory-based management (Lu et al., 2019), and single-lane roundabout management (Martin-Gasulla and Elefteriadou, 2019). The intersection-level management of AVs can be classified into two main categories: decentralized control and centralized control strategies. Decentralized control allows the AVs to make decisions by themselves while considering V2V coordination. This category of study usually assumes that intersections can be divided into a set of grids to discretize the motions of vehicles. The objectives of vehicles are usually to minimize safety risks (Katrakazas et al., 2015, Zohdy, 2012), to maximize intersection efficiencies (Roncoli et al., 2015), or to minimize fuel consumption (Zhang et al., 2016). Further, reinforcement-learning(RL)-based methods have been introduced to mitigate the effects of the non-stationary environment problem in decentralized multi-agent learning (Wu et al., 2019). Besides, conflict point formulation has been used in a previous study on automated intersection control (Zhu and Ukkusuri, 2015). Also, Mirheli et al. (2019) defines conflict zones, allow vehicle to share states and formulate a MINLP problem to minimize delay and speed variance. Furthermore, based on the conflict relationship of involved vehicles, Xu et al. (2018) projects the approaching vehicles from different traffic movements into a virtual lane and introduces a conflict-free geometry topology. In contrast to decentralized control, the early-proposed centralized scheme controls the actions of all AVs through a centralized controller. The automated intersection management (AIM) problem, first proposed by Dresner and Stone (2004), is a typical centralized control scheme. As an alternative to traffic signals for AVs, AIM applies a reservation-based framework, i.e., an approaching vehicle sends a request to the intersection manager (IM), and the IM specifies its turning movement as well as the time at which it can enter the intersection. Following the AIM, various centralized methods have been developed. For example, Levin and Rey (2017) proposed a rolling-horizon method to decide the optimal reservation assignment for a large number of vehicles, and Li et al. (2019) provided accurate descriptions of temporal-spatial trajectories to further reduce the conflict-avoidance loss. In short, the trend of the research about the organization and coordination of AVs can be summarized as two stages: the first is to achieve a feasible and efficient conflict-free routing, and on this basis, the second stage is to realize some optimization goals while considering some practical constraints, such as the communication between vehicles (Xu et al., 2018).

In most cases, the pass order decision or trajectory management problems are formulated as a mixed integer programming, but the scalability of such method is poor, which leads to difficulties in extending the control to network-level (or zone level), because of the unacceptable computational burden. For a realistic-size transportation network, there could be hundreds to thousands of vehicles moving simultaneously, and there are hundreds of conflict points in the network that require a conflict-free guarantee for all pairs of vehicles, so the centralized real-time control of the network traffic inevitably leads to an optimization problem with enormous size; moreover, in practical implementation, the problem must be solved very efficiently for instantaneous decision-making. Thus, for network-wise control, a framework with a certain level of decentralization must be proposed to relieve the computational load.

In the field of logistics, the real-time management of automated guide vehicles (AGVs) also faces the difficulty of computational intractability, i.e., multiple AGVs are assigned to complete multiple pick-up and delivery tasks in a given network of warehouses or ports. Extensive research on AGV dispatching, routing, scheduling, and collision deadlock avoidance has been published in the literature. Similar to traffic control, the AGV control approaches can also be classified as centralized or decentralized. In centralized control, a hierarchical control framework is developed to reduce the scale and complexity of the problem (Cardarelli et al., 2017). Some decentralized approaches have been studied in recent years, e.g., the agent-based AGV controller (Wallace, 2001), zone-control-based decentralized control strategies (Fanti et al., 2018), and multiple AGV decentralized control (Draganjac et al., 2016). A time-dependent network is usually adopted in the AGV control system to ensure conflict avoidance; for example, Antakly et al. (2017) developed a policy-based conflict-free routing strategy in a time-dependent network. AGV-related applications are usually in a closed environment, with known workers and limited tasks, so the planning work is much easier than AVs. But Some issues need to be considered in AVs management research have been studied in the AGV system, and some heuristic methods are also helpful.

This study aims to address the problem of network-level AV traffic management in dedicated AV zones with some ideas borrowed from the AGV decentralized control. Unlike AGV control, traffic control is generally more complicated in the following aspects: (i) the behavior of vehicle traffic is much more complicated than AGV traffic, including turning at complicated intersections, lane-changing, etc.; (ii) the traffic demand is highly stochastic, which is extremely difficult to be modeled by some compact mathematical programming. This paper develops a space-time routing strategy on a continuous timeline for AVs on road networks, where each vehicle is assigned with a space-time route once it enters the network for ensuring conflict-free condition. In this paper, we model the underlying road network as a lane-based conflict point network, i.e., a link can only represent a lane instead of a road segment. With this approach, we can accurately describe a set of complicated vehicle motions at intersections or during lane changes by vehicles. Based on the conflict point network, two routing algorithms, namely Dijkstra algorithm with time windows (Dijkstra-TW) and k-shortest path algorithm (KSP), are proposed to solve the space-time routing of each vehicle. Both of the proposed routing algorithms apply to every single vehicle entering the zone instead of a group of vehicles.

However, it is difficult to guarantee the routing performances, especially in the cases with heavy traffic. To relieve this issue, we further propose a dynamic platoon strategy. The idea behind the platoon strategy is built upon the observation that the minimum time interval required between two crossing vehicles is larger than that between two following vehicles (Tachet et al., 2016, Yu et al., 2019), indicating that the vehicle traffic should be formed into platoons under heavy traffic while passing through conflict points. To determine the optimal platoon sizes at different spots within the zone dynamically, we adopt the Deep Q-learning (DQN) method that can utilize the real-time traffic information to update the current platoon sizes for the maximization of the all-time reward, i.e., the minimization of average vehicle delay in the whole time horizon. Both the proposed routing framework and the platoon strategy are tested using self-coded computer simulations in artificial dedicated AV zones.

The rest of the paper is organized as follows. Section 2 formulates the lane-based conflict point network as well as some specific realistic settings, and based on this, we formulate the space-time routing problem for each approaching vehicle. In Section 3, the Dijkstra-TW algorithm and the KSP algorithm for solving the routing problem are presented in detail. In Section 4, we introduce the platoon strategy to space-time routing and illustrate the DQN framework for determining the platoon sizes. Section 5 shows some results by testing the proposed methods in a dedicated AV zone and presents their performances under various scenarios. Finally, Section 6 concludes the paper.

Section snippets

Problem statement

In this section, the conflict point network is established and the conflict-free routing problem is formulated. In traditional network modeling, nodes typically represent intersections, and arcs represent roadway segments. This paper designs a lane-based conflict point network, in which nodes represent conflict points and arcs represent the lane segments. Conflict points are defined as the areas where lane segments from different directions occupy the same road space, and they occur at

Space-time routing algorithms

This section proposes two conflict-free routing algorithms based on the conflict point network to solve the optimization problem Eq. (7). Unlike the conventional shortest-path problem, the routing problem in this study must deal with a time-dependent network with node-specific time windows, and some additional constraints, e.g., the speed change constraints at intersections, must be taken into account. In the following subsections, we first present the Dijkstra algorithm with time windows

Motivations and settings

The conflict-free routing framework proposed in the previous section operates in a decentralized way, implying that each vehicle makes the routing decision without considering its potential impact on the decisions of other vehicles. Besides, the time occupations on conflict points of a vehicle will not be altered once it enters the zone. As a result, when the traffic in the dedicated AV zone is heavy, the strategies proposed in Section 2 may not be able to handle it well, and some additional

Test settings

In this section, we perform a variety of experiments to test the performances of our proposed framework, both the original conflict-free routing and the platoon strategy, for gaining some insights. We evaluate the solutions on the metrics of average delay d¯, which is defined by Eq. (19).d¯=iVtindi-tifreendi|V|where V is the set of all vehicles and tifree is the zone departure time of vehicle i traveling with free-flow speed on the shortest path. For each vehicle, we also define a delay ratio

Conclusions and discussions

This paper develops a methodological framework for autonomous vehicle control at the network level. Based on a conflict point network establishment, we develop a space-time routing protocol for the traffic organization in the network such that the routing decision is made for every single AV when it enters the network, and the information of time window occupancy is provided to avoid vehicle collision at conflict points. Two conflict-free routing algorithms, i.e., Dijkstra-TW and KSP, are

Declaration of Competing Interest

None.

Acknowledgements

This study is supported by grants from National Natural Science Foundation of China (71871126, 51622807). This study is supported in part by the Tsinghua University-Toyota Research Center.

References (34)

  • B. Xu et al.

    Distributed conflict-free cooperation for multiple connected vehicles at unsignalized intersections

    Transport. Res. Part C: Emerg. Technol.

    (2018)
  • C. Yu et al.

    Managing connected and automated vehicles at isolated intersections: From reservation- to optimization-based methods

    Transport. Res. Part B: Methodol.

    (2019)
  • F. Zhu et al.

    A linear programming formulation for autonomous intersection control within a dynamic traffic assignment and connected vehicle environment

    Transport. Res. Part C: Emerg. Technol.

    (2015)
  • J. Auld et al.

    Analysis of the effects of connected–automated vehicle technologies on travel demand

    Transp. Res. Rec.

    (2017)
  • J.E. Beasley et al.

    An algorithm for the resource constrained shortest path problem

    Networks

    (2010)
  • Desrochers, M., Soumis, F., 1988a. A generalized permanent labeling algorithm for the shortest path problem with time...
  • I. Draganjac et al.

    Decentralized control of multi-agv systems in autonomous warehousing applications

    IEEE Trans. Autom. Sci. Eng.

    (2016)
  • Cited by (15)

    • Network-level control of heterogeneous automated traffic guaranteeing bus priority

      2022, Transportation Research Part C: Emerging Technologies
      Citation Excerpt :

      Fruitful studies have been developed to enhance the traffic management and control under the CAV environment. For single intersections, both signalized control strategies (Li et al., 2014; Feng et al., 2018) and signal-free strategies (Dresner and Stone, 2004; Lee and Park, 2012) are proposed; besides, control strategies for arterials or networks are also investigated (Wang et al., 2020; Lu et al., 2019; An et al., 2020; Qian et al., 2021). In spite of the satisfactory performances of these control strategies, homogeneous traffic flow has been assumed in most of the studies.

    • Multi-rhythm control for heterogeneous traffic and road networks in CAV environments

      2022, Transportation Research Part E: Logistics and Transportation Review
      Citation Excerpt :

      The application of emerging technologies implies a tremendous revolution in traffic operations. Fruitful studies have attempted to enhance traffic management and control under a connected and automated vehicle (CAV) environment, including for single intersections (Dresner and Stone, 2004; Lee and Park, 2012; Levin and Rey, 2017; Feng et al., 2018) and arterials or networks (Wang et al., 2020; Lu et al., 2019; An et al., 2020; Qian et al., 2021; Wu et al., 2020). Despite the satisfactory performances of these control strategies, homogeneous traffic flow has been assumed in most of the studies.

    • Aggregate modeling and equilibrium analysis of the crowdsourcing market for autonomous vehicles

      2021, Transportation Research Part C: Emerging Technologies
      Citation Excerpt :

      Control of fully AVs may be the best investigated topic (Wu et al., 2021), including route planning (Wang et al., 2020), longitudinal control, i.e. Cooperative Adaptive Cruise Control (Gong and Du, 2018), and lateral control of autonomous vehicles (Yang et al., 2018). An et al. (2020) proposed a space–time routing framework for control of AVs in dedicated AV zones. Chen et al. (2020) studied control of AVs using dedicated bus rapid transit (BRT) lanes and Chen et al. (2020) studied control of AVs in mixed flow of AVs and manually driven vehicles (MVs).

    View all citing articles on Scopus
    View full text