Space-time routing in dedicated automated vehicle zones

doi:10.1016/j.trc.2020.102777

Transportation Research Part C: Emerging Technologies

Volume 120, November 2020, 102777

https://doi.org/10.1016/j.trc.2020.102777 Get rights and content

Highlights

•
Conflict-free routing algorithms for automated vehicles in dedicated zones.
•
Conflict-point networks with time occupations on conflict points.
•
Dynamic platoon strategy to reduce delay in heavy traffic.
•
Deep reinforcement learning for optimizing the platoon sizes.

Abstract

With the fast development of automated vehicle (AV) technologies, scholars have proposed various innovative local traffic control schemes for more effective management of AV traffic, especially at intersections. However, due to computational intractability, the investigation of network-level AV control is still at the initial stage. This study proposes a space-time routing framework applicable in dedicated AV zones. To relieve the computational load, we establish a node-based conflict point network to model realistic road networks, and at each conflict point, we record the space-time occupations of AVs in continuous timelines. Then, based on the conflict point network, we develop two space-time routing algorithms for each AV once it enters the dedicated AV zone to minimize its trip travel time while maintaining the non-collision insurances; these two algorithms can trade-off between solution quality and computational load. Furthermore, to enhance the network throughput for handling heavy traffic, we develop a ”platoon strategy” that forces AVs to pass through conflict points in platoons, and we adopt Deep Q-learning (DQN) to optimize the platoon sizes at different spots dynamically. Numerical tests show that both proposed algorithms perform well in that they can execute the routing tasks with very limited computational time, and the average vehicle delay approaches zero when the traffic is relatively mild. Meanwhile, compared with the FCFS policy and the optimization-based approach, the platoon strategy can greatly reduce the average vehicle delay under congested scenarios and give a better balance between the optimality and real-time performance.

Introduction

Automated vehicles (AVs) are considered a crucial component of the future of transportation by many scholars and practitioners. AVs collect environmental information through their sensors, as well as through some roadside facilities (V2I) and other vehicles (V2V). Decisions can then be made either by the AVs themselves or by some cloud controllers. Several promising benefits of AVs are foreseeable, and AV technologies can be used to affect traffic demand (Auld et al., 2017) and improve lane throughput (Talebpour and Mahmassani, 2016). Furthermore, with the development of a new generation of communication technologies and advanced decision-making techniques, it is envisioned that AVs will reshape the form of human transport. For example, the traffic organization of AVs can be taken over by some local or global controller in the future, and the controllers can acquire the optimal trajectory of each vehicle for the minimization of total travel time or energy consumption.

The management and planning of AVs is becoming a hot research area. In recent years, some scholars have developed approaches for traffic organization and management in the AV era, mainly focusing on corridor-level and intersection-level management. In particular, the corridor-level problem mainly includes mixed traffic speed harmonization (Ghiasi et al., 2019), trajectory-based management (Lu et al., 2019), and single-lane roundabout management (Martin-Gasulla and Elefteriadou, 2019). The intersection-level management of AVs can be classified into two main categories: decentralized control and centralized control strategies. Decentralized control allows the AVs to make decisions by themselves while considering V2V coordination. This category of study usually assumes that intersections can be divided into a set of grids to discretize the motions of vehicles. The objectives of vehicles are usually to minimize safety risks (Katrakazas et al., 2015, Zohdy, 2012), to maximize intersection efficiencies (Roncoli et al., 2015), or to minimize fuel consumption (Zhang et al., 2016). Further, reinforcement-learning(RL)-based methods have been introduced to mitigate the effects of the non-stationary environment problem in decentralized multi-agent learning (Wu et al., 2019). Besides, conflict point formulation has been used in a previous study on automated intersection control (Zhu and Ukkusuri, 2015). Also, Mirheli et al. (2019) defines conflict zones, allow vehicle to share states and formulate a MINLP problem to minimize delay and speed variance. Furthermore, based on the conflict relationship of involved vehicles, Xu et al. (2018) projects the approaching vehicles from different traffic movements into a virtual lane and introduces a conflict-free geometry topology. In contrast to decentralized control, the early-proposed centralized scheme controls the actions of all AVs through a centralized controller. The automated intersection management (AIM) problem, first proposed by Dresner and Stone (2004), is a typical centralized control scheme. As an alternative to traffic signals for AVs, AIM applies a reservation-based framework, i.e., an approaching vehicle sends a request to the intersection manager (IM), and the IM specifies its turning movement as well as the time at which it can enter the intersection. Following the AIM, various centralized methods have been developed. For example, Levin and Rey (2017) proposed a rolling-horizon method to decide the optimal reservation assignment for a large number of vehicles, and Li et al. (2019) provided accurate descriptions of temporal-spatial trajectories to further reduce the conflict-avoidance loss. In short, the trend of the research about the organization and coordination of AVs can be summarized as two stages: the first is to achieve a feasible and efficient conflict-free routing, and on this basis, the second stage is to realize some optimization goals while considering some practical constraints, such as the communication between vehicles (Xu et al., 2018).

In most cases, the pass order decision or trajectory management problems are formulated as a mixed integer programming, but the scalability of such method is poor, which leads to difficulties in extending the control to network-level (or zone level), because of the unacceptable computational burden. For a realistic-size transportation network, there could be hundreds to thousands of vehicles moving simultaneously, and there are hundreds of conflict points in the network that require a conflict-free guarantee for all pairs of vehicles, so the centralized real-time control of the network traffic inevitably leads to an optimization problem with enormous size; moreover, in practical implementation, the problem must be solved very efficiently for instantaneous decision-making. Thus, for network-wise control, a framework with a certain level of decentralization must be proposed to relieve the computational load.

In the field of logistics, the real-time management of automated guide vehicles (AGVs) also faces the difficulty of computational intractability, i.e., multiple AGVs are assigned to complete multiple pick-up and delivery tasks in a given network of warehouses or ports. Extensive research on AGV dispatching, routing, scheduling, and collision deadlock avoidance has been published in the literature. Similar to traffic control, the AGV control approaches can also be classified as centralized or decentralized. In centralized control, a hierarchical control framework is developed to reduce the scale and complexity of the problem (Cardarelli et al., 2017). Some decentralized approaches have been studied in recent years, e.g., the agent-based AGV controller (Wallace, 2001), zone-control-based decentralized control strategies (Fanti et al., 2018), and multiple AGV decentralized control (Draganjac et al., 2016). A time-dependent network is usually adopted in the AGV control system to ensure conflict avoidance; for example, Antakly et al. (2017) developed a policy-based conflict-free routing strategy in a time-dependent network. AGV-related applications are usually in a closed environment, with known workers and limited tasks, so the planning work is much easier than AVs. But Some issues need to be considered in AVs management research have been studied in the AGV system, and some heuristic methods are also helpful.

This study aims to address the problem of network-level AV traffic management in dedicated AV zones with some ideas borrowed from the AGV decentralized control. Unlike AGV control, traffic control is generally more complicated in the following aspects: (i) the behavior of vehicle traffic is much more complicated than AGV traffic, including turning at complicated intersections, lane-changing, etc.; (ii) the traffic demand is highly stochastic, which is extremely difficult to be modeled by some compact mathematical programming. This paper develops a space-time routing strategy on a continuous timeline for AVs on road networks, where each vehicle is assigned with a space-time route once it enters the network for ensuring conflict-free condition. In this paper, we model the underlying road network as a lane-based conflict point network, i.e., a link can only represent a lane instead of a road segment. With this approach, we can accurately describe a set of complicated vehicle motions at intersections or during lane changes by vehicles. Based on the conflict point network, two routing algorithms, namely Dijkstra algorithm with time windows (Dijkstra-TW) and k-shortest path algorithm (KSP), are proposed to solve the space-time routing of each vehicle. Both of the proposed routing algorithms apply to every single vehicle entering the zone instead of a group of vehicles.

However, it is difficult to guarantee the routing performances, especially in the cases with heavy traffic. To relieve this issue, we further propose a dynamic platoon strategy. The idea behind the platoon strategy is built upon the observation that the minimum time interval required between two crossing vehicles is larger than that between two following vehicles (Tachet et al., 2016, Yu et al., 2019), indicating that the vehicle traffic should be formed into platoons under heavy traffic while passing through conflict points. To determine the optimal platoon sizes at different spots within the zone dynamically, we adopt the Deep Q-learning (DQN) method that can utilize the real-time traffic information to update the current platoon sizes for the maximization of the all-time reward, i.e., the minimization of average vehicle delay in the whole time horizon. Both the proposed routing framework and the platoon strategy are tested using self-coded computer simulations in artificial dedicated AV zones.

The rest of the paper is organized as follows. Section 2 formulates the lane-based conflict point network as well as some specific realistic settings, and based on this, we formulate the space-time routing problem for each approaching vehicle. In Section 3, the Dijkstra-TW algorithm and the KSP algorithm for solving the routing problem are presented in detail. In Section 4, we introduce the platoon strategy to space-time routing and illustrate the DQN framework for determining the platoon sizes. Section 5 shows some results by testing the proposed methods in a dedicated AV zone and presents their performances under various scenarios. Finally, Section 6 concludes the paper.

Section snippets

Problem statement

In this section, the conflict point network is established and the conflict-free routing problem is formulated. In traditional network modeling, nodes typically represent intersections, and arcs represent roadway segments. This paper designs a lane-based conflict point network, in which nodes represent conflict points and arcs represent the lane segments. Conflict points are defined as the areas where lane segments from different directions occupy the same road space, and they occur at

Space-time routing algorithms

This section proposes two conflict-free routing algorithms based on the conflict point network to solve the optimization problem Eq. (7). Unlike the conventional shortest-path problem, the routing problem in this study must deal with a time-dependent network with node-specific time windows, and some additional constraints, e.g., the speed change constraints at intersections, must be taken into account. In the following subsections, we first present the Dijkstra algorithm with time windows

Motivations and settings

The conflict-free routing framework proposed in the previous section operates in a decentralized way, implying that each vehicle makes the routing decision without considering its potential impact on the decisions of other vehicles. Besides, the time occupations on conflict points of a vehicle will not be altered once it enters the zone. As a result, when the traffic in the dedicated AV zone is heavy, the strategies proposed in Section 2 may not be able to handle it well, and some additional

Test settings

In this section, we perform a variety of experiments to test the performances of our proposed framework, both the original conflict-free routing and the platoon strategy, for gaining some insights. We evaluate the solutions on the metrics of average delay $\bar{d}$ , which is defined by Eq. (19). $\bar{d} = \frac{\sum_{i \in V} [t_{i} (n_{d}^{i}) - t_{i}^{free} (n_{d}^{i})]}{| V |}$ where $V$ is the set of all vehicles and $t_{i}^{free}$ is the zone departure time of vehicle i traveling with free-flow speed on the shortest path. For each vehicle, we also define a delay ratio

Conclusions and discussions

This paper develops a methodological framework for autonomous vehicle control at the network level. Based on a conflict point network establishment, we develop a space-time routing protocol for the traffic organization in the network such that the routing decision is made for every single AV when it enters the network, and the information of time window occupancy is provided to avoid vehicle collision at conflict points. Two conflict-free routing algorithms, i.e., Dijkstra-TW and KSP, are

Declaration of Competing Interest

None.

Acknowledgements

This study is supported by grants from National Natural Science Foundation of China (71871126, 51622807). This study is supported in part by the Tsinghua University-Toyota Research Center.

References (34)

D. Antakly et al.
A temporised conflict-free routing policy for agvs
IFAC-PapersOnLine
(2017)
E. Cardarelli et al.
Cooperative cloud robotics architecture for the coordination of multi-agv systems in industrial warehouses
Mechatronics
(2017)
M. Desrochers et al.
A reoptimization algorithm for the shortest path problem with time windows?
Eur. J. Oper. Res.
(1988)
M.P. Fanti et al.
A decentralized control strategy for the coordination of agv systems
Control Eng. Pract.
(2018)
P. Gipps
A model for the structure of lane-changing decisions
Transport. Res. Part B: Methodol.
(1986)
C. Katrakazas et al.
Real-time motion planning methods for autonomous on-road driving: State-of-the-art and future research directions
Transport. Res. Part C: Emerg. Technol.
(2015)
Z. Li et al.
Temporal-spatial dimension extension-based intersection control formulation for connected and autonomous vehicle systems
Transport. Res. Part C Emerg. Technol.
(2019)
A. Mirheli et al.
A consensus-based distributed trajectory control in a signal-free intersection
Transport. Res. Part C: Emerg. Technol.
(2019)
A. Talebpour et al.
Influence of connected and autonomous vehicles on traffic flow stability and throughput
Transport. Res. Part C Emerg. Technol.
(2016)
Y. Wu et al.
Dcl-aim: Decentralized coordination learning of autonomous intersection management for connected and automated vehicles
Transport. Res. Part C: Emerg. Technol.
(2019)

B. Xu et al.

Distributed conflict-free cooperation for multiple connected vehicles at unsignalized intersections

Transport. Res. Part C: Emerg. Technol.

(2018)

C. Yu et al.

Managing connected and automated vehicles at isolated intersections: From reservation- to optimization-based methods

Transport. Res. Part B: Methodol.

(2019)

F. Zhu et al.

A linear programming formulation for autonomous intersection control within a dynamic traffic assignment and connected vehicle environment

Transport. Res. Part C: Emerg. Technol.

(2015)

J. Auld et al.

Analysis of the effects of connected–automated vehicle technologies on travel demand

Transp. Res. Rec.

(2017)

J.E. Beasley et al.

An algorithm for the resource constrained shortest path problem

Networks

(2010)

Desrochers, M., Soumis, F., 1988a. A generalized permanent labeling algorithm for the shortest path problem with time...

I. Draganjac et al.

Decentralized control of multi-agv systems in autonomous warehousing applications

IEEE Trans. Autom. Sci. Eng.

(2016)

Cited by (15)

A platoon-based cooperative optimal control for connected autonomous vehicles at highway on-ramps under heavy traffic
2023, Transportation Research Part C: Emerging Technologies
To improve traffic efficiency at highway on-ramps under heavy traffic, this study proposes a platoon-based cooperative optimal control algorithm for connected autonomous vehicles (CAVs). The proposed algorithm classifies CAVs on both mainline and on-ramp into multiple local platoons (LPs) according to their initial conditions (i.e., spacing and speed), which enables the algorithm to adapt to time-varying traffic volume. A distributed cooperative control for multiple LPs is designed which projects on-ramp LPs onto mainline to transform the complex 2-D multi-platoon cooperation problem into a 1-D platoon following control problem. An optimal control is applied to further consider the strict nonlinear safety spacing constraint and state limitations (e.g., maximum speed and acceleration), and an analytical solution to the optimal control is derived based on Pontryagin’s maximum principle. The consensus of intra-platoon and inter-platoon are analyzed, and sufficient conditions of the consensus are mathematically deducted based on Lyapunov stability theorem. Numerical simulations are conducted for different traffic demand levels and demand splits to verify the effectiveness of the proposed algorithm. The sensitivity analysis of maximum platoon sizes for mainline and on-ramp LPs is performed. A comparison with a baseline virtual platooning merging strategy is conducted, and results show that the proposed algorithm could significantly improve the average travel speed and traffic efficiency, and reduce total travel time.
Network-level control of heterogeneous automated traffic guaranteeing bus priority
2022, Transportation Research Part C: Emerging Technologies
Citation Excerpt :
Fruitful studies have been developed to enhance the traffic management and control under the CAV environment. For single intersections, both signalized control strategies (Li et al., 2014; Feng et al., 2018) and signal-free strategies (Dresner and Stone, 2004; Lee and Park, 2012) are proposed; besides, control strategies for arterials or networks are also investigated (Wang et al., 2020; Lu et al., 2019; An et al., 2020; Qian et al., 2021). In spite of the satisfactory performances of these control strategies, homogeneous traffic flow has been assumed in most of the studies.
Guaranteeing the quality of transit service is of great importance to promote the attractiveness of buses and alleviate urban traffic issues such as congestion and pollution. Emerging technologies of automated driving and V2X communication have the potential to enable the accurate control of vehicles and the efficient organization of traffic to enhance both the schedule adherence of buses and the overall network mobility. This study proposes a novel network-level control scheme for heterogeneous automated traffic composed of buses and private cars under a full connected and automated environment. Inheriting the idea of network-level rhythmic control proposed by Lin et al. (2021b), an innovative rhythmic control scheme for heterogeneous traffic, i.e., RC-H, is established to organize the mixed traffic in a rhythmic manner. Realized virtual platoons are designed for accommodating vehicles to pass through the network, including dedicated virtual platoons for buses to provide exclusive right-of-ways (ROWs) on their trips and regular virtual platoons for private cars along with an optimal assignment plan to minimize the total travel cost. A mixed-integer linear program (MILP) is formulated to optimize the RC-H scheme with the objective of minimizing the weighted travel costs of both buses and private cars. A bilevel heuristic solution method is designed to relieve the computational burden of MILP. Numerical examples and simulation experiments are conducted to evaluate the performance of the RC-H scheme under different scenarios. The results show that the bus punctuality can be guaranteed and the traveler delay can be minimized under various demand levels with transit priority. Moreover, compared with traffic signal control (TSC) strategies, including those with fixed timing and those with bus priority, the RC-H scheme shows significant advantages in handling heavy traffic demand, in terms of both reducing vehicle delay and enhancing network throughput.
Multi-rhythm control for heterogeneous traffic and road networks in CAV environments
2022, Transportation Research Part E: Logistics and Transportation Review
Citation Excerpt :
The application of emerging technologies implies a tremendous revolution in traffic operations. Fruitful studies have attempted to enhance traffic management and control under a connected and automated vehicle (CAV) environment, including for single intersections (Dresner and Stone, 2004; Lee and Park, 2012; Levin and Rey, 2017; Feng et al., 2018) and arterials or networks (Wang et al., 2020; Lu et al., 2019; An et al., 2020; Qian et al., 2021; Wu et al., 2020). Despite the satisfactory performances of these control strategies, homogeneous traffic flow has been assumed in most of the studies.
Heterogeneity is widely observed in various aspects of urban transport, such as road types, vehicle dynamics, and demand patterns, which greatly complexifies transport systems and hinders the successful control and management of traffic. On one hand, intelligent strategies are required to relieve the loss of efficiency caused by the mixed traffic flow composed of vehicles with different speeds; on the other hand, the spatial heterogeneity of travel demand distribution requires pertinent control schemes to improve the network-level mobility and the phenomenon of oversaturation under heavy traffic also demands an appropriate way to resolve. Leveraging the advantages of connected and automated vehicle (CAV) technology, this study proposes a multi-rhythm control (MRC) scheme to accommodate heterogeneous network traffic in a CAV environment with 100% penetration rate. While the rhythmic control (RC) proposed by (Lin et al., 2021) could organize traffic efficiently and in an orderly manner, the multi-rhythm strategy has greater potential to not only improve the efficiency of heterogeneous traffic by the design of different virtual platoons (VPs) catering to different vehicle types, but also address the spatial heterogeneity of network traffic by implementing differentiated cycle lengths among the road network. For the optimal design of the MRC scheme, a two-step model is developed to minimize the total travel cost, where the first step focuses on the mesoscopic level to jointly optimize the multi-rhythm scheme, VP design and lane configuration for both undersaturated and oversaturated cases, while the second step focuses on the microscopic design to optimize the space–time trajectories of VPs under the premise of collision free. Numerical examples and simulation experiments are conducted to test the performances of the MRC scheme compared to a naive version of RC with a homogeneous rhythm and a traffic signal control (TSC) strategy. The results show that among all control strategies, the MRC scheme has the best performance under all kinds of demand patterns, which is reflected in both the lowest travel cost and the highest network throughput.
Aggregate modeling and equilibrium analysis of the crowdsourcing market for autonomous vehicles
2021, Transportation Research Part C: Emerging Technologies
Citation Excerpt :
Control of fully AVs may be the best investigated topic (Wu et al., 2021), including route planning (Wang et al., 2020), longitudinal control, i.e. Cooperative Adaptive Cruise Control (Gong and Du, 2018), and lateral control of autonomous vehicles (Yang et al., 2018). An et al. (2020) proposed a space–time routing framework for control of AVs in dedicated AV zones. Chen et al. (2020) studied control of AVs using dedicated bus rapid transit (BRT) lanes and Chen et al. (2020) studied control of AVs in mixed flow of AVs and manually driven vehicles (MVs).
Autonomous vehicles (AVs) have the potential of reshaping the human mobility in a wide variety of aspects. This paper focuses on a new possibility that the AV owners have the option of “renting” their AVs to a company, which can use these collected AVs to provide on-demand ride services without any drivers. We call such a mobility market with AV renting options the “AV crowdsourcing market”. This paper establishes an aggregate equilibrium model with multiple transport modes to analyze the AV crowdsourcing market. The modeling framework can capture the customers’ mode choices and AV owners’ rental decisions with the presence of traffic congestion. Then, we explore different scenarios that either maximize the crowdsourcing platform’s profit or maximize social welfare. Gradient-based optimization algorithms are designed for solving the problems. The results obtained by numerical examples reveal the welfare enhancement and the strong profitability of the AV crowdsourcing service. However, when the crowdsourcing scale is small, the crowdsourcing platform might not be profitable. A second-best pricing scheme is able to avoid such undesirable cases. The insights generated from the analyses provide guidance for regulators, service providers and citizens to make future decisions regarding the utilization of the AV crowdsourcing markets for serving the good of the society.
A time and energy efficient merging control for platoon formation of connected and automated electric vehicles at on-ramps
2024, Nonlinear Dynamics
Theoretical Trade-Off Between Fairness and Efficiency in the Cooperative Driving Problem for CAVs at On-Ramps
2024, IEEE Open Journal of Intelligent Transportation Systems

View all citing articles on Scopus

View full text

Space-time routing in dedicated automated vehicle zones

Highlights

Abstract

Introduction

Section snippets

Problem statement

Space-time routing algorithms

Motivations and settings

Test settings

Conclusions and discussions

Declaration of Competing Interest

Acknowledgements

IFAC-PapersOnLine

Mechatronics

Eur. J. Oper. Res.

Control Eng. Pract.

Transport. Res. Part B: Methodol.

Transport. Res. Part C: Emerg. Technol.

Transport. Res. Part C Emerg. Technol.

Transport. Res. Part C: Emerg. Technol.

Transport. Res. Part C Emerg. Technol.

Transport. Res. Part C: Emerg. Technol.

Transport. Res. Part C: Emerg. Technol.

Transport. Res. Part B: Methodol.

Transport. Res. Part C: Emerg. Technol.

Analysis of the effects of connected–automated vehicle technologies on travel demand

Transp. Res. Rec.

An algorithm for the resource constrained shortest path problem

Networks

Decentralized control of multi-agv systems in autonomous warehousing applications

IEEE Trans. Autom. Sci. Eng.