Day-to-day dynamic origin–destination flow estimation using connected vehicle trajectories and automatic vehicle identification data

doi:10.1016/j.trc.2021.103241

Transportation Research Part C: Emerging Technologies

Volume 129, August 2021, 103241

https://doi.org/10.1016/j.trc.2021.103241 Get rights and content

Highlights

•
A novel methodology for recovering day-to-day dynamic OD flow.
•
Fusion of CV trajectories and AVI observations.
•
Obtaining prior OD flows by addressing penetration rate variation and sparsity issue.
•
Determining final estimates utilizing day-to-day traffic characteristics.

Abstract

Dynamic vehicular origin–destination (OD) flow is a fundamental component of traffic network modeling and its estimation has long been studied. Although ideal observing conditions and behavioral assumptions are often indispensable for estimation, day-to-day traffic recurrences and variations are seldom utilized to improve the estimation performance. In this paper, we propose a new method to recover day-to-day dynamic OD flows using both connected vehicle (CV) trajectories and automatic vehicle identification (AVI) observations. The method involves two modules: the first module provides reliable prior OD flows given limited observations, while the second module seeks the optimal estimates based on the prior OD flows. In the first module, linear projection is extended to consider temporal and spatial variation of the CV penetration rate, and non-negative Tucker decomposition (NTD) is adopted to address the data sparsity issue caused by the low CV penetration rate. In the second module, a self-supervised learning model called the latency-constrained autoencoder (LCAE) is established to search for the optimal OD flows according to the priors with given robust latent features. To avoid local minima and ensure consistency between estimates, a novel algorithm called adaptive sub-sample correction (ASC) is proposed and integrated into the optimization process of LCAE, which can iteratively correct the most inconsistent samples based on the day-to-day traffic flow characteristics. The proposed method is examined on an empirical urban arterial network, a calibrated simulation network, and a synthetic large-scale grid network. Our results indicated that the proposed method requires very few AVI detectors and CV trajectories to achieve competitive estimation performance against two benchmark models. Furthermore, general robustness to several factors with respect to observing conditions and data quality was investigated, and satisfactory scalability was also demonstrated in terms of both estimation accuracy and computational cost.

Introduction

Dynamic origin–destination (OD) flow reveals the time-dependent travel demand on road networks. It serves as the fundamental input for dynamic traffic assignment (DTA) models as well as for network optimization programs (Arsava et al., 2018, Peeta and Ziliaskopoulos, 2001). The dynamic OD flow fluctuates from day-to-day due to variations and stochasticity in trip patterns. Thus, active traffic network management also requires accurate estimation of day-to-day dynamic OD flows to handle the uncertainty of traffic demand.

Despite the extensive studies across decades, obtaining accurate time-dependent OD flows given network observations remains challenging due to the observability issue in traffic networks (Castillo et al., 2008a). The observations from the network are much less than the unknown OD flows, and thus, models may not produce a unique solution. Under such circumstances, the existing models often start with a prior OD estimate and solve an optimization program to identify the solution that is most consistent with available observations and assumptions. Then, the objective of such program is to minimize the deviation between estimated and observed (or prior) variables while maintaining network flow conservation described by DTA process. However, a reliable prior may not always exist, especially at central business districts or under rapid urbanization in many developing countries. Besides, DTA models are usually established based on departure time and route choice behavior assumptions to approach user equilibrium status, which could largely deviate from the realistic situation (Yildirimoglu and Kahraman, 2017, Zhu and Levinson, 2015). Furthermore, most current studies have focused on within-day situations while considering deterministic OD flows, such as estimation of the morning peak period; they have not considered day-to-day recurrence and variation of OD flows. Only a few studies have dealt with day-to-day OD flows, but they have mainly estimated the mean, variance and covariance assuming certain OD demand distribution, e.g. multivariate normal distribution (Ma and Qian, 2018b, Shao et al., 2014).

In recent years, connected vehicles, such as vehicles of DiDi and Uber that equipped with GPS unit or drivers that use navigation service on their mobile phone, have emerged as a promising mobile data source because they can provide detailed and accurate traffic flow information. Meanwhile, vehicle re-identification systems have also been rapidly deployed in many countries. The main components of these systems are automatic vehicle identification (AVI) detectors that could uniquely identify each vehicle, including radio frequency identification device (RFID)-based detectors, blue-tooth detectors, and license plate recognition (LPR) devices. Among them, the LPR cameras are mostly used in China, and the data could be accessed by the supplier as well as law enforcement apartment. These data sources can directly provide OD and path-related observations. Owing to the availability of these day-to-day continuous and multi-source heterogeneous observations, external priors and unrealistic assumptions may no longer be required (Ma and Qian, 2018b, Yang et al., 2018). Nevertheless, according to our literature review, few efforts have been devoted to fully utilize the day-to-day observations of these emerging sensors to address the external prior OD usage issue and assumptions in DTA modeling. Therefore, in this paper, we eliminate historical priors and behavioral assumptions and infer OD flows using both CV trajectories and AVI observations via a purely data-driven method.

Early attempts on OD flow estimation mainly focused on the modeling framework based on fixed link counts, including the entropy minimization model (Van Zuylen & Willumsen, 1980), the maximum likelihood estimator (Spiess, 1987), Bayesian inference method (Maher, 1983), and generalized least squares (GLS) models (Bell, 1991, Cascetta, 1984). Among them, the GLS-based models have been most frequently extended and tested, including bi-level and single-level GLS models. The bi-level models consider the effects of congestion, in which the upper level minimizes deviation terms in the form of least square error and the lower level performs DTA based on inference of the equilibrium states (Yang et al., 1991, Yang et al., 1992). To better describe the DTA process, simulators are often incorporated and models are then solved by the stochastic perturbation simultaneous approximation algorithm (Lu et al., 2015). However, this bi-level structure usually leads to non-convexity; therefore, single-level models have been proposed based on relaxation techniques (Lu et al., 2013, Nie and Zhang, 2010). Several recent studies have also shed light on data-driven approaches. Ma and Qian (2018a) used high-granular traffic count and speed data to estimate multi-year 24/7 dynamic OD demands, where a GLS model is established given estimated assignment ratio; Krishnakumari et al. (2020) also uses count and speed data to estimate OD matrix under the mild assumption of proportional flow on shortest paths. Both studies showed satisfactory results and demonstrated that data-driven approaches are promising and efficient. Generally, with only aggregated traffic counts available, reliable priors and effective assumptions are indispensable to fill the observability gap. However, obtaining reliable initial OD matrices is generally difficult and labor-intensive, and the aforementioned assumptions could possibly deviate from realistic conditions.

In terms of AVI observations, several researchers have derived travel times and traffic counts from AVI detectors and have conducted OD flow estimation and prediction by integrating link counts with these observations (Dixon and Rilett, 2002, Zhou and Mahmassani, 2006). In addition to traffic counts, these detectors can reproduce partial paths of vehicles and thus provide further flow and travel time constraints to facilitate path and OD flow estimation. Following this direction, Bayesian methods have been adopted to recover paths (Castillo et al., 2008b, Castillo et al., 2008c, Mo et al., 2020), and state-space models, especially particle filtering (Feng et al., 2015, Rao et al., 2018, Yang and Sun, 2015), have been also introduced to probabilistically reconstruct the path of each individual vehicle. Subsequently, path and OD flows can be obtained by aggregation. Despite the promising results, these works require a large AVI coverage rate (e.g., 40–80%), which is rare occasion in a realistic network, especially large ones. In addition, reducing the uncertainties in the exact travel origin and destination is difficult using AVI-based OD flow estimation methods. Although travel paths can be effectively recovered between AVI detectors, the path from the origin to the first detected location (or last detected location to destination) can hardly be recognized, and route choice assumption is still necessary to address this issue.

CVs have recently facilitated many tasks in traffic modeling including the OD flow estimation. Compared with fixed detectors, which use indirect variables to estimate OD flow and suffer from the observability issue, CVs can nearly cover the entire network and provide direct OD flow samples. With such high-coverage, time-continuous OD samples, the focus of modeling shifts from reducing the uncertainties of unobservable OD flow to measuring the reliability of sampled OD flow (i.e., CV OD flow). Following this direction, the quantity and quality requirements of probe data for estimating population OD flows have been discussed and examined by some early studies based on several toy network examples (Eisenman and List, 2004, Van Aerde et al., 1993), and penetration rates of 10–30% have been regarded as sufficient. Moreover, several studies have investigated the route choices and trip distributions of probe vehicles and have demonstrated the feasibility of using projected probe OD as prior OD for estimation (Ásmundsdóttir, 2008, Ásmundsdóttir et al., 2010). Based on these insights, bi-level GLS models with exogenous DTA simulators have been employed to further incorporate probe vehicles or floating car trajectories (Cao et al., 2013, Carrese et al., 2017). In another benchmark study, Yang et al. (2017) formulated two single-level GLS models based on both probe vehicle trajectories and link counts. The route choices of probe vehicles were used to compute traffic assignment fractions, and the relationship between OD and link penetration rates was established; thus, there were few assumptions made in this model. Generally, these studies projected the CV OD flow as an estimate or prior according to presumed or derived penetration rates. However, few studies have explicitly considered the error of projected OD flow in the model. Thus, optimization models may be trapped in local minima and the solution may be inaccurate. Furthermore, these existing studies often assumed penetration rates of more than 10%, which is considered to be rare in the current CV market (Tan et al., 2019, Yao et al., 2019), and the estimation performance rapidly deteriorated with a decrease in penetration rate.

To summarize, with the availability of detailed and continuous observations, recent AVI-based and CV-based studies require less historical data and assumptions compared with link count-based studies. The superiority in estimation accuracy has also been demonstrated under certain observation conditions, which often deviates from the currently prevailing market status. However, two research problems still need to be addressed. The first problem concerns obtaining a reliable prior estimate, when there are limited available AVI detectors or CVs. Several studies have recognized the minimum level of AVI coverage rate or CV penetration rate; however, only a few have explicitly dealt with the accompanying problem of data sparsity. The second research problem concerns ensuring an optimal estimate according to the prior. Most existing methods rely on the DTA process to establish constraints, such as link flow conservation and travel time consistency. In this way, estimation errors are prone to increase because of either unrealistic assumption regarding user behavior or improper simplification of the DTA process.

In view of the first problem, we translated the limited observation problem into the reliability of the CV OD flow projection and problem of data sparsity imputation. Linear projection is extended based on fusion of AVI observations and CV trajectories to reduce the bias in the prevailing simple scaling, and a low-rank approximation method facilitated by the multi-dimensional tensor is adopted to deal with the sparsity problem in projected OD flow. To deal with the second research problem, we propose to robustly reconstruct prior OD flows via self-supervised learning. Based on the day-to-day traffic flow characteristics, we developed an adaptive correction algorithm to dynamically adjust the objective surface during optimization. Thus, the local minima could be avoided and the final estimates could be obtained without any theoretical assumptions. The proposed methodology is comprehensively examined on an empirical dataset from an urban arterial, a simulation dataset from a regional network, and a synthetic large-scale grid network. The proposed method exhibited satisfactory estimation accuracy, robustness to several influencing factors, and good scalability.

In general, the main contributions of this paper are three-fold:

(1)
A novel methodology for estimating day-to-day dynamic OD flow estimation fusing AVI observations and CV trajectories is proposed. Within this methodology, the characteristics of both data sources are effectively utilized.
(2)
Linear projection is extended to deal with variations in CV penetration rates, and non-negative Tucker decomposition (NTD) is applied to impute sparsity values in projected OD flows. Reliable prior OD flows are provided through the two steps even under limited observing conditions.
(3)
A self-supervised learning model called the latency-constrained autoencoder (LCAE) is established to search for the optimal solution based on the estimated prior. Meanwhile, an adaptive sub-sample correction (ASC) algorithm is proposed to incorporate day-to-day traffic flow characteristics to facilitate the optimization process.

Section snippets

Background and notations

Table 1 presents all the variables and notations used in this paper. Considering a road network $G$ specified by link set $A$ and OD pair set $RS$ , and an analysis period consisting of multiple consecutive days represented by $D$ , for each OD pair $rs \in R S$ , a path set $K_{rs}$ exists; for each day $d \in D$ , a number of identical time intervals are split and denoted by $I$ . Here, a special note should be given to the day-to-day context of this paper, as the evolutionary dynamics is out of concern and the focus is

Evaluation metrics

In this study, the estimation performance is indicated by four error metrics—mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE), and mean square percentage error (MSPE). MAE and MAPE are prevalent indicators of the estimation performances for various tasks. RMSE and MSPE are also included in this study because they can better reveal the estimation performance on larger values (larger OD flows tend to be more relevant). Corresponding formulas are

Conclusion and future work

In this paper, we developed a novel methodology for estimating the dynamic OD flows under day-to-day context based on the fusion of CV trajectories and AVI observations. This method requires neither any external or historical prior information nor assumptions on route choice behavior and dynamic network loading process, and thus, it could be recognized as a generalizable method. In this methodology, two remaining research problems are solved: obtaining reliable prior OD flows given limited

CRediT authorship contribution statement

Yumin Cao: Conceptualization, Methodology, Validation, Visualization, Writing - original draft. Keshuang Tang: Writing - review & editing, Supervision, Funding acquisition. Jian Sun: Writing - review & editing, Supervision, Funding acquisition. Yangbeibei Ji: Writing - review & editing, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This research is jointly sponsored by the National Key Research and Development Program of China (2018YFB16005), the National Natural Science Foundation of China (61673302, U1764261), and the Shanghai Science and Technology Commission Fund Project (19DZ1208800). The authors would also like to thank the constructive comments of the three anonymous reviewers. Any opinions, findings and conclusions are the responsibility of the authors alone.

References (65)

D. Bachir et al.
Inferring dynamic origin-destination flows by transport mode using mobile phone data
Transport. Res. Part C: Emerg. Technol.
(2019)
M.G.H. Bell
The estimation of origin-destination matrices by constrained generalised least squares
Transport. Res. Part B: Methodol.
(1991)
S. Carrese et al.
Dynamic demand estimation and prediction for traffic urban networks adopting new data sources
Transport. Res. Part C: Emerg. Technol.
(2017)
E. Cascetta
Estimation of trip matrices from traffic counts and survey data: a generalized least squares estimator
Transport. Res. Part B: Methodol.
(1984)
E. Cascetta et al.
Quasi-dynamic estimation of o–d flows from traffic counts: formulation, statistical validation and performance analysis on real data
Transport. Res. Part B: Methodol.
(2013)
E. Castillo et al.
Trip matrix and path flow reconstruction and estimation based on plate scanning and link observations
Transport. Res. Part B: Methodol.
(2008)
X. Chen et al.
Spatial-temporal traffic speed patterns discovery and incomplete data recovery via SVD-combined tensor decomposition
Transport. Res. Part C: Emerg. Technol.
(2018)
Q. Ge et al.
Updating origin–destination matrices with aggregated data of GPS traces
Transport. Res. Part C: Emerg. Technol.
(2016)
P. Krishnakumari et al.
A data driven method for OD matrix estimation
Transport. Res. Part C: Emerg. Technol.
(2020)
C.-C. Lu et al.
Dynamic origin–destination demand flow estimation under congested traffic conditions
Transport. Res. Part C: Emerg. Technol.
(2013)

L. Lu et al.

An enhanced SPSA algorithm for the calibration of Dynamic Traffic Assignment models

Transport. Res. Part C: Emerg. Technol.

(2015)

W. Ma et al.

Estimating multi-class dynamic origin-destination demand through a forward-backward algorithm on computational graphs

Transport. Res. Part C: Emerg. Technol.

(2020)

W. Ma et al.

Estimating multi-year 24/7 origin-destination demand using high-granular multi-source traffic data

Transport. Res. Part C: Emerg. Technol.

(2018)

W. Ma et al.

Statistical inference of probabilistic origin-destination demand using day-to-day traffic data

Transport. Res. Part C: Emerg. Technol.

(2018)

M.J. Maher

Inferences on trip matrices from observations on link volumes: a Bayesian statistical approach

Transport. Res. Part B: Methodol.

(1983)

V. Marzano et al.

Limits and perspectives of effective O-D matrix correction using traffic counts

Transport. Res. Part C: Emerg. Technol.

(2009)

C. Osorio

High-dimensional offline origin-destination (OD) demand calibration for stochastic traffic simulators of large-scale road networks

Transport. Res. Part B: Methodol.

(2019)

W. Rao et al.

Origin-destination pattern estimation based on trajectory reconstruction using automatic license plate recognition data

Transport. Res. Part C: Emerg. Technol.

(2018)

H. Shao et al.

Estimation of mean and covariance of peak hour origin–destination demands from day-to-day traffic counts

Transport. Res. Part B: Methodol.

(2014)

W. Song et al.

Statistical metamodeling of dynamic network loading

Transport. Res. Part B: Methodol.

(2018)

H. Spiess

A maximum likelihood model for estimating origin-destination matrices

Transport. Res. Part B: Methodol.

(1987)

H.J. Van Zuylen et al.

The most likely trip matrix estimated from traffic counts

Transport. Res. Part B: Methodol.

(1980)

C. Wu et al.

Cellpath: Fusion of cellular and traffic sensor data for route flow estimation via convex optimization

Transport. Res. Part C: Emerg. Technol.

(2015)

X. Wu et al.

Hierarchical travel demand estimation using multiple data sources: A forward and backward propagation algorithmic framework on a layered computational graph

Transport. Res. Part C: Emerg. Technol.

(2018)

H. Yang et al.

An analysis of the reliability of an origin-destination trip matrix estimated from traffic counts

Transport. Res. Part B: Methodol.

(1991)

H. Yang et al.

Estimation of origin-destination matrices from link traffic counts on congested networks

Transport. Res. Part B: Methodol.

(1992)

J. Yang et al.

Vehicle path reconstruction using automatic vehicle identification data: An integrated particle filter and path flow estimator

Transport. Res. Part C: Emerg. Technol.

(2015)

Y. Yang et al.

Stochastic travel demand estimation: Improving network identifiability using multi-day observation sets

Transport. Res. Part B: Methodol.

(2018)

H. Zhang et al.

Missing data detection and imputation for urban ANPR system using an iterative tensor decomposition approach

Transport. Res. Part C: Emerg. Technol.

(2019)

J. Zheng et al.

Estimating traffic volumes for signalized intersections using connected vehicle data

Transport. Res. Part C: Emerg. Technol.

(2017)

T. Arsava et al.

OD-NETBAND: an approach for origin-destination based network progression band optimization

Transport. Res. Rec.: J. Transport. Res. Board

(2018)

R. Ásmundsdóttir

Dynamic OD Matrix Estimation using Floating Car Data (Msc)

(2008)

Cited by (28)

Simulation-based dynamic origin–destination matrix estimation on freeways: A Bayesian optimization approach
2023, Transportation Research Part E: Logistics and Transportation Review
This study focuses on dynamic origin–destination demand estimation problem on freeway networks. Existing studies on this problem rely on high-coverage of traffic measurements and assumptions on travel times, exhibiting limitations in real-world applications. We formulate the problem as a bi-level programming model, where micro-simulations are incorporated to precisely model traffic flows/travel times on freeways. The bi-level programming model cannot provide explicit closed-form expressions for the objective function and its derivatives, and also intrinsically high-dimensional. Thus, it is highly challenging to find efficient solution algorithms. In this regard, a problem-specific and computationally efficient Bayesian optimization approach is designed. Herein, a novel surrogate model is proposed by embedding a physical surrogate model (it characterizes underlying physical mechanisms and provides global yet less precise approximations) into a functional surrogate model (it provides precise local approximations). The embedding provides problem-specific knowledge for the surrogate model. More importantly, it also restricts the feasible region, enabling the surrogate model to efficiently deal with high-dimensional problems. Gaussian process can be served as the functional surrogate model. Two linear physical surrogate models are proposed to capture interactions between travel demand and traffic measurements. To deal with constraints in the surrogate model, a projection-distance based acquisition function is designed. In searching for new points, the proposed acquisition function is capable of assigning unique weight of exploration to each feasible solution. The proposed approach is validated based on a freeway corridor example, which indicates its outperformance over existing dynamic origin–destination estimation methods in terms of computational efficiency and solution accuracy.
Optimizing vehicle dynamics co-simulation performance by introducing mesoscopic traffic simulation
2023, Simulation Modelling Practice and Theory
Microscopic traffic simulation is often used in conjunction with vehicle dynamics simulation to test cooperative or perception-based driver assistance functions. On the other hand, visualization and the interaction of a large number of swarm vehicles is computationally burdensome, limiting the size of scenarios. To remedy this problem, this paper introduces a mesoscopic traffic model, namely the extension of the shockwave profile model to road networks, that handles traffic as continuous flows of vehicles. Adopting the idea of level of detail from computer graphics, the modeling of swarm vehicles is carried out in less detail farther away from the EGO vehicle. These levels are defined using the classical (macroscopic, mesoscopic, and microscopic) categorization of traffic modeling. The macroscopic traffic model is responsible for the traffic demand and traffic assignment. The proposed mesoscopic model is capable of capturing the fluctuating nature of traffic on lane level. Closer to the EGO vehicle microscopic traffic simulation is employed while the EGO vehicle is modeled in full-detail including vehicle dynamics too. The 3D rendering of the simulation is performed by the vehicle dynamics simulator. The challenge in the proposed methodology is transitioning between the mesoscopic and the microscopic models, i.e., selecting the boundary and spawning/destroying vehicle agents. The paper addresses this challenge with a linear (w.r.t. the vehicle number) time complexity algorithm. Practically, a dynamic downscaling of the microscopic simulation to mesoscopic level is realized outside the vicinity of the EGO vehicle. The proposed methodology is generic, and can be adapted to most existing vehicle dynamics and microscopic traffic simulator software. The solution is tested with SUMO as microsimulation and Carla as vehicle dynamics simulation through a simple path following test case in two scenarios: a congested residential area and a complex scenario with both a highway section and an urban area. Simulation results suggest that the simulation performance could be improved by 200–500% while retaining modeling accuracy, compared to the case when only microscopic simulation is used.
Dynamic origin–destination flow estimation for urban road network solely using probe vehicle trajectory data
2023, Journal of Intelligent Transportation Systems: Technology, Planning, and Operations
Dynamic origin–destination (OD) flow is a fundamental input for dynamic network models and simulators. Numerous studies have conducted dynamic OD estimations based on fixed detectors, where a high device coverage rate and data quality are often required to accomplish the desired results. Several existing methods have used probe vehicle trajectories as an additional data source, and generalized least squares (GLS) is commonly recognized as an effective framework. However, the prior matrices used in these models either came from historical data or data obtained by uniform scaling that neglected the variation in penetration rates and suffer from sparsity issues. Moreover, the microscopic information contained in the high-resolution probe vehicle trajectories has not been fully utilized. The possibility of estimating OD flows using only vehicle trajectories without external information is rarely discussed in current literature. Therefore, this paper introduces a dynamic OD flow estimation model solely using probe vehicle trajectories. In the proposed model, two methods based on probe OD pair distribution are proposed to infer prior OD flows. Then the GLS framework is extended by including link travel times as another objective term, and the solution algorithm is adapted to deal with uncertain priors. To validate the proposed model, extensive experiments were conducted on a simulation network. The results show that the proposed model could reliably estimate dynamic OD flows and showed superiority to two existing models. In sensitivity analysis concerning the penetration rate and degree of saturation, the proposed model presented satisfactory performance and could adapt to various conditions.
Signalized arterial origin-destination flow estimation using flawed vehicle trajectories: A self-supervised learning approach without ground truth
2022, Transportation Research Part C: Emerging Technologies
Citation Excerpt :
However, computing efficiency and tractability are challenging due to a large number of unknown parameters. With the increasing data availability, data-driven OD flow estimation has been investigated to address this problem, such as license plate recognition (LPR) data (Castillo et al., 2013; Chiou et al., 2011; Sun et al., 2011, Mo et al., 2020), automatic vehicle identification (AVI) data (Van Der Zijpp, 1997; Asakura et al., 2000; Dixon and Rilett, 2002; Antoniou et al., 2004; Dixon and Rilett, 2005; Zhou and Mahmassani, 2006; Chen et al., 2011; Hadavi and Shafahi, 2016; Cao et al., 2021), cellphone data (Sohn and Kim, 2008; Iqbal et al., 2014), and probe vehicle data (Matsumoto et al., 2005; Yamamoto et al., 2009; Asmundsdottir et al., 2010; Baek et al., 2010; Cao et al., 2013; Yang et al., 2017). The basic idea of those methods is to boost estimation accuracy by supplementing information which is unavailable before.
For alleviating arterial congestion, most control strategies provide progression for through and turning traffic. A prerequisite input is the arterial origin–destination (OD) flow pattern, which can be estimated based on connected vehicle (CV) trajectories. However, the existing estimation methods require the ground-truth historical OD flow, which is difficult to obtain. To address this issue, this paper develops a method to estimate real-time OD flow along a signalized arterial without ground truth. A model based on the Generative Adversarial Network (GAN) network is proposed, which incorporates long short-term memory (LSTM), attention mechanism, and convolutional neural network (CNN) to capture the temporal and spatial correlations between OD flow patterns. This model is trained with the proposed self-supervised without historical OD flow. The proposed model is extensively tested based on a realistic signalized arterial, and the results indicate sufficient accuracy for progression control.
Truncated tensor Schatten p-norm based approach for spatiotemporal traffic data imputation with complicated missing patterns
2022, Transportation Research Part C: Emerging Technologies
Rapid advances in sensor, wireless communication, cloud computing and data science have brought unprecedented amount of data to assist transportation engineers and researchers in making better decisions. However, traffic data in reality often has corrupted or incomplete values due to detector and communication malfunctions. Data imputation is thus required to ensure the effectiveness of downstream data-driven applications. To this end, numerous tensor-based methods treating the imputation problem as the low-rank tensor completion (LRTC) have been attempted in previous works. To tackle rank minimization, which is at the core of the LRTC, most of aforementioned methods utilize the tensor nuclear norm (NN) as a convex surrogate for the minimization. However, the over-relaxation issue in NN refrains it from desirable performance in practice. In this paper, we define an innovative nonconvex truncated Schatten $p$ -norm for tensors (TSpN) to approximate tensor rank and impute missing spatiotemporal traffic data under the LRTC framework. We model traffic data into a third-order tensor structure of time intervals $\times$ locations (sensors) $\times$ days and introduce four complicated missing patterns, including random missing and three fiber-like missing cases according to the tensor mode- $n$ fibers. Despite nonconvexity of the objective function in our model, we derive the global optimal solutions by integrating the alternating direction method of multipliers (ADMM) with generalized soft-thresholding (GST). In addition, we design a truncation rate decay strategy to deal with varying missing rate scenarios. Comprehensive experiments are finally conducted using four different types of real-world spatiotemporal traffic datasets, which demonstrate that the proposed LRTC-TSpN method performs well under various missing cases even with high rates of data loss, meanwhile outperforming other state-of-the-art tensor-based imputation models in almost all scenarios. In general, the performance of the proposed method achieves up to a 52% improvement in mean absolute error (MAE) compared to baseline methods even when the missing ratio is as high as 90%. Moreover, both theoretical and empirical evidences indicate that our method has robust and efficient convergence performances.
Reliable location of automatic vehicle identification sensors to recognize origin-destination demands considering sensor failure
2022, Transportation Research Part C: Emerging Technologies
Citation Excerpt :
Fu et al. (2019) investigated the traffic counting locations for minimizing the weighted maximum deviation of estimated mean and covariance of OD demands. Cao et al. (2021) proposes a novel method for estimating the dynamic OD demand based on the fusion of CV trajectories and AVI observations. This method consists of two models: the first provides reliable prior OD demands given limited observations, and the second seeks the optimal estimation based on the prior OD demands.
Locating automatic vehicle identification (AVI) sensors to recognize Origin-Destination (OD) demand has been attracting extensive attentions in academia and industry. Although most scholars determined OD demand after deriving unique route flow based on AVI sensors, this traditional method does not necessarily obtain the smallest number of sensors to ensure the uniqueness of OD demand. Moreover, the sensors can fail in reality, which results in missing some information of observed links, thus the uniqueness of OD demand cannot be guaranteed. In this paper, we propose a new AVI sensor location model considering sensor failures to ensure the uniqueness of OD demand directly, without determining route flows. Typically, given the observation order of AVI sensors, this method can minimize the number of sensors to determine OD demand uniquely, in the meanwhile satisfying certain reliability given sensors’ failure. Moreover, under budget constraints, we develop a sensor location model to estimate OD demand under sensor failures, which maximizes information value of the differentiated OD pairs. Then we design several greedy heuristic algorithms to solve these two sensor location problems. Through numerical experiments, we show that the proposed models and algorithms can effectively determine the AVI sensor locations to recognize the OD demand and its uniqueness in the event of uncertain sensor failures.

View all citing articles on Scopus

View full text