Short-term prediction of passenger volume for urban rail systems: A deep learning approach based on smart-card data

https://doi.org/10.1016/j.ijpe.2020.107920Get rights and content

Abstract

Short-term prediction of passenger volume is a complex but critical task to urban rail companies, which desire prediction methods with high accuracy, time efficiency and good practicality. Good prediction results of the outbound passenger volume at urban rail stations are important to the organization of passenger flow, and helpful to the arrangement of shuttles, especially in large transit junctions. The application of automatic fare collection (AFC) devices in urban rail transit systems helps to collect large amounts of historical data of completed journeys, which can be used by metro operators to construct a database of the urban rail passenger volume. Based on deep learning techniques and big data, this paper develops an improved spatiotemporal long short-term memory model (Sp-LSTM) to forecast short-term outbound passenger volume at urban rail stations. The proposed model predicts the outbound passenger volume on the basis of the historical data of the spatial-temporal passenger volume, station origin–destination (OD) matrix and the operation data of the rail transit network. Finally, based on actual data of the Beijing Metro Airport Line, a case study is carried out to compare the proposed Sp-LSTM with other prediction methods, i.e., the general long short-term memory model (LSTM), the autoregressive integrated moving average model (ARIMA), and the non-linear autoregressive neural network model (NAR), and the results show that the proposed method outperforms the others.

Introduction

The construction of urban rail transits is in fast development in the beginning of the twenty first century of China. Urban railway is a safe and sustainable transportation mode with large capacity, thus plays an increasingly important role in modern cities (Kai et al., 2016; Yang et al., 2019). In China, by the end of 2018, up to 185 urban railway lines were put into operation in 35 cities, with a total mileage of 5761.4 km. The total mileage of urban railways in operation in megalopolises such as Beijing and Shanghai exceed 700 km. The instantaneous passenger volume at urban rail transit stations can be overwhelming during peak hours, under such circumstances, the problem of emergency safety of every passenger must be guaranteed in consideration of the closeness of stations (Lam et al., 1999). Therefore, good prediction results of short-term outbound passenger volume at stations not only help to organize passengers thus keep them safe, but also contribute to better arrangement of shuttles, especially in large transit hubs. Moreover, accurate short-term prediction is conducive to the appropriate staffing in stations, based on which the metro operators can organize and guide passengers appropriately in advance to avert accidents that can harm mass passengers (Hu, 2011; Li and Lo, 2014a; Yin et al., 2019). Accurate short-term prediction can also help metro operators to design efficient train timetable, and make effective operation plans, thus the travel demands of large amounts of citizens can be satisfied and the overall efficiency of the urban rail transit system can be improved (Xue et al., 2019). This paper focus on the study of the prediction of short-term outbound passenger volume at urban rail transit stations, which proposes an improved spatiotemporal long short-term memory model (Sp-LSTM) aiming to improve prediction performance and provide rational suggestions to metro operators for better management of the railway system.

Passenger demands change frequently in daily operations, which necessitate a prediction model that is robust and flexible enough (Li and Lo, 2014b). The prediction of short-term passenger volume is to estimate the passenger volume in the near future, based on the information given by the historical data. In the literature, the time span of the short-term prediction of passenger volume varies from 5 min to one week. The problem to determine proper prediction time scale is critical to the focused problem in this paper, which directly affects the accuracy of the prediction results. The time spans of prediction in this paper are selected as 15 min, 30 min and 60 min, and the prediction results under different choices of prediction time scales are discussed. Due to the application of automatic fare collection (AFC) devices in rail transit systems, metro operators are able to obtain a substantial amount of smart card data, enabling the construction of a large database that includes station inbound/outbound passenger volume, and a station origin–destination matrix (Reddy et al., 2009; Wang et al., 2019). At present, deep learning technologies, which have good abilities for learning and generalization, provide good theoretical methods to tackle large-scale data (Najafabadi et al., 2015; Qiu et al., 2016).

The inbound and outbound passenger volume at stations are measured at a fixed time interval. The long short-term memory model (LSTM) based on deep learning takes sequential data of passenger volume as input, and the output is influenced by the input and output of the model in previous time periods. Such a structure makes it possible to have a certain memory of the processed information (Gers et al., 1999). The LSTM is especially well adapted for learning time series data with long-term dependence and has been widely used in the prediction of passenger volume. However, a notable disadvantage of LSTM is that it pays close attention to data timeliness, but pays little attention to the spatiality of the data, which would greatly affect the prediction performance. Therefore, this paper proposes an improved LSTM (i.e., Sp-LSTM) based on spatiotemporal passenger volume to predict short-term passenger volume.

The remainder of this paper is organized as follows. In Section 2, we review the relevant literature and provide the problem statement. In Section 3, we elaborate on the LSTM and Sp-LSTM. Based on real-world passenger and operation data from Beijing Metro Airport Line, we conduct a case study in section 4. Section 5 concludes the work.

Section snippets

Literature review

The prediction of short-term passenger flow has been widely studied. Initially, models for short-term prediction were mainly based on various classical linear prediction models based on statistical theory. Ahmed and Cook (1979) presented the application of the autoregressive integrated moving average (ARIMA) model in the short-term traffic flow prediction of freeways. Williams et al. (1998) applied seasonal autoregressive integrated moving average (SARIMA) and Winters exponential smoothing

Problem statement

The data of inbound/outbound passenger volume at stations can be collected from each independent AFC terminal every 15 min. Taking Beijing Metro as an example, the data of passenger volume is collected in such a format as shown in Table 2.

We can use a one-dimensional time series to describe the station inbound/outbound passenger volume:Fs={fths,fth+1s,ftis,,ft1s,fts}where fts denotes the inbound/outbound passenger volume of station s during the t-th time slot.

The data related to the

Methodology

Deep learning is a deep-level machine learning method based on artificial neural networks, which can be used as an effective tool to analyze and process large amounts of data. Deep learning has been successfully applied in many fields such as computer vision, speech recognition, image recognition and natural language processing. In recent years, many researchers have achieved great results in ridership prediction in transportation systems by applying deep learning techniques. In this section,

Data analysis

Case studies are carried out based on the real data of passenger volume on Beijing Metro Airport Line. The historical data of the station inbound/outbound passenger volume are provided by Beijing Mass Transit Railway Operation Corp., Ltd. The time resolution of the recorded time stamps is 15 min. Each day is composed of 67 stamps, from 05:45 until 22:15.

Beijing Metro Airport Line (termed as Airport Line thereinafter) connects urban districts with Beijing Capital International Airport, which has

Conclusion

This study developed an improved spatiotemporal long short-term memory model (Sp-LSTM) to predict short-term ridership of urban rail transit. Both the space and time characteristics of the passenger flow are investigated in the formulation of the model. The inputs of Sp-LSTM are the historical inbound/outbound passenger volume of all stations, the OD matrix and some of the operation data, and the output is the predicted outbound passenger volume at the station in consideration. Based on the

Acknowledgments

This work was supported by the National Key R&D Program of China (No. 2020YFB1601020), the National Natural Science Foundation of China (Nos. 71701013, 71890972/71890970, 71621001), the Beijing Municipal Natural Science Foundation (No. L191024), and the State Key Laboratory of Rail Traffic Control and Safety (No. RCS2019ZZ001).

References (43)

  • W. Puchalsky et al.

    Agribusiness time series forecasting using Wavelet neural networks and metaheuristic optimization: an analysis of the soybean sack price and perishable products demand

    Int. J. Prod. Econ.

    (2018)
  • Y. Sun et al.

    A novel wavelet-SVM short-time passenger flow prediction in Beijing subway system

    Neurocomputing

    (2015)
  • T.H. Tsai et al.

    Neural network based temporal feature models for short-term railway passenger demand forecasting

    Expert Syst. Appl.

    (2009)
  • J. Wang et al.

    New Bayesian combination method for short-term traffic flow forecasting

    Transport. Res. C Emerg. Technol.

    (2014)
  • Y. Wei et al.

    Forecasting the short-term metro passenger flow with empirical mode decomposition and neural networks

    Transport. Res. C Emerg. Technol.

    (2012)
  • X. Yang et al.

    Recognizing the critical stations in urban rail networks: an analysis method based on the smart-card data

    IEEE Intelligent Transportation Systems Magazine

    (2019)
  • H. Yin et al.

    Optimizing the release of passenger flow guidance information in urban rail transit network via agent-based simulation

    Appl. Math. Model.

    (2019)
  • S.Z. Zhao et al.

    A new approach to the prediction of passenger flow in a transit system

    Comput. Math. Appl.

    (2011)
  • M.S. Ahmed et al.

    Analysis of Freeway Traffic Time-Series Data by Using Box-Jenkins Techniques (No. 722)

    (1979)
  • H. Chang et al.

    Dynamic near-term traffic flow prediction: system-oriented approach based on past experiences

    IET Intell. Transp. Syst.

    (2012)
  • Y. Chen et al.

    Prediction of shanghai metro line 16 passenger flow based on time series analysis

    Oper. Res. Fuzziol.

    (2016)
  • Cited by (71)

    • Resilience analysis of an urban rail transit for the passenger travel service

      2024, Transportation Research Part D: Transport and Environment
    View all citing articles on Scopus
    View full text