Using improved gradient-boosted decision tree algorithm based on Kalman filter (GBDT-KF) in time series prediction

Li, Ling; Dai, Sida; Cao, Zhiwei; Hong, Jinghui; Jiang, Shu; Yang, Kunmeng

doi:10.1007/s11227-019-03130-y

Using improved gradient-boosted decision tree algorithm based on Kalman filter (GBDT-KF) in time series prediction

Published: 08 January 2020

Volume 76, pages 6887–6900, (2020)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Ling Li ORCID: orcid.org/0000-0002-8447-3205¹,
Sida Dai¹,
Zhiwei Cao¹,
Jinghui Hong²,
Shu Jiang³ &
…
Kunmeng Yang⁴

1542 Accesses
33 Citations
Explore all metrics

Abstract

In this study, we analyse two mobile phone activity datasets to predict the future traffic of mobile base stations in urban areas. The predicted time series can be used to reflect the trend of human activity flow. Although common methods such as recurrent neural network and long short-term memory (LSTM) network often achieve a high precision, they have the short back of time-consuming. So, we present the improved gradient-boosted decision tree algorithm based on Kalman filter (GBDT-KF) due to the noise in the original time series, because the decrease in the performance of GBDT is usually caused by overfitting the noise in the signal. According to our experiments, although the RMSE of the predicted values of our GBDT-KF and the ground truth is only 12–14% worse than that of the LSTM model, the proposed GBDT-KF algorithm makes a trade-off between the precision and time complexity and achieves over 100-time training time reduction compared with the LSTM model. By implementing the result of our work, service providers could predict where and when a network congestion would happen; therefore, they could take actions ahead of time. Such applications are useful especially in the era of 5G.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Forecasting of mobile network traffic and spatio–temporal analysis using modLSTM

Article 30 November 2023

Vidyadhar J. Aski, Rugved Sanjay Chavan, … Eugenio Vocaturo

Combination predicting model of traffic congestion index in weekdays based on LightGBM-GRU

Article Open access 21 February 2022

Wei Cheng, Jiang-lin Li, … Li-na Ji

Forecasting PM2.5 Concentration Using Gradient-Boosted Regression Tree with CNN Learning Model

Article 25 March 2024

A. Usha Ruby, J. George Chellin Chandran, … T. J. Swasthika Jain

References

China: mobile users 2019 | Statista[EB/OL]. Statista, 2019. (2019)[2019 -12 -16]. https://www.statista.com/statistics/278204/china-mobile-users-by-month/
Lei PR, Shen TJ, Peng WC, Su IJ (2011) Exploring spatial-temporal trajectory model for location prediction. In: IEEE International Conference on Mobile Data Management, pp 58–67
Dong F (2012) When and where next: individual mobility prediction. In: ACM Sigspatial International Workshop on Mobile Geographic Information Systems, pp 57–64
Barlacchi G et al (2015) A multi-source dataset of urban life in the city of Milan and the Province of Trentino. Sci Data 2:150055
Article Google Scholar
Nazaripouya H, Wang B, Wang Y, Chu P, Pota HR, Gadh R (2016) Univariate time series prediction of solar power using a hybrid wavelet-ARMA-NARX prediction method. In: Transmission and Distribution Conference and Exposition, pp 1–5
Gumus B, Kilic H (2018) Time dependent prediction of monthly global solar radiation and sunshine duration using exponentially weighted moving average in southern of Turkey. Therm Sci 22(2):943–951
Article Google Scholar
Engle RF (1982) Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica 50(4):987–1007
Article MathSciNet Google Scholar
Bollerslevb T (1986) Generalized autoregressive conditional heteroskedasticity. J Econ 31(3):307–327
Article MathSciNet Google Scholar
Luong MT, Pham H, Manning CD (2015) Effective approaches to attention-based neural machine translation. Computer Science
Graves A (2013) Generating sequences with recurrent neural networks, Computer Science
Cho K et al. (2014) Learning phrase representations using RNN encoder–decoder for statistical machine translation, Computer Science
Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate, Computer Science
Ahmad S, Lavin A, Purdy S et al (2017) Unsupervised real-time anomaly detection for streaming data. Neurocomputing 262:134–147
Article Google Scholar
Gilson M et al. (2019) The covariance perceptron: a new framework for classification and processing of time series in recurrent neural networks. bioRxiv: 562546
Lebret R, Grangier D, Auli M (2016) Neural text generation from structured data with application to the biography domain. In: Conference on Empirical Methods in Natural Language Processing, pp 1203–1213
Bai S, Kolter JZ, Koltun V (2018) An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. Preprint https://arxiv.org/1803.01271
Gers F, Eck D (2001) Applying LSTM to time series predictable through time-window approaches. In: International Conference on Artificial Neural Networks, pp 669–676
Tian D, He G, Wu J, Chen H, Jiang Y (2016) An accurate eye pupil localization approach based on adaptive gradient boosting decision tree. In: Visual Communications and Image Processing (VCIP), pp 1–4
Shashank G et al (2018) Semi-supervised recurrent neural network for adverse drug reaction mention extraction. BMC Bioinform 19(8):212
Google Scholar
Ghahramani Z, Hinton GE (1996) Parameter estimation for linear dynamical systems
Chai T, Draxler RR (2014) Root mean square error (RMSE) or mean absolute error (MAE)? Geosci Model Dev Discuss 7(3):1247–1250
Article Google Scholar
Ying X, Chen J (2017) Traffic flow forecasting method based on gradient boosting decision tree. In: 2017 5th International Conference on Frontiers of Manufacturing Science and Measuring Technology (FMSMT 2017). Atlantis Press

Download references

Acknowledgements

This work was supported in part by a grant from foundation project for the Science and Technology Department of Jilin Province (Grant No. 20170101140JC).

Author information

Authors and Affiliations

Department of Communication Engineering, Jilin University, Changchun, 130012, China
Ling Li, Sida Dai & Zhiwei Cao
Northeast Normal University, Changchun, 130032, China
Jinghui Hong
State Grid Jilin Changchun Power Supply Company, Changchun, 130012, China
Shu Jiang
High School Attached to Northeast Normal University, Changchun, 130032, China
Kunmeng Yang

Authors

Ling Li
View author publications
You can also search for this author in PubMed Google Scholar
Sida Dai
View author publications
You can also search for this author in PubMed Google Scholar
Zhiwei Cao
View author publications
You can also search for this author in PubMed Google Scholar
Jinghui Hong
View author publications
You can also search for this author in PubMed Google Scholar
Shu Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Kunmeng Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ling Li.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, L., Dai, S., Cao, Z. et al. Using improved gradient-boosted decision tree algorithm based on Kalman filter (GBDT-KF) in time series prediction. J Supercomput 76, 6887–6900 (2020). https://doi.org/10.1007/s11227-019-03130-y

Download citation

Published: 08 January 2020
Issue Date: September 2020
DOI: https://doi.org/10.1007/s11227-019-03130-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Using improved gradient-boosted decision tree algorithm based on Kalman filter (GBDT-KF) in time series prediction

Abstract

Access this article

Similar content being viewed by others

Forecasting of mobile network traffic and spatio–temporal analysis using modLSTM

Combination predicting model of traffic congestion index in weekdays based on LightGBM-GRU

Forecasting PM2.5 Concentration Using Gradient-Boosted Regression Tree with CNN Learning Model

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Using improved gradient-boosted decision tree algorithm based on Kalman filter (GBDT-KF) in time series prediction

Abstract

Access this article

Similar content being viewed by others

Forecasting of mobile network traffic and spatio–temporal analysis using modLSTM

Combination predicting model of traffic congestion index in weekdays based on LightGBM-GRU

Forecasting PM2.5 Concentration Using Gradient-Boosted Regression Tree with CNN Learning Model

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation