Abstract
The ARIMA model is widely adopted by the financial industry as the standard statistical instrument for forecasting asset returns. Numerous studies have compared the accuracy of the ARIMA model with other competing models. However, there are no studies that cover a broad range of equities and their time series. Furthermore, there is no clear guideline on the time series window selected to fit the ARIMA model. In addition, there are no firm conclusions on whether older information in the sample should be abandoned. This makes it impossible to draw a definitive conclusion about the predictive power of the ARIMA model. This study sets out to address this gap in the literature. It summarizes more than two million ARIMA forecasts of future daily returns, using data from January 3, 1996 to May 12, 2017. The forecasts are run with different model parameter settings. We find that the five-year sliding fixed-width window fits US equity market asset prices to the highest degree, with an annual over-optimistic error of 2.6561%. However, when environments with positive and negative returns are separated, the ARIMA models generate forecasting errors of − 0.0009% and 0.011%, and both underestimate gain and loss. These errors are lower for low volatility equities. We conclude that the lack of nonlinearity of the ARIMA model is not a major concern, and that the ARIMA models do not lose their validity if the data windows are carefully selected. Our conclusions are not in conflict with the weak form market efficiency hypothesis and are robust in an environment with transaction cost.
Similar content being viewed by others
References
Adebiyi, A.A., A.O. Adewumi, and C.K. Ayo. 2014. Comparison of ARIMA and artificial neural networks models for stock price prediction. Journal of Applied Mathematics. https://doi.org/10.1155/2014/614342.
Aho, K., D. Derryberry, and T. Peterson. 2014. Model selection for ecologists: the worldviews of AIC and BIC. Ecology 95: 631–636. https://doi.org/10.1890/13-1452.1.
Akaike, H. 1973. Information theory and an extension of the maximum likelihood principle. In 2nd International Symposium on Information Theory, ed. B.N. Petrov and F. Csáki, Armenia: Tsahkadsor, USSR. 267–281. https://doi.org/10.1007/978-1-4612-1694-0_15.
Babu, A.M., and S.K. Reddy. 2015a. Exchange rate forecasting using ARIMA, neural network and fuzzy neuron. Journal of Stock & Forex Trading 4(3): 1–5. https://doi.org/10.4172/2168-9458.1000155.
Babu, C.N., and B.E. Reddy. 2015b. Prediction of selected Indian stock using a partitioning–interpolation based ARIMA–GARCH model. Applied Computing and Informatics 11(2): 130–143. https://doi.org/10.1016/j.aci.2014.09.002.
Beckers, S., and B. Blair. 2002. Non-parametric forecasting for conditional asset allocation. Journal of Asset Management 3(3): 213–228. https://doi.org/10.1057/palgrave.jam.2240076.
Bollerslev, H., and O. Mikkelsen. 1996. Modeling and pricing long memory in stock market volatility. Journal of Econometrics 73(1): 151–184. https://doi.org/10.1016/0304-4076(95)01736-4.
Brockwell, P.J., and R.A. Davis. 1991. Time Series: Theory and Methods, 2nd ed. New York: Springer.
de Santos, Júnior O., J.F. de Oliveira, and P.S. de Mattos Neto. 2019. An intelligent hybridization of ARIMA with machine learning models for time series forecasting. Knowledge-Based Systems. https://doi.org/10.1016/j.knosys.2019.03.011. Forthcoming.
Gomez, V. 1998. Automatic model identification in the presence of missing observations and outliers. Working paper, Ministerio de Economia y Hacienda.
Gomez, V., and Maravall, A. 1998. Programs TRAMO and SEATS, instructions for the users. Working paper, Ministerio de Economia y Hacienda.
Hannan, E.J., and J. Rissanen. 1982. Recursive estimation of mixed Autoregressive-Moving Average Order. Biometrika 69(1): 81–94. https://doi.org/10.2307/2335856.
Hassan, M.R., B. Nath, and M. Kirley. 2007. A fusion model of HMM, ANN and GA for stock market forecasting. Expert Systems with Applications 33(1): 171–180. https://doi.org/10.1016/j.eswa.2006.04.007.
Hyndman, R.J., and Y. Khandakar. 2008. Automatic time series forecasting: the forecast package for R. Journal of Statistical Software 27(3): 1–22. https://doi.org/10.18637/jss.v027.i03.
Ighravwe, D.E., and C.O. Anyaeche. 2019. A comparison of ARIMA and ANN techniques in predicting port productivity and berth effectiveness. International Journal of Data and Network Science 3(1): 13–22. https://doi.org/10.5267/j.ijdns.2018.11.003.
Karabiber, O.A., and G. Xydis. 2019. Electricity price forecasting in the Danish day-ahead market using the TBATS, ANN and ARIMA methods. Energies 12(5): 928–957. https://doi.org/10.3390/en12050928.
Khashei, M., and M. Bijari. 2010. An artificial neural network (p, d, q) model for timeseries forecasting. Expert Systems with Applications 37(1): 479–489. https://doi.org/10.1016/j.eswa.2009.05.044.
Li, M., S. Ji, and G. Liu. 2018. Forecasting of Chinese E-commerce sales: an empirical comparison of ARIMA, nonlinear autoregressive neural network, and a combined ARIMA–NARNN Model. Mathematical Problems in Engineering. https://doi.org/10.1155/2018/6924960.
Matyjaszek, M., P. Riesgo Fernández, A. Krzemień, K. Wodarski, and G. Fidalgo Valverde. 2019. Forecasting coking coal prices by means of ARIMA models and neural networks, considering the transgenic time series theory. Resources Policy 61: 283–292. https://doi.org/10.1016/j.resourpol.2019.02.017.
Ogneva, D.S., and D.Y. Golembiovskii. 2018. Option pricing with ARIMA–GARCH models of underlying asset returns. Computational Mathematics & Modeling 29(4): 461–473. https://doi.org/10.1007/s10598-018-9425-2.
Pai, P.-F., and C.-S. Lin. 2005. A hybrid ARIMA and support vector machines model in stock price forecasting. Omega 33(6): 497–505. https://doi.org/10.1016/j.omega.2004.07.024.
Sánchez Lasheras, F., F. de Cos Juez, A. Suárez Sánchez, A. Krzemień, and P. Riesgo Fernández. 2015. Forecasting the COMEX copper spot price by means of neural networks and ARIMA models. Resources Policy 45: 37–43. https://doi.org/10.1016/j.resourpol.2015.03.004.
Schwarz, G.E. 1978. Estimating the dimension of a model. Annals of Statistics 6(2): 461–464. https://doi.org/10.1214/aos/1176344136.
Shen, W., X. Guo, C. Wu, and D. Wu. 2011. Forecasting stock indices using radial basis function neural networks optimized by artificial fish swarm algorithm. Knowledge-Based Systems 24(3): 378–385. https://doi.org/10.1016/j.knosys.2010.11.001.
Wang, J.-J., J.-Z. Wang, Z.-G. Zhang, and S.-P. Guo. 2012. Stock index forecasting based on a hybrid model. Omega 40(6): 758–766. https://doi.org/10.1016/j.omega.2011.07.008.
Wang, Q., S. Li, R. Li, and M. Ma. 2018. Forecasting U.S. shale gas monthly production using a hybrid ARIMA and metabolic nonlinear grey model. Energy 160: 378–387. https://doi.org/10.1016/j.energy.2018.07.047.
Wit, E., E. van den Heuvel, and J.-W. Romeyn. 2012. ‘All models are wrong…’: an introduction to model uncertainty. Statistica Neerlandica 66(3): 217–236. https://doi.org/10.1111/j.1467-9574.2012.00530.x.
Yeh, C.-Y., C.-W. Huang, and S.-J. Lee. 2011. A multiple-kernel support vector regression approach for stock market price forecasting. Expert Systems with Applications 38(3): 2177–2186. https://doi.org/10.1016/j.eswa.2010.08.004.
Yıldıran, C.U., and A. Fettahoğlu. 2017. Forecasting USDTRY rate by ARIMA method. Cogent Economics & Finance 5: 1–11.
Funding
Funding was provided by Kate Tiedemann School of Business and Finance, University of South Florida.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
The first part of the Appendix presents the facts of the randomly selected companies and their stocks to conduct the ARIMA forecast accuracy measure. The purpose of this list is to demonstrate that our study covers a wide range of companies, regarding their sector, industry, market capitalization, systematic risk, and listing agents (Table 6).
The second part of the Appendix presents the detailed ARIMA forecast errors for the individual assets categorized in different volatility groups. The errors are presented in the order of growing window sample, one-year sliding window, two-year sliding window, three-year sliding window, and five-year sliding window. The error measures included are:
“Positive”, or “P”: ARIMA forecast deviation of the asset on the trading days when positive returns are realized;
“Negative”, or “N”: ARIMA forecast deviation of the asset on the trading days when negative returns are realized;
“Total”, or “T”: ARIMA forecast deviation of the asset with both positive and negative return environments;
“Skewness”, or “S”: The skewness of ARIMA forecast deviation of the asset;
“Kurtosis”, or “K”: The kurtosis of ARIMA forecast deviation of the asset (Tables 7, 8, 9);
Rights and permissions
About this article
Cite this article
Dong, H., Guo, X., Reichgelt, H. et al. Predictive power of ARIMA models in forecasting equity returns: a sliding window method. J Asset Manag 21, 549–566 (2020). https://doi.org/10.1057/s41260-020-00184-z
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1057/s41260-020-00184-z