Abstract
In this study, we developed and evaluated a hybrid framework for reservoir inflow forecast. This framework is unprecedented, which integrates new quasi-globally available observation-, satellite-, or model-based datasets using machine learing models to forecast inflow at the local scale. Under this framework, we compared random forests, gradient boosting machine, extreme learning machine, M5-cubist, elastic net, as well as their multi-model ensemble using Bayesian model averaging (BMA), and tested contributions from different input datasets, including retrospective forecast (reforecast) from florecast oriented low ocean resolution (FLOR) dynamic climate model, satellite-based hydrologic products, and climate phenomenon information. The performance was evaluated using Kling–Gupta efficiency (KGE) and correlation coefficient (R) in two headwater reservoirs, Harris reservoir in the humid Alabama–Coosa–Tallapoosa river basin and the Navajo reservoir in the arid Upper Colorado River Basin (UCRB). The results showed that for the Harris reservoir, the BMA combining five models with antecedent inflow and satellite-based hydrologic information as model inputs provided the best performance (KGE = 0.66, R = 0.76). For the Navajo reservoir, the gradient boosting machine model with all variables combined as input showed the best performance (KGE = 0.76, R = 0.83). Satellite-based soil moisture and evaporation consistently showed significant contributions to the inflow forecast. Benefits from climate indices and FLOR reforecast varied by locations, with more benefits coming from climate indices than FLOR potential evaporation reforecast at the Navajo reservoir in UCRB. Given the global coverage of the model inputs, our approach can be potentially applicable to improve reservoir inflow forecasts in different regions of the world.
Similar content being viewed by others
Availability of data and materials
Data used in this study are publicly available as indicated in the article and from authors.
Code availability
Computer codes are available from authors.
References
Alfieri L, Burek P, Dutra E, Krzeminski B, Muraro D, Thielen J, Pappenberger F (2013) GloFAS-global ensemble streamflow forecasting and flood early warning. Hydrol Earth Syst Sci 17(3):1161
Anghileri D, Voisin N, Castelletti A, Pianosi F, Nijssen B, Lettenmaier DP (2016) Value of long-term streamflow forecasts to reservoir operations for water supply in snow-dominated river catchments. Water Resour Res 52:4209–4225
Anghileri D, Monhart S, Zhou C, Bogner K, Castelletti A, Burlando P, Zappa M (2019) The value of subseasonal hydrometeorological forecasts to hydropower operations: How much does preprocessing matter? Water Resour Res 55(12):10159–10178
Asefa T, Kemblowski M, McKee M, Khalil A (2006) Multi-time scale streamflow predictions: the support vector machines approach. J Hydrol 318(1):7–16
Baugh C, de Rosnay P, Lawrence H, Jurlina T, Drusch M, Zsoter E, Prudhomme C (2020) The impact of SMOS soil moisture data assimilation within the Operational Global Flood Awareness System (GloFAS). Remote Sens 12(9):1490
Bennett JC, Wang QJ, Li M, Robertson DE, Schepen A (2016) Reliable long-range ensemble streamflow forecasts: combining calibrated climate forecasts with a conceptual runoff model and a staged error model. Water Resour Res 52(10):8238–8259
Biecek P (2018) DALEX: explainers for complex predictive models in R. J Mach Learn Res 19(84):1–5
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Chen X, Hao Z, Devineni N, Lall U (2014) Climate information based streamflow and rainfall forecasts for Huai River basin using hierarchical Bayesian modeling. Hydrol Earth Syst Sci 18(4):1539
Cloke HL, Pappenberger F (2008) Evaluating forecasts of extreme events for hydrological applications: an approach for screening unfamiliar performance measures. Meteorol Appl 15(1):181–197
Day GN (1985) Extended streamflow forecasting using NWSRFS. J Water Resour Plan Manag 111(2):157–170
De Viron O, Dickey JO, Ghil M (2013) Global modes of climate variability. Geophys Res Lett 40(9):1832–1837
Deo RC, Sahin M (2016) An extreme learning machine model for the simulation of monthly mean streamflow water level in eastern Queensland. Environ Monit Assess 188(2):90
Devineni N, Sankarasubramanian A (2010) Improving the prediction of winter precipitation and temperature over the continental United States: role of the ENSO state in developing multimodel combinations. Monthly Weather Rev 138(6):2447–2468
Emerton R, Zsoter E, Arnal L, Cloke HL, Muraro D, Prudhomme C, Stephens EM, Salamon P, Pappenberger F (2018) Developing a global operational seasonal hydro-meteorological forecasting system: GloFAS-Seasonal v1.0. Geosci Model Dev 11:3327–3346
Erdal HI, Karakurt O (2013) Advancing monthly streamflow prediction accuracy of CART models using ensemble learning paradigms. J Hydrol 477:119–128
Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139
Freund Y, Schapire R, Abe N (1999) A short introduction to boosting. J Jpn Soc Artif Intell 14(771–780):1612
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29(5):1189–1232
Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38(4):367–378
Friedman JHT, Hastie RT (2000) Additive logistic regression: a statistical view of boosting. Ann Stat 28(2):337–374
Funk C, Peterson P, Landsfeld M, Pedreros D, Verdin J, Shukla S, Husak G, Rowland J, Harrison L, Hoell A, Michaelsen J (2015) The climate hazards infrared precipitation with stations—a new environmental record for monitoring extremes. Sci Data 2(1):1–21
Galelli S, Castelletti A (2013) Tree-based iterative input variable selection for hydrological modeling. Water Resour Res 49(7):4295–4310
Giuliani M, Zaniolo M, Castelletti A, Davoli G, Block P (2019) Detecting the state of the climate system via artificial intelligence to improve seasonal forecasts and inform reservoir operations. Water Resour Res. https://doi.org/10.1029/2019WR025035
Gneiting T, Raftery AE, Westveld AH III, Goldman T (2005) Calibrated probabilistic forecasting using ensemble model output statistics and minimum CRPS estimation. Mon Weather Rev 133(5):1098–1118
Grantz K, Rajagopalan B, Clark M, Zagona E (2005) A technique for incorporating large-scale climate information in basin-scale ensemble streamflow forecasts. Water Resour Res 41(10):W10410
Gupta HV, Kling H, Yilmaz KK, Martinez GF (2009) Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. J Hydrol 377(1–2):80–91
Hamlet AF, Lettenmaier DP (1999) Columbia River streamflow forecasting based on ENSO and PDO climate signals. J Water Resour Plan Manag 125(6):333–341
Hargreaves GH (1994) Defining and using reference evapotranspiration. J Irrig Drain Eng 120:1132–1139
Harrigan S, Prudhomme C, Parry S, Smith K, Tanguy M (2018) Benchmarking ensemble streamflow prediction skill in the UK. Hydrol Earth Syst Sci 22:2018
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction. Springer Science & Business Media
Hautaniemi S, Kharait S, Iwabu A, Wells A, Lauffenburger DA (2005) Modeling of signal–response cascades using decision tree analysis. Bioinformatics 21(9):2027–2035
He X, Zhao T, Yang D (2013) Prediction of monthly inflow to the Danjiangkou reservoir based on distributed hydrological model and hydro-climatic teleconnections. J Hydroelectr Eng 32(3):4–9
He X, Chaney NW, Schleiss M, Sheffield J (2016) Spatial downscaling of precipitation using adaptable random forests. Water Resour Res 52(10):8217–8237
Higgins RW, Leetmaa A, Kousky VE (2002) Relationships between climate variability and winter temperature extremes in the United States. J Clim 15(13):1555–1572
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501
Huang GB, Zhou H, Ding X, Zhang R (2012) Extreme learning machine for regression and multiclass classification. IEEE Trans Syst Man Cybern Part B (Cybern) 42(2):513–529
Jia L, Yang X, Vecchi GA, Gudgel RG, Delworth TL, Rosati A, Stern WF, Wittenberg AT, Krishnamurthy L, Zhang S, Msadek R (2015) Improved seasonal prediction of temperature and precipitation over land in a high-resolution GFDL climate model. J Clim 28(5):2044–2062
Kalra A, Ahmad S (2012) Estimating annual precipitation for the Colorado River Basin using oceanic-atmospheric oscillations. Water Resour Res 48(6):2012
Kalra A, Miller WP, Lamb KW, Ahmad S, Piechota T (2013) Using large-scale climatic patterns for improving long lead time streamflow forecasts for Gunnison and San Juan River Basins. Hydrol Process 27:1543–1559. https://doi.org/10.1002/hyp.9236
Kirtman BP, Min D, Infanti JM, Kinter JL III, Paolino DA, Zhang Q, Van Den Dool H, Saha S, Mendez MP, Becker E, Peng P (2014) The North American multimodel ensemble: phase-1 seasonal-to-interannual prediction; phase-2 toward developing intraseasonal prediction. Bull Am Meteor Soc 95(4):585–601
Knoben WJ, Freer JE, Woods RA (2019) Inherent benchmark or not? Comparing nash-sutcliffe and Kling–Gupta efficiency scores. Hydrol Earth Syst Sci 23(10):4323–4331
Koster RD, Mahanama SP, Livneh B, Lettenmaier DP, Reichle RH (2010) Skill in streamflow forecasts derived from large-scale estimates of soil moisture and snow. Nat Geosci 3(9):613–616
Kuhn M (2012) Variable importance using the caret package. J Stat Softw 28:1–26
Kuhn M, Johnson K (2013) Applied predictive modeling, vol 26. Springer, New York
Li B, Yang G, Wan R, Dai X, Zhang Y (2016) Comparison of random forests and other statistical methods for the prediction of reservoir water level: a case study of the Poyang Reservoir in China. Hydrol Res 47(S1):69–83
Lima AR, Cannon AJ, Hsieh WW (2016) Forecasting daily streamflow using online sequential extreme learning machines. J Hydrol 537:431–443
Liu Z, Zhou P, Chen X, Guan Y (2015) A multivariate conditional model for streamflow prediction and spatial precipitation refinement. J Geophys Res Atmos 120(19):10116–10129
Maleski JJ, Martinez CJ (2018) Coupled impacts of ENSO AMO and PDO on temperature and precipitation in the Alabama–Coosa–Tallapoosa and Apalachicola–Chattahoochee–Flint river basins. Int J Climatol 38:e717–e728
Martens B, Miralles DG, Lievens H, van der Schalie R, de Jeu RAM, Fernández-Prieto D, Beck HE, Dorigo WA, Verhoest NEC (2017) GLEAM v3: satellite-based land evaporation and root-zone soil moisture. Geosci Model Dev 10:1903–1925. https://doi.org/10.5194/gmd-10-1903-2017
Mazrooei A, Sankarasubramanian A (2017) Utilizing probabilistic downscaling methods to develop streamflow forecasts from climate forecasts. J Hydrometeorol 18(11):2959–2972
McEvoy DJ, Huntington JL, Mejia JF, Hobbins MT (2016) Improved seasonal drought forecasts using reference evapotranspiration anomalies. Geophys Res Lett 43(1):377–385
McGovern A, Elmore KL, Gagne DJ, Haupt SE, Karstens CD, Lagerquist R, Smith T, Williams JK (2017) Using artificial intelligence to improve real-time decision making for high-impact weather. Bull Am Meteor Soc 98(10):2073–2090
Mendoza PA, Wood AW, Clark E, Rothwell E, Clark MP, Nijssen B, Brekke LD, Arnold JR (2017) An intercomparison of approaches for improving operational seasonal streamflow forecasts. Hydrol Earth Syst Sci 21(7):3915
Monhart S, Zappa M, Spirig C, Schär C, Bogner K (2019) Subseasonal hydrometeorological ensemble predictions in small- and medium-sized mountainous catchments: benefits of the NWP approach. Hydrol Earth Syst Sci 23:493–513
Muluye GY, Coulibaly P (2007) Seasonal reservoir inflow forecasting with low-frequency climatic indices: a comparison of data-driven methods. Hydrol Sci J 52(3):508–522
Nelson NG, Muñoz-Carpena R, Phlips EJ, Kaplan D, Sucsy P, Hendrickson J (2018) Revealing biotic and abiotic controls of harmful algal blooms in a shallow subtropical reservoir through statistical machine learning. Environ Sci Technol 52(6):3527–3535
Ni L, Wang D, Wu J, Wang Y, Tao Y, Zhang J, Liu J (2020) Streamflow forecasting using extreme gradient boosting model coupled with Gaussian mixture model. J Hydrol 586:124901
National Research Council (2007) Colorado River Basin Water management: evaluating and adjusting to hydroclimatic variability. The National Academies Press, Washington, DC. https://doi.org/10.17226/11857
Quinlan R (1992) Learning with continuous classes. In: Proceedings of the 5th Australian joint conference on artificial intelligence, pp 343–348
Raftery AE, Gneiting T, Balabdaoui F, Polakowski M (2005) Using Bayesian model averaging to calibrate forecast ensembles. Mon Weather Rev 133(5):1155–1174
Regonda SK, Rajagopalan B, Clark M, Zagona E (2006) A multimodel ensemble forecast framework: application to spring seasonal flows in the Gunnison River Basin. Water Resour Res 42:W09404. https://doi.org/10.1029/2005WR004653
Ridgeway G (1999) The state of boosting. Comput Sci Stat 31:172–181
Roy T, He X, Lin P, Beck HE, Wood EF (2019) Global evaluation of seasonal precipitation and temperature forecasts from NMME. J Hydrometeorol 21:2473–2486
Sagarika S, Kalra A, Ahmad S (2015) Interconnections between oceanic–atmospheric indices and variability in the US streamflow. J Hydrol 525:724–736
Sagarika S, Kalra A, Ahmad S (2016) Pacific ocean SST and Z500 climate variability and western US seasonal streamflow. Int J Climatol 36(3):1515–1533
Sahoo SS, Lampert CH, Martius G (2018) Learning equations for extrapolation and control. Preprint arXiv:1806.07259
Sahu N, Robertson AW, Boer R, Behera S, DeWitt DG, Takara K, Kumar M, Singh RB (2017) Probabilistic seasonal streamflow forecasts of the Citarum River, Indonesia, based on general circulation models. Stoch Env Res Risk Assess 31(7):1747–1758
Schapire RE (2003) The boosting approach to machine learning: an overview. Nonlinear Estim Classif 171:149–171
Schepen A, Wang QJ (2015) Model averaging methods to merge operational statistical and dynamic seasonal streamflow forecasts in Australia. Water Resour Res 51:1797–1812. https://doi.org/10.1002/2014WR016163
Shams MS, Anwar AF, Lamb KW, Bari M (2018) Relating ocean-atmospheric climate indices with Australian river streamflow. J Hydrol 556:294–309
Sheffield J, Wood EF, Pan M, Beck H, Coccia G, Serrat-Capdevila A, Verbist K (2018) Satellite remote sensing for water resources management: potential for supporting sustainable development in data-poor regions. Water Resour Res 54(12):9724–9758
Shukla S, Lettenmaier DP (2011) Seasonal hydrologic prediction in the United States: understanding the role of initial hydrologic conditions and seasonal climate forecast skill. Hydrol Earth Syst Sci 15(11):3529–3538
Slater LJ, Villarini G (2018) Enhancing the predictability of seasonal streamflow with a statistical-dynamical approach. Geophys Res Lett 45:6504–6513
Slater LJ, Villarini G, Bradley AA, Vecchi GA (2017) A dynamical statistical framework for seasonal streamflow forecasting in an agricultural watershed. Clim Dyn 53:1–17
Snoek J, Larochelle H, Adams RP (2012) Practical bayesian optimization of machine learning algorithms. In: Advances in neural information processing systems, pp 2951–2959
Tian D, Martinez CJ, Graham WD (2014) Seasonal prediction of regional reference evapotranspiration based on climate forecast system version 2. J Hydrometeorol 15(3):1166–1188
Wanders N, Bachas A, He XG, Huang H, Koppa A, Mekonnen ZT, Pagán BR, Peng LQ, Vergopolan N, Wang KJ, Xiao M (2017) Forecasting the hydroclimatic signature of the 2015/16 El Niño event on the western United States. J Hydrometeorol 18(1):177–186
Wang H, Asefa T, Wanakule N, Adams A (2020) Application of decision-support tools for seasonal water supply management that incorporates system uncertainties and operational constraints. J Water Resour Plan Manag 146(6):05020008
Wen Q, Sun L, Song X, Gao J, Wang X, Xu H (2020) Time series data augmentation for deep learning: a survey. Preprint arXiv:2002.12478
Wilks DS (2011) Statistical methods in the atmospheric sciences, vol 100. Academic Press, Berlin
Wood AW, Schaake JC (2008) Correcting errors in streamflow forecast ensemble mean and spread. J Hydrometeorol 9(1):132–148
Wood AW, Hopson T, Newman A, Brekke L, Arnold J, Clark M (2016) Quantifying streamflow forecast skill elasticity to initial condition and climate prediction skill. J Hydrometeorol 17(2):651–668
Worland SC, Farmer WH, Kiang JE (2018) Improving predictions of hydrological low-flow indices in ungaged basins using machine learning. Environ Model Softw 101:169–182
Xiao Y, Wu J, Lin Z, Zhao X (2018) A deep learning-based multi-model ensemble method for cancer prediction. Comput Methods Program Biomed 153:1–9
Xu L, Chen N, Zhang X (2018) A comparison of large-scale climate signals and the North American Multi-Model Ensemble (NMME) for drought prediction in China. J Hydrol 1(557):378–390
Xu CY, Xiong L, Singh VP (2019) Black–Box hydrological models. Handb Hydrometeorol Ensemble Forecast 2019:341–387
Yang T, Asanjan AA, Welles E, Gao X, Sorooshian S, Liu X (2017) Developing reservoir monthly inflow forecasts using artificial intelligence and climate phenomenon information. Water Resour Res 53(4):2786–2812
Yaseen ZM, Jaafar O, Deo RC, Kisi O, Adamowski J, Quilty J, El-Shafie A (2016) Stream-flow forecasting using extreme learning machines: a case study in a semi-arid region in Iraq. J Hydrol 542:603–614
Yuan X, Roundy JK, Wood EF, Sheffield J (2015) Seasonal forecasting of global hydrologic extremes: system development and evaluation over GEWEX basins. Bull Am Meteor Soc 96(11):1895–1912
Zhang D, Lin J, Peng Q, Wang D, Yang T, Sorooshian S, Liu X, Zhuang J (2018) Modeling and simulating of reservoir operation using the artificial neural network, support vector regression, deep learning algorithm. J Hydrol 565:720–736
Zhang D, Lin J, Peng Q, Wang D, Yang T, Sorooshian S, Liu X, Zhuang J (2018) Modeling and simulating of reservoir operation using the artificial neural network, support vector regression, deep learning algorithm. J Hydrol 565:720–736
Zhang F, Sun YQ, Magnusson L, Buizza R, Lin SJ, Chen JH, Emanuel K (2019) What is the predictability limit of midlatitude weather? J Atmos Sci 76(4):1077–1091
Acknowledgements
The authors thank the reservoir management sector of the Southern Company—Alabama Power for providing the inflow data of the Harris reservoir and Ms. Tayler Schillerberg for her assistance in processing the GLEAM data.
Funding
This research was supported in part by the USGS Alabama Water Resources Institute 104(b) Annual Grant Program, by the Auburn University Intramural Grant Program, and by the Alabama Agricultural Experiment Station and the Hatch Program of the National Institute of Food and Agriculture, US Department of Agriculture.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interest
The authors declare that they have no known competing financial interest or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tian, D., He, X., Srivastava, P. et al. A hybrid framework for forecasting monthly reservoir inflow based on machine learning techniques with dynamic climate forecasts, satellite-based data, and climate phenomenon information. Stoch Environ Res Risk Assess 36, 2353–2375 (2022). https://doi.org/10.1007/s00477-021-02023-y
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-021-02023-y