Skip to main content
Log in

A hybrid framework for forecasting monthly reservoir inflow based on machine learning techniques with dynamic climate forecasts, satellite-based data, and climate phenomenon information

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

In this study, we developed and evaluated a hybrid framework for reservoir inflow forecast. This framework is unprecedented, which integrates new quasi-globally available observation-, satellite-, or model-based datasets using machine learing models to forecast inflow at the local scale. Under this framework, we compared random forests, gradient boosting machine, extreme learning machine, M5-cubist, elastic net, as well as their multi-model ensemble using Bayesian model averaging (BMA), and tested contributions from different input datasets, including retrospective forecast (reforecast) from florecast oriented low ocean resolution (FLOR) dynamic climate model, satellite-based hydrologic products, and climate phenomenon information. The performance was evaluated using Kling–Gupta efficiency (KGE) and correlation coefficient (R) in two headwater reservoirs, Harris reservoir in the humid Alabama–Coosa–Tallapoosa river basin and the Navajo reservoir in the arid Upper Colorado River Basin (UCRB). The results showed that for the Harris reservoir, the BMA combining five models with antecedent inflow and satellite-based hydrologic information as model inputs provided the best performance (KGE = 0.66, R = 0.76). For the Navajo reservoir, the gradient boosting machine model with all variables combined as input showed the best performance (KGE = 0.76, R = 0.83). Satellite-based soil moisture and evaporation consistently showed significant contributions to the inflow forecast. Benefits from climate indices and FLOR reforecast varied by locations, with more benefits coming from climate indices than FLOR potential evaporation reforecast at the Navajo reservoir in UCRB. Given the global coverage of the model inputs, our approach can be potentially applicable to improve reservoir inflow forecasts in different regions of the world.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Availability of data and materials

Data used in this study are publicly available as indicated in the article and from authors.

Code availability

Computer codes are available from authors.

References

  • Alfieri L, Burek P, Dutra E, Krzeminski B, Muraro D, Thielen J, Pappenberger F (2013) GloFAS-global ensemble streamflow forecasting and flood early warning. Hydrol Earth Syst Sci 17(3):1161

    Google Scholar 

  • Anghileri D, Voisin N, Castelletti A, Pianosi F, Nijssen B, Lettenmaier DP (2016) Value of long-term streamflow forecasts to reservoir operations for water supply in snow-dominated river catchments. Water Resour Res 52:4209–4225

    Google Scholar 

  • Anghileri D, Monhart S, Zhou C, Bogner K, Castelletti A, Burlando P, Zappa M (2019) The value of subseasonal hydrometeorological forecasts to hydropower operations: How much does preprocessing matter? Water Resour Res 55(12):10159–10178

    Google Scholar 

  • Asefa T, Kemblowski M, McKee M, Khalil A (2006) Multi-time scale streamflow predictions: the support vector machines approach. J Hydrol 318(1):7–16

    Google Scholar 

  • Baugh C, de Rosnay P, Lawrence H, Jurlina T, Drusch M, Zsoter E, Prudhomme C (2020) The impact of SMOS soil moisture data assimilation within the Operational Global Flood Awareness System (GloFAS). Remote Sens 12(9):1490

    Google Scholar 

  • Bennett JC, Wang QJ, Li M, Robertson DE, Schepen A (2016) Reliable long-range ensemble streamflow forecasts: combining calibrated climate forecasts with a conceptual runoff model and a staged error model. Water Resour Res 52(10):8238–8259

    Google Scholar 

  • Biecek P (2018) DALEX: explainers for complex predictive models in R. J Mach Learn Res 19(84):1–5

    Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Google Scholar 

  • Chen X, Hao Z, Devineni N, Lall U (2014) Climate information based streamflow and rainfall forecasts for Huai River basin using hierarchical Bayesian modeling. Hydrol Earth Syst Sci 18(4):1539

    Google Scholar 

  • Cloke HL, Pappenberger F (2008) Evaluating forecasts of extreme events for hydrological applications: an approach for screening unfamiliar performance measures. Meteorol Appl 15(1):181–197

    Google Scholar 

  • Day GN (1985) Extended streamflow forecasting using NWSRFS. J Water Resour Plan Manag 111(2):157–170

    Google Scholar 

  • De Viron O, Dickey JO, Ghil M (2013) Global modes of climate variability. Geophys Res Lett 40(9):1832–1837

    Google Scholar 

  • Deo RC, Sahin M (2016) An extreme learning machine model for the simulation of monthly mean streamflow water level in eastern Queensland. Environ Monit Assess 188(2):90

    Google Scholar 

  • Devineni N, Sankarasubramanian A (2010) Improving the prediction of winter precipitation and temperature over the continental United States: role of the ENSO state in developing multimodel combinations. Monthly Weather Rev 138(6):2447–2468

    Google Scholar 

  • Emerton R, Zsoter E, Arnal L, Cloke HL, Muraro D, Prudhomme C, Stephens EM, Salamon P, Pappenberger F (2018) Developing a global operational seasonal hydro-meteorological forecasting system: GloFAS-Seasonal v1.0. Geosci Model Dev 11:3327–3346

    Google Scholar 

  • Erdal HI, Karakurt O (2013) Advancing monthly streamflow prediction accuracy of CART models using ensemble learning paradigms. J Hydrol 477:119–128

    Google Scholar 

  • Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139

    Google Scholar 

  • Freund Y, Schapire R, Abe N (1999) A short introduction to boosting. J Jpn Soc Artif Intell 14(771–780):1612

    Google Scholar 

  • Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29(5):1189–1232

    Google Scholar 

  • Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38(4):367–378

    Google Scholar 

  • Friedman JHT, Hastie RT (2000) Additive logistic regression: a statistical view of boosting. Ann Stat 28(2):337–374

    Google Scholar 

  • Funk C, Peterson P, Landsfeld M, Pedreros D, Verdin J, Shukla S, Husak G, Rowland J, Harrison L, Hoell A, Michaelsen J (2015) The climate hazards infrared precipitation with stations—a new environmental record for monitoring extremes. Sci Data 2(1):1–21

    Google Scholar 

  • Galelli S, Castelletti A (2013) Tree-based iterative input variable selection for hydrological modeling. Water Resour Res 49(7):4295–4310

    Google Scholar 

  • Giuliani M, Zaniolo M, Castelletti A, Davoli G, Block P (2019) Detecting the state of the climate system via artificial intelligence to improve seasonal forecasts and inform reservoir operations. Water Resour Res. https://doi.org/10.1029/2019WR025035

    Article  Google Scholar 

  • Gneiting T, Raftery AE, Westveld AH III, Goldman T (2005) Calibrated probabilistic forecasting using ensemble model output statistics and minimum CRPS estimation. Mon Weather Rev 133(5):1098–1118

    Google Scholar 

  • Grantz K, Rajagopalan B, Clark M, Zagona E (2005) A technique for incorporating large-scale climate information in basin-scale ensemble streamflow forecasts. Water Resour Res 41(10):W10410

    Google Scholar 

  • Gupta HV, Kling H, Yilmaz KK, Martinez GF (2009) Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. J Hydrol 377(1–2):80–91

    Google Scholar 

  • Hamlet AF, Lettenmaier DP (1999) Columbia River streamflow forecasting based on ENSO and PDO climate signals. J Water Resour Plan Manag 125(6):333–341

    Google Scholar 

  • Hargreaves GH (1994) Defining and using reference evapotranspiration. J Irrig Drain Eng 120:1132–1139

    Google Scholar 

  • Harrigan S, Prudhomme C, Parry S, Smith K, Tanguy M (2018) Benchmarking ensemble streamflow prediction skill in the UK. Hydrol Earth Syst Sci 22:2018

    Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction. Springer Science & Business Media

    Google Scholar 

  • Hautaniemi S, Kharait S, Iwabu A, Wells A, Lauffenburger DA (2005) Modeling of signal–response cascades using decision tree analysis. Bioinformatics 21(9):2027–2035

    CAS  Google Scholar 

  • He X, Zhao T, Yang D (2013) Prediction of monthly inflow to the Danjiangkou reservoir based on distributed hydrological model and hydro-climatic teleconnections. J Hydroelectr Eng 32(3):4–9

    Google Scholar 

  • He X, Chaney NW, Schleiss M, Sheffield J (2016) Spatial downscaling of precipitation using adaptable random forests. Water Resour Res 52(10):8217–8237

    Google Scholar 

  • Higgins RW, Leetmaa A, Kousky VE (2002) Relationships between climate variability and winter temperature extremes in the United States. J Clim 15(13):1555–1572

    Google Scholar 

  • Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501

    Google Scholar 

  • Huang GB, Zhou H, Ding X, Zhang R (2012) Extreme learning machine for regression and multiclass classification. IEEE Trans Syst Man Cybern Part B (Cybern) 42(2):513–529

    Google Scholar 

  • Jia L, Yang X, Vecchi GA, Gudgel RG, Delworth TL, Rosati A, Stern WF, Wittenberg AT, Krishnamurthy L, Zhang S, Msadek R (2015) Improved seasonal prediction of temperature and precipitation over land in a high-resolution GFDL climate model. J Clim 28(5):2044–2062

    Google Scholar 

  • Kalra A, Ahmad S (2012) Estimating annual precipitation for the Colorado River Basin using oceanic-atmospheric oscillations. Water Resour Res 48(6):2012

    Google Scholar 

  • Kalra A, Miller WP, Lamb KW, Ahmad S, Piechota T (2013) Using large-scale climatic patterns for improving long lead time streamflow forecasts for Gunnison and San Juan River Basins. Hydrol Process 27:1543–1559. https://doi.org/10.1002/hyp.9236

    Article  Google Scholar 

  • Kirtman BP, Min D, Infanti JM, Kinter JL III, Paolino DA, Zhang Q, Van Den Dool H, Saha S, Mendez MP, Becker E, Peng P (2014) The North American multimodel ensemble: phase-1 seasonal-to-interannual prediction; phase-2 toward developing intraseasonal prediction. Bull Am Meteor Soc 95(4):585–601

    Google Scholar 

  • Knoben WJ, Freer JE, Woods RA (2019) Inherent benchmark or not? Comparing nash-sutcliffe and Kling–Gupta efficiency scores. Hydrol Earth Syst Sci 23(10):4323–4331

    Google Scholar 

  • Koster RD, Mahanama SP, Livneh B, Lettenmaier DP, Reichle RH (2010) Skill in streamflow forecasts derived from large-scale estimates of soil moisture and snow. Nat Geosci 3(9):613–616

    CAS  Google Scholar 

  • Kuhn M (2012) Variable importance using the caret package. J Stat Softw 28:1–26

    Google Scholar 

  • Kuhn M, Johnson K (2013) Applied predictive modeling, vol 26. Springer, New York

    Google Scholar 

  • Li B, Yang G, Wan R, Dai X, Zhang Y (2016) Comparison of random forests and other statistical methods for the prediction of reservoir water level: a case study of the Poyang Reservoir in China. Hydrol Res 47(S1):69–83

    Google Scholar 

  • Lima AR, Cannon AJ, Hsieh WW (2016) Forecasting daily streamflow using online sequential extreme learning machines. J Hydrol 537:431–443

    Google Scholar 

  • Liu Z, Zhou P, Chen X, Guan Y (2015) A multivariate conditional model for streamflow prediction and spatial precipitation refinement. J Geophys Res Atmos 120(19):10116–10129

    Google Scholar 

  • Maleski JJ, Martinez CJ (2018) Coupled impacts of ENSO AMO and PDO on temperature and precipitation in the Alabama–Coosa–Tallapoosa and Apalachicola–Chattahoochee–Flint river basins. Int J Climatol 38:e717–e728

    Google Scholar 

  • Martens B, Miralles DG, Lievens H, van der Schalie R, de Jeu RAM, Fernández-Prieto D, Beck HE, Dorigo WA, Verhoest NEC (2017) GLEAM v3: satellite-based land evaporation and root-zone soil moisture. Geosci Model Dev 10:1903–1925. https://doi.org/10.5194/gmd-10-1903-2017

    Article  Google Scholar 

  • Mazrooei A, Sankarasubramanian A (2017) Utilizing probabilistic downscaling methods to develop streamflow forecasts from climate forecasts. J Hydrometeorol 18(11):2959–2972

    Google Scholar 

  • McEvoy DJ, Huntington JL, Mejia JF, Hobbins MT (2016) Improved seasonal drought forecasts using reference evapotranspiration anomalies. Geophys Res Lett 43(1):377–385

    Google Scholar 

  • McGovern A, Elmore KL, Gagne DJ, Haupt SE, Karstens CD, Lagerquist R, Smith T, Williams JK (2017) Using artificial intelligence to improve real-time decision making for high-impact weather. Bull Am Meteor Soc 98(10):2073–2090

    Google Scholar 

  • Mendoza PA, Wood AW, Clark E, Rothwell E, Clark MP, Nijssen B, Brekke LD, Arnold JR (2017) An intercomparison of approaches for improving operational seasonal streamflow forecasts. Hydrol Earth Syst Sci 21(7):3915

    Google Scholar 

  • Monhart S, Zappa M, Spirig C, Schär C, Bogner K (2019) Subseasonal hydrometeorological ensemble predictions in small- and medium-sized mountainous catchments: benefits of the NWP approach. Hydrol Earth Syst Sci 23:493–513

    Google Scholar 

  • Muluye GY, Coulibaly P (2007) Seasonal reservoir inflow forecasting with low-frequency climatic indices: a comparison of data-driven methods. Hydrol Sci J 52(3):508–522

    Google Scholar 

  • Nelson NG, Muñoz-Carpena R, Phlips EJ, Kaplan D, Sucsy P, Hendrickson J (2018) Revealing biotic and abiotic controls of harmful algal blooms in a shallow subtropical reservoir through statistical machine learning. Environ Sci Technol 52(6):3527–3535

    CAS  Google Scholar 

  • Ni L, Wang D, Wu J, Wang Y, Tao Y, Zhang J, Liu J (2020) Streamflow forecasting using extreme gradient boosting model coupled with Gaussian mixture model. J Hydrol 586:124901

    Google Scholar 

  • National Research Council (2007) Colorado River Basin Water management: evaluating and adjusting to hydroclimatic variability. The National Academies Press, Washington, DC. https://doi.org/10.17226/11857

  • Quinlan R (1992) Learning with continuous classes. In: Proceedings of the 5th Australian joint conference on artificial intelligence, pp 343–348

  • Raftery AE, Gneiting T, Balabdaoui F, Polakowski M (2005) Using Bayesian model averaging to calibrate forecast ensembles. Mon Weather Rev 133(5):1155–1174

    Google Scholar 

  • Regonda SK, Rajagopalan B, Clark M, Zagona E (2006) A multimodel ensemble forecast framework: application to spring seasonal flows in the Gunnison River Basin. Water Resour Res 42:W09404. https://doi.org/10.1029/2005WR004653

    Article  Google Scholar 

  • Ridgeway G (1999) The state of boosting. Comput Sci Stat 31:172–181

    Google Scholar 

  • Roy T, He X, Lin P, Beck HE, Wood EF (2019) Global evaluation of seasonal precipitation and temperature forecasts from NMME. J Hydrometeorol 21:2473–2486

    Google Scholar 

  • Sagarika S, Kalra A, Ahmad S (2015) Interconnections between oceanic–atmospheric indices and variability in the US streamflow. J Hydrol 525:724–736

    Google Scholar 

  • Sagarika S, Kalra A, Ahmad S (2016) Pacific ocean SST and Z500 climate variability and western US seasonal streamflow. Int J Climatol 36(3):1515–1533

    Google Scholar 

  • Sahoo SS, Lampert CH, Martius G (2018) Learning equations for extrapolation and control. Preprint arXiv:1806.07259

  • Sahu N, Robertson AW, Boer R, Behera S, DeWitt DG, Takara K, Kumar M, Singh RB (2017) Probabilistic seasonal streamflow forecasts of the Citarum River, Indonesia, based on general circulation models. Stoch Env Res Risk Assess 31(7):1747–1758

    Google Scholar 

  • Schapire RE (2003) The boosting approach to machine learning: an overview. Nonlinear Estim Classif 171:149–171

    Google Scholar 

  • Schepen A, Wang QJ (2015) Model averaging methods to merge operational statistical and dynamic seasonal streamflow forecasts in Australia. Water Resour Res 51:1797–1812. https://doi.org/10.1002/2014WR016163

    Article  Google Scholar 

  • Shams MS, Anwar AF, Lamb KW, Bari M (2018) Relating ocean-atmospheric climate indices with Australian river streamflow. J Hydrol 556:294–309

    Google Scholar 

  • Sheffield J, Wood EF, Pan M, Beck H, Coccia G, Serrat-Capdevila A, Verbist K (2018) Satellite remote sensing for water resources management: potential for supporting sustainable development in data-poor regions. Water Resour Res 54(12):9724–9758

    Google Scholar 

  • Shukla S, Lettenmaier DP (2011) Seasonal hydrologic prediction in the United States: understanding the role of initial hydrologic conditions and seasonal climate forecast skill. Hydrol Earth Syst Sci 15(11):3529–3538

    Google Scholar 

  • Slater LJ, Villarini G (2018) Enhancing the predictability of seasonal streamflow with a statistical-dynamical approach. Geophys Res Lett 45:6504–6513

    Google Scholar 

  • Slater LJ, Villarini G, Bradley AA, Vecchi GA (2017) A dynamical statistical framework for seasonal streamflow forecasting in an agricultural watershed. Clim Dyn 53:1–17

    Google Scholar 

  • Snoek J, Larochelle H, Adams RP (2012) Practical bayesian optimization of machine learning algorithms. In: Advances in neural information processing systems, pp 2951–2959

  • Tian D, Martinez CJ, Graham WD (2014) Seasonal prediction of regional reference evapotranspiration based on climate forecast system version 2. J Hydrometeorol 15(3):1166–1188

    Google Scholar 

  • Wanders N, Bachas A, He XG, Huang H, Koppa A, Mekonnen ZT, Pagán BR, Peng LQ, Vergopolan N, Wang KJ, Xiao M (2017) Forecasting the hydroclimatic signature of the 2015/16 El Niño event on the western United States. J Hydrometeorol 18(1):177–186

    Google Scholar 

  • Wang H, Asefa T, Wanakule N, Adams A (2020) Application of decision-support tools for seasonal water supply management that incorporates system uncertainties and operational constraints. J Water Resour Plan Manag 146(6):05020008

    Google Scholar 

  • Wen Q, Sun L, Song X, Gao J, Wang X, Xu H (2020) Time series data augmentation for deep learning: a survey. Preprint arXiv:2002.12478

  • Wilks DS (2011) Statistical methods in the atmospheric sciences, vol 100. Academic Press, Berlin

    Google Scholar 

  • Wood AW, Schaake JC (2008) Correcting errors in streamflow forecast ensemble mean and spread. J Hydrometeorol 9(1):132–148

    Google Scholar 

  • Wood AW, Hopson T, Newman A, Brekke L, Arnold J, Clark M (2016) Quantifying streamflow forecast skill elasticity to initial condition and climate prediction skill. J Hydrometeorol 17(2):651–668

    Google Scholar 

  • Worland SC, Farmer WH, Kiang JE (2018) Improving predictions of hydrological low-flow indices in ungaged basins using machine learning. Environ Model Softw 101:169–182

    Google Scholar 

  • Xiao Y, Wu J, Lin Z, Zhao X (2018) A deep learning-based multi-model ensemble method for cancer prediction. Comput Methods Program Biomed 153:1–9

    Google Scholar 

  • Xu L, Chen N, Zhang X (2018) A comparison of large-scale climate signals and the North American Multi-Model Ensemble (NMME) for drought prediction in China. J Hydrol 1(557):378–390

    Google Scholar 

  • Xu CY, Xiong L, Singh VP (2019) Black–Box hydrological models. Handb Hydrometeorol Ensemble Forecast 2019:341–387

    Google Scholar 

  • Yang T, Asanjan AA, Welles E, Gao X, Sorooshian S, Liu X (2017) Developing reservoir monthly inflow forecasts using artificial intelligence and climate phenomenon information. Water Resour Res 53(4):2786–2812

    Google Scholar 

  • Yaseen ZM, Jaafar O, Deo RC, Kisi O, Adamowski J, Quilty J, El-Shafie A (2016) Stream-flow forecasting using extreme learning machines: a case study in a semi-arid region in Iraq. J Hydrol 542:603–614

    Google Scholar 

  • Yuan X, Roundy JK, Wood EF, Sheffield J (2015) Seasonal forecasting of global hydrologic extremes: system development and evaluation over GEWEX basins. Bull Am Meteor Soc 96(11):1895–1912

    Google Scholar 

  • Zhang D, Lin J, Peng Q, Wang D, Yang T, Sorooshian S, Liu X, Zhuang J (2018) Modeling and simulating of reservoir operation using the artificial neural network, support vector regression, deep learning algorithm. J Hydrol 565:720–736

    Google Scholar 

  • Zhang D, Lin J, Peng Q, Wang D, Yang T, Sorooshian S, Liu X, Zhuang J (2018) Modeling and simulating of reservoir operation using the artificial neural network, support vector regression, deep learning algorithm. J Hydrol 565:720–736

    Google Scholar 

  • Zhang F, Sun YQ, Magnusson L, Buizza R, Lin SJ, Chen JH, Emanuel K (2019) What is the predictability limit of midlatitude weather? J Atmos Sci 76(4):1077–1091

    Google Scholar 

Download references

Acknowledgements

The authors thank the reservoir management sector of the Southern Company—Alabama Power for providing the inflow data of the Harris reservoir and Ms. Tayler Schillerberg for her assistance in processing the GLEAM data.

Funding

This research was supported in part by the USGS Alabama Water Resources Institute 104(b) Annual Grant Program, by the Auburn University Intramural Grant Program, and by the Alabama Agricultural Experiment Station and the Hatch Program of the National Institute of Food and Agriculture, US Department of Agriculture.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Di Tian.

Ethics declarations

Competing interest

The authors declare that they have no known competing financial interest or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tian, D., He, X., Srivastava, P. et al. A hybrid framework for forecasting monthly reservoir inflow based on machine learning techniques with dynamic climate forecasts, satellite-based data, and climate phenomenon information. Stoch Environ Res Risk Assess 36, 2353–2375 (2022). https://doi.org/10.1007/s00477-021-02023-y

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-021-02023-y

Keywords

Navigation