Abstract
The forecasting of lake water level is one of the complex problems in the hydrology field owing to the incorporating with various hydrological and morphological characteristics. In this research, newly hybrid data intelligence (DI) model based on the integration of the Multilayer Perceptron (MLP) and Whale Optimization Algorithm (WOA) is developed for lake water level forecasting. The potential of the proposed hybrid MLP_WOA model is validated against several well-established DI models over the literature including the Cascade-Correlation Neural Network Model (CCNNM), Self-Organizing Map (SOM), Decision Tree Regression (DTR), Random Forest Regression (RFR), and classical MLP. The applied predictive models are examined to forecast the Van Lake water level fluctuation with monthly scale over seven-decade time period (1943–2016). The input variables are abstracted using statistical correlation analysis procedure. The modeling is diagnosed using multiple statistical metrics (i.e., root mean square error (RMSE), mean absolute error (MAE), Nash-Sutcliffe coefficient (NSE), Willmott’s Index (WI), Legate and McCabe’s Index (LMI), determination coefficient (R2)). In addition, graphical distribution data such as the Taylor diagram, violin plot, and point density are investigated. Results indicated that the MLP_WOA model performed superior prediction results over the comparable models based on forecasting performance. Five-month lead times performed the best results for the prediction procedure. In quantitative terms, the RMSE and MAE are reduced by 29.8% and 33.9%, 48.3% and 52%, 57.6% and 59.7%, 53.9% and 58.3%, and 25.3% and 23.9% using the MLP_WOA model over CCNNM, SOM, DTR, RFR, and MLP models, respectively. In comparison with the literature studies, using longer span of historical data elevated the forecasting accuracy. In summary, MLP_WOA model provided an applicable and simple methodology for Van Lake water level forecasting owing to its simple learning procedure.
Similar content being viewed by others
References
Abdulwahab HA, Noraziah A, Alsewari AA, Salih SQ (2019) An enhanced version of black hole algorithm via levy flight for optimization and data clustering problems
Aljarah I, Faris H, Mirjalili S (2018) Optimizing connection weights in neural networks using the whale optimization algorithm. Soft Comput 22:1–15. https://doi.org/10.1007/s00500-016-2442-1
Altunkaynak A (2007) Forecasting surface water level fluctuations of lake van by artificial neural networks. Water Resour Manag 21:399–408. https://doi.org/10.1007/s11269-006-9022-6
Aytek A, Kisi O, Guven A (2014) A genetic programming technique for lake level modeling. Hydrol Res 45:529–539. https://doi.org/10.2166/nh.2013.069
Barros RC, Basgalupp MP, De Carvalho ACPLF, Freitas AA (2012) A survey of evolutionary algorithms for decision-tree induction. IEEE Trans Syst Man Cybern Part C Appl Rev 42:291–312
Breiman L (1996) Bagging predictors. Mach Learn 24:123–140. https://doi.org/10.1007/BF00058655
Céréghino R, Park Y-S (2009) Review of the self-organizing map (SOM) approach in water resources: commentary. Environ Model Softw 24:945–947
Chai T, Draxler RR (2014) Root mean square error (RMSE) or mean absolute error (MAE)? -Arguments against avoiding RMSE in the literature. Geosci Model Dev 7:1247–1250. https://doi.org/10.5194/gmd-7-1247-2014
Chidammodzi CL, Muhandiki VS (2016) An indicator framework for assessing the technology aspect of Integrated Lake Basin Management for Lake Malawi Basin. Ecol Indic 60:789–801. https://doi.org/10.1016/j.ecolind.2015.08.021
Cutler DR, Edwards TC, Beard KH et al (2007) Random forests for classification in ecology. Ecology 88:2783–2792. https://doi.org/10.1890/07-0539.1
Doulgeris C, Georgiou P, Papadimos D, Papamichail D (2012) Ecosystem approach to water resources management using the MIKE 11 modeling system in the Strymonas River and Lake Kerkini. J Environ Manag 94:132–143. https://doi.org/10.1016/j.jenvman.2011.06.023
Fahimi F, Yaseen ZM, El-shafie A (2017) Application of soft computing based hybrid models in hydrological variables modeling: a comprehensive review. Theor Appl Climatol 128:875–903. https://doi.org/10.1007/s00704-016-1735-8
Fahlman SE, Lebiere C (1990) The cascade-correlation learning architecture. Adv Neural Inf Proces Syst:524–532. https://doi.org/10.1190/1.1821929
Gantzer PA, Bryant LD, Little JC (2009) Lake and Reservoir Management. Water Environ Res 81:1854–1956. https://doi.org/10.2175/106143009X12445568400494
Ghorbani MA, Khatibi R, Karimi V et al (2018) Learning from multiple models using artificial intelligence to improve model prediction accuracies: application to river flows. Water Resour Manag. https://doi.org/10.1007/s11269-018-2038-x
Hintze JL, Nelson RD (1998) Violin plots: A box plot-density trace synergism. Am Stat 52:181–184. https://doi.org/10.1080/00031305.1998.10480559
Jiang H, Deng Y, Chen H-S et al (2004) Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes. BMC Bioinformatics 5:81
Khan N, Shahid S, Juneng L, Ahmed K, Ismail T, Nawaz N (2019) Prediction of heat waves in Pakistan using quantile regression forests. Atmos Res 221:1–11. https://doi.org/10.1016/j.atmosres.2019.01.024
Khatibi R, Ghorbani MA, Naghipour L, Jothiprakash V, Fathima TA, Fazelifard MH (2014) Inter-comparison of time series models of lake levels predicted by several modeling strategies. J Hydrol 511:530–545. https://doi.org/10.1016/j.jhydrol.2014.01.009
Khozani ZS, Khosravi K, Pham BT, et al (2019) Determination of compound channel apparent shear stress: application of novel data mining models.
Kohonen T (1982) Self-organized formation of topologically correct feature maps. Biol Cybern 43:59–69. https://doi.org/10.1007/BF00337288
Kovalishyn VV, Tetko IV, Luik AI, Kholodovych VV, Villa AEP, Livingstone DJ (1998) Neural network studies. 3. Variable selection in the cascade-correlation learning architecture. J Chem Inf Comput Sci 38:651–659. https://doi.org/10.1021/ci980325n
Legates DR, Mccabe GJ (1999) Evaluating the use of “goodness-of-fit” measures in hydrologic and hydroclimatic model validation. Water Resour Res 35:233–241
Li XY, Xu HY, Sun YL, Zhang DS, Yang ZP (2007) Lake-level change and water balance analysis at lake Qinghai, West China during recent decades. Water Resour Manag 21:1505–1516. https://doi.org/10.1007/s11269-006-9096-1
Li B, Yang G, Wan R et al (2016) Comparison of random forests and other statistical methods for the prediction of lake water level: a case study of the {Poyang} {Lake} in {China}. Hydrol Res. https://doi.org/10.2166/nh.2016.264
Lima AR, Cannon AJ, Hsieh WW (2016) Forecasting daily streamflow using online sequential extreme learning machines. J Hydrol 537:431–443. https://doi.org/10.1016/j.jhydrol.2016.03.017
McClelland JL, Rumelhart DE (1988) Explorations in parallel distributed processing: a handbook of models, programs, and exercises. Explorations in Parallel Distributed Processing a Handbook of Models Programs and Exercises 344:ix, 344 p. https://doi.org/10.2307/1423065
Mirjalili S, Lewis A (2016) The Whale Optimization Algorithm. Adv Eng Softw 95:51–67. https://doi.org/10.1016/j.advengsoft.2016.01.008
Mohanty S, Jha MK, Raul SK, Panda RK, Sudheer KP (2015) Using artificial neural network approach for simultaneous forecasting of weekly groundwater levels at multiple sites. Water Resour Manag 29:5521–5532. https://doi.org/10.1007/s11269-015-1132-6
Oǧuz AR, Yeltekin A (2014) Metal levels in the liver, muscle, gill, intestine, and gonad of Lake Van Fish (Chalcalburnus tarichi) with abnormal gonad. Biol Trace Elem Res 159:219–223. https://doi.org/10.1007/s12011-014-9980-0
Prasad AM, Iverson LR, Liaw A (2006) Newer classification and regression tree techniques: Bagging and random forests for ecological prediction. Ecosystems 9:181–199. https://doi.org/10.1007/s10021-005-0054-1
Quinlan JR (1992) Learning with continuous classes. In 5th Australian joint conference on artificial intelligence 92:343–348. https://doi.org/10.1142/9789814536271
Salih SQ, Alsewari AA (2019) A new algorithm for normal and large-scale optimization problems: Nomadic People Optimizer. Neural Comput & Applic:1–28
Salih SQ, Allawi MF, Yousif AA et al (2019) Viability of the advanced adaptive neuro-fuzzy inference system model on reservoir evaporation process simulation: case study of Nasser Lake in Egypt. Eng Appl Comput Fluid Mech 13:878–891
Sari M (2008) Threatened fishes of the world: Chalcalburnus tarichi (Pallas 1811) (Cyprinidae) living in the highly alkaline Lake Van, Turkey. Environ Biol Fish 81:21–23. https://doi.org/10.1007/s10641-006-9154-9
Schuurmans D, Southey F (2002) Metric-based methods for adaptive model selection and regularization. Mach Learn 48:51–84. https://doi.org/10.1023/A:1013947519741
Shafaei M, Kisi O (2016) Lake level forecasting using wavelet-SVR, wavelet-ANFIS and wavelet-ARMA conjunction models. Water Resour Manag 30:79–97. https://doi.org/10.1007/s11269-015-1147-z
Talebizadeh M, Moridnejad A (2011) Uncertainty analysis for the forecast of lake level fluctuations using ensembles of ANN and ANFIS models. Expert Syst Appl 38:4126–4135. https://doi.org/10.1016/j.eswa.2010.09.075
Tao H, Diop L, Bodian A et al (2018) Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: regional case study in Burkina Faso.
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res-Atmos 106:7183–7192. https://doi.org/10.1029/2000JD900719
Ul-Saufie AZ, Yahya AS, Ramli NA (2011) Improving multiple linear regression model using principal component analysis for predicting PM 10 concentration in Seberang Prai , Pulau Pinang. Int J Environ Sci 2:403–410. https://doi.org/10.6088/ijes.00202020003
Utkucu M (2006) Implications for the water level change triggered moderate (M ≥ 4.0) earthquakes in Lake Van basin, Eastern Turkey. J Seismol 10:105–117. https://doi.org/10.1007/s10950-005-9002-y
Vaheddoost B, Aksoy H, Abghari H (2016) Prediction of Water Level using Monthly Lagged Data in Lake Urmia, Iran. Water Resour Manag 30:4951–4967. https://doi.org/10.1007/s11269-016-1463-y
Van Geest GJ, Wolters H, Roozen FCJM et al (2005) Water-level fluctuations affect macrophyte richness in floodplain lakes. In: Hydrobiologia
Vaziri M (1997) Predicting caspian sea surface water level by ANN and ARIMA models. J Waterw Port Coast Ocean Eng 123:158–162
Wang S, Yao X (2013) Using class imbalance learning for software defect prediction. IEEE Trans Reliab 62:434–443. https://doi.org/10.1109/TR.2013.2259203
Willmott CJ (1984) On the evaluation of model performance in physical geography. In: Spatial statistics and models. pp 443–446
Willmott CJ, Matsuura K (2005) Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim Res 30:79–82. https://doi.org/10.3354/cr030079
Xu P, Jelinek F (2004) Random forests in language modeling. Emnlp-2004 325–332
Xu M, Watanachaturaporn P, Varshney PK, Arora MK (2005) Decision tree regression for soft classification of remote sensing data. Remote Sens Environ 97:322–336
Yadav B, Eliza K (2017) A hybrid wavelet-support vector machine model for prediction of Lake water level fluctuations using hydro-meteorological data. Measurement 103:294–301. https://doi.org/10.1016/j.measurement.2017.03.003
Yaseen ZM, Ebtehaj I, Bonakdari H et al (2017) Novel approach for streamflow forecasting using a hybrid ANFIS-FFA model
Yaseen ZM, Deo RC, Ebtehaj I, Bonakdari H (2018a) Hybrid data intelligent models and applications for water level prediction. In: Handbook of research on predictive modeling and optimization methods in science and engineering
Yaseen ZM, Sulaiman SO, Deo RC, Chau K-W (2018b) An enhanced extreme learning machine model for river flow forecasting: state-of-the-art, practical applications in water resource engineering area and future research direction. J Hydrol 569:387–408. https://doi.org/10.1016/j.jhydrol.2018.11.069
Young C-C, Liu W-C, Hsieh W-L (2015) Predicting the water level fluctuation in an alpine lake using physically based, artificial neural network, and time series forecasting models. Math Probl Eng 2015:1–11. https://doi.org/10.1155/2015/708204
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All the assigned reviewers are not from the same author origins.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yaseen, Z.M., Naghshara, S., Salih, S.Q. et al. Lake water level modeling using newly developed hybrid data intelligence model. Theor Appl Climatol 141, 1285–1300 (2020). https://doi.org/10.1007/s00704-020-03263-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-020-03263-8