Abstract
This study investigates the accuracy of three different techniques with the periodicity component for estimating monthly lake levels. The three techniques are multivariate adaptive regression splines (MARS), least-square support vector regression (LSSVR), and M5 model tree (M5-tree). Data from Lake Michigan, located in the USA, is used in the analysis. In the first stage of modeling, three techniques were applied to forecast monthly lake level fluctuations up to 8 months ahead of time intervals. In the second stage, the influence of the periodicity component was applied (month number of the year, e.g., 1, 2, 3, …12) as an external subset in modeling monthly lake levels. The root-mean-square error, mean absolute error, and coefficient of determination were used for evaluating the accuracy of the models. In both stages, the comparison results indicate that MARS generally outperforms LSSVR and M5-tree. Further, it has been discovered that including periodicity as an input to the models improves their accuracy in projecting monthly lake levels.
Similar content being viewed by others
Availability of data and material
Data are available from the U.S. Army Corps of Engineers.
Code availability
Not applicable.
References
Adamowski J, Chan HF, Prasher SO, Sharda VN (2012) Comparison of multivariate adaptive regression splines with coupled wavelet transform artificial neural networks for runoff forecasting in Himalayan micro-watersheds with limited data. J Hydroinformatics 14:731–744. https://doi.org/10.2166/hydro.2011.044
Altunkaynak A (2007) Forecasting surface water level fluctuations of lake van by artificial neural networks. Water Resour Manag 21:399–408. https://doi.org/10.1007/s11269-006-9022-6
Altunkaynak A, Özger M, Sen Z (2003) Triple diagram model of level fluctuations in Lake Van, Turkey. Hydrol Earth Syst Sci 7:235–244. https://doi.org/10.5194/hess-7-235-2003
Altunkaynak A, Şen Z (2007) Fuzzy logic model of lake water level fluctuations in Lake Van, Turkey. Theor Appl Climatol 90:227–233. https://doi.org/10.1007/s00704-006-0267-z
Başakın EE, Ekmekcioğlu Ö, Çıtakoğlu H, Özger M (2022) A new insight to the wind speed forecasting: robust multi-stage ensemble soft computing approach based on pre-processing uncertainty assessment. Neural Comput Appl 34:783–812. https://doi.org/10.1007/s00521-021-06424-6
Bera, S. O. Prasher, R. M. Patel, et al (2006) Application of MARS in simulating pesticide concentrations in soil. Trans ASABE 49:297–307. https://doi.org/10.13031/2013.20228
Bhattacharya B, Solomatine DP (2005) Neural networks and M5 model trees in modelling water level-discharge relationship. Neurocomputing 63:381–396. https://doi.org/10.1016/j.neucom.2004.04.016
Citakoglu H (2021) Comparison of multiple learning artificial intelligence models for estimation of long-term monthly temperatures in Turkey. Arab J Geosci 14:2131. https://doi.org/10.1007/s12517-021-08484-3
Coulibaly P (2010) Reservoir computing approach to Great Lakes water level forecasting. J Hydrol 381:76–88. https://doi.org/10.1016/j.jhydrol.2009.11.027
De Andrés J, Lorca P, De Cos Juez FJ, Sánchez-Lasheras F (2011) Bankruptcy forecasting: a hybrid approach using fuzzy c-means clustering and multivariate adaptive regression splines (MARS). Expert Syst Appl 38:1866–1875. https://doi.org/10.1016/j.eswa.2010.07.117
Demir V, Çubukçu EA (2021) Digital elevation modeling with heuristic regression techniques. Eur J Sci Technol 484–488. https://doi.org/10.31590/ejosat.916012
Deng S, Yeh TH (2010) Applying least squares support vector machines to the airframe wing-box structural design cost estimation. Expert Syst Appl 37:8417–8423. https://doi.org/10.1016/j.eswa.2010.05.038
Friedman JH (1991) Multivariate adaptive regression splines. Ann Stat 19:590–606. https://doi.org/10.1214/aos/1176347963
Görkemli B Citakoglu H Haktanir T Karaboga D 2022 A new method based on artificial bee colony programming for the regional standardized intensity–duration-frequency relationship Arab J Geosci 15https://doi.org/10.1007/s12517-021-09377-1
Goyal MK (2014) Modeling of sediment yield prediction using M5 model tree algorithm and wavelet regression. Water Resour Manag 28:1991–2003. https://doi.org/10.1007/s11269-014-0590-6
Hemmati-Sarapardeh A, Shokrollahi A, Tatar A et al (2014) Reservoir oil viscosity determination using a rigorous approach. Fuel 116:39–48. https://doi.org/10.1016/j.fuel.2013.07.072
Huang Z, Luo J, Li X, Zhou Y (2009) Prediction of effluent parameters of wastewater treatment plant based on ımproved least square support vector machine with PSO. In: 2009 First International Conference on Information Science and Engineering. IEEE, pp 4058–4061
Hwang SH, Ham DH, Kim JH (2012) Forecasting performance of LS-SVM for nonlinear hydrological time series. KSCE J Civ Eng 16:870–882. https://doi.org/10.1007/s12205-012-1519-3
Ji Z, Wang B, Deng S, You Z (2014) Predicting dynamic deformation of retaining structure by LSSVR-based time series method. Neurocomputing 137:165–172. https://doi.org/10.1016/j.neucom.2013.03.073
Kamari A, Nikookar M, Sahranavard L, Mohammadi AH (2014) Efficient screening of enhanced oil recovery methods and predictive economic analysis. Neural Comput Appl 25:815–824. https://doi.org/10.1007/s00521-014-1553-9
Karimi S, Shiri J, Kisi O, Makarynskyy O (2012) Forecasting water level fluctuations of Urmieh Lake using gene expression programming and adaptive neuro-fuzzy ınference system. Int J Ocean Clim Syst 3:109–125. https://doi.org/10.1260/1759-3131.3.2.109
Kisi O (2015) Pan evaporation modeling using least square support vector machine, multivariate adaptive regression splines and M5 model tree. J Hydrol 528:312–320. https://doi.org/10.1016/j.jhydrol.2015.06.052
Kisi O (2015) Streamflow forecasting and estimation using least square support vector regression and adaptive neuro-fuzzy embedded fuzzy c-means clustering. Water Resour Manag 29:5109–5127. https://doi.org/10.1007/s11269-015-1107-7
Kisi O (2012) Modeling discharge-suspended sediment relationship using least square support vector machine. J Hydrol 456–457:110–120. https://doi.org/10.1016/j.jhydrol.2012.06.019
Kisi O (2013) Least squares support vector machine for modeling daily reference evapotranspiration. Irrig Sci 31:611–619. https://doi.org/10.1007/s00271-012-0336-2
Kişi Ö (2009) Neural network and wavelet conjunction model for modelling monthly level fluctuations in Turkey. Hydrol Process 23:2081–2092. https://doi.org/10.1002/hyp.7340
Kisi O, Parmar KS (2016) Application of least square support vector machine and multivariate adaptive regression spline models in long term prediction of river water pollution. J Hydrol 534:104–112. https://doi.org/10.1016/j.jhydrol.2015.12.014
Kisi O Parmar KS Soni K Demir V 2017a Modeling of air pollutants using least square support vector regression, multivariate adaptive regression spline, and M5 model tree models Air Qual Atmos Heal 10https://doi.org/10.1007/s11869-017-0477-9
Kisi O, Shiri J, Demir V (2017b) Hydrological Time Series Forecasting Using Three Different Heuristic Regression Techniques. In: Handbook of Neural Computation. Elsevier, pp 45–65
Leathwick JR, Elith J, Hastie T (2006) Comparative performance of generalized additive models and multivariate adaptive regression splines for statistical modelling of species distributions. Ecol Modell 199:188–196. https://doi.org/10.1016/j.ecolmodel.2006.05.022
Legates DR, McCabe GJ (1999) Evaluating the use of “goodness-of-fit” measures in hydrologic and hydroclimatic model validation. Water Resour Res 35:233–241. https://doi.org/10.1029/1998WR900018
Legouhy A (2021) al_goodplot - boxblot & violin plot. In: MATLAB Cent. mathworks. https://www.mathworks.com/matlabcentral/fileexchange/91790-al_goodplot-boxblot-violin-plot
Liang C Li H Lei M Du Q 2018 Dongting Lake water level forecast and its relationship with the Three Gorges Dam based on a long short-term memory network Water (switzerland) 10https://doi.org/10.3390/w10101389
Luo Y Dong Z Liu Y 2021 Research on stage-divided water level prediction technology of rivers-connected lake based on machine learning: a case study of Hongze Lake China Stoch Environ Res Risk Assess 6https://doi.org/10.1007/s00477-021-01974-6
Mitchell TM 1997 Machine learning The McGraw-Hill Companies New York
Michigan (2021a) Sea Grant. https://www.michiganseagrant.org/topics/great-lakes-fast-facts/lake-michigan/. Accessed 15 Jul 2021
Michigan (2021b). https://www.goller.gen.tr/michigan-golu.html. Accessed 15 Jul 2021b
Okkan U, Ali Serbes Z (2013) The combined use of wavelet transform and black box models in reservoir inflow modeling. J Hydrol Hydromechanics 61:112–119. https://doi.org/10.2478/johh-2013-0015
Pahasa J, Ngamroo I (2011) A heuristic training-based least squares support vector machines for power system stabilization by SMES. Expert Syst Appl 38:13987–13993. https://doi.org/10.1016/j.eswa.2011.04.206
Pal M, Deswal S (2009) M5 model tree based modelling of reference evapotranspiration. Hydrol Process 23:1437–1443. https://doi.org/10.1002/hyp.7266
Peprah MS, Larbi EK, Peprah MS (2021) International Journal of Earth Sciences Knowledge and Applications lake water level prediction model based on autocorrelation regressive ıntegrated moving average and Kalman filtering techniques — an empirical study on Lake Volta Basin, Ghana. Int J Earth Sci Knowl Appl 3:1–11
Quinlan JR (1992) Learning with continuous classes. World Sci 92:343–348. https://doi.org/10.1.1.34.885
Sanikhani H, Kisi O, Kiafar H (2015) Comparison of different data-driven approaches for modeling lake level fluctuations : the case of Manyas and Tuz Lakes (Turkey). Water Resour Manag 29:1557–1574. https://doi.org/10.1007/s11269-014-0894-6
Şen Z, Kadioǧlu M, Batur E (2000) Stochastic modeling of the Van Lake monthly level fluctuations in Turkey. Theor Appl Climatol 65:99–110. https://doi.org/10.1007/s007040050007
Sephton P (2001) Forecasting recessions: can we do better on MARS? Review 83:39–49
Shafaei M, Kisi O (2016) Lake level forecasting using wavelet-SVR, wavelet-ANFIS and wavelet-ARMA conjunction models. Water Resour Manag 30:79–97. https://doi.org/10.1007/s11269-015-1147-z
Sharda VN, Patel RM, Prasher SO et al (2006) Modeling runoff from middle Himalayan watersheds employing artificial intelligence techniques. Agric Water Manag 83:233–242. https://doi.org/10.1016/j.agwat.2006.01.003
Shiri J, Shamshirband S, Kisi O et al (2016) Prediction of water-level in the Urmia Lake using the extreme learning machine approach. Water Resour Manag 30:5217–5229. https://doi.org/10.1007/s11269-016-1480-x
Shokrollahi A, Arabloo M, Gharagheizi F, Mohammadi AH (2013) Intelligent model for prediction of CO2 — reservoir oil minimum miscibility pressure. Fuel 112:375–384. https://doi.org/10.1016/j.fuel.2013.04.036
Shortridge JE, Guikema SD, Zaitchik BF (2015) Empirical streamflow simulation for water resource management in data-scarce seasonal watersheds. Hydrol Earth Syst Sci Discuss 12:11083–11127. https://doi.org/10.5194/hessd-12-11083-2015
Solomatine DP, Dulal KN (2003) Model trees as an alternative to neural networks in rainfall-runoff modelling. Hydrol Sci J 48:399–411. https://doi.org/10.1623/hysj.48.3.399.45291
Solomatine DP, Xue Y (2004) M5 Model trees and neural networks: application to flood forecasting in the upper reach of the Huai River in China. J Hydrol Eng 9:491–501. https://doi.org/10.1061/(ASCE)1084-0699(2004)9:6(491)
Sotomayor KAL (2010) Comparison of adaptive methods using multivariate regression splines ( MARS ) and artificial neural networks backpropagation (ANNB) for the forecast of rain and temperatures in the Mantaro river basin. Hydrol Days 2010:58–68
Suykens JAK, Vandewalle J (1999) No Title Neural Process Lett 9:293–300. https://doi.org/10.1023/A:1018628609742
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res Atmos 106:7183–7192. https://doi.org/10.1029/2000JD900719
Vuglinskiy V (2009) Water level: water level in lakes and reservoirs, water storage. Assessment of the status of the development of the standards for the terrestrial essential climate variables, Global Terrestrial Observing System (GTOS), Rome, Italy
Yaseen ZM Kisi O Demir V 2016 Enhancing long-term streamflow forecasting and predicting using periodicity data component: application of artificial ıntelligence Water ResourManag 30https://doi.org/10.1007/s11269-016-1408-5
Young CC Liu WC Hsieh WL 2015 Predicting the water level fluctuation in an Alpine Lake using physically based, artificial neural network, and time series forecasting models Math ProblEng 2015https://doi.org/10.1155/2015/708204
Acknowledgements
The author would like to thank KTO Karatay University.
Author information
Authors and Affiliations
Contributions
All chapters have been prepared by Vahdettin Demir.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This study did not involve any protected area, private land, and endangered or protected species, and no specific permissions were required for this activity.
Consent for publication
Written informed consent for publication was obtained from all authors.
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Demir, V. Enhancing monthly lake levels forecasting using heuristic regression techniques with periodicity data component: application of Lake Michigan. Theor Appl Climatol 148, 915–929 (2022). https://doi.org/10.1007/s00704-022-03982-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-022-03982-0