Abstract
This comprehensive study addresses the correlation between water levels and meteorological features, including air temperature, evaporation, and precipitation, to accurately predict water levels in lakes within the Great Lakes basin. Various models, namely multiple linear regression (MLR), nonlinear autoregressive network with exogenous inputs (NARX), Facebook Prophet (FB-Prophet), and long short-term memory (LSTM), are employed to enhance predictions of lake water levels. Results indicate that all models, except for FB-Prophet, perform well, particularly for Lakes Erie, Huron-Michigan, and Superior. However, MLR and LSTM show reduced performance for Lakes Ontario and St. Clair. NARX emerges as the top performer across all lakes, with Lakes Erie and Superior exhibiting the lowest error metrics—root mean square error (RMSE: 0.048 and 0.034), mean absolute error (MAE: 0.036 and 0.026), mean absolute percent error (MAPE: 0.021% and 0.014%), and alongside the highest R-squared value (R2: 0.977 and 0.968), respectively. Similarly, for Lake Huron-Michigan, NARX demonstrates exceptional predictive precision with an RMSE (0.029), MAE (0.022), MAPE (0.013%), and an outstanding R2 value of 0.995. Despite slightly higher error metrics, NARX consistently performs well for Lake Ontario. However, Lake St. Clair presents challenges for predictive performance across all models, with NARX maintaining relatively strong metrics with an RMSE (0.076), MAE (0.050), MAPE (0.029%), and R2 (0.953), reaffirming its position as the leading model for water level prediction in the Great Lakes basin. The findings of this study suggest that the NARX model accurately predicts water levels, providing insights for managing water resources in the Great Lakes region.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Freshwater resources play an indispensable role in sustaining our ecosystems due to their multifaceted contributions that include agricultural irrigation, economic progress through tourism, preservation of natural aesthetics, preservation of biodiversity, and many other important factors (Barzegar et al. 2021; Buyukyildiz et al. 2014; Choi et al. 2019; García Molinos et al. 2015; Zhu et al. 2020a). Despite providing myriad benefits to human societies and biodiversity, freshwater resources are inherently limited. Freshwater, which is essential to support life on Earth, makes up approximately 2.5% of the water that covers about 70% of the planet’s surface (Shiklomanov 2000). Freshwater is often challenging to access due to its predominant presence in glaciers and ice caps, which account for approximately 68.7% of the global freshwater supply (Gleick 1993; Shiklomanov 2000). Freshwater lakes hold about 20.9% of the 1.2% of Earth’s freshwater that is available as surface water (Gleick 1993). According to the United States Environmental Protection Agency, the Great Lakes of North America, constituting 21% of the world’s surface freshwater, are the largest group of freshwater systems on Earth (US EPA 2023). The Great Lakes, encompassing Lake Erie, Lake Huron-Michigan, Lake Ontario, Lake Superior, and Lake St. Clair, possess a combined volume of approximately 22,671 km³. Native Americans have historically relied heavily on these lakes as a source of water. The Great Lakes region was home to a large population of American Indians who fished, hunted, and used water for all manner of purposes (Hudson and Ziegler 2014). However, they respected and cherished these lakes, and as a result they kept them pristine. Both the surroundings near the lakes and the lakes themselves have seen significant change during the past few hundred years. The greatest contributors to environmental change near the Great Lakes have been industrialization leading to commercial and agricultural growth, followed by climate change and its potential long-term effects (Hartig et al. 2020; Mortsch 1998; Mortsch and Quinn 1996).
Keddy and Reznicek (1986) discussed how water fluctuations can alter ecosystems, especially vegetation, several decades ago. In their study, they compared periods of low water level with periods of high-water level. Their observations suggest that during low water level periods, the soil becomes less anoxic, plants attempt to adapt to living in dryer conditions, and vegetation changes generally as plants that cannot survive in dry conditions gradually give way to species that are tolerant of these climatic changes. On the other hand, the opposite procedure can be seen at times of high water. According to study conducted a few decades ago, the water level in some Great Lakes could rise or fall (depending on the pattern) by 4.5 feet (1.37 m) over the next hundred years (Annin 2006; Hall and Stuntz 2008). Water level changes are a particularly important issue due to potential environmental changes, so monitoring and recording the water level in these large freshwater reservoirs over an extended period of time is essential. In addition to having a significant impact on businesses, homes and habitats of aquatic species in the Great Lakes region, these changes have the potential to change the weather, resulting in prolonged rain. Therefore, analyzing the findings can help effectively manage the water resources of the Great Lakes. The Great Lakes Environmental Research Laboratory (GLERL), a facility of the National Oceanic and Atmospheric Administration (NOAA) Office of Oceanic and Atmospheric Research (OAR), has been involved in a number of observation and monitoring initiatives undertaken so far (https://www.glerl.noaa.gov/data/dashboard/data/). Starting in the year 1860, GLERL has been collecting information on the water levels of all the lakes in the Great Lakes basin, with the exception of Lake St. Clair, where water levels have been monitored since 1898. In addition to water levels data, NOAA has also been collecting data on air temperature, evaporation and precipitation in the Great Lakes basin over an extended period of time.
In recent years, significant research has been conducted to study water level fluctuations of lakes in various parts of the world, including the Great Lakes, using machine learning methods. Machine learning (ML) is a subdivision of artificial intelligence (AI) that allows us to train the computer to accurately predict the output for given inputs (Ciaburro and Iannace 2021; Janiesch et al. 2021; Jordan and Mitchell 2015). Recently, ML algorithms have become very popular especially in engineering and science and are used in numerous applications such as image recognition, medicine, language translation, computer vision, social media platforms and others (Demir and Yaseen 2023; Injadat et al. 2021; Jordan and Mitchell 2015; Mirzania et al. 2023a; Sarker 2021; Sharma et al. 2021; Zhou et al. 2023). However, the application of ML algorithms to predict water levels in lakes and reservoirs is still under development. Zhu et al. (2023a) developed two deep data-driven models, including gated recurrent unit (GRU) and long short-term memory (LSTM), coupled with an attention mechanism for forecasting daily lake water levels in Poland. Their study revealed that LSTM with attention mechanism generally outperforms GRU with attention mechanism, although on average across different lakes, GRU emerges as the best-performing deep learning model. Furthermore, zero-order forecast models excel in predicting tomorrow, while deep learning models demonstrate improved performance with longer prediction horizons. Zhu et al. (2020b) employed a feed forward neural network (FFNN) and Deep Learning (DL) technique to predict monthly lake water level in Poland. Their results demonstrated that the FFNN model slightly outperformed the DL model, suggesting that traditional ML models are sufficient for predicting water levels if properly trained. Saroughi et al. (2024) utilized the Shannon entropy method and developed a hybrid model for predicting groundwater level (GLW) in the Tabriz plain of Iran. They reported that the performance of the standalone model significantly improved with the proposed method, with the Honey Badger algorithm (HBA)-ANN performing marginally better than the Coot-ANN model. Additionally, Mirzania et al. (2023b) employed hybrid algorithm of innovative gunner-support vector regression (AIG-SVR) and SVR models, to accurately estimate daily reference evapotranspiration (ET0) in Australia. Their results showed that the AIG-SVR provides better results than the classic SVR. Similarly, Mirzania et al. (2023c) developed a hybrid COOT-ANN predictive model and evaluated its performance in predicting ET0 in Australia, comparing its performance with that of the standalone ANN model. Their findings indicated that the COOT-ANN hybrid model surpasses the ANN model in performance. Bonakdari et al. (2019) employed several AI models such as the Minimax Probability Machine Regression (MPMR), Relevance Vector Machine (RVM), Gaussian Process Regression (GPR) and Extreme Learning Machine (ELM) to predict water level fluctuations in Lake Huron. The study reported that the MPMR is the best model for predicting water level fluctuations in Lake Huron. Similarly, Altunkaynak (2014) estimated water level fluctuations in Lake Michigan-Huron using a combination of three methods: wavelet transform, fuzzy logic, and multilayer perceptron (artificial neural network). In addition, Wang and Wang (2020) used a set of ML algorithms, namely Gaussian process (GP), multiple linear regression (MLR), multilayer perceptron (MLP), M5P model tree, random forest (RF), and k-nearest neighbor (KNN) to predict the water level in Lake Erie. While these studies are useful for estimating the water levels of Lake Huron, Lake Michigan-Huron, and Lake Erie, they do not provide complete statistical information about all the lakes in the Great Lakes basin. Furthermore, Coulibaly (2010) compared the results of water levels of only major lakes (excluding lake St. Clair) in the Great Lakes region using several neural networks such as echo state network (ESN), recurrent neural networks (RNN), and Bayesian neural networks (BNN).
This study addresses the correlation between water levels and key meteorological features, including air temperature, evaporation, and precipitation, to accurately predict fluctuations in water levels in the Great Lakes. To achieve this goal, the study employs various models, namely multiple linear regression (MLR), the nonlinear autoregressive network with exogenous inputs (NARX), Facebook Prophet (FB-Prophet), and long short-term memory (LSTM), all of which are based on historical datasets. The selection of these models for water level prediction was based on their respective strengths and suitability for the task. MLR, chosen for its simplicity and interpretability, captures linear relationships between predictor variables and water levels effectively. NARX models accommodate nonlinear relationships and temporal dependencies, which are crucial for capturing the complex dynamics of water level fluctuations (Zhu et al. 2023b). LSTM’s capability to capture long-term dependencies in sequential data is essential for modeling the intricate temporal relationships inherent in water level prediction tasks. These models were preferred over others due to their alignment with the specific requirements and objectives of the study, highlighting their effectiveness in addressing the complexities of water level forecasting. In this investigation, air temperature, evaporation, precipitation, and lagged water levels from the preceding month were selected as the independent variables for analysis. Simultaneously, current water levels were identified as the dependent variable, serving as the target for predictive modeling efforts within the Great Lakes basin. Based on the comprehensive evaluation metrics and accurately predicted results, the findings strongly suggest that the NARX model emerges as a well-suited and reliable tool for predicting lake water levels in the Great Lakes. These results hold significant implications for enhancing our understanding of the region’s hydrological dynamics and can be instrumental in guiding effective water resource management strategies. Although prior studies have contributed valuable insights into water level prediction within the Great Lakes basin, they have not comprehensively addressed statistical information for all lakes within the region. To the best of my knowledge, this study is the first to provide detailed assessments for all lakes, including Lake St. Clair, in the Great Lakes basin using a variety of models.
Materials and methods
Study area
Freshwater is a vital necessity for the survival of human life and a wide range of biota. Distribution of global water, including freshwater, on Earth is illustrated in Fig. 1(a) (USGS 1993). The Great Lakes, also known as the Laurentian Great Lakes, account for 20.9% of global surface freshwater located between the United States (59%) and Canada (41%) border (Bonakdari et al. 2019; Gleick 1993; Xue et al. 2022). These lakes are Lake Erie, Lake Huron-Michigan, Lake Ontario, Lake Superior, and Lake St. Clair as shown in Fig. 1(b).
It is important to note that Lake Huron and Lake Michigan have the same hydrological characteristics and are connected to each other by the Strait of Mackinac, hence known as a single lake (Anderson and Schwab 2013). The Great Lakes basin covers an area of approximately 764,046 km2 (295,000 mi2) (Neff and Nicholas 2005). It stretches for about 1126 km (700 miles) from north to south and about 1448 km (900 miles) from the west to the outlet of Lake Ontario at Cornwall and Massena in the east. The region encompasses eight states - Minnesota, Wisconsin, Illinois, Indiana, Michigan, Ohio, Pennsylvania, New York - and one province, Ontario, in addition to the Great Lakes (Neff and Nicholas 2005; Wilcox et al. 2007). The Great Lakes basin is home to approximately 33 million people, representing roughly 10% of the U.S. population and 30% of the Canadian population (Danz et al. 2007; Wilcox et al. 2007). Lake Superior is the largest lake in terms of volume and surface area in the Great Lakes basin, with a surface area of approximately 82,100 km2 (31,700 mi2). However, When Lake Michigan and Lake Huron are combined into a single lake, they not only become the largest lake in the region but also claim the title of the world’s largest freshwater lake by surface area, boasting a total surface area of approximately 117,400 km2 (45,300 mi2). On the other hand, Lake St. Clair, which is considered part of the Great Lakes system, is the smallest lake with a surface area of approximately 1.114 km² (430 mi2). The physical characteristics and basic statistical parameters of all lakes in the Great Lakes basin are given in Table 1 (Inn and Port Huron 1999; US EPA 2023).
Data source
A time series, comprising chronologically ordered observations, serves as a valuable resource for researchers across diverse fields, including engineering, physical sciences, and social sciences (Ghaderpour et al. 2021; Moraffah et al. 2021; Parzen 1961). In the present study, monthly mean time series data of water level, air temperature, evaporation, and precipitation of the Great Lakes, including Lake St. Clair, for the duration of 1950 to 2010 were used. These monthly time series data were acquired from NOAA website. All analyses were conducted using various models in Python (version 3.9.13) and MATLAB (version R2022a) environments. The flowchart of the proposed methodology for water level prediction is illustrated in Fig. 2.
When working with ML models, there isn’t a single, universally applicable method for input selection (Babel and Shinde 2011). The body of research demonstrates that researchers have thus far employed heuristic methodologies, sensitivity analysis, and linear cross-correlation techniques (Bowden et al. 2005; Piasecki et al. 2015). As hydrological factors such as air temperature, evaporation, and precipitation have an impact on lake water levels, the selection of input for forecasting Great Lakes water levels was made depending on the specific ML models utilized.
Before using the raw data for some ML models, the data were normalized using the min-max normalization method and scaled between 0 and 1 with the following equation:
Here, xnorm represents the normalized data, x represents the observed data, and xmin and xmax denote the minimum and maximum values of the observed data, respectively.
There are several approaches to employing data partitioning in ML. When training models, the given data is typically divided into three distinct sets: training, validation, and testing, usually with ratios of 70/15/15 or 80/10/10, respectively. In this study, the datasets were randomly split into training, validation, and testing with a ratio of 70/15/15.
Prediction model
ML has recently gained popularity and proven to be a useful model for classification, clustering, pattern recognition, and prediction in a variety of fields (Lee et al. 2018; Mirzania et al. 2023a; Saroughi et al. 2024; Wu et al. 2014). Additionally, artificial neural networks (ANNs), a subset of ML, serve as nonlinear statistical data models that mimic the functionality of biological neurons (Ghimire et al. 2021; Zhu et al. 2023b). While numerous studies have significantly benefited from the use of classical approaches, ML, particularly ANNs, offer several advantages over their counterparts in time series forecasting because ML and ANNs can handle nonlinear data that does not follow a normal distribution (Hansen et al. 1999). Moreover, ML can efficiently process extensive datasets and identify potential interactions among predictor variables (Ghiasi and Koushki 2020).
In the course of this study, diverse models, particularly MLR, NARX, FB-Prophet, and LSTM, were employed to predict lake water levels within the Great Lakes basin. The optimization of each model was carried out by adjusting the hyperparameters (Jamous et al. 2021). Each of the models for predicting water levels is briefly described below.
Multiple linear regression (MLR)
MLR, an extension of linear regression, is a statistical approach employed in data analysis and modeling to investigate the correlation between a dependent variable and two or more independent predictors (Choden et al. 2022; Uyanık and Güler 2013). The following expression presents the equation that characterizes the MLR model:
where \(Y\) is the estimated value of the dependent variable, \({\beta }_{0}\) is the \(Y\)-intercept, \({\beta }_{1},{\beta }_{2},{\beta }_{3},\dots ,{\beta }_{p}\) represent regression coefficients of independent predictors \({X}_{1},{X}_{2},{X}_{3}, \dots , {X}_{p}\), and \(\epsilon\) is the error term. In the context of MLR, water level values with a one-month lagged were employed for predicting the water levels in lakes.
Nonlinear autoregressive network with exogenous inputs (NARX)
The NARX model is a modification of the nonlinear autoregressive (NAR) model by adding an additional relevant time-series variable as an additional input to the forecasting model. The NARX, which is based on the linear ARX model, is a recurrent dynamic neural network that is enclosed by feedback connections at several layers (Demuth et al. 1992). The equation describing the NARX model is given by the following expression:
where \(y\left(t\right)\) is the dependent (predicted) output value, \({n}_{u}\) is the number of time delay in the input, \({n}_{y}\) is the number of time delay in the output, \(y\left(t-1\right),y\left(t-2\right),\dots ,y\left(t-{n}_{y}\right)\) are previous values of the output, \(u\left(t-1\right),u\left(t-2\right),\dots ,u\left(t-{n}_{u}\right)\) are previous values of independent (exogeneous) input, and \(f\) is typically a nonlinear function. In the NARX model, the dependent output value \(y\left(t\right)\) is predicted based on previous values of the output and previous values of independent (exogeneous) input as shown in Eq. 3. The NARX model can be employed by approximating the function f with a feedforward neural network. The architecture of the NARX neural network is shown in Fig. 3. Here, IW1,1 is the connection weight between the input neuron and hidden neuron; LW1,3 is the connection weight between the hidden neuron and output feedback neuron; LW2,1 is the connection weight between the hidden neuron and predicted output; b1 is the bias of the hidden neuron; b2 is the bias of the predicted output; f1 is the hidden layer activation function; and f2 is the output layer activation function. In this architecture, the approximation is performed using a two-layer feedforward network (Demuth et al. 1992). In this model, previous water level values along with the aforementioned independent exogenous inputs were used to predict future values of the lake water levels. While training the model, 10 neurons with 1:2 input and feedback delays were utilized in the hidden layer. The network was trained with Levenberg-Marquardt backpropagation algorithm discussed in detail elsewhere (Lv et al. 2017). Although the maximum number of epochs was set to 1000, the validation criteria were met between 15 and 35 epochs for all lakes.
Facebook prophet (FB-Prophet)
FB-Prophet, developed by Facebook’s data science team as open source, is a forecasting tool designed for time series analysis and forecasting (Battineni et al. 2020; ChikkaKrishna et al. 2022). FB-Prophet finds extensive application across various domains, including business, finance, and economics, for forecasting purposes. The primary methodology employed by FB-Prophet involves a decomposable time series model that encompasses three key model components: trend, seasonality, and holidays (Daraghmeh et al. 2021; Garlapati et al. 2021). The combination of these three components can be mathematically represented by the following equation:
where \(y\left(t\right)\) represents additive regression model, \(g\left(t\right)\) represents trend function (or growth term), \(s\left(t\right)\) describes seasonality, and \({\epsilon }_{t}\) is an error term. For training the FB-Prophet model, the internal width, change point prior scale, seasonality prior scale, change point range, and uncertainty sample were set to 0.9, 2, 9, 0.6, and 1000, respectively.
Long short-term memory (LSTM)
The LSTM represents a specialized variant within the domain of recurrent neural networks (RNNs) (Sherstinsky 2020; Wunsch et al. 2021). It is specifically engineered for the management and analysis of sequential data, rendering it highly suitable for applications associated with time series data, natural language processing, and a variety of other sequential data processing tasks (Cao et al. 2018; Sagheer and Kotb 2019; Wunsch et al. 2021; Yang and Zhang 2022). The LSTM addresses certain limitations present in conventional RNNs, including the vanishing gradient problem, which has the potential to prevent deep networks from being trained on long sequences. In that sense, the LSTM is distinguished from traditional RNNs by its ability to capture and remember long-range dependencies in data. A typical LSTM unit comprises four main components: a cell, an input gate, an output gate, and a forget gate, which allow them to selectively store and retrieve information over time (Mohan and Gaitonde 2018; Wunsch et al. 2021). The schematic diagram of the recurrent neural network (top) and the LSTM architecture (bottom) is shown in Fig. 4, and the corresponding LSTM equations are as follows:
Here, \({f}_{t}\) represents the forget gate which specifies what information from the prior cell state \(\left({C}_{t-1}\right)\) should be forgotten, the input gate \({i}_{t}\) controls what new information should be added to the cell state \(\left({C}_{t}\right)\), \({\stackrel{\sim}{C}}_{t}\) represents the new candidate values to be added to the cell state, \({C}_{t}\) updates the cell state by combining the information to be forgotten and the new information, the output gate \({o}_{t}\) regulates what information from the cell state \(\left({C}_{t}\right)\) should be output as the hidden state \(\left({h}_{t}\right)\) which computes the hidden state based on the cell state and the output gate. In above equations, \(\sigma\) is the sigmoid activation function, tanh is the hyperbolic tangent activation function, \({W}_{f}, {W}_{i},{W}_{C}, {W}_{o}\) represent weight matrices for the forget gate, input gate, candidate cell state, and output gate, respectively, \({b}_{f},{b}_{i},{b}_{C},{b}_{o}\) represent bias vectors for the corresponding gates, \({x}_{t}\) is the input at time step \(t\), \({h}_{t-1}\) is the hidden state at the previous time step \(t-1\). These equations provide a comprehensive description of how an LSTM cell manages sequential data, maintains and refreshes information, and governs the transmission of data to subsequent time steps or the output. In this model, one-month lagged water level values and three stacked LSTM layers with varying units (128, 64, and 64) were utilized to predict the water levels of lakes. The Rectified Linear Unit (ReLU) and Adam were used as the activation function and optimizer, respectively, with 100 epochs and a batch size of 64.
The parameters of each algorithm used for predicting water levels in the Great Lakes region are given in Table 2.
Model performance evaluation
In this study, four widely used statistical indicators were employed to assess the effectiveness of the model used to forecast lake water levels in the Great Lakes basin. These four different indicators can be defined as follows:
-
i.
Root mean square error (RMSE) is used to measure residuals between the predicted value and the observed value. The RMSE value is generally considered to be between 0 and 1. Higher prediction accuracy is associated with a lower RMSE value. The mathematical expression for the RMSE is given by the following equation:
where n is the total number of data points, \({y}_{i}\) is the observed (actual) value, and \(\widehat{{y}_{i}}\) denotes the predicted value.
-
ii.
Mean absolute error (MAE) serves as a metric for assessing the average magnitude of error between a set of observed or predicted values. A lower MAE signifies higher model performance. The equation for MAE can be expressed as:
where n is the total number of data points, \({y}_{i}\) is the observed value, and \(\widehat{{y}_{i}}\) is the predicted value.
-
iii.
The mean absolute percent error (MAPE) measures the average percentage variance between the observed values and the predicted values. A lower MAPE signifies an improved predictive model because it indicates that the predictions are closer to the actual values in terms of percentage error. The equation for MAPE can be written as:
where n is the total number of data points, \({y}_{i}\) is the observed value, and \(\widehat{{y}_{i}}\) is the predicted value.
-
iv.
Coefficient of determination (R2), which ranges from 0 to 1, expresses how accurately a statistical model forecasts an outcome. Forecasting performance improves when the value of R2 is close to 1. The expression for R2 is given by:
Here, n is the total number of data points, \({y}_{i}\), \(\overline{{y}_{i}}\), \(\widehat{{y}_{i}}\), and \(\overline{\widehat{{y}_{i}}}\) represent the observed value, mean of the observed value, the predicted value, and mean of the predicted value, respectively.
Results
In order to accurately estimate the monthly water level fluctuations in Lakes Erie, Huron-Michigan, Ontario, Superior, and St. Clair, located within the Great Lakes basin, a data partitioning strategy was employed. The dataset was subdivided into three distinct sets, consisting of a training set (70%), a validation set (15%), and a testing set (15%). The initial training phase of model utilized the training dataset, spanning from 01/01/1950 to 01/10/2001. During this phase, the model learned how to process information and make predictions. Subsequently, hyperparameter optimization was performed using the validation dataset. Once the model was constructed and optimized using the training and validation datasets, test data covering the time period from 01/11/2001 to 01/12/2010 was employed to assess its predictive capabilities. A diverse set of performance indicators, including RMSE, MAE, MAPE, and R2, were employed to assess the effectiveness and accuracy of various models.
Figure 5 displays graphical representations illustrating the partitioning of data for all lakes in the Great Lakes region. The training and testing parts are represented by solid magenta and solid blue lines, respectively. For the sake of clarity, a vertical black dashed line is used to distinguish the training section from the testing section.
Figure 6 shows regression fits for a comparative analysis between the observed and estimated monthly water levels for each model across all lakes in the Great Lakes basin. Each row corresponds to a lake, while each column is associated with a model. The results obtained from the MLR, NARX, and LSTM models are in good agreement with the observed water levels, yet the best performance is achieved from the NARX model for all lakes. The regression lines of the NARX model for all lakes have a slope of almost unity, indicating that all data points are roughly distributed very close to the regression lines.
Referring to the performance metrics in Table 3 for all models across each lake, the NARX model consistently exhibits the best values for RMSE, MAE, MAPE, and R2. Additionally, application of the NARX model yields the most favorable performance metrics for Lake Huron-Michigan, with the lowest values recorded for RMSE (0.029), MAE (0.022), and MAPE (0.013%), alongside the highest R2 (0.995) value, when compared to all other lakes.
The performance of each model is visually presented in Fig. 7, where bar graphs depict the RMSE, MAE, and MAPE for all models across every lake, excluding R2 for clarity due to its significantly larger values compared to other performance indicators. While comparing performance metrics for all lakes, it is evident that the NARX outperforms other models. However, the least satisfactory water level predictions across all lakes are observed with FB-Prophet.
Similarly, Fig. 8 displays quantitative analysis plots for the observed and predicted water levels as a function of time for various models across all lakes. The blue solid line shows the observed water level, while the green dotted line, red dashed line, purple square, and mustard dash-dotted line represent the predicted water levels of MLR, NARX, FB-Prophet, and LSTM, respectively. The time series plots for all lakes clearly demonstrate that the predicted water level values for all models except FB-Prophet are in excellent agreement with the observed water level values, as the predicted lines closely follow the trend of the observed line.
Figure 9 shows notched boxplots illustrating the comparison between observed and predicted lake water levels for each model. The notched boxplots clearly demonstrate that each model, across all lakes, could accurately predict median water levels, as their notches overlap with the observed water levels. Similarly, each model, with the exception of FB-Prophet, nearly captured both the first (Q25) and third (Q75) quartiles of the observed water level values and accurately predicted the maximum values of the observed water levels for all lakes. As for the minimum water level values across all lakes, the best prediction was obtained using the NARX and LSTM models. Furthermore, an outlier beyond the whiskers is clearly visible for MLR and LSTM in the case of Lake St. Clair. Although all models except FB-Prophet provided good accuracy for water level prediction in all lakes, the overall best performance was achieved using the NAXR model.
Discussion
The prediction and monitoring of freshwater levels are essential for both foreseeing and addressing the challenges posed by water scarcity, floods, and environmental degradation. This enables the effective management of resources and promotes sustainable development. There has been great interest in studies focusing on predicting freshwater levels of the Great Lakes. According to the evaluation metric values presented in Table 3, the predictions generated by the ML models in this study demonstrate an acceptable level of accuracy when compared to findings from previous studies. For instance, Coulibaly (2010) utilized various neural networks to forecast water levels exclusively for the major lakes in the Great Lakes region, except for Lake St. Clair, and assessed the accuracy of the predictions using RMSE and correlation coefficient (r). In his study, the best RMSE values obtained for Lakes Erie, Huron-Michigan, Ontario, and Superior are 0.06, 0.04, 0.08, and 0.03, respectively. These values are higher than those presented in Table 3 in this study, with the exception of Lake Superior, where the value is almost the same. In addition, Altunkaynak (2014) predicted water levels for Lake Michigan-Huron, attaining an RMSE value of 0.112 by employing a combination of three methods. It is evident that this value significantly exceeds the one presented in Table 3 in this study. In a recent published paper, Wang and Wang (2020) used various ML models to predict Lake Erie water levels based on RMSE, MAE, r, and mutual information (MI) performance metrics. The RMSE and MAE values achieved for Lake Erie in their study are 0.02 and 0.01, respectively, which closely resemble those obtained in this study. Similarly, Barzegar et al. (2021) conducted water level forecasting in Lakes Ontario and Michigan utilizing various ML models, which were evaluated based on numerous metrics. Their report showed that Lake Ontario achieved RMSE and MAE values of 0.082 and 0.064, respectively, as the best outcomes, which are higher than the values obtained in this study.
The findings of this study suggest that the NARX model has produced highly promising and encouraging results compared to other models. Performance assessments across various metrics during the testing period revealed significant improvements with the NARX model. Specifically, for Lake Erie, RMSE decreased by up to 21.3% and 11.1%, MAE decreased by up to 25% and 18.2%, MAPE decreased by up to 22.2% and 16%, and R2 increased by up to 10.4% and 7.6% compared to the MLR and LSTM models, respectively. Similarly, for Lake Huron-Michigan, RMSE decreased by up to 32.6% and 29.3%, MAE decreased by up to 37.1% and 33.3%, MAPE decreased by up to 35% and 31.6%, and R2 increased by up to 6.8% and 5.9% compared to the MLR and LSTM models, respectively. Furthermore, for Lake Ontario, RMSE decreased by up to 39.6% and 34.4%, MAE decreased by up to 42% and 37.3%, MAPE decreased by up to 43.1% and 38%, and R2 increased by up to 16.4% and 12.7% compared to the MLR and LSTM models, respectively. Concerning Lake Superior, RMSE decreased by up to 21% and 30.6%, MAE decreased by up to 16.1% and 27.8%, MAPE decreased by up to 17.6% and 30%, and R2 increased by up to 5.3% and 8.3% compared to the MLR and LSTM models, respectively. Lastly, for Lake St. Clair, RMSE decreased by up to 14.6% and 13.6%, MAE decreased by up to 23.1% and 25.4%, MAPE decreased by up to 21.6% and 23.7%, and R2 increased by up to 33.8% and 32.2% compared to the MLR and LSTM models, respectively.
Furthermore, to provide a comprehensive understanding of water level prediction performance across different models, visual comparisons are presented in both Figs. 6 and 8. These figures offer detailed insights into how the MLR, NARX, FB-Prophet, and LSTM models perform in capturing the underlying trends of the actual data. From the visualization, it is evident that all three models (MLR, NARX, and LSTM) exhibit commendable abilities to capture the overall trend of the observed data. However, upon closer examination, the NARX model emerges as the most accurate in fitting the data, showcasing its superior predictive capabilities. Notably, although the performance of all models, except for FB-Prophet, is considered satisfactory, particularly for Lakes Erie, Huron-Michigan, and Superior, it is the NARX model that consistently outperforms the others across all lakes in the Great Lakes basin. This highlights the robustness and reliability of the NARX model in predicting water levels across various hydrological settings.
Conclusions
The sustainability of our ecosystems significantly depends on the vital contribution of freshwater, highlighting the crucial need for monitoring water levels and ensuring the effective management of freshwater resources. The present study reports the prediction of monthly mean water levels for lakes in the Great Lakes basin based on historical datasets, including air temperature, evaporation, and precipitation, using a variety of models. By utilizing a comprehensive range of evaluation metrics, which encompass RMSE, MAE, MAPE, and R², the effectiveness of four prominent models (MLR, NARX, FB-Prophet, and LSTM) systematically evaluated across five significant lakes: Erie, Huron-Michigan, Ontario, Superior, and St. Clair. The results of the current study reveal that the predictive capability of all models, except for FB-Prophet, is in good agreement with the observed water levels, particularly for Lakes Erie, Huron-Michigan, and Superior. However, it is apparent that the predicting performance of MLR and LSTM is diminished for Lakes Ontario and St. Clair (Table 3). While MLR and LSTM offer satisfactory performance in predicting water levels in the Great Lakes, the NARX model achieves the best overall performance across all lakes, demonstrating superior prediction ability for water levels. In the case of Lake Erie, NARX emerges as the standout performer, boasting the lowest RMSE (0.048), MAE (0.036), and MAPE (0.021%) values, coupled with an impressive R2 score of 0.977. Similarly, for Lake Huron-Michigan, NARX once again stand outs, demonstrating unparalleled predictive precision with an RMSE (0.029), MAE (0.022), MAPE (0.013%), and an exceptional R2 value of 0.995. When the focus shifts to Lake Ontario, NARX consistently demonstrates strong performance metrics, with an RMSE of 0.061, MAE of 0.047, MAPE of 0.062%, and R2 of 0.960, despite exhibiting slightly higher error metrics compared to other lakes. This observation underscores the complex interplay between the dynamics of the model and the unique characteristics of each lake. Meanwhile, in the case of Lake Superior, both NARX and LSTM exhibit commendable predictive accuracy, underscoring their efficacy in capturing the complex hydrological dynamics inherent to the region (Table 3). However, the predicting landscape presents greater challenges when shifting our focus to Lake St. Clair, where predictive performance encounters significant obstacles across all models. Despite this, NARX maintains relatively strong performance metrics with an RMSE of 0.076, MAE of 0.050, MAPE of 0.029%, and R2 of 0.953, solidifying its position as the leading model for water level prediction in the Great Lakes basin. The findings of this research suggest that the current study can help effectively manage water resources and advance the knowledge of water level prediction in the Great Lakes region. Moreover, this study focuses solely on the correlation between water levels and meteorological features such as air temperature, evaporation, and precipitation for water level prediction in the Great Lakes basin. Other relevant variables like wind speed and direction, humidity, atmospheric pressure, and solar radiation, which could impact water levels, have not been taken into account. However, further studies will be performed to investigate the inclusion of these factors to enhance the understanding of their correlation with water levels and their influence on prediction accuracy. Furthermore, future research will also include the development of machine learning hybrid models to enhance the analysis of water level prediction in the Great Lakes basin.
Data availability
All data were gathered from The Great Lakes Environmental Research Laboratory (GLERL) website, www.glerl.noaa.gov/data. The data that support the findings of this study are described in the manuscript and available from the corresponding author, Onur Kurt, upon reasonable request.
References
Altunkaynak A (2014) Predicting water level fluctuations in Lake Michigan-Huron using wavelet-expert system methods. Water Resour Manag 28:2293–2314
Anderson EJ, Schwab DJ (2013) Predicting the oscillating bi-directional exchange flow in the Straits of Mackinac. J Great Lakes Res 39:663–671
Annin P (2006) The great lakes water wars (Springer)
Babel MS, Shinde VR (2011) Identifying prominent explanatory variables for water demand prediction using artificial neural networks: a case study of Bangkok. Water Resour Manag 25:1653–1676
Barzegar R, Aalami MT, Adamowski J (2021) Coupling a hybrid CNN-LSTM deep learning model with a boundary corrected maximal overlap discrete wavelet transform for multiscale lake water level forecasting. J Hydrol 598:126196
Battineni G, Chintalapudi N, Amenta F (2020) Forecasting of COVID-19 epidemic size in four high hitting nations (USA, Brazil, India and Russia) by Fb-Prophet machine learning model. Appl Comput Inform
Bonakdari H, Ebtehaj I, Samui P, Gharabaghi B (2019) Lake water-level fluctuations forecasting using minimax probability machine regression, relevance vector machine, gaussian process regression, and extreme learning machine. Water Resour Manag 33:3965–3984
Bowden GJ, Dandy GC, Maier HR (2005) Input determination for neural network models in water resources applications. Part 1—background and methodology. J Hydrol 301:75–92
Buyukyildiz M, Tezel G, Yilmaz V (2014) Estimation of the change in lake water level by artificial intelligence methods. Water Resour Manag 28:4747–4763
Cao K, Kim H, Hwang C, Jung H (2018) CNN-LSTM coupled model for prediction of waterworks operation data. J Inf Process Syst 14:1508–1520
ChikkaKrishna NK, Rachakonda P, Tallam T (2022) Short-term traffic prediction using Fb-PROPHET and neural-PROPHET. Paper presented at: 2022 IEEE Delhi Section Conference (DELCON) (IEEE)
Choden Y, Chokden S, Rabten T, Chhetri N, Aryan KR, Al Abdouli KM (2022) Performance assessment of data driven water models using water quality parameters of Wangchu river, Bhutan. SN Appl Sci 4:290
Choi C, Kim J, Han H, Han D, Kim HS (2019) Development of water level prediction models using machine learning in wetlands: a case study of Upo Wetland in South Korea. Water 12:93
Ciaburro G, Iannace G (2021) Machine learning-based algorithms to knowledge extraction from time series data: a review. Data 6:55
Coulibaly P (2010) Reservoir computing approach to great lakes water level forecasting. J Hydrol 381:76–88
Danz NP, Niemi GJ, Regal RR, Hollenhorst T, Johnson LB, Hanowski JM, Axler RP, Ciborowski JJ, Hrabik T, Brady VJ (2007) Integrated measures of anthropogenic stress in the US Great Lakes Basin. Environ Manage 39:631–647
Daraghmeh M, Agarwal A, Manzano R, Zaman M (2021) Time series forecasting using Facebook Prophet for cloud resource management. Paper presented at: 2021 IEEE International Conference on Communications Workshops (ICC Workshops) (IEEE)
Demir V, Yaseen ZM (2023) Neurocomputing intelligence models for lakes water level forecasting: a comprehensive review. Neural Comput Appl 35:303–343
Demuth H, Beale M, Hagan M (1992) Neural network toolbox. For Use with MATLAB. The MathWorks Inc 2000
García Molinos J, Viana M, Brennan M, Donohue I (2015) Importance of long-term cycles for predicting water level dynamics in natural lakes. PLoS ONE 10:e0119253
Garlapati A, Krishna DR, Garlapati K, Rahul U, Narayanan G (2021) Stock price prediction using Facebook Prophet and Arima models. Paper presented at: 2021 6th International Conference for Convergence in Technology (I2CT) (IEEE)
Ghaderpour E, Pagiatakis SD, Hassan QK (2021) A survey on change detection and time series analysis with applications. Appl Sci 11:6141
Ghiasi V, Koushki M (2020) Numerical and artificial neural network analyses of ground surface settlement of tunnel in saturated soil. SN Appl Sci 2:939
Ghimire S, Yaseen ZM, Farooque AA, Deo RC, Zhang J, Tao X (2021) Streamflow prediction using an integrated methodology based on convolutional neural network and long short-term memory networks. Sci Rep 11:17497
Gleick PH (1993) Water in crisis. Pacific Institute for Studies in Dev., Environment & Security. Stockholm Env. Institute, Oxford Univ. Press. 473p 9: 1051 – 0761
Hall ND, Stuntz BB (2008) Climate change and great lakes water resources: avoiding future conflicts with conservation. Hamline L Rev 31:639
Hansen JV, McDonald JB, Nelson RD (1999) Time series prediction with genetic-algorithm designed neural networks: an empirical comparison with modern statistical models. Comput Intell 15:171–184
Hartig JH, Krantzberg G, Alsip P (2020) Thirty-five years of restoring great lakes areas of concern: gradual progress, hopeful future. J Great Lakes Res 46:429–442
Hudson JC, Ziegler SS (2014) Environment, culture, and the great lakes fisheries. Geogr Rev 104:391–413
Injadat M, Moubayed A, Nassif AB, Shami A (2021) Machine learning towards intelligent systems: applications, challenges, and opportunities. Artif Intell Rev 54:3299–3348
Inn TE, Port Huron M (1999) Lake St. Its Current State, Clair
Jamous R, ALRahhal H, El-Darieby M (2021) Neural Network Architecture Selection using particle swarm optimization technique. Appl Artif Intell 35:1219–1236
Janiesch C, Zschech P, Heinrich K (2021) Machine learning and deep learning. Electron Mark 31:685–695
Jordan MI, Mitchell TM (2015) Machine learning: Trends, perspectives, and prospects. Science 349:255–260
Keddy P, Reznicek A (1986) Great lakes vegetation dynamics: the role of fluctuating water levels and buried seeds. J Great Lakes Res 12:25–36
Lee JH, Shin J, Realff MJ (2018) Machine learning: overview of the recent progresses and implications for the process systems engineering field. Comput Chem Eng 114:111–121
Lv C, Xing Y, Zhang J, Na X, Li Y, Liu T, Cao D, Wang F-Y (2017) Levenberg–Marquardt backpropagation training of multilayer neural networks for state estimation of a safety-critical cyber-physical system. IEEE Trans Ind Inf 14:3436–3446
Mirzania E, Kashani MH, Golmohammadi G, Ibrahim OR, Saroughi M (2023a) Hybrid COOT-ANN: a novel optimization algorithm for prediction of daily crop reference evapotranspiration in Australia. Theor Appl Climatol 154:201–218
Mirzania E, Roshni T, Kashani MH, Bickici B, Ibrahim OR (2023b) Forecasting of lake water level based on a hybrid model of innovative gunner algorithm. Acta Geophys: 1–19
Mirzania E, Vishwakarma DK, Bui Q-AT, Band SS, Dehghani R (2023c) A novel hybrid AIG-SVR model for estimating daily reference evapotranspiration. Arab J Geosci 16:301
Mohan AT, Gaitonde DV (2018) A deep learning based approach to reduced order modeling for turbulent flow control using LSTM neural networks. arXiv Preprint arXiv :180409269
Moraffah R, Sheth P, Karami M, Bhattacharya A, Wang Q, Tahir A, Raglin A, Liu H (2021) Causal inference for time series analysis: problems, methods and evaluation. Knowl Inf Syst 63:3041–3085
Mortsch LD (1998) Assessing the impact of climate change on the Great Lakes shoreline wetlands. Clim Change 40:391–416
Mortsch LD, Quinn FH (1996) Climate change scenarios for Great Lakes Basin ecosystem studies. Limnol Oceanogr 41:903–911
Neff BP, Nicholas J (2005) Uncertainty in the Great Lakes water balance (US Geological Survey)
Parzen E (1961) An approach to time series analysis. Ann Math Stat 32:951–989
Piasecki A, Jurasz J, Skowron R (2015) Application of artificial neural networks (ANN) in Lake Drwęckie water level modelling. Limnol Rev 15:21–29
Sagheer A, Kotb M (2019) Time series forecasting of petroleum production using deep LSTM recurrent networks. Neurocomputing 323:203–213
Sarker IH (2021) Machine learning: algorithms, real-world applications and research directions. SN Comput Sci 2:160
Saroughi M, Mirzania E, Achite M, Katipoğlu OM, Ehteram M (2024) Shannon Entropy of performance metrics to choose the best novel hybrid algorithm to predict groundwater level (case study: Tabriz plain, Iran). Environ Monit Assess 196:1–20
Sharma N, Sharma R, Jindal N (2021) Machine learning and deep learning applications-a vision. Glob Transit Proc 2:24–28
Sherstinsky A (2020) Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Phys D: Nonlinear Phenom 404:132306
Shiklomanov IA (2000) Appraisal and assessment of world water resources. Water Int 25:11–32
US EPA (2023) The United States Environmental Protection Agency. https://www.epa.gov/greatlakes/great-lakes-facts-and-figures. Accessed September 2023
USGS (1993) United States Geological Survey by Water Science School. https://www.usgs.gov/media/images/distribution-water-and-above-earth-0. Accessed September 2023
Uyanık GK, Güler N (2013) A study on multiple linear regression analysis. Procedia Soc Behav Sci 106:234–240
Wang Q, Wang S (2020) Machine learning-based water level prediction in Lake Erie. Water 12:2654
Wilcox DA, Thompson TA, Booth RK, Nicholas J (2007) Lake-level variability and water availability in the Great Lakes (US Geological Survey Reston, VA, USA)
Wu W, Dandy GC, Maier HR (2014) Protocol for developing ANN models and its application to the assessment of the quality of the ANN model development process in drinking water quality modelling. Environ Model Softw 54:108–127
Wunsch A, Liesch T, Broda S (2021) Groundwater level forecasting with artificial neural networks: a comparison of long short-term memory (LSTM), convolutional neural networks (CNNs), and non-linear autoregressive networks with exogenous input (NARX). Hydrol Earth Syst Sci 25:1671–1687
Xue P, Wagh A, Ma G, Wang Y, Yang Y, Liu T, Huang C (2022) Integrating Deep Learning and Hydrodynamic modeling to improve the Great Lakes Forecast. Remote Sens 14:2640
Yang X, Zhang Z (2022) A CNN-LSTM Model based on a Meta-learning algorithm to Predict Groundwater Level in the Middle and Lower reaches of the Heihe River, China. Water 14:2377
Zhu S, Hrnjica B, Ptak M, Choiński A, Sivakumar B (2020a) Forecasting of water level in multiple temperate lakes using machine learning models. J Hydrol 585:124819
Zhu S, Lu H, Ptak M, Dai J, Ji Q (2020b) Lake water-level fluctuation forecasting using machine learning models: a systematic review. Environ Sci Pollut Res 27:44807–44819
Zhou J, Wang D, Band SS, Mirzania E, Roshni T (2023) Atmosphere air temperature forecasting using the honey badger optimization algorithm: on the warmest and coldest areas of the world. Eng Appl Comput Fluid Mech 17:2174189
Zhu S, Di Nunno F, Ptak M, Sojka M, Granata F (2023a) A novel optimized model based on NARX networks for predicting thermal anomalies in Polish lakes during heatwaves, with special reference to the 2018 heatwave. Sci Total Environ 905:167121
Zhu S, Ji Q, Ptak M, Sojka M, Keramatfar A, Chau KW, Band SS (2023b) Daily water-level forecasting for multiple Polish lakes using multiple data‐driven models. Geogr J 189:357–369
Acknowledgements
The author expresses sincere gratitude to The Great Lakes Environmental Research Laboratory (GLERL), a facility within the National Oceanic and Atmospheric Administration (NOAA), for generously supplying data relevant to the Great Lakes basin for this study.
Funding
Open access funding provided by the Scientific and Technological Research Council of Türkiye (TÜBİTAK). The author has not disclosed any funding.
Open access funding provided by the Scientific and Technological Research Council of Türkiye (TÜBİTAK).
Author information
Authors and Affiliations
Contributions
Onur Kurt: Conceptualization, Methodology, Analysis, Software, Writing the original draft.
Corresponding author
Ethics declarations
Conflict of interest
The author declares that he has no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Communicated by Hassan Babaie.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kurt, O. Model-based prediction of water levels for the Great Lakes: a comparative analysis. Earth Sci Inform (2024). https://doi.org/10.1007/s12145-024-01341-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12145-024-01341-3