Value and limitations of machine learning in high-frequency nutrient data for gap-filling, forecasting, and transport process interpretation

Barcala, Victoria; Rozemeijer, Joachim; Ouwerkerk, Kevin; Gerner, Laurens; Osté, Leonard

doi:10.1007/s10661-023-11519-9

Value and limitations of machine learning in high-frequency nutrient data for gap-filling, forecasting, and transport process interpretation

Research
Open access
Published: 27 June 2023

Volume 195, article number 892, (2023)
Cite this article

Download PDF

You have full access to this open access article

Environmental Monitoring and Assessment Aims and scope Submit manuscript

Value and limitations of machine learning in high-frequency nutrient data for gap-filling, forecasting, and transport process interpretation

Download PDF

Victoria Barcala¹,
Joachim Rozemeijer²,
Kevin Ouwerkerk²,
Laurens Gerner³ &
…
Leonard Osté¹

1260 Accesses
1 Citation
Explore all metrics

Abstract

High-frequency monitoring of water quality in catchments brings along the challenge of post-processing large amounts of data. Moreover, monitoring stations are often remote and technical issues resulting in data gaps are common. Machine learning algorithms can be applied to fill these gaps, and to a certain extent, for predictions and interpretation. The objectives of this study were (1) to evaluate six different machine learning models for gap-filling in a high-frequency nitrate and total phosphorus concentration time series, (2) to showcase the potential added value (and limitations) of machine learning to interpret underlying processes, and (3) to study the limits of machine learning algorithms for predictions outside the training period. We used a 4-year high-frequency dataset from a ditch draining one intensive dairy farm in the east of The Netherlands. Continuous time series of precipitation, evapotranspiration, groundwater levels, discharge, turbidity, and nitrate or total phosphorus were used as predictors for total phosphorus and nitrate concentrations respectively. Our results showed that the random forest algorithm had the best performance to fill in data-gaps, with R² higher than 0.92 and short computation times. The feature importance helped understanding the changes in transport processes linked to water conservation measures and rain variability. Applying the machine learning model outside the training period resulted in a low performance, largely due to system changes (manure surplus and water conservation) which were not included as predictors. This study offers a valuable and novel example of how to use and interpret machine learning models for post-processing high-frequency water quality data.

Applications of Data-driven Models for Daily Discharge Estimation Based on Different Input Combinations

Article 23 April 2022

Applicability of machine learning techniques for multi-time step ahead runoff forecasting

Article 08 March 2022

Intelligent Data Analytics Approaches for Predicting Dissolved Oxygen Concentration in River: Extremely Randomized Tree Versus Random Forest, MLPNN and MLR

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Intensive agriculture is an important source of nutrients in surface waters (Bol et al., 2018; Van der Grift et al., 2016; Van der Salm et al., 2012; Withers et al., 2014). High total phosphorus (TP) or high nitrate (NO₃) concentrations are two of the parameters that can lead to a poor ecological quality status in surface waters. High NO₃ and TP concentrations are being pointed out as one of the main causes of biodiversity loss (Dise et al., 2011; Porter et al., 2013) and algae blooms (Withers & Haygarth, 2007). In many cases, the nutrients lost to surface water do not just originate from the freshly applied manure or fertilizer, but from the nutrient legacy accumulated in the soil which is transported into surface waters through natural of artificial drainage systems (Bieroza et al., 2019; Lucas et al., 2021; Sharpley et al., 2013). Realistic goals and appropriate mitigation measures are needed (Schoumans et al., 2014). Therefore, it is central for the water authorities to monitor the water quality of surface waters and quantify the effect that nutrient sources and system changes have in the transport processes and leaching of nutrients into larger water systems.

High-frequency monitoring of water quality data offers a detailed understanding of the processes involved in nutrient transport (Rode et al., 2016; Rozemeijer et al., 2010a, b). However, technicians often face the challenge to deal with large data volumes and missing data (Zhang et al., 2019). It is often the case that sensors and autoanalyzers have technical problems which can result in a significant number of gaps in the data. The post processing of the data, including identification of errors and filling missing values can be time consuming and the final result depends on the individual who does the post processing (Jones et al., 2021). Furthermore, the amount of data collected by high-frequency sensors can easily exceed the amount of data that can be treated manually (Dupas et al., 2015; Kirchner & Neal, 2013). Nevertheless, without complete data series, it is not possible to accurately calculate total annual loads and the value of high-frequency monitoring is reduced to the observation of specific events that might not be representative of the overall system’s response.

Machine learning algorithms build models based on sample (training) data in order to make numerical predictions (regression models) or categorical predictions (classification models). Machine learning algorithms, such as trees, rules, support vector machines, and artificial neural networks, offer an advantage to linear methods when treating nonlinear problems such as concentration-discharge relationships. Although machine learning algorithms are powerful tools to post-process high-frequency water quality data, their use is still below their potential in many fields of environmental sciences (Liu et al., 2022). The relative low acceptance of machine learning in some environmental sciences may lie in reluctance to shift from a process-based approach to a data-based approach and the tradeoff between interpretability, performance, and complexity (Liu et al., 2022; Visser et al., 2022). Gap-filling of continuous water quality datasets is a so far unexplored, yet potentially powerful application of machine learning. Machine learning algorithms have been successfully applied for gap-filling in medical datasets (Shah et al., 2014), eddy-covariance evapotranspiration and CO₂ flux data sets (Kang et al., 2019), soil moisture (Mao et al., 2019) and more recently also for daily streamflow time series (Arriagada et al., 2021). Most water quality applications of machine learning focus on predicting nutrient concentrations from catchment characteristics (e.g., Castrillo & García, 2020; Chen et al., 2020; Olson & Hawkins, 2012) or from other chemical parameters measured in conventional monitoring networks (e.g. Ha et al., 2020; Visser et al., 2022). Nevertheless, most of these studies focus on the forecasting performance and do not explore the limitations of using predictive data-based models (Tyralis & Papacharalampous, 2019).

The objectives of this study were as follows: (i) to evaluate six different machine learning models for gap-filling in a high-frequency NO₃ and TP concentration time series, (ii) to showcase the potential added value and limitations of machine learning to interpret underlying nutrient transport processes, and (iii) to study the limits of machine learning algorithms for making predictions outside the training period. As case study, we used 4 years of high-frequency data from a ditch draining one dairy farm in the east of The Netherlands where the nutrient transport from the soil to the surface water were previously investigated (Barcala et al., 2020). We applied open source and popular data-science software such as WEKA (Frank et al., 2017) and R (R Core Team, 2020) for the data post-processing and model implementation. This study offers a valuable and novel example of how to use and interpret machine learning models for post-processing high-frequency water quality data.

Materials and methods

Field site and time series description

The data was collected from a dairy farm near Winterswijk, the Netherlands (52.00131 N, 6.76112 E). The nutrient routes from the soil to the surface water were previously studied by Barcala et al. (2020). Manure is applied in the fields for fertilization between March and August. After measuring the crop productivity, the annual soil nutrient surpluses are calculated by the farmer. The topsoil is high in organic matter and has a 0.26 phosphorus saturation degree, meaning it can hardly retain more P. The water extractable P content (Pw) was on average 11.2 mg/kg in the topsoil, 1.5 mg/l just below the tillage zone (40–50 cm depth), and 0.1 mg/kg at 70–80 cm depth (Barcala et al., 2020). The average P in the topsoil is 2.630 kg/ha and the N in the topsoil is 565 kg/ha (Barcala et al., 2020). Below the topsoil, there is a Fe- and Al-rich sand layer. The farm is artificially drained by a main ditch that collects the water of the whole farm and runs parallel to the road in front of the farm. A secondary ditch runs perpendicular to the main ditch into the fields behind the farmyard. The northeast part of the fields has subsurface drainpipes draining into the most upstream part of the main ditch. The terrain is flat and surface runoff only occasionally contributed to the ditch discharge. Therefore, nutrients are transported mainly from the soil to the main ditch via lateral groundwater flow and the tile drains. During the summer months, the groundwater level falls below the ditch level and the main ditch falls dry. During this study, farmers were particularly affected by the extreme drought of 2018. Water shortage is a stress factor for crop growth and climate change is causing greater rainfall variability (Greve et al., 2021; Masson-Delmotte et al., 2021). To adapt against droughts, the farmer implemented different water conservation measures to control the groundwater level in the field before the start of the last drainage season. An adjustable weir was placed in the main ditch in front of the farm, and an adjustable pipe was installed in the side ditch behind the farmyard (Fig. 1).

At the end of the main ditch, we installed a v-notch weir and a high-frequency monitoring station was operative from 17 February 2018 to 7 June 2021. Every 15 min, TP (Phosphax Sigma autoanalyzer, Hach), turbidity (Solitax Sensor, Hach), and NO₃ (Nitratax Sensor, Hach) were measured. About 80% of the total-N was in the form of NO₃ in exploratory laboratory analysis. Furthermore, every 15 min, the discharge from the v-notch weir was calculated using a pressure gauge upstream from the weir and groundwater levels at the farm were monitored with a pressure gauge installed in a groundwater piezometer. As meteorological data may contribute to the prediction of the missing values, hourly rainfall and daily evapotranspiration data were downloaded from a meteorological station 12 km from the farm that belongs to the Dutch Royal Meteorological Institute Network (https://www.knmi.nl/nederland-nu/klimatologie, station 283). The hourly rain, and daily grass reference evapotranspiration, were linearly interpolated to have one value every 15 min using the approxfun function in R. All the times were taken to Dutch wintertime (GMT + 1). Time lags may vary depending on pre-event conditions such as groundwater levels, soil moisture, the location of the source (soil vs ditch sediment) and the magnitude of the event. Time lags were not included in the calculations but were estimated to be under 3 h. All the time series were quality checked, discharge measurements were checked based on manual measurements, and concentrations measurements were controlled based on laboratory measurements taken on routine visits every approximately 4 weeks. More detailed information about the field site characteristics and the high-frequency monitoring station can be found in Barcala et al. (2020).

Using the 2017–2018 values as a reference, the groundwater levels were on average 25 cm higher in 2020–2021 when the farmer implemented water retention measures (Table S1). In the year 2018 (2017–2018 season), TP correlated with turbidity (0.70) and NO₃ correlated strongly with discharge and groundwater level (0.91, 0.86) as was already discussed in Barcala et al. (2020). However, these correlations became weaker in the following years, especially between TP and turbidity (Fig. S2). NO₃ concentrations showed a similar temporal pattern to the groundwater levels but were shortly diluted during rain events (Figs. 3 and S3). Before starting with the selection of the machine learning models, we did some basic exploration of the available data available in the supplementary material. The N and P application to the fields are limited by the national Action Plans for the EU Nitrate Directive (Schroder et al., 2007). The manure applied targets at a 0 kg/ha P surplus. However, the crop growth can be limited by the water availability; in years with low rainfall less P was taken up, which resulted in a positive surplus. The average yearly N surplus was 142 kg/ha N, which falls just below the national average (160 kg/ha). Table 1 gives a quantitative summary of the drainage seasons.

Table 1 Summary of the nutrient surplus, first and last day of drainage, seasonal rainfall, evapotranspiration, and discharge

Full size table

Data analysis

To calculate and compare accurate annual nutrient loads leaving the catchment, we needed to fill the missing NO₃ and TP values of the high-frequency dataset. Table 1 shows the duration of each drainage season and the percentage of NO₃ and TP missing data. To fill in missing data, we evaluated six machine learning algorithms and compared them to filling in the gaps with the mean. The measured data was split and 60% was used for training (calibration) and 40% for testing (validation). The data was randomly split three times to improve the statistical representation and robustness of the results by using three different seeds. Seeds ensure that the results are reproducible, dividing the data in the same way each time. We opted for a wide pre-selection of machine learning algorithms because one cannot know beforehand which will perform best for a specific problem. The performance depends on the available dataset and the defined problem, in our case the accuracy in filling missing data in high-frequency nutrient concentration time series. The Waikato Environment for Knowledge Analysis (WEKA) was used to preprocess the data and for evaluation of all Machine Learning models. WEKA is open software widely used for data mining programmed in Java (Frank et al., 2017). If it is not stated otherwise, the default parameter settings in WEKA were used. R studio (R Core Team, 2020) was used for data visualization and pre- and post-processing of the data. Six algorithms were pre-selected following the criteria that they were well documented and accepted, able to predict a numeric class (regression), and capable of handling missing data.

The pre-selected algorithms use different principles in order to build the models. Zero Rules (ZR) predicts the mean of the numeric class, it is used as a benchmark to determine if other algorithms perform better than filling the missing values with the mean. Multivariate linear regression (MLR) finds the best fit for a line between multiple independent variables and the output is an explicit equation. Sequential minimal optimization regression (SMO) is similar to a support vector machine but can solve regression problems. SMO solves analytically the smallest possible optimization problem at every step using two Lagrange multipliers that obey a linear equality constraint (Platt, 2008). K-nearest neighbor (kNN), also called instance-based learner, generates a prediction by first finding k instances in the training dataset which are closest to the value that we want to predict (Aha et al., 1991). K was set to 1 and we used the Euclidean distance. M5 Rules (M5R) combines rules with trees, it generates list of rules for regression problems using the “separate-and-conquer” strategy, in each iteration a tree is built, and the “best” leaf is made into a rule (Leman, 1997). Random forest (RF) grows an ensemble of trees and takes the average of the trees for the regression problem (Breiman, 2001), 100 trees were grown to maximal depth. Artificial neural networks (ANNs) are networks of linear classifiers (perceptrons), they implement a weighted decision given two hidden layers, we used 7 nodes or “neurons” in the hidden layer as this was equal to the number of nodes in the input layer (Wolpert, 1992).

To build the TP and NO₃ models, we used time series of precipitation, evapotranspiration, groundwater levels, discharge, turbidity, and NO₃ or TP, respectively. Seasonal changes as the manure surplus and the implementation of water retention measures are not included as predictors. To evaluate and select the best model, we used the coefficient of determination (R²), the mean absolute error (MAE), and the root mean square error (RMSE) between the measured and the predicted values of the test subset. The models were done for each drainage season (2017–2018, 2018–2019, 2019–2020, 2020–2021) and for all the seasons together (2017–2021). The seasons are defined as the time during the year when there is water discharge in the ditch. Separate models were preferred to one single model to evaluate the model response to different yearly features that are not considered as predictors, such as year-to-year variations in total rainfall, nutrient surpluses, and the implementation of water conservation measures in the 2020–2021 drainage season.

To study the performance of predicting nutrient concentrations we evaluated two scenarios. First, the 2018–2019 model was used to predict the 2019–2020 measurements, and second, the 2019–2020 model was used to predict the 2020–2021 measurements. Both predictions used the input variables (rain, evapotranspiration, groundwater, discharge, turbidity, and NO₃ or TP) of the season we wanted to predict. The first scenario represents the prediction with no system changes and the second represents the prediction with changes (water retention measures). This way we evaluate if the model can predict system changes outside the training window. Both predictions were also done with the all seasons’ (2017–2021) model to assess if the model could represent system changes inside the training window. Although the retention measures are not incorporated into the model, the groundwater levels measured were and they were higher on the last season.

The feature importance (also called permutation variable importance metrics) allows to weight the influence of each input variable in the prediction, improving the interpretability of the results. The feature importance is calculated as the percentual increase in predictive error of not including one variable as compared to the out-of-bag rate with all other variables intact (Breiman, 2001). The feature importance was used to interpret which variables are most relevant for the prediction outcomes. An extra random variable was included in the feature importance calculations as a benchmark. If any variable was equally or less important than the random variable, then it would not contribute for the prediction. The importance values are related to the output magnitude of the predicted variable; therefore, the feature importance was normalized to 1 to facilitate the comparison. Without this normalization step, the NO₃ predictors would have a higher feature importance values than those for TP. The 2017–2018 season is only used for gap filling and not for future predictions or for variable feature importance calculations because it is not a full drainage season (it starts in half February). Lastly, after filling the missing data with the best performing model, the total loads of NO₃ and TP were calculated for each season by multiplying the concentrations by the discharge. Total loads were also calculated for the predictive models.

Results

Filling in missing data

The TP autoanalyzer had a larger amount of missing data, grouped in 2 to 4 large gaps per season. Besides 2018, the amount values missing from the NO₃ sensor were very low and concentrated at the beginning of the season. The summary of the R², RMSE, and MAE for three test subsets for the different NO₃ and TP season models are shown in Fig. 2 (values and computation times are shown in Tables S2 and S3). For each drainage season, the random forest model had the best fit (R² ~ 0.99 for NO₃ and 0.96 for TP) with low computation times and was therefore selected to fill in the missing values. Figure 3 shows the complete measured and modeled time series together with the input variables for the 2019–2020 and 2020–2021 seasons (seasons 2017–2018 and 2018–2019 are in Fig. S3). Random forest gave consistently very good results for each seasons’ model, while other algorithms showed larger differences in performance from season to season. Following random forest, k-nearest neighbor, and M5 Rules had a good performance for all seasons’ (R² ~ 0.84 and 0.81 for TP and 0.73 and 0.92 for NO₃, respectively), yet they had poorer results for TP in the 2019–2020 season (R² ~ 0.59 and 0.65, respectively). M5 Rules gave, for little extra computation time, a very good performance in the all seasons’ model and has the advantage that the output of the model are explicit rules that could be interpreted. However, as the problem was complex, more than 190 rules were obtained, which makes interpretation very difficult. Artificial neural networks came in fourth place with a lower performance in the all seasons’ model (R² ~ 0.67 for NO₃ and 0.50 for TP). Sequential minimal optimization obtained even lower results than artificial neural networks (R² ~ 0.38 NO₃ and 0.19 TP) and the longer times needed to build and validate the model are a mayor disadvantage. For example, for the all seasons’ model, the sequential minimal optimization model took 46 h computing time to train and test the NO₃ data series while random forest took only 2 min. Multivariable linear regression offers the benefit of having an explicit equation as output, but the trade-off is a lower performance (R² ~ 0.39 for NO₃ and 0.21 for TP; MAE > mean). Only in 2018 when NO₃ was strongly correlated to discharge and groundwater levels the results for the multivariable linear regression were very good (R² ~ 0.89). Nevertheless, the correlation was almost the same as doing a simple one variable linear regression with the discharge and this relationship was not maintained through the years (Fig. S2).

Future predictions

First, we compared the predictive performance between the 2019–2020 measured data and the prediction of the same season using the 2018–2019 and the all seasons’ random forest models (Fig. 4). This prediction illustrates the random forest performance outside the training period without system changes (besides manure surplus). The R² using the TP predictive 2018–2019 model was only 0.41 (MAE 0.02 and RMSE 0.04), fewer and lower TP peaks were obtained but the baseline concentrations were reproduced, except for some baseline overestimations in March. For NO₃, the largest difference in concentrations was from the end of November to January, when there is an increase in the measured NO₃ values that were not predicted by the model, still the R² was 0.75 (MAE 3.02 and RMSE 4.40), as the model performs well outside this window. The NO₃ load using 2018–2019 model was 723 kg while the measured load was 885 kg. TP exported using 2018–2019 model was 4.1 kg while the measured load was 3.3 kg.

Second, we compared the predictive performance between the 2020–2021 measurements and the prediction of the same season using the 2019–2020 and the all seasons’ random forest models (Fig. 5). The predictions represent how suitable is random forest to capture system changes (water conservation). The performance of the 2019–2020 model TP predictions was poor (0.09 R², 0.034 MAE, and 0.057 RMSE) while the all seasons’ model performed well (0.96 R², 0.001 MAE, and 0.003 RMSE). In the case of NO₃, the R² of the 2019–2020 model with the measured values was 0.44 (4.53 MAE and 5.15 RMSE) while with the all seasons’ model it was 0.99 (0.050 MAE and 0.070 RMSE). The distribution of the error of the all seasons’ model was always around zero and the differences with the measurements were mainly in the outliers. Despite the very good results for gap-filling within the training period, the predictive performance of random forest outside the training period was poor. The total TP load using the 2019–2020 model predictions was 5.57 kg while the measured load was 6.55 kg. On the other hand, the 2019–2020 model overestimated the NO₃ load at 760 kg, while the measured load was 534 kg.

During the first three seasons the total nutrient loads appears to have increased with the rainfall. A change in this trend is observed for the last season after the water conservation measures were implemented. Although the predicted loads differ from the measured loads for 2020–2021, the change in trend is to some extent captured by the model (Fig. 6). The dryer years with lower leaching may have resulted in accumulation of nutrients in the topsoil that were later released in the wetter years. All measured variables had between four and a hundred times the importance of the random variable introduced (Fig. 7). Turbidity had the highest feature importance in most models (except the 2018–219 and 2020–2021 NO₃ models).

Discussion

Using machine learning models to fill in missing data

The first objective of this study was to evaluate six different machine learning models for gap-filling in a high-frequency NO₃ and TP concentration time series. Random forest had the best performance for both NO₃ and TP with a constantly high R² (> 0.92) and low MAE and RMSE for every randomly selected testing set and application period. The short computing times are another advantage for using random forest for gap filling. The random forest gap-filling model could reproduce short-term trends in the time series as the TP peaks after rain events, NO₃ dilution after rain events, and NO₃increase with increase in the groundwater levels. The good results of the 2017–2021 model show that the random forest algorithm can also largely incorporate system changes within the training period, such as the introduction of water conservation measures and different soil nutrient surpluses. We observed that the NO₃ models performed systematically slightly better than the TP models. This may be caused by the larger proportion of missing values in our TP time series. Kang et al. (2019) and Zhang and Thorburn (2022) also reported a reduction in model performance when the amount of missing data is larger as this reduces the size of the training set. Another reason could be the relatively smooth behavior of NO₃ concentration dynamics compared to the spikier TP patterns.

Other comparative studies have also found random forest to perform better than linear regression and other machine learning algorithms in problems related to water quality (Castrillo & García, 2020; Ha et al., 2020; Shen et al., 2020; Visser et al., 2022). Methods such as artificial neural networks that here had a low performance have shown very good results in other studies (Astuti et al., 2020; Chen et al., 2020; Daliakopoulos & Ioannis, 2016; Dastorani et al., 2010; Kim et al., 2020; Najah et al., 2009, 2013). Nevertheless, in other comparative studies, they have also underperformed random forest (Bedi et al., 2020; Chen et al., 2020; Kim et al., 2020; Qiao et al., 2021; Visser et al., 2022). The low performance of multivariable linear regression can be explained by the nonlinear relationships between hydrological variables and concentrations. Multivariable linear regression has shown low performance in other nonlinear problems such as epidemiological studies (Shah et al., 2014). As there is no one-fits-all model, we recommend the used approach of evaluating different algorithms and seasonal performance to select the most robust model.

Conventional approaches for dealing with missing data included not considering the missing data or substituting the missing data with the mean (Tyralis & Papacharalampous, 2017). Not including the missing data makes calculations of total annual loads uncertain, especially because short, extreme rain events can account for a large portion of the yearly nutrient load export (Rozemeijer & Van der Velde, 2014). Substituting the missing data with the mean was done as a benchmark (Zero Rules model) and it underperformed all other methods having the highest MAE and RMSE. It has been already shown that statistical gap filling methods usually underperform machine learning methods (Zhang & Thorburn, 2022). Another gap-filling method includes stepwise linear regression; this approach has exhibited a good treatment of the missing data (Rozemeijer et al. (2010a, b), R² 0.74), but it requires a more a complex and time-consuming analysis of the time series than machine learning models as random forest. In our previous publication (Barcala et al., 2020), TP was correlated with NO₃ for gap-filling in the 2018–2019 season; this lead to a lower load estimations, 0.96 kg for TP and 282 kg for NO₃ compared with loads obtained with the new method, 1.33 kg for TP and 344 kg for NO₃. Random forest, and other machine learning algorithms, are underused tools in water quality studies. We encourage their application for gap-filling because of the good performance, short calculation times, and the availability of open source packages in R and user-friendly software like WEKA. Beside gap-filling, algorithms like random forest could also be useful for similar applications as real-time anomaly detection in sensors and autoanalyzers which could support system maintenance, for example by comparing the incoming data with one-step ahead predictions to detect anomalies and trigger a warning message.

Process interpretation

Our second objective was to show potential added value of machine learning to interpret underlying nutrient transport processes. In most cases, the high number of trees in random forest models makes their physical interpretation difficult or impossible (Tyralis & Papacharalampous, 2019). The feature importance represents the information gain of including each variable in the model and could be indicative for the influence of variables on physical processes. As a first observation, all predictor variables in our data set contributed to the information gain for all drainage seasons. This was indicated by the higher relative feature importance values compared to the random variable introduced (Breiman, 2001; Doshi-Velez & Kim, 2017). Therefore, it is not advised in this case to remove variables as all of them contributed to the prediction. Besides the feature importance, other approaches for process interpretation include to evaluate the variable coefficients obtained by multivariable linear regression for process interpretation together with the results of “less transparent” models as random forest (Visser et al., 2022). However, we would not recommend this approach for case studies as this one where results obtained with multivariate linear regression are poor. Instead, a similar sensitivity analysis to the variable feature importance can be done for other machine learning algorithms by training the models without one input variable at a time and evaluating the impact on the model’s results.

For NO₃, the feature importance values were quite different between the 2019–2020 and 2020–2021 seasons. In the 2019–2020 season, turbidity was the most important predictor, although groundwater levels, evapotranspiration and discharge also contributed significantly. In the 2020–2021 season, groundwater levels, evapotranspiration, and discharge were the most important variables, but turbidity did not rank high. The connection between groundwater levels, discharge, and NO₃ losses was described before by Rozemeijer and Broers (2007) and for this field site by Barcala et al. (2020). With higher groundwater levels, a larger relative contribution of shallow NO₃-rich groundwater flow routes (including tube drain discharge) towards surface water increases the NO₃ concentrations while rain events dilute the concentrations. The NO₃ concentrations reached up to 25 mg/L NO₃-N after an increase in the groundwater level in mid-November 2019. During the second part of the drainage season, the mineral N residue in the soil was depleted and the NO₃ concentrations decrease to around 10 mg/L. Turbidity was also high in the second part of the season and splitting the trees by turbidity resulted in a positive information gain. In this case, the predictive power of turbidity does not seem to have a direct process-based explanation.

For TP, turbidity has the highest relative feature importance values in all models. This relation is directly linked to the role of sediment transport in TP concentration dynamics. As described by Barcala et al. (2020) and Baken et al. (2015), iron and phosphorus from groundwater form iron(hydr)oxides which precipitate at the ditch bottom. During steady hydrological conditions, a P-rich sediment layer builds up. This sediment is transported during the next discharge event, causing a peak in both turbidity and TP. The data gaps in 2020–2021 coincided with the highest turbidity peaks, and although it is likely, it is not possible to asses if there was an underestimation of TP peaks during this period. Furthermore, almost no high TP peaks were predicted with the 2019–2020 model when compared with the 2020–2021 measured series. Shen et al. (2020) observed that high values were underestimated when using random forest to predict N and P concentrations in streams. The high skewness of the data was offered as explanation. Surprisingly, the TP export increased in the last season (2020–2021) despite the negative P surplus. The groundwater level increased in importance in the 2020–2021 model. The risk of mobilization of phosphate (and heavy metals) with the introduction of water conservation has been proposed theoretically before (Rozemeijer & Griffioen, 2004; Schoumans & Groenendijk, 2000), but to the best of our knowledge, it has not been directly measured yet. The topsoil had higher P and lower Al and Fe content, therefore watering the topsoil may have increased the risk of P leaching. In a recent data-based model of NO₃ leaching from agricultural soils across the Netherlands comparable trends were found (Spijker et al., 2021), the TP concentrations were inversely correlated with NO₃ emissions and TP was important for the prediction of NO₃ concentrations. They hypothesized that the high TP concentrations were a proxy for high groundwater levels (which were not included in that model), and that areas with high groundwater levels had higher denitrification rates and therefore lower NO₃ concentrations. In addition, Skidmore et al. (2022) has recently shown that extreme rain events increase the TP loads from agriculture.

Overall, machine learning can support process interpretation but justification of findings by other methods is needed, as the feature importance can be related to indirect links. Finally, turbidity was the variable that represented the largest information gain. Arriagada et al. (2021) and Fox et al. (2017) showed that random forest performance increased with the number of input variables used. Sensors are cheaper than autoanalyzers, require less maintenance and do not use reagents. The sensor data can then be trained to fill in data gaps in more complex equipment as TP autoanalyzers. This reason largely justify adding sensors (such as for turbidity, conductivity, dissolved oxygen, or temperature) next to autoanalyzers at high-frequency monitoring stations. Moreover, if a TP autoanalyzer is removed, the combination of continued cheap sensor measurements, low-frequency conventional TP sampling, and the previously trained RF model could still produce accurate continuous TP concentration time series for the site.

Using machine models for forecasting

The third objective of this study was to show the limits of machine learning algorithms for making predictions outside the training period. As reported by Tyralis and Papacharalampous (2019), the most important limitation to data-based models is that they should not be generalized to predict new processes or changes in unaccounted variables that were not covered by the training data set. Moreover, Kang et al. (2019), observed that interannual variations in nutrient loads are caused by year-to-year system changes for example in manure application, crop rotation, and cultivated area percentage, which were not fully captured by their random forest models. One disadvantage of random forest is that a small change in the data set, caused for example by different manure surpluses, can lead to a large change in the structure of the optimal decision tree. Unfortunately, the soil nutrient surplus is calculated at the end of the season; therefore, it is not possible to use it as a predictor for result forecasting.

Although for each season the all seasons’ model and the model of the season had similar R², MAE, and RMSE, the trees of each model were built splitting by different variables. This is why although the fitting was good for testing sets contained in the training period, it did not show such a good predictive performance outside that period. For example, the high feature importance of the variable turbidity to predict NO₃ in the 2019–2020 season is likely because during the first half of the season NO₃ was high and turbidity was low while in the second half the opposite happened, NO₃ was low and turbidity was high. Whereas in the previous (2018–2019) season, this negative correlation did not occur the importance of turbidity in the prediction was low. When we see the feature importance of the variable turbidity for the whole period (2018–2021), it is larger than for the 2018–2019 and 2020–2021 seasons. Therefore, the longer the data set used to build the model, the more likely it is that more processes that directly or indirectly affect the prediction are taken into account.

Despite both predictive models underperform the gap-filling results, the 2018–2019 model does a better job reproducing the 2019–2020 data than the 2019–2020 for the 2020–2021 data. Especially the NO₃ prediction with the 2019–2020 gives quite fair results. The relatively low N surplus in 2020–2021 may explain the differences in NO₃ loads obtained with the predictive model. In the last season, water retention measures were introduced and the nutrient N and P surpluses were lower, both changes were not directly introduced in the model as predictors but indirectly through the groundwater levels and the water quality data. Therefore, the groundwater level measurements were not enough to explain the system changes outside the training window. Nevertheless, the all seasons’ model performed very well for each of the individual seasons, including the 2020–2021 season. Random forest can cover the system changes as long as they occur within the training period. The presented example shows that historical trends are no guarantee for future performance and that emerging processes may not be accurately predicted by the model. This highlights the need for cautious interpretations of machine learning model predictions and for keeping the models up to date using longer data sets to improve the robustness of the models.

It is important to notice that the issue regarding the uncertainty of predictions outside the training period is not likely to be caused by overfitting. Overfitting occurs when the performance of the model is good in the training set but not in the testing set. The training set was 60% of the data and it was randomly divided three times. The resulting random forest models were robust, with good results on all different validation sets and application periods. Moreover, we used a high-quality dataset with seven variables and about 18,000 instances in the last two seasons. For further studies, we recommend evaluating ways to include low-frequency annual system changes in data-based models and quantifying the impact of the amount of missing data for the model performance.

Conclusions

Random forest was the best out of six machine learning algorithms to fill in missing data, with an R² higher than 0.92 for all test sets. Random forest could effectively reproduce nonlinear processes as concentration-discharge relationships and represent system changes that were considered in the training set.
Machine learning may support process interpretation, but justification of findings by other methods is needed. Accounting for changes in the groundwater levels was not enough to accurately predict system changes as the water conservation caused changes in the nutrient processes. After water conservation, higher groundwater levels resulted in more TP leaving the farm despite the negative P surplus and was represented by an increase of the relative feature importance of the groundwater level variable. This effect was likely related to higher desorption from the topsoil layers. For NO₃, the variable feature importance values were caused by indirect links and were not directly related to processes.
The incorporation of subsidiary sensors can pay-off in monitoring stations with more sensitive autoanalyzers. Here, the turbidity showed the largest information gain in predictions.
Random forest predictions outside their training period can be uncertain. Keeping the machine learning models up to date with newly retrieved data increases the reliability of the predictions.
Similar to gap-filling, random forest can be used for anomaly detection in monitoring stations.

The supplemental material includes the full data series, basic statistical exportation of the data series, model results including time calculations, plots of the modeled and measured data for 2017–2018 and 2018–2019 seasons, plots of the cumulative load for each season, scatter plots of measured vs modeled data with Random Forest for every season.

Data availability

The raw data used in this study is available at https://github.com/victoriabarcala/Huppel.

Abbreviations

ANN:: Artificial neural network
kNN:: K-nearest neighbor
M5R:: M5-Rules
MAE:: Mean average error
MLR:: Multivariable linear regression
N:: Nitrogen
NO₃ :: Nitrate
P:: Phosphorus
R ² :: Coefficient of determination
RF:: Random forest
RMSE:: Root mean square error
SMO:: Sequential minimal optimization
TP:: Total phosphorus
ZR:: Zero Rules

References

Aha, D., Kilbert, D., & Albert, M. (1991). Instance-based learning algorithms. Machine Learning, 6, 37–66.
Article Google Scholar
Arriagada, P., Karelovic, B., & Link, O. (2021). Automatic gap-filling of daily streamflow time series in data-scarce regions using a machine learning algorithm. Journal of Hydrology, 598(May), 126454. https://doi.org/10.1016/j.jhydrol.2021.126454
Astuti, A. D., Aris, A., Salim, M. R., Azman, S., Salmiati, & Said, M. I. M. (2020). Artificial intelligence approach to predicting river water quality: A review. Journal of Environmental Treatment Techniques, 8(3), 1093–1100.
Google Scholar
Baken, S., Verbeeck, M., Verheyen, D., Diels, J., & Smolders, E. (2015). Phosphorus losses from agricultural land to natural waters are reduced by immobilization in iron-rich sediments of drainage ditches. Water Research, 71, 160–170. https://doi.org/10.1016/j.watres.2015.01.008
Article CAS Google Scholar
Barcala, V., Rozemeijer, J., Osté, L., Van Der Grift, B., Gerner, L., & Behrends, T. (2020). Processes controlling the flux of legacy phosphorus to surface waters at the farm scale. Environmental Research Letters, 16(1). https://doi.org/10.1088/1748-9326/abcdd4
Bedi, S., Samal, A., Ray, C., & Snow, D. (2020). Comparative evaluation of machine learning models for groundwater quality assessment. In Environmental Monitoring and Assessment (Vol. 192, Issue 12). https://doi.org/10.1007/s10661-020-08695-3
Bieroza, M., Bergström, L., Ulén, B., Djodjic, F., Tonderski, K., Heeb, A., Svensson, J., & Malgeryd, J. (2019). Hydrologic extremes and legacy sources can override efforts to mitigate nutrient and sediment losses at the catchment scale. Journal of Environmental Quality, 48(5), 1314–1324. https://doi.org/10.2134/jeq2019.02.0063
Article CAS Google Scholar
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Bol, R., Gruau, G., Mellander, P. E., Dupas, R., Bechmann, M., Skarbøvik, E., Bieroza, M., Djodjic, F., Glendell, M., Jordan, P., Van der Grift, B., Rode, M., Smolders, E., Verbeeck, M., Gu, S., Klumpp, E., Pohle, I., Fresne, M., & Gascuel-Odoux, C. (2018). Challenges of reducing phosphorus based water eutrophication in the agricultural landscapes of Northwest Europe. Frontiers in Marine Science, 5(AUG), 1–16. https://doi.org/10.3389/fmars.2018.00276
Castrillo, M., & García, Á. L. (2020). Estimation of high frequency nutrient concentrations from water quality surrogates using machine learning methods. Water Research, 172. https://doi.org/10.1016/j.watres.2020.115490
Chen, K., Chen, H., Zhou, C., Huang, Y., Qi, X., Shen, R., Liu, F., Zuo, M., Zou, X., Wang, J., Zhang, Y., Chen, D., Chen, X., Deng, Y., & Ren, H. (2020). Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data. Water Research, 171, 115454. https://doi.org/10.1016/j.watres.2019.115454
Daliakopoulos, I. N., & Ioannis, K. T. (2016). Comparison of an artificial neural network and a conceptual rainfall–runoff model in the simulation. Hydrological Sciecne Journal, 61, 2763–2774. https://doi.org/10.1080/02626667.2016.1154151
Dastorani, M., Moghadamnia, A., Piri, J., & Rico-Ramirez, M. (2010). Application of ANN and ANFIS models for recosntructing missing flow data. Environmental Monitoring and Assessment, 166, 421–434.
Article Google Scholar
Dise, N. B., Ashmore, M., Belyazid, S., Bleeker, A., Bobbink, R., Vries, W. De, Erisman, J. W., Spranger, T., Stevens, C. J., & Berg, L. Van Den. (2011). Nitrogen deposition as a threat to European Terrestrial Biodiversity. In The European Nitrogen Assessment (Issue 2011). CAmbridge University Press.
Doshi-Velez, F., & Kim, B. (2017). Towards a rigorous science of interpretable machine learning. ArXiv Preprint ArXiv:1702.08608., Ml, 1–13.
Dupas, R., Tavenard, R., Fovet, O., Gilliet, N., Grimaldi, C., & Gascuel-Odoux, C. (2015). Identifying seasonal patterns of phosphorus storm dynamics with dynamic time warping. Water Resources Research, 51, 8868–8882. https://doi.org/10.1002/2015WR017338.Received
Article Google Scholar
Fox, E. W., Hill, R. A., Leibowitz, S. ., & Weber, M. H. (2017). Assessing the accuracy and stability of variable selection methods for random forest modeling in ecology. Environ Monit Assess, 316(189). https://doi.org/10.1007/s10661-017-6025-0
Frank, E., Hall, M. A., & Witten, I. H. (2017). The WEKA workbench. Data Mining, 553–571. https://doi.org/10.1016/b978-0-12-804291-5.00024-6
Greve, P., Brunner, L., Weiland, F. C. S., Visser, R. D., Greve, P., & Bisselink, B. (2021). Estimating regionalized hydrological impacts of climate change over Europe by performance-based weighting of CORDEX projections. November. https://doi.org/10.3389/frwa.2021.713537
Ha, N. T., Nguyen, H. Q., Truong, N. C. Q., Le, T. L., Thai, V. N., & Pham, T. L. (2020). Estimation of nitrogen and phosphorus concentrations from water quality surrogates using machine learning in the Tri An Reservoir, Vietnam. In Environmental Monitoring and Assessment 192(12). https://doi.org/10.1007/s10661-020-08731-2
Jones, A. S., Jones, T. L., Logan, N., & Horsburgh, J. S. (2021). Toward Automating Post Processing of Aquatic Sensor Data., 435, 1–63.
Google Scholar
Kang, M., Ichii, K., Kim, J., Indrawati, Y. M., Park, J., Moon, M., Lim, J. H., & Chun, J. H. (2019). New gap-filling strategies for long-period flux data gaps using a data-driven approach. Atmosphere, 10(10), 1–18. https://doi.org/10.3390/atmos10100568
Article Google Scholar
Kim, Y., Johnson, M. S., Knox, S. H., Black, T. A., Dalmagro, H. J., Kang, M., Kim, J., & Baldocchi, D. (2020). Gap-filling approaches for eddy covariance methane fluxes: A comparison of three machine learning algorithms and a traditional method with principal component analysis. Global Change Biology, 26(3), 1499–1518. https://doi.org/10.1111/gcb.14845
Article Google Scholar
Kirchner, J. W., & Neal, C. (2013). Universal fractal scaling in stream chemistry and its implications for solute transport and water quality trend detection. PNAS, 110(30). https://doi.org/10.1073/pnas.1304328110
Leman, M. (1997). Lecture notes in artificial intelligence. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 1317.
Liu, X., Lu, D., Zhang, A., Liu, Q., & Jiang, G. (2022). Data-driven machine learning in environmental pollution: Gains and problems. Environmental Science & Technology. https://doi.org/10.1021/acs.est.1c06157
Article Google Scholar
Lucas, E. R., Toor, G. S., & Mcgrath, J. M. (2021). Agronomic and environmental phosphorus decline in coastal plain soils after cessation of manure application. Agriculture, Ecosystems and Environment, 311(January), 107337. https://doi.org/10.1016/j.agee.2021.107337
Mao, H., Kathuria, D., Duffield, N., & Mohanty, B. P. (2019). Gap filling of high-resolution soil moisture for SMAP / Sentinel-1: A two-layer machine learning-based framework. Water Resources Research, 1, 6986–7009. https://doi.org/10.1029/2019WR024902
Article Google Scholar
Masson-Delmotte, V., Zhai, P., Pirani, A., Connors, S. L., Péan, C., Berger, S., Caud, N., Chen, Y., Goldfarb, L., Gomis, M. I., Huang, M., Leitzell, K., Lonnoy, E., Matthews, J. B. R., Maycock, T. K., Waterfield, T., Yelekçi, O., Yu, R. A., & Zhou, B. (2021). IPCC, 2021: Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. https://www.ipcc.ch/report/ar6/wg1/
Najah, A., El-Shafie, A., Karim, O. A., & El-Shafie, A. H. (2013). Application of artificial neural networks for water quality prediction. Neural Computing & Applications, 22(1), 187–201. https://doi.org/10.1007/s00521-012-0940-3
Article Google Scholar
Najah, A., Elshafie, A., Karim, O. A., & Jaffar, O. (2009). Prediction of Johor River water quality parameters using artificial neural networks. European Journal of Scientific Research, 28(3), 422–435.
Google Scholar
Olson, J. R., & Hawkins, C. P. (2012). Predicting natural base-flow stream water chemistry in the western United States. Water Resources Research, 48(2), 1–19. https://doi.org/10.1029/2011WR011088
Article CAS Google Scholar
Platt, J. (2008). Fast training support vector machines using parallel sequential minimal optimization. Proceedings of 2008 3rd International Conference on Intelligent System and Knowledge Engineering, ISKE 2008, 997–1001. https://doi.org/10.1109/ISKE.2008.4731075
Porter, E. M., Bowman, W. D., Clark, C. M., Compton, J. E., Pardo, L. H., & Soong, J. L. (2013). Interactive effects of anthropogenic nitrogen enrichment and climate change on terrestrial and aquatic biodiversity. Biogeochemistry, 93–120. https://doi.org/10.1007/s10533-012-9803-3
Qiao, Z., Sun, S., Jiang, Q., Xiao, L., Wang, Y., & Yan, H. (2021). Retrieval of total phosphorus concentration in the surface water of miyun reservoir based on remote sensing data and machine learning algorithms. Remote Sensing, 13(22). https://doi.org/10.3390/rs13224662
R Core Team. (2020). R: A language and environment for statistical computing (4.0.5). R Foundation for Statistical Computing.
Rode, M., Wade, A. J., Cohen, M. J., Hensley, R. T., Bowes, M. J., Kirchner, J. W., Arhonditsis, G. B., Jordan, P., Kronvang, B., Halliday, S. J., Skeffington, R. A., Rozemeijer, J. C., Aubert, A. H., Rinke, K., & Jomaa, S. (2016). Sensors in the Stream: The High-Frequency Wave of the Present. Environmental Science and Technology, 50(19), 10297–10307. https://doi.org/10.1021/acs.est.6b02155
Article CAS Google Scholar
Rozemeijer, J. C., & Broers, H. P. (2007). The groundwater contribution to surface water contamination in a region with intensive agricultural land use (Noord-Brabant, The Netherlands). Environmental Pollution, 148(3), 695–706. https://doi.org/10.1016/j.envpol.2007.01.028
Article CAS Google Scholar
Rozemeijer, J., & Griffioen, J. (2004). Effecten Van Waterconservering Op De Waterkwaliteit in Noord-Brabant En Limburg., H2O(20), 30–33.
Google Scholar
Rozemeijer, J. C., & Van der Velde, Y. (2014). Temporal variability in groundwater and surface water quality in humid agricultural catchments; Driving processes and consequences for regional water quality monitoring. Fundamental and Applied Limnology, 184(3), 195–209. https://doi.org/10.1127/1863-9135/2014/0565
Article CAS Google Scholar
Rozemeijer, J., Van der Velde, Y., De Jonge, H., Van Geer, F., Broers, H. P., & Bierkens, M. (2010a). Application and evaluation of a new passive sampler for measuring average solute concentrations in a catchment scale water quality monitoring study. Environmental Science and Technology, 44(4), 1353–1359. https://doi.org/10.1021/es903068h
Article CAS Google Scholar
Rozemeijer, J. C., Van der Velde, Y., Van Geer, F. C., De Rooij, G. H., Torfs, P. J. J. F., & Broers, H. P. (2010b). Improving load estimates for NO3 and P in surface waters by characterizing the concentration response to rainfall events. Environmental Science and Technology, 44(16), 6305–6312. https://doi.org/10.1021/es101252e
Article CAS Google Scholar
Schoumans, O. F., Chardon, W. J., Bechmann, M. E., Gascuel-Odoux, C., Hofman, G., Kronvang, B., Rubæk, G. H., Ulén, B., & Dorioz, J. M. (2014). Mitigation options to reduce phosphorus losses from the agricultural sector and improve surface water quality: A review. Science of the Total Environment, 468–469, 1255–1266. https://doi.org/10.1016/j.scitotenv.2013.08.061
Article CAS Google Scholar
Schoumans, O. F., & Groenendijk, P. (2000). Modeling Soil Phosphorus Levels and Phosphorus Leaching from Agricultural Land in the Netherlands. Journal of Environmental Quality, 29(1), 111–116. https://doi.org/10.2134/jeq2000.00472425002900010014x
Article CAS Google Scholar
Shen, L. Q., Amatulli, G., Sethi, T., Raymond, P., & Domisch, S. (2020). Estimating nitrogen and phosphorus concentrations in streams and rivers, within a machine learning framework. In Scientific Data 7(1). https://doi.org/10.1038/s41597-020-0478-7
Schroder, J. J., Aarts, H. F. M., Van Middelkoop, J. C., Schils, R. L. M., Velthof, G. L., Fraters, B., & Willems, W. J. (2007). Permissible manure and fertilizer use in dairy farming systems on sandy soils in The Netherlands to comply with the Nitrates Directive target. European Jounal of Agronomy, 27, 102–114. https://doi.org/10.1016/j.eja.2007.02.008
Article Google Scholar
Shah, A. D., Bartlett, J. W., Carpenter, J., Nicholas, O., & Hemingway, H. (2014). Comparison of random forest and parametric imputation models for imputing missing data using MICE: A CALIBER study. American Journal of Epidemiology, 179(6), 764–774. https://doi.org/10.1093/aje/kwt312
Article Google Scholar
Sharpley, A., Jarvie, H. P., Buda, A., May, L., Spears, B., & Kleinman, P. (2013). Phosphorus legacy: Overcoming the effects of past management practices to mitigate future water quality impairment. Journal of Environmental Quality, 42(5), 1308–1326. https://doi.org/10.2134/jeq2013.03.0098
Article CAS Google Scholar
Skidmore, M., Andarge, T., & Foltz, J. (2022). Climate change and water pollution: the impact of extreme rain on nutrient runoff in Wisconsin. Agricultural and Applied Economics Association. https://doi.org/10.22004/ag.econ.322113
Spijker, J., Fraters, D., & Vrijhoef, A. (2021). A machine learning based modelling framework to predict nitrate leaching from agricultural soils across the Netherlands. Environmental Research Communications, 3.
Tyralis, H., & Papacharalampous, G. (2017). Variable selection in time series forecasting using random forests. Algorithms, 10(4). https://doi.org/10.3390/a10040114
Tyralis, H., & Papacharalampous, G. (2019). A brief review of random forests for water scientists and practitioners and their recent history. Water.
Van der Grift, B., Broers, H. P., Berendrecht, W. L., Rozemeijer, J. C., Osté, L. A., & Griffioen, J. (2016). High-frequency monitoring reveals nutrient sources and transport processes in an agriculture-dominated lowland water system. Hydrology and Earth System Sciences Discussions, 12(8), 8337–8380. https://doi.org/10.5194/hessd-12-8337-2015
Article Google Scholar
Van der Salm, C., Van den Toorn, A., Chardon, W. J., & Koopmans, G. F. (2012). Water and nutrient transport on a heavy clay soil in a fluvial plain in the Netherlands. Journal of Environment Quality, 41, 229–241.
Article Google Scholar
Visser, H., Evers, N., Bontsema, A., Rost, J., Niet, A. De, Vethman, P., Mylius, S., Linden, A. Van Der, Roovaart, J. Van Den, & Gaalen, F. Van. (2022). What drives the ecological quality of surface waters ? A review of 11 predictive modeling tools. Water Research, 208, 117851. https://doi.org/10.1016/j.watres.2021.117851
Withers, P. J. A., & Haygarth, P. M. (2007). Agriculture, phosphorus and eutrophication: A European perspective. Soil Use and Management, 23(SUPPL. 1), 1–4. https://doi.org/10.1111/j.1475-2743.2007.00116.x
Article Google Scholar
Withers, P. J. A., Neal, C., Jarvie, H. P., & Doody, D. G. (2014). Agriculture and eutrophication: Where do we go from here? Sustainability (switzerland), 6(9), 5853–5875. https://doi.org/10.3390/su6095853
Article Google Scholar
Wolpert, D. H. (1992). Stacked generalization. Neural Networks, 5(2), 241–259.
Article Google Scholar
Zhang, Y., & Thorburn, P. J. (2022). Handling missing data in near real-time environmental monitoring: A system and a review of selected methods. Future Generation Computer Systems, 128, 63–72. https://doi.org/10.1016/j.future.2021.09.033
Article Google Scholar
Zhang, Y. F., Thorburn, P. J., Xiang, W., & Fitch, P. (2019). SSIM - a deep learning approach for recovering missing time series sensor data. IEEE Internet of Things Journal, 6(4), 6618–6628. https://doi.org/10.1109/JIOT.2019.2909038
Article Google Scholar

Download references

Acknowledgements

We would like to acknowledge the Farmer, Water Board Rhine and IJssel, Thilo Behrends and the P-TRAP group for the discussions. The project is closely related to the activities of Vruchtbare Kringloop Achterhoek and Liemers (VKA), a partnership between LTO Noord, Water Board Rijn en IJssel, ForFarmers, Vitens, FrieslandCampina, Rabobank and the Province of Gelderland that focuses on circular agriculture and sustainable water and soil management.

Funding

This project has received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie grant agreement no. 813438.

Author information

Authors and Affiliations

Unit Inland Water Systems, Daltonlaan 600, 3584 BK, Utrecht, The Netherlands
Victoria Barcala & Leonard Osté
Unit Subsurface and Groundwater Systems, Daltonlaan 600, 3584 BK, Utrecht, The Netherlands
Joachim Rozemeijer & Kevin Ouwerkerk
Water Board Rijn and IJssel, Liemersweg 2, 7006 GG, Doetinchem, The Netherlands
Laurens Gerner

Authors

Victoria Barcala
View author publications
You can also search for this author in PubMed Google Scholar
Joachim Rozemeijer
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Ouwerkerk
View author publications
You can also search for this author in PubMed Google Scholar
Laurens Gerner
View author publications
You can also search for this author in PubMed Google Scholar
Leonard Osté
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors confirm their contribution to the paper as follows: study conception and design: Laurens Gerner, Joachim Rozemeijer, and Leonard Osté; data collection: Joachim Rozemeijer and Victoria Barcala; analysis and interpretation of results: Victoria Barcala, Joachim Rozemeijer, Kevin Ouwerkerk, and Leonard Osté; draft manuscript preparation: Victoria Barcala. All authors reviewed the results and approved the final version of the manuscript.

Corresponding author

Correspondence to Victoria Barcala.

Ethics declarations

Competing interests

The authors declare no competing interests.

Conflict of interest

The authors declare no conflict of interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 464 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Barcala, V., Rozemeijer, J., Ouwerkerk, K. et al. Value and limitations of machine learning in high-frequency nutrient data for gap-filling, forecasting, and transport process interpretation. Environ Monit Assess 195, 892 (2023). https://doi.org/10.1007/s10661-023-11519-9

Download citation

Received: 25 October 2022
Accepted: 13 June 2023
Published: 27 June 2023
DOI: https://doi.org/10.1007/s10661-023-11519-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Value and limitations of machine learning in high-frequency nutrient data for gap-filling, forecasting, and transport process interpretation

Abstract

Similar content being viewed by others

Applications of Data-driven Models for Daily Discharge Estimation Based on Different Input Combinations

Applicability of machine learning techniques for multi-time step ahead runoff forecasting

Intelligent Data Analytics Approaches for Predicting Dissolved Oxygen Concentration in River: Extremely Randomized Tree Versus Random Forest, MLPNN and MLR

Introduction