Response of streamflow and nutrient loads in a small temperate catchment subject to land use change

The aim of this study was to quantify the effect of land use change (LUC) implemented to meet nutrient load targets for a freshwater lake in New Zealand. We used the Soil and Water Assessment Tool (SWAT) model in combination with a non-parametric statistical test to determine whether afforestation of 15% of a subcatchment area was adequate to meet assigned nutrient load targets. A regional management authority set nutrient load targets of reduction in total nitrogen (TN) by 0.9 t yr−1 and reduction in total phosphorus (TP) by 0.05 t yr−1 to avoid eutrophication in the receiving waters of a freshwater lake. The load reduction was designed to be achieved through 200 ha of LUC from pasture to trees. Analysis of nutrient loads before, during, and following LUC shows that a 15% increase in forest cover decreased the annual flow (7.2%), TP load (33.3%), and TN load (13.1%). As flow and water quality observations were discrete and at irregular intervals, we used a parametric test and the SWAT model as different lines of evidence to demonstrate the effect of afforestation on flow and water quality. Policymakers concerned with decisions about LUC to improve the quality of receiving waters can benefit from applying our findings and using a statistical and numerical modelling framework to evaluate the adequacy of land use change to support improvements in water quality. Supplementary Information The online version contains supplementary material available at 10.1007/s10661-023-11828-z.


Introduction
Increased human population and consumption patterns have created global demand for agricultural products, which has often adversely affected ecosystem services and biodiversity (MEA, 2005;Nelson et al., 2010).Landscape disturbance associated with agricultural production leads to increased mobilisation of sediment and nutrients, with many mitigation actions designed to minimise or offset the effects of disturbance and avoid impaired water quality (Mello et al., 2020).These actions often include land use change (LUC), many involving afforestation, and altered management practices.An understanding of the hydrological connectivity between areas of sediment and nutrient mobilisation and receiving waters within catchments is important to predict the impacts of LUC and other management practices on receiving water environments (Leibowitz et al., 2018).
Hydrological models are valuable tools to quantify discharge and nutrient loads that affect the water quality and ecological status of receiving waters and to assess temporal changes associated with LUC (Milly et al., 2008).Ability to capture these features can help identify nutrient 'hotspots' and 'hot moments' (Harms & Grimm, 2008) and support informed policy decisions about prioritising areas within catchments and sub-catchments for intensive management and LUC.Confidence in assessments of LUC impacts on flow and nutrient loads is often based on demonstrating 'good' fit of the model output to observations, using statistical metrics for calibration and validation periods (Chen et al., 2019).However, one of the challenges in modelling catchment processes can be a paucity of high-quality data.For 'data-poor' catchments, non-parametric tests provide a useful statistical tool to compensate for the effects of limited data, particularly when data are non-normally distributed (Mohebbi & Akbariyeh, 2021) or there are missing data (Naddeo et al., 2013).Non-parametric percentile estimation techniques can complement conventional methods that rely on statistical assessment of model calibration and validation periods to describe the distribution of flow and changes in water quality variables.
Declines in water quality have been widely attributed to the effects of agricultural development, deforestation, urbanisation, and land use change.Distributed hydrological models, which include input parameters in relation to land use, have been applied to assess LUC impacts on runoff and changes in nutrient loads (El-Khoury et al., 2015).The Soil and Water Assessment Tool (SWAT; Arnold et al., 1995) has been used to predict the quality and quantity of water and assess the environmental impacts of LUC.SWAT accounts for spatial heterogeneity of land use and the effectiveness of erosion control measures (Arnold et al., 1995) as well as land use change by updating an initial prescribed LUC distribution through the simulation period (Moriasi et al., 2019).SWAT is the most widely used hydrological model globally (Tan et al., 2020) and is supported by an online literature database of over 4500 articles from 1993 to 2023 (e.g.www.card.iastate.edu/swat_articles/).A review of SWAT model applications (Gassman et al., 2007) citing more than 250 journal articles indicated that model performance was mostly adequate to make assessments about LUC and non-point source (NPS) pollution and land use-climate change impact (Douglas-Mankin et al., 2010).Khoi et al. (2022) used the SWAT model to assess water quantity and quality (e.g.sediment, total nitrogen, and total phosphorus) responses to changes in different land use types; agricultural, bush, forestland, and urban areas.This study highlighted an inverse relationship between forest cover and water quantity.The SWAT model has also been used to assess LUC effects on water quantity as a result conversion of forest to oil palm plantation (Tan et al., 2015).Other studies (Hoghooghi et al., 2021) have used SWAT to assess the effect of Brassica carinata plantations on flow, sediment, and nutrients loads in an experimental watershed in the upper Suwannee River Basin, South-Central Georgia.Plantations covering 12% of the watershed resulted in reductions in total sediment, mineral phosphorus, and nitrate loads ranging from 3.8 to 14.0%.A study using SWAT in the Big Sunflower River Watershed in the Mississippi Delta, USA, assessed the effect of spatiotemporal changes in land use classes, such as forest, cropland, corn, and soybean (Ni et al., 2021), while Nguyen et al. (2019) used SWAT to generate scenarios of flow and pollutant loads arising from land use intensification.
Agriculture has intensified in New Zealand over recent decades, resulting in increased NPS nutrient discharge that has impaired the water quality of streams (McDowell et al., 2014;Snelder et al., 2018) and lakes (Abell et al., 2020a, b).Declining water quality ranks as the foremost environmental concern in surveys of the New Zealand population (Hughey et al., 2019).In response to these concerns, the New Zealand government has implemented a National Policy Statement for Freshwater Management (NPS-FM) (Ministry for the Environment, 2020).The NPS-FM directs regional and local councils to set limits for concentrations of sediment, phosphorus, nitrogen, and microbial contaminants in waterbodies (Journeaux et al., 2017).Changes in land use or land management are the primary levers used by councils to achieve compliance with the NPS-FM (McDowell et al., 2014;Snelder et al., 2018).For example, for New Zealand's largest lake, Taupō (area = 616 km 2 ; Central Volcanic Plateau, North Island), market-based environmental policies involving nitrogen trading have been established (Duhon et al., 2015), linked to LUC from agriculture to forest to reduce NPS loads.In other areas, regional plans and policies include directives for catchment management to meet nitrogen and phosphorus load targets for lakes (Abell et al., 2011), many involving afforestation.However, implementing interventions can be challenging due to a diversity of land uses, active LUC, and climate variability in New Zealand (Me et al., 2018), as well as difficulty in change detection (e.g. for nutrient loads).
To implement effective management policies to reduce the impacts of nutrients, assessments are required to identify the relative contributions of nonpoint and point sources of nutrients (Abell et al., 2020a, b;Roygard et al., 2012).Elliott et al. (2005) used the SPAtially Referenced Regression on Watershed (SPARROW) model and the National (New Zealand) River Water Quality Network dataset to indicate that non-point loads exceed 96% of the total nutrient load across New Zealand; NPS pollution is therefore one of the most challenging environmental problems for New Zealand (Snelder et al., 2018).
Our study was designed for sub-catchments where flow and water quality data are typically collected infrequently; a situation common to many other catchments where LUC (afforestation) is being implemented to meet water quality targets.The study focused on a lake catchment in New Zealand where there were infrequent discrete observations of flow and water quality, which is typical of many catchments globally.The regional council action plan for the lake indicated that 200 ha LUC from pasture to trees would be adequate to avoid eutrophication of the downstream receiving waters of Lake Ōkareka.Building upon findings from prior research (Farley et al., 2005;Vertessy, 2001;Zhang et al., 2001;Zhang et al., 2003), our hypothesis was centred around the notion that there would be a significant difference in the average streamflow and nutrient loads between pre-and post-LUC values.The hypothesis is consistent with the concept that when an area undergoes afforestation, the dense canopy and extensive root network of trees and vegetation increase rainwater interception, reducing surface runoff and subsequent streamflow, as well as supporting nutrient retention.We tested the extent of LUC required to meet the nutrient load targets for the lake.The objectives of this study were to (1) test for significant changes in flow and nutrients measured pre-and post-LUC, (2) implement a non-parametric percentile estimation technique to examine shifts in the frequency domain between observations and model output, (3) assess the ability of the SWAT model to reproduce observed streamflow and nutrient loads arising from LUC, and (4) assess the resulting changes in seasonal variability of water quantity and quality.First, we analysed a time series of observations of responses of flow and nutrients to LUC at the sub-catchment scale within the Lake Ōkareka catchment.Next, we simulated streamflow and catchment nutrient export in a sub-catchment of the lake using SWAT.Last, we combined results from non-parametric tests and the SWAT model to describe the changes in flow and water quality variables.

Study area
The Ōkareka catchment (Fig. 1) is located in the Rotorua District of the North Island of New Zealand.The geology of the area is characterised by volcanic and sedimentary deposits (Healy, 1964) with a series of rhyolitic domes and lava flows flanking the west, south, and east of the lake (Nairn, 2002).The catchment area is 1500 ha and drains to monomictic Lake Ōkareka.The lake is important recreationally and culturally.Elevations in the catchment range from 335 to 685 m above mean sea level.The mean annual precipitation in the area is about 1300 mm, with considerable interannual variation.Annual mean air temperature, relative humidity, and wind speed measured 15 km from the lake at Rotorua airport are 13.4 °C, 82%, and 3.6 m s −1 , respectively (National Climate Database; cliflo.niwa.co.nz/).The modelling study was conducted on the Millar sub-catchment (area = 384 ha) because of the availability of relatively long-term monitoring data (Fig. 1).Millar stream is the main surface water inflow to the lake and this sub-catchment covers 26% of the total catchment area (Fig. 1).Summary of the land use information in the Millar stream sub-catchment indicates that compared with a 2003 baseline, 17 ha (4.4% of Millar sub-catchment area) had been planted in pine (Pinus radiata) or native trees by 2011, 53 ha (13.8%) by 2013, and 57 ha (15%) by 2015.Recent studies of the lake have classified its trophic state as mesotrophic (Trolle et al., 2010) after it was previously assessed to be oligotrophic (McColl, 1972).Mesotrophic refers to a water body with moderate nutrient levels, supporting balanced plant growth and diverse aquatic life, and oligotrophic denotes waters with low nutrient levels, resulting in clear, pristine conditions and limited biological productivity (Burns et al., 2009).Millar sub-catchment has moderately steep topography, and pasture and native forest cover typified by a climate of cold winters and mild to cool summers.After 2010, there has been significant afforestation including Pinus radiata and native forest.

Testing for temporal changes in observed flow and water quality variables
We used parametric, cumulative deviation (CDT), and non-parametric percentile estimation to describe distributions of streamflow and nutrient loads as we had discrete and instantaneous observations of discharge, sediment, and nutrient concentrations (Mohebbi & Akbariyeh, 2021;Naddeo et al., 2013).We split data based on the proportion of observations below and above a given value in a selected percentile distribution (McCluskey & Lalkhen, 2007).Flow and loads of ammonium (NH 4 -N), nitrate (NO 3 -N), total nitrogen (TN), and total phosphorus (TP) for each monitoring day, and 5-day total rainfall, normalised by the respective maximum values were examined pre-and post-LUC.In addition, we used descriptive statistics such as the mean, standard deviation (SD), minimum, and maximum values to compare Q, NH 4 -N, NO 3 -N, TP, and TN for pre-and post-LUC periods.
Tests for homogeneity of observations were based on the adjusted partial sum ( S * k ), which is the cumulative deviation from the mean (ӯ) for n observations of a series y 1 , y 2 , y 3,… y n : The rescaled adjusted partial sum, S * * k , was calcu- lated by dividing the adjusted partial sum ( S * k ) by the standard deviation, D y : Departures from homogeneity are defined by Ơ, the maximum S * * k value in the data series.A critical value of a variable is defined by the ratio of Ơ to √n (Buishand, 1982) where n is the number of observations for NH 4 -N, NO 3 -N, TN, TP, or 5-day mean rainfall.

Soil and Water Assessment Tool (SWAT) model
SWAT is a semi-distributed catchment model that requires static geospatial information (terrain topography, soil, and land use) and dynamic (meteorological) data.Using a digital elevation map (DEM), the model delineates a catchment into multiple sub-catchments, which may be split further into homogeneous Hydrologic Response Units (HRUs) with unique combinations of land use, soil, and slope (Neitsch et al., 2011).Nutrient and sediment transformations and losses are modelled separately for each HRU, with predictions accumulated to obtain the total for each sub-catchment and then routed to the associated reach and catchment outlet through the channel network (Neitsch et al., 2011).Daily observed precipitation, temperature, wind speed, solar radiation, and relative humidity are used as hydrological data to run the model in a daily time step (Neitsch et al., 2011).The climate data is from 2002 to 2021.
SWAT contains a nutrient routing module that simulates total nitrogen (TN) and total phosphorus (TP) concentration (Gassman et al., 2007), including constituents of organic nitrogen (ORGN), ammonium (NH 4 -N), and nitrate (NO 3 -N) (Gassman et al., 2007).TP is defined as the sum of organic phosphorus (1) (ORGP) and mineral phosphorus (MINP) with these fractions commonly taken to be the particulate and dissolved phases of phosphorus (Me et al., 2018).

SWAT inputs
A digital elevation model with 25-m horizontal resolution was used to generate the stream network and sub-catchments of Lake Ōkareka.  1, Fig. 2).Land use in the Ōkareka catchment is a mix of forest (native, pine), pasture, rangeland, medium density residential area, and water bodies (Lake Ōkareka).Daily rainfall, temperature, solar radiation, relative humidity, and wind speed model inputs were obtained from Rotorua Airport Automatic Weather Station (The National Climatic Database [New Zealand]; cliflo.niwa.co.nz/).Observed streamflow and concentrations of NH 4 -N, NO 3 -N, TN, and TP were measured instantaneously from Millar Road stream gauge station (Fig. 1) and collected by the Bay of Plenty Regional Council (BoPRC).Most of the measurements were collected monthly, which partly reflects the paucity of comprehensive flow and water quality data.The Millar Road stream gauge station had only limited sediment concentration data, which was deemed not adequate to support model calibration and validation sediment concentration.Detailed information on model input and data sources is presented in Table 1.
Changes in land use have been implemented in the Ōkareka catchment to meet nutrient load targets for the lake (Burns et al., 2009) use change were accounted for through the land use update (LUP) routine in the SWAT model (Moriasi et al., 2019).LUP sets a land use change update at user-specified day.

SWAT performance assessment
A standalone SWAT Calibration and Uncertainty Program (Arnold et al., 2012) was used to optimise  SWAT model parameters.Selection and configuration of parameters for streamflow and water quality followed the recommendations of Abbaspour et al. (2015).We chose the SUFI-2 routine in the SWAT-CUP program to perform a global sensitivity analysis for identification of sensitive SWAT model parameters (Abbaspour et al., 2004).This routine is computationally efficient in limiting the number of iterations to quantify the effect on model outputs of changes in an input parameter (Yang et al., 2008).Our choice of the range of flow and nutrient-related parameter ranges was consistent with previous studies in the wider region (Me et al., 2018).After identification of model parameter ranges, sensitive parameters, and an autocalibration procedure in SWAT-CUP, we checked and fine-tuned model parameter values using manual adjustment of parameters for a calibration period In our model setup, calibration and validation were based on measurements of flow and nutrient concentration during pre-afforestation period (2002)(2003)(2004)(2005)(2006)(2007)(2008)(2009)(2010) as there was no visible land use change in the catchment.Since reforestation, as a land use change, occurs gradually over time rather than instantaneously, calibrating the model during this period could lead to uncertainties and errors, as the system is in a state of transition.On the other hand, choosing data sets prior to land use change provides a more stable and consistent baseline for model calibration, helping to avoid overfitting and providing more confidence in the model predictions.In addition, our choice of pre-afforestation data (from 2002 to 2010) for model training is related to several other reasons.First, we set the model in view of future LUC scenario development and assess the versatility of the model under dynamic catchment and climatic factors; next, according to Moriasi et al. (2019), SWAT model has a well-established and considerably versatile system that represents transient LUC in the modelling environment.Therefore, SWAT calibrated and validated with pre reforestation data could be used to predict post reforestation streamflow and water quality response; finally, it appears that the region receives high proportion of flow from base flow stores (Me et al., 2015) and has only moderate seasonality in both pre-and post-afforestation periods.
We used coefficient of determination (R 2 ), Nash-Sutcliffe Efficiency (NSE), and percent bias (PBIAS) as statistical evaluation criteria for calibration and validation (Moriasi et al., 2007).Equations for each of these measures are given in Table S1.NSE (range: − ∞ to 1.0) gives the relative magnitude of the residual variance compared to the variance in the measured data (Nash & Sutcliffe, 1970).PBIAS describes the tendency of simulated output data to be consistently smaller or larger than the observed data in the mean with values < 25% indicating satisfactory model performance.Moriasi et al. (2019) distinguished model performance ratings of very good, good, satisfactory, and unsatisfactory based on values of these metrics (see Table S1).

Flow-weighted mean concentrations
Flow-weighted mean nutrient concentration was used to reduce the bias from sparse, discrete observations for the Millar sub-catchment (Meals et al., 2013).We used a nondimensional sensitivity analysis (Chiew, 2006;Sankarasubramanian et al., 2001) to assess the sensitivity of flow to changes in rainfall (i.e.ratio of relative change in streamflow to relative change in rainfall) and sensitivity of nutrient loading to variations in flow.The elasticity of each nutrient variable was quantified from the ratio of relative changes in nutrient load to flow.

Results
Observations of changes pre-and post-land use change Landsat aerial photographs of the Lake Ōkareka catchment were used to delineate afforested areas, shown as yellow polygons in the Millar sub-catchment (Figs. 1 and 3, Table S2).Comparison of pre-and post-LUC observed streamflow (mm d −1 ) and nutrient (NH 4 -N, NO 3 -N, TN, and TP) loads (kg d −1 ) show a post-LUC decrease in mean, minimum, and maximum values across all variables, except for a small increase in minimum Q and mean and minimum TP (Table 2; see also Figs S1-S2).There was a significant decrease in post-LUC mean streamflow (p < 0.05), NH 4 -N (p < 0.01), NO 3 -N (p < 0.01), and TN (p < 0.05) but not TP (p < 0.05).Changes in mean values pre-and post-LUC were also assessed using a cumulative deviation t-statistic test (CDT) (Ơ, Table S3).CDT test results showed a significant post-LUC decrease (p < 0.05) in streamflow and loads of most of the nutrients (NH 4 -N, NO 3 -N, and TN) except for TP (p > 0.05).
Non-parametric analysis was used to examine changes in streamflow and nutrient load observations pre-and post-LUC using percentile distributions.Post-LUC streamflow and water quality were mostly lower except for TP and for all percentile streamflow classes, there was a post-LUC decrease in Q, NH 4 -N, NO 3 -N, and TN (Fig. 4).TP observations had a similar post-LUC decrease for Q < 10% (i.e. the lowest 10%) and Q ≥ 90% (the highest 10%), but an increase for Q between 10 and 85%.

SWAT model calibration and performance
We used a sensitivity analysis to provide insights into the contributions of SWAT model parameters and input data to streamflow and nutrient loads (Table 3).Sensitive flow parameters included initial Soil Conservation Services (SCS), runoff curve number for moisture condition (CN2), threshold depth of water in the shallow aquifer required for return flow (GWQMN), maximum canopy storage (CANMX), baseflow alpha factor (ALPHA_BF), aquifer percolation coefficient (RCHRG_DP), and effective hydraulic conductivity in main channel alluvium (CH_K2).Average slope steepness (HRU_SLP), a terrain parameter, varies with elevation in the catchment and affects lateral flow within the kinematic storage model in SWAT (Sloan & Moore, 1984).Lateral flow entering the stream reach is also affected by lateral flow travel time (LAT_TTIME) and slope length for lateral subsurface flow (SLSOIL).
Sensitive parameters for TN included organic nitrogen in the base flow (LAT_ORGN), organic N enrichment ratio (ERORGN), nitrate-nitrogen concentration in the shallow aquifer (SHALLST_N), and reach rate constant for hydrolysis of organic nitrogen to ammonium-nitrogen (BC3).For organic phosphorus, the sensitive model parameters were baseflow (LAT_ORGP), soluble phosphorus concentration in the groundwater (GWSOLP), and organic P enrichment ratio (ERORGP).Parameters varied in the number of variables they influenced, e.g.being specific to one variable (e.g.TP), a subset of the variables (e.g.NH 4 -N and, therefore also TN), or a combination of several variables.
SWAT-CUP produced an optimised calibration for which summary statistics (R 2 , NSE, and PBIAS) are shown in Table 4. Based on R 2 values (Moriasi et al., 2007), the goodness-of-fit of calibration and validation performance varied from very good (Q) to unsatisfactory during calibration (NO 3 -N, TN, and TP) and validation (NH 4 -N, NO 3 -N, TN, and TP), although R 2 values for nutrient constituents are in the upper range of values summarised for a number of studies by Arhonditsis and Brett (2004).Additionally, according to PBIAS model performance rating criteria (Legates & McCabe, 1999), flow and nutrient variables are generally acceptably captured during calibration and validation but with slight overestimates and poor performance for the calibration of NH 4 -N and TP.
Streamflow (mm d −1 ) and nutrient load (kg d −1 ) observations and model output for Millar gauging station were split into pre-LUC (2002)(2003)(2004)(2005)(2006)(2007)(2008)(2009)(2010) and post-LUC (post-2010), which included periods of calibration (2002)(2003)(2004)(2005) and validation (2006)(2007)(2008)(2009)(2010), Table 4 (Fig. 5).Modelled loads of NO 3 -N and TN mostly reflected seasonal variations of low flow in summer and high flow in winter, while modelled TP load also showed a similarly high level of flow dependence.Loads of NO 3 -N accounted for a high fraction of the TN (Fig. 6).The median values for model simulations were generally comparable with those of observations (Table 4).Graphical comparison of observed and predicted streamflow and NH 4 -N, NO 3 -N, TN, and TP loads for each monitoring day for Millar stream (Fig. 5) shows high similarity of modelled and observed datasets for most of the variables other than NH 4 -N, with simulated values generally exceeding the observed values.
All nutrient variables showed satisfactory performance during calibration except for TP (Moriasi et al., 2007).Visual observations of observed and predicted streamflow and water quality data for each monitoring day indicate that model simulations are satisfactory for all variables, other than TP, which could be resulted from unavailability of sediment data for calibration.
Referring to previous other studies, Moriasi et al. (2007) proposed performance categories for 'monthly time-step' model results and judged as satisfactory if NSE is greater than 0.50 and PBIAS < 25% for streamflow and suggested 'appropriate adjustments' in their criteria for annual or daily evaluation time steps.Whether this type of classification is reasonable or not needs further discussion.The four-level classification (very good, good, satisfactory, and unsatisfactory) introduced by Moriasi et al. (2007) was originally grounded in statistical analyses of diverse outcomes from hydrological model simulations.However, studies indicated that the model performance assessment criteria employed by different models lack uniformity (Lin et al., 2017).Gupta et al. (2009) argued that in order to maximise NSE, the variability has to be underestimated which turns out to be an important feature of watersheds.McCuen et al. (2006) indicated that NSE is a single-valued measure which is overly sensitive to extreme values (Legates & McCabe, 1999) that can be influenced by several factors such as sample size, outliers, and magnitude bias and the NSE values should be accompanied by additional metrics.In this regard, we have also used R 2 and PBIAS metrics to support the results, which are in the range of very good and satisfactory, respectively.
In view of other model performance metrics, our comparison of model output against measurements of flow indicated satisfactory (R 2 = 0.73) and good (R 2 = 0.73) values during calibration and validation, respectively (Moriasi et al., 2015).For both calibration and validation, a satisfactory PBIAS value < −25% has been achieved.From Moriasi et al. (2007), calibration and validation of nutrient variables with PBIAS < ±70% are considered satisfactory.As such, except TP during calibration, we found that all the nutrient variables ranged from satisfactory to very good performance both during calibration and validation.For analyses of other permeance metrics flow applied in our study, a NSE value of 0.5 for the daily value could be regarded as acceptable.
We also reviewed other relevant studies that have used SWAT model.A review of model performance (Moriasi et al., 2007)   based on the results of five experimental watersheds in USA, reported PBIAS values from 2.9 to −91.7% for calibration and 2.7 to −155.6% for validation.
Changes in model output pre-and post-land use change Changes in streamflow, loads, and flow-weighted mean concentrations (FWMC) of nutrients from the SWAT model output were used to assess effects of the afforested in the Millar stream sub-catchment (Table 5).We conducted a comparison between the results generated by the SWAT model for two conditions of land use: one assuming the land use did not change post-2010 (reference simulation) and another where the existing land use changes since 2011 were taken into account (post-LUC simulation).Post-LUC, there was a 7.2% reduction in mean annual streamflow, while nutrient loads showed a reduction, from  Vol:. ( 1234567890) 13.1% for TN to 33.3% for TP.The change in nutrient concentrations was considerably smaller, with a reduction from 6.4% for TN to 28.1% for TP.
The elasticity values used to test for sensitivity of nutrient loads to changes in flow ranged from 0.27 (NH 4 -N) to 4.61 (TP) loads (Table 5), i.e. a 1% change in flow produced a 4.61% change in TP.For nutrients, the FWMC elasticity values ranged from 0.88 (TN) to 3.89 (TP).
Table 3 Sensitive SWAT model parameters, ranges, units, and calibrated values for streamflow (Q), total nitrogen (TN), and total phosphorus (TP) as indicated in SWAT-CUP.Note: The 'v' before the parameter indicates replacement of exist-ing value by the calibrated values, the 'a' denotes addition of the calibrated value to the model default, and 'r' is for relative change, a relative change was applied by multiplying the existing value by ( 1 Note: Superscripts a, b, c, and d denote very good, good, satisfactory, and unsatisfactory performances, respectively (Moriasi et al., 2007;Moriasi et al., 2015).These studies do not provide a clear guideline on the performance category for NH 4 -N and NO 3 -N For S LUC , there was a significant reduction in annual streamflow (p < 0.01), and nutrient loads (p < 0.001, Fig. 7) and concentrations (p < 0.001; NO 3 -N, TN, and TP) except for NH 4 -N (p > 0.05).Model output of monthly average streamflow and nutrient loads and concentrations from 2011 to 2021 showed marked reduction (p < 0.001) between the reference and the post-LUC cases, except for NH 4 -N concentrations, which showed no significant change (p > 0.05, Fig. 8).We also calculated the ratios of variables post-and pre-LUC from SWAT model time series of mean monthly flow and nutrient loads and concentrations.There was a statistically significant post-LUC reduction from the reference simulation for each variable in the SWAT simulations (Fig. 9).

Discussion
Nutrient runoff from agricultural land represents a high proportion of the total nutrient yield from many catchments in New Zealand and has resulted in eutrophication of a high proportion of lakes (Abell et al., 2020a, b).As a result, water quality restoration programs involving LUC have been set up with an underlying premise that afforestation can reduce nutrient loads and address water quality problems (McDowell et al., 2020).We tested the adequacy of afforestation in a lake sub-catchment using analysis of observed data that included a non-parametric statistical analysis and a numerical modelling approach using the SWAT model.The analyses included evaluation of changes in stream discharge and nutrient concentrations before, during, and following LUC.Our hypothesis that LUC would decrease flow and nutrient loads was confirmed from results of non-parametric analyses and SWAT model outputs, with mean annual streamflow decreasing by 7.2%, nutrient loads decreasing from 2.0 to 33.3% (depending on nutrient species), and concentrations decreasing from 6.4 to 28.1% in response to LUC.

Modelling streamflow and nutrient loads: land use change effects
Our study was conducted in a lake catchment with temporally sparse observations and discontinuous flow measurements.This situation is typical  of numerous small to medium-sized catchments globally.We used non-parametric percentile distributions to assess pre-LUC and post-LUC observations and complement the analysis with SWAT model outputs, providing dual lines of evidence about the effects of afforestation and counter the situation common to many catchments where data are less than optimal.The non-parametric technique can be applied to other gauged and poorly gauged catchments where afforestation has been prescribed to meet water quality targets.The use of instantaneous flow measurements instead of daily (24-h) averages may increase model uncertainty, but such instantaneous measurements are common in monitoring programs which have budgetary and personnel constraints.The error term in our statistical comparisons of instantaneous measured flow against daily modelled values therefore includes a component of flow variability within each day, but the comparison of flow was still 'very good' (Moriasi et al., 2007) in our study.A study by Abell and Hamilton (2013) intensively monitored two streams for short periods in a catchment adjacent to Lake Ōkareka.Their study indicated that under baseflow, there was little flow variation within each day, but greater variability for stormflows, which varied between the streams and with recent rainfall history.Consideration should be given to continuous streamflow monitoring to better quantify contaminant loads in streams as we found that variations in flow are the dominant driver of daily load estimates compared with variations in nutrient concentrations.
Non-parametric analysis of observations and model predictions between reference simulation and post-LUC periods indicated that the model generally reproduced observed changes accurately.Model simulations showed a 7.2% reduction in mean annual streamflow corresponding to a 15% conversion of pasture to forest.Our findings are comparable to previous studies (Farley et al., 2005;Zhang et al., 2001) that have considered the effect of afforestation on changes in catchment streamflow conditions, largely due to increased evapotranspiration.A study based on analysis of measured data for over 250 catchments worldwide (Zhang et al., 2001) indicated that under similar climatic conditions, pasture crops use less water than trees, leading to greater runoff.Zhang et al. (2003) conducted a study in the Murray-Darling Basin in Australia to predict the effects of afforestation on the annual flow regime.They found that afforestation with blue gum plantations, representing 16% and 21% of sub-catchment area for two tributaries in the Goulburn Broken catchment, resulted in 8% and 14% mean annual streamflow reductions, respectively.They highlighted that the rate of reduction in mean annual streamflow initially accelerated leading up to canopy closure, as expected from changes in interception and evapotranspiration driven by leaf area, surface albedo, and rooting characteristics of plants (Vertessy, 2001).
The streamflow reduction may equilibrate as quickly as 5 years, with substantial changes within 2 to 4 years of afforestation (Farley et al., 2005).Our study showed that the rate of reduction in mean annual streamflow increased over time, from 3.2% in 2011 to 15.8% in 2021.Pine trees (Pinus radiata) have been used extensively for afforestation in the Lake Ōkareka catchment and more widely in New Zealand.This species has a high growth rate, high rainfall interception, and large soil moisture storage capacity due to a deep root system, as well as high evapotranspiration rates.Evaporation of intercepted precipitation is low in pastoral lands, and a 20 to 40% increase can be achieved with LUC to pine plantations (Duncan, 1995;Le Maitre, 1999), which in turn reduces streamflow and nutrient export to receiving waters (Cannell, 1999).Trends in river water quality have been routinely analysed and reported in New Zealand (Snelder et al., 2021) and model-based assessment of the reference and current water quality status of 1031 lakes indicates a general trend of increased nutrient concentrations associated with land use intensification (Abell et al., 2020a, b).However, no direct attribution of water quality changes to land use was given.Our assessment of the water quality response to a 15% conversion of pasture to pine in a small (sub) catchment using a dual statistical and numerical model approach indicates a post-LUC reduction in nutrient loads ranging from 13.1% (TN) to 33.3% (TP).Responses of nutrient concentrations in our study were somewhat equivocal, however, with the majority of the nutrient load reduction due to a decrease in flow.The modelled decrease in annual nutrient export between the post-LUC case and a hypothetical baseline case showed load reductions ranging from 8.5% (NH 4 -N) in 2012 to 25.4% (TN) in 2021.Furthermore, as for streamflow, the annual reduction in nutrient loading increased through time.This finding is likely attributed to the initial rapid increase in forest canopy cover and the concurrent reduction in streamflow (Zhang et al., 2003).
Although direct comparison between afforestation canopy effects and nutrient load responses is difficult because of climate variability (e.g.interannual variability in rainfall), it is important to note that the TN and TP loads in post-LUC years were well below those of the first year of LUC (i.e. 2011).The initial slower response in the first year may be attributable not only to lags as trees mature but also a damping effect from a legacy of decades of pastoral land use, with soils likely to have had higher levels of inorganic nutrients.If an area remains under pine plantation (e.g. after 2-3 decades), the initial decreases in stream nutrient concentrations may begin to slow (Hughes & Quinn, 2019).Most pine plantations in New Zealand are harvested after about 25 years, however, which leads to a temporary spike (~1-2 years) in nutrient Fig. 8 Eleven years (2011 to 2021) of monthly average SWAT model output for streamflow (mm), load (kg), and concentrations (g m −3 ) of ammonium (NH 4 -N), nitrate (NO 3 -N), total nitrogen (TN), and total phosphorus (TP) for post-LUC simulations and pre-LUC reference simulations loads due to reduced evapotranspiration and abundant degradable organic matter (e.g.slash and other plant material (Hamilton, 2005).We included analysis of covariance (ANCOVA) of monthly data to assess the effects of precipitation variability on nutrient loads and to predict the relative contribution of flow and Fig. 9 Ratio of post-LUC to reference simulation (pre-LUC) monthly mean flow and nutrient loads and concentrations from SWAT model output.Nutrients include ammonium (NH 4 -N), nitrate (NO 3 -N), total nitrogen (TN), and total phosphorus (TP).Asterisks for each variable show statistical significance of differences in loads and concentrations: * is p < 0.05, ** is p < 0.01, and *** is p < 0.001 concentration on nutrient loads.Flow contributed more to NO 3 -N and TN load reductions than concentration while for NH 4 -N loads, which contribute a smaller component of TN than NO 3 -N, were nearly equally apportioned between flow and concentration.

Land cover effects on nutrient cycling
We examined the potential impacts of land cover change on catchment hydrological and water quality responses.Changes in land use from agriculture to forest alter nutrient dynamics through changes in pools, availability, and cycling of material.Afforestation tends to generate a greater proportion of nutrients in organic form, particularly organic nitrogen, which can be relatively refractory or resistant to rapid decomposition.This characteristic of organic nitrogen in forested ecosystems can lead to slower nutrient turnover rates compared to agricultural systems.The accumulation of organic matter on the forest floor contributes to a build-up of nutrients over time, promoting nutrient retention and storage within the ecosystem.
Afforestation also alters nutrient cycling patterns within the soil and the availability oof nutrients for plants.Forests often develop a well-defined litter layer and organic-rich soil, which foster a complex and diverse soil microbial community.This soil microbial community promotes nutrient cycling and sustains the long-term fertility of forest ecosystems.A study by Łaszewski et al. (2021) revealed pronounced effects of agriculture compared with forestry on seasonal variations on NO 3 -N in a receiving stream.Wang et al. (2017) showed high potential for N 2 O emissions, denitrification activity, and denitrifier abundances in rural farmland soils compared turfgrass soils.Tong et al. (2019) observed substantial soil organic carbon and TN losses when woodland and grassland were converted to cultivated land types.Similarly, Kooch and Noghre (2020) demonstrated due to variations in soil organic matter while in the tropics, Kuma et al. (2022) showed high rates of nutrient loss in agricultural catchments.These findings collectively reveal that land cover plays a crucial role in nutrient cycling but, as shown in our study, delineation to individual subcatchment or stream scale may be required to identify causal relationships.

Catchment management implications
The New Zealand government is implementing freshwater restoration programs, supported by national policies and regional implementation of action plans.The regional council action plan target to reduce the potential for degradation of water quality of Lake Ōkareka was a 0.9 t yr −1 reduction in TN and 0.05 t yr −1 reduction in TP, estimated to be achieved through a 200 ha LUC from pasture to trees (EBoP, 2004).We conducted this research to assess the validity of land use change in Lake Ōkareka according to nutrient load targets in an action plan but also to assess resilience to greater variability of rainfall, which may be expected with climate change (Millar et al., 2007).The conversion of low intensity pastoral land use to forest in a sub-catchment (Millar) representing 15% of the total lake catchment area, with reduction in nutrient loads of 13.1% (TN) to 33.3% (TP) in the subcatchment, appears to validate the approach taken for Lake Ōkareka.Several other lakes in the region are subject to action plans, with reductions in nutrient export vital to enhance lake water quality (Monaghan et al., 2021).
Our study provides policy support and scientific evidence for a model catchment, being suitable for others where similar catchment actions are planned at national, regional, and local levels.Nationally, the Rotorua Regional Council is required to demonstrate that it complies with the National Policy Statement for Freshwater Management (Ministry for the Environment, 2020) that requires regional councils to set nutrient concentrations and exceedance criteria so that diffuse nutrient pollution does not adversely impact receiving waters.Our findings can support the regional council in assessments if further conversion of pasture to trees is adequate to avoid eutrophication of the downstream receiving waters of Lake Ōkareka and thereby serve as a model for other catchments in New Zealand where water quality of receiving waters is impaired.
At regional level, our study can help to inform policy makers in the implementation of the Regional Water and Land Plan (Rule 11) which seeks to address the export (diffuse discharge) of nutrients from land use activities (Foster & Kivell, 2009).In addition, this research provides important insights on the extent of land use change required and the latency of catchment response to land 1418 Page 20 of 24 Vol:.( 1234567890) use change.This information is especially pertinent where large land use change investments are being made to support water quality goals.Water quality constituents have different responses to rainfall, which drives much of the short-term variability in concentrations.The particulate P component of TP is closely related to surface runoff and sediment erosion events while dissolved N varies mostly with changes in nitrate due to changes in groundwater delivery during storm events (Abell et al., 2013).Our study had some limitations, one of which was inadequate observed data to support modelling of suspended sediments.A substantial component of the phosphorus load could be expected to be in particulate form although there can be interactions with dissolved phosphorus through adsorption and desorption, depending on phosphorus isotherm kinetics of the sediment (Peryer-Fursdon et al., 2015).Future monitoring of the stream with continuous streamflow and nutrient and suspended sediment measurements would help to separate short-term changes (e.g. from storm events) from long-term changes due to effects of land use change and improve quantification of loads of particulate and dissolved phosphorus.Measured data and calibration of the model on paired subcatchments (e.g.control and test cases) would also allow identification of LUC or climate effects when one catchment is subjected to LUC treatment while the other remains as a control.The paired catchment should be located in close proximity to ensure similar characteristics of climate, soils, area, slope, aspect, and vegetation.

Conclusions
Conversion of pastoral areas to forestry is purported to improve water quality of receiving waters but few studies have used different approaches to quantify the water quality changes.Validation is often complicated by sparse monitoring data that makes it difficult to detect changes due to mitigation actions.This situation is common for numerous small to medium-sized catchments globally and complicates the delivery of evidence-based science to inform policy for land use change.We used a dual approach involving statistical analysis of discrete observations and dynamic simulation modelling to provide independent lines of evidence to assess the effect of land use change on water quantity and quality.This approach is valuable for assessing whether actions designed to meet nutrient load targets for a catchment will be attained and over what time frame, including the lag times for land use change to impact hydrology and water quality.The numerical modelling approach can offer insights into how a future climate might impact the targets and ultimately the water quality of receiving waters.To assess how water quality has varied during different flow regimes and to understand future trends under different climatic conditions, further research and policy directions would be needed to have a network of stream gauges with continuous flow and water quality measurements representative of the spatiotemporal variations.In addition, TP is closely related to surface runoff and sediment erosion events.However, in sparsely monitored catchments where there is not sufficient sediment data, this connection is not adequately represented.Therefore, future research and monitoring efforts should focus on obtaining continuous sediment data to improve our understanding and modelling of catchment TP dynamics.(lakes380.com)for funding through an Endeavour Fund grant from the Ministry of Business, Innovation and Employment.We thank the Bay of Plenty Regional Council (BoPRC) and Rotorua District Council for providing soil, land use, climate, streamflow, and water quality data.We also thank Niroy Sumeran (BoPRC) and Wang Me (University of Waikato) and Max Mackay, Michele Hosking, and Penny MacCormick (BoPRC) for providing data to support this study.Gebiaw T. Ayele acknowledges Solomon S. Demissie for his unwavering support.

Fig. 1
Fig. 1 Location map of the study area including Millar sub-catchment in the Lake Ōkareka catchment and the gauging station.Inset: Map of New Zealand showing location of Lake Ōkareka . We inferred spatiotemporal changes in land use with Google Earth Engine based on Landsat 7 Enhanced Thematic Mapper from 2003 to 2013 and Landsat 8 Operational Land Imager from February 2013.Images for years 2003, 2011, 2013, and 2015 were verified with BoPRC observations, focusing on identification of areas of afforestation on pastoral land.Water quality and quantity responses to land 1418 Page 6 of 24 Vol:.(1234567890)

Fig. 2
Fig. 2 Digital elevation (a) and land use (b) in the Lake Ōkareka catchment of 2002-2005, with 2000  to 2001 used as a model 'warmup' period.Streamflow was calibrated first, followed sequentially for concentrations of TP, TN, NO 3 -N, and NH 4 -N, and several iterations of this procedure were used.Observed streamflow and concentration data were collected for each monitoring day in a single event and sporadic in nature.The observed flow and water quality data (NH 4 -N, NO 3 -N, TN, and TP) were partitioned into pre-and post-LUC categories and assigned for calibration, validation, and land use change impact analyses.Data were of unequal length with three distinct phases of pre-LUC calibration (2002 to 2005) and validation (2006 to 2010) and post-LUC (post-2010).

Fig. 3
Fig. 3 Landsat 7 (2011) and Landsat 8 (post 2013) aerial photographs of Lake Ōkareka showing post land use change in a 2011, b 2013, c 2015, and d 2022.All images are taken on the

Fig. 5
Fig. 5 Scatterplots of observed and predicted streamflow and loads (a) of NH 4 -N (b), NO 3 -N (c), TN (d), and TP (e) for Millar stream for each monitoring day

Fig. 7
Fig. 7 SWAT simulation output showing percent relative change in annual total streamflow (black solid lines) and loads and annual mean concentrations of ammonium (NH 4 -N), nitrate (NO 3 -N), total nitrogen (TN), and total phosphorus (TP) between reference simulation pre-LUC and post-LUC.Red and
org.nz/about-lcdb).Land use and soil data were available at 25-m resolution (Table

Table 1
Data description and source to configure the SWAT model for Lake Ōkareka.NH 4 -N, NO 3 -N, TN, and TP are ammonium, nitrate, total nitrogen, and total phosphorus, respectively