Modelling the performance of an integrated fixed-film activated sludge (IFAS) system: a systematic approach to automated calibration

Pryce, D.; Kapelan, Z.; Memon, F. A.

doi:10.1038/s41598-022-13779-w

Modelling the performance of an integrated fixed-film activated sludge (IFAS) system: a systematic approach to automated calibration

Article
Open access
Published: 08 June 2022

Volume 12, article number 9416, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Modelling the performance of an integrated fixed-film activated sludge (IFAS) system: a systematic approach to automated calibration

Download PDF

D. Pryce¹,
Z. Kapelan^1,2 &
F. A. Memon¹

1459 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

IFAS systems are inherently complex due to the hybrid use of both suspended and attached bacterial colonies for the purpose of pollutant degradation as part of wastewater treatment. This poses challenges when attempting to represent these systems mathematically due to the vast number of parameters involved. Besides becoming convoluted, large effort will be incurred during model calibration. This paper demonstrates a systematic approach to calibration of an IFAS process model that incorporates two sensitivity analyses to identify influential parameters and detect collinearity from a subset of 68 kinetic and stoichiometric parameters, and the use of the Nelder–Mead optimization algorithm to estimate the required values of these parameters. The model considers the removal of three critical pollutants including biochemical oxygen demand (BOD), total nitrogen (TN) and total suspended solids (TSS). Results from the sensitivity analyses identified four parameters that were the primary influence on the model. The model was found to be most sensitive to the two stoichiometric parameters including aerobic heterotrophic yield on soluble substrate whose total effects were responsible for 92.4% of the model’s BOD output sensitivity and 92.8% of the model’s TSS output sensitivity. The anoxic heterotrophic yield on soluble substrate was observed to be responsible for 54.3% of the model’s TN output sensitivity. To a lesser extent the two kinetic parameters, aerobic heterotrophic decay rate and reduction factor for denitrification on nitrite, were responsible for only 8.0% and 13.1% of the model’s BOD and TN output sensitivities respectively. Parameter estimation identified the need for only minor adjustments to default values in order to achieve sufficient accuracy of simulation with deviation from observed data to be only ± 3.6 mg/L, ± 1.3 mg/L, and ± 9.5 mg/L for BOD, TN and TSS respectively. Validation showed the model was limited in its capacity to predict system behaviour under extreme dissolved oxygen stress.

Calibration and validation of an activated sludge model for a pilot-scale anoxic/anaerobic/aerobic/post-anoxic process

Article 06 September 2014

Efficient model calibration method based on phase experiments for anaerobic–anoxic/nitrifying (A2N) two-sludge process

Article 30 June 2017

A Novel Protocol for Model Calibration in Biological Wastewater Treatment

Article Open access 16 February 2015

Introduction

Computational process models of wastewater treatment plants (WWTP) have long been utilized for the benefits they afford¹. Such benefits may include the investigation of alternative design and operational scenarios in pursuit of improved efficiency or performance, or to gain insight into system behaviour under circumstantial scenarios². Through modelling, the high demand of such investigations on time and resources that would be accrued with physical experimentation can be overcome with relative ease without compromising on confidence in the results. For example, a model can allow a process engineer to identify the size of anoxic tank required to enable sufficient denitrification in a system without building many different sized anoxic tanks.

While complex models can provide insights to the simulated system that would otherwise be obscured by simplified models, this only holds true if the physical system is accurately depicted. The calibration of biological WWTP models is typically the most demanding phase of model development in terms of time, effort and financial resources required to collect necessary data^2,3,4,5. In fact several protocols such as BIOMATH, STOWA, WERF and HSG have been developed to guide the calibration of activated sludge (AS) models^6,7,8,9,10. While each of these approaches offer distinct differences, they share common demands for large amounts of data that must be yielded by way of site survey, intensive sampling and respirometric/titrimetric batch tests that place high demands on resources (see ³ for a critical review).

As environmental models, biological wastewater treatment (WWT) process models are inherently complex. By their very nature, they are highly-dimensional and non-linear due to the vast array of kinetics and stoichiometry that are included. For instance, even the popular Activated Sludge Model (ASM) in its earliest form consisted of five stoichiometric parameters to describe the biochemical reactions, 14 kinetic parameters and 13 differential equation¹. It is no surprise then that WWT models become over-parameterised with regards to given observation when these models are further developed to provide a more detailed representation of the underlying processes or to model emerging technologies and contaminants^11,12,13,14.

Increased complexity is unavoidable when multiple technologies are combined such as the more recent integrated fixed-film activated sludge (IFAS) solution. This technology combines the application of suspended biomass colonies with attached biofilm colonies to draw on the advantages of each, while increasing functional biomass and therefore treatment capacity at a reduced plant footprint compared to conventional AS or biofiltration systems^15,16,17,18. The main limitation of this technology however is the substantial increase in energy demand for aeration compared to AS¹⁹, which necessitates continued modelling efforts towards optimizing its efficiency.

While IFAS has been the subject of much modelling work to date ^{15,20,21,22,23}, the calibration procedure has received little attention in contrast to AS models despite the added intricacy. Brockmann et al. ²⁴ attempted to calibrate an IFAS system by following the good biofilm reactor modelling practice (GBRMP) proposed by Boltz et al.²⁵, however the authors found this was inadequate for this purpose, proposing the need for further development of a specific protocol for this technology. A major challenge the authors highlighted was the over-parameterization associated with the hybridization of the two components.

Automatic calibration has received increasing interest over the last few decades ^{10,26,27,28,29}. While manual calibration by way of experimentally-determining parameter value estimates is conventional, this process remains laborious and resource-heavy and may introduce errors while lacking objectivity^3,30. Furthermore it is ineffective in estimating the many unmeasurable parameters found in WWT models ³¹. Automatic calibration refers to the use of an optimization algorithm to estimate the parameter values and there are increasing examples of this approach being taken in the modelling of wastewater treatment and waste collection systems^{26,30,32,33,34,35}.

Kim et al. ³² demonstrated the feasibility of using a genetic algorithm (GA) to calibrate the first ASM (ASM1), while a study by Zeferino et al. ²⁶ into the model planning of a regional wastewater system utilized a Particle Swarm Optimization (PSO) algorithm to calibrate their model. Ye ³⁴ also reported the use of an immune algorithm (IA) and a hybridized form of the IA and PSO (IPSO) to calibrate an ASM. In developing the numerical optimal approaching procedure (NOAP) for systematically calibrating the third of the ASM series (ASM3), Zhu et al. ¹⁰ employed a genetic algorithm (GA) to successfully automate the parameter estimation of two AS system types. More recently Du et al. ³⁵ employed an improved cuckoo search (ICS) algorithm to calibrate up to 7 sensitive parameters in ASM1 yielding accurate simulation.

In the current work a classical optimization technique is employed, the Nelder–Mead (NM) simplex method³⁶. While the last few decades have seen an accelerating growth of novel optimization algorithms (see ³⁷ for review), the NM simplex approach remains relevant because it is robust, easy to implement and understand and is a favoured algorithm for dealing with multi-dimensional, unconstrained optimization problems without derivatives³⁸. Despite being a vintage approach to optimization it has recently been shown to remain competitive in performance with more “intelligent” algorithms such as the PSO algorithm³⁹.

As a proceeding step to the parameter estimation, global sensitivity analyses (GSAs) were employed to deal with the issue of over-parameterisation in the IFAS model and reduce the optimization problem to a manageable number of parameters. Regardless of whether a manual or automatic calibration approach is being taken, GSAs can mitigate over-parameterisation by identifying the parameters most influential to the model output that warrant the most focus and resource allocation while leaving less influential parameters at default values^{9,10,40,41,42,43,44,45}. This was a primary concern of Gernaey et al.² who argued that the automated calibration approach may be stifled by a lack of identifiability in parameters which may lead to only minor adjustments in considerable number of secondary parameters.

The use of GSA during calibration afford additional benefits to the process. While they can identify the most critical parameters for estimation, they are also able to check for collinearity of parameters by investigating higher order interactions^43,45,46. These are both important because non-influence and collinearity with other parameters are the two main sources of practical parameter non-identifiability¹³. Furthermore, because GSA allows the modeller to apportion the uncertainty that is propagated through the model due to each parameter and their interactions⁴⁵, this may be coupled with a Monte-Carlo based uncertainty analysis to then quantify the uncertainty for each factor installing confidence in the model^47,48,49.

The objectives of this study are therefore to identify the kinetic and stoichiometric parameters most influential to the model, to estimate the values of any identified parameters by way of an optimization algorithm and to validate the NM simplex algorithm as remaining an appropriate and effective tool for this role. Finally, uncertainty of the model outputs relating to the identified influential parameters are also assessed by way of uncertainty analysis.

Methodology

Data collection

Due to travel limitations relating to the COVID-19 pandemic, access to the modelled system was prevented. Under these circumstances historic data was instead required for the purpose of calibration and validation. Fortunately a pilot-scale version of the modelled WWT system had been operated in Rishikesh, Uttar Pradesh, India over the course of 2015 as a research platform for this new technology. During this time it was subject to intense investigation that resulted in a series of publications^{50,51,52,53,54,55}. Most notably was a study into the effects of DO stress on the system⁵¹. In this study a detailed account of influent and effluent data was provided alongside the operational strategy of three DO concentrations (0.5, 2.5, 4.5 mg/L). For the purpose of the present work, data from the 2.5 mg/L DO regime was used as observed data for the calibration, while the remaining two regimes were utilized in the validation phase as independent data. Investigation from the calibration regime was reported in greater detail in an earlier publication⁵⁰.

Subject system to be modelled

The system itself consisted of an aerated reactor (20 m³) with a 6 m² footprint and a separate circular settling tank with sloping bottom totalling a volume of 4.2 m³ and a surface loading area of 1.25 m². The aeration tank contained 64 rectangular panels of loop knitted polypropylene fabric (Biotextile Cleartec^®, Jager, Germany) that occupied ~ 0.5% of the reactor volume in order to facilitate the proliferation of attached biomass. Further details regarding dimensions are provided in other works^51,55.

Site conditions for the model reflected the conditions of DO regime 2 (i.e. 2.5 mg/L) as reported by Singh et al. ⁵¹ for model calibration. These include a flow rate (Q) of 1.8 m³/h to give a hydraulic retention time (HRT) of 11.1 h, bulk liquid temperature of 26 °C, DO concentration within the bulk liquid of the HySAF reactor of 2.5 mg/L for model calibration. The activated sludge waste (WAS) rate was set to 2.2 m³/day with a maintained recycle activated sludge (RAS) rate of 3.7 m³/h (2.5 Q) to ensure a mixed liqour suspended solids (MLSS) concentration of 2000 mg/L in the reactor and a sludge retention time (SRT) of 11 days as reported by Singh et al.⁵¹. Operational characteristics of DO regimes 1 and 3 as shown in Singh et al. ⁵¹ were used for validation purposes following the calibration.

Model development

The commercial modelling software GPS-X™ Version 8.0 (Hydromantis Environmental Software Solutions, Inc.) was used to model the system. This software, while robust and feature-rich, was chosen particularly for its recent integration of the Python coding language in the latest installment that facilitated the use of external Python libraries to perform subsequent analyses in this study.

As shown in Fig. 1, the developed model comprised of the following objects:

wastewater (WW) influent from the sewer,
aerobic IFAS reactor,
settlement tank (secondary).

Figure 1 also displays several key parameters that needed to be defined within the model objects including the mixed liquor suspended solids (MLSS) concentration, the recycle activated sludge (RAS) stream, the waste activated sludge (WAS) stream and media content within the IFAS reactor.

Wastewater influent from the sewer

The influent model is commonly regarded as the most important element during model calibration³. For the purpose of this study the “CODSTATES” model was chosen from the Comprehensive Model Library (MANTIS2LIB) in GPS-X. This model is normally recommended following a full influent characterization including manual calculation of the state variables⁵⁶, yet this was not the case in this study due to available influent data taken from publication⁵¹. However this influent model was found to produce good agreement with the composite variables following an iterative process with only few changes to the default values, including adjusted inputs of 626 mg/L, 44.2 mg/L, 33.7 mg/L and 0.63 mgVSS/mgTSS for total COD, total TKN, ammonia nitrogen (NH₃) and the VSS/TSS ratio respectively as shown in Table S1 (see Supplementary Material). These changes returned 367.9 mg/L for TSS (4.3% deviation) and 322.9 mg/L (5.4% deviation) for total carbonaceous BOD which remained well within the reported range shown in Table 1.

Table 1 Influent, effluent and operational characteristics of HySAF pilot plant in Rishikesh, India used for calibration. Value range provided where present in literature ⁵¹.

Full size table

Aerobic IFAS reactor

The comprehensive model (MANTIS2) was used for this study. This model is a state-of-the-art model that was developed by Hydromantis in order to progress the ASM2d and the anaerobic digester model no. 1 with greater versatility⁵⁶. It was chosen for this study primarily due to its comprehensive approach to total nitrogen (TN) removal which is a key priority in global water management⁵⁷. The hybrid system object was selected which combines the suspended growth model discussed and the GPS-X biofilm model⁵⁶. Kinetic and stoichiometric parameters are not differentiated between colony types (suspended or attached) in this object. User-defined inputs can be found in Table S1 and correspond to the values given by Singh et al.⁵¹.

Secondary clarifier

To represent the clarifier a one-dimensional, non-reactive model was used (SIMPLE1D). This model is preferred when biological reactions in the clarifier can be ignored⁵⁶. It divides the sludge blanket into 10 equal layers and assumes the consideration of only vertical flow and that all incoming solids are distributed instantaneously and uniformly across the surface of the feed layer⁵⁶. User defined-inputs can be found in Table S1 and correspond to the values given by Singh et al.⁵¹.

Model calibration

Following adjustments of the user-defined values, the steady-state model was run to determine the accuracy of its representation to the published data. The model outputs assessed were BOD, TN and TSS. At this stage the model results, were not in good agreement with the observed results as shown in Fig. S1 (Supplementary Material), so the calibration phase commenced.

Sensitivity and uncertainty analysis

Sensitivity analyses (SA) are often used by modellers to identify the relative influence that different factors such as parameters and their interactions have on the output of a model⁴⁵. This can aid the modeller in several ways, for instance to simplify models, prioritise parameters for calibration, identify errors in the model and apportion uncertainty between model factors⁵⁸. Parameters that are found to have a small or no influence can be considered negligible and can either be left at default value, given an arbitrary value within the parameter range, or even removed from the model as a form of model reduction, thus simplifying and aiding interpretation without compromising accuracy. The parameters of the greatest influence, or that the model output is most sensitive to, are therefore identified as the parameters that require the most experimental focus during calibration in determining their true values. As the influence of a parameter increases, so does the potential for error which indicates these as the parameters that will require further analysis to quantify the level of uncertainty that they invoke and thus minimize where possible.

A key source of uncertainty in bio-models is derived from the multiple kinetic and stoichiometric parameters that compose the model. These are often difficult, if not impossible to measure due to experimental limitations, and kinetic parameters especially can demonstrate large variability between treatment plants⁵⁹. Despite this, often only a small number of input parameters account for the majority of the uncertainty or variability in model outputs⁴³. Once these influential factors have been identified and their share of the uncertainty has been apportioned by the SA, an uncertainty analysis (UA) can be performed to quantify this uncertainty held by each input by observing its propagation through the model. Together SA and UA provide a way to achieve some transparency into the reliability of the model adding (or removing) credibility to conclusions drawn⁵⁸.

For this study, the Python coding platform (Ver 3.7) was used for both the UA and SA, with both SA utilizing the SALib library⁶⁰. The developed code can be found in the supplementary material (Python Scripts S1-S3).

Method of Morris

The Method of Morris (MOM) is similar to the more commonly used local sensitivity analysis (LSA) in that it only deviates one parameter at a time, however it conforms to GSA definition because the parameter value is started from different points in the possible input space multiple ($r)$ times and averaged. While other GSA approaches such as the Sobol method are able to provide a more detailed analysis, these are more computationally-expensive. For example, an analysis with 14 parameters and a sample size of 6,000 would require 6.5 × 10⁵ evaluations⁶¹. In contrast to the Sobol method, the MOM is able to detect first-order effects at only 100th the computational expense and a 10th of the cost when investigating second order effects^62,63,64,65.

A more efficient approach that is commonly employed is to first use the MOM to screen for non-influential parameters that can be exempt from the Sobol analysis in order to reduce the number of model evaluations needed to obtain adequate decomposition^61,66,67. Brockmann and Morgenroth ⁶⁸ demonstrated MoM to be just as effective as variance-based methods in distinguishing between influential and non-influential parameters, which was similarly observed by Herman et al.⁶¹. However in both cases, it was recommended to proceed the more computationally-demanding methods to gain sufficient variance information or better rank the more sensitive parameters. For the purpose of this study the MOM was used as an initial screening of 68 kinetic and stoichiometric parameters that constitute the model to identify the 10 most influential parameters with regards to each model output (BOD, TN, TSS). These parameters were then subjected to further examination under the Sobol analysis. The screened parameters of the investigated model can be found in Table S2.

The MoM uses approximations of the first order partial derivatives of a model called elementary effects (EE) to characterise model sensitivity⁶⁹. In order to determine the EE for each input factor, the differentiation of the model output $y$ is calculated in relation to each input factor ${x}_{i}$ as according to the following equation⁶⁹:

$${EE}_{i}= \frac{\delta y}{{\delta x}_{i}}= \frac{y\left({x}_{1}, {x}_{2}, {x}_{i}+ \Delta , \dots {x}_{k}\right)-y\left(x\right)}{\Delta },$$

(1)

where ${x}_{i}$ is the $i$th factor of the model, $\Delta $ is the set perturbation factor by which the base value of ${x}_{i}$ is deviated, $y\left(x\right)$ is the model output evaluated at certain nominal values of model factors, $y\left({x}_{1}, {x}_{2}, {x}_{i}+ \Delta , \dots {x}_{k}\right)-y\left(x\right)$ describes the model output corresponding to the predetermined deviation $\Delta $ in ${x}_{i}$.

The finite distribution of the EE, $Fi$, for each factor is obtained by performing $r$ calculations of the EE at independent, randomly sampled points in the input space. This method as described by Morris ⁶² provides $r$ observations of $Fi$ for $k$ factors at a cost of $r$ ($k$ + 1) model evaluations.

In order to determine the mean average of the EE, ${\mu }_{i}$, the following equation is used⁶⁹:

$${\mu }_{i}=\frac{1}{r}\sum_{t=1}^{r}{EE}_{i}.$$

(2)

While calculation of the standard deviation,${\sigma }_{i}$, is achieved as follows⁶⁹:

$${\sigma }_{i}=\sqrt{\frac{1}{r-1}}\sum_{t=1}^{r}{{(EE}_{it}-{\mu }_{i})}^{2}.$$

(3)

If the model is non-monotonic, where the variables have a tendency to move in the same direction yet without guarantee of constancy, it is possible for the $Fi$ distribution to return negative values that may cancel positive values and be misrepresented as non-influential. Campolongo et al. ⁶³ provided a revision to the method that replaced the mean of the elementary effects, $\mu $, with the absolute mean of the EE, ${\mu }^{*}$. This prevented the effects of opposite signs and provides an index of the magnitude of influence of a parameter by which the overall influence on the output can be ranked accordingly. While it is widely used for screening influential parameters, it is primarily a measure of non-influence⁶³. In contrast, the standard deviation of the elementary effects, ${\sigma }_{i}$ considers the variance and detects influence of factor interaction or non-linearity.

In order to calculate the absolute mean of the EE, ${{\mu }_{i}}^{*},$ the following equation was used⁶⁹:

$${{\mu }_{i}}^{*}= \frac{1}{r}\sum_{t=1}^{r}|{EE}_{it}|.$$

(4)

Parameters are then able to be ranked using the following equation⁶⁹:

$$\text{B }= \sqrt{{{\mu }_{i}}^{*2}+ {{\sigma }_{i}}^{2}.}$$

(5)

By deriving both the absolute mean and standard deviation of the EE, the effects of investigated parameters can be categorized as follows⁶⁹:

1.
Negligible (low average, low standard deviation).
2.
Linear and additive (high average, low standard deviation).
3.
Non-linear and/or interactions with other parameters (high standard deviation).

In presenting graphical representation of the results from the Morris method, bar charts and Morris plots are both commonly used^66,70. The Morris plot takes both the ${\mu }^{*}$ and $\sigma $ of the elementary effects (EE) and plots them against each other in a two-dimensional graph. By doing so, parameters with a low mean and low standard deviation will be plotted in the bottom left hand corner and can be considered non-influential^69,70. Conversely the parameters of influence will be plotted towards the top right hand corner.

Drawbacks of MoM include its tendency to provide only qualitative information by ranking input factors, without quantifying the influence of each factor on the output⁶⁶, as well as its inability to correctly rank the most influential parameters despite being highly effective at separating influential from non-influential parameters^43,61. Where this method is only to be used as a screening method to reduce computational expense of a further variance-based method such as the Sobol analysis, these limitations become less important.

For this study a sample size (n) of 100 was used with the MoM analysis. While previous work has shown a smaller sample size (20 n) to be just as effective as higher sample sizes (100 n) in differentiating between influential and non-influential parameters even in highly-parameterized models⁶¹, other work has demonstrated the MoM to neither predict the correct number or designation of influential/non-influential factors in a wastewater biofilm model at lower sample sizes (10–20 n) when compared to variance-based methods⁷¹.

Compared to more comprehensive GSA, Cosenza et al. ⁷¹ showed this method was unable to converge at lower sample sizes (10–20 n) and failed to differentiate between influential and non-influential parameters as effectively. However this is in contrast to work by Herman et al. ⁶¹ who compared the efficacy of the MoM to differentiate between influential and non-influential parameters across a range of sample sizes. In their study, Herman et al. ⁶¹ found the MoM to be just as effective in differentiation at 20n as 100n with little need for higher sample sizes.

Sobol method

The Sobol method derives the sensitivity indices by attributing variance to single model inputs as first-order Sobol indices (${S}_{i}$), and variance due to interactions between multiple parameters as higher-order Sobol indices⁴⁵. A second-order effect (${S}_{ij}$) is characterised by an interaction between two parameters, while a third-order effect relates to an interaction between three parameters and so on. When ${\sum }_{i}{S}_{i}\ne 1,$ the presence of interactions are indicated as well as their influence relative to ${S}_{i}$⁴⁵. Total-order Sobol indices (${S}_{Ti}$) are determined as the sum of apportioned variance for any parameter and its interactions. In linear models the sum of ${S}_{Ti}$ should equal 1 while in non-linear models the sum should exceed 1⁴⁵. When ${S}_{Ti}$ is observed to be substantially higher than ${S}_{i}$ for any given factor, it is indicative of a higher order interaction occurring with other parameters. Over-parameterised models can therefore be reduced by discounting parameters that demonstrate a low ${S}_{Ti}$ as these can be assumed to hold negligible influence due to the inclusiveness of this index⁶⁶.

In this study the Sobol indices (SI) were calculated through random sampling (n = 10,000) by way of Monte Carlo simulation (MCS). Four steps were taken to apply the Sobol procedure as described by Ref.⁷²: 1—The uncertainty ranges for each input parameter were defined as ± 50% of the default value in GPS-X, 2—Sobol sampling was used to sample the range of each parameter, 3—uncertainty was propagated through the model by repeated simulations for each combination of the input parameters within their ranges, 4—acquired data was post-processed to calculate first-order SI, second-order SI, and total-order SI as follows⁷³:

$${\text{First{-}order SI}, S}_{i}=\frac{{V}_{i}}{V},$$

(6)

$${\text{Second{-}order SI}, S}_{ij}=\frac{{V}_{ij}}{V},$$

(7)

$$\text{Total{-}order SI}, {S}_{Ti}= {S}_{i}+ {\sum }_{j\ne i}{S}_{ij},$$

(8)

where the partial variance, ${V}_{i}$ = $V[E\left(Y|{X}_{i}\right)]$ which is the variance of the conditional expectation, held over the unconditional variance $V$ with ${X}_{i}$ representing the input parameters and $Y$ the model output or objective function⁷³. The total contribution of variance of each input parameter ($p$) and interactions can thus be determined by the following decomposition:

$$V\left(Y\right)= {\sum }_{i=1}^{P}{V}_{i}+ {\sum }_{i=1}^{p-1}{\sum }_{j=i+1}^{p}{V}_{ij}+\dots + {V}_{1,\dots ,p}.$$

(9)

The role of the Sobol method in automatic calibration is to test for possible collinearity and reduce the calibration effort of a potentially over-parameterized model by differentiating between parameters of influence and non-influence. While this may be apparent, it is appropriate to define the level of influence required to warrant calibration and signify collinearity. In this study a threshold of 0.05 was defined as used in previous work⁴².

Distinguishing between influential/non-influential parameters

Following each of the GSAs, it was necessary to distinguish between influential and non-influential in order to reduce the model for a more focused calibration. Following the MoM analysis, the output values for each parameter were first normalised within the 0–1 range. By normalizing the data, the capacity of the MoM to identify influential parameters could then be compared against that of the more-detailed Sobol analysis once thresholds of influence have been defined ^74,75. With regards to the MoM, both Ramin et al. ⁷⁴ and Valverde-Pérez et al. ⁷⁶ applied a threshold of 0.1 ${\mu }^{*}$, while Hsieh et al. ⁷⁵ used thresholds of 0.1 ${\mu }^{*}$ for the MoM and 0.05 for the Sobol analysis to good effect. Previous work by Zhang et al. ⁴² also proposed an influence threshold of 0.05 for the Sobol analysis, with this value requiring an influential parameter to account for at least 5% of the variance. In line with these past definitions, a threshold of 0.1 ${\mu }^{*}$ was used for the MoM and 0.05 for the Sobol analysis.

Data was normalised using the following equation:

$${z}_{i}= \frac{({x}_{i}-{x}_{min})}{({x}_{max}-{x}_{min})},$$

(10)

where ${z}_{i}$ is the ${i}_{th}$ normalized data set and ${x}_{i}=({x}_{1}, .. , {x}_{n})$.

Parameter estimation for calibration

A computational optimization exercise was performed for the purpose of estimating the kinetic and stoichiometric parameters of greatest influence by means of the maximum likelihood function. Further details of the maximum likelihood function utilized can be found in Hydromantis⁵⁶.

With process models typically being of a non-linear nature, analytical determination of the their optimized values are often not possible. To combat this, a derivative-free optimization method, namely the Nelder and Mead (NM) simplex method ⁷⁷ was employed. This method was first proposed by Spendley et al. ⁷⁸ before being better refined by Nelder and Mead ³⁶ to the core form that has now been used for over half a century due to its many advantages. It can be considered a direct line-search method of steepest descent exploration, due to its system of searching the factor space. The method makes use of a polyhedron of N + 1 sides (N = number of input variables) that systematically reflects, expands, contracts and shrinks to explore the factor space for a minimum^36,79. On every iteration the function values are taken from the vertices and the highest value will be discarded and a new point would be sought in the general direction of a negative gradient.

The NM method offers multiple advantages over derivative-based alternatives due to its robust approach to optimization. Its simplicity to implement and understand without the need for any derivative information make it an attractive option for engineers. While derivative-based methods tend to be faster in returning a result⁷⁹, they can also be vulnerable to noise in the function values while the NM method here demonstrates a high tolerance. Additionally the NM method thrives in more complex terrain of the objective function due to its capacity to rapidly adjust its shape, size and orientation based on local contours⁷⁹. While it has faced some criticism from mathematicians due to its inability for convergence to be proven⁸⁰, it has stood the test of time with engineers for its many examples of successful optimization, particularly in parameter estimation^81,82.

In estimating the values of the kinetic and stoichiometric parameters as part of the calibration, the current task was defined as a multi-objective, constrained optimization problem with the purpose of minimizing the (negative) error between the target values of objectives (effluents: BOD, TN and TSS) and the model output for these parameters. In this way the maximum likelihood function can be maximized as required to determine the optimal parameter estimates by using a minimizer⁵⁶. The error distribution was considered normal and termination criteria was set as no further significant change in parameters.

Model validation

Model validation is a necessary part of model development. As defined by Schlesinger et al. ⁸³, “model validation is the substantiation that a computerized model within its domain of applicability possesses a satisfactory range of accuracy consistent with the intended application of the model”. Put more simply, it seeks to assess whether a model is of an acceptable accuracy for its intended purpose⁸⁴.

In the present work, data derived from alternative DO regimes were used (0.5 mg/L, 4.5 mg/L) from a previous experiment as shown in Table 2⁵¹. This was considered sufficient in terms of technical rigour because separate influent and effluent data was provided conforming to the need for independent data to be used in validation⁸⁵. A similar approach to calibration and validation has been used in previous work⁸⁶. This also provided a means for validity to be assessed at two separate points as opposed to the single-point validation that is more traditional⁸⁵.

Table 2 Influent, effluent and operational characteristics of HySAF pilot plant in Rishikesh, India used for validation. Value range provided where present in literature⁵¹.

Full size table

Uncertainty analysis

Following model calibration, an uncertainty analysis was carried out to quantify the uncertainty surrounding the determined values of parameters deemed to be influential⁴⁸. For this a Monte Carlo simulation (MCS) was used. MCS is commonly used for this purpose due to its robustness as it accounts for non-linearity in the model and correlation where specified while being more accessible in terms of mathematical intensity compared to alternative techniques⁸⁷. Furthermore, it naturally provides a graphical representation of the output distribution which can aid identification of skewed or non-normal distribution in the measured⁸⁷. Farrance and Frenkel ⁸⁷ provide a detailed comparison between this approach and older standards such the guide to the expression of uncertainty in measurement (GUM) method.

In applying the MCS, a probability distribution is first prescribed for the input parameters being investigated. A key source of uncertainty in the model output can come from a lack of information regarding the precise value that the input should hold. Where there is variability in this value, for example between WWTP, the value can be represented by a probability distribution. When this distribution is known, the uncertainty can be reduced as a more accurate estimate of the true value can be assigned in lieu of precise knowledge while the uncertainty held in this estimation can be better evaluated.

Unfortunately it is often the case that the probability distribution is unknown and in place of such information expert judgement needs to be assigned⁸⁸. For example, where estimates can be made regarding the likely range of the parameter, but no information regarding the distribution, a uniform distribution can be assigned. When information is available regarding occurrence of values around the mean and experimental bias is known to not be influencing results, a normal distribution can be used⁸⁸. The use of a uniform PDF reflects the probability of obtaining a value anywhere between the defined upper and lower limits. This PDF is usually chosen when knowledge on the distribution is little and only the limits are known⁸⁷. This is the most conservative estimate of uncertainty since it leads to the largest value.

Once the distribution has been assigned to each of the investigated model inputs based on limits and shape with the capture of the real value assured, pseudo-random samples are taken from within each of these distributions and the model is evaluated based on these de facto input parameters. The corresponding functional output is then reported and can be graphically represented to present the probability distribution function (PDF) of the investigated outputs. The larger the number of samples taken, the more accurately the standard uncertainty in the inputs can be represented and propagated through the system. However as shown by Farrance and Frenkel⁸⁷, the difference in variability between sample sizes will reduce considerably at higher orders of magnitude. While a sample size of 1 × 10³ was observed to return a similar estimation of the standard uncertainty in the parameter, a sample size of 1 × 10⁵ offered improved numerical precision and consistency.

Following determination of the PDF the required information can be calculated by way of simple mathematical procedures from the output data. The mean average of the returned values, $\overline{x }$, gives the calculated estimate of the parameter’s true output value, while the standard deviation, $S$, indicates its standard uncertainty⁸⁷. Accurate calculation of $S$ will depend on the assigned shape of the parameter’s PDF with each shape having its own equation:

$$\text{Standard deviation for a uniform distribution}, {S}_{uni}= \frac{a}{\sqrt{3}},$$

(11)

$$\text{Standard deviation for a normal distribution}, {S}_{norm}= \sqrt{\frac{\sum ({{x}_{1}-\overline{x })}^{2}}{n-1},}$$

(12)

$${\text{Standard deviation for a triangular distribution}, S}_{tri}= \frac{a}{\sqrt{6}},$$

(13)

where $a$ is the ± limit from $\overline{x }$ or the half-width of the distribution. The appropriate PDF assignment for each investigated parameter will depend on the nature of the data in terms of its limits and where the values are likely to fall.

A coverage value, $k,$ of 1.96 is used in order to derive the expanded uncertainty, ${U}_{e}$, of within a 95% coverage interval as follows⁸⁷:

$${U}_{e}=1.96\left({U}_{c}\right)\text{ for a }95\text{\% coverage interval}.$$

(14)

Results

Model calibration and validation

Sensitivity screening phase

Results from the screening phase of the calibration process attempted to distinguish between influential and non-influential parameters. As shown in Fig. 2a, three parameters are suggested as influential on the BOD model including X52 (aerobic heterotrophic yield on soluble substrate), X16 (aerobic heterotrophic decay rate), and X53 (anoxic heterotrophic yield on soluble substrate) being ranked by μ* scores in this order. Figure 2b displayed the presence of a monotonic relationship between X52 and the model output, implying a strong correlation and a focal parameter for calibration. The dominant influence of this parameter is unsurprising as it reflects the critical role that ordinary heterotrophic organisms (OHOs) take in the degradation of waste organic matter by way of receiving the organic compound’s electrons and utilizing the carbon for cell synthesis⁸⁹. The importance of accurately portraying the actual proliferation of OHOs in both aerobic and anoxic environments of any given wastewater system has long been recognised⁹⁰.

The TN removal model was found to be sensitive to a greater number of parameters than the BOD removal with 6 parameters breaching the threshold influence (0.1) following normalization as shown in Fig. 3a. Amongst the influential parameters, X53 and X52 incurred the greatest sensitivity respectively, with the former demonstrating the greatest linearity as shown in 3b. These two parameters have previously been identified as strongly influential on AS models ⁹⁰ but their value as key parameters in IFAS models also are here supported. TN removal models of other bioreactors such as the membrane bioreactor (MBR) have also been found as most sensitive to the anoxic heterotrophic yield parameter which further emphasizes its critical role in effective simulation⁹¹. Other parameters suggested to be influential on TN removal included X12 (reduction factor for denitrification on nitrite-N), X15 (oxygen inhibition coefficient for denitrification), X61 (unbiodegradable fraction from cell decay) and X27 (maximum growth rate for nitrite oxidizer) in this order.

In terms of TSS removal, Fig. 4a shows four parameters were suggested as influential including X52, X61, X16 and X53 in this order. Figure 4b displays a strong asymmetry between the parameters identified as influential with X52 as a clear outlier. This is not surprising as heterotrophic bacteria are the dominant type in AS systems ⁹² and have been shown to positively influence settling velocities of AS flocs more than nitrifiers⁹³.

The MoM identified multiple parameters to be influential across more than one model which warranted greater insight into their relative effect on each model. Table 3 showed that X52 and X53 were influential on each of the three models, while X61 and X16 were influential to two. Besides identifying the parameters with a broader influence, Table 3 also identified parameters with conflicting polarity. For example, this was apparent for X52. In this instance, any adjustment of this parameter to reduce error between actual and simulated output during calibration of the BOD and TSS models would increase error for the TN model and vice versa. As such X52 was shown to have an inverse monotonic relationship with the TN model output in contrast to the direct monotonic relationship of the other models. The polarity of other influential parameters was not seen to differ across models.

Table 3 Models sensitive to parameters identified as influential by MoM with polarity.

Full size table

Variance-based sensitivity analysis

In comparison to the MoM, the Sobol analysis was seen to be more selective of influential parameters at the given threshold (SI < 0.05). Table 4 identifies only four parameters to meet the influence criteria designated by Zhang et al.⁴². Of these parameters only X52 was observed to significantly influence all three models, while the remaining parameters only influenced one model each. In terms of variance, X52 accounted for 92.4% of the variance in the BOD model, 27.3% in the TN model and 92.8% in the TSS model. In each case this was mainly attributed to the first order effects as shown in Fig. 5. X53 and X12 were only influential to the TN model and accounted for 54.3% and 13.1% of the variance respectively, again due mainly to the first order effects. Finally, X16 only had significant influence on the BOD model and accounted for 8.0% of the total variance, while the first order effects only accounted for 5.0%.

Table 4 Parameters identified as influential by the Sobol analysis (> 0.05) with observed Sobol indices for first and total order effects.

Full size table

The Sobol analysis also identified minimal presence of higher order effects for each of the parameters. Despite the presence of multiple interactions between parameters as shown by their communication in Fig. 6, these remained below the given threshold (SI < 0.05). This was supported by the absence of substantial white rings around each parameter that would indicate a greater influence of higher-order effects relative to the first order. This suggest that non-linearity is not a significant factor in the present model. This may be attributed to its steady-state nature, as well as lacking wide factor variability ranges and additional recycle streams that have otherwise been proposed as causal reasons for increased detectability of parameter interactions⁹⁴.

What is also apparent from the Sobol analysis is the greater interactivity of factors in the TN model compared to the BOD model as shown in Fig. 6. While no interactions were found to be significant in terms of the threshold, relatively speaking X12 was observed to have a greater symmetry of influence across interacting parameters than alternatives including the more influential parameters. This may be due to its role in governing the prominent pathway for TN removal that would be expected to offer broad influence⁹⁵. In contrast, the BOD model demonstrates only one interaction of relative importance, relating the yield and decay of aerobic heterotrophs that are central to BOD removal⁹⁶.