Experimental sandbox tracer tests to characterize a two-facies aquifer via an ensemble smoother

Todaro, Valeria; D’Oria, Marco; Zanini, Andrea; Gómez-Hernández, J. Jaime; Tanda, Maria Giovanna

doi:10.1007/s10040-023-02662-1

Experimental sandbox tracer tests to characterize a two-facies aquifer via an ensemble smoother

Essais de traçage dans un bac à sable expérimental en vue de caractériser un aquifère à deux faciès grâce à un filtre d’ensemble

Ensayos experimentales con trazadores en un tanque de laboratorio para caracterizar un acuífero de dos facies mediante un suavizador de conjuntos

通过集合平滑法的表征双相水系含水层的试验砂箱示踪试验

Testes experimentais de traçadores em caixa de areia para caracterizar um aquífero de dois fácies por meio de um suavizador em conjunto

Paper
Open access
Published: 18 July 2023

Volume 31, pages 1665–1678, (2023)
Cite this article

Download PDF

You have full access to this open access article

Hydrogeology Journal Aims and scope Submit manuscript

Experimental sandbox tracer tests to characterize a two-facies aquifer via an ensemble smoother

Download PDF

Valeria Todaro ORCID: orcid.org/0000-0002-9313-6999¹,
Marco D’Oria¹,
Andrea Zanini¹,
J. Jaime Gómez-Hernández² &
…
Maria Giovanna Tanda¹

1271 Accesses
Explore all metrics

Abstract

Estimating aquifer properties and their spatial variability is the most challenging part of groundwater flow and transport simulations. In this work, an ensemble Kalman-based method, the ensemble smoother with multiple data assimilation (ES-MDA), is applied to infer the characteristics of a binary field by means of tracer test data collected in an experimental sandbox. Two different approaches are compared: the first one aims at estimating the hydraulic conductivity over the whole field assuming that the rest of the hydraulic and transport parameters are known by applying the standard ES-MDA method; the second one couples the ES-MDA with a truncated Gaussian model to simultaneously estimate the spatial distribution of two geological lithotypes and their main hydraulic and transport properties. Both procedures are tested following a fully parameterized approach and a pilot point approach. A synthetic case that mimics the sandbox experiment was developed to test the capability of the proposed methods and find out their optimal configurations to be used for the real case. The results show that the ES-MDA coupled with a truncated Gaussian model outperforms the standard ES-MDA and it reproduces well the binary field and the aquifer properties also in the presence of large measurement errors. The fully parametrized and pilot point approaches lead to comparable solutions, with less computation time required by the pilot point approach.

Résumé

L’estimation des propriétés d’un aquifère et de leurs variations dans l’espace est la partie la plus difficile des simulations relatives au flux et au transport dans les eaux souterraines. Dans le présent travail, une méthode basée sur l’ensemble de Kalman, le filtre d’ensemble avec assimilation de données multiples (ES-MDA), est appliqué pour déduire les caractéristiques d’un champ binaire au moyen des données d’un essai de traçage recueillies dans un bac à sable expérimental. Deux approches différentes sont comparées: la première vise une estimation de la conductivité hydraulique dans l'ensemble du champ, le reste des paramètres hydrauliques et de transfert étant supposé connu par l’application de la méthode standard ES-MDA; la seconde couple le ES-MDA avec un modèle Gaussien tronqué pour estimer dans le même temps la distribution spatiale des deux lithotypes géologiques et leurs principales propriétés d’écoulement et de transport. Les deux procédures sont testées en suivant une approche totalement paramétrée et une approche par point pilote. Un cas synthétique qui reproduit l’expérience du bac à sable a été développé pour tester la capacité des méthodes proposées et déterminer leurs configurations optimales à utiliser dans le cas réel. Les résultats montrent que le ES-MDA couplé avec un modèle Gaussien tronqué surpasse le standard ES-MDA et qu’il reproduit convenablement le champ binaire et les propriétés de l’aquifère, même en présence d’erreurs de mesure importantes. L’approche totalement paramétrés et des points pilote conduit à des solutions comparables, avec moins de temps de calcul requis par l’approche par le point pilote.

Resumen

La estimación de las propiedades de los acuíferos y su variabilidad espacial es el reto más importante en las simulaciones de flujo y transporte de aguas subterráneas. En este trabajo, se usa un método basado en el filtro de Kalman de conjuntos, el suavizador de conjuntos con asimilación repetida de los datos (ES-MDA), para inferir las características de un campo binario a partir de datos de ensayos de trazadores tomados en un tanque de laboratorio. Se comparan dos enfoques diferentes: el primero pretende estimar la conductividad hidráulica en todo el campo asumiendo que el resto de los parámetros hidráulicos y de transporte son conocidos aplicando el método ES-MDA estándar; el segundo acopla el ES-MDA con un modelo gaussiano truncado para estimar simultáneamente la distribución espacial de dos litologías geológicas y sus principales propiedades hidráulicas y de transporte. Ambos procedimientos se prueban siguiendo un enfoque totalmente parametrizado y un enfoque de puntos piloto. Se desarrolló un caso sintético que imita el experimento del tanque de laboratorio para probar el rendimiento de los métodos propuestos y determinar sus configuraciones óptimas para su uso en el caso real. Los resultados muestran que el ES-MDA acoplado con un modelo gaussiano truncado supera al ES-MDA estándar y reproduce bien el campo binario y las propiedades del acuífero incluso en presencia de importantes errores de medida. El enfoque totalmente parametrizado y el de puntos piloto conducen a soluciones comparables, con menos tiempo de cálculo requerido por el enfoque de puntos piloto.

摘要

估计含水层性质及其空间变异性是地下水流和运移模拟中最具挑战性的部分。本研究应用基于集合卡尔曼的方法, 即具有多重数据同化的集合平滑法 (ES-MDA), 通过在试验砂箱中采集示踪试验数据来推断双场的特征。比较了两种不同的方法: 第一种旨在通过应用标准ES-MDA方法, 假定其余的水力和运移参数已知, 估计整个场的水力传导率; 第二种则将ES-MDA与截断高斯模型耦合, 同时估计两种地质岩性的空间分布及其主要的水力和运移特性。两种程序都是通过完全参数化方法和试点方法进行测试的。开发了一个模仿砂箱试验的综合案例, 以测试所提出方法的性能并找到其用于真实案例的最佳配置。结果表明, ES-MDA与截断高斯模型耦合的方法优于标准ES-MDA, 即使存在大量测量误差, 也能很好地再现双场和含水层的特性。完全参数化和试点方法都可以得到可比较的解, 试点方法所需的计算时间更少。

Resumo

Estimar as propriedades dos aquíferos e sua variabilidade espacial é a parte mais desafiadora das simulações de fluxo e transporte de águas subterrâneas. Neste trabalho, um método baseado em filtro de Kalman em conjunto, o suavizados em conjunto com múltipla assimilação de dados (ES-MDA), é aplicado para inferir as características de um campo binário por meio de dados de teste de traçadores coletados em uma caixa de areia experimental. Duas abordagens diferentes são comparadas: a primeira visa estimar a condutividade hidráulica em todo o campo assumindo que o restante dos parâmetros hidráulicos e de transporte são conhecidos pela aplicação do método padrão ES-MDA; o segundo acopla o ES-MDA com um modelo gaussiano truncado para estimar simultaneamente a distribuição espacial de dois litotipos geológicos e suas principais propriedades hidráulicas e de transporte. Ambos os procedimentos são testados seguindo uma abordagem totalmente parametrizada e uma abordagem de ponto piloto. Um caso sintético que mimetiza o experimento em caixa de areia foi desenvolvido para testar a capacidade dos métodos propostos e descobrir suas configurações ótimas para serem usadas no caso real. Os resultados mostram que o ES-MDA acoplado a um modelo gaussiano truncado supera o ES-MDA padrão e reproduz bem o campo binário e as propriedades do aquífero também na presença de grandes erros de medição. A abordagem dos pontos piloto e totalmente parametrizados leva a soluções comparáveis, com menos tempo de computação exigido pela abordagem do ponto piloto.

Identification of non-Gaussian parameters in heterogeneous aquifers by a modified probability conditioning method through hydraulic-head assimilation

Article 13 October 2020

Joint inversion of physical and geochemical parameters in groundwater models by sequential ensemble-based optimal design

Article 21 February 2018

Impact of model complexity and multi-scale data integration on the estimation of hydrogeological parameters in a dual-porosity aquifer

Article 05 March 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Aquifer characterization is fundamental for developing effective engineering strategies and applications such as the planning of groundwater extraction or recharge systems and the assessment of spatiotemporal evolution of subsurface contaminants. However, suitable methods of identifying aquifer hydraulic and transport properties are still under investigation within the scientific community. Many of the associated parameters cannot be estimated directly and need to be inferred through inverse modelling. Several approaches have been developed to address the inverse problem; extensive reviews have been carried out by Zimmerman et al. (1998), Vrugt et al. (2008) and Zhou et al. (2014).

In the past years, ensemble Kalman filter-based methods gained increasing interest due to their efficiency and flexibility in data assimilation for large nonlinear models. In fact, aquifer characterization by inverse modelling using gradient-based optimization approaches needs massive computational effort for moderately large systems. The ensemble Kalman filter-based methods save considerable computing time compared to full Bayesian approaches. Hendricks Franssen and Kinzelbach (2009) studied the ensemble Kalman filter (EnKF) with a Monte-Carlo type inverse modelling technique, and the sequential self-calibration method (Gómez-Hernández et al. 2003), for inverse modelling of groundwater flow systems, to conclude that the two methods give similar results; although, the EnKF computational cost is 80 times lower than that required by the sequential self-calibration method.

Since the introduction of the ensemble Kalman filter (EnKF) by Evensen (1994), it has been widely used for data assimilation and inverse modelling in several fields, including aquifer characterization. Chen and Zhang (2006) applied the EnKF to continuously update the hydraulic conductivity values in both two-dimensional (2D) and three-dimensional (3D) synthetic models by assimilating dynamic pressure head observations. Camporese et al. (2011) applied the EnKF to infer the spatial distribution of hydraulic conductivity from electrical resistivity tomography data of a 3D synthetic tracer test experiment. Tong et al. (2013) used the EnKF in a synthetic 2D aquifer to identify the hydraulic conductivity distribution by assimilating solute concentration measurements. Xu and Gómez-Hernández (2018) used the restart normal-score EnKF for the simultaneous identification of a contaminant source and the spatially variable hydraulic conductivity in an aquifer. The method has been applied in synthetic aquifers by assimilating in time piezometric heads and concentrations from observation wells.

Many variants of the EnKF have been developed over time such as the ensemble smoother (ES) proposed by van Leeuwen and Evensen (1996). Unlike the EnKF, which sequentially assimilates data over time, the ES incorporates all available information into a single global update step. Bailey and Baù (2012) used the ES to estimate spatially variable hydraulic conductivity within a synthetic 3D tilted v-shaped catchment system by assimilating water-table elevation and streamflow data. Crestani et al. (2013) compared the capabilities of the EnKF and the ES to estimate the hydraulic conductivity assimilating observed concentrations. The two approaches have been tested in a 2D synthetic aquifer where a tracer test is simulated. The authors conclude that EnKF always outperforms the ES due to iterative assimilations of information, which helps to handle nonlinear and non-Gaussian conditions. With the aim to improve the ES performance for highly nonlinear applications, Emerick and Reynolds (2012, 2013) proposed the ensemble smoother with multiple data assimilation (ES-MDA), which iteratively assimilates the same data multiple times. The good performance of ES-MDA for inverse modelling have been confirmed in different studies (Todaro et al. 2019, 2021, 2022; Lam et al. 2020; D’Oria et al. 2021; Godoy et al. 2022). Xu et al. (2021) compared the ensemble smoother with multiple data assimilation (ES-MDA) and the restart EnKF for the simultaneous identification of a contaminant source and hydraulic conductivity using both piezometric heads and concentrations on a synthetic aquifer. The results showed that the ES-MDA performs better than the restart EnKF when using enough iterations, needing almost the same computational time.

While these studies have demonstrated the capability of ensemble-based methods to determine aquifer parameters, their application to real sites is still limited due to the complexity of field data collection. Chen et al. (2013) employed the parameter space EnKF and some variants of ES to characterize the hydraulic conductivity field of an aquifer by assimilating experimental tracer data obtained from the Integrated Field Research Challenge site in U.S. Department of Energy's Hanford 300 Area. Crestani et al. (2015) inferred the vertical hydraulic conductivity distribution of an alluvial unconfined aquifer located in northern Italy by assimilating electrical resistivity tomography data through an EnKF method. Panzeri et al. (2015) coupled a modified ensemble Kalman filter algorithm with stochastic moment equations to estimate aquifer transmissivities by assimilating drawdown data recorded during cross-hole pumping tests in a heterogeneous alluvial aquifer located in Germany. Zovi et al. (2017) combined the normal score EnKF with multiple point geostatistics to retrieve the spatially distributed hydraulic conductivity and the storage coefficients of the same aquifer investigated in Crestani et al. (2015) by assimilating measured groundwater levels. Duan et al. (2022) integrated the transition probability geostatistics and the discrete cosine transform with the ES-MDA to map the hydraulic conductivity of a real karst aquifer using the data from a multiple cross-hole pumping test.

In this work, the ES-MDA is employed to infer the properties of a binary aquifer from concentration data obtained via an experimental tracer test. The data are collected in a laboratory sandbox that mimics a vertical cross-section of an unconfined aquifer. Glass beads of two different diameters reproduce a heterogeneous binary field, and fluorescein sodium salt is used as a tracer. The groundwater flow and transport processes are modeled with MODFLOW (Harbaugh 2005) and MT3DMS (Zheng and Wang 1999), respectively. The software package genES-MDA (Todaro et al. 2022) was used to apply the ES-MDA procedure.

Different approaches are tested to estimate the aquifer parameters. First, the binary pattern of the true field is assumed unknown, and the ES-MDA is applied to directly estimate the hydraulic conductivity field. Another common approach to model subsurface characteristics is the conceptualization of the field using lithotypes or facies; constant properties are assigned to each lithotype. Kalman-based methods are optimal when working with Gaussian distributed parameters, and they are not suitable for estimating categorical variables such as geological lithotypes. To handle the categorical parameter estimation through ES-MDA, it can be coupled with a truncated Gaussian model (Matheron et al. 1987). The main key of the truncated Gaussian model is the definition of the proportion of facies and their spatial distribution. The application of this approach for inverse modelling was first introduced by Wen et al. (2002) as an extension of the self-calibrating approach (Capilla et al. 1998; Wen et al. 1999; Franssen and Gómez-Hernández 2002). Then, the truncated Gaussian models were linked to ensemble-based methods in history-matching problems (Liu and Oliver 2005; Zhao et al. 2008; Agbalaka and Oliver 2008; Zhang et al. 2015), but they have only been applied to synthetic cases and considering the properties of the facies known. In this work, the ES-MDA is coupled with a truncated Gaussian model (ES-MDA-T) to simultaneously estimate the pattern of the aquifer and the hydraulic and transport parameters of each facies.

Both ES-MDA and ES-MDA-T are applied using a fully parameterized approach and a pilot point approach. In the fully parameterized approach, parameters are estimated at each grid cell or element in the model domain. This approach can provide a high level of detail and accuracy, but it can be computationally intensive. In order to reduce the computational demands, a pilot point approach (RamaRao et al. 1995; Gómez-Hernández et al. 1997), which is commonly used for inverse modeling in groundwater (see, e.g. Alcolea et al. 2006; Li et al. 2013; Cao et al. 2018; Keller et al. 2021), can be employed. The pilot point approach involves selecting a few key locations in the model domain where parameters are estimated, and then, using interpolation methods, parameter values at other locations are inferred.

To validate the methods proposed, a synthetic case that reproduces the sandbox experiment is developed and used to test different configurations of the inverse procedures. Then the experimental tracer test data are used to infer the characteristics of the sandbox field.

The paper is organized as follows: the next section presents the ES-MDA procedure and its link with a truncated Gaussian model; section ‘Description of the experiment’ is followed by a summary of all the tests performed using synthetic and experimental data, and the results; and lastly, there is section ‘Discussion and conclusions’.

Methods

The methods applied in this work to solve the inverse problem are based on the ensemble smoother with multiple data assimilation (ES-MDA) technique, in some cases coupled with a Gaussian truncated model. The ES-MDA is a stochastic iterative approach that allows the estimation of a set of unknown parameters and their uncertainty from available observations on the state of the system. The description of the ES-MDA scheme is presented in detail by Emerick and Reynolds (2013), Evensen (2018) and Todaro et al. (2022); here, an overview of the method is given. Then, the link between the truncated Gaussian model and the ES-MDA is presented.

ES-MDA

The ES-MDA is used to solve the inverse problem aimed at estimating the aquifer parameters assimilating observed tracer breakthrough curve data. The vector $\mathbf{X}$ $\in {\mathbb{R}}^{{N}_{\mathrm{P}}}$ contains the ${N}_{\mathrm{P}}$ aquifer parameters to be estimated, while the vector of observations $\mathbf{D}$ $\in {\mathbb{R}}^{m}$ contains the $m$ tracer concentrations measured at sparse sampling locations in the aquifer at different times. The method requires the relationship between parameters and observations to be known; this is given by a forward model that simulates the flow (MODFLOW) and transport processes (MT3DMS). The procedure consists of an initialization phase and then proceeds with an iterative loop made up of two steps.

Initialization phase

The first ingredient of the ES-MDA is the definition of an initial ensemble of parameters. The size of the ensemble, ${N}_{\mathrm{e}}$, should be large enough to be a statistical representation of the problem at hand and as small as possible with the aim to limit the computational burden. The characteristics of the initial ensemble of parameters allow one to take into account prior information, when available.

Another preliminary step is to specify an ensemble of measurement errors ${\varvec{\upvarepsilon}}\in {\mathbb{R}}^{m \times {N}_{\mathrm{e}}}$, which are usually assumed to be uncorrelated and drawn from a Gaussian distribution with zero mean and given standard deviation. The ES-MDA also requires defining a priori the number of iterations to be performed, $N$, and a vector of coefficients ${{\varvec{\upalpha}}=(\alpha }_{1}, {\alpha }_{2}, \dots , {\alpha }_{N})$ that apply to the measurement errors and control the parameter change from one step to another. A gradual decrease of ${\varvec{\upalpha}}$ during the iterative process improves the performance of the method as it gradually reduces the measurement errors helping to avoid overfitting in the first updates. Several schemes can be used to define the set of α, but they must satisfy the condition:

$$\sum\nolimits_{i=1}^{N}\frac{1}{{\alpha }_{i}}=1$$

(1)

After the initialization step, the procedure follows with two iterative steps.

Forecast step

At each iteration $i$, the state of the system coinciding with the available observations, ${\mathbf{Y}}_{j,i}$, are obtained for each realization $j$ of the ensemble of parameters, ${\mathbf{X}}_{j,i}$, by means of the forward model:

$${\mathbf{Y}}_{j,i}=\mathrm{g}\left({\mathbf{X}}_{j,i}\right)$$

(2)

where the operator $\mathrm{g}\left(\bullet \right)$ includes the forward model run and the functions used to extract the predictions at the same spacetime locations where observations were collected.

Update step

During the update step, the ensemble of parameters is updated following the equation:

$${\mathbf{X}}_{i+1}={\mathbf{X}}_{i}+\frac{{\mathbf{C}}_{\mathbf{X}\mathbf{Y}}^{i}}{{\mathbf{C}}_{\mathbf{Y}\mathbf{Y}}^{i}+{\alpha }_{i}\mathbf{R}}\times \left(\mathbf{D}+\sqrt{{\alpha }_{i}}{\varvec{\upvarepsilon}}-{\mathbf{Y}}_{i}\right)$$

(3)

where ${\mathbf{C}}_{\mathbf{X}\mathbf{Y}}^{i}\in {\mathbb{R}}^{{N}_{\mathrm{P}}\times m}$, ${\mathbf{C}}_{\mathbf{Y}\mathbf{Y}}^{i}\in {\mathbb{R}}^{m\times m}$ and $\mathbf{R}\in {\mathbb{R}}^{m\times m}$ are the cross-covariance between parameters and predictions, the auto-covariance of the predictions and the auto-covariance of the measurement errors, respectively. $\mathbf{R}$ is a diagonal matrix containing the error variances, ${\mathbf{C}}_{\mathbf{X}\mathbf{Y}}^{i}$ and ${\mathbf{C}}_{\mathbf{Y}\mathbf{Y}}^{i}$ are computed from the parameters and predictions ensembles as:

$${\mathbf{C}}_{\mathbf{X}\mathbf{Y}}^{i}=\frac{1}{{N}_{\mathrm{e}} -1}{\sum }_{j=1}^{{N}_{\mathrm{e}}}\left({\mathbf{X}}_{j,i}-{\overline{\mathbf{X}} }_{i}\right){\left({\mathbf{Y}}_{j,i}-{\overline{\mathbf{Y}} }_{i}\right)}^{\mathrm{T}}$$

(4)

$${\mathbf{C}}_{\mathbf{Y}\mathbf{Y}}^{i}=\frac{1}{{N}_{\mathrm{e}} -1}{\sum }_{j=1}^{{N}_{\mathrm{e}}}\left({\mathbf{Y}}_{j,i}-{\overline{\mathbf{Y}} }_{i}\right){\left({\mathbf{Y}}_{j,i}-{\overline{\mathbf{Y}} }_{i}\right)}^{\mathrm{T}}$$

(5)

where ${\overline{\mathbf{X}} }_{i}$ and ${\overline{\mathbf{Y}} }_{i}$ are the ensemble means of parameters and predictions, respectively.

The procedure repeats until the last iteration after making ${{\mathbf{X}}_{i}=\mathbf{X}}_{i-1}$.

To avoid the appearance of negative values during the update phase, which can be inconsistent for some types of parameters, a space transformation can be applied. In this study, the ensemble of parameters is transformed in the logarithmic space before the update and back-transformed into its physical space after the update.

The total number of runs of the forward model, ${n}_{t}$, required by ES-MDA depends on the ensemble size and the number of iterations: ${n}_{t}=N\cdot {N}_{\mathrm{e}}$. Therefore, it is crucial to use a small ensemble and at the same time avoid undersampling problems, leading to divergence and the appearance of long spurious correlations. Covariance localization approaches can be applied to mitigate the appearance of long-range spurious correlations and to increase the rank of the covariance matrices, which are rank deficient when the number of parameters is higher than the ensemble size (Hamill et al. 2001; Anderson 2007; Chen and Oliver 2010). Covariance localization is applied by element-wise multiplication of the original covariance matrices with chosen correlation matrices reducing the correlations between points for increasing distances and cutting off spurious long-range correlations beyond a specific distance. In this study, the Gaspari and Cohn (1999) fifth-order correlation function is used to compute the elements of the correlation matrices. Furthermore, to avoid overshooting, a linear relaxation can be applied to the ensemble of parameters at the end of each update step:

$${\widehat{\mathbf{X}}}_{i+1}=(1-w){\mathbf{X}}_{i+1}+w{\mathbf{X}}_{i}$$

(6)

where $w$ is a relaxation coefficient between 0 and 1. The localization distance and the coefficient of the linear relaxation are important tuning parameters in the ES-MDA algorithm that ensure accurate and stable estimates of the state variables. The localization distance could be chosen based on prior information or empirical estimates of the spatial correlation of the system being modeled, such as through variogram analysis, while the coefficient of the linear relaxation should be chosen so as to prevent overcorrection and damping of the ensemble spread. In this study, a combination of a trial-and-error approach and prior knowledge was used to choose these parameters.

The ES-MDA-T: linking the ES-MDA and a truncated Gaussian model

In this paper, the ES-MDA is coupled with the truncated Gaussian model with the aim of characterizing a binary field. The spatial distribution of the two facies and their main properties are simultaneously estimated. The proportion of the two lithotypes is assumed known. The truncated Gaussian model consists of thresholding a Gaussian random function using a threshold defined to match the proportions of the facies.

In this case, the vector of unknown parameters is $\mathbf{X}=\left({\mathbf{P}}_{1},{\mathbf{P}}_{2}, \mathbf{F}\right)$, where ${\mathbf{P}}_{1}$ and ${\mathbf{P}}_{2}$ are the properties of the two facies to be estimated (e.g. hydraulic conductivities, longitudinal dispersivities of the two facies, etc.), while $\mathbf{F}=({F}_{1}, {F}_{2}, \dots , {F}_{{N}_{\mathrm{C}}})$ is the vector containing the categorical variables for each cell of the discretized domain. Therefore, the number of unknown parameters ${N}_{\mathrm{P}}$ is equal to the number of aquifer parameters to be estimated (for the two facies) plus the total number of cells of the model grid (${N}_{\mathrm{C}}$), corresponding to the categorical variables. The categorical variable can take the values 1 or 2. The cells with categorical value 1 assume the properties ${\mathbf{P}}_{1}$, while the cells with categorical value is 2 assume properties ${\mathbf{P}}_{2}$. The vector of observations $\mathbf{D}$ is the same as in the previous section.

This procedure allows for estimating categorical variables using the ES-MDA. The modification to the ES-MDA loop to incorporate the truncated Gaussian model into the process is described in the following.

Initialization phase

The initial ensemble of parameters is generated using random values drawn from a Gaussian distribution for the realizations of the aquifer parameters (${\mathbf{P}}_{1}$ and ${\mathbf{P}}_{2}$) and random Gaussian fields for the definition of the spatial distribution of the facies; let this initial ensemble of Gaussian random fields be called $\widetilde{\mathbf{F}}=({\widetilde{F}}_{1}, {\widetilde{F}}_{2}, \dots , {\widetilde{F}}_{{N}_{\mathrm{C}}})$. The proportion between facies must be defined a priori on the basis of available information and expert knowledge. The ensemble of measurement errors and the vector of coefficients ${\varvec{\upalpha}}$ are defined as described in the previous section. The following iterative loop is the same as the standard ES-MDA method, but an additional step named the “truncation step” is introduced before the forecast step.

Truncation step

The truncation step consists of thresholding the Gaussian field to obtain a binary field; this step allows one to transform the continuous variables estimated by ES-MDA into categorical variables. At each iteration $i,$ and for each realization of parameters $j$, the categorical field ${\mathbf{F}}_{i,j}$ is obtained by truncating the Gaussian field ${\widetilde{\mathbf{F}}}_{i,j}$ using as threshold the percentile corresponding to the facies proportion. Then, the continuous Gaussian field is transformed into a categorical one by replacing each value of ${\widetilde{\mathbf{F}}}_{i,j}$ with 1, if it is below or equal to the threshold, and 2 otherwise. It must be noted that the threshold is different for each realization and it changes at each iteration in such a way as to guarantee the selected proportion between the two facies, which is fixed. Figure 1 shows an example of a Gaussian field and the associated binary field after truncation.

Forecast step

At each iteration $i$, the predictions ${\mathbf{Y}}_{j,i}$ are obtained for each realization $j$ of the ensemble of parameters, ${\mathbf{X}}_{j,i}=\left({\mathbf{P}}_{1},{\mathbf{P}}_{2}, \mathbf{F}\right)$,

$${\mathbf{Y}}_{j,i}=\mathrm{g}\left({\mathbf{X}}_{j,i}\right)$$

(7)

At each cell of the grid, ${\mathbf{P}}_{1}$ parameters are assigned if the categorical value of the cell is 1, and ${\mathbf{P}}_{2}$ are assigned if the categorical value of the cell is 2.

Update step

During the update step, the ensemble of parameters, ${\widetilde{\mathbf{X}}}_{j,i}=\left({\mathbf{P}}_{1},{\mathbf{P}}_{2}, \widetilde{\mathbf{F}}\right)$ is updated in the continuous space:

$${\widetilde{\mathbf{X}}}_{i+1}={\widetilde{\mathbf{X}}}_{i}+\frac{{\mathbf{C}}_{\mathbf{X}\mathbf{Y}}^{i}}{{\mathbf{C}}_{\mathbf{Y}\mathbf{Y}}^{i}+{\alpha }_{i}\mathbf{R}}\times \left(\mathbf{D}+\sqrt{{\alpha }_{i}}{\varvec{\upvarepsilon}}-{\mathbf{Y}}_{i}\right)$$

(8)

The key difference with the standard implementation is that the updates are applied to the underlying continuous Gaussian fields and not to the conductivity fields, which, in this case, are binary. Then the procedure repeats from the truncation step until the end of the iterative process.

Fully-parameterized and pilot point approaches

The ES-MDA and the ES-MDA-T inverse procedures are applied following a fully parameterized approach and a pilot point approach. In the first case, the aquifer parameters (ES-MDA) and the categorical variables (ES-MDA-T) are estimated at each cell of the grid of the numerical model. In the second case, aquifer parameters and categorical variables are estimated at some pilot points and then interpolated into the rest of the domain. The number and location of pilot points have to be chosen in order to ensure that they capture the relevant heterogeneity of the sandbox field. In this work, the pilot points are evenly distributed in space with a distance approximately equal to five times the spatial resolution of the model grid. After each update step, the full map of parameters is obtained by assigning to each cell of the model grid the value of the closest pilot point. This type of interpolation will result in a blocky distribution of the conductivities.

Description of the experiment

Sandbox experiment

The tracer test used to collect the observations to solve the inverse problem is performed in a laboratory sandbox set up in the hydraulic laboratory of the University of Parma, Italy (Fig. 2). The experimental device mimics a vertical cross-section of an unconfined heterogeneous aquifer and the device has been used to develop several laboratory tests (Citarella et al. 2015; Cupola et al. 2015; Chen et al. 2018, 2021, 2023; Todaro et al. 2021). The sandbox has a width of 120 cm, a height of 73 cm, and a thickness of 14 cm; it is made of polymethyl methacrylate (PMMA) plates with a thickness of 2 cm for the lateral sides and 3 cm for the bottom lid. The sandbox is divided into three parts along the longitudinal direction: an upstream tank, a central chamber that contains the porous medium and a downstream tank. The flow into the experimental device is governed by the water levels at the upstream and downstream tanks, which are equal to 62.5 and 61 cm above the horizontal bottom of the tank, respectively. The porous media consists of glass beads with two different diameters of about 1 and 4 mm, which reproduce a binary field. Fluorescein sodium salt is employed as the conservative tracer. The PMMA walls of the device allow one to collect concentration data by interpreting pictures taken during the experiment. The picture RGB values are converted to concentrations through an image processing technique (Citarella et al. 2015).

At the beginning of the experiment (initial condition) the porous medium within the sandbox had a homogeneous concentration of fluorescein sodium salt of 25 mg/L. The tracer test performed had a duration of 4,000 s; during this time the fluorescein concentration progressively decreases within the sandbox as clean water enters the system. The experiment ended when tracer concentration was zero everywhere. Pictures are collected with a time step of 5 s and converted in concentration data at each pixel of the images. Breakthrough curves recorded at 64 monitoring points with a discretization time of 75 s are used as observations in the inverse procedure. For example, Fig. 3 shows the concentration fields obtained from the analysis of the images in four instants from the beginning of the test.

During the experiment, the flow stopped accidentally for about 855 s after 985 s since the start of the test. This was likely caused by a clogged drain in the downstream tank leading to a rise in the downstream water level and resulting in no gradient in the sandbox during the clogging period. The model that simulates the experiment also takes into account the period of no-flow considering a transient boundary condition downstream. Groundwater flow and mass transport were modeled with MODFLOW 2005 and with MT3DMS, respectively. The domain is discretized into a uniform grid of 1-cm resolution (70 layers, 97 columns and 1 row).

Synthetic experiment

Before applying the different methods to the sandbox experiment, a synthetic case was performed with the aim of evaluating the capability of the proposed approaches to infer the aquifer characteristics and find the optimal configuration of the inverse algorithm. The synthetic case mimics the sandbox experiment; the reference hydraulic conductivity field is depicted in Fig. 4. It is a binary field characterized by two different facies, named Facies 1 and Facies 2. The same numerical model developed to simulate the experimental test is used as a forward model; the reference solution is obtained using the parameters reported in Table 1. The observed concentrations used for the inverse modeling are collected at the same spacetime locations considered for the experimental data: 64 breakthrough curves discretized in 54 time steps.

Table 1 Transport and hydraulic parameters of the numerical model used to set up the synthetic case study. HK is the horizontal hydraulic conductivity, VK is the vertical hydraulic conductivity, n is the porosity, Ss is the specific storage coefficient, α_L is the longitudinal dispersivity and α_T is the transverse dispersivity

Full size table

Applications

Several tests were performed to compare the different approaches and the different settings of the ES-MDA algorithm. The analyses were initially performed using the dataset of the synthetic experiment and then the laboratory data. For all tests, the observations used in the inverse modeling are the breakthrough curves collected at the 64 monitoring points depicted in Fig. 4 and discretized in 54 time steps (the number of observations m is 3,456). The first analyses employ the ES-MDA to estimate directly the hydraulic conductivity field (${\mathrm{HK}}_{1}$, ${\mathrm{HK}}_{2}$, …, ${\mathrm{HK}}_{{N}_{\mathrm{P}}}$), assuming the other parameters are known. Then, the ES-MDA-T method is used incorporating the extra information that the medium is binary and, therefore, it focuses on identifying the spatial distribution of the two facies (${F}_{1}$, ${F}_{2}$, …, ${F}_{{N}_{\mathrm{C}}}$) and five aquifer parameters: the hydraulic conductivity (HK_F1 and HK_F2) and longitudinal transport dispersivity (α_L,F1 and α_L,F2) of the two facies, as well as the ratio between the transversal and longitudinal dispersivity (α_T/α_L), which is equal for the two facies. The porosity is fixed and homogeneous over the entire domain, as it is determined by the packing method rather than the size of the glass beads. Previous studies demonstrated that the porosity of uniformly random-packed spherical beads is about 0.37 (Beavers et al. 1973; Citarella et al. 2015). The proportion between the two facies is assumed known for all tests and equal to 76% for Facies 1 and 24% for Facies 2.

Both the ES-MDA and the ES-MDA-T methods are applied using a fully parameterized approach and a pilot point method aimed at reducing the number of unknown parameters. For the fully parameterized approach, the parameters are estimated at each cell of the model grid (${N}_{\mathrm{P}}$ = 6,790 for the ES-MDA, ${N}_{\mathrm{P}}$ = 6,795 for the ES-MDA-T); for the pilot point method, the parameters are estimated at 266 points uniformly distributed over the field (${N}_{\mathrm{P}}$ = 266 for ES-MDA, ${N}_{\mathrm{P}}$ = 271 for ES-MDA-T). The ensemble size considered (number of realizations of parameters) is 1,000 for the fully parametrized approach and 100 for the pilot point one.

For all tests, the measurement error ε is normally distributed with zero mean and variance 5⋅10^–2 mg/L. ES-MDA and ES-MDA-T are run with six iterations and decreasing α = [364; 121.3; 40.4; 13.5; 4.5; 1.5]. Covariance localization is applied considering the aforementioned tapering functions that gradually reduce the correlations based on the spatial distance between parameters and predictions and between predictions themselves. The covariance matrices are modified by applying scale factors from 1 (no-correction) to 0 (zero correlation) for distances from 0 to 120 cm. The linear relaxation is applied with the coefficient w = 0.2 for the synthetic data and with w = 0.25 for the experimental data. The specific configuration of each test and the results are presented in the following.

Synthetic case study

Test 1.1: ES-MDA for estimation of the hydraulic conductivity field following a fully parameterized approach

The ES-MDA is applied for the estimation of the hydraulic conductivity field in each cell of the model domain considering the remaining hydraulic and transport parameters are known. Two tests were performed starting from different types of an initial ensemble of parameters, all other variables being equal. The size of the ensemble is 1,000 for both tests. The first test (test 1.1A) uses homogeneous fields as the initial ensemble of parameters; the hydraulic conductivity of each realization is constant and selected randomly from a uniform distribution, HK ∈ U[0.1,9]. For the second test (test 1.1B), the initial ensemble of parameters is made up of Gaussian fields generated using isotropic exponential variograms with random parameters: the mean μ, variance σ and range of correlation h are drawn from the following uniform distributions: μ ∈ U[0.5,0.7], σ ∈ U[60,365], h ∈ U[0.05,2]. Figure 5 shows some realizations of the initial ensembles of parameters and the results of the inverse procedure for both tests. The results refer to the mean field obtained from the ensemble at the last iteration of the ES-MDA procedure. A map of the coefficient of variations is also reported. The RMSE between the observed and predicted concentrations is 2.7 and 2.1 mg/L for test 1.1A and test 1.1B, respectively.

Test 1.2: ES-MDA for estimation of the hydraulic conductivity using a pilot point approach

The ES-MDA is applied for the estimation of the hydraulic conductivity field at 266 pilot points (Fig. 6). The ensemble size is 100; the initial realizations of parameters are a subset of the ensemble used in test 1.1B, from which the values at the pilot point locations were selected.

Figure 6 shows the mean field obtained from the ensemble at the last iteration and the map of the coefficient of variation computed from the ensemble. The RMSE between observed and predicted concentrations is 2.1 mg/L.

Test 1.3 ES-MDA-T for characterization of the binary field following a fully parameterized approach

The ES-MDA is linked with the truncated Gaussian model to simultaneously estimate the spatial distribution of the two facies and their main properties. In test 1.3, the fully parameterized approach is adopted. The ensemble size is 1,000 and the initial realizations of the hydraulic and transport parameters of the two facies are drawn from Gaussian distributions with different mean and variance defined as follows: HK_F1 ∈ $\mathcal{N}$[0.7,0.01]; HK_F2 ∈ $\mathcal{N}$[7,1]; α_L,F1 ∈ $\mathcal{N}$[0.1,4·10^–3]; α_L,F2 ∈ $\mathcal{N}$[0.2,4·10^–3]; α_T /α_L ∈ $\mathcal{N}$[0.05, 1·10^–3]. The initial realizations of the field $\widetilde{\mathbf{F}}$ are Gaussian random fields with zero mean and standard deviation of 1 generated using isotropic Gaussian variograms with correlation range randomly selected from a uniform distribution h ∈ U[10,60].

Figure 7 shows the solution in terms of the probability field derived from the ensemble of realizations. Each cell is assigned a value between 0 and 1, which represents the probability of that cell being associated with Facies 2. Since the field is binary, values close to 1 indicate a high probability of having Facies 2, while values close to 0 indicate a high probability of having Facies 1. For most cells, the true categorical variable is estimated in almost all realizations; the lowest agreement between the ensemble realizations, denoted by probabilities around 0.5, is found at the interface between the two materials. Table 2 summarizes the actual and estimated hydraulic and transport parameters with their uncertainty; the approach shows very good performance in reproducing all the properties of the two facies. The RMSE between the observed and predicted concentrations is equal to 1.53 mg/L.

Table 2 Test 1.3: Actual and estimated parameters of the two facies. The ensemble mean and the 95% uncertainty interval are reported

Full size table

Test 1.4 ES-MDA-T for characterization of the binary field using the pilot point approach

Test 1.3 is repeated using the pilot point method. The initial ensemble of parameters is made up of 100 realizations that are a subset of the initial ensemble of test 1.3: the values at the pilot point locations are extracted from the fully parametrized field. Figure 8 shows the resulting probability field. Most of the true categorical parameters are well reproduced by all the ensemble realizations in almost the whole field. The hydraulic and transport parameters of the two facies are reported in Table 3. The RMSE between the observations and predictions is equal to 2.2 mg/L.

Table 3 Test 1.4: Actual and estimated parameters of the two facies. The ensemble mean and the 95% uncertainty interval are reported

Full size table

Experimental case study

The results obtained from the synthetic case analyses pointed out that the best performance is achieved using the ES-MDA-T approach. Therefore, tests performed using data from the sandbox experiment only consider the approach that links ES-MDA with the truncated Gaussian model.

Test 2.1: ES-MDA-T for characterization of the binary field following a fully parameterized approach

The ES-MDA-T method is applied for the simultaneous estimation of the spatial distribution of the two facies and their properties using the experimental data. The same settings and ensembles of the synthetic test 1.3 are used. Figure 9 shows the estimated probability field. The estimated hydraulic and transport parameters with their uncertainty are reported in Table 4. The RMSE between the observed and predicted concentrations is equal to 3.2 mg/L.

Table 4 Test 2.1: Estimated parameters of the two facies. The ensemble mean and the 95% uncertainty interval are reported

Full size table

Test 2.2: ES-MDA-T for characterization of the binary field using a pilot point approach

In the second experimental case study, the same configuration of test 1.4 is employed. The inverse problem is solved by means of the pilot point method. The results are depicted in Fig. 10; only a portion of the field is well reproduced. In particular, the ES-MDA-T approach fails to correctly estimate the actual categorical variable for the upper part of the field where no observation data are available. Table 5 reports the estimated hydraulic and transport parameters with their uncertainty. The RMSE between the observed and predicted concentrations is equal to 2.9 mg/L.

Table 5 Test 2.2: Estimated parameters of the two facies. The ensemble mean and the 95% uncertainty interval are reported

Full size table

Test 2.3: ES-MDA-T for characterization of a three-facies field following a parameterized approach

The third test is performed considering that the experimental field is made of three materials. This follows the assumption that there is a mixing zone at the interface between the two facies made up of 1 mm or 4 mm diameter glass beads. The mixing zone, named Facies 3, has different characteristics from Facies 1 and Facies 2. The ES-MD-T is applied to estimate the spatial distribution of the three facies and their properties. The vector of unknown parameters is $\mathbf{X}=\left({\mathbf{P}}_{1},{\mathbf{P}}_{2}, {\mathbf{P}}_{3}, \mathbf{F}\right)$, where ${\mathbf{P}}_{1}$, ${\mathbf{P}}_{2}$ and ${\mathbf{P}}_{3}$ are the vectors containing the parameters of the three facies, $\mathbf{F}$ is the vector of categorical variables for each cell of the discretized domain. The categorical variable can take the values 1, 2 or 3. The cells with categorical value 1 assume the properties ${\mathbf{P}}_{1}$, the cells with categorical value 2 assume the properties ${\mathbf{P}}_{2}$, and when the value is 3, the parameters ${\mathbf{P}}_{3}$ are used. The vector of observations $\mathbf{D}$ is the same as the previous experimental tests. The truncation of the Gaussian field is performed defining two thresholds that match the proportions between the three facies. It is assumed that the 75% of the field is occupied by Facies 1, 17% by Facies 2 and 8% by Facies 3. The two thresholds are equal to the 75th-percentile (Thr₁) and 83th-percentile (Thr₂) of the values of the Gaussian field to be truncated. The continuous variables of the Gaussian function are transformed into categorical variables replacing each value with 1 if it is below Thr₁, 2 if it is above Thr₂ and 3 otherwise. The same settings and initial ensemble of test 2.1 are employed, but two additional parameters are estimated: the hydraulic conductivity (HK_F3) and longitudinal dispersivity (α_L,F3) of Facies 3. The initial ensemble of the properties of Facies 3 are drawn from Gaussian distributions with different mean and variance defined as follows: HK_F3 ∈ $\mathcal{N}$[5, 0.5] and α_L,F3 ∈ $\mathcal{N}$[0.4,0.05].

Figure 11 reports the results of the inverse procedure in terms of the best estimate represented in each cell by the mode of the ensemble of parameters. Table 6 reports the estimated hydraulic and transport parameters with their uncertainty for the three facies. The RMSE between the observed and predicted concentrations is 3.2 mg/L.

Table 6 Test 1.3: Estimated parameters of the three facies. The ensemble mean and the 95% uncertainty interval are reported

Full size table

Discussion and conclusions

The ensemble smoother with multiple data assimilation was applied to characterize a binary aquifer by means of experimental tracer test data. First, the ES-MDA was used to infer the hydraulic conductivity field unaware that it was binary and assuming the rest of the hydraulic and transport parameters are known. Then, the ES-MDA was linked with a truncated Gaussian model for the simultaneous estimation of categorical variables, which reproduce the two-facies field, and the properties of each facies. A fully parameterized approach and a pilot point approach were considered in the analysis. Initially, a synthetic case study that mimics the laboratory experiment was developed to test the capability of the proposed procedures. It is noteworthy that the observation errors assumed for the synthetic case are large and comparable with the experimental ones used for the laboratory case study. Test 1.1 aimed at the estimation of the hydraulic conductivity field using a fully parameterized approach by assimilating observed breakthrough curves. The effect of the characteristics of the initial ensemble of parameters was investigated: test 1.1A was performed considering an initial ensemble consisting of random homogeneous fields, while test 1.1B considers Gaussian fields generated using exponential variograms with random parameters. The results show that test 1.1B performs better in terms of RMSE between the observed and predicted concentrations. Test 1.1A estimates a smoother field than test 1.1B; the portions of the field with low and high permeability are fairly identified, but not the actual distribution of the two facies. In contrast, test 1.1B better reproduces the complexity of the actual field and the discontinuities between the two facies. In both tests, the coefficient of variation is higher at the top and bottom of the field, due to the absence of observation points in these portions of the aquifer. Therefore, the initialization of the procedure, in terms of the initial ensemble of parameters, affects the solution. This is stressed by the ensemble size; the smaller it is, the more the solution may depend on the initial configuration. The size of the ensemble should be related to the number of parameters to be estimated; in this case, the number of realizations is equal to 1,000, and the number of parameters to be estimated is 6,795. This may lead to rank-deficient covariance matrices, since they are computed from the ensembles, and therefore covariance localization techniques must be used to mitigate this effect. Hence, the use of a well-defined initial ensemble of parameters and covariance localization techniques allows for reducing the number of realizations. This leads to a remarkable reduction of the computational burden. For instance, a gradient-based optimization method needs to run the forward model at least as many times as the number of parameters, per iteration. The ES-MDA requires running the forward model a number of times equal to the ensemble size, for each iteration. In this case, the ES-MDA computational effort is 85% less than that required by a gradient-based method.

With the aim to further reduce the computational time, test 1.2 was performed similarly to test 1.1, following a pilot point approach. The hydraulic conductivity is estimated in 266 pilot points applying the ES-MDA method with an ensemble size equal to 100. The performance metrics are comparable with those obtained with the fully parameterized approach. The main features of the binary field are reproduced, but the results are less accurate than the previous test.

The following tests were performed taking into account the binary distribution of the sandbox field. ES-MDA was coupled with a truncated Gaussian model to simultaneously estimate the distribution of the facies and their main hydraulic and transport parameters. ES-MDA-T was first applied following the fully parametrized approach (test 1.3). The results reproduce well the actual field with high accuracy as well as the true parameters of each facies. This test was repeated following the pilot point approach (test 1.4), reaching good results, but with a larger error in the reproduction of the breakthrough curves compared with test 1.3. Both test 1.3 and test 1.4 performed better than test 1.1 and test 1.2, suggesting that the ESMDA-T approach is better than the standard ES-MDA one in reproducing the characteristics of a binary field. In addition, ES-MDA-T has the advantage of being able to simultaneously estimate more properties of the aquifer. This estimation could be computationally prohibitive with the ES-MDA approach, as each aquifer parameter has to be estimated at each cell (or each pilot point), resulting in a huge number of parameters to be estimated. Furthermore, the estimation of different aquifer parameters for each cell can lead to equifinality problems and unreliable solutions. Test 2.1 and test 2.2 were performed using the experimental sandbox data using the optimal configuration obtained from the synthetic cases (test 1.3 and test 1.4). The ES-MDA-T method achieves good results, but with lower performance than that obtained for the synthetic cases. This could be due to epistemic errors in both the experimental data and the forward model structure. ES-MDA-T allows one to reproduce the main features of the field and the estimated hydraulic and transport parameters for the two facies are reasonable and close to those expected.

In general, the results obtained following the pilot point approach are comparable with those achieved by the fully parametrized one, but they show less detailed and less accurate representation of the parameter field. However, compared to the fully parameterized approach, the pilot point method significantly reduces the required computational time by up to 90%. It is noteworthy that the number and location of the pilot points, as well as the interpolation method adopted, may affect the final results of the inverse procedure. Further investigations will focus on varying the pilot point configuration with the aim to find the optimal one considering the trade-off between the computational cost and the accuracy of the results. The last test (test 2.3) was performed assuming that the actual field is made up of three materials, where the third material is generated by a mixing zone at the interface between the two facies. The ES-MDA-T approach is applied using a simple truncated Gaussian model that ensures that the three facies occur in the same sequential order; the results are comparable with those obtained for test 2.1.

For more complex fields, truncated pluri-Gaussian methods can be implemented to reproduce more than two facies and different types of contacts between them. Future work will investigate this aspect by carrying out new laboratory experiments on more complex fields.

References

Agbalaka CC, Oliver DS (2008) Application of the EnKF and localization to automatic history matching of facies distribution and production data. Math Geosci 40:353–374. https://doi.org/10.1007/s11004-008-9155-7
Article Google Scholar
Alcolea A, Carrera J, Medina A (2006) Pilot points method incorporating prior information for solving the groundwater flow inverse problem. Adv Water Resour 29:1678–1689. https://doi.org/10.1016/j.advwatres.2005.12.009
Article Google Scholar
Anderson JL (2007) Exploring the need for localization in ensemble data assimilation using a hierarchical ensemble filter. Phys Nonlinear Phenom 230:99–111. https://doi.org/10.1016/j.physd.2006.02.011
Article Google Scholar
Bailey RT, Baù D (2012) Estimating geostatistical parameters and spatially-variable hydraulic conductivity within a catchment system using an ensemble smoother. Hydrol Earth Syst Sci 16:287–304. https://doi.org/10.5194/hess-16-287-2012
Article Google Scholar
Beavers GS, Sparrow EM, Rodenz DE (1973) Influence of bed size on the flow characteristics and porosity of randomly packed beds of spheres. J Appl Mech 40:655–660. https://doi.org/10.1115/1.3423067
Article Google Scholar
Camporese M, Cassiani G, Deiana R, Salandin P (2011) Assessment of local hydraulic properties from electrical resistivity tomography monitoring of a three-dimensional synthetic tracer test experiment. Water Resour Res 47. https://doi.org/10.1029/2011wr010528
Cao Z, Li L, Chen K (2018) Bridging iterative ensemble smoother and multiple-point geostatistics for better flow and transport modeling.J Hydrol.565:411–421. https://doi.org/10.1016/j.jhydrol.2018.08.023
Article Google Scholar
Capilla JE, Gómez-Hernández JJ, Sahuquillo A (1998) Stochastic simulation of transmissivity fields conditional to both transmissivity and piezometric head data: 3. application to the Culebra formation at the Waste Isolation Pilot Plant (WIPP), New Mexico, USA. J applicationHydrol 207:254–269. https://doi.org/10.1016/S0022-1694(98)00138-3
Chen X, Hammond GE, Murray CJ, Rockhold ML, Vermeul VR, Zachara JM (2013) Application of ensemble-based data assimilation techniques for aquifer characterization using tracer data at Hanford 300 area: tracer data assimilation at Hanford 300 area. Water Resour Res 49:7064–7076. https://doi.org/10.1002/2012WR013285
Article Google Scholar
Chen Y, Oliver DS (2010) Cross-covariances and localization for EnKF in multiphase flow data assimilation. Comput Geosci 14:579–601. https://doi.org/10.1007/s10596-009-9174-6
Article Google Scholar
Chen Y, Zhang D (2006) Data assimilation for transient flow in geologic formations via ensemble Kalman filter. Adv Water Resour 29:1107–1122. https://doi.org/10.1016/j.advwatres.2005.09.007
Article Google Scholar
Chen Z, Gómez-Hernández JJ, Xu T, Zanini A (2018) Joint identification of contaminant source and aquifer geometry in a sandbox experiment with the restart ensemble Kalman filter. J Hydrol 564:1074–1084. https://doi.org/10.1016/j.jhydrol.2018.07.073
Article Google Scholar
Chen Z, Xu T, Gómez-Hernández JJ, Zanini A (2021) Contaminant spill in a sandbox with non-Gaussian conductivities: simultaneous identification by the restart normal-score ensemble Kalman filter. Math Geosci 53:1587–1615. https://doi.org/10.1007/s11004-021-09928-y
Article Google Scholar
Chen Z, Xu T, Gómez-Hernández JJ, Zanini A, Zhou Q (2023) Reconstructing the release history of a contaminant source with different precision via the ensemble smoother with multiple data assimilation. J Contam Hydrol 252:104115. https://doi.org/10.1016/j.jconhyd.2022.104115
Article Google Scholar
Citarella D, Cupola F, Tanda MG, Zanini A (2015) Evaluation of dispersivity coefficients by means of a laboratory image analysis. J Contam Hydrol 172:10–23. https://doi.org/10.1016/j.jconhyd.2014.11.001
Article Google Scholar
Crestani E, Camporese M, Baú D, Salandin P (2013) Ensemble Kalman filter versus ensemble smoother for assessing hydraulic conductivity via tracer test data assimilation. Hydrol Earth Syst Sci 17:1517–1531. https://doi.org/10.5194/hess-17-1517-2013
Article Google Scholar
Crestani E, Camporese M, Salandin P (2015) Assessment of hydraulic conductivity distributions through assimilation of travel time data from ERT-monitored tracer tests. Adv Water Resour 84:23–36. https://doi.org/10.1016/j.advwatres.2015.07.022
Article Google Scholar
Cupola F, Tanda MG, Zanini A (2015) Laboratory sandbox validation of pollutant source location methods. Stoch Environ Res Risk Assess 29:169–182. https://doi.org/10.1007/s00477-014-0869-4
Article Google Scholar
D’Oria M, Mignosa P, Tanda MG, Todaro V (2021) Estimation of levee breach discharge hydrographs: comparison of inverse approaches. Hydrol Sci J 67(1). https://doi.org/10.1080/02626667.2021.1996580
Duan X, Deng Y, Chu X, Peng X, Su H, Yang H (2022) Identification of hydraulic conductivity field of a karst aquifer by using transition probability geostatistics and discrete cosine transform with an ensemble method. Hydrol Process 36(11). https://doi.org/10.1002/hyp.14755
Emerick AA, Reynolds AC (2012) History matching time-lapse seismic data using the ensemble Kalman filter with multiple data assimilations. Comput Geosci 16:639–659. https://doi.org/10.1007/s10596-012-9275-5
Article Google Scholar
Emerick AA, Reynolds AC (2013) Ensemble smoother with multiple data assimilation. Comput Geosci 55:3–15. https://doi.org/10.1016/j.cageo.2012.03.011
Article Google Scholar
Evensen G (1994) Sequential data assimilation with a nonlinear quasi-geostrophic model using Monte Carlo methods to forecast error statistics. J Geophys Res 99:10143. https://doi.org/10.1029/94JC00572
Article Google Scholar
Evensen G (2018) Analysis of iterative ensemble smoothers for solving inverse problems. Comput Geosci 223(22):885–908. https://doi.org/10.1007/S10596-018-9731-Y
Article Google Scholar
Franssen HJH, Kinzelbach W (2009) Ensemble Kalman filtering versus sequential self-calibration for inverse modelling of dynamic groundwater flow systems. J Hydrol 365:261–274. https://doi.org/10.1016/j.jhydrol.2008.11.033
Article Google Scholar
Franssen HJWMH, Gómez-Hernández JJ (2002) 3D inverse modelling of groundwater flow at a fractured site using a stochastic continuum model with multiple statistical populations. Stoch Environ Res Risk Assess SERRA 16:155–174. https://doi.org/10.1007/s00477-002-0091-7
Article Google Scholar
Gaspari G, Cohn SE (1999) Construction of correlation functions in two and three dimensions. Q J R Meteorol Soc 125:723–757. https://doi.org/10.1002/qj.49712555417
Article Google Scholar
Godoy VA, Napa-García GF, Gómez-Hernández JJ (2022) Ensemble smoother with multiple data assimilation as a tool for curve fitting and parameter uncertainty characterization: example applications to fit nonlinear sorption isotherms. Math Geosci 54:807–825. https://doi.org/10.1007/s11004-021-09981-7
Article Google Scholar
Gómez-Hernández JJ, Sahuquillo A, Capilla JE (1997) Stochastic simulation of transmissivity fields conditional to both transmissivity and piezometric data: I. theory. J Hydrol 203:162–174. https://doi.org/10.1016/S0022-1694(97)00098-X
Article Google Scholar
Gómez-Hernández JJ, Franssen H-JWMH, Sahuquillo A (2003) Stochastic conditional inverse modeling of subsurface mass transport: a brief review and the self-calibrating method. Stoch Environ Res Risk Assess SERRA 17:319–328. https://doi.org/10.1007/s00477-003-0153-5
Article Google Scholar
Hamill TM, Whitaker JS, Snyder C (2001) Distance-dependent filtering of background error covariance estimates in an ensemble Kalman filter. Mon Weather Rev 129:2776–2790. https://doi.org/10.1175/1520-0493(2001)129/3c2776:DDFOBE/3e2.0.CO;2
Article Google Scholar
Harbaugh AW (2005) MODFLOW-2005: the U.S. Geological Survey modular ground-water model: the ground-water flow process. US Geol Surv Techniques Methods 6-A15. https://doi.org/10.3133/tm6A16
Keller J, Franssen H-JH, Nowak W (2021) Investigating the pilot point ensemble Kalman filter for geostatistical inversion and data assimilation. Adv Water Resour 155:104010. https://doi.org/10.1016/j.advwatres.2021.104010
Article Google Scholar
Lam D ‐T, Renard P, Straubhaar J, Kerrou J (2020) Multiresolution approach to condition categorical multiple‐point realizations to dynamic data with iterative ensemble smoothing. Water Resour Res 56. https://doi.org/10.1029/2019WR025875
Li L, Srinivasan S, Zhou H, Gómez-Hernández JJ (2013) A pilot point guided pattern matching approach to integrate dynamic data into geological modeling. Adv Water Resour 62:125–138. https://doi.org/10.1016/j.advwatres.2013.10.008
Article Google Scholar
Liu N, Oliver DS (2005) Ensemble Kalman filter for automatic history matching of geologic facies. J Pet Sci Eng 47:147–161. https://doi.org/10.1016/j.petrol.2005.03.006
Article Google Scholar
Matheron G, Beucher H, de Fouquet C, Galli A, Guerillot D, Ravenne C (1987) Conditional simulation of the geometry of fluvio-deltaic reservoirs. In: All days. SPE-16753-MS, SPE, Tulsa, OK
Panzeri M, Riva M, Guadagnini A, Neuman SP (2015) EnKF coupled with groundwater flow moment equations applied to Lauswiesen aquifer, Germany. J Hydrol 521:205–216. https://doi.org/10.1016/j.jhydrol.2014.11.057
Article Google Scholar
RamaRao BS, LaVenue AM, De Marsily G, Marietta MG (1995) Pilot point methodology for automated calibration of an ensemble of conditionally simulated transmissivity fields: 1. theory and computational experiments. Water Resour Res 31:475–493. https://doi.org/10.1029/94WR02258
Article Google Scholar
Todaro V, D’Oria M, Tanda MG, Gómez-Hernández JJ (2019) Ensemble smoother with multiple data assimilation for reverse flow routing. Comput Geosci 131:32–40. https://doi.org/10.1016/j.cageo.2019.06.002
Article Google Scholar
Todaro V, D’Oria M, Tanda MG, Gómez-Hernández JJ (2021) Ensemble smoother with multiple data assimilation to simultaneously estimate the source location and the release history of a contaminant spill in an aquifer. J Hydrol 598, Art. no. 126215. https://doi.org/10.1016/j.jhydrol.2021.126215
Todaro V, D’Oria M, Tanda MG, Gómez-Hernández JJ (2022) genES-MDA: a generic open-source software package to solve inverse problems via the ensemble smoother with multiple data assimilation. Comput Geosci 167(C), Art. no. 105210. https://doi.org/10.1016/j.cageo.2022.105210
Tong J, Hu BX, Yang J (2013) Data assimilation methods for estimating a heterogeneous conductivity field by assimilating transient solute transport data via ensemble Kalman filter: data assimilation methods for transient solute transport modeling. Hydrol Process 27:3873–3884. https://doi.org/10.1002/hyp.9523
Article Google Scholar
van Leeuwen PJ, Evensen G (1996) Data assimilation and inverse methods in terms of a probabilistic formulation. Mon Weather Rev 124:2898–2913. https://doi.org/10.1175/1520-0493(1996)124/3c2898:DAAIMI/3e2.0.CO;2
Article Google Scholar
Vrugt JA, Stauffer PH, Wöhling T, Robinson BA, Vesselinov VV (2008) Inverse modeling of subsurface flow and transport properties: a review with new developments. Vadose Zone J 7:843–864. https://doi.org/10.2136/vzj2007.0078
Article Google Scholar
Wen XH, Capilla JE, Deutsch CV, Gómez-Hernández JJ, Cullick AS (1999) A program to create permeability fields that honor single-phase flow rate and pressure data. Comput Geosci 25:217–230. https://doi.org/10.1016/S0098-3004(98)00126-5
Article Google Scholar
Wen X-H, Tran TT, Behrens RA, Gómez-Hernández JJ (2002) Production data integration in sand/shale reservoirs using sequential self-calibration and geomorphing: a comparison. SPE Reserv Eval Eng 5:255–265. https://doi.org/10.2118/78139-PA
Article Google Scholar
Xu T, Gómez-Hernández JJ (2018) Simultaneous identification of a contaminant source and hydraulic conductivity via the restart normal-score ensemble Kalman filter. Adv Water Resour 112:106–123. https://doi.org/10.1016/j.advwatres.2017.12.011
Article Google Scholar
Xu T, Gómez-Hernández JJ, Chen Z, Lu C (2021) A comparison between ES-MDA and restart EnKF for the purpose of the simultaneous identification of a contaminant source and hydraulic conductivity. J Hydrol 595:125681. https://doi.org/10.1016/j.jhydrol.2020.125681
Article Google Scholar
Zhang Y, Oliver DS, Chen Y, Skaug HJ (2015) Data assimilation by use of the iterative ensemble smoother for 2D facies models. SPE J 20:169–185. https://doi.org/10.2118/170248-PA
Article Google Scholar
Zhao Y, Reynolds AC, Li G (2008) Generating facies maps by assimilating production data and seismic data with the ensemble Kalman filter. In: All days. SPE-113990-MS, SPE, Tulsa, OK
Zheng C, Wang PP (1999) MT3DMS: a modular three-dimensional multispecies transport model for simulation of advection, dispersion, and chemical reactions of contaminants in groundwater systems: documentation and user’s guide. SERDP-99-1, US Army Corps of Engineers, Washington, DC
Zhou H, Gómez-Hernández JJ, Li L (2014) Inverse methods in hydrogeology: evolution and recent trends. Adv Water Resour 63:22–37. https://doi.org/10.1016/j.advwatres.2013.10.014
Article Google Scholar
Zimmerman DA, de Marsily G, Gotway CA, Marietta MG, Axness CL, Beauheim RL, Bras RL, Carrera J, Dagan G, Davies PB, Gallegos DP, Galli A, Gómez-Hernández J, Grindrod P, Gutjahr AL, Kitanidis PK, Lavenue AM, McLaughlin D, Neuman SP, RamaRao BS, Ravenne C, Rubin Y (1998) A comparison of seven geostatistically based inverse approaches to estimate transmissivities for modeling advective transport by groundwater flow. Water Resour Res 34:1373–1413. https://doi.org/10.1029/98wr00003
Article Google Scholar
Zovi F, Camporese M, Hendricks Franssen H-J, Huisman JA, Salandin P (2017) Identification of high-permeability subsurface structures with multiple point geostatistics and normal score ensemble Kalman filter. J Hydrol 548:208–224. https://doi.org/10.1016/j.jhydrol.2017.02.056
Article Google Scholar

Download references

Funding

Open access funding provided by Università degli Studi di Parma within the CRUI-CARE Agreement. Valeria Todaro acknowledges financial support from PNRR MUR project ECS_00000033_ECOSISTER. J. Jaime Gómez-Hernández acknowledges grant PID2019-109131RB-I00 funded by MCIN/AEI/10.13039/50110001103.

Author information

Authors and Affiliations

Department of Engineering and Architecture, University of Parma, 43124, Parma, Italy
Valeria Todaro, Marco D’Oria, Andrea Zanini & Maria Giovanna Tanda
Institute for Water and Environmental Engineering, Universitat Politècnica de València, 46022, València, Spain
J. Jaime Gómez-Hernández

Authors

Valeria Todaro
View author publications
You can also search for this author in PubMed Google Scholar
Marco D’Oria
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Zanini
View author publications
You can also search for this author in PubMed Google Scholar
J. Jaime Gómez-Hernández
View author publications
You can also search for this author in PubMed Google Scholar
Maria Giovanna Tanda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Valeria Todaro.

Ethics declarations

Conflicts of Interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Published in the special issue “Geostatistics and hydrogeology”.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Todaro, V., D’Oria, M., Zanini, A. et al. Experimental sandbox tracer tests to characterize a two-facies aquifer via an ensemble smoother. Hydrogeol J 31, 1665–1678 (2023). https://doi.org/10.1007/s10040-023-02662-1

Download citation

Received: 05 December 2022
Accepted: 12 June 2023
Published: 18 July 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s10040-023-02662-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Experimental sandbox tracer tests to characterize a two-facies aquifer via an ensemble smoother

Abstract

Résumé

Resumen

摘要

Resumo

Similar content being viewed by others

Identification of non-Gaussian parameters in heterogeneous aquifers by a modified probability conditioning method through hydraulic-head assimilation

Joint inversion of physical and geochemical parameters in groundwater models by sequential ensemble-based optimal design

Impact of model complexity and multi-scale data integration on the estimation of hydrogeological parameters in a dual-porosity aquifer

Introduction

Methods

ES-MDA

Initialization phase

Forecast step

Update step

The ES-MDA-T: linking the ES-MDA and a truncated Gaussian model

Initialization phase

Truncation step

Forecast step

Update step

Fully-parameterized and pilot point approaches

Description of the experiment

Sandbox experiment

Synthetic experiment

Applications

Synthetic case study

Test 1.1: ES-MDA for estimation of the hydraulic conductivity field following a fully parameterized approach

Test 1.2: ES-MDA for estimation of the hydraulic conductivity using a pilot point approach

Test 1.3 ES-MDA-T for characterization of the binary field following a fully parameterized approach

Test 1.4 ES-MDA-T for characterization of the binary field using the pilot point approach

Experimental case study

Test 2.1: ES-MDA-T for characterization of the binary field following a fully parameterized approach

Test 2.2: ES-MDA-T for characterization of the binary field using a pilot point approach

Test 2.3: ES-MDA-T for characterization of a three-facies field following a parameterized approach

Discussion and conclusions

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of Interest

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation