Introduction

Large-scale injection of CO2 into geologic formations is considered a potential mitigation method to reduce greenhouse gas emissions. The safety and effectiveness of geologic carbon sequestration (GCS) are achieved when injected CO2 remains contained within the storage reservoir. Trapping mechanisms reducing the mobile CO2 that could impact health, safety or the environment include structural trapping, residual phase trapping, solubility trapping, and mineral trapping (IPCC 2005). Despite these trapping mechanisms, it is possible in some cases that CO2 could unexpectedly leak upwards due to (1) the large amount of CO2 injected and (2) the buoyant nature of CO2 (Oldenburg et al. 2008). On the other hand, it is important to recognize that CO2 is non-hazardous unless concentrations are above certain levels. The key to public acceptance and success of GCS is to address leakage concerns, and to demonstrate that leakage risks are acceptably small.

Similar to what has been used for the risk assessment of nuclear waste repositories, general probabilistic theory and a features, events, and processes (FEP) scenario approach (Savage et al. 2004; Wildenborg et al. 2004, 2005) have been used to evaluate risks related to GCS. The FEP approach includes identifying all relevant FEPs, defining scenarios, and modeling environmental impacts and consequences. In this approach, the probability that certain FEPs will occur is usually assigned as input. Bowden and Rigg (2004) used a performance index to quantitatively characterize risks, where the likelihood and duration of each risk is determined by an expert panel and entered as input to each risk event. To date there has not been a quantitative risk evaluation based on geology and CO2 plume characteristics for a given CO2 storage site.

Three potential pathways may be available for CO2 to escape from the storage formation to regions with vulnerable resources (e.g., a drinking water aquifer) (Espie 2004; Pruess 2008; Zweigel et al. 2004): (1) leakage through the caprock, (2) leakage through subvertical faults or fracture zones, and (3) leakage through abandoned wells. The focus of this study is on the second pathway—leakage through faults or fracture zones. The assessment of risks includes evaluation of both the probability that the CO2 plume will reach drinking water or another vulnerable resource through a conduit, and the impact of the leakage, which requires knowledge of CO2 flux, concentration, and total amount. The focus of our study is on the estimation of leakage probability (P leak).

This work is designed to fit into the certification framework (CF) for geological CO2 storage. The overall objective of the CF is to develop a simple framework for evaluating leakage risk for certifying operation and abandonment of geological CO2 storage sites. In the CF, the term “compartment” is defined as a region containing vulnerable resources that could potentially be impacted such as potable groundwater and the near-surface environment. The locations of compartments are abstract in the sense that they may include disconnected pieces (Oldenburg et al. 2008). The same concept is used in this work as well.

The fundamental uncertainties addressed by the approach described in this paper are schematically presented in Fig. 1. Starting from the injection horizon, CO2 enters the storage region and migrates as controlled by pressure, buoyancy, permeability, and capillary effects. Because plume migration distances and the presence of faults and fractures may only be estimated roughly from site characterization data prior to actual injection, the first unknown is whether the CO2 plume will intersect a conductive fault that could serve as a conduit for CO2 to leak through the sealing formation. Probabilistic approaches are needed to quantify the likelihood of this event based on the size of the CO2 plume, which is highly uncertain given the uncertain properties of the deep storage reservoir. The second unknown is whether the conductive fault(s) intersected by the CO2 plume are connected to other conductive faults in such a way that a connected flow path is formed to allow CO2 to leak to a compartment where there could be a potential impact. The connectivity of the conduits is related to the geometric characteristics of the system of conduits (i.e., distribution of conduits) between the storage reservoir and the compartment. For a site (which includes the storage formation and the geological formation above it) to be selected for GCS, some fault and fracture distribution data are expected to be available. However, the information on the conduit system is usually limited and highly uncertain. Therefore, it is a challenge to predict (1) if the conduits are connected, and if so, (2) the probability that a CO2 plume will encounter the connected pathways. The objective of this work is to provide a framework for estimating the likelihood that CO2 will intersect a conductive conduit network that allows leakage to occur. The amount of leakage and the impact of leakage on the compartment are not within the scope of this study.

Fig. 1
figure 1

Schematic geologic cross section (not to scale) showing CO2 injection well, CO2 plume, sealing formation, overlying formations, and potable ground water, along with conductive faults that may or may not intersect with each other as indicated by the question marks

In the remainder of this paper, we develop the approach and demonstrate its applicability for a simplified geologic formation. The extension to a realistic CO2 sequestration system is outlined in the conclusion section.

Approach

The proposed approach includes four steps: (1) estimate a critical value α c for parameter α, which is related to the density of conduits (faults and fractures) in the conduit length distribution model, such that if α = α c , there is a 50% probability that the system is connected between the storage formation and a compartment; (2) use numerical simulations to estimate the probability that the CO2 plume will encounter the connected conduits for a system with α that is above or slightly less than α c for various distributions of conduits, system size and CO2 plume sizes; (3) construct fuzzy rules that relate information about the conduit system and CO2 plume size to leakage probability based on simulation results from the previous step; and (4) using the fuzzy rules developed in the previous step, predict the probability that CO2 will leak from the storage formation to a compartment through connected conduits for a given system. Each of these steps will be described in detail in the subsequent subsections.

One of the main advantages of the proposed approach is that conduit network generation (Step 2), which is the computationally intensive task, only needs to be done once. Once rules are generated (Step 3), P leak can be easily determined in Step 4 for any given system with a known or estimated conduit distribution.

While the approach can be expanded to be applicable to realistic geologic formations, the four steps are illustrated for a simplified system with the following assumptions:

  • The system is a square, two-dimensional (2D) cross section with sides of length L. The compartment where impacts occur is located at the top of the system and is of size L.

  • Faults/fractures are represented by line segments in 2D.

  • Faults/fractures are randomly positioned and oriented.

  • Only conductive faults/fractures are considered.

  • Faults/fractures follow a power-law length distribution.

  • The details of the proposed approach are discussed next.

Calculation of critical α

In this part, we first justify and describe the power-law fault length distribution we choose for the subsequent analysis. Then we will introduce the percolation threshold, which is related to the connectivity of the fault system. Finally, we provide the relationship between the percolation threshold and the critical value of α (the coefficient in the power-law length distribution).

Distribution of fault length

Extensive studies have been done to characterize fault systems. The power-law distribution is the most widely used model to describe the fault-length distribution (Gudmundsson 1987; Scholz and Cowie 1990; Segall and Pollard 1983). Other statistical descriptions used to characterize fault length include the lognormal distribution (Priest and Hudson 1981; Rouleau and Gale 1985), the exponential distribution (Carbotte and Macdonald 1994; Cowie et al. 1993, 1995; Dershowitz and Einstein 1988), the stretched exponential distribution (Laherrere and Sornette 1998), and the Gamma distribution (Davy 1993; Kagan 1997; Main 1996; Sornette and Sornette 1999). A detailed review of fault characterization distributions can be found in Bonnet et al. (2001). Based on the arguments of Bonnet et al. (2001), we will use a power-law function to describe the fault-length distribution.

The power-law distribution of a fault system is given by:

$$ n(l) = \alpha \;l^{ - a} $$
(1)

where n(l)dl is the number of faults having a length in the interval [l, l + dl], α is a coefficient of proportionality that reflects fault density and depends on the system size L (assuming a square system with sides of length L), and a is an exponent, which typically varies between one and three. It is apparent from Eq. 1 that the power-law distribution contains no characteristic length. This is the key argument for using power laws to describe fault growth processes (Bonnet et al. 2001).

Percolation threshold

Percolation theory (Stauffer and Aharony 1992) has been applied to study the connectivity of fault systems. In percolation theory, a percolation parameter p is used as an average measure of the geometrical properties, generally related to the density of faults, which also provides information on the connectivity of the system. For a 2D system (size L) with a total number of N faults of constant length l, the percolation parameter p is defined as:

$$ p = Nl^{2} /L^{2} . $$
(2)

The percolation threshold p c is defined as the critical p value below which the fault system is not connected (on average), whereas when p is above the critical value p c , the system is on average connected. In other words, 50% of the systems at the percolation threshold are connected. The percolation threshold p c can be obtained from excluded area arguments (Balberg et al. 1984) or numerical simulation (Robinson 1983). For a power-law length distribution (Eq. 1) of a fault network, Bour and Davy (1997) demonstrated that the percolation threshold p c (L) does not vary significantly with L. For any value of a, the computed values of p c (L) are around 5.6 in two dimensions. Therefore, 5.6 will be used as the first approximation for p c in the next step.

The purpose of the first step is to find a critical parameter α c at percolation threshold, where α c is related to the critical density of conduits at which the network is on average connected for a given system size.

Determining critical parameter α c

Studies on the connectivity of faults include both faults with constant length (Balberg et al. 1984, 1991; Gueguen and Dienes 1989; Stauffer and Aharony 1992) and faults with a power-law distribution (Bour and Davy 1997, 1998; Renshaw 1999). Here, we will adopt some results from the study of Bour and Davy (1997) to find the critical density of a fault system.

Bour and Davy (1997) presented an analytical expression for the percolation threshold for a fault system following a power-law length distribution. The analysis is based on the relative contribution of small and large faults for defining the percolation threshold, e.g., the percolation threshold should be the sum of two terms describing the behavior of “small” and “large” faults:

$$ p_{c} (L) = \int\limits_{{l_{\min } }}^{L} {\frac{{n(l)l^{2} }}{{L^{2} }}} {\text{d}}l + \int\limits_{L}^{{l_{\max } }} {n(l)} {\text{d}}l. $$
(3)

where L is the system size, l min and l max are the smallest and largest conduit lengths considered in the system, and n(l) is the probability density function (pdf) of fault length distribution (in our case, the power-law distribution). If l max is less than L, the second term on the right-hand side of Eq. 3 drops out and the first term integrates to l max instead of L.

By inserting Eq. 1 for n(l) into Eq. 3, and setting p c to 5.6, an expression for the critical fault density α c is obtained

$$ \int\limits_{{l_{\min } }}^{L} {\frac{{\alpha_{c} l^{2 - a} }}{{L^{2} }}} {\text{d}}l + \int\limits_{L}^{{l_{\max } }} {\alpha_{c} l^{ - a} } {\text{d}}l = 5.6. $$
(4)

As shown in Figure (9) of Bour and Davy (1997), p c varies very slightly around 5.6 for different values of a and L, with the largest discrepancy of p c  ≈ 7. In this case, the calculated α c using the above equations underestimates α c by about 25%, which is conservative. We assume p c (L) = 5.6 provides a good first approximation for a finite system with a power-law distribution of conduit length. Corrections for p c (L) due to finite-size effect and different values of the exponent a in the pdf of fault length distribution will not be made in this analytical formulation. However, the overestimation of α c caused by these effects will be considered in the numerical simulation by performing simulations for systems with α less than α c , until the resulting P leak is considered small enough to be ignored.

Renshaw (1999) demonstrated that the connectivity of power-law length distribution networks is insensitive to the lower cutoff length l min as long as this length is sufficiently small. However, it is sensitive to the higher cutoff length l max. This effect will also be considered in the numerical simulation.

The length scales in both Eqs. 1 and 4 can be normalized by the smallest fault size l min: l s  = l/l min, L s  = L/l min, and l max s  = l max/l min. By defining \( \alpha_{s} = \alpha \,{\kern 1pt} l_{\min }^{ - a + 1} , \) we obtain:

$$ n(l){\kern 1pt} {\text{d}}l = \alpha {\kern 1pt} l^{ - a} {\text{d}}l = \alpha {\kern 1pt} l_{\min }^{ - a + 1} \frac{{l^{ - a} }}{{l_{\min }^{ - a} }}d\frac{l}{{l_{\min } }} = \alpha_{s} l_{s}^{ - a} {\text{d}}l_{s} . $$
(5)

For a ≠ 3, and if L ≥ l max,

$$ p_{c} (L) = \int\limits_{{l_{\min } }}^{{l_{\max } }} {\frac{{n(l)l^{2} }}{{L^{2} }}} {\text{d}}l = \int\limits_{1}^{{l_{\max s} }} {\frac{{\alpha_{sc} l_{s}^{2 - a} }}{{L_{s}^{2} }}} {\text{d}}l_{s} = \frac{{\alpha_{sc} }}{{L_{s}^{2} }}\left[ {\frac{1}{a - 3} - \frac{{l_{\max s}^{3 - a} }}{a - 3}} \right] = 5.6. $$
(6)

Where α sc is the critical value of α s when the system is at the percolation threshold p c whereas if L < l max,

$$ p_{c} (L) = \int\limits_{1}^{{L_{s} }} {\frac{{\alpha_{sc} l_{s}^{2 - a} }}{{L_{s}^{2} }}} {\text{d}}l_{s} + \int\limits_{{L_{s} }}^{{l_{\max s} }} {\alpha_{sc} l_{s}^{ - a} } {\text{d}}l_{s} = \alpha_{sc} \left[ {\frac{1}{a - 3}\frac{1}{{L_{s}^{2} }} + \left( {\frac{1}{3 - a} + \frac{1}{a - 1}} \right)L_{s}^{1 - a} + \frac{1}{1 - a}l_{\max s}^{1 - a} } \right] = 5.6. $$
(7)

For a = 3, and if L ≥ l max

$$ p_{c} (L) = \alpha_{sc} \frac{1}{{L_{s}^{2} }}\ln l_{\max s} = 5.6 $$
(8)

whereas if L < l max,

$$ p_{c} (L) = \alpha_{sc} \left[ {\frac{1}{{L_{s}^{2} }}\ln L_{s} + \frac{1}{a - 1}L_{s}^{ - 2} + \frac{1}{1 - a}l_{\max s}^{1 - a} } \right] = 5.6. $$
(9)

The expression for α sc shows that a larger system size L s corresponds to a larger α sc ; a larger l max s corresponds to a smaller α sc ; and a larger exponent a (larger portion of smaller faults) corresponds to a larger α sc .

For a given system, we can calculate the critical parameter α sc and compare it to the actual parameter α s . If the actual density is much smaller than the critical value, we can conclude that the system is not connected and the CO2 plume will not be able to leak out through the fault system. For systems with α s around or above its critical value, the steps described in “Conduit network generation to determine P leak ” and “Construct fuzzy rules for calculating P leak ” need to be performed.

The above formulation works for a square system and for conduits with random orientations. For an anisotropic system—a system with different horizontal and vertical connectivity on average—the percolation theory still holds, but with a modified expression for the percolation threshold (Masihi et al. 2006).

Conduit network generation to determine P leak

The purpose of this step is to provide a basis to form fuzzy rules for P leak for systems of different size, fault geometries, and CO2 plume size through numerical generation of fault networks using a limited number of parameters. We assume that vulnerable resources are located at the top of the system. This means the probability that a connected pathway (connected also to the top of the system), encounters a compartment is 1. In this case, P leak depends on only two unknowns, i.e., whether the system is connected (U1), and whether a connected pathway intersects the CO2 plume (U2). U2 depends on both the number of connections and the size of the CO2 plume.

Two types of uncertainties are considered. The first stems from our lack of knowledge of the system, specifically, parameters used to describe fault-length distribution and reservoir properties used to estimate CO2 plume size. This uncertainty will be considered by using fuzzy-rule-based modeling to propagate the uncertainty of the input parameters to the estimation of P leak. We vary system size and fault-length distribution parameters to generate fault networks and estimate P leak for various CO2 plume sizes. The second type of uncertainty is the uncertainty in the generation of the discrete fault network itself. Even for systems with the same parameters (e.g., system size and fault distribution), the generated network could have very different connectivities. To consider this uncertainty on the evaluation of the leakage probability P leak, multiple realizations of random discrete fault networks using the same parameter set are generated. The average and 95% confidence interval are used to interpret the results.

The parameters varied in the fault network generation and P leak calculations are the normalized system size L s , the normalized maximum fault length l max s , the exponent a, the ratio of r = α s /α sc (representing the system’s actual fault density compared to that at the percolation threshold), and the normalized plume size M s , which is the CO2 plume size divided by the smallest fault size l min.

The total number of faults that exist in the system is obtained by integrating the left side of Eq. 1:

$$ N = \int\limits_{1}^{{l_{\max s} }} {\alpha_{s} l_{s}^{ - a} } {\text{d}}l_{s} = \frac{{\alpha_{s} }}{1 - a}\left( {l_{\max s}^{1 - a} - 1} \right). $$
(10)

To generate a fault network, three parameters for characterizing the geometry of individual faults need to be determined, namely location, orientation, and length. In our simulation, we locate the center of conduits randomly in the system. Fault orientation is also random, e.g., uniformly sampled from all directions. Fault length (l) is sampled from its power-law distribution. Note that hydraulic properties of the faults are not needed in this study, as no leakage flux is determined, and because fault network connectivity is estimated using geometrical parameters of the hydraulically conductive faults.

Once the fault network is generated using a given set of L s , l max s , a, and r = α s /α sc , to calculate P leak, we need to (1) remove the unconnected faults (starting from the top of the system) and examine if a connected pathway can be established when the bottom of the system is reached, and (2) calculate the probability (P inter) that the CO2 plume encounters this/these connected pathway(s). The first step will be explained in detail in the illustrative example. In the second step, for systems where no connected pathway is established, P inter = 0; for systems where a connected pathway is found, an “effective connection” concept and a moving average method are used to calculate P inter. All connections encountered within M s /L s (i.e., the normalized CO2 plume size divided by the normalized system size) are counted as a single effective connection. With each of the assumed plume sizes, P inter is obtained by averaging effective connections for the CO2 plume at different locations. In other words, a moving average is performed by moving the CO2 plume along the caprock and checking if it encounters a connected pathway. If it does, we assign a number of one, and if it does not, we assign a number of zero. The final averaged number is P inter—the probability that the CO2 plume encounters a conductive fault (or fault zone) that is connected to other conductive conduits and serves as a pathway for CO2 to escape from the reservoir and migrate to compartments. As discussed earlier, P leak depends not only on P inter, but also on the probability (P con) that a connected pathway exists in the system. To consider P con, for each parameter set, we average P inter from all realizations (including the ones with P inter = 0 and the ones obtained using moving average method) to obtain P leak. In this way, P con is implicitly considered in the P leak calculation.

As a result of this procedure, the following statement can be made for each parameter set:

$$ {\mathbf{IF}}\,L_{s } = L_{1}\,{\mathbf{AND}}\,l_{{{ \max }s}} = l_{1}\,{\mathbf{AND}}\,a = a_{1}\,{\mathbf{AND}}\,r = r_{ 1}\,{\mathbf{AND}}\,M_{s} = M_{1},$$
$$ {\mathbf{THEN}}\; P_{\text{leak}} ,{\text{ is }}b. $$

Here, L 1 , l 1 , a 1 , r 1 (r 1  ≥ 1), and M 1 are the numerical values of the varying parameters in the simulation, covering all likely values. b is the calculated P leak for each parameter set. Up to this point, both input variables and output variable are crisp numbers rather than fuzzy numbers.

Construct fuzzy rules for calculating P leak

Fuzzy logic is viewed as a system of concepts, principles, and methods for dealing with models of reasoning that are approximate rather than exact (Novak and Perfilieva 2000). Due to its strength in dealing with uncertainty, ambiguity, and imprecision that one often encounters in modeling natural systems, fuzzy logic has been successfully applied to earth sciences including areas of surface and subsurface hydrology (e.g., Bardossy 1996; Bardossy et al. 2005; Bardossy and Disse 1993; Dou et al. 1999; Hundecha et al. 2001), water resources management and risk assessment (e.g., Kumar et al. 2006; Panigrahi and Mujumdar 2000; Shrestha et al. 1996; Uricchio et al. 2004), and soil science (e.g., Bardossy and Lehmann 1998; Mays et al. 1997; McBratney and Degruijter 1992; McBratney and Odeh 1997; Odeh et al. 1992).

Fuzzy-rule based modeling represents a complex system with imprecise, vague, and uncertain information by means of fuzzy rules. A fuzzy rule (the ith rule) consists of a set of arguments C i,k (kth argument, k = 1,…, K) in the form of fuzzy sets with membership functions \( \mu_{{C_{i,\,k} }} \), and a consequence B i , which is also in the form of a fuzzy set with a membership function \( \mu_{{B_{i} }} \). The membership function \( \mu_{{C_{i,\,k} }} \)expresses the grade or degree of membership of element x in C i,k . A simple fuzzy rule statement reads as follows:

$$ {\mathbf{IF}}\,x_{1}\,{\text{ is }}C_{i,1}\,{\mathbf{AND}}\,x_{2} {\text{ is }}C_{i,2}\,{\mathbf{AND}}\,\ldots\,{\mathbf{AND}}\,x_{K} {\text{ is }}C_{i,K} , {\mathbf{THEN}}\,B_{i}.$$

The use of fuzzy sets (instead of crisp numbers) in the fuzzy statements allows the rules to be used conveniently for both descriptive and quantitative purposes. In addition, these fuzzy rules can be partially or simultaneously fulfilled. This means a rule can have partial applicability, or it is possible to have a few partially applicable rules combined.

The degree of fulfillment (DOF) ν is used to quantify the truth grade corresponding to the fulfillment of the conditions of a rule (the ith rule) for given premises (x 1,…, x K ).

$$ \nu (x_{1}\,{\mathbf{AND}}\,x_{2}\,{\mathbf{AND}}\,\ldots\,{\mathbf{AND}}\,x_{K} ) = \mu_{{C_{i,1} }} (x_{1} )\mu_{{C_{i,2} }} (x_{2} ) \cdots \mu_{{C_{i,K} }} (x_{K} ). $$
(11)

Fuzzy rules can be developed using expert opinions, existing data, and qualitative information. Alternatively, fuzzy rules can be generated through numerical simulations (Bardossy and Disse 1993; Bardossy and Duckstein 1995; Dou et al. 1999). In our case, we use results from Step 2 as the training set to construct fuzzy rules. Since the training set is from numerical simulations covering the feasible input parameter (fault network parameters and plume size), we know the input data structure (e.g., exponent a between 1 and 3, we can define a few fuzzy numbers evenly distributed between 1 and 3). We use the weighted counting algorithm proposed by Bardossy and Duckstein (1995) to construct fuzzy rules. In this method, the rule premises are defined explicitly, and responses to the rule are defined using all simulation data sets. The training set Γ is written as:

$$ \Upgamma = \left\{ {(x_{1} (s), \ldots ,x_{K} (s),b(s));\quad s = 1, \ldots ,S} \right\} $$
(12)

where b refers to the consequence, in our case it is P leak. And S is the total number of data (in our case, the number of parameter set in the simulation).

The algorithm can be described as follows:

Define the membership functions of the premises. There are five arguments (K = 5) in our case: system size L s , largest fault size l max s , exponent a, ratio r = α s /α sc , and ratio M s /L s (for convenience, we use the ratio of plume size over system size to represent relative plume size). If triangular membership functions are used for C i,k (the kth argument of the ith rule), define the fuzzy number (\( c_{i,k}^{ - } \), \( c_{i,k}^{1} \), \( c_{i,k}^{ + } \)) as shown in Fig. 2.

Fig. 2
figure 2

Triangular membership functions for a fuzzy number

Calculate the DOF ν i of each rule for each premise vector [x 1(s),…, x k (s)] corresponding to the training set.

Select a number ε > 0 such that only responses with a DOF of at least ε will be considered in the construction of the rule response. The corresponding response is assumed to be a triangular fuzzy number (\( \beta_{i}^{ - } ,\;\beta_{i}^{1} ,\;\beta_{i}^{ + } \)), where

$$ \beta_{i}^{ - } = \mathop {\min }\limits_{{\nu_{i} (s) > \varepsilon }} b(s) $$
(13)
$$ \beta_{i}^{1} = \frac{{\sum\nolimits_{{\nu_{i} (s) > \varepsilon }} {\nu_{i} (s)b(s)} }}{{\sum\nolimits_{{\nu_{i} (s) > \varepsilon }} {\nu_{i} (s)} }} $$
(14)
$$ \beta_{i}^{ + } = \mathop {\max }\limits_{{\nu_{i} (s) > \varepsilon }} b(s). $$
(15)

The resulting fuzzy rules have the following format:

IF\( L_{s} = (L_{si}^{ - } ,L_{si}^{1} ,L_{si}^{ + } ) \)AND\( l_{\max s} = (l_{\max \,s\,i}^{ - } ,l_{\max \,s\,i}^{1} ,l_{\max \,s\,i}^{ + } ) \)AND\( a = (a_{i}^{ - } ,a_{i}^{1} ,a_{i}^{ + } ) \)AND\( \alpha_{s} /\alpha_{sc} = (r_{i}^{ - } ,r_{i}^{1} ,r_{i}^{ + } ) \)AND\( M_{s} /L_{s} = (m_{i}^{ - } ,m_{i}^{1} ,m_{i}^{ + } ), \)THEN the probability that a CO2 plume escapes through a connected conduit system is \( P_{\text{leak}} = (\beta_{i}^{ - } ,\beta_{i}^{1} ,\beta_{i}^{ + } ). \)

Calculate P leak for a given system

For a given system, the first step is to calculate α sc and to compare it to α s . If the two are the same, P leak should be about 0.5. If the latter is much smaller, P leak = 0. We will address how much smaller α s has to be so P leak can be ignored in the example problem. For systems with an α s around or above α sc , the above fuzzy rules are used to infer P leak. There two commonly used inference models, referred to as Mamdani-type models (Mamdani and Assilian 1975) and the Takagi–Sugeno-type models (Takagi and Sugeno 1985). However, the Takagi–Sugeno-type system uses a single spike as the output membership function, where the output (consequence) is calculated as the weighted average of a few data points rather than integrating over the domain of the output fuzzy set. The method does not provide the uncertainty range of the output, which is what we need for our prediction result. The Mamdani-type inference system uses a maximum combination method to aggregate fuzzy rules. The method tolerates disagreement between rules, but it does not increase the membership function of the response if two rules give the same results. Furthermore, it does not give higher weights to rules with crisper answers, therefore it could make the response very vague.

We use the normalized sum combination method proposed by Bardossy and Duckstein (1995) to aggregate fuzzy rules. This method has the advantage of assigning more weight to rules with crisper answers than fuzzy answers (e.g., less weight on less certain answers). It is calculated as:

$$ \mu_{B} (x) = \frac{{\sum\nolimits_{i = 1}^{I} {\nu_{i} \tau_{i} \mu_{{B_{i} (x)}} } }}{{\max_{u} \sum\nolimits_{i = 1}^{I} {\nu_{i} \tau_{i} \mu_{{B_{i} (u)}} } }} $$
(16)

where

$$ \frac{1}{{\tau_{i} }} = \int\limits_{ - \infty }^{ + \infty } {\mu_{{B_{i} }} } (x){\text{d}}x. $$
(17)

The division by the maximum of the summation is to ensure the resulting membership function is not >1. Moreover, if the centroid is used as the defuzzification method, the fuzzy mean can be simply calculated as:

$$ M(B) = \frac{{\sum\nolimits_{i = 1}^{I} {\nu_{i} M(B_{i} )} }}{{\sum\nolimits_{i = 1}^{I} {\nu_{i} } }}. $$
(18)

Illustrative example

In the following example, we establish rules and predict P leak for systems with a normalized system size L s between 50 and 200, and a normalized largest fault size l max s between 50 and 200. The exponent a in Eq. 1 is only considered for values between 1.1 and 3. The first step is to use Eqs. 69 to calculate the critical parameter α sc and to compare it to the actual value α s . Only systems around the percolation threshold (including a little less) or above are considered to be possible to have connected pathways.

Next, we determine how many fault networks with different sets of L s , l max s , a, and α s /α sc are needed to construct robust fuzzy rules. We sample these parameters uniformly over the admissible range, making sure that no excessive extrapolations are needed when subsequently generating fuzzy rules. The admissible range for each of the parameters is listed in Table 1. The total number of parameter sets evaluated to generate fault networks is about 1,800. For each parameter combination, 100 discrete fault networks are generated, and for each realization, different plume sizes are considered to calculate P inter. Then the leakage probability P leak is obtained by averaging P inter (for the same plume size) from the 100 realizations.

Table 1 Parameters used in fault network generation

We use a fracture network generation code modified based on the one that was used by Liu et al. (2002) and Zhang et al. (2004). Figure 3 shows the power-law fault-length distribution (solid line) that is used to generate fault networks, and the actual generated fault-length distribution (symbols) for a system with a = 1.5, L s  = l max s  = 200, and r = 1. As expected, the double-log plot shows a good fit for small faults with high frequency. For big faults with low frequency, since the number of faults can only be an integer, which is discrete, the samples are spread out around the analytical length distribution. The fault length distribution generated in the network is considered to coincide well with the specified power-law distribution. Figure 4 is an example of a discrete fault network for a system with L s  = 100, l max s  = 200, a = 1.1, and r = 1.1. Figure 4a shows all generated faults. Starting with the faults intersecting the top of the system, we gradually find and plot only the faults that are connected (i.e., the unconnected faults are removed); the resulting network of connected faults is shown in Fig. 4b. The removal of unconnected faults is started at the top, so that the calculation of effective connections, which are identified at the bottom of the system, where CO2 resides, remains zero until connectivity across the entire system is achieved. Based on this plot, we can easily find P leak using the moving average method. Note that while there are three conduits connected to the CO2 storage formation at the bottom of the system, two of the three connections are very close to each other. Such a cluster does not increase the leakage probability significantly compared to a single connection at that location. The simple moving average method described in “Conduit network generation to determine P leak ” properly accounts for this effect. For a normalized plume size M s  = 10, and a moving unit of M s /10 = 1, we obtain P leak = 0.21, which is smaller than the probability one might expect based on the total number of connections alone, but slightly larger than if there were only two, clearly separated faults.

Fig. 3
figure 3

Power law fault length distribution (solid line) and sample length distribution (symbols) for a system with a = 1.5, L s  = l max s  = 200, and α s /α sc

Fig. 4
figure 4

A discrete fault network for a system with a = 1.1, L s  = 100, l max s  = 200, α s /α sc  = 1.1 (a) with all fractures and (b) with unconnected faults sequentially removed from the top to the bottom

Figure 5 shows a fault network with a = 3.0 and r = 1.25 (L s and l max s are the same as those used to generate Fig. 4). With increasing a, a larger portion of the total fault population consists of small faults. Consequently, the total number of faults and the critical value α cs also increases. In this example, despite the very densely distributed small faults in the domain, the system is not connected (see Fig. 5b). Figure 6 shows a network generated using exactly the same system parameters as the network shown in Fig. 5, but for this realization the system is connected. Figures 4, 5 and 6 demonstrate that with increasing a, the portion of small faults in the entire distribution increases significantly, gradually making overall connectivity (should it exist) dominated by small rather than large faults.

Fig. 5
figure 5

A discrete fault network for a system with a = 3.0, L s  = 100, l max s  = 200, α s /α sc  = 1.25 (a) with all fractures and (b) with unconnected faults sequentially removed from the top to the bottom

Fig. 6
figure 6

A discrete fault network for a system with a = 3.0, L s  = 100, l max s  = 200, α s /α sc  = 1.25 (a) with all fractures and (b) with unconnected faults sequentially removed from the top to the bottom

In Fig. 7, we plot P leak and its 95% confidence interval as a function of CO2 plume size for a system with a = 1.5, l max s  = 100, L s  = 100, and different ratios r = α s /α sc . The value of P leak at M s  = L s (plume size equals system size) is equal to the probability that the system is connected. If the theoretical percolation threshold were accurate, P leak is expected to be around 0.5 for r = 1. As it turns out, this is not always the case, as demonstrated in Fig. 7 where P leak is only about 0.25. There are two reasons for this. First, the theoretically derived percolation threshold of 5.6 used in the calculation needs to be corrected to account for differences between the systems studied here and that used to develop the theory. Specifically, there are differences in (a) the exponent a, (b) the system size, leading to a finite-size effect, and (c) the cut-off fault size for the largest fault. Secondly, we generated the faults by randomly locating the centers of the fault within the system domain. Because no faults are generated if their center points are outside the model domain, the fault density near the domain edge is somewhat reduced. Therefore, the estimated P leak (r = 1, M s  = L s ) tends to be smaller than 0.5. The second reason is an artifact of the fault network generation, which could be eliminated by generating additional faults with the center points outside the model domain. Such a revision, however, is only justified if geological evidence suggests that fault density indeed does not show an edge effect near lithological or tectonic boundaries.

Fig. 7
figure 7

Pleak as a function of CO2 plume size for a system with a = 1.5, lmax s  = 100, L s  = 100, and different values r = α s /α sc . The error bars indicate 95% confidence intervals

The original values we considered for r are in the range between 1 and 2.5. However, theoretically P leak (r = 1) at percolation threshold should be about 0.5, which is not small enough for us to ignore the likelihood of CO2 leakage. This is also demonstrated by simulation results (e.g., Fig. 7). We gradually add new simulations with r smaller than 1 to the set of fault networks. These additional simulation results indicate that if r = 0.75, P leak at (r = 0.75, M s  = L s ) is smaller than 0.05. If this is considered to be acceptably small, we change the lower bound for generating fault network, also for generating fuzzy rules to r = 0.75. If r = 2.5, we consider P leak to be high enough for us to conclude the likelihood of CO2 leakage to compartments is significant and close to certain. Thus, this value is used as the upper bound for fuzzy rule generation.

As we mentioned in the previous section, Bour and Davy (1997) showed in their Fig. 9 that p c varies between 5 and 7 for systems that have a size between 10 and 100, with exponent a between 1.8 and 3.2. This means that the needed correction for α sc could be 25% or even higher. To confirm this assumption, we show results for r = 1.25 in Fig. 7, the expected connectivity is approximately 0.4, with the 95% confidence interval between 0.3 and 0.5. This supports the validity of the assumption.

The 95% interval appears to be bigger for r between 1 and 2, because for fault networks near the percolation threshold, the chance that a system is or is not connected depends on the random presence or absence of a few critical faults and is thus highly random.

Table 2 is a list of α sc value calculated using Eqs. 69, as a function of l max s for both a = 1.5 and a = 3.0, L s  = 100. When l max s increases from 50 to 100, α sc is reduced by about 65% for a = 1.5, because large faults dominate the connectivity; on the other hand, for a = 3.0, small faults dominate the connectivity, and therefore the reduction of α sc is much less, only 15%.

Table 2 α sc values for different a and l max s

For each parameter set, 100 realizations with different seed numbers are created to consider the uncertainty in fault network generation. To construct fuzzy rules, the averaged results from these 100 realizations are used. Now we have a database with a total of S (~21,000) sets: \( \{ (x_{1} (s),x_{2} (s),x_{3} (s),x_{4} (s),x_{5} (s),b(s));\quad s = 1, \ldots S\} , \) with x 1(s) representing exponent a, x 2(s) representing normalized system size, x 3(s) representing normalized largest fault size, x 4(s) representing ratio r = α s /α sc , x 5(s) representing m = M s /L s , and b(s) representing P leak. The first step in generating fuzzy rules is to define rule structure. For each argument, we define its fuzzy numbers as listed in Table 3.

Table 3 Fuzzy numbers used in the premises

With the structure listed in Table 3, there will be a total of 1,620 rules (a combination of 5 arguments: 5 × 3 × 3 × 6 × 6). For each parameter set, the DOF of each rule is calculated. The ones that have a DOF value larger than ε = 0.2 are kept for calculating the consequences using Eqs. 1315.

To demonstrate how to predict P leak for a given system with uncertainty, we use a simple example that only considers one premise. We consider a system with the exponent a = 1.5, both normalized system size and largest fault are 100, and the normalized plume size is 40 (e.g., 40% of the system size). The ratio r = α s /α sc is considered to be uncertainty, estimated to be greater than but close to 1, somewhere between 1 and 1.25.

Corresponding rules (with the same a, L s , L max s , and M s ) are used to predict P leak. If we assume r can be represented by two fuzzy numbers as shown in Fig. 8, with a membership function of 0.5 for both, the two rules will have a DOF of 0.5, while all the other rules will have a DOF of zero. These two rules are applied to estimate P leak:

  • Rule 1:

    $$ \begin{aligned} {\text{IF }}a = (1.1,1.5,2.0){\text{ AND }}L_{s} = & (50,100,200){\text{ AND }}l_{{ \max\,s}} = (50,100,200) \\ {\text{AND }}r = & (0.75,1.0,1.25){\text{ AND }}r_{\text{p}} = (0.2,0.4,0.6) \\ {\text{THEN}} P_{\text{leak}} = (0.01,0.12,0.18) & \\ \end{aligned} $$
  • Rule 2:

    $$ \begin{gathered} {\text{IF }}a = (1.1,1.5,2.0){\text{ AND }}L_{\text{s}} = (50,100,200){\text{ AND }}l_{{\max\,s}} = (50,100,200) \hfill \\ {\text{ AND }}r = (1.0,1.25,1.5){\text{ AND }}r_{\text{p}} = (0.2,0.4,0.6) \hfill \\ {\text{THEN}} P_{\text{leak}} = (0.1,0.3,0.55) \hfill \\ \end{gathered} $$
Fig. 8
figure 8

The membership functions for the two fuzzy numbers that are used to represent “r is greater but close to 1”

The individual responses from the two rules are shown as thin black lines in Fig. 9. The area under the membership function curve is a measure of how uncertain the estimated P leak is: the larger the area, the more uncertain it is. In this case, the area for rule 1 is 0.085 and that for rule 2 is 0.27. Rule 1 has a crisper result than Rule 2; therefore, it has a larger weight (the weight is inverse to the area) in the final combined results. The final Pleak and its membership function are shown as thick red lines. The defuzzified P leak value (using the centroid method) is 0.21.

Fig. 9
figure 9

Membership functions of P leak (thick red line) for a system (a = 1.5, L s  = l max s  = 100, M s  = 40) with “r greater but close to 1”. The two individual rule consequences are shown as thin black lines

We apply the fuzzy rules to the same system as shown in Fig. 7. The predicted P leak using fuzzy rules are also fuzzy numbers. After defuzzification, we plot them in Fig. 10. Although the results are consistent with results from the Monte Carlo simulation, they are not identical because by using fuzzy-rule based modeling, we have considered the uncertainty in the input parameters, whereas the multiple realizations for the same parameter set only considered the randomness of equally probable fault networks. When we talk about r being approximately 1.5, we have implicitly include the possibilities that r could be larger or smaller than 1.5. As a result, the estimated P leak contained uncertainty in r. Since the combinations of the rules are weighted by both the DOF of a rule and the area of the membership function of the rule outcome, the defuzzified P leak could be smaller or larger than the Monte Carlo simulation, which did not consider the input parameter uncertainty. However, there are two exceptions in our prediction. When r = 0.75, the predicted P leak values using fuzzy rules are always larger than the Monte Carlo simulation results, and when r = 2.5, the predicted P leak values are always smaller than the Monte Carlo simulation results (see Figs. 7, 10). This is because we use a triangular fuzzy number (0.75, 0.75, 1.0) for the statement “r is likely to be 0.75 or slightly above.” The uncertainty in this number means a possibility that r is higher than 0.75, but not lower. This effect makes P leak at r ≈ 0.75 always higher than P leak at r = 0.75. Similarly, because we define a triangular membership function (2.0, 2.5, 2.5) for the statement “r is close to 2.5 or slightly smaller”, the uncertainty in r means a possibility that r is smaller than 2.5, but not larger, and the P leak values predicted using fuzzy rules are always smaller.

Fig. 10
figure 10

Fuzzy-rule based prediction of P leak as a function of r p (CO2 plume size divided by system size) for a system with a = 1.5, l max s  = 100, L s  = 100, and different values of r = α s /α sc

Conclusions and practical considerations

In this paper we presented a method to estimate a limiting factor controlling the probability of CO2 leakage through a fault or fracture system, namely the probability (P leak) of the plume intersecting a connected network of faults or fractures that also intersects a vulnerable resource. The proposed method includes (1) the estimation of the connectivity of the fault system using percolation theory; (2) the estimation (for a limited number of systems) of the probability that CO2 plumes with different sizes encounter a connected system of conduits; (3) the construction of fuzzy rules for calculating P leak considering uncertainty in the input parameters; and (4) predictive estimation of P leak for a given system. The method was designed to fit in the CF, where the risk associated with CO2 leakage is the product of the leakage likelihood and the impact of that leakage event. The rules that are generated from this study will be stored within the CF model. When a site needs to be evaluated as a potential geological CO2 sequestration site, step (4) needs to be performed, and a leakage likelihood will be passed on for the final calculation of risk.

The study was done for a two-dimensional system. Representing an inherently three-dimensional system with a two-dimensional model yields conservative leakage probability estimates, because the third dimension is implicitly assumed to be connected. Nevertheless, the concept and methods described in this study can also be applied to more realistic three-dimensional fault-systems. Analyzing fault systems in three dimensions requires modifying (1) the percolation threshold value, (2) the expression of the percolation threshold (Bour and Davy 1998), (3) generating three-dimensional fault networks, (4) evaluating leakage probability using realistic plume configurations, and (5) recreating fuzzy rules.

The assumptions for the approach include a square system and randomly oriented conduits. However, for a real site, the system is unlikely to be square, and faults have preferential orientations. Both situations lead to a preferential connection in one direction (the short direction), and less likely connectivity in the other direction. This effect is referred to as anisotropy, which can be accounted for following the method proposed by Masihi et al. (2007).

The main computational effort resides in the numerical generation of the fault networks and finding the connected pathway(s). In our case, we generated about 1,800 × 100 (for each system parameter there are 100 realizations) fault networks. The varying of plume size is not included in this number since once the connected pathway is found, there is not much computational effort involved in calculating the P inter for various plume sizes. However, fault network generation only needs to be done once to provide the basis for constructing the fuzzy rules; predictive simulations are then performed very efficiently using these fuzzy rules. After we include the plume size in the input parameter set and average results of the 100 realizations, we have about 21,000 datasets to generate rules. If needed, additional networks can be added to the database to extend the input parameter ranges. By using fuzzy-rule based modeling, we can predict P leak for systems that have characteristics different from the ones we have in the database (obtained from fault network generation), as well as the uncertainty of P leak, by propagating the uncertainty in the input parameters. The randomness in the fault network generation is considered by generating multiple realizations for the same system.

Brine leakage through a fault or fracture system from the reservoir may also lead to environmental impact. Although we focus the application of the proposed method to evaluate the probability of CO2 leakage, the approach can also be applied to estimate brine leakage probability. An effective brine plume size, analogous to the CO2 plume size, could be defined as the size of the pressure perturbation above some cut-off value. Using the proposed method, a relationship can be established between the leakage probability and the region of pressure perturbation.

If GCS becomes a viable mitigation option to address CO2 emissions, a large number of sites with different amounts of characterization data and degrees of uncertainty will need to be evaluated. The proposed method provides a tool for a preliminary evaluation of leakage likelihood through fault systems. In the future, this simplified model will be refined to better represent the dimensionality and fault distributions at actual sites under evaluation.