# Stochastic Reconstruction of an Oolitic Limestone by Generative Adversarial Networks

- 156 Downloads

## Abstract

Stochastic image reconstruction is a key part of modern digital rock physics and material analysis that aims to create representative samples of microstructures for upsampling, upscaling and uncertainty quantification. We present new results of a method of three-dimensional stochastic image reconstruction based on generative adversarial neural networks (GANs). GANs are a family of unsupervised learning methods that require no a priori inference of the probability distribution associated with the training data. Thanks to the use of two convolutional neural networks, the discriminator and the generator, in the training phase, and only the generator in the simulation phase, GANs allow the sampling of large and realistic volumetric images. We apply a GAN-based workflow of training and image generation to an oolitic Ketton limestone micro-CT unsegmented gray-level dataset. Minkowski functionals calculated as a function of the segmentation threshold are compared between simulated and acquired images. Flow simulations are run on the segmented images, and effective permeability and velocity distributions of simulated flow are also compared. Results show that GANs allow a fast and accurate reconstruction of the evaluated image dataset. We discuss the performance of GANs in relation to other simulation techniques and stress the benefits resulting from the use of convolutional neural networks . We address a number of challenges involved in GANs, in particular the representation of the probability distribution associated with the training data.

## Keywords

Stochastic modeling Neural networks Porous media Computer tomography## 1 Introduction

The microstructural characteristics of porous media play an important role in the understanding of numerous scientific and engineering applications such as the recovery of hydrocarbons from subsurface reservoirs (Blunt et al. 2013), sequestration of \(\text {CO}_2\) (Singh et al. 2017) or the design of new batteries (Siddique et al. 2012). Modern microcomputer tomographic (micro-CT) methods have enabled the acquisition of high-resolution three-dimensional images at the scale of individual pores. Increased resolution comes at the cost of longer image acquisition time and limited sample size. Individual samples allow numerical and experimental assessment of the effective properties of the porous media, but give no insight into the variance of key microstructural properties. Therefore, an efficient method to generate representative volumetric models of porous media that allow the assessment of the effective properties is required. The generated images serve as an input to a digital rock physics workflow to represent the computational domain for numerical estimation of key physical properties (Berg et al. 2017).

Statistical methods aim at reconstructing porous media based on spatial statistical properties such as two-point pore–grain correlation functions. Quiblier (1984) has presented an extensive overview of the early literature of porous media reconstruction and provided an extension of the method of Joshi (1974) by reconstructing three-dimensional porous media based on the empirical covariance function and probability density function obtained from two-dimensional thin sections. Other statistical methods such as simulated annealing (Yeong and Torquato 1998; Jiao et al. 2008) allow high-quality three-dimensional reconstruction and incorporation of numerous statistical descriptors of porous media. Pant (2016) introduced a multi-scale simulated annealing algorithm allowing simulation of three-dimensional porous media at much lower computational cost than previous methods.

Methods to incorporate higher-order multi-point statistical (MPS) properties of porous media have been developed. These MPS functions are implicitly defined by two- or three-dimensional training images. Simulation algorithms based on multi-point statistics are therefore considered as training image-based algorithms. MPS simulation was originally developed in the context of generating realistic geological structures (Guardiano and Srivastava 1993; Caers 2001; Mariethoz and Caers 2014). With the advent of micron-resolution X-ray tomography (micro-CT imaging) (Flannery et al. 1987), which provides training images, MPS simulation techniques have been successfully applied to the stochastic reconstruction of three-dimensional porous media (Okabe and Blunt 2004, 2005, 2007).

Tahmasebi et al. (2012) and Tahmasebi and Sahimi (2012, 2013) have introduced a patch-based approach where sub-domains are simulated along a pre-defined path and populated based on a cross-correlation distance criterion (CCSIM). This approach is similar to the image quilting algorithm by Efros and Freeman (2001) and Mariethoz and Caers (2014) but corrects mismatching patches in overlapping or neighboring domains. Tahmasebi et al. (2017) present a method for fast reconstruction of granular porous media from a single two- or three-dimensional training image using a method closely related to CCSIM. They obtain significant speedup in computational time by incorporating a fast Fourier transform and a multi-scale approach. A graph-based approach is used to resolve non-physical regions at the boundaries of simulated patches of grains.

Object-based methods describe the material domain by locating geometrical bodies of random size at locations provided by a spatial point process. The so-called Boolean model is a particular case where the randomly placed bodies, typically spheres, are allowed to overlap (Matheron 1975; Chiu et al. 2013). Object-based methods may also allow interaction of particles to be incorporated. They have successfully been used to describe complex and heterogeneous materials (Torquato 2013).

Process models reconstruct the pore and grain structure of materials by mimicking how they were formed. Øren and Bakke (2003) have created reconstructions of sandstones by reproducing the natural processes of sedimentation, compaction and diagenesis.

This contribution presents a training image-based method of image reconstruction using a class of deep generative methods called generative adversarial networks (GANs) first introduced by Goodfellow et al. (2014). Recently, Mosser et al. (2017) have shown that GANs allow the reconstruction of three-dimensional porous media based on segmented volumetric images. Their study applied GANs to three segmented images of rock samples. They showed that GANs represent a computationally efficient method for the fast generation of large volumetric images that capture the statistical and morphological features, as well as the effective permeability.

We expand on the work of Mosser et al. (2017) and investigate the ability of generative adversarial networks to create stochastic reconstructions of an unsegmented micro-CT scan of a larger oolitic Ketton limestone sample. We evaluate the four Minkowski functionals for the three-dimensional datasets as a function of the gray-level threshold. In addition to the numerical evaluation of permeability as shown by Mosser et al. (2017), we compare velocity distributions of the original porous medium and samples obtained from the GAN. We also provide details of the convolution approach used by GANs. Furthermore we evaluate the reconstruction process within the trained generative function and highlight the parametric and differentiable nature of the obtained generative function. We evaluate the computational cost of GAN-based image simulation with reported values of computational run time for a variety of other reconstruction methods of equal reconstruction quality. We also investigate how the image representation evolves along the different layers of the GAN network, and discuss the benefits that can be derived from the differentiable nature of the parameterization used by GANs.

## 2 Generative Adversarial Networks

Generative adversarial networks are a deep learning method for generating samples from arbitrary probability distributions (Goodfellow et al. 2014; Goodfellow 2017). GANs do not impose any a priori model on the probability density function and are therefore also referred to as an implicit method. Without the need to specify an explicit model, GANs provide efficient sampling methods for high-dimensional and intractable density functions.

In the case of CT images of porous media, we can define an image *x* to be a sample of a real, unknown probability density function (pdf) of images \(p_\mathrm{data}\) of which we have acquired a number of samples which serve as training images. In our example, the training set is comprised of 5832 sub-domains (\(64^3\) voxel) of the original micro-CT image. Sub-domains are extracted without any overlap, and each training image represents the originally acquired dataset.

*D*that tries to distinguish between samples from the training set and synthetic images created by the generator. The generator

*G*is defined by its parameters \(\mathbf {\theta }\) and performs a mapping from a random prior \(\mathbf {z}\) to the image domain:

*d*is the dimensionality of the random prior.

*x*being a sample of the true data distribution \(p_\mathrm{data}\):

In a second step, we train the generator to maximize its ability to “fool” the discriminator into misclassifying the images provided by the generator as real images. This is performed by computing the binary cross-entropy of the output of the discriminator on a mini-batch sampled from the generator \(G_{\mathbf {\theta }}(\mathbf {z})\) and requiring that the created labels be close to one, thereby computing \(H(\mathbf {1}, \mathbf {y}')\). The parameters of the generator are then modified to optimize \(H(\mathbf {1}, \mathbf {y}')\) by applying stochastic gradient descent while keeping the parameters of the discriminator constant.

Training of these networks is often challenging due to the competing objective functions of the generator and discriminator. Recently, new objective functions and training heuristics have greatly improved the training process of GANs (Arjovsky et al. 2017; Berthelot et al. 2017).

GANs follow a different training scheme from other stochastic reconstruction methods (Sect. 1). There are two phases in GAN-based reconstruction: training and generation. Training is expensive, requiring modern graphics processing units (GPU) and for three-dimensional datasets large GPU memory. Parallelization of the training process across numerous GPUs reduces time for training the network. Nevertheless, finding a set of hyper-parameters, that is, a network architecture (number of filters, types, order of layers and activation functions) that leads to the desired quality can require significant trial and error.

## 3 Dataset

The sample used in this study is an oolitic limestone of Jurassic age (169–176 million years). The spherical to ellipsoidal grains consist of 99.1% calcite and 0.9% quartz (Menke et al. 2017). Inter- and intra-granular porosity can be observed, as well as significant amounts of unresolved sub-resolution microporosity. This is characterized by the various shades of gray in individual grains, where the interaction of sub-resolution porosity with X-rays penetrating the sample during imaging leads to an increase in intermediate gray-level values (Fig. 1). The sample was imaged using a Zeiss XRM 510 with a voxel size of 27.8 \(\upmu \)m. The size of the image domain after resampling to 8 bit resolution is \(900^3\) voxels. We subdivide the original image into a training set of non-overlapping 5832 images at a size of \(64^3\) voxels. We define a sequential randomized pass over the full training set as an epoch. Evaluation of the effective properties is performed at larger image sizes than the training images to judge whether the GAN is able to generalize to larger domains. To evaluate the reconstruction quality of the GAN model, we randomly extract 64 images at a size of \(200^3\) voxels with no overlap from the original training image (Fig. 1) which we refer to as the validation set. A synthetic validation set was created by sampling 64 images at a size of \(200^3\) voxels from the trained GAN model. To perform numerical computation of the effective permeability as well as measure the two-point correlation function, all images of the synthetic and original Ketton validation set were segmented using Otsu thresholding (Otsu 1975). Minkowski functionals were evaluated for the unsegmented validation sets.

### 3.1 Neural Network Architecture and Training

*m*,

*n*and

*o*are of size one, which results in an image of \(64^3\) voxels. For image generation,

*m*,

*n*and

*o*may be of any integer size. The main difference to the SGAN architecture of Jetchev et al. (2016) is therefore that at training time the input random vector has a spatial dimension of one and the output of the discriminator is a single scalar value.

The generator consists of a series of three-dimensional transposed convolutions. In each layer, the number of weight kernels is reduced by a factor of \(\frac{1}{2}\). Before the final transposed convolution, we add an additional convolutional layer (Fig. 3). Each layer in the network except the last is followed by a batch normalization (Ioffe and Szegedy 2015) and a leaky rectified linear unit (LeakyReLU) activation function. The final transposed convolution in the generator is followed by a hyperbolic tangent activation function (Tanh) (LeCun et al. 1998). A representation of each activation function used in the network is shown in Fig. 4.

We represent the discriminator as a convolutional classification network with binary output using as input the real samples of the \(64^3\) voxel training set (Label 1) and synthetic realizations of equal size created by the generator (Label 0). Each layer in the network consists of a three-dimensional convolution operation followed by batch normalization and a LeakyReLU activation function. The final convolutional layer outputs a single value between 0 and 1 (Sigmoid activation) which corresponds to the probability that the input image belongs to the original training set or in other words that it is a real image.

We distinguish two sets of parameters for training: The set of weights of a network comprises the adjustable parameters of the filter kernels for convolutional and neurons for linear network layers. The so-called hyper-parameters define the network architecture and training scheme, e.g., the number of filters per layer, the number of convolutional layers or learning rates. A chosen set of hyper-parameters defines different networks with their own weights (parameters) which are adapted using a mini-batch gradient descent method at training time.

In total, 8 models have been trained on the Ketton image dataset. The main hyper-parameters that were varied for each model are the number of filters in the generator and discriminator, \(N_\mathrm{GF}\) and \(N_\mathrm{DF}\), respectively, as well as the number of convolutional layers before the final transposed convolution in the generator. The dimensionality of the latent random vector \(\mathbf {z}\) was kept constant at a size of \(512\times 1 \times 1 \times 1\). Learning was performed by stochastic gradient descent using the ADAM optimizer with momentum constants \(\beta _1=0.5\), \(\beta _2=0.999\) and a constant learning rate of \(2 \times 10^{-4}\). Network training was performed on eight NVIDIA K40 GPUs using a mini-batch size of 64 images and the total run time of each training run is 8 h.

To train the pair of networks \(G_{\mathbf {\theta }}(\mathbf {z})\) and \(D_{\mathbf {\omega }}(\mathbf {x})\), we make use of two heuristic stabilization methods. First, Gaussian noise \((\mu =0, \sigma =0.1)\) is added to the input of the discriminator which is annealed linearly over the first 300 epochs of training. A theoretical analysis of why adding Gaussian noise helps to stabilize GAN training was performed by Kaae Sønderby et al. (2016). In addition, we make use of a second stabilization method called label switching. Label switching represents a heuristic stabilization method with the aim of weakening the discriminator during the early stages of training. This heuristic stabilization method is performed by training the discriminator every N steps for one step with switched labels of the input real and generator simulated images; a real image is expected to be labeled as false and generated images as real. This corresponds to switching the expected labels of the input image mini-batches in Eq. (5).

Architecture of the generator and discriminator networks

Layer | Type | Filters | Kernel | Stride | Padding | Batch normalization | Activation |
---|---|---|---|---|---|---|---|

Generator | |||||||

1 | ConvTransp3D | 512 | \(4 \times 4 \times 4\) | 1 | 0 | Yes | LeakyReLU |

2 | ConvTransp3D | 256 | \(4 \times 4 \times 4\) | 2 | 1 | Yes | LeakyReLU |

3 | ConvTransp3D | 128 | \(4 \times 4 \times 4\) | 2 | 1 | Yes | LeakyReLU |

4 | ConvTransp3D | 64 | \(4 \times 4 \times 4\) | 2 | 1 | Yes | LeakyReLU |

5 | Conv3D | 64 | \(3 \times 3 \times 3\) | 1 | 1 | Yes | LeakyReLU |

6 | ConvTransp3D | 1 | \(4 \times 4 \times 4\) | 2 | 1 | No | Tanh |

Discriminator | |||||||

1 | Conv3D | 64 | \(4 \times 4 \times 4\) | 2 | 1 | No | LeakyReLU |

2 | Conv3D | 128 | \(4 \times 4 \times 4\) | 2 | 1 | Yes | LeakyReLU |

3 | Conv3D | 256 | \(4 \times 4 \times 4\) | 2 | 1 | Yes | LeakyReLU |

4 | Conv3D | 512 | \(4 \times 4 \times 4\) | 2 | 1 | Yes | LeakyReLU |

5 | Conv3D | 1 | \(4 \times 4 \times 4\) | 1 | 0 | No | Sigmoid |

### 3.2 Two-Point Probability Functions

Good agreement between the real and synthetic microstructures can be observed for the radial averaged two-point probability function (Fig. 7). For both the radial averaged and directional estimates of \(S_2(\mathbf {r})\), a tight clustering around the mean can be observed, whereas the real porous medium shows a larger degree of variation around the mean.

### 3.3 Minkowski Functionals

To evaluate the ability of the trained GAN model to capture the morphological properties of the studied Ketton limestone, we compute four integral geometric properties that are closely related to the set of Minkowski functionals as a function of the image gray value.

*n*-dimensional body we can define \(n+1\) Minkowski functionals to characterize morphological descriptor of the grain–pore body structures (Mecke 2000). The Minkowski functional of zeroth order is equivalent to the porosity of a porous medium and defined as:

*V*to the bulk volume of the porous medium.

Our analysis of the GAN-based models shows excellent agreement for the porosity \(\phi (\rho )\), specific surface area \(S_V(\rho )\) and integral of mean curvature \(\kappa _V(\rho )\) as a function of the threshold value \(\rho \). For these three properties, a low error is introduced when applying global thresholding. The fourth property, the specific Euler characteristic, \(\chi _V(\rho )\), shows an error of \(20\%\) in the range of global thresholding values with good agreement outside this range. This implies that care must be taken when segmenting an image—real or generated—to preserve the connectivity of the pore space. As for the covariances, we also observe that the scatter produced by the GAN simulations is less than the scatter of the training set.

### 3.4 Permeability and Velocity Distributions

To validate GAN-based model generation for uncertainty evaluation and numerical computations, it is key that the generated samples capture the relevant physical properties of the porous media that the model was trained on. The permeability and, moreover, the local velocity distributions represent the key properties of the porous medium (Menke et al. 2017).

Our results (Figs. 9, 10) show that the GAN model generates stochastic reconstructions that capture the average permeability of the original training image at a scale of \(200^3\) voxels, with the majority of samples closely centered around the average effective permeability of the Ketton subsets.

The velocity distributions of the numerical simulations performed on the Ketton validation dataset and generated realizations were normalized by the average cell-centered velocity following the approach of Alhashmi et al. (2016) and a histogram with 256 logarithmically spaced bins in a range from \(10^{-4}\) to \(10^2\) for each simulation was obtained.

Figure 11 shows the per-bin arithmetic average of the bin frequencies and a bounding region of one standard deviation \(\mu \pm \sigma \) as the shaded area. Due to the high range of velocities spanning six orders of magnitude, the x-axis is represented in logarithmic scaling.

*n*and

*m*are the sample sizes, respectively, and \(c(\alpha )=\sqrt{-\frac{1}{2}\ln (\frac{\alpha }{2})}\). All tests were performed at a significance level of \(\alpha =0.05\) for the per-bin average velocity distributions presented in Fig. 11 (dashed curves).

Results of the two-sample Kolmogorov–Smirnov test for equality of velocity distributions computed on the image dataset and generated realizations

Direction | \(D_{n,m}\) | \(D_{0.05}\) |
---|---|---|

x | 0.09 | 0.12 |

y | 0.09 | |

z | 0.07 |

For all three directions, the null hypothesis can be accepted at the 5% significance level based on the \(D_{0.05}\) statistic, giving evidence to the visual similarity between the velocity distributions of the real Ketton images and their synthetic counterparts (Table 2).

## 4 Discussion

We have presented the results of training a generative adversarial network on a micro-CT image of the oolitic Ketton limestone. The image morphological properties were evaluated as a function of the image threshold level and it was shown that the generated images capture the textural features of the original training image. Two-point statistics and effective properties computed on segmented representations of the individual sub-domains have also shown excellent agreement between the realizations generated by the GAN model and subsets of the Ketton image. Nevertheless there remain a number of open questions that need to be addressed.

The predicted statistical and morphological properties have shown a tight bound around the average behavior of the training image. This indicates that there is less variation in the generated samples than in the training samples. This behavior can have a number of origins.

The training images can be regarded as samples of the unknown multivariate pdf \(p_\mathrm{real}(\mathbf {x})\), which is likely to be multimodal. The original formulation of the GAN objective function (Goodfellow et al. 2014) has been shown to lead to unimodal pdfs, even if the training set pdf itself is multimodal (Goodfellow 2017). The behavior of a generator to represent multimodal pdfs by a pdf with fewer modes is called mode collapse (Goodfellow 2017). This behavior may occur due to the fact that there is no incentive for diversity in GAN training.

Visually the images generated by the presented GAN model are nearly indistinguishable from their real counterparts (Fig. 5). Minkowski functionals and statistical parameters allow us to perform an evaluation of the reconstruction quality. Nevertheless, this does not rule out the fact that the generator may be memorizing the training set, show mode collapse behavior or output a low diversity of synthetic samples. A generator showing one or more of these behaviors will falsely indicate low errors in the Minkowski functionals, statistical and effective properties.

The smooth transition between the starting image \(G_{\mathbf {\theta }}(\mathbf {z}_\mathrm{start})\) and the endpoint \(G_{\mathbf {\theta }}(\mathbf {z}_\mathrm{end})\) shown in Fig. 12 indicates that the generator has not memorized the training set and has instead learned a lower-dimensional representation \(\mathbf {z}\) that results in meaningful features of the pore–grain microstructure. Definition of GAN training objectives compatible with high-diversity samples showing no mode collapse and stable training remains an open problem. Che et al. (2016) have presented a summary of recent advances to counteract mode collapse and have proposed a regularization method to improve GAN output variety. Reformulations of the GAN training criterion (Eq. 4) based on the Wasserstein distance (WGAN-GP) (Gulrajani et al. 2017) and other training approaches to GANs such as EBGAN (Zhao et al. 2016) or DRAGAN (Kodali et al. 2017) show the ability to model multimodal densities and allow stable training.

While the input and output to the GAN generator and discriminator are well defined, the interior mechanics of the neural network that result in high-quality reconstructions are not well understood. Rather than treating GANs as a black-box mechanism, it is of interest to evaluate the behavior of the generator and discriminator in more detail. In Fig. 13, we have extracted the generator’s output after each layer’s activation function (following the convolution operation and batch normalization).

Based on the consecutive upsampling of the noise prior \(\mathbf {z}\) by each transposed convolution in the generator, we observe a multi-scale feature representation of the final image. Early layers, where the spatial dimensions of the images are small, can be related to global features in the generator output. The final layers create highly detailed representations of the structural features of the reconstructed images. This view of the generator’s behavior also helps identify deficiencies in the network’s architecture. In layers 3 and 4, we see repeated noise that appears to be following a grid like structure. This is due to the transposed convolutional operation and in parts is diminished by the additional convolution operation prior to the last upsampling operation. This could be alleviated by the use of other convolution-based upsampling layers such as the sub-pixel convolution operation (Shi et al. 2016) or interpolation upsampling (nearest neighbor, bilinear, trilinear).

The discriminator’s role is simply to label images as real or “fake,” but it also is a critical component in the ability of the generator to learn features in the original image space. The discriminator, in order to distinguish GAN-generated from real training images, needs to learn a unique set of features that distinguish real samples from fake ones. As such, for future work, it may be of interest to use a GAN trained discriminator for classification or feature extraction (Arora and Zhang 2017).

Nevertheless, we can perform a similar operation as for the generator and inspect some of the features learned by the discriminator. Figure 14 shows a set of 5 learned filters applied to an image of the Ketton training set. At shallow layers, we find that the discriminator has learned to identify the pore space (layer 1, second row) as well as a number of edges. Deeper layers in the network represent more abstract features, and after layer 2, no original feature of the pore space is distinguishable.

The computational effort to perform image reconstruction using GANs can be split into two parts: training time and generation time. The training time is the total time required to find a set of parameters of the generator that allows generation at sufficient image quality. We define generation time as the total time required to initialize a neural network and the associated parameters obtained during the training phase and the generation of the images by passing a latent random vector \(\mathbf {z}\) through the generator to obtain an image \(\mathbf {x}\sim G_{\theta }(\mathbf {z})\). To create one realization from a GAN, it is necessary to train the generator–discriminator pairing only once; therefore, training time is a fixed computational cost. Once trained, the generator can simply be reused for each new realization.

Comparison of reported computational run times of recent reconstruction methods

Authors | Method | Size \([\text {voxels}^3]\) | Run time (\(\times \,1\)) (h) | Run time (\(\times \,100\)) (h) |
---|---|---|---|---|

Computational run time comparison | ||||

Pant (2016) | Simulated annealing | \(300^3\) | 22–47 | 2200 |

Čapek et al. (2009) | Simulated annealing | \(320^3\) | 160–400 | 16,000 |

Tahmasebi et al. (2017) | Patch-based | \(1000^2\times 300\) | 0.1 | 11 |

Okabe and Blunt (2004) | MPS | \(150^3\) | 12 | 1200 |

Current work | GAN | \(450^3\) | 8 | 8.1 |

The main limitations in computational effort come from two factors: the training time and available GPU memory. In the future, we expect the training time to decrease, due to greater performance of GPUs and development of novel GAN training methods that allow faster convergence. Furthermore GAN-based image synthesis for large spatial domains requires large amounts of GPU memory, for example reconstruction with \(450^3\) voxel requires more than 10 gigabytes of GPU memory.

Most methods reported in Table 3 incur a high computational cost per generated realization, with the exception of the method of Tahmasebi et al. (2017). We refer to these methods as proportional cost methods as the computational cost scales linearly with the number of created realizations. Training-based methods such as the presented GAN-based approach have a high initial computational cost due to the required training phase. Our method, once training is completed, has a very small generation time per realization. It is possible to determine an amortization time, when the use of one approach, considering all other factors equal, becomes beneficial.

Figure 15 presents a schematic comparison of the computational cost induced by different methods as a function of the number of realizations at a fixed image size. The amortization time, where the two curves intersect, corresponds to the number of realizations at which training-based methods, such as GANs, become faster.

## 5 Conclusions

We have presented a method to reconstruct microstructures of porous media based on gray-scale image representations of volumetric porous media. By creating a GAN-based model of an oolitic Ketton limestone, we have shown that GANs can learn to represent the statistical and effective properties of segmented representations of the pore space as well as their Minkowski functionals as a function of the image gray level. In addition to the effective permeability which is associated with a global average of the velocity field, we show that the pore-scale velocity statistical distributions have been recovered by the synthetic GAN-based models. We highlight the roles of the discriminator and generator function of the GAN and show that the GAN learns a multi-scale representation of the pore space based on inference from a latent random prior. Large hyper-parameter searches involved in the deep neural network architectures and learning instabilities make the training of GANs difficult. The high computational cost involved in training GANs is made good use of for applications when very large or many stochastic reconstructions are required. The differentiable nature of the generative network parameterised by the latent random vector provides a powerful framework in the context of gradient-based optimization and inversion techniques. Future work will focus on creating GAN-based methodologies that ensure a valid representation of the underlying data distribution allowing application of GANs for uncertainty quantification and inversion of effective material properties.

## Notes

### Acknowledgements

The authors would like to thank Hannah Menke (Imperial College London) for providing the Ketton image dataset. The authors thank H.J. Vogel of UFZ—Helmhotz Center for Environmental Research—for making the software library available for public use. O. Dubrule would like to thank Total S.A. for seconding him as a visiting professor at Imperial College London.

## References

- Alhashmi, Z., Blunt, M., Bijeljic, B.: The impact of pore structure heterogeneity, transport, and reaction conditions on fluid-fluid reaction rate studied on images of pore space. Transp. Porous Media
**115**(2), 215–237 (2016)CrossRefGoogle Scholar - Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein GAN. ArXiv e-prints (2017). arXiv:1701.07875
- Arora, S., Zhang, Y.: Do GANs actually learn the distribution? An empirical study. ArXiv e-prints (2017). arXiv:1706.08224
- Berg, C.F., Lopez, O., Berland, H.: Industrial applications of digital rock technology. J. Pet. Sci. Eng.
**157**(Supplement C), 131–147 (2017). https://doi.org/10.1016/j.petrol.2017.06.074. http://www.sciencedirect.com/science/article/pii/S0920410517305600 - Berryman, J.G.: Relationship between specific surface area and spatial correlation functions for anisotropic porous media. J. Math. Phys.
**28**(1), 244–245 (1987)CrossRefGoogle Scholar - Berthelot, D., Schumm, T., Metz, L.: BEGAN: boundary equilibrium generative adversarial networks. ArXiv e-prints (2017). arXiv:1703.10717
- Blunt, M.J., Bijeljic, B., Dong, H., Gharbi, O., Iglauer, S., Mostaghimi, P., Paluszny, A., Pentland, C.: Pore-scale imaging and modelling. Adv. Water Resour.
**51**, 197–216 (2013)CrossRefGoogle Scholar - Caers, J.: Geostatistical reservoir modelling using statistical pattern recognition. J. Pet. Sci. Eng.
**29**(3–4), 177–188 (2001)CrossRefGoogle Scholar - Čapek, P., Hejtmánek, V., Brabec, L., Zikánová, A., Kočiřík, M.: Stochastic reconstruction of particulate media using simulated annealing: improving pore connectivity. Transp. Porous Media
**76**(2), 179–198 (2009). https://doi.org/10.1007/s11242-008-9242-8 - Che, T., Li, Y., Jacob, A.P., Bengio, Y., Li, W.: Mode regularized generative adversarial networks. ArXiv e-prints (2016). arXiv:1612.02136
- Chiu, S.N., Stoyan, D., Kendall, W.S., Mecke, J.: Stochastic Geometry and its Applications. Wiley, Hoboken (2013)CrossRefGoogle Scholar
- Dumoulin, V., Visin, F.: A guide to convolution arithmetic for deep learning. ArXiv e-prints (2016). arXiv:1603.07285
- Efros, A.A., Freeman, W.T.: Image quilting for texture synthesis and transfer. In: Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, pp. 341–346. ACM (2001)Google Scholar
- Flannery, B.P., Deckman, H.W., Roberge, W.G.: D’AMICO, K.L.: Three-dimensional X-ray microtomography. Science
**237**(4821), 1439–1444 (1987)CrossRefGoogle Scholar - Goodfellow, I.: NIPS 2016 tutorial: generative adversarial networks. ArXiv e-prints (2017). arXiv:1701.00160
- Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2672–2680 (2014). http://papers.nips.cc/paper/5423-generative-adversarial-nets.pdf
- Guardiano, F.B., Srivastava, R.M.: Multivariate Geostatistics: Beyond Bivariate Moments, pp. 133–144. Springer, Netherlands (1993). https://doi.org/10.1007/978-94-011-1739-5_12
- Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., Courville, A.: Improved training of wasserstein GANs. ArXiv e-prints (2017). arXiv:1704.00028
- Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. ArXiv e-prints (2015). arXiv:1502.03167
- Jetchev, N., Bergmann, U., Vollgraf, R.: Texture synthesis with spatial generative adversarial networks. ArXiv e-prints (2016). arXiv:1611.08207
- Jiao, Y., Stillinger, F., Torquato, S.: A superior descriptor of random textures and its predictive capacity. Proc. Nat. Acad. Sci.
**106**(42), 17634–17639 (2009)CrossRefGoogle Scholar - Jiao, Y., Stillinger, F.H., Torquato, S.: Modeling heterogeneous materials via two-point correlation functions. II. Algorithmic details and applications. Phys. Rev. E
**77**, 031,135 (2008). https://doi.org/10.1103/PhysRevE.77.031135 - Joshi, M.: A Class of Stochastic Models for Porous Media. University of Kansas, Chemical and Petroleum Engineering (1974). https://books.google.co.uk/books?id=_zu8OwAACAAJ
- Kaae Sønderby, C., Caballero, J., Theis, L., Shi, W., Huszár, F.: Amortised MAP inference for image super-resolution. ArXiv e-prints (2016). arXiv:1610.04490
- Kodali, N., Abernethy, J., Hays, J., Kira, Z.: On convergence and stability of GANs. ArXiv e-prints (2017). arXiv:1705.07215
- LeCun, Y., Bottou, L., Orr, G.B., Müller, K.R.: Efficient backprop. In: Neural Networks: Tricks of the Trade, pp. 9–50. Springer (1998)Google Scholar
- Mariethoz, G., Caers, J.: Multiple-Point Geostatistics: Stochastic Modeling with Training Images. Wiley, Hoboken (2014)CrossRefGoogle Scholar
- Matheron, G.: Random Sets and Integral Geometry. Wiley series in probability and mathematical statistics: probability and mathematical statistics. Wiley (1975). https://books.google.co.uk/books?id=bgzvAAAAMAAJ
- Mecke, K.R.: Additivity, convexity, and beyond: applications of Minkowski functionals in statistical physics. In: Statistical Physics and Spatial Statistics, pp. 111–184. Springer (2000)Google Scholar
- Menke, H., Bijeljic, B., Blunt, M.: Dynamic reservoir-condition microtomography of reactive transport in complex carbonates: effect of initial pore structure and initial brine pH. Geochim. Cosmochim. Acta
**204**, 267–285 (2017)CrossRefGoogle Scholar - Mosser, L., Dubrule, O., Blunt, M.J.: Reconstruction of three-dimensional porous media using generative adversarial neural networks. Phys. Rev. E
**96**, 043,309 (2017). https://doi.org/10.1103/PhysRevE.96.043309 - Mosser, L., Dubrule, O., Blunt, M.J.: Conditioning of three-dimensional generative adversarial networks for pore and reservoir-scale models. ArXiv e-prints (2018). arXiv:1802.05622
- Mostaghimi, P., Blunt, M.J., Bijeljic, B.: Computations of absolute permeability on micro-CT images. Math. Geosci.
**45**(1), 103–125 (2013). https://doi.org/10.1007/s11004-012-9431-4 - Okabe, H., Blunt, M.J.: Prediction of permeability for porous media reconstructed using multiple-point statistics. Phys. Rev. E
**70**, 066,135 (2004). https://doi.org/10.1103/PhysRevE.70.066135 - Okabe, H., Blunt, M.J.: Pore space reconstruction using multiple-point statistics. J. Pet. Sci. Eng.
**46**(1–2), 121–137 (2005). https://doi.org/10.1016/j.petrol.2004.08.002 - Okabe, H., Blunt, M.J.: Pore space reconstruction of vuggy carbonates using microtomography and multiple-point statistics. Water Resour. Res.
**43**(12), 3–7 (2007). https://doi.org/10.1029/2006WR005680 - Øren, P.E., Bakke, S.: Reconstruction of berea sandstone and pore-scale modelling of wettability effects. J. Pet. Sci. Eng.
**39**(3), 177–199 (2003). https://doi.org/10.1016/S0920-4105(03)00062-7 - Otsu, N.: A threshold selection method from gray-level histograms. Automatica
**11**(285–296), 23–27 (1975)Google Scholar - Pant, L.M.: Stochastic characterization and reconstruction of porous media. Ph.D. thesis, University of Alberta (2016)Google Scholar
- Quiblier, J.A.: A new three-dimensional modeling technique for studying porous media. J. Colloid Interface Sci.
**98**(1), 84–102 (1984)CrossRefGoogle Scholar - Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. ArXiv e-prints (2015). arXiv:1511.06434
- Schmähling, J.: Statistical characterization of technical surface microstructure. Ph.D. thesis, Ruprecht-Karls-Universität, Heidelberg (2006)Google Scholar
- Shi, W., Caballero, J., Huszár, F., Totz, J., Aitken, A.P., Bishop, R., Rueckert, D., Wang, Z.: Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1874–1883 (2016)Google Scholar
- Siddique, N., Salehi, A., Liu, F.: Stochastic reconstruction and electrical transport studies of porous cathode of li-ion batteries. J. Power Sources
**217**(Supplement C), 437–443 (2012). https://doi.org/10.1016/j.jpowsour.2012.05.121. http://www.sciencedirect.com/science/article/pii/S0378775312010166 - Singh, K., Menke, H., Andrew, M., Lin, Q., Rau, C., Blunt, M.J., Bijeljic, B.: Dynamics of snap-off and pore-filling events during two-phase fluid flow in permeable media. Sci. Rep.
**7**(1), 5192 (2017)CrossRefGoogle Scholar - Tahmasebi, P., Hezarkhani, A., Sahimi, M.: Multiple-point geostatistical modeling based on the cross-correlation functions. Comput. Geosci.
**16**(3), 779–797 (2012)CrossRefGoogle Scholar - Tahmasebi, P., Sahimi, M.: Reconstruction of three-dimensional porous media using a single thin section. Phys. Rev. E
**85**(6), 066,709 (2012)Google Scholar - Tahmasebi, P., Sahimi, M.: Cross-correlation function for accurate reconstruction of heterogeneous media. Phys. Rev. Lett.
**110**(7), 078,002 (2013)Google Scholar - Tahmasebi, P., Sahimi, M., Andrade, J.E.: Image-based modeling of granular porous media. Geophys. Res. Lett.
**44**(10), 4738–4746 (2017). https://doi.org/10.1002/2017GL073938 - Torquato, S.: Random Heterogeneous Materials: Microstructure and Macroscopic Properties, vol. 16. Springer, New York (2013). https://doi.org/10.1007/978-1-4757-6355-3
- Torquato, S., Lado, F.: Characterisation of the microstructure of distributions of rigid rods and discs in a matrix. J. Phys. A Math. Gen.
**18**(1), 141 (1985). https://doi.org/10.1088/0305-4470/18/1/025 - Vogel, H.J.: Quantim (2008). http://www.quantim.ufz.de
- Vogel, H.J., Weller, U., Schlüter, S.: Quantification of soil structure based on Minkowski functions. Comput. Geosci.
**36**(10), 1236–1245 (2010). https://doi.org/10.1016/j.cageo.2010.03.007 - Yeh, R.A., Chen, C., Yian Lim, T., Schwing, A.G., Hasegawa-Johnson, M., Do, M.N.: Semantic image inpainting with deep generative models. ArXiv e-prints (2016). arXiv:1607.07539
- Yeong, C.L.Y., Torquato, S.: Reconstructing random media. Phys. Rev. E
**57**, 495–506 (1998). https://doi.org/10.1103/PhysRevE.57.495 - Zachary, C.E., Torquato, S.: Improved reconstructions of random media using dilation and erosion processes. Phys. Rev. E
**84**(5), 056,102 (2011)Google Scholar - Zhao, J., Mathieu, M., LeCun, Y.: Energy-based generative adversarial network. ArXiv e-prints (2016). arXiv:1609.03126

## Copyright information

**Open Access**This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.