1 Introduction

Geostatistical simulation is the most commonly used approach to quantify spatial geological uncertainty in mineral deposits, with the usual goal being better informed engineering and business decisions (Dimitrakopoulos 2011). Such a framework requires transferring the spatial uncertainty provided by the set of simulations to engineering or economic uncertainty; a typical example would be to exploit the characterization of the spatial uncertainty for drilling scheme selection. For instance, determining efficient drilling density can be achieved with stochastic simulations (Boucher et al. 2005) that may require a relatively large number of realizations to assess the expected performance of drilling schemes. The applicability of such a study depends on the availability of efficient simulation algorithms. Although methods for simulating individual attributes are generally efficient and easily accessible (Deutsch and Journel 1992; Goovaerts 1997; Remy et al. 2008; Dimitrakopoulos and Luo 2004; Godoy 2002; Chiles and Delfiner 2012), existing methods for jointly modeling multivariable deposits are in practice limited, particularly for deposits with more than two attributes of interest. For example, a realistic model of an iron deposit must account for silica, alumina, phosphorous and organic matters in addition to iron and must reproduce the joint local variability of the attributes of interest.

Approaches for joint simulation include methods based on the model of linear coregionalization, or LMC (Goovaerts 1997), and conditioning of simulated correlated fields (Chiles and Delfiner 2012), extension of the conditional univariate LU decomposition method of David (1987) to joint simulation (Myers 1988), combination of the LU vector simulation and sequential simulation for large joint simulation of two variables (Verly 1993), and direct co-simulation (Soares 2001). Any method relying on an LMC becomes cumbersome for more than three variables; the major difficulty being the inference of the related coregionalization model. Another approach is to simulate the variables in turn conditional to the previously simulated variables. For instance, in the case of an iron ore deposit one would first simulate the iron content, then simulate the alumina conditional to the previously simulated iron grade, and so on (Almeida and Journel 1994). In such a case, simplified models of coregionalization such as the Markov model 1 and Markov model 2 (Journel 1999) are used, whereas only the secondary collocated data is considered. An alternative to direct joint simulation is to use service variables that are uncorrelated (orthogonal) such that each service variable can be independently simulated and then recombined into the original variables. The back-transformation of simulated service variables restores the histograms, variograms and cross-variograms of the data.

The most common and longstanding type of transform is the change of basis through a principal components type of transform (David et al. 1984; David 1988). The service variables are then a linear combination of the original variables and, by construction, collocated attributes have zero covariance. However, PCA-based transformation does not ensure that the service variables are uncorrelated when not collocated, that is any lag greater than zero (Wackernagel et al. 1989; Goovaerts 1993), the only exception being a set of attributes that display proportional spatial correlation (intrinsic model of coregionalization). Leuangthong and Deutsch (2003) introduce the step-wise transform, a multivariate non-linear Gaussian transform, to generate lag-zero uncorrelated service variables. The step-wise transform provides, in one step, both the orthogonalization and the normal-score transform needed for all Gaussian-based simulation algorithms such as sequential Gaussian simulation. The main drawbacks of the step-wise transformation is the need for a large number of drill-hole data when dealing with several geological attributes; at the same time, being a global and location-independent transformation it suffers from an inherent deconstruction of local spatial connectivity of ranges of grade values of interest. Mueller and Ferreira (2012) present the U-WEDGE transformation approach for multivariate simulation.

A significant improvement of the PCA approach to joint simulation is presented by Desbarats and Dimitrakopoulos (2000) based on the minimum/maximum autocorrelation factors (MAF), a factorization method originally developed for remote-sensing applications (Switzer and Green 1984). It can be shown that MAF generates completely decorrelated factors at all lags, if the variogram model of the related variables follows the linear model of coregionalization with two structures. In comparison, the PCA approach provides such a guarantee for only one structure. Other types of coregionalization may still leave some correlation between attributes. Note that the MAF approach does not actually require modeling the coregionalization. Joint simulations of mineral deposits based on MAF are shown to be effective, relatively efficient, flexible and practical (Boucher 2003; Dimitrakopoulos and Fonseca 2003; Bandarian et al. 2008; Lopes et al. 2011; Rondon 2012). The efficiency of joint simulation with MAF is further enhanced when the simulation is done directly on a block-support scale (Godoy 2002).

For a mineral deposit, the target support or mining block or selective mining unit (Switzer and Parker 1975) is on a larger scale than the available drilling data (typically considered as point-support), hence a change of support process is required. The current practice consists of simulating the entire deposit on point-support followed by a post-processing step where simulated point values within a mining block are averaged. This approach makes no change of support assumptions. It can become cumbersome in practical terms when dealing with large deposits represented with several millions or tenths of millions of mining blocks and in some cases large mineral deposits represented by a billion or more mining blocks; an example of the latter size is the Escondida copper deposit, Chile, represented with about three billion mining blocks and several attributes per block (Baird 2011). The simulation of point values and re-blocking process has severe computational drawbacks as it needs to process, store and manage large sets of data and generate files that are often two orders of magnitude greater than the target size. For instance, take a model with 50 million blocks where the blocks are discretized into 125 points (5×5×5); the output model of the ore body will be about a 200-Mb file versus a 2.5-Gb file if all the points were to be kept in memory. For such an ore body model, a 32-bit operating system cannot efficiently work at the point-scale simulation but may accommodate simulation at the block-support scale. An alternative simulation method to simulate directly at the block-support size is proposed by Godoy (2002) and extended to the joint simulation of several correlated variables using MAF by Boucher and Dimitrakopoulos (2009, 2007), in an algorithm termed dbmafsim. The method minimizes the information stored by retaining only the block values, a procedure that significantly speeds up the simulation process while reducing the size of the output files for more efficient data storage and management. The application of MAF in mineral deposits has been shown in several studies (Dimitrakopoulos and Fonseca 2003; Lopes et al. 2011) to work particularly well and reproduce the statistical, spatial continuity and cross continuity, as well as scatter-plots, in spite of being based on simulating independent service variables.

This paper presents a full-field application of the dbmafsim algorithm at a multivariate iron ore deposit that demonstrates the key aspects and capabilities of the method for applied joint simulation of multi-element mineral deposits. In the following sections, the joint-simulation method is reviewed for completeness. Then, the application at the Yandi Central 1 iron ore deposit in Western Australia is detailed and includes the joint simulation of iron content, silica, alumina, phosphorus and loss on ignition (organic material), directly at the block scale. Conclusions follow.

2 Block-Support Simulation of Multiple Correlated Variables—A Recall

2.1 Algorithm Overview

The dbmafsim algorithm is an extension of the sequential Gaussian simulation (sgsim) algorithm. The simulation is performed on orthogonalized service variables where the simulation path visits the blocks instead of individual points and the conditioning is based on previously simulated blocks instead of previously simulated points. The orthogonal service variables are obtained with the minimum/maximum autocorrelation factorization. For each service variable a block value is drawn by simulating its discretizing points using the LU simulation algorithm delivering speed while using a unique neighborhood search.

Note that non-linear transformation and back-transformation of the data to and from a Gaussian distribution, such as the affine normal-score transform, may not be defined on block-support. The back-transformation must be then done on the simulated points and not on the simulated blocks, requiring an extra step not present in sequential simulation algorithms. In addition to averaging the simulated service variables for further conditioning (the previously simulated blocks), each group of points are back-rotated from the MAF service variables to the Gaussian space and then back-transformed to their original distribution and finally averaged into block values. The block values generated are then the actual simulated attributes at that block and is not used any further in the simulation process. It is the block values derived from the service variables that are used for further conditioning the LU system of equations for the remaining blocks on the simulation path.

The dbmafsim algorithm is:

  1. 1.

    Transform the data Z(u) to the normal-score data Y(u).

  2. 2.

    Transform Y(u) with the MAF transformation to orthogonal factors M(u).

  3. 3.

    Define a random path visiting each block.

  4. 4.

    For each block v, simulate the discretizing N points \(m^{k}_{s} (u_{i} )\), i=1,…,N, with an LU decomposition for each service variable k=1,…,K through Eq. (12):

    1. a.

      Average the point-support service variables \(m^{k}_{s} (u_{i} )\), i=1,…,N, at the block-support for further conditioning.

    2. b.

      Back-transform \(m^{k}_{s} (u_{i} )\), i=1,…,N, to \(z^{k}_{s} (v )\), k=1,…,K, and output the block values.

  5. 5.

    Repeat until all blocks are simulated.

The orthogonalization with MAF and the block-support simulation with the LU algorithm are explained next.

2.2 Orthogonalization with Minimum/Maximum Autocorrelation Factors

Consider a stationary vector random function (RF) Z(u)={Z 1(u),…,Z K(u)} transformed into its Gaussian, equivalent Y(u); through Y(u)=Φ(Z(u)) or when the transformation is done independently for each variable

(1)

The resulting vector RF Y(u) is composed of K Gaussian RFs that are assumed to be multi-Gaussian. The MAF service variables are derived as a new vector RF, M(u)= {M 1(u),…,M K(u)}, where the K RFs are independent and obtained from the multi-Gaussian vector RF Y(u) using the coefficients A such that

(2)

Note that the MAF service variables, M(u), are a linear function of Y(u), which are a non-linear transformation of the original data Z(u) such that

(3)

and conversely,

(4)

The orthogonalization coefficients matrix A is generated from

(5)

with

(6)

where B is the variance/covariance matrix of Y(u) and Γ Y (h) is the variogram matrix at lag h. The above derivation of A is equivalent to performing two successive principal components (PCA) decompositions (Desbarats and Dimitrakopoulos 2000; Rondon 2012).

Finally, the block values are computed with

(7)

with u i , i=1,…,N, being a group of points discretizing the block at location v.

2.3 Simulating the Service Variables with the Cholesky Decomposition

The advantages of working with orthogonal service variables are that (a) only the direct covariance models, C 1(h),…,C k (h), are necessary, and (b) the simulation can be done on each service variables independently. Consider a block at location v discretized with a vector of N points of the kth service variable \({\mathbf{m}}^{k}_{s}= \{m^{k} (u_{1} ),\dots ,m^{k} (u_{N} ) \}\) with u i v, and with a neighborhood made of data points and previously simulated blocks \({\mathbf{m}}^{k}_{d}\). The vector \({\mathbf{m}}^{k}_{s}\) can be simulated through the Cholesky decomposition

(8)

where

(9)

\({\mathbf{C}}^{k}_{d}\) is the covariance matrix of conditioning data comprised of the drill-hole data and the previously simulated blocks for the service variable M k(u), \({\mathbf{C}}^{k}_{s}\) is the point-support covariance matrix between the point discretizing a block, and \({\mathbf{C}}^{k}_{sd}\) is the matrix of point and point-to-block covariance between the discretizing points and the known values (drill holes and previously simulated blocks). All covariance values are derived from the model C k (h) or its regularization (Journel and Huijbregts 1978) when a block-support data is considered.

More specifically, the submatrix for the conditioning data is

(10)

The pp index refers to the point-to-point covariance between drill-hole data, pb to the point-to-block covariance between the drill-hole data and the previously simulated blocks, and bb the block-to-block covariance between previously simulated blocks.

The submatrix between the conditioning data and the vector of point-support data discretizing the block is

(11)

where \({\mathbf{C}}^{k,pp}_{sd}\) and \({\mathbf{C}}^{k,pb}_{sd}\) are the covariance matrix of the discretizing points with the point-scale drill-hole data and with the previously simulated blocks, respectively. There is one such system of equations per service variable; note that searching for neighborhood data (drill holes and previously simulated blocks) is done only once per block location for all service variables.

The group of nodes for service variable k is simulated with

(12)

with

(13)

The variables \({\mathbf{m}}^{k}_{p}\) and \({\mathbf{m}}^{k}_{b}\) are respectively the service variable vectors of the drill holes and vector of the previously simulated blocks within the search neighborhood. The vector w s is made up of standard Gaussian deviates. Finally, the block value to be used for further conditioning is the arithmetic average of \({\mathbf{m}}^{k}_{s}\) and the final block values are computed with Eq. (7).

2.4 Algorithmic Considerations

The group simulation with the LU algorithm of Eq. (12), coupled with the reduced searches for neighboring data, means there are N times fewer searches being done than on a point-wise simulation algorithm, contributing to an overall increase in speed and efficiency. See Dimitrakopoulos and Luo (2004), and Benndorf and Dimitrakopoulos (2007) for guidelines in choosing optimal block discretization and search parameters. When the number of discretizing nodes used in blocks is suitably chosen, the simulation with LU systems is substantially faster than the solving co-kriging systems of equations of variables (Dimitrakopoulos and Luo 2004).

3 Application at Yandi Central 1, Iron Ore Deposit, WA

The Yandi Central 1 iron ore deposit is a part of the larger Yandicoogina-Marillana detrital channel deposits in Western Australia. The mine is located approximately 120 km north-west of Newman, in Hamersley province. BHP Billiton started operations at Yandi in 1992 with estimated resources of about 1500 million tonnes of high-grade iron ore. The iron ore is derived from the erosion of the banded iron formation of Hamersley province and trapped in paleochannel incisions within the Weeli Wolli Formation that formed the surface approximately 40 to 50 million years ago (Stone et al. 2002). The deposit is composed of cemented masses of concretionary iron oxides, largely goethite (Hall and Kneeshaw 1990; Stone et al. 2002). The main ore zone of Yandi Central contains a consistently high level of iron, together with high and low levels of silica and generally low levels of alumina.

3.1 Study Area and Data

The section of the channel at Yandi Central 1 considered in this study is 4.1-km long by 500-m wide. This study is concerned with a section of the main ore zone located above the water table, a section 30-m thick on average. The part of the deposit modeled is discretized into 40,698 blocks, each 25 m×25 m×2 m, filling a volume of about 50.87 million cubic meters. This is an economically important zone within the deposit because of its volume, high iron content and low silica, alumina and phosphorus content. The study area is covered by 961 drill holes (Fig. 1), representing 7126 two-meter long composites of iron content (Fe), silica (SiO2), alumina (Al2O3), phosphorus (P) and loss on ignition (LOI), all present in the available composites. Drilling density is approximately on a regular 50-m grid, with the exception of a densely drilled section in the middle of the ore body. In this study, the main ore zone is divided into the external and internal zones shown in Fig. 1. The statistics for the data in the two zones are given in Table 1. Note that the internal zone differs from the larger external zone by having higher Fe and Al2O3 content and slightly lower SiO2 content. The SiO2 in the external zone also displays higher variability. The correlations and rank correlations between the elements in the data set are shown in Table 2. Three strongly correlated elements are present: Fe, SiO2 and Al2O3. This has an impact when the two correlations differ significantly, as is the case with the Fe–Al2O3 and SiO2–Al2O3 correlations.

Fig. 1
figure 1

Drill holes available for the Yandi Central 1 deposit and division into external and internal zones

Table 1 Summary statistics for the data in the external and internal zones
Table 2 Pearson’s correlation (left) and rank correlation (right) between variables

3.2 Minimum/Maximum Autocorrelation Factors

The variance/covariance matrix Γ(h) in Eq. (5) is computed using data pairs located at a minimum of 110 m and a maximum of 155 m apart in the horizontal plane, but with no more than 2 m of vertical separation. This distance is, on average, just larger than the range of the first spherical structure and much shorter than the second one for the five variables. The vertical constraint of 2 m is necessary to avoid the various local vertical trends generating an artificial increase in variance. The factor coefficients A are presented in Table 3. In both zones Fe has the highest coefficient and is always important in the determination of all MAF service variables, followed by SiO2. P is the least important element, but it is also the one having the least correlation with the others elements. The two most important factors, MAF1 and MAF2 (representing almost 50 % of the variance), are essentially a combination of Fe and SiO2 with the addition of LOI for MAF2. Examples of experimental and model variograms of the MAF service variables for the external zone are shown in Fig. 2. It is interesting to note that the range of a factor seems to be inversely proportional to its contribution to the variance. With slightly less than 15 % of the global variance, MAF5 has the longest range at 680 and 770 m for the external and the internal zones, respectively. On the other hand, MAF1 has a range of 160 and 200 m for the two zones, but represents around 25 % of the global variance. Representative examples of cross-variograms are shown in Fig. 3 and, as expected, the decorrelation between the service variables is excellent.

Fig. 2
figure 2

Experimental and model variograms for the five MAF service variables for the external zone. Note the increase of the range from the first to the fifth service variable

Fig. 3
figure 3

Selected experimental cross-variograms between MAF service variables for the external zone

Table 3 Coefficients A of MAF for the external zone

3.3 Joint Conditional Simulation at the External and Internal Zones

The external and internal zones are separately simulated 20 times using the dbmafsim method and subsequently merged. Each realization contains 40,698 blocks, each 25 m×25 m×2 m, with simulated Fe, P, SiO2, Al2O3 and LOI content. For comparison, at the point-support scale this corresponds to 2,122,752 points for the external zone and 481,920 points in the internal zone per realization. In addition to visual inspection, the simulations are subsequently validated by: (i) quantile–quantile plots between drill-hole data and simulated point-support values; (ii) comparison of drill-hole data and simulation values scatter plots at point-support; (iii) block variogram validation with their respective regularized (block-support) variogram models; and (iv) assessment of the vertical profiles of the realizations at block-support. Note that the point-support quantile–quantile plots and scatter plots were generated for illustrative purpose only since the simulation algorithm outputs result at block scale.

One joint simulation for the five elements conditional to the drill-hole data (Fig. 1) is shown in Fig. 4. The correlation between Fe, SiO2 and Al2O3 is apparent from the realizations; the low levels of SiO2 and Al2O3 are present where there is a high level of Fe. The preservation of the negative spatial correlation between Fe and SiO2 is clearly visible in Fig. 5 where the locations of the first-decile block values of Fe match the locations of the 9th-decile block values of SiO2 and vice versa.

Fig. 4
figure 4

Representation of a joint simulation for each element. Right: Full block model. Left: Selected sections. The block size is 25 m×25 m×2 m. Note the strong vertical trend of the LOI variable

Fig. 5
figure 5

Locations of blocks that have values within the lower decile (left) and the upper decile (right) for Fe and SiO2. Note the strong inverse correlation between the two elements

For validation purposes, the point-scale values for the external zone of the first simulation, a total of 2,122,752 points, were kept to compare with the statistics from the drill-hole data (3873 values). As shown in Fig. 6, the simulated points do reproduce the distributions of the drill-hole data. Only the high values of Al2O3 significantly deviate from the original distribution. The point-scale scatter plots between selected elements for the data (left) and the first simulation (right) are shown in Fig. 7. The shape of the clouds of points remains somewhat preserved; the coefficient of correlation is lower for the simulations than for the drill-hole data. It is noteworthy that the rank correlation is always well preserved. This indicates that the loss of correlation is due to the independent forward and back normal-score transformation applied to the data and not to the MAF orthogonalization. Recall that the normal-score transformation is a rank-preserving transformation; this point is explored in more detail when discussing the reproduction of cross-variograms. An important feature of iron ore deposit in general and of this deposit in particular is the vertical trends of the elements. The trends are present in the drill-hole data variograms (Fig. 2) and visually in the simulations (Fig. 4). The vertical profiles between the point-scale drill-hole data and those of two simulations on block-support can be directly compared in Fig. 8. The general shape of the profiles is well reproduced in all cases but with an apparent increase in variability for the simulation profiles.

Fig. 6
figure 6

QQ-plot between data and simulation #1 on point-support. Only the Al2O3 attribute significantly diverges from the data distribution

Fig. 7
figure 7

Scatter plot between Fe–SiO2, SiO2–Al2O3 and Fe–Al2O3 for the data (left) and simulation #1 on point-support (right). Note that the rank correlation is systematically better reproduced than Pearson’s correlation. The strong negative correlation for Fe–Al2O3 is not well reproduced

Fig. 8
figure 8

Vertical profile of each element for the point scale data (black line) and two simulations at block scale (grey line)

Validation of variogram and cross-variogram reproduction is performed in the data space at the block-support scale. For each element of the external zone the regularized data variogram and cross-variogram models at block-support (25 m×25 m×2 m) are compared with their corresponding variograms from the 20 simulations (Fig. 9). The direct variograms do reproduce the spatial correlation of the data. The various vertical trends can be clearly seen in the vertical experimental variogram calculated from the simulations. Note that none of the variogram models of Fig. 9 were considered during the simulation; only the direct variogram of the MAF service variables were used. Figure 10 shows the cross-variograms between Fe–SiO2, SiO2–LOI, Fe–LOI, Al2O3–P and SiO2–Al2O3. Of particular importance is the correlation between Fe and silica. This correlation is very well preserved because the standard and the rank correlations are similar. On the other hand, the block cross-variogram between SiO2 and Al2O3 is not reproduced since normal-score back-transformation destroyed the correlation built from the MAF service variables (recall Fig. 7 for the scatter plot). The impact of the back-transformation on the spatial correlation of Al2O3 can be seen by looking at the reproduction of the normal score of the block data in Fig. 11. The variograms of the simulated normal-score blocks do match with the regularized model of the normal-score data. Since normal-score transformations are rank transformations, they contribute to the poor reproduction of the cross-variograms for the pairs of elements with the largest difference between their rank correlation and their standard correlation. Overall, the simulations reproduce spatial features of the original data that have not been explicitly modeled. This observation has also been made in point simulations using MAF (Dimitrakopoulos and Fonseca 2003; Bandarian et al. 2008; Lopes et al. 2011).

Fig. 9
figure 9

Block variogram for the external zone calculated for the 20 joint simulations. The block-scale model is obtained by fitting variogram models on the point-scale data and upscaling them to a 25 m×25 m×2 m support. Note that only the variogram of the MAF service variables were used to generate the simulations

Fig. 10
figure 10

Cross-variogram at block scale for the external zone calculated for the 20 joint simulations. The block-scale model is obtained by fitting variogram models on the point-scale data and upscaling them to a 25 m×25 m×2 m support. Note that only the variogram of the MAF service variables were used to generate the simulations

Fig. 11
figure 11

Cross-variograms for SiO2–Al2O3 and SiO2–LOI in the normal score space on block-support. These two pairs of elements showed marginal to poor variogram reproduction in the data space (Fig. 10) but display good reproduction here in the normal score space

4 Conclusions

This paper has shown the practical aspects of an efficient framework for the joint simulation of correlated variables based on minimum/maximum autocorrelation factors and directly generating realizations at the block-support scale. The MAF factors are used to transform a set of geological attributes to independent service variables that are then simulated independently. Orthogonalization with MAF combined with the simulation directly at the block-support scale has been shown to be well suited for multi-element deposits.

The practical aspects of the dbmafsim algorithm were shown in an application at the Yandi iron ore deposit in Western Australia and the joint simulation of five cross-correlated elements, namely Fe, SiO2, Al2O3, P and LOI. Two main advantages from the application presented herein are worth noting. First, the use of dbmafsim only requires fitting 10 independent variograms to the MAF service variables instead of two sets of 15 permissible variograms and cross-variograms needed for a full model of coregionalization. This is a major practical aspect of the approach that facilitates the routine simulation of multi-element deposits. Second, the direct simulation at block-support is computationally efficient as well as outputs manageable file sizes that can easily be transferred and visualized on a desktop. In addition, it has been shown that the block-support simulations reproduce the direct- and cross-variograms of the elements even though only the direct variograms of the service variables have to be modeled. Note that in the case where the rank correlation differs significantly from the standard correlation, the normal-score transformation seems to limit the reproduction of cross-variograms in the data space, although this is a less important issue compared with the positive applied contributions of the method: the implementation of the direct sequential simulation algorithm (Journel 1999; Soares 2001; Bandarian et al. 2008) where the normal-score transformation is not used may then be an alternative.