Huang et al. (2016a) claimed that they proposed a “spatial hidden Markov chain (SHMC) model” based on the Markov chain random field (MCRF) theory proposed by Li (2007). However, we found that their so-called SHMC model is exactly the same as the colocated MCRF cosimulation (coMCRF) model with one auxiliary categorical data set. The coMCRF model was first presented on GIScience 2012 conference (Li and Zhang 2012) and then presented on GeoComputation 2013 conference (Li and Zhang 2013a). It was mentioned in Li and Zhang (2013b), described and used in the research article Li et al. (2013), and then further described and used in the research article Li et al. (2015). However, Huang et al. (2016a) seemingly ignored the coMCRF model in all of these published online materials and journal articles. Obviously, Huang et al. reinterpreted the coMCRF model and claimed that they proposed a SHMC model in Huang et al. (2016a). In addition, Huang et al. (2016a) performed an inaccurate case study to support their claim for article publication by apparently using computer programs for other models; they made many inaccurate statements about geostatistics, Markov chain models, and Markov random fields, which are misleading. This is not their first problematic article related to the MCRF model. It is our obligation to clarify the misunderstandings and point out the mistakes.

Introduction

In “Introduction” section, Huang et al. (2016a) stated “Spatial Markov chain (SMC) models have also been widely adopted in petroleum reservoir to characterize the spatial heterogeneity of categorical variables through the conditional probabilities (transition probabilities) from different directions (Carle and Fogg 1997; Weissmann and Fogg 1999)”. However, Carle and Fogg (1997) introduced continuous-lag Markov chain models, which were actually a joint modelling method of spatial transition probabilities for providing transition probability parameters to transition probability-based indicator geostatistics (TPROGS) (Carle and Fogg 1996). Weissmann and Fogg (1999) was focused on hydrofacies modelling using TPROGS, in which what SMC models were adopted by them? Then, Huang et al. (2016a) grouped both the coupled Markov chain (CMC) model proposed by Elfeki and Dekking (2001) and the MCRF model proposed by Li (2007) into SMC models. Thus, they regarded the CMC model, the MCRF model, and the TPROGS model all as SMC models, no matter whether these models are kriging models or nonlinear Markov chain models. We don’t think it is proper to call all of these three models “SMC models”, although they all use transition probabilities, while neither Carle and Fogg nor Elfeki and Dekking call their model a “spatial Markov chain” model. Indeed, Li (2007) used the term “spatial Markov chain” to refer to the Markov chain in a specific MCRF and also used the name of “spatial Markov chain model” for specific MCRF models with different numbers of nearest neighbours and spatial dimensions (i.e. one to six nearest neighbours in cardinal directions, and one to three dimensions). However, considering that this term was not sufficiently unique to represent “single-chain-based multidimensional Markov chain model” (Li and Zhang 2008), Li (2007) and our later papers used the name of “Markov chain random field” to differentiate the new Markov chain model from other Markov chain models for spatial data. Once the name of a model was decided by the model proposer(s), it is not respectful for others to rename it without sufficient reasons and scientific progress.

Huang et al. (2016a) stated “Most traditional geostatistical models, like Markov chain random field (MCRF), use well data only and make prediction based on SMC, which results in a relatively low prediction accuracy (Huang et al. 2016b)”. This is not true! First, traditional geostatistical models such as cokriging and indicator cokriging that make use of auxiliary data have existed for decades. The MCRF model is not a traditional geostatistical model. It even has not been well developed into a software system for release. Second, Huang et al. (2016b) did not use a proper MCRF model to make prediction, but just played a trick to mislead readers by mistakenly claiming that they derived the MCRF model from the conventional Markov random field model. Third, coMCRF model, which may use one or multiple auxiliary data sets, has already been proposed (Li et al. 2013, 2015). Huang et al. (2016a) stated “Huang et al. (2016c) introduced a beta-transformed Bayesian updating model to boost the classification accuracy of category random field”. We don’t think that Huang et al. (2016c) used the proper simulation algorithms and computer programs to perform simulations, except that they just wanted to quickly claim new geostatistical models even without understanding the differences between spatial models and non-spatial models. They further stated: “Auxiliary information, however, has not been taken into consideration”. If this was true, then what had Li et al. (2013) and Li et al. (2015) done? Didn’t these two papers present the coMCRF model for incorporating auxiliary information into the MCRF model? Didn’t these two papers conduct cosimulations for soil class map updating and land cover post-classification, respectively? After the proposition of the MCRF model as a geostatistical model, its extension for collaborative modelling is a natural thing. In addition, both the MCRF model and the coMCRF model are not limited to two dimensions theoretically.

Model

The second section of the paper is “Review of Markov models”. In this section, both Markov mesh models and spatial Markov chain models were reviewed. Let’s see how Huang et al. (2016a) made their reviews.

For Markov mesh models, Huang et al. (2016a) wrote their model formula as a conditional probability

$$\Pr \left( {f_{s} |f_{\eta_{s}} } \right)\, = \,\Pr \left( {f_{s} |f_{s - 1} ,f_{{s_{1} }} ,f_{{s_{2} }} , \ldots, f_{{s_{l} }} } \right),$$
(1)

and stated that “where s 1, s 2, …, s l is its nearest known locations of current cell s in different directions; s−1 is always the start cell of the Markov chain to the unknown cell s, which is to be estimated”. It is widely known that Markov mesh models are unilateral lattice models. Here Huang et al. (2016a) suddenly redefined them on surrounding nearest known locations as a geostatistical model (i.e. a model defined on spatial sample data). The purpose of their making such a misleading definition appeared later at the end of this section, where Huang et al. (2016a) stated that “A spatial Markov chain model may be viewed as a special case of Markov mesh models, whereas a Markov mesh model is an extension of spatial Markov chain models, called a generalized spatial Markov chain model”. Such a statement is absolutely irrational. Where did Huang et al. find such knowledge? Markov mesh models have existed for half century since 1960s (Abend et al. 1965). How could they generalize the MCRF model proposed in 2007? Here Huang et al. (2016a) turned so-called SMC models, including the CMC model of Elfeki and Dekking (2001), the MCRF model of Li (2007), and the TPROGS method according to their classification, all into special cases of Markov mesh models. Huang et al. (2016a) “generalized” all of these non-unilateral models into unilateral lattice models. Couldn’t Huang et al. understand the differences between unilateral models and non-unilateral models and the differences between lattice models and geostatistical models?

For spatial Markov chain models, Huang et al. (2016a) first presented a “generalized form” of the CMC model

$$\Pr \left( {f_{s} |f_{{s - 1}} ,f_{{s_{1} }} , \ldots, f_{{s_{l} }} } \right) = \frac{{p_{{f_{{s - 1}} f_{s} }} p_{{f_{{s_{1} }} f_{s} }}^{1} \ldots p_{{f_{{s_{l} }} f_{s} }}^{l} }}{{\sum\nolimits_{{f_{s} }} {p_{{f_{{s - 1}} f_{s} }} p_{{f_{{s_{1} }} f_{s} }}^{1} \ldots p_{{f_{{s_{l} }} f_{s} }}^{l} } }}$$
(2)

by mimicking the generalized form of the MCRF model simplified using the conditional independence assumption. While the CMC model of Elfeki and Dekking (2001) is composed of two 1-D Markov chains, Huang et al. (2016a) made it composed of l + 1 1-D Markov chains. What scientific problem did Huang et al. (2016a) attempt to solve by writing the CMC model like this? Then, Huang et al. (2016a) stated that “Using the conditional independence assumption, Li (2007) gives the general expression of the conditional probability formula in Eq. (1) at any location s as”

$$\Pr \left( {f_{s} |f_{s - 1} ,f_{{s_{1} }} , \ldots, f_{{s_{l} }} } \right)\, = \,\frac{{p_{{f_{s - 1} f_{s} }} p_{{f_{s} f_{s_1} }}^{1} \ldots p_{{f_{s} f_{s_l} }}^{l} }}{{\sum\nolimits_{{f_{s} }} {p_{{f_{s - 1} f_{s} }} p_{{f_{s} f_{s_1} }}^{1} \ldots p_{{f_{s} f_{s_l} }}^{l} } }}.$$
(3)

How simple is it? Could Huang et al. use the conditional independence assumption to get such a model without the derivation process of using Bayes’ theorem for factorizing the spatial conditional probability distribution function over a neighbourhood? How could one use such a model to conduct spatial simulation directly conditional on sample data without first defining the conditional probability function on sparse sample data?

The third section of the paper is “Spatial hidden Markov chain model”. Huang et al. (2016a) stated “The SHMC is an extension of SMC”. Which SMC? The CMC model or the TPROGS model? They stated “A spatial Markov chain … is characterized by its states and conditional probabilities through Eq. (2) or (3)…”. Thus, both the CMC model and the MCRF model were included, but the TPROGS model was not. They defined a local conditional probability distribution for their SHMC model as

$$\Pr \left( {f_{s} |f_{s - 1} ,f_{{s_{1} }} , \ldots, f_{{s_{l} }} ,w_{s} } \right)\, = \,\frac{{\Pr \left( {w_{s} |f_{s} } \right)\Pr \left( {f_{s} |f_{s - 1} ,f_{{s_{1} }} , \ldots ,f_{{s_{l} }} } \right)}}{{\sum\nolimits_{{f_{s} }} {\Pr \left( {w_{s} |f_{s} } \right)\Pr \left( {f_{s} |f_{s - 1} ,f_{{s_{1} }} , \ldots ,f_{{s_{l} }} } \right)} }}$$
(4)

But when they defined their prior conditional probabilities, they used the formula

$$\Pr \left( {f_{s} |f_{{s - 1}} ,f_{{s_{1} }} , \ldots, f_{{s_{l} }} } \right) = \frac{{p_{{f_{{s - 1}} f_{s} }} p_{{f_{s} f_{{s_{1} }} }}^{{(1)}} \ldots p_{{f_{s} f_{{s_{l} }} }}^{{(l)}} }}{{\sum\nolimits_{{f_{s} }} {p_{{f_{{s - 1}} f_{s} }} p_{{f_{s} f_{{s_{1} }} }}^{{(1)}} \ldots p_{{f_{s} f_{{s_{l} }} }}^{{(l)}} } }},$$
(5)

which is exactly the simplified MCRF model based on the conditional independence assumption provided in Li (2007), the same as above Eq. (3). Both a generalized MCRF model equation and specific MCRF models with nearest data only in cardinal directions were provided in Li (2007, p. 327–329, see “MCRF-based SMC models”). Note that the superscripts used in the MCRF model for directions were unnecessary because the subscript labels were already sufficient; we, therefore, didn’t use the superscripts anymore in MCRF model equations in our later publications after 2009.

In the above Eq. (4), their \({ \Pr }(w_{s} |f_{s} )\), that is, their emission probability, was called cross-field transition probability in Li et al. (2013, 2015). Because the coMCRF model may incorporate multiple auxiliary data sets, it may have multiple cross-field transition probability terms, which represent the cross-field relationships between the primary categorical field and multiple different auxiliary fields. Here by combining the above Eqs. (4) and (5) and using a cross-field transition probability (or their emission probability) expression \(b_{{f_{s} w_{s} }}\) for the conditional probability \({ \Pr }(w_{s} |f_{s} )\), the complete form of their so-called SHMC model is

$$\Pr \left( {f_{s} |f_{s - 1} ,f_{{s_{1} }} , \ldots ,f_{{s_{l} }} ,w_{s} } \right) = \frac{{b_{{f_{s} w_{s} }} p_{{f_{s - 1} f_{s} }} p_{{f_{s} f_{{s_{1} }} }} \ldots p_{{f_{s} f_{{s_{l} }} }} }}{{\mathop \sum \nolimits_{{f_{s} }} \left[ {b_{{f_{s} w_{s} }} p_{{f_{s - 1} f_{s} }} p_{{f_{s} f_{{s_{1} }} }} \ldots p_{{f_{s} f_{{s_{l} }} }} } \right]}} .$$
(6)

One can see that their SHMC model [i.e. Eq. (6)] is exactly the same as the colocated coMCRF model with one auxiliary data set presented in Li et al. (2013) and Li et al. (2015). For the sake of clarity, the colocated coMCRF model with one auxiliary data set in Li et al. (2013) is copied here:

$$p[i_{0} ({\mathbf{u}}_{0} )|i_{1} ({\mathbf{u}}_{1} ), \ldots ,i_{m} ({\mathbf{u}}_{m} );r_{0} ({\mathbf{u}}_{0} )]\, = \,\frac{{b_{{i_{0} r_{0} }} p_{{i_{1} i_{0} }} \left( {{\mathbf{h}}_{{10}} } \right)\prod\nolimits_{{g = 2}}^{m} {p_{{i_{0} i_{g} }} \left( {{\mathbf{h}}_{{0g}} } \right)} }}{{\sum\nolimits_{{f_{0} = 1}}^{n} {\left[ {b_{{f_{0} r_{0} }} p_{{i_{1} f_{0} }} \left( {{\mathbf{h}}_{{10}} } \right)\prod\nolimits_{{g = 2}}^{m} {p_{{f_{0} i_{g} }} \left( {{\mathbf{h}}_{{0g}} } \right)} } \right]} }}.$$
(7)

The colocated coMCRF model with one auxiliary data set used in Li et al. (2015) is also copied here:

$$p[i_{0} ({\mathbf{u}}_{0} )|i_{1} ({\mathbf{u}}_{1} ), \ldots ,i_{m} ({\mathbf{u}}_{m} );r_{0} ({\mathbf{u}}_{0} )]\, = \,\frac{{q_{{i_{0} r_{0} }} p_{{i_{1} i_{0} }} \left( {{\mathbf{h}}_{{10}} } \right)\prod\nolimits_{{g = 2}}^{m} {p_{{i_{0} i_{g} }} \left( {{\mathbf{h}}_{{0g}} } \right)} }}{{\sum\nolimits_{{f_{0} = 1}}^{n} {\left[ {q_{{f_{0} r_{0} }} p_{{i_{1} f_{0} }} \left( {{\mathbf{h}}_{{10}} } \right)\prod\nolimits_{{g = 2}}^{m} {p_{{f_{0} i_{g} }} \left( {{\mathbf{h}}_{{0g}} } \right)} } \right]} }}.$$
(8)

Please note the differences in symbols: the neighbourhood size for the MCRF model was denoted as m rather than l + 1, and the cross-field transition probability term was denoted as \(b_{{i_{0} r_{0} }}\) and \(q_{{i_{0} r_{0} }}\) in Li et al. (2013) and Li et al. (2015), respectively. Readers who have interests in the MCRF approach (i.e. Markov chain geostatistics) may read Li (2007), Li and Zhang (2008), and Li et al. (2013, 2015) for details.

Now it is clear that both claiming the CMC model, the MCRF model, and the TPROGS model all as SMC models and messing up these models with Markov mesh models through wrong statements were just tricks played by Huang ret al. (2016a).

Case study

The fourth section of Huang et al. (2016a) is “Case study”. Let’s see how they did their simulation. They estimated experimental transiograms from 509 data points and fitted all of them with exponential models. They did not use our transiogram joint modelling methods (Li and Zhang 2010), nor use the transition rate method for continuous-lag Markov chain models (also a transition probability joint modelling method) suggested by Carle and Fogg (1997). How did they guarantee their transiogram models (see Huang et al. 2016a, figure 7) sum to unity in each row? Without satisfying this condition, how could they properly implement their simulations using transiogram models?

What are unbelievable are their simulated results provided in the figure 10 in Huang et al. (2016a) (see below Fig. 1). Huang et al. (2016a) claimed that these results were generated by the “SMC” defined by Eq. (5), which is the simplified MCRF model based on the conditional independence assumption. These results, however, have no way to be the results produced by the simplified MCRF model. The authors also pointed out some problems about the results, as they stated “It is obvious that conditional data have played a role in controlling the distribution of lithofacies near the wells. However, the further counterparts are fragmented and random in the grid”. We are afraid that the true problem is not fragmentation or randomness. The true problem with the results should be that small classes were strongly underestimated. These results are very similar to the results generated by the unconditional CMC model (see Elfeki and Dekking 2001). This means that Huang et al. (2016a) used the unconditional CMC model to conduct the simulations. It is not surprising because the CMC model used in case studies was always unconditional (to sample data ahead) in their earlier publications (see Liang et al. 2014, Huang et al. 2016b).

Fig. 1
figure 1

Four stochastic simulation results based on three wells using SMC (from Huang et al. 2016a, figure 10)

Conclusions

Huang et al. (2016a) claimed “We have presented a SHMC model for geological facies modelling. This combines spatial Markov chain theory and Bayes estimation. We have adopted the specification of earlier published hidden Markov models. SHMC is based on neighbourhood and cliques and has a solid theoretical foundation”. These claims are wrong. What Huang et al. (2016a) presented and claimed as a new model is exactly the colocated coMCRF model with one auxiliary data set proposed and published by Li and his coauthors. What they did was just ignoring the existence of the coMCRF model and giving it a different interpretation. They did not solve any scientific issues. More surprisingly, in their case study, Huang et al. (2016a) actually implemented the CMC model without conditioning to borehole data ahead, but claimed they implemented the MCRF model.

Apparently by misrepresentation, Huang and his coauthors not only claimed the coMCRF model as their new model but also caused some confusion on the whole MCRF approach. According to their logic, cokriging could be called “hidden kriging” and reproposed as a new model, and indicator cokriging could be called “hidden indicator kriging” and proposed as a new model. In summary, Huang et al. (2016a) have mislead their readers. It’s our responsibility to clarify their errors and imporper claims.