Abstract
Spatial data are widely used in many scientific and engineering fields, such as remote sensing, environment monitoring, weather forecast and mineral exploitation. However, direct measurements of such spatial data sometimes are difficult to achieve due to the expensive cost of equipment or current limited technology, so stochastic reconstruction or simulation of spatial data are necessary based on the principles of statistics. As a typical statistical modeling method, multiple-point statistics (MPS) has been successfully used for stochastic reconstruction by reproducing the features from training images (TIs) to the reconstructed regions. However, because these features mostly have intrinsic nonlinear relations, the traditional MPS methods using linear dimensionality reduction are not suitable to deal with the nonlinear situation. In this paper a new method using locally linear embedding (LLE) and MPS is proposed to resolve this issue. As a classical nonlinear method of dimensionality reduction in manifold learning, LLE is combined with MPS to reduce redundant data of TIs so that the subsequent reconstruction can be faster and more accurate. The tests are performed in both 2D and 3D reconstructions, showing that the reconstructions can reproduce the structural features of TIs and the proposed method has its advantages in reconstruction speed and quality over typical methods using linear dimensionality reduction.
Similar content being viewed by others
References
Arpat GB, Caers J (2007) Conditional simulation with patterns. Math Geol 39(2):177–203
Bezdek JC (1974) Numerical taxonomy with fuzzy set. J of Math Biol 1(1):57–71
Bierkens M (2006) Designing a monitoring network for detecting groundwater pollution with stochastic simulation and a cost model. Stoch Env Res Risk Assess 20(5):335–351
Cheng K, Hou J, Liou J, Wu Y, Chiang J (2011) Stochastic simulation of bivariate gamma distribution: a frequency-factor based approach. Stoch Env Res Risk Assess 25(2):107–122
Cheon YJ, Kim DJ (2009) Natural facial expression recognition using differential-AAM and manifold learning. Pattern Recogn 42:1340–1350
Comunian A, Renard P, Straubhaar J (2012) 3D multiple-point statistics simulation using 2D training images. Comput Geosci 40:49–65
Dujardin B (2007) Sensitivity analysis of FILTERSIM and histogram reproduction. M.S. thesis, Stanford University, Stanford
Dunn JC (1974) Well separated clusters and optimal fuzzy partitions. J Cybern 4(1):95–104
Ester M, Kriegel HP, Sander J, Xu XW (1996) A density-based algorithm for discovering clusters in large spatial databases. Knowledge Discovery and Data Mining(KDD’96) AAAI Press, Menlo Park
Goovaerts P (1997) Geostatistics for natural resources evaluation. Oxford University Press, New York
Guardiano F, Srivastava RM (1993) Multivariate geostatistics: Beyond bivariate moments. In: Soares A (ed) Geostatistics-Troia. Kluwer, Dordrecht, pp 133–144
Han Y, Xu Z, Ma Z, Huang Z (2013) Image classification with manifold learning for out-of-sample data. Sig Process 93:2169–2177
Honarkhah M (2011) Stochastic simulation of patterns using distance-based pattern modeling. Ph.D. Dissertation, Stanford University, Stanford
Honarkhah M, Caers J (2010) Stochastic simulation of patterns using distance-based pattern modeling. Math Geosci 42:487–517
Hubbard SS, Chen JS, Peterson J, Majer EL, Williams KH, Swift DJ, Mailloux B, Rubin Y (2001) Hydrogeological characterization of the South Oyster Bacterial Transport Site using geophysical data. Water Resour Res 37(10):2431–2456
Jordan DW, Smith P (2007) Nonlinear ordinary differential equations, 4th edn. Oxford University Press, New York
Journel AG (1974) Geostatistics for conditional simulation of ore bodies. Econ Geol 69(5):673–687
Journel A, Huijbregts CJ (1978) Mining geostatistics. Academic Press, New York
Kaufman L, Rousseeuw PJ (1990) Finding groups in data: an introduction to cluster analysis. John Wiley & Sons, New York
Ketchen DJ, Shook CL (1996) The application of cluster analysis in strategic management research: an analysis and critique. Strateg Manag J 17(6):441–458
Levina E, Bickel PJ (2005) Maximum likelihood estimation of intrinsic dimension. In: Proceedings of neural information processing systems (NIPS 2005), pp 777–784
Mariethoz G, Renard P (2010) Reconstruction of incomplete data sets or images using direct sampling. Math Geosci 42:245–268
Matheron G (1973) The intrinsic random functions and their applications. Adv Appl Probab 5(3):439–468
Orsenigo C, Vercellis C (2013) A comparative study of nonlinear manifold learning methods for cancer microarray data classification. Expert Syst Appl 40:2189–2197
Parra A, Ortiz JM (2011) Adapting a texture synthesis algorithm for conditional multiple point geostatistical simulation. Stoch Env Res Risk Assess 25:1101–1111
Peled SH, Mazumdar S (2004) On coresets for k-means and k-median clustering. In: Proceeding of the thirty-sixth annual ACM symposium on Theory of Computing, ACM, New York, pp 291–300
Renard P, Gómez-Hernández J, Ezzedine S (2005) Characterization of porous and fractured media. In: Encyclopedia of hydrological sciences. Wiley, New York
Rezaee H, Asghari O, Koneshloo M, Ortiz JM (2014) Multiple-point geostatistical simulation of dykes: application at Sungun porphyry copper system, Iran. Stoch Environ Res Risk Assess 28(7):1913–1927
Rezaee H, Marcotte D, Tahmasebi P, Saucier A (2015) Multiple-point geostatistical simulation using enriched pattern databases. Stoch Env Res Risk Assess 29(3):893–913
Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Comput Appl Math 20:53–65
Roweis S, Saul L (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290(22):2323–2326
Sattari TM, Apaydin H, Ozturk F (2009) Operation analysis of Eleviyan irrigation reservoir dam by optimization and stochastic simulation. Stoch Env Res Risk Assess 23(8):1187–1201
Shannon CE (1948) A mathematical theory of communication. Bell Syst Tech J 27:379–423
Strebelle SB (2000) Sequential Simulation drawing structures from training images. Ph.D. Dissertation, Stanford University, Stanford
Strebelle SB (2002) Conditional simulation of complex geological structures using multiple-point statistics. Math Geol 34(1):1–21
Tenenbaum BJ, Silva V, Langford CJ (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(22):2319–2323
Vladimir EC (2002) Why so many clustering algorithms—a position paper. ACM SIGKDD Explor Newslett 4:65–75
Wang G, Gertner GZ, Anderson AB (2004) Spatial-variability-based algorithms for scaling-up spatial data and uncertainties. IEEE Trans Geosci Remote Sens 42(9):2004–2015
Xie XL, Beni G (1991) A validity measure for fuzzy clustering. IEEE Trans Pattern Anal Mach Intel 13(8):841–847
Zhang TF (2006) Filter-based training pattern classification for spatial pattern simulation. Ph.D. Dissertation, Stanford University, Stanford
Zhang TF, Switzer P, Journel A (2006) Filter-based classification of training image patterns for spatial simulation. Math Geol 38:63–80
Zhang T, Du Y, Huang T, Li X (2015a) GPU-accelerated 3D reconstruction of porous media using multiple-point statistics. Comput Geosci 19:79–98
Zhang T, Du Y, Huang T, Li X (2015b) Reconstruction of porous media using multiple-point statistics with data conditioning. Stoch Env Res Risk Assess 29(3):727–738
Acknowledgments
We really appreciate the reviewers for the valuable comments and suggestions on our manuscript. This work is supported by National Program on Key Basic Research Project of China (973 Program, No. 2011CB707305), the National Science and Technology Major Project (No. 2011ZX05009-006), CAS Strategic Priority Research Program (XDB10030402), Talented People Introduction Foundation of Shanghai University of Electric Power (No. K2012-004, K2013-019, K2014-020), and the Excellent University Young Teachers Training Program of Shanghai Municipal Education Commission (ZZsdl13015).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, T., Du, Y., Li, B. et al. Stochastic reconstruction of spatial data using LLE and MPS. Stoch Environ Res Risk Assess 31, 243–256 (2017). https://doi.org/10.1007/s00477-015-1197-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-015-1197-z