Abstract
This paper introduces a method to generate conditional categorical simulations, given an ensemble of partially conditioned (or unconditional) categorical simulations derived from any simulation process. The proposed conditioning method relies on implicit functions (signed distance functions) for representing the categorical spatial variable of interest. Thus, the conditioning problem is reformulated in terms of signed distance functions. The proposed approach combines aspects of principal component analysis and Gibbs sampling to achieve the conditioning of the unconditional categorical realizations to the data. It is applied to synthetic and real-world datasets and compared to the traditional sequential indicator simulation. It appears that the proposed simulation technique is an effective method to generate conditional categorical simulations from a set of unconditional categorical simulations.
Similar content being viewed by others
References
Armstrong M, Galli A, Beucher H, Loc’h G, Renard D, Doligez B, Eschard R, Geffroy F (2011) Plurigaussian simulations in geosciences. Springer, Berlin
Arpat GB, Caers J (2007) Conditional simulation with patterns. Math Geol 39(2):177–203
Bogaert P, Gengler S (2018) Bayesian maximum entropy and data fusion for processing qualitative data: theory and application for crowdsourced cropland occurrences in ethiopia. Stoch Env Res Risk Assess 32(3):815–831
Borg I, Groenen P (2007) Modern multidimensional scaling: theory and applications. Springer Series in Statistics. Springer, New York
Breunig MM, Kriegel H-P, Ng RT, Sander J (2000) Lof: identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD international conference on management of data, pp 93–104
Buhmann M (2003) Radial basis functions: theory and implementations. Cambridge monographs on applied and computational mathematics. Cambridge University Press, Cambridge
Chiles J-P, Delfiner P (2012) Geostatistics: modeling spatial uncertainty. Wiley, New York
Daly C (2005) Higher Order models using entropy. Markov random fields and sequential simulation. Springer, Dordrecht, pp 215–224
Davies E (2012) Chapter 9—binary shape analysis. In: Davies E (ed) Computer and machine vision, 4th edn. Academic Press, Boston, pp 229–265
Deutsch C (2002) Geostatistical reservoir modelling. Oxford University Press, Oxford
Deutsch CV (1998) Cleaning categorical variable (lithofacies) realizations with maximum a-posteriori selection. Comput Geosci 24(6):551–562
Deutsch CV (2006) A sequential indicator simulation program for categorical variables with point and block data: blocksis. Comput Geosci 32(10):1669–1681
Elfeki A, Dekking M (2001) A markov chain model for subsurface characterization: theory and applications. Math Geol 33(5):569–589
Emery X (2007) Simulation of geological domains using the plurigaussian model: new developments and computer programs. Comput Geosci 33(9):1189–1201
Goovaerts P (1998) Geostatistics for natural resources evaluation. Oxford University Press, Oxford
Grevera GJ (2007) Distance transform algorithms and their implementation and evaluation. Springer, New York, pp 33–60
Honarkhah M, Caers J (2010) Stochastic simulation of patterns using distance-based pattern modeling. Math Geosci 42(5):487–517
Horrace WC (2005) Some results on the multivariate truncated normal distribution. J Multivar Anal 94(1):209–221
Hubert M, Debruyne M, Rousseeuw PJ (2018) Minimum covariance determinant and extensions. WIREs Comput Stat 10(3):e1421
Journel AG (1983) Nonparametric estimation of spatial distributions. J Int Assoc Math Geol 15(3):445–468
Lantuejoul C (2002) Geostatistical simulation: models and algorithms. Springer, New York
Li W (2007) Markov chain random fields for estimation of categorical variables. Math Geol 39(3):321–335
Li Y, Ghosh SK (2015) Efficient sampling methods for truncated multivariate normal and student-t distributions subject to linear inequality constraints. J Stat Theory Pract 9(4):712–732
Liu FT, Ting KM, Zhou Z (2008) Isolation forest. In: 2008 Eighth IEEE international conference on data mining, pp 413–422
Mariethoz G, Caers J (2014) Multiple-point geostatistics: stochastic modeling with training images. Willey, New York
Mariethoz G, Renard P, Straubhaar J (2010) The direct sampling method to perform multiple-point geostatistical simulations. Water Resour Res 46(11):1–14
Osher S, Fedkiw R (2002) Level set methods and dynamic implicit surfaces. Applied mathematical sciences. Springer, New York
R Core Team (2020) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/. Accessed 17 July 2020
Safa M, Soltani-Mohammadi S (2018) Distance function modeling in optimally locating additional boreholes. Spatial Stat 23:17–35
Scheidt C, Li L, Caers J (2018) Quantifying uncertainty in subsurface systems. Geophysical monograph series. Wiley, New York
Schölkopf B, Platt JC, Shawe-Taylor J, Smola AJ, Williamson RC (2001) Estimating the support of a high-dimensional distribution. Neural Comput 13(7):1443–1471
Strebelle S (2002) Conditional simulation of complex geological structures using multiple-point statistics. Math Geol 34(1):1–21
Tjelmeland H, Besag J (1998) Markov random fields with higher-order interactions. Scand J Stat 25(3):415–433
Vanderbei R (2013) Linear programming: foundations and extensions. International series in operations research & management science. Springer, New York
Yang L, Hyde D, Grujic O, Scheidt C, Caers J (2019) Assessing and visualizing uncertainty of 3D geological surfaces using level sets with stochastic motion. Comput Geosci 122:54–67
Zhang T, Switzer P, Journel A (2006) Filter-based classification of training image patterns for spatial simulation. Math Geol 38(1):63–80
Zhou Y, Zhang W, Zhu J, Xu Z (2016) Feature-driven topology optimization method with signed distance function. Comput Methods Appl Mech Eng 310:1–32
Žukovič M, Hristopulos DT (2009) Classification of missing values in spatial data using spin models. Phys Rev E 80:011116
Acknowledgements
We gratefully acknowledge the funding provided by BHP to support this research. We are grateful to the anonymous reviewers for their helpful and constructive comments that greatly helped improve the manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Fouedjio, F., Scheidt, C., Yang, L. et al. Conditional simulation of categorical spatial variables using Gibbs sampling of a truncated multivariate normal distribution subject to linear inequality constraints. Stoch Environ Res Risk Assess 35, 457–480 (2021). https://doi.org/10.1007/s00477-020-01925-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-020-01925-7