Abstract
This paper presents a statistical model for stationary ergodic point processes, estimated from a single realization observed in a square window. With existing approaches in stochastic geometry, it is very difficult to model processes with complex geometries formed by a large number of particles. Inspired by recent works on gradient descent algorithms for sampling maximum-entropy models, we describe a model that allows for fast sampling of new configurations reproducing the statistics of the given observation. Starting from an initial random configuration, its particles are moved according to the gradient of an energy, in order to match a set of prescribed moments (functionals). Our moments are defined via a phase harmonic operator on the wavelet transform of point patterns. They allow one to capture multi-scale interactions between the particles, while controlling explicitly the number of moments by the scales of the structures to model. We present numerical experiments on point processes with various geometric structures, and assess the quality of the model by spectral and topological data analysis.
Similar content being viewed by others
Data availability of data and material
For the sake of transparency, we are ready to make available the data used to produce the results in our paper.
Code availability
For the sake of transparency, we are ready to make available the code used to produce the results in our paper.
Notes
More precisely, where the wavelet norm is non negligible
In our case we use the periodic metric.
References
Baccelli, F., Woo, J.O.: On the entropy and mutual information of point processes. In: 2016 IEEE International Symposium on Information Theory (ISIT), pp. 695–699. IEEE (2016)
Baddeley, A., Jammalamadaka, A., Nair, G.: Multitype point process analysis of spines on the dendrite network of a neuron. J. Roy. Stat. Soc. Ser. C Appl. Stat. 673–694 (2014)
Bartlett, M.S.: The spectral analysis of two-dimensional point processes. Biometrika 51(3/4), 299–311 (1964)
Boissonnat, J.D., Chazal, F., Yvinec, M.: Geometric and Topological Inference, vol. 57. Cambridge University Press, Cambridge (2018)
Brémaud, P.: Mathematical Principles of Signal Processing: Fourier and Wavelet Analysis. Springer Science & Business Media (2002)
Brochard, A., Błaszczyszyn, B., Mallat, S., Zhang. S.: Particle gradient descent model for point process generation (2020). arXiv preprint arXiv:201014928
Brumwell, X., Sinz, P., Kim, K.J., Qi, Y., Hirn, M.: Steerable Wavelet Scattering for 3D Atomic Systems with Application to Li-Si Energy Prediction (2018). arXiv preprint arXiv:181202320
Bruna, J., Mallat, S.: Multiscale sparse microcanonical models. Math. Stat. Learn. 1(3), 257–315 (2019)
Chazal, F., Michel, B.: An introduction to topological data analysis: fundamental and practical aspects for data scientists (2017). arXiv preprint arXiv:171004019
Chenouard, N., Unser, M.: 3D steerable wavelets and monogenic analysis for bioimaging. In: 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, pp. 2132–2135 (2011)
Chiu, S.N., Stoyan, D., Kendall, W.S., Mecke, J.: Stochastic Geometry and Its Applications. John Wiley & Sons (2013)
Daley, D.J., Vere-Jones, D.: An Introduction to the Theory of Point Processes, vol. II, Probability and Its Applications, vol 2. Springer New York, New York, NY (2008)
Dereudre, D.: Introduction to the theory of gibbs point processes. In: Stochastic Geometry, pp. 181–229. Springer (2019)
Diggle, P.J., Eglen, S.J., Troy, J.B.: Modelling the bivariate spatial distribution of amacrine cells. In: Case Studies in Spatial Point Process Modeling, pp 215–233. Springer (2006)
Ducasse, L., Pumir, A.: Intermittent particle distribution in synthetic free-surface turbulent flows. Phys. Rev. E 77(6), 066304 (2008)
Ducasse, L., Pumir, A.: Inertial particle collisions in turbulent synthetic flows: quantifying the sling effect. Phys. Rev. E 80(6), 066312 (2009)
Efron, B., Tibshirani, R.J.: An Introduction to the Bootstrap. CRC Press (1994)
Fasy BT, Kim J, Lecci F, Maria, C., Rouvreau, V.: TDA: statistical tools for topological data analysis (2014). Software available at Fasy, B.T., Kim, J., Lecci, F., Maria, C., Rouvreau, V.: TDA: statistical tools for topological data analysis (2014). https://cran.r-project.org/package=TDA
Gatys, L., Ecker, A.S., Bethge, M.: Texture synthesis using convolutional neural networks. Adv. Neural. Inf. Process. Syst. 28, 262–270 (2015)
Geman, S., Geman, D.: Stochastic relaxation, gibbs distributions, and the bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6(6), 721–741 (1984)
Illian, J., Penttinen, A., Stoyan, H., Stoyan, D.: Statistical Analysis and Modelling of Spatial Point Patterns, vol. 70. John Wiley & Sons (2008)
Jaynes, E.T.: Information theory and statistical mechanics. Phys. Rev. 106(4), 620 (1957)
Koňasová, K., Dvořák, J.: Stochastic reconstruction for inhomogeneous point patterns. Methodol. Comput. Appl. Prob. 23(2), 527–547 (2021)
Liu, D.C., Nocedal, J.: On the limited memory bfgs method for large scale optimization. Math. Program. 45(1), 503–528 (1989)
Mallat, S.: A Wavelet Tour of Signal Processing: The Sparse Way, 3rd edn. Academic Press (2001)
Mallat, S., Zhang, S., Rochette, G.: Phase harmonic correlations and convolutional neural networks. Inf. Infer. J. IMA 9(3), 721–747 (2020)
Matsuda, K., Onishi, R.: Turbulent enhancement of radar reflectivity factor for polydisperse cloud droplets. Atmos. Chem. Phys. 19(3), 1785–1799 (2019)
Molchanov, I., Zuyev, S.: Steepest descent algorithms in a space of measures. Stat. Comput. 12(2), 115–123 (2002)
Müller, R., Schuhmacher, D., Mateu, J.: Metrics and barycenters for point pattern data. Stat. Comput. 30(4), 953–972 (2020)
Oujia, T., Matsuda, K., Schneider, K.: Divergence and convergence of inertial particles in high-reynolds-number turbulence. J. Fluid Mech. 905 (2020)
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., et al.: Pytorch: an imperative style, high-performance deep learning library. Adv. Neural. Inf. Process. Syst. 32, 8026–8037 (2019)
Portilla, J., Simoncelli, E.P.: A parametric texture model based on joint statistics of complex wavelet coefficients. Int. J. Comput. Vision 40(1), 49–70 (2000)
Rabin, J., Peyré, G., Delon, J., Bernot, M.: Wasserstein barycenter and its application to texture mixing. In: International Conference on Scale Space and Variational Methods in Computer Vision, Springer, pp. 435–446 (2011)
Rajala, T., Redenbach, C., Särkkä, A., Sormani, M.: A review on anisotropy analysis of spatial point patterns. Spatial Stat. 28, 141–168 (2018)
Schneider, K., Ziuber, J., Farge, M., Azzalini, A.: Coherent vortex extraction and simulation of 2d isotropic turbulence. J. Turbul. 7(44), N44 (2006)
Skare, Ø., Møller, J., Vedel Jensen, E.B.: Bayesian analysis of spatial point processes in the neighbourhood of voronoi networks. Stat. Comput. 17(4), 369–379 (2007)
Stoica, R.S., Martinez, V.J., Mateu, J., Saar, E.: Detection of cosmic filaments using the candy model. Astron. Astrophys. 434(2), 423–432 (2005)
Stoyan, D., Stoyan, H.: Fractals, random shapes and point fields: methods of geometrical statistics, vol 302. Wiley-Blackwell (1994)
Tempel, E., Stoica, R.S., Kipper, R., Saar, E.: Bisous model-detecting filamentary patterns in point processes. Astron. Comput. 16, 17–25 (2016)
Torquato, S.: Random Heterogeneous Materials: Microstructure and Macroscopic Properties. Springer, New York (2002)
Tscheschel, A., Stoyan, D.: Statistical reconstruction of random point patterns. Comput. Stat. Data Anal. 51(2), 859–871 (2006)
Wadhwa, R.R., Williamson, D.F., Dhawan, A., Scott, J.G.: TDAstats: R pipeline for computing persistent homology in topological data analysis. J. Open Source Softw. 3(28), 860 (2018)
Wiegand, T., Moloney, K.A.: Handbook of Spatial Point-Pattern Analysis in Ecology. CRC Press (2013)
Zhang, G., Stillinger, F.H., Torquato, S.: Ground states of stealthy hyperuniform potentials: I. entropically favored configurations. Phys. Rev. E 92(2), 22119 (2015a)
Zhang, G., Stillinger, F.H., Torquato, S. Ground states of stealthy hyperuniform potentials. ii. stacked-slider phases. Phys. Rev. E 92(2), 022120 (2015b)
Zhang, S., Mallat, S.: Maximum entropy models from phase harmonic covariances. Appl. Comput. Harmon. Anal. 53, 199–230 (2021)
Funding
This work was partly supported by the PRAIRIE 3IA Institute of the French ANR-19-P3IA-0001 program. Bartłomiej Błaszczyszyn’s work was partly supported by the European Research Council (ERC-NEMO-788851). Sixin Zhang was supported by the European Research Council (ERC FACTORY-CoG-6681839). Part of this work was done when Sixin Zhang was a postdoctoral researcher at ENS Paris, France.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Proof of Theorem 1
In order to prove Theorem 1, we need to formally define Eq. (9). Recall that in this section and in what follows, \(W_s\) is interpreted as endowed with the addition and scalar multiplication modulo \(W_s\).
For \(\mu \in \mathbb M^s\) and any \(x \in \text {Supp}(\mu )\), we define the following functions:
The function \(K_{x}^{\mu }\) can be complex valued. However, as our energy function is the square Euclidean norm, it is equivalent to consider that \(K_{x}^{\mu }\) has values in \(\mathbb R^{2d}\). Moreover, we assume in what follows that the function K is such that for all \(\mu \in \mathbb M^s\) and all \(x \in \text {Supp}(\mu )\), \(K_x^\mu \) is differentiable. We can then define from chain rule, for any \(\mu \in \mathbb M^s\) and any \(x \in W_s\)
and
where Jac[f] denotes the Jacobian matrix of the function f. When \(x \in \text {Supp}(\mu )\), the chain-rule gives \(Jac[E_x^\mu ](0) = (\nabla _xK(\mu ))^t(K(\mu )-K(\bar{\phi }))\). We can now give the proof of Theorem 1.
Proof
We are going to show that if \(\Phi _n\) follows a distribution invariant to T, then \(\Phi _{n+1} = G_{\bar{\phi }} (\Phi _n)\) also follows a distribution that is invariant to T. The gradient descent procedure thus produces a sequences of measures \(\Phi _n\) that are all invariant to T because the initial random measure \(\Phi _0\) is invariant to T.
Denote \(G_{\bar{\phi }} (\mu )\) by the measure configuration transported from \(\mu \), by performing one gradient-descent step on the energy \(E_{\bar{\phi }}\). More precisely, for \(\mu = \sum _i \delta _{x_i}\), we define for a fixed \(\gamma >0\), the gradient-descent step by
For any transform \(T x = Ax + b\) on \( W_s\), where A is an orthogonal matrix A with entries in \(\{-1,0,1\}\), and \(b\in W_s\). As A is a linear transformation on the torus \( W_s\), \(\forall x , y \in W_s\), \(A(x+y) = Ax + Ay\). We shall first prove that
Let \(y_i = T x_i\), then by definition,
and
We are going to show that for each i-th particle, \(y_i - \gamma \nabla _{y_i} E_{\bar{\phi }} (T_{\#} \mu ) = T( x_i - \gamma \nabla _{x_i} E_{\bar{\phi }} (\mu ) )\). This implies that (20) is correct. The key is to show that
which will imply that \(\forall i\),
To show (21), we recall that by the definitions in Sect. 3.1,
with \(Jac(K_{y_i}^{T_{\#} \mu })(0) = Jac(K_{y_i}^{T_{\#} \mu } \circ A \circ A^{-1})(0) = Jac(K\circ h_{y_i}^{T_{\#}\mu } \circ A)(0)A^{-1}.\) Furthermore, \(\forall x \in W_s\),
where we used the fact that T is affine, and the invariance of K w.r.t. T. The equality in (21) follows directly from (22), (23), (24) and the fact that \(A^{-1} = A^t\). From (21), we conclude that (20) holds.
Based on (20), it remains to show that for any Borel set \(\Gamma \) on \( W_s\), \( P ( \Phi _n \in T_{\#}^{-1} \Gamma ) = P ( \Phi _n \in \Gamma )\). This can be shown by induction, since by assumption it holds at \(n=0\): assume now that this statement holds at \(n \ge 0\), then we have,
The second last equality in (26) is due to the invariance of \(\Phi _n\), i.e.
\(\square \)
Appendix B: Fourier spectrum and power spectrum
We define the discrete Fourier transform (DFT) \(F_m(\mu )\) of a counting measure \(\mu =\sum _u\delta _{x_u}\in \mathbb M^s\) on the (square) window \([-s,s[^2\) at integer frequency \(m \in \mathbb Z^2 \) by
Observe, \(F_m(\mu )\) at frequency \(m=(0,0)\) specifies the number of points of the measure \(\mu \) on \( W_s\). The empirical Fourier spectrum (or power spectrum ) is often defined by taking the square modulus of the Fourier coefficients \(F_m(\mu )\); \(U_m(\mu ):=|F_m(\mu )|^2 \). Note that \(|F_m(\mu )|^2\), and consequently \(U_m(\mu )\) is invariant with respect to (circular) translations of \(\mu \) on \(W_s\). By selecting the frequencies in a limited range \(m \in \Gamma _F \subset \mathbb Z^2\), one obtains a translation-invariant Fourier spectrum. As we shall focus on isotropic point processes, we further reduce the variance of our statistics by averaging Fourier coefficients along frequency orientations. More precisely, let us define \({\tilde{\Gamma }}_F := \{ \lfloor |m| \rfloor , \, m \in \Gamma _F \}\). For each \(k\in \tilde{\Gamma }_F\), we define \({\tilde{U}}_k(\mu ) := \frac{1}{\#k}\sum _{\begin{array}{c} m \in \Gamma _F \\ \lfloor |m| \rfloor = k \end{array}} U_m(\mu )\), where \(\#k\) denotes the cardinal of \(\{ m \in \Gamma _F : \lfloor |m| \rfloor = k \}\). The radial power spectrum P(k) is the expectation of \({\tilde{U}}_k(\mu ) \) for \(k \in N = \{ 1,2,3,\ldots \} \) when \(\mu \) follows some distribution, divided by the intensity of the process (estimated over 10 realizations).
Appendix C: Relaxing the assumptions on the data
In this paper, in order to present our model in a simple setting, strong theoretical assumptions have been made on the data. However, in real world applications, the data will most likely not satisfy these assumptions. This sections presents ideas on how to adapt our model in such cases.
Non-periodic boundaries Recall that our descriptor, defined in (14), applies periodic boundary correction to point patterns in a square window. If the structure of the observed pattern is not periodic, one can modify the descriptor by applying non-periodic integrals in (14) over some smaller window. In particular, we suggest a scale-dependent reduction of the integration window, pertinent when the wavelet \(\psi \) has a compact (or approximately compact) spatial support. Specifically, we consider a new descriptor \({\tilde{K}}\) by considering the integrals in (14) with \(i=(\lambda ,k,\lambda ',k', \tau ') \in \Gamma _H\) over smaller windows \(W_{s_i}\subset W_s\), such that boundary effects are negligible. Our current software can also handle such non-periodic boundary conditions.
More general observation windows In this paper, we considered that the observed pattern lies in a square observation window. If this is not the case, one could use a similar idea to the non-periodic case: embed the observation window in a square window and considering integrals in (14) over the observation window.
Non stationary process In Koňasová and Dvořák (2021), the authors focus on building a model for non stationary point processes inspired by Tscheschel and Stoyan (2006). Similarly, one might adapt our method to model non stationary processes. This could be done by modifying two aspects of the method. First, the initial distribution \(\Phi _0\) (cf. Sect. 3.1) could be chosen as a non stationary Poisson point process, estimating the intensity with a kernel estimator, such as in Koňasová and Dvořák (2021). In addition, as pointed out in Koňasová and Dvořák (2021), the descriptor should be adapted not to be translation invariant. This could be done, for example, by applying local integrals over patches of the observation window in (14) (this requires some notion of “local stationarity” of the process). Another method that may be useful in this scenario is the regularization proposed in Brochard et al. (2020), where a regularization term is added to the energy. This term consists of the (Sliced Wasserstein) distance (Rabin et al. 2011) between the initial configuration and the current configuration (the one being optimized). By adding this regularization term to the energy, the points of the configuration are forced not to move too far away from the initial configuration, which could help preserve the non stationarity of the initial distribution in the distribution of the model.
Processes in other dimensions While we focus in this paper on planar point processes, our approach can readily be extended to any dimensions. To model point processes in other dimensions such as 1d or 3d, one can consider similar type of wavelets proposed in the literature (Chenouard and Unser 2011; Brumwell et al. 2018).
Appendix D: List of important parameters of our model
In Table 6, we discuss the main parameters of our model in three categories. The first two categories are the parameters that are relatively standard to consider in most existing methods such as Tscheschel and Stoyan (2006). The third category is more specific to our model, which involves the discretization step, and the final blurring step.
Rights and permissions
About this article
Cite this article
Brochard, A., Błaszczyszyn, B., Zhang, S. et al. Particle gradient descent model for point process generation. Stat Comput 32, 49 (2022). https://doi.org/10.1007/s11222-022-10099-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11222-022-10099-x