当前位置: X-MOL 学术J. Am. Stat. Assoc. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains
Journal of the American Statistical Association ( IF 3.0 ) Pub Date : 2020-11-24 , DOI: 10.1080/01621459.2020.1833889
Michele Peruzzi 1, 2 , Sudipto Banerjee 3 , Andrew O Finley 1
Affiliation  

Abstract

We introduce a class of scalable Bayesian hierarchical models for the analysis of massive geostatistical datasets. The underlying idea combines ideas on high-dimensional geostatistics by partitioning the spatial domain and modeling the regions in the partition using a sparsity-inducing directed acyclic graph (DAG). We extend the model over the DAG to a well-defined spatial process, which we call the meshed Gaussian process (MGP). A major contribution is the development of an MGPs on tessellated domains, accompanied by a Gibbs sampler for the efficient recovery of spatial random effects. In particular, the cubic MGP (Q-MGP) can harness high-performance computing resources by executing all large-scale operations in parallel within the Gibbs sampler, improving mixing and computing time compared to sequential updating schemes. Unlike some existing models for large spatial data, a Q-MGP facilitates massive caching of expensive matrix operations, making it particularly apt in dealing with spatiotemporal remote-sensing data. We compare Q-MGPs with large synthetic and real world data against state-of-the-art methods. We also illustrate using Normalized Difference Vegetation Index data from the Serengeti park region to recover latent multivariate spatiotemporal random effects at millions of locations. The source code is available at github.com/mkln/meshgp. Supplementary materials for this article are available online.



中文翻译:

通过分区域上的网格高斯过程进行高度可扩展的贝叶斯地统计建模

摘要

我们引入了一类可扩展的贝叶斯层次模型,用于分析海量地统计数据集。其基本思想通过对空间域进行分区并使用稀疏诱导有向无环图 (DAG) 对分区中的区域进行建模,从而结合了高维地质统计学的思想。我们将 DAG 上的模型扩展到定义明确的空间过程,我们称之为网格高斯过程 (MGP)。一个主要贡献是在棋盘格域上开发了 MGP,并配有 Gibbs 采样器,用于有效恢复空间随机效应。特别是,三次 MGP(Q-MGP)可以通过在 Gibbs 采样器中并行执行所有大规模操作来利用高性能计算资源,与顺序更新方案相比,改进了混合和计算时间。与一些现有的大型空间数据模型不同,Q-MGP 有助于对昂贵的矩阵运算进行大规模缓存,使其特别适合处理时空遥感数据。我们将 Q-MGP 与大型合成和真实世界数据与最先进的方法进行比较。我们还说明了使用来自塞伦盖蒂公园地区的归一化差异植被指数数据来恢复数百万个位置的潜在多元时空随机效应。源代码可在 github.com/mkln/meshgp 获得。本文的补充材料可在线获取。我们将 Q-MGP 与大型合成和真实世界数据与最先进的方法进行比较。我们还说明了使用来自塞伦盖蒂公园地区的归一化差异植被指数数据来恢复数百万个位置的潜在多元时空随机效应。源代码可在 github.com/mkln/meshgp 获得。本文的补充材料可在线获取。我们将 Q-MGP 与大型合成和真实世界数据与最先进的方法进行比较。我们还说明了使用来自塞伦盖蒂公园地区的归一化差异植被指数数据来恢复数百万个位置的潜在多元时空随机效应。源代码可在 github.com/mkln/meshgp 获得。本文的补充材料可在线获取。

更新日期:2020-11-24
down
wechat
bug