Exact sampling of the unobserved covariates in Bayesian spline models for measurement error problems.,Statistics and Computing

当前位置： X-MOL 学术 › Stat. Comput. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Exact sampling of the unobserved covariates in Bayesian spline models for measurement error problems.
Statistics and Computing ( IF 1.6 ) Pub Date : 2015-06-14 , DOI: 10.1007/s11222-015-9572-7
Anindya Bhadra ₁ , Raymond J Carroll ₂

Affiliation

In truncated polynomial spline or B-spline models where the covariates are measured with error, a fully Bayesian approach to model fitting requires the covariates and model parameters to be sampled at every Markov chain Monte Carlo iteration. Sampling the unobserved covariates poses a major computational problem and usually Gibbs sampling is not possible. This forces the practitioner to use a Metropolis–Hastings step which might suffer from unacceptable performance due to poor mixing and might require careful tuning. In this article we show for the cases of truncated polynomial spline or B-spline models of degree equal to one, the complete conditional distribution of the covariates measured with error is available explicitly as a mixture of double-truncated normals, thereby enabling a Gibbs sampling scheme. We demonstrate via a simulation study that our technique performs favorably in terms of computational efficiency and statistical performance. Our results indicate up to 62 and 54 % increase in mean integrated squared error efficiency when compared to existing alternatives while using truncated polynomial splines and B-splines respectively. Furthermore, there is evidence that the gain in efficiency increases with the measurement error variance, indicating the proposed method is a particularly valuable tool for challenging applications that present high measurement error. We conclude with a demonstration on a nutritional epidemiology data set from the NIH-AARP study and by pointing out some possible extensions of the current work.

中文翻译：

贝叶斯样条模型中未观察到的协变量的精确采样，用于测量误差问题。

在对协变量进行误差测量的截断多项式样条或B样条模型中，采用完全贝叶斯模型拟合的方法需要在每次马尔可夫链蒙特卡洛迭代中对协变量和模型参数进行采样。对未观察到的协变量进行采样带来了主要的计算问题，通常无法进行吉布斯采样。这会迫使从业人员使用“大都市-迈斯特”步骤，由于混音效果差，可能会导致性能不佳，并且可能需要仔细调整。在本文中，我们显示了对于阶数等于1的截断多项式样条或B样条模型的情况，通过双截断正态的混合可以明确地获得带有误差的协变量的完整条件分布，从而可以进行Gibbs采样方案。我们通过模拟研究证明，我们的技术在计算效率和统计性能方面表现出色。我们的结果表明，与分别使用截断多项式样条和B样条的现有替代方法相比，平均积分平方误差效率分别提高了62％和54％。此外，有证据表明效率的增益随测量误差的变化而增加，这表明所提出的方法对于挑战具有高测量误差的应用特别有用。最后，我们通过对NIH-AARP研究的营养流行病学数据集进行了演示，并指出了当前工作的一些可能扩展。我们的结果表明，与分别使用截断多项式样条和B样条的现有替代方法相比，平均积分平方误差效率分别提高了62％和54％。此外，有证据表明效率的增益随测量误差的变化而增加，这表明该方法对于挑战具有高测量误差的应用特别有用。最后，我们通过对NIH-AARP研究的营养流行病学数据集进行了演示，并指出了当前工作的一些可能扩展。我们的结果表明，与分别使用截断多项式样条和B样条的现有替代方法相比，平均积分平方误差效率分别提高了62％和54％。此外，有证据表明效率的增益随测量误差的变化而增加，这表明所提出的方法对于挑战具有高测量误差的应用特别有用。最后，我们通过对NIH-AARP研究的营养流行病学数据集进行了演示，并指出了当前工作的一些可能扩展。有证据表明，效率增益随测量误差的变化而增加，这表明所提出的方法对于具有高测量误差的具有挑战性的应用特别有用。最后，我们通过对NIH-AARP研究的营养流行病学数据集进行了演示，并指出了当前工作的一些可能扩展。有证据表明，效率增益随测量误差的变化而增加，这表明所提出的方法对于挑战具有高测量误差的应用特别有用。最后，我们通过对NIH-AARP研究的营养流行病学数据集进行了演示，并指出了当前工作的一些可能扩展。

更新日期：2015-06-14

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11