当前位置: X-MOL 学术Stat. Med. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Optimal multiwave sampling for regression modeling in two‐phase designs
Statistics in Medicine ( IF 2 ) Pub Date : 2020-10-05 , DOI: 10.1002/sim.8760
Tong Chen 1 , Thomas Lumley 1
Affiliation  

Two‐phase designs involve measuring extra variables on a subset of the cohort where some variables are already measured. The goal of two‐phase designs is to choose a subsample of individuals from the cohort and analyse that subsample efficiently. It is of interest to obtain an optimal design that gives the most efficient estimates of regression parameters. In this article, we propose a multiwave sampling design to approximate the optimal design for design‐based estimators. Influence functions are used to compute the optimal sampling allocations. We propose to use informative priors on regression parameters to derive the wave‐1 sampling probabilities because any prespecified sampling probabilities may be far from optimal and decrease the design efficiency. The posterior distributions of the regression parameters derived from the current wave will then be used as priors for the next wave. Generalized raking is used in the final statistical analysis. We show that a two‐wave sampling with reasonable informative priors will end up with a highly efficient estimation for the parameter of interest and be close to the underlying optimal design.

中文翻译:

两阶段设计中回归建模的最佳多波采样

两阶段设计涉及测量已经测量了一些变量的队列子集的额外变量。两阶段设计的目标是从队列中选择个体的子样本并有效地分析该子样本。获得能够给出回归参数最有效估计的最佳设计是很有意义的。在本文中,我们提出了一种多波采样设计来近似基于设计的估计器的最优设计。影响函数用于计算最佳抽样分配。我们建议使用回归参数的信息先验来推导第一波采样概率,因为任何预先指定的采样概率可能远非最优并降低设计效率。从当前波导出的回归参数的后验分布将被用作下一波的先验分布。最终的统计分析采用广义倾斜法。我们表明,具有合理信息先验的双波采样最终将对感兴趣的参数进行高效估计,并接近潜在的最优设计。
更新日期:2020-10-05
down
wechat
bug