当前位置: X-MOL 学术Annu. Rev. Stat. Appl. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Missing Data Assumptions
Annual Review of Statistics and Its Application ( IF 7.4 ) Pub Date : 2021-03-08 , DOI: 10.1146/annurev-statistics-040720-031104
Roderick J. Little 1
Affiliation  

I review assumptions about the missing-data mechanisms that underlie methods for the statistical analysis of data with missing values. I describe Rubin's original definition of missing at random (MAR), its motivation and criticisms, and his sufficient conditions for ignoring the missingness mechanism for likelihood-based, Bayesian, and frequentist inference. Related definitions, including missing completely at random, always MAR, always missing completely at random, and partially MAR, are also covered. I present a formal argument for weakening Rubin's sufficient conditions for frequentist maximum likelihood inference with precision based on the observed information. Some simple examples of MAR are described, together with an example where the missingness mechanism can be ignored even though MAR does not hold. Alternative approaches to statistical inference based on the likelihood function are reviewed, along with non-likelihood frequentist approaches, including weighted generalized estimating equations. Connections with the causal inference literature are also discussed. Finally, alternatives to Rubin's MAR definition are discussed, including informative missingness, informative censoring, and coarsening at random. The intent is to provide a relatively nontechnical discussion, although some of the underlying issues are challenging and touch on fundamental questions of statistical inference.

中文翻译:


缺少数据假设

我回顾了有关缺失数据机制的假设,这些假设构成了对具有缺失值的数据进行统计分析的方法的基础。我描述了鲁宾对随机缺失(MAR)的最初定义,动机和批评,以及他忽略基于似然,贝叶斯和频繁推断的缺失机制的充分条件。还涵盖了相关定义,包括完全随机丢失,始终为MAR,始终完全随机丢失和为部分MAR。我提出一个正式的论据,以削弱鲁宾基于观察到的信息进行精确度最高的频繁似然推理的充分条件。描述了一些简单的MAR示例,以及一个即使MAR不成立也可以忽略缺失机制的示例。回顾了基于似然函数的统计推断的替代方法,以及非似然的频繁性方法,包括加权广义估计方程。还讨论了与因果推理文献的联系。最后,讨论了鲁宾的MAR定义的替代方案,包括信息缺失,信息审查和随机粗化。目的是提供一个相对非技术性的讨论,尽管其中一些基本问题具有挑战性,并涉及统计推断的基本问题。讨论了鲁宾的MAR定义的替代方案,包括信息缺失,信息审查和随机粗化。目的是提供一个相对非技术性的讨论,尽管其中一些基本问题具有挑战性,并涉及统计推断的基本问题。讨论了鲁宾的MAR定义的替代方案,包括信息缺失,信息审查和随机粗化。目的是提供一个相对非技术性的讨论,尽管其中一些基本问题具有挑战性,并涉及统计推断的基本问题。

更新日期:2021-03-09
down
wechat
bug