Double-slicing assisted sufficient dimension reduction for high-dimensional censored data,Annals of Statistics

当前位置： X-MOL 学术 › Ann. Stat. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Double-slicing assisted sufficient dimension reduction for high-dimensional censored data
Annals of Statistics ( IF 3.2 ) Pub Date : 2020-08-01 , DOI: 10.1214/19-aos1880
Shanshan Ding , Wei Qian , Lan Wang

This paper provides a unified framework and an efficient algorithm for analyzing high-dimensional survival data under weak modeling assumptions. In particular, it imposes neither parametric distributional assumption nor linear regression assumption. It only assumes that the survival time T depends on a high-dimensional covariate vector X through low-dimensional linear combinations of covariates ΓX. The censoring time is allowed to be conditionally independent of the survival time given the covariates. This general framework includes many popular parametric and semiparametric survival regression models as special cases. The proposed algorithm produces a number of practically useful outputs with theoretical guarantees, including a consistent estimate of the sufficient dimension reduction subspace of T |X, a uniformly consistent Kaplan-Meier type estimator of the conditional distribution function of T and a consistent estimator of the conditional quantile survival time. Our asymptotic results significantly extend the classical theory of sufficient dimension reduction for censored data (particularly that of Li et al. 1999) and the celebrated nonparametric Kaplan-Meier estimator to the setting where the number of covariates p diverges exponentially fast with the sample size n. We demonstrate the promising performance of the proposed new estimators through simulations and a real data example.

中文翻译：

双切片辅助高维删失数据的充分降维

本文为弱建模假设下的高维生存数据分析提供了统一的框架和高效的算法。特别是，它既不强加参数分布假设，也不强加线性回归假设。它仅假设生存时间 T 依赖于高维协变量向量 X 通过协变量 ΓX 的低维线性组合。在给定协变量的情况下，允许审查时间有条件地独立于生存时间。这个通用框架包括许多流行的参数和半参数生存回归模型作为特例。所提出的算法产生了许多具有理论保证的实际有用的输出，包括对 T |X 的足够降维子空间的一致估计，T 的条件分布函数的一致一致 Kaplan-Meier 型估计量和条件分位数生存时间的一致估计量。我们的渐近结果显着扩展了截尾数据充分降维的经典理论（尤其是 Li 等人 1999 年的理论）和著名的非参数 Kaplan-Meier 估计量，以证明协变量 p 的数量随着样本大小 n 呈指数级快速发散. 我们通过模拟和真实数据示例证明了所提出的新估计器的有希望的性能。我们的渐近结果显着扩展了截尾数据充分降维的经典理论（尤其是 Li 等人 1999 年的理论）和著名的非参数 Kaplan-Meier 估计量，以证明协变量 p 的数量随着样本大小 n 呈指数级快速发散. 我们通过模拟和真实数据示例展示了所提出的新估计器的有希望的性能。我们的渐近结果显着扩展了截尾数据充分降维的经典理论（尤其是 Li 等人 1999 年的理论）和著名的非参数 Kaplan-Meier 估计量，以证明协变量 p 的数量随着样本大小 n 呈指数级快速发散. 我们通过模拟和真实数据示例展示了所提出的新估计器的有希望的性能。

更新日期：2020-08-01

点击分享查看原文

点击收藏

阅读更多本刊最新论文