Abstract
The case-cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In many such studies, the number of covariates is very large, and the goal of the research is to identify active covariates which have great influence on response. Since the introduction of sure independence screening, screening procedures have achieved great success in terms of effectively reducing the dimensionality and identifying active covariates. However, commonly used screening methods are based on marginal correlation or its variants, they may fail to identify hidden active variables which are jointly important but are weakly correlated with the response. Moreover, these screening methods are mainly proposed for data under the simple random sampling and can not be directly applied to case-cohort data. In this paper, we consider the ultrahigh-dimensional survival data under the case-cohort design, and propose a conditional screening method by incorporating some important prior known information of active variables. This method can effectively detect hidden active variables. Furthermore, it possesses the sure screening property under some mild regularity conditions and does not require any complicated numerical optimization. We evaluate the finite sample performance of the proposed method via extensive simulation studies and further illustrate the new approach through a real data set from patients with breast cancer.
Similar content being viewed by others
References
Andersen PK, Gill RD (1982) Cox’s regression model for counting processes: a large sample study. Ann Statis 10:1100–1120
Barlow WE (1994) Robust variance estimation for the case-cohort design. Biometrics 50:1064–1072
Barut E, Fan J, Verhasselt A (2016) Conditional sure independence screening. J Am Stat Assoc 111:1266–1277
Borgan O, Langholz B, Samuelsen SO, Goldstein L, Pogoda J (2000) Exposure stratified case-cohort designs. Lifetime Data Anal 6:39–58
Bresolw NE, Wellner JA (2007) Weighted likelihood for semiparametric models and two-phase stratified samples, with application to cox regression. Scand J Stat 34:86–102
Candes E, Tao T (2007) The Dantzig selector: Statistical estimation when $p$ is much larger than $n$. Ann Stat 35:2313–2351
Chang J, Tang CY, Wu Y (2013) Marginal empirical likelihood and sure independence feature screening. Ann Stat 41:2123–2148
Chen K (2001) Generalized case-cohort sampling. J R Stat Soc B 63:791–809
Chen K, Lo SH (1999) Case-cohort and case-control analysis with Cox’s model. Biometrika 86:755–764
Cox DR (1972) Regression models and life-tables. J R Stat Soc B 34:187–220
Cui H, Li R, Zhong W (2015) Model-free feature screening for ultrahigh dimensional discriminant analysis. J Am Stat Assoc 110:630–641
Fan J, Feng Y, Song R (2011) Nonparametric independence screening in sparse ultra-high-dimensional additive models. J Am Stat Assoc 106:544–557
Fan J, Feng Y, Wu Y (2010) High-dimensional variable selection for Cox’s proportional hazards model. In: Borrowing strength: theory powering applications: a Festschrift for Lawrence D. Brown, Institute of Mathematical Statistics 6:70–86
Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360
Fan J, Lv J (2008) Sure independence screening for ultrahigh dimensional feature space. J R Stat Soc B 70:849–911
Fan J, Ma Y, Dai W (2014) Nonparametric independence screening in sparse ultra-high-dimensional varying coefficient models. J Am Stat Assoc 109:1270–1284
Fan J, Samworth R, Wu Y (2009) Ultrahigh dimensional feature selection: beyond the linear model. J Mach Learn Res 10:2013–2038
Fan J, Song R (2010) Sure independence screening in generalized linear models with NP-dimensionality. Ann Stat 38:3567–3604
Fleming TR, Harrington DP (1991) Counting processes and survival analysis. Wiley, New York
Gorst-Rasmussen A, Scheike T (2013) Independent screening for single-index hazard rate models with ultrahigh dimensional features. J R Stat Soc B 75:217–245
He X, Wang L, Hong HG (2013) Quantile-adaptive model-free variable screening for high-dimensional heterogeneous data. Ann Stat 41:342–369
Hong HG, Kang J, Li Y (2018) Conditional screening for ultra-high dimensional covariates with survival outcomes. Lifetime Data Anal 24:45–71
Hong HG, Wang L, He X (2016) A data-driven approach to conditional screening of high-dimensional variables. Stat 5:200–212
Hu Q, Lin L (2017) Conditional sure independence screening by conditional marginal empirical likelihood. Ann Inst Stat Math 69:63–96
Kalbfleisch JD, Lawless JF (1988) Likelihood analysis of multi-state models for disease incidence and mortality. Stat Med 7:149–160
Kang S, Cai J (2009) Marginal hazards model for case-cohort studies with multiple disease outcomes. Biometrika 96:887–901
Keogh RH, White IR (2013) Using full-cohort data in nested case-control and case-cohort studies by multiple imputation. Stat Med 32:4021–4043
Kim S, Ahn WK (2019) Bi-level variable selection for case-cohort studies with group variables. Stat Methods Med Res 28:3404–3414
Kim S, Cai J, Lu W (2013) More efficient estimators for case-cohort studies. Biometrika 100:695–708
Kulich M, Lin D (2004) Improving the efficiency of relative-risk estimation in case-cohort studies. J Am Stat Assoc 99:832–844
Li G, Peng H, Zhang J, Zhu L (2012a) Robust rank correlation based screening. Ann Stat 40:1846–1877
Li R, Zhong W, Zhu L (2012b) Feature screening via distance correlation learning. J Am Stat Assoc 107:1129–1139
Lin DY, Wei LJ (1989) The robust inference for the Cox proportional hazards model. J Am Stat Assoc 84:1074–1078
Lin Y, Liu X, Hao M (2018) Model-free feature screening for high-dimensional survival data. Sci China Math 61:1617–1636
Liu Y, Chen XL (2018) Quantile screening for ultra-high-dimensional heterogeneous data conditional on some variables. J Stat Comput Sim 88:329–342
Liu J, Li R, Wu R (2014) Feature selection for varying coefficient models with ultrahigh-dimensional covariates. J Am Stat Assoc 109:266–274
Liu Y, Wang Q (2018) Model-free feature screening for ultrahigh-dimensional data conditional on some variables. Ann Inst Stat Math 70:283–301
Liu Y, Zhang J, Zhao X (2018) A new nonparametric screening method for ultrahigh-dimensional survival data. Comput Stat Data Anal 119:74–85
Lu J, Lin L (2020) Model-free conditional screening via conditional distance correlation. Stat Pap 61:225–244
Mai Q, Zou H (2015) The fused Kolmogorov filter: a nonparametric model-free screening method. Ann Stat 43:1471–1497
Marti H, Chavance M (2011) Multiple imputation analysis of case-cohort studies. Stat Med 30:1595–1607
Ni A, Cai J, Zeng D (2016) Variable selection for case-cohort studies with failure time outcome. Biometrika 103:547–562
Pan W, Wang X, Xiao W, Zhu H (2019) A generic sure independence screening procedure. J Am Stat Assoc 114:928–937
Prentice RL (1986) A case-cohort design for epidemiologic cohort studies and disease prevention trials. Biometrika 73:1–11
Scheike TH, Martinussen T (2004) Maximum likelihood estimation for Cox’s regression model under case-cohort sampling. Scand J Stat 31:283–293
Self SG, Prentice R (1988) Asymptotic distribution theory and efficiency results for case-cohort studies. Ann Stat 16:64–81
Song R, Lu W, Ma S, Jeng XJ (2014) Censored rank independence screening for high-dimensional survival data. Biometrika 101:799–814
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc B 58:267–288
Tibshirani R (2009) Univariate shrinkage in the Cox model for high dimensional data. Stat Appl Genet Mol 8:1–18
Uno H, Cai T, Pencina MJ, D’Agostino RB, Wei LJ (2011) On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Stat Med 30:1105–1117
van de Vijver MJ, He YD, van Veer LJ, Dai H, Hart AA, Voskuil DW, Schreiber GJ, Peterse JL, Roberts C, Marton MJ (2002) A gene-expression signature as a predictor of survival in breast cancer. New Engl J Med 347:1999–2009
van der Vaart AW, Wellner JA (1996) Weak convergence and empirical processes. Springer, New York
van Veer LJ, Dai H, van De Vijver MJ, He YD, Hart AA, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, Schreiber GJ, Kerkhoven RM, Roberts C, Linsley PS, Bernards R, Friend SH (2002) Gene expression profiling predicts clinical outcome of breast cancer. Nature 415:530–536
Wu Y, Yin G (2015) Conditional quantile screening in ultrahigh-dimensional heterogeneous data. Biometrika 102:65–76
Yeung KY, Bumgarner RE, Raftery AE (2005) Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data. Bioinformatics 21:2394–2402
Zeng D, Lin DY (2014) Efficient estimation of semiparametric transformation models for two-phase cohort studies. J Am Stat Assoc 109:371–383
Zhang CH (2010) Nearly unbiased variable selection under minimax concave penalty. Ann Stat 38:894–942
Zhang J, Liu Y, Wu Y (2017) Correlation rank screening for ultrahigh–dimensional survival data. Comput Stat Data Anal 108:121–132
Zhang J, Yin G, Liu Y, Wu Y (2018) Censored cumulative residual independent screening for ultrahigh-dimensional survival data. Lifetime Data Anal 24:273–292
Zhao SD, Li Y (2012) Principled sure independence screening for Cox models with ultra-high-dimensional covariates. J Mult Anal 105:397–411
Zhou T, Zhu L (2017) Model-free feature screening for ultrahigh dimensional censored regression. Stat Comput 27:947–961
Zhu LP, Li L, Li R, Zhu LX (2011) Model-free feature screening for ultrahigh-dimensional data. J Am Stat Assoc 106:1464–1475
Zou H (2006) The adaptive Lasso and its oracle properties. J Am Stat Assoc 101:1418–1429
Acknowledgements
This work is funded in part by the U.S. National Institute of Health Grants (P01CA142538, P42ES031007, P30ES010126), the National Natural Science Foundation of China grants (Nos. 11971362, 11901581, 11771366).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that we have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Appendix
Appendix
1.1 Appendix A: regularity conditions
Let \(S_T(t|\mathbf{Z}_i)=\exp \{-\varLambda _0(t)\exp (\varvec{\alpha }^{\mathrm{T}}\mathbf{Z}_i)\}\) and \(S_C(t|\mathbf{Z}_i)=P(C_i>t|\mathbf{Z}_i)\) denote the survival functions of \(T_i\) and \(C_i\), \(F_T(t|\mathbf{Z}_i)=1-S_T(t|\mathbf{Z}_i)\), \(\varLambda _0(t)=\int _0^t\lambda _0(s)\mathrm{d}s\) denotes the cumulative baseline hazard function. For any vector \({\varvec{\nu }}=(\nu _1,\ldots ,\nu _p)\in R^p\), let \(\Vert {\varvec{\nu }}\Vert _d=\root d \of {\sum _{j=1}^p|\nu _j|^d}\) be the \(L_d\) norm. For any random variables \(\zeta : \varOmega \rightarrow R^d\), \(\zeta _1: \varOmega \rightarrow R^{d_1}\), \(\zeta _2: \varOmega \rightarrow R^{d_2}\) and \(\eta : \varOmega \rightarrow R^p\), the conditional linear expectation of \(\zeta \) given \(\eta \) is defined as \(E^{*}(\zeta |\eta )=E(\zeta )+B^{T}\{\eta -E(\eta )\}\), where \(B=\mathrm{argmin}_{D\in R^d\times R^p}E[\{\zeta -E(\zeta )-D^{\mathrm{T}}(\eta -E(\eta ))\}^2|\eta ]\). The conditional linear covariance between \(\zeta _1\) and \(\zeta _2\) given \(\eta \) is defined as \(Cov^{*}(\zeta _1,\zeta _2|\eta )=E^{*}[\{\zeta _1-E^{*}(\zeta _1|\eta )\} \{\zeta _2-E^{*}(\zeta _2|\eta )\}|\eta ]\). The properties of \(E^{*}(\zeta |\eta )\) and \(Cov^{*}(\zeta _1,\zeta _2|\eta )\) are presented in “Appendix B”. The regularity conditions listed below are imposed throughout our discussions.
-
C1.
For each \(j\notin {\mathcal {C}}\) and \(k\in {\mathcal {C}}\bigcup \{j\}\), there exists a neighborhood \({\mathcal {B}}_j\) of \(({\varvec{\beta }}_{{\mathcal {C}},j}^{0},{\beta }_j^{0})^{\mathrm{T}}\) such that
$$\begin{aligned} \sup _{t\in [0,\tau ],({\varvec{\beta }}_{{\mathcal {C}},j},{\beta _j})^{\mathrm{T}}\in {\mathcal {B}}_j}\Vert S_{j,k}^{(l)}(\varvec{\beta }_{{\mathcal {C}},j},\beta _j,t)- s_{j,k}^{(l)}(\varvec{\beta }_{{\mathcal {C}},j},\beta _j,t)\Vert _2\rightarrow 0 \end{aligned}$$in probability as \(n\rightarrow \infty \) (\(l=0,1\)), \(s_{j,k}^{(0)}(\varvec{\beta }_{{\mathcal {C}},j},\beta _j,t)\) is bounded away from zero on \({\mathcal {B}}_j\times [0,\tau ]\), \(s_{j,k}^{(l)}(\varvec{\beta }_{{\mathcal {C}},j},\beta _j,t)\) are bounded on \({\mathcal {B}}_j\times [0,\tau ]\).
-
C2.
For all \(j=1,\ldots ,p\), \(\int _0^{\tau }\lambda _{j,0}(t)\mathrm {d}t< \infty \) and \(E\{Y(\tau )\}>0\).
-
C3.
The covariates \(Z_{j}\) (\(j=1,\ldots , p\)) are independent of time and bounded by a constant \(L_0\). Furthermore, \(E(Z_j)=0\) for all \(j\in \{1,\ldots , p\}\).
-
C4.
All \(Z_{j}\), \(j\in {\mathcal {A}}_{-{\mathcal {C}}}\) are independent of all \(Z_{j}\), \(j\notin {\mathcal {A}}_{-{\mathcal {C}}}\) given \(\mathbf{Z}_{{\mathcal {C}}}\).
-
C5.
There exists a constant \(L_1\) such that \(\Vert \varvec{\alpha }\Vert _1<L_1\) and \(\Vert ({\varvec{\beta }}_{{\mathcal {C}},j},{\beta }_j)^{\mathrm{T}}\Vert _1<L_1\).
-
C6.
There exist constants \(c_1>0\) and \(0<\kappa <1/2\) such that \(\min _{j\in {\mathcal {A}}_{-{\mathcal {C}}}}|E[Cov^{*}(Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}})]| \ge c_1 n^{-\kappa }\).
-
C7.
There exists a constant \(L>0\) such that \(n^{-1}\Vert \mathbf{U}_j(\widehat{\varvec{\beta }}_{{\mathcal {C}},j},\widehat{\beta }_j)-\mathbf{U}_j(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})\Vert _2 \ge L\Vert (\widehat{\varvec{\beta }}_{{\mathcal {C}},j},\widehat{\beta }_j)^{\mathrm{T}} -({\varvec{\beta }}_{{\mathcal {C}},j}^{0},{\beta }_j^{0})^{\mathrm{T}}\Vert _2\) for all \(j\notin {\mathcal {C}}\).
-
C8.
Let \({\tilde{n}}=\sum _{i=1}^n\xi _i\) denote the sample size of subcohort, then \({\tilde{n}}/n\) converges to the constant \(\pi \in (0,1)\).
Conditions C1 and C2 are common assumptions in survival analysis (Andersen and Gill 1982; Fleming and Harrington 1991). Condition C3 assumes the covariates are bounded, similar condition also used in Hong et al. (2018). Condition C4 is similar to the partial orthogonality assumption of the covariates. Condition C5 controls the total effect size of the covariates, it is reasonable under the sparsity principle. Condition C6 is a typical assumption which has been widely used in the literature of feature screening, such as condition 3 in Fan and Lv (2008), condition 2 in Li et al. (2012b), condition 2 in Song et al. (2014), conditions 2 and 5 in Wu and Yin (2015), etc. Condition C7 is a mild assumption which holds in many situations. Condition C8 is a common assumption on the case-cohort design.
1.2 Appendix B: lemmas and theoretic proofs
Let \({\varvec{\beta }}_{{\mathcal {C}},0}\) be the solution of the equations \(\mathbf{u}_{{\mathcal {C}}}(\varvec{\beta }_{{\mathcal {C}}})=[u_{j,k}(\varvec{\beta }_{{\mathcal {C}}},0), k\in {\mathcal {C}}]^{\mathrm{T}}=\mathbf{0}_{q}\). Define \(\mathbf{v}_{j}(\varvec{\beta }_{{\mathcal {C}},j},\beta _j)=u_{j,j}(\varvec{\beta }_{{\mathcal {C}},j},\beta _j)- \sum _{k\in {\mathcal {C}}}b_k u_{j,k}(\varvec{\beta }_{{\mathcal {C}},j},\beta _j)\), where vector \(\mathbf{b}_{{\mathcal {C}}}=[b_k, k\in {\mathcal {C}}]^{\mathrm{T}}\) such that \(E^{*}[Z_j|\mathbf{Z}_{{\mathcal {C}}}]=\sum _{k\in {\mathcal {C}}} b_kZ_k\). As a preparation, we first introduce some lemmas.
Lemma 4
Let \(\varvec{\xi }=(\xi _1,\ldots ,\xi _n)\) be a random vector containing \({\tilde{n}}\) ones and \(n-{\tilde{n}}\) zeros, with each permutation equally likely. Let \(B_i (t)\) \((i = 1, \ldots , n)\) be independent and identically distributed real-valued random processes on \([0,\tau ]\) with \(E\{B_i(t)\}=\mu _B(t)\), \(var(B_i(\tau ))< \infty \). Let \(B(t)=\{B_1(t),\ldots ,B_n(t)\}\) be independent of \(\xi \). Suppose that almost all paths of \(B_i(t)\) have finite variation. Then, \(n^{-1/2}\sum _{i=1}^n\xi _i\{B_i(t)-\mu _B(t)\}\) converges weakly in \(l^\infty [0,\tau ]\) to a zero-mean Gaussian process and therefore \(n^{-1/2}\sum _{i=1}^n\xi _i\{B_i(t)-\mu _B(t)\}\) converges in probability to zero uniformly in t.
This Lemma is the same as Lemma A1 of Kang and Cai (2009).
Lemma 5
Given that \(\xi \) is independent of \(\varDelta \) and Y(t), \(n^{1/2}\{\widehat{\pi }^{-1}(t)-\pi ^{-1}\}\) converges weakly to a zero-mean Gaussian process.
This lemma is extracted from lemma A3 of Ni et al. (2016).
Lemma 6
For independent random variables \(Y_1,\ldots , Y_n\) with bounded ranges \([-M,M]\) and zero mean,
for \(V\ge Var(Y_1+\ldots +Y_n)\).
This lemma is extracted from lemma 2.2.9 of van der Vaart and Wellner (1996).
Lemma 7
Let \(\zeta \), \(\zeta _1\), \(\zeta _2\) and \(\eta \) be any four random variables in the probability space \((\varOmega , {\mathcal {F}},P)\), the following properties hold for the conditional linear expectation \(E^*(\cdot |\eta )\) given \(\eta \):
-
1.
\(E^*(\zeta |\eta )=E(\zeta )+Cov(\zeta ,\eta )Var(\eta )^{-1}\{\eta -E(\eta )\}\);
-
2.
\(E^*(\eta |\eta )=\eta \);
-
3.
For any matrices \(A_1\) and \(A_2\), \(E^*(A_1\zeta _1+A_2\zeta _2|\eta )=A_1E^*(\zeta _1|\eta )+A_2E^*(\zeta _2|\eta )\);
-
4.
\(E^*[E^*(\zeta |\eta )]=E[E^*(\zeta |\eta )]=E[\zeta ]\).
This lemma is extracted from proposition 2 of Hong et al. (2018).
Lemma 8
The conditional linear covariance has the following properties:
-
1.
\(Cov^*(\zeta _1,\zeta _2|\eta )=0\Longleftrightarrow E^*(\zeta _1\zeta _2|\eta )=E^*(\zeta _1|\eta )E^*(\zeta _2|\eta )\);
-
2.
\(E[Cov^*(\zeta _1,\zeta _2|\eta )]=Cov(\zeta _1,\zeta _2)- Cov(\zeta _1,\eta )Var(\eta )^{-1}Cov(\eta ,\zeta _2)\);
-
3.
For any increasing function \(h(\cdot ): R \rightarrow R\) and random variable \(\xi : \varOmega \rightarrow R\), we have \(Cov^*(h(\xi ),\xi |\eta )\ge 0\).
This lemma is extracted from proposition 3 of Hong et al. (2018).
1.2.1 Proof of Lemma 1
Proof
We first relate \(\beta _j^{0}\) to \(E[Cov^{*}\{Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}]\), then by condition C6, we relate it to \(\alpha _j\). For any \(j\notin {\mathcal {C}}\) and \(k\in {\mathcal {C}}\), straightforward calculations entail that \(s_k^{l}(t)=E\{Z_k^l\lambda _0(t)\exp (\varvec{\alpha }^{\mathrm{T}}\mathbf{Z})S_TS_C\}\) and \(s_{j,k}^{(l)}(\varvec{\beta }_{{\mathcal {C}},j},\beta _j,t)=E\{Z_k^l\exp (\mathbf{Z}_{{\mathcal {C}}}^{\mathrm{T}}\varvec{\beta }_{{\mathcal {C}},j}+Z_j\beta _j )S_TS_C\}\) \((l=0,1,2)\), then
By the definition, we have
where
and
By the definition of \((\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})\), we have \(\mathbf{u}_j(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})=\mathbf{0}_{q+1}\), then \(u_{j,k}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})=0\) for any \(k\in {\mathcal {C}}\cup \{j\}\), \(\mathbf{v}_{j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})=u_{j,j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})- \sum _{k\in {\mathcal {C}}}b_k u_{j,k}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})=0\), \(F_{2j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})=F_{1j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0}) =E[Cov^{*}\{Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}]\). When \(\alpha _j=0\), \(E[Cov^{*}\{Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}]=0\), thus \(F_{2j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})=0\). Because of \(F_{2j}(\varvec{\beta }_{{\mathcal {C}},0},0)=0\), \(\mathbf{v}_{j}(\varvec{\beta }_{{\mathcal {C}},0},0)=E[Cov^{*}\{Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}] -F_{2j}(\varvec{\beta }_{{\mathcal {C}},0},0)=\mathbf{0}_{q+1}\). By the uniqueness of the solution of \(\mathbf{v}_{j}(\varvec{\beta }_{{\mathcal {C}}},\beta )\), we have \(\beta _j^{0}=0\).
When \(\alpha _j\ne 0\), by condition C6, we have \(F_{2j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0}) =E[Cov^{*}\{Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}]\ge c_1n^{-\kappa }\). This implies that \(F_{2j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})\) and \(E[Cov^{*}\{Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}]\) are both nonzero and have the same signs since they are equal. Specifically, \(P(\delta =1|\mathbf{Z})\) is the probability of occurrence of the event and \(S_TS_C=P(X>t|\mathbf{Z})\) represents the probability at risk at time t. For any t, we have
By lemma 8, \(Cov^*\{Z_j,P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}\) and \(Cov^*(Z_j,S_TS_C|\mathbf{Z}_{{\mathcal {C}}})\) have the opposite signs unless they are zero. This further implies that
and \(E[Cov^{*}\{Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}]\) have opposite signs unless they are equal to zero. So \(F_{2j}(\varvec{\beta }_{{\mathcal {C}},0},0)\ne F_{2j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})\), therefore, \(\beta _j^{0}\ne 0\). \(\square \)
1.2.2 Proof of Lemma 2
Proof
By lemma 1, for any \(j\in {\mathcal {A}}_{-{\mathcal {C}}}\), we have \(\beta _j^{0}\ne 0\). By Taylor expansion, there exists \(\widetilde{\beta }_j\in (0, \beta _j^{0})\) such that
By the proof of lemma 1, \(\mathbf{v}_{j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})=E[Cov^{*}\{Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}] -F_{2j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0}).\) Given \(\varvec{\beta }_{{\mathcal {C}},j}^{0}\), consider \(F_{2j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j)\) as a function of \(\beta _j\), then
where
By condition C3, \(|Z_j|\le L_0\), then \(\sup _{\beta _j}|H_{j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j,t)| \le 2L_0^2\). So
By the proof in lemma 1, \(F_{2j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},0)\) and \(E[Cov^{*}\{Z_{j},P(\delta =1|\mathbf{Z})|\mathbf{Z}_{{\mathcal {C}}}\}]\) have opposite signs, combining it with condition C6,
So
Taking \(c_2=0.5L_0^{-2}c_1\), we have
which completes the proof. \(\square \)
1.2.3 Proof of Lemma 3
Proof
Denote \(\bar{\mathbf{U}}_j(\varvec{\beta }_{{\mathcal {C}},j},\beta _j)=n^{-1}\mathbf{U}_j(\varvec{\beta }_{{\mathcal {C}},j},\beta _j)\). By the definition of \((\widehat{\varvec{\beta }}_{{\mathcal {C}},j},\widehat{\beta }_j)^{\mathrm{T}}\), we have
For any \(j\notin {\mathcal {C}}\) and \(k\in {\mathcal {C}}\cup \{j\}\), using the similar method of Lin and Wei (1989), by lemmas 4 and 5, we can obtain that
where \(\mathbf{W}_{i,j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})\) \((i=1,\ldots , n)\) are independent, \(E\{\mathbf{W}_{i,j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})\}=\mathbf{0}\) and \(\mathbf{W}_{i,j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})= [{W}_{i,j,k}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0}), k\in {\mathcal {C}}\cup \{j\}]^{\mathrm{T}}\) with
Let \(E_n\) denote the empirical measure, we can write
For any given i, j, k, by conditions C1, C3, C5, there exists a constant \(L_2\) such that \(|{W}_{i,j,k}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})|\le L_2\), by the fact that \(E[{W}_{i,j,k}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})]=0\), we have \(Var[{W}_{i,j,k}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})]= E[|{W}_{i,j,k}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})|^2]\le L_2^2\). By lemma 6, for any \(t>0\), \(j\notin {\mathcal {C}}\) and \(k\in {\mathcal {C}}\cup \{j\}\), we have
By Bonferroni inequality, we have
As \(\Vert \bar{\mathbf{U}}_j(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})-E_n[\mathbf{W}_{i,j}(\varvec{\beta }_{{\mathcal {C}},j}^{0},\beta _j^{0})]\Vert _2=o_p(1)\), for any \(\epsilon _1>0\) and \(\epsilon _2>0\), there exists \(N_1\) such that for any \(n>N_1\), we have
Taking \(t=\frac{c_2Ln^{1-\kappa }}{2(q+1)}>0\), then \(\frac{t(q+1)}{n}=\frac{c_2Ln^{-\kappa }}{2}\). By Triangle inequality and Bonferroni inequality, we have
Taking \(N=\max \{(L_2/3)^{1/\kappa },N_1\}\), then for any \(n>N\), \(n^{-\kappa }<3/L_2\), so we have
where \(c_3=\frac{c_2^2L^2}{8L_2^2(q+1)^2+4c_2L(q+1)}\). By condition C7, we have
Then we have
where \(a=|{\mathcal {A}}_{-{\mathcal {C}}}|=\sum _{j\notin {\mathcal {C}}}I(\alpha _j\ne 0)\) is the size of \(|{\mathcal {A}}_{-{\mathcal {C}}}|\). \(\square \)
1.2.4 Proof of Theorem 1
Proof
By the definition of \(\widehat{{\mathcal {A}}}_{-{\mathcal {C}}}\) and condition C7, there exists a positive constant \(c_4\) such that
Following lemma 2, for any \(j\in {\mathcal {A}}_{-{\mathcal {C}}}\), we have \(|\beta _j^{0}-\widehat{\beta }_j|\ge |\beta _j^{0}|-|\widehat{\beta }_j|\ge c_2n^{-\kappa }-|\widehat{\beta }_j|\). Suppose \(\min _{j\in {\mathcal {A}}_{-{\mathcal {C}}}}|\widehat{\beta }_j|<n^{-1/2}c_4\gamma \), then \(\max _{j\in {\mathcal {A}}_{-{\mathcal {C}}}}|\beta _j^{0}-\widehat{\beta }_j|\ge c_2n^{-\kappa }-n^{-1/2}c_4\gamma \). If we have \(\gamma <c_2(n^{-\kappa }-\epsilon _1)n^{1/2}/(2c_4)\), we can obtain
Then \(P({\mathcal {A}}_{-{\mathcal {C}}}\subseteq \widehat{{\mathcal {A}}}_{-{\mathcal {C}}}) \ge 1-2a(q+1)\exp (-c_3n^{1-2\kappa })-a\epsilon _2\). Let \(n\rightarrow \infty \), for any \(\epsilon _2>0\), we have \(\lim _{n\rightarrow \infty }P({\mathcal {A}}_{-{\mathcal {C}}}\subseteq \widehat{{\mathcal {A}}}_{-{\mathcal {C}}}) \ge 1-a\epsilon _2\), the right side of the above equation does not depend on n any more. Taking \(\epsilon _2\rightarrow 0\), we have \(\lim _{n\rightarrow \infty }P({\mathcal {A}}_{-{\mathcal {C}}}\subseteq \widehat{{\mathcal {A}}}_{-{\mathcal {C}}})=1\). \(\square \)
1.2.5 Proof of Theorem 2
Proof
For any \(j\in {\mathcal {A}}_{-{\mathcal {C}}}\), we have \(\alpha _j\ne 0\). From lemma 1, we know that \(|\beta _j^{0}|> 0\). Similarily, we have \(|\beta _j^{0}|=0\) if \(j\notin {\mathcal {A}}_{-{\mathcal {C}}}\). As \(\widehat{\beta }_j\) is a consistent estimator of \(\beta _j^{0}\) and \( M_{{\mathcal {C}},j}=|\widehat{\beta }_j|/\widehat{\sigma }_j\), we can easily conclude that \(P(\max _{j\notin {\mathcal {A}}_{-{\mathcal {C}}}}M_{{\mathcal {C}},j}<\min _{j\in {\mathcal {A}}_{-{\mathcal {C}}}}M_{{\mathcal {C}},j})\rightarrow 1\) when \(n\rightarrow \infty \), which completes the proof of theorem 2. \(\square \)
Rights and permissions
About this article
Cite this article
Zhang, J., Zhou, H., Liu, Y. et al. Conditional screening for ultrahigh-dimensional survival data in case-cohort studies. Lifetime Data Anal 27, 632–661 (2021). https://doi.org/10.1007/s10985-021-09531-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10985-021-09531-7