Abstract
The proportional hazards (PH) model is arguably one of the most popular models used to analyze time to event data arising from clinical trials and longitudinal studies. In many such studies, the event time is not directly observed but is known relative to periodic examination times; i.e., practitioners observe either current status or interval-censored data. The analysis of data of this structure is often fraught with many difficulties since the event time of interest is unobserved. Further exacerbating this issue, in some such studies the observed data also consists of instantaneous failures; i.e., the event times for several study units coincide exactly with the time at which the study begins. In light of these difficulties, this work focuses on developing a mixture model, under the PH assumptions, which can be used to analyze interval-censored data subject to instantaneous failures. To allow for modeling flexibility, two methods of estimating the unknown cumulative baseline hazard function are proposed; a fully parametric and a monotone spline representation are considered. Through a novel data augmentation procedure involving latent Poisson random variables, an expectation–maximization (EM) algorithm is developed to complete model fitting. The resulting EM algorithm is easy to implement and is computationally efficient. Moreover, through extensive simulation studies the proposed approach is shown to provide both reliable estimation and inference. The motivation for this work arises from a randomized clinical trial aimed at assessing the effectiveness of a new peanut allergen treatment in attaining sustained unresponsiveness in children.
Similar content being viewed by others
References
Betensky RA, Lindsey JC, Ryan LM, Wand M (2002) A local likelihood proportional hazards model for interval censored data. Stat Med 21(2):263–275. https://doi.org/10.1002/sim.993
Cai B, Lin X, Wang L (2011) Bayesian proportional hazards model for current status data with monotone splines. Comput Stat Data Anal 55(9):2644–2651. https://doi.org/10.1016/j.csda.2011.03.013
Cai T, Betensky RA (2003) Hazard regression for interval-censored data with penalized spline. Biometrics 59(3):570–579. https://doi.org/10.1111/1541-0420.00067
Chaudhari M, Kim EH, Withana Gamage PW, McMahan CS, Kosorok MR (2018) Study design with staggered sampling times for evaluating sustained unresponsiveness to peanut sublingual immunotherapy. Stat Med 37:3944–3958
Chen CM, Lai CC, Cheng KC, Weng SF, Liu WL, Shen HN (2015) Effect of end-stage renal disease on long-term survival after a first-ever mechanical ventilation: a population-based study. Crit Care 19(1):354. https://doi.org/10.1186/s13054-015-1071-x
Cox R et al (1972) Regression models and life tables (with discussion). J R Stat Soc 34:187–220
Dorey FJ, Little RJ, Schenker N (1993) Multiple imputation for threshold-crossing data with interval censoring. Stat Med 12(17):1589–1603
Finkelstein DM (1986) A proportional hazards model for interval-censored failure time data. Biometrics 42:845–854
Goetghebeur E, Ryan L (2000) Semiparametric regression analysis of interval-censored data. Biometrics 56(4):1139–1144. https://doi.org/10.1111/j.0006-341X.2000.01139.x
Goggins WB, Finkelstein DM, Schoenfeld DA, Zaslavsky AM (1998) A markov chain monte carlo em algorithm for analyzing interval-censored data under the cox proportional hazards model. Biometrics 54:1498–1507
Groeneboom P, Wellner JA (1992) Information bounds and nonparametric maximum likelihood estimation, vol 19. Springer, Berlin
Kale B, Muralidharan K (2002) Optimal estimating equations in mixture distributions accommodating instantaneous or early failures. Qual Control Appl Stat 47(6):677–680
Knopik L (2011) Model for instantaneous failures. Sci Probl Mach Oper Maint 46(2):37–45
Lamborn KR, Yung WA, Chang SM, Wen PY, Cloughesy TF, DeAngelis LM, Robins HI, Lieberman FS, Fine HA, Fink KL et al (2008) Progression-free survival: an important end point in evaluating therapy for recurrent high-grade gliomas. Neuro-oncology 10(2):162–170. https://doi.org/10.1215/15228517-2007-062
Li J, Ma S (2013) Survival analysis in medicine and genetics. CRC Press, Boca Raton
Liem MS, van der Graaf Y, van Steensel CJ, Boelhouwer RU, Clevers GJ, Meijer WS, Stassen LP, Vente JP, Weidema WF, Schrijvers AJ et al (1997) Comparison of conventional anterior surgery and laparoscopic surgery for inguinal-hernia repair. N Engl J Med 336(22):1541–1547. https://doi.org/10.1056/NEJM199705293362201
Lin X, Wang L (2010) A semiparametric probit model for case 2 interval-censored failure time data. Stat Med 29(9):972–981. https://doi.org/10.1002/sim.3832
Liu C, Yang W, Devidas M, Cheng C, Pei D, Smith C, Carroll WL, Raetz EA, Bowman WP, Larsen EC et al (2016) Clinical and genetic risk factors for acute pancreatitis in patients with acute lymphoblastic leukemia. J Clin Oncol 34(18):2133–2140. https://doi.org/10.1200/JCO.2015.64.5812
Liu H, Shen Y (2009) A semiparametric regression cure model for interval-censored data. J Am Stat Assoc 104(487):1168–1178. https://doi.org/10.1198/jasa.2009.tm07494
Lu M, Li CS (2017) Penalized estimation for proportional hazards models with current status data. Stat Med 36(30):4893–4907
Lu M, McMahan CS (2018) A partially linear proportional hazards model for current status data. Biometrics 74:1240–1249
Matsuzaki A, Nagatoshi Y, Inada H, Nakayama H, Yanai F, Ayukawa H, Kawakami K, Moritake H, Suminoe A, Okamura J (2005) Prognostic factors for relapsed childhood acute lymphoblastic leukemia: impact of allogeneic stem cell transplantation-a report from the kyushu-yamaguchi children’s cancer study group. Pediatric Blood Cancer 45(2):111–120. https://doi.org/10.1002/pbc.20363
McMahan CS, Wang L, Tebbs JM (2013) Regression analysis for current status data using the em algorithm. Stat Med 32(25):4452–4466. https://doi.org/10.1002/sim.5863
Muralidharan K (1999) Tests for the mixing proportion in the mixture of a degene-rate and exponential distribution. J Ind Stat Assoc 37:105–119
Muralidharan K, Lathika P (2006) Analysis of instantaneous and early failures in weibull distribution. Metrika 64(3):305–316. https://doi.org/10.1007/s00184-006-0050-2
Murthy DP, Xie M, Jiang R (2004) Weibull models, vol 505. Wiley, Hoboken
NCT01373242 (2017) Sublingual immunotherapy for peanut allergy and induction of tolerance (slit-tlc): Nct01373242. http://clinicaltrials.gov/show/NCT01373242 NLM Identifier: NCT01373242
Odell P, Anderson K, Agostino R (1992) Maximum likelihood estimation for interval-censored data using a weibull-based accelerated failure time model. Biometrics. https://doi.org/10.2307/2532360
Pan W (1999) Extending the iterative convex minorant algorithm to the cox model for interval-censored data. J Comput Graph Stat 8(1):109–120
Pan W (2000) A multiple imputation approach to cox regression with interval-censored data. Biometrics 56(1):199–203. https://doi.org/10.1111/j.0006-341X.2000.00199.x
Pham H, Lai CD (2007) On recent generalizations of the weibull distribution. IEEE Trans Reliab 56(3):454–458. https://doi.org/10.1109/TR.2007.903352
Ramsay JO (1988) Monotone regression splines in action. Stat Sci. https://doi.org/10.1214/ss/1177012761
Satten GA (1996) Rank-based inference in the proportional hazards model for interval censored data. Biometrika 83(2):355–370. https://doi.org/10.1093/biomet/83.2.355
Sun J (2007) The statistical analysis of interval-censored failure time data. Springer, Berlin
Wang L, Dunson DB (2011) Semiparametric bayes’ proportional odds models for current status data with underreporting. Biometrics 67(3):1111–1118. https://doi.org/10.1111/j.1541-0420.2010.01532.x
Wang L, McMahan CS, Hudgens MG, Qureshi ZP (2016) A flexible, computationally efficient method for fitting the proportional hazards model to interval-censored data. Biometrics 72(1):222–231. https://doi.org/10.1111/biom.12389
Wang N, Wang L, McMahan CS (2015) Regression analysis of bivariate current status data under the gamma-frailty proportional hazards model using the em algorithm. Comput Stat Data Anal 83:140–150. https://doi.org/10.1016/j.csda.2014.10.013
Zhang Y, Hua L, Huang J (2010) A spline-based semiparametric maximum likelihood estimation method for the cox model with interval-censored data. Scand J Stat 37(2):338–354. https://doi.org/10.1111/j.1467-9469.2009.00680.x
Zhang Z, Sun J (2010) Interval censoring. Stat Methods Med Res 19(1):53–70. https://doi.org/10.1177/0962280209105023
Acknowledgements
This research was partially supported by the Wallace Foundation and National Institutes of Health grants AI121351 and UL1 TR001111.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Withana Gamage, P.W., Chaudari, M., McMahan, C.S. et al. An extended proportional hazards model for interval-censored data subject to instantaneous failures. Lifetime Data Anal 26, 158–182 (2020). https://doi.org/10.1007/s10985-019-09467-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10985-019-09467-z