Abstract
The paper presents a novel approach to solve a classical two-sample problem with right-censored data. As a result, an efficient procedure for verifying equality of the two survival curves is developed. It generalizes, in a natural manner, a well-known standard, that is, the log-rank test. Under the null hypothesis, the new test statistic has an asymptotic Chi-square distribution with one degree of freedom, while the corresponding test is consistent for a wide range of the alternatives. On the other hand, to control the actual Type I error rate when sample sizes are finite, permutation approach is employed for the inference. An extensive simulation study shows that the new test procedure improves upon classical solutions and popular recent developments in the field. An analysis of the real datasets is included. A routine, written in R, is attached as Supplementary Material.
Similar content being viewed by others
Change history
02 February 2021
Added Supplementary file.
References
Arboretti, R., Fontana, R., Pesarin, F., Salmaso, L. (2018). Nonparametric combination tests for comparing two survival curves with informative and non-informative censoring. Statistical Methods in Medical Research, 27, 3739–3769.
Arboretti, R. G., Bolzan, M., Campigotto, F., Corain, L., Salmaso, L. (2010). Combination-based permutation testing in survival analysis. Quaderni di Statistica, 12, 15–38.
Behnen, K., Neuhaus, G. (1983). Galton’s test as a linear rank test with estimated scores and its local asymptotic efficiency. Annals of Statistics, 11, 588–599.
Brendel, M., Janssen, A., Mayer, C.-D., Pauly, M. (2014). Weighted logrank permutation tests for randomly right censored life science data. Scandinavian Journal of Statistics, 41, 742–761.
Callegaro, A., Spiessens, B. (2017). Testing treatment effect in randomized clinical trials with possible non-proportional hazards. Statistics in Biopharmaceutical Research, 9, 204–211.
Chang, Y.-M., Chen, C.-S., Shen, P.-S. (2012). A jackknife-based versatile test for two-sample problems with right-censored data. Journal of Applied Statistics, 39, 267–277.
Chauvel, C., O’Quigley, J. (2014). Tests for comparing estimated survival functions. Biometrika, 101, 535–552.
Chi, Y., Tsai, M.-H. (2001). Some versatile tests based on the simultaneous use of weighted logrank and weighted Kaplan–Meier statistics. Communications in Statistics: Simulation and Computation, 30, 743–759.
Darilay, A. T., Naranjo, J. D. (2011). A pretest for using logrank or Wilcoxon in the two-sample problem. Computational Statistics and Data Analysis, 55, 2400–2409.
Edmonson, J. H., Fleming, T. R., Decker, D. G., Malkasian, G. D., Jorgensen, E. O., Jefferies, J. A., Webb, M. J., Kvols, L. K. (1979). Different chemotherapeutic sensitivities and host factors affecting prognosis in advanced ovarian carcinoma versus minimal residual disease. Cancer Treatment Reports, 63, 241–247.
Efron, B. (1967). The two-sample problem with censored data. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 4, 831–853.
Efron, B. (1981). Censored data and the bootstrap. Journal of the American Statistical Association, 76, 312–319.
Fleming, T. R., Harrington, D. P. (1991). Counting processes and survival analysis. New York: Wiley.
Fleming, T. R., Harrington, D. P., O’Sullivan, M. (1987). Supremum versions of the log-rank and generalized Wilcoxon statistics. Journal of the American Statistical Association, 82, 312–320.
Fleming, T. R., O’Fallon, J. R., O’Brien, P. C., Harrington, D. P. (1980). Modified Kolmogorov–Smirnov test procedures with application to arbitrarily right-censored data. Biometrics, 36, 607–625.
Garès, V., Andrieu, S., Dupuy, J.-F., Savy, N. (2017). On the Fleming–Harrington test for late effects in prevention randomized controlled trials. Journal of Statistical Theory and Practice, 11, 418–435.
Gastrointestinal Tumor Study Group. (1982). A comparison of combination chemotherapy and combined modality therapy for locally advanced gastric carcinoma. Cancer, 49, 1771–1777.
Gehan, E. A. (1965). A generalized Wilcoxon test for comparing arbitrarily singly censored samples. Biometrika, 52, 203–223.
Gill, R. D. (1980). Censoring and stochastic integrals. Mathematical Centre Tracts 124. Amsterdam: Mathematisch Centrum. http://oai.cwi.nl/oai/asset/11499/11499A.pdf.
Harrington, D. P., Fleming, T. R. (1982). A class of rank test procedures for censored survival data. Biometrika, 69, 553–566.
Hsieh, J.-J., Chen, H.-Y. (2017). A testing strategy for two crossing survival curves. Communications in Statistics-Simulation and Computation, 46, 6685–6696.
Inglot, T., Ledwina, T. (2006). Towards data driven selection of a penalty function for data driven Neyman tests. Linear Algebra and Its Applications, 417, 124–133.
Janic-Wróblewska, A., Ledwina, T. (2000). Data driven rank test for two-sample problem. Scandinavian Journal of Statistics, 27, 281–297.
Kaplan, E. L., Meier, P. (1958). Nonparametric estimation from incomplete observations. Journal of the American Statistical Association, 53, 457–481.
Koziol, J. A. (1978). A two sample Cramér–von Mises test for randomly censored data. Biometrical Journal, 20, 603–608.
Koziol, J. A., Jia, Z. (2014). Weighted Lin–Wang tests for crossing hazards. Computational and Mathematical Methods in Medicine. https://doi.org/10.1155/2014/643457.
Kraus, D. (2009). Adaptive Neyman’s smooth tests of homogeneity of two samples of survival data. Journal of Statistical Planning and Inference, 139, 3559–3569.
Lee, J. W. (1996). Some versatile tests based on the simultaneous use of weighted log-rank statistics. Biometrics, 52, 721–725.
Lee, S.-H. (2007). On the versatility of the combination of the weighted log-rank statistics. Computational Statistics and Data Analysis, 51, 6557–6564.
Lee, S.-H., Lee, E.-J., Omolo, B. O. (2008). Using integrated weighted survival difference for the two-sample censored data problem. Computational Statistics and Data Analysis, 52, 4410–4416.
Letón, E., Zuluaga, P. (2005). Relationships among tests for censored data. Biometrical Journal, 47, 377–387.
Li, G., Tiwari, R. C., Wells, M. T. (1996). Quantile comparison functions in two-sample problems, with application to comparisons of diagnostic markers. Journal of the American Statistical Association, 91, 689–698.
Lin, Ch.-Y., Kosorok, M. R. (1999). A general class of function-indexed nonparametric tests for survival analysis. Annals of Statistics, 27, 1722–1744.
Lin, X., Wang, H. (2004). A new testing approach for comparing the overall homogeneity of survival curves. Biometrical Journal, 46, 489–496.
Liu, Y., Yin, G. (2017). Partitioned log-rank tests for the overall homogeneity of hazard rate functions. Lifetime Data Analysis, 23, 400–425.
Lu, H. H. S., Wells, M. T., Tiwari, R. C. (1994). Inference for shift functions in the two-sample problem with right-censored data: With applications. Journal of the American Statistical Association, 89, 1017–1026.
Mantel, N. (1966). Evaluation of survival data and two new rank order statistics arising in its consideration. Cancer Chemotherapy Reports, 50, 163–170.
Martínez-Camblor, P. (2010). Comparing k-independent and right censored samples based on the likelihood ratio. Computational Statistics, 25, 363–374.
Neuhaus, G. (2000). A method of constructing rank tests in survival analysis. Journal of Statistical Planning and Inference, 91, 481–497.
O’Quigley, J. (2003). Khalamadze-type graphical evaluation of the proportional hazard assumption. Biometrika, 90, 577–584.
Pepe, M. S., Fleming, T. R. (1989). Weighted Kaplan–Meier statistics: A class of distance tests for censored survival data. Biometrics, 45, 497–507.
Pepe, M. S., Fleming, T. R. (1991). Weighted Kaplan–Meier statistics: Large sample and optimality considerations. Journal of the Royal Statistical Society, Series B, 53, 341–352.
Pesarin, F., Salmaso, L. (2010). Permutation tests for complex data: Theory, applications and software. Chichester: Wiley.
Peto, R., Peto, J. (1972). Asymptotically efficient rank invariant test procedures (with discussion). Journal of the Royal Statistical Society, Series A, 135, 185–206.
Prentice, R. L. (1978). Linear rank tests with right censored data. Biometrika, 65, 167–179.
Qiu, P., Sheng, J. (2008). A two-stage procedure for comparing hazard rate functions. Journal of the Royal Statistical Society, Series B, 70, 191–208.
Schumacher, M. (1984). Two-sample tests of Cramér–von Mises- and Kolmogorov–Smirnov-type for randomly censored data. International Statistical Review, 52, 263–281.
Tarone, R. E., Ware, J. (1977). On distribution-free test for equality of survival distributions. Biometrika, 64, 156–160.
Wu, L., Gilbert, P. B. (2002). Flexible weighted log-rank tests optimal for detecting early and/or late survival differences. Biometrics, 58, 997–1004.
Wyłupek, G. (2010). Data-driven k-sample tests. Technometrics, 52, 107–123.
Yang, S., Prentice, R. (2005). Semiparametric analysis of short-term and long-term hazard ratios with two-sample survival data. Biometrika, 92, 1–17.
Yang, S., Prentice, R. (2010). Improved logrank-type tests for survival data using adaptive weights. Biometrics, 66, 30–38.
Zhang, J., Wu, Y. (2007). k-sample tests based on the likelihood ratio. Computational Statistics and Data Analysis, 51, 4682–4691.
Acknowledgements
The author is grateful to the Associate Editor and Reviewer for the comments which led to the improvement in the presentation. He also thanks A. Callegaro, Y-M. Chang, V. Garès, J-J. Hsieh, and L. Salmaso for sending the copies of their papers. The research has been supported by the Grant 1407/M/IM/15 indirectly awarded by the Polish Ministry of Science and Higher Education. Calculations have been carried out in Wrocław Centre for Networking and Supercomputing (http://www.wcss.wroc.pl) under Grant No. 199. The cooperation of the Centre is gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
About this article
Cite this article
Wyłupek, G. A permutation test for the two-sample right-censored model. Ann Inst Stat Math 73, 1037–1061 (2021). https://doi.org/10.1007/s10463-020-00777-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10463-020-00777-w