Skip to main content
Log in

A permutation test for the two-sample right-censored model

  • Published:
Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

This article has been updated

Abstract

The paper presents a novel approach to solve a classical two-sample problem with right-censored data. As a result, an efficient procedure for verifying equality of the two survival curves is developed. It generalizes, in a natural manner, a well-known standard, that is, the log-rank test. Under the null hypothesis, the new test statistic has an asymptotic Chi-square distribution with one degree of freedom, while the corresponding test is consistent for a wide range of the alternatives. On the other hand, to control the actual Type I error rate when sample sizes are finite, permutation approach is employed for the inference. An extensive simulation study shows that the new test procedure improves upon classical solutions and popular recent developments in the field. An analysis of the real datasets is included. A routine, written in R, is attached as Supplementary Material.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Change history

  • 02 February 2021

    Added Supplementary file.

References

  • Arboretti, R., Fontana, R., Pesarin, F., Salmaso, L. (2018). Nonparametric combination tests for comparing two survival curves with informative and non-informative censoring. Statistical Methods in Medical Research, 27, 3739–3769.

    Article  MathSciNet  Google Scholar 

  • Arboretti, R. G., Bolzan, M., Campigotto, F., Corain, L., Salmaso, L. (2010). Combination-based permutation testing in survival analysis. Quaderni di Statistica, 12, 15–38.

    Google Scholar 

  • Behnen, K., Neuhaus, G. (1983). Galton’s test as a linear rank test with estimated scores and its local asymptotic efficiency. Annals of Statistics, 11, 588–599.

    Article  MathSciNet  MATH  Google Scholar 

  • Brendel, M., Janssen, A., Mayer, C.-D., Pauly, M. (2014). Weighted logrank permutation tests for randomly right censored life science data. Scandinavian Journal of Statistics, 41, 742–761.

    Article  MathSciNet  MATH  Google Scholar 

  • Callegaro, A., Spiessens, B. (2017). Testing treatment effect in randomized clinical trials with possible non-proportional hazards. Statistics in Biopharmaceutical Research, 9, 204–211.

    Article  Google Scholar 

  • Chang, Y.-M., Chen, C.-S., Shen, P.-S. (2012). A jackknife-based versatile test for two-sample problems with right-censored data. Journal of Applied Statistics, 39, 267–277.

    Article  MathSciNet  MATH  Google Scholar 

  • Chauvel, C., O’Quigley, J. (2014). Tests for comparing estimated survival functions. Biometrika, 101, 535–552.

    Article  MathSciNet  MATH  Google Scholar 

  • Chi, Y., Tsai, M.-H. (2001). Some versatile tests based on the simultaneous use of weighted logrank and weighted Kaplan–Meier statistics. Communications in Statistics: Simulation and Computation, 30, 743–759.

    Article  MathSciNet  MATH  Google Scholar 

  • Darilay, A. T., Naranjo, J. D. (2011). A pretest for using logrank or Wilcoxon in the two-sample problem. Computational Statistics and Data Analysis, 55, 2400–2409.

    Article  MathSciNet  MATH  Google Scholar 

  • Edmonson, J. H., Fleming, T. R., Decker, D. G., Malkasian, G. D., Jorgensen, E. O., Jefferies, J. A., Webb, M. J., Kvols, L. K. (1979). Different chemotherapeutic sensitivities and host factors affecting prognosis in advanced ovarian carcinoma versus minimal residual disease. Cancer Treatment Reports, 63, 241–247.

    Google Scholar 

  • Efron, B. (1967). The two-sample problem with censored data. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 4, 831–853.

    Google Scholar 

  • Efron, B. (1981). Censored data and the bootstrap. Journal of the American Statistical Association, 76, 312–319.

    Article  MathSciNet  MATH  Google Scholar 

  • Fleming, T. R., Harrington, D. P. (1991). Counting processes and survival analysis. New York: Wiley.

    MATH  Google Scholar 

  • Fleming, T. R., Harrington, D. P., O’Sullivan, M. (1987). Supremum versions of the log-rank and generalized Wilcoxon statistics. Journal of the American Statistical Association, 82, 312–320.

    Article  MathSciNet  MATH  Google Scholar 

  • Fleming, T. R., O’Fallon, J. R., O’Brien, P. C., Harrington, D. P. (1980). Modified Kolmogorov–Smirnov test procedures with application to arbitrarily right-censored data. Biometrics, 36, 607–625.

    Article  MathSciNet  MATH  Google Scholar 

  • Garès, V., Andrieu, S., Dupuy, J.-F., Savy, N. (2017). On the Fleming–Harrington test for late effects in prevention randomized controlled trials. Journal of Statistical Theory and Practice, 11, 418–435.

    Article  MathSciNet  MATH  Google Scholar 

  • Gastrointestinal Tumor Study Group. (1982). A comparison of combination chemotherapy and combined modality therapy for locally advanced gastric carcinoma. Cancer, 49, 1771–1777.

    Article  Google Scholar 

  • Gehan, E. A. (1965). A generalized Wilcoxon test for comparing arbitrarily singly censored samples. Biometrika, 52, 203–223.

    Article  MathSciNet  MATH  Google Scholar 

  • Gill, R. D. (1980). Censoring and stochastic integrals. Mathematical Centre Tracts 124. Amsterdam: Mathematisch Centrum. http://oai.cwi.nl/oai/asset/11499/11499A.pdf.

  • Harrington, D. P., Fleming, T. R. (1982). A class of rank test procedures for censored survival data. Biometrika, 69, 553–566.

    Article  MathSciNet  MATH  Google Scholar 

  • Hsieh, J.-J., Chen, H.-Y. (2017). A testing strategy for two crossing survival curves. Communications in Statistics-Simulation and Computation, 46, 6685–6696.

    Article  MathSciNet  MATH  Google Scholar 

  • Inglot, T., Ledwina, T. (2006). Towards data driven selection of a penalty function for data driven Neyman tests. Linear Algebra and Its Applications, 417, 124–133.

    Article  MathSciNet  MATH  Google Scholar 

  • Janic-Wróblewska, A., Ledwina, T. (2000). Data driven rank test for two-sample problem. Scandinavian Journal of Statistics, 27, 281–297.

    Article  MathSciNet  MATH  Google Scholar 

  • Kaplan, E. L., Meier, P. (1958). Nonparametric estimation from incomplete observations. Journal of the American Statistical Association, 53, 457–481.

    Article  MathSciNet  MATH  Google Scholar 

  • Koziol, J. A. (1978). A two sample Cramér–von Mises test for randomly censored data. Biometrical Journal, 20, 603–608.

    Article  MathSciNet  MATH  Google Scholar 

  • Koziol, J. A., Jia, Z. (2014). Weighted Lin–Wang tests for crossing hazards. Computational and Mathematical Methods in Medicine. https://doi.org/10.1155/2014/643457.

    Article  MathSciNet  MATH  Google Scholar 

  • Kraus, D. (2009). Adaptive Neyman’s smooth tests of homogeneity of two samples of survival data. Journal of Statistical Planning and Inference, 139, 3559–3569.

    Article  MathSciNet  MATH  Google Scholar 

  • Lee, J. W. (1996). Some versatile tests based on the simultaneous use of weighted log-rank statistics. Biometrics, 52, 721–725.

    Article  MATH  Google Scholar 

  • Lee, S.-H. (2007). On the versatility of the combination of the weighted log-rank statistics. Computational Statistics and Data Analysis, 51, 6557–6564.

    Article  MathSciNet  MATH  Google Scholar 

  • Lee, S.-H., Lee, E.-J., Omolo, B. O. (2008). Using integrated weighted survival difference for the two-sample censored data problem. Computational Statistics and Data Analysis, 52, 4410–4416.

    Article  MathSciNet  MATH  Google Scholar 

  • Letón, E., Zuluaga, P. (2005). Relationships among tests for censored data. Biometrical Journal, 47, 377–387.

    Article  MathSciNet  MATH  Google Scholar 

  • Li, G., Tiwari, R. C., Wells, M. T. (1996). Quantile comparison functions in two-sample problems, with application to comparisons of diagnostic markers. Journal of the American Statistical Association, 91, 689–698.

    Article  MathSciNet  MATH  Google Scholar 

  • Lin, Ch.-Y., Kosorok, M. R. (1999). A general class of function-indexed nonparametric tests for survival analysis. Annals of Statistics, 27, 1722–1744.

    Article  MathSciNet  MATH  Google Scholar 

  • Lin, X., Wang, H. (2004). A new testing approach for comparing the overall homogeneity of survival curves. Biometrical Journal, 46, 489–496.

    Article  MathSciNet  MATH  Google Scholar 

  • Liu, Y., Yin, G. (2017). Partitioned log-rank tests for the overall homogeneity of hazard rate functions. Lifetime Data Analysis, 23, 400–425.

    Article  MathSciNet  MATH  Google Scholar 

  • Lu, H. H. S., Wells, M. T., Tiwari, R. C. (1994). Inference for shift functions in the two-sample problem with right-censored data: With applications. Journal of the American Statistical Association, 89, 1017–1026.

    Article  MathSciNet  MATH  Google Scholar 

  • Mantel, N. (1966). Evaluation of survival data and two new rank order statistics arising in its consideration. Cancer Chemotherapy Reports, 50, 163–170.

    Google Scholar 

  • Martínez-Camblor, P. (2010). Comparing k-independent and right censored samples based on the likelihood ratio. Computational Statistics, 25, 363–374.

    Article  MathSciNet  MATH  Google Scholar 

  • Neuhaus, G. (2000). A method of constructing rank tests in survival analysis. Journal of Statistical Planning and Inference, 91, 481–497.

    Article  MathSciNet  MATH  Google Scholar 

  • O’Quigley, J. (2003). Khalamadze-type graphical evaluation of the proportional hazard assumption. Biometrika, 90, 577–584.

    Article  MathSciNet  MATH  Google Scholar 

  • Pepe, M. S., Fleming, T. R. (1989). Weighted Kaplan–Meier statistics: A class of distance tests for censored survival data. Biometrics, 45, 497–507.

    Article  MathSciNet  MATH  Google Scholar 

  • Pepe, M. S., Fleming, T. R. (1991). Weighted Kaplan–Meier statistics: Large sample and optimality considerations. Journal of the Royal Statistical Society, Series B, 53, 341–352.

    MathSciNet  MATH  Google Scholar 

  • Pesarin, F., Salmaso, L. (2010). Permutation tests for complex data: Theory, applications and software. Chichester: Wiley.

    Book  MATH  Google Scholar 

  • Peto, R., Peto, J. (1972). Asymptotically efficient rank invariant test procedures (with discussion). Journal of the Royal Statistical Society, Series A, 135, 185–206.

    Article  MATH  Google Scholar 

  • Prentice, R. L. (1978). Linear rank tests with right censored data. Biometrika, 65, 167–179.

    Article  MathSciNet  MATH  Google Scholar 

  • Qiu, P., Sheng, J. (2008). A two-stage procedure for comparing hazard rate functions. Journal of the Royal Statistical Society, Series B, 70, 191–208.

    MathSciNet  MATH  Google Scholar 

  • Schumacher, M. (1984). Two-sample tests of Cramér–von Mises- and Kolmogorov–Smirnov-type for randomly censored data. International Statistical Review, 52, 263–281.

    Article  MathSciNet  MATH  Google Scholar 

  • Tarone, R. E., Ware, J. (1977). On distribution-free test for equality of survival distributions. Biometrika, 64, 156–160.

    Article  MathSciNet  MATH  Google Scholar 

  • Wu, L., Gilbert, P. B. (2002). Flexible weighted log-rank tests optimal for detecting early and/or late survival differences. Biometrics, 58, 997–1004.

    Article  MathSciNet  MATH  Google Scholar 

  • Wyłupek, G. (2010). Data-driven k-sample tests. Technometrics, 52, 107–123.

    Article  MathSciNet  Google Scholar 

  • Yang, S., Prentice, R. (2005). Semiparametric analysis of short-term and long-term hazard ratios with two-sample survival data. Biometrika, 92, 1–17.

    Article  MathSciNet  MATH  Google Scholar 

  • Yang, S., Prentice, R. (2010). Improved logrank-type tests for survival data using adaptive weights. Biometrics, 66, 30–38.

    Article  MathSciNet  MATH  Google Scholar 

  • Zhang, J., Wu, Y. (2007). k-sample tests based on the likelihood ratio. Computational Statistics and Data Analysis, 51, 4682–4691.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

The author is grateful to the Associate Editor and Reviewer for the comments which led to the improvement in the presentation. He also thanks A. Callegaro, Y-M. Chang, V. Garès, J-J. Hsieh, and L. Salmaso for sending the copies of their papers. The research has been supported by the Grant 1407/M/IM/15 indirectly awarded by the Polish Ministry of Science and Higher Education. Calculations have been carried out in Wrocław Centre for Networking and Supercomputing (http://www.wcss.wroc.pl) under Grant No. 199. The cooperation of the Centre is gratefully acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Grzegorz Wyłupek.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 672 KB)

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wyłupek, G. A permutation test for the two-sample right-censored model. Ann Inst Stat Math 73, 1037–1061 (2021). https://doi.org/10.1007/s10463-020-00777-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10463-020-00777-w

Keywords

Navigation