Skip to main content
Log in

To impute or to adapt? Model specification tests’ perspective

  • Regular Article
  • Published:
Statistical Papers Aims and scope Submit manuscript

Abstract

We study the problem of testing a wide range of statistical hypotheses under the assumption of the sample being randomly right-censored. As an alternative to the classical approach which assumes the modification of a test statistic for complete data, we propose a novel imputation procedure. The new approach, for the first time, is completely hypothesis free which means that it does not require any modification for the application of different statistical procedures. The competitive properties are demonstrated with several goodness-of-fit tests to exponentiality, as well as the most well known two-sample tests. Finally, concluding remarks about whether it is better to impute data or to adapt statistical procedures are provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Allison J, Milošević B, Obradović M et al (2022) Distribution-free goodness-of-fit tests for the Pareto distribution based on a characterization. Comput Stat 37(1):403–418

    Article  MathSciNet  Google Scholar 

  • Balakrishnan N, Chimitova E, Vedernikova M (2015) An empirical analysis of some nonparametric goodness-of-fit tests for censored data. Commun Stat Simul Comput 44(4):1101–1115

    Article  MathSciNet  Google Scholar 

  • Barlow RE, Proschan F (1969) A note on tests for monotone failure rate based on incomplete data. Ann Math Stat 40(2):595–600

    Article  MathSciNet  Google Scholar 

  • Barnhart HX, Song J, Lyles RH (2005) Assay validation for left-censored data. Stat Med 24(21):3347–3360

    Article  MathSciNet  PubMed  Google Scholar 

  • Bartholomew DJ (1957) A problem in life testing. J Am Stat Assoc 52(279):350–355

    Article  Google Scholar 

  • Batsidis A, Economou P, Tzavelas G (2016) Tests of fit for a lognormal distribution. J Stat Comput Simul 86(2):215–235

    Article  MathSciNet  Google Scholar 

  • Bose A, Sen A (1999) The strong law of large numbers for Kaplan-Meier U-statistics. J Theor Probab 12(1):181–200

    Article  MathSciNet  Google Scholar 

  • Bose A, Sen A (2002) Asymptotic distribution of the Kaplan-Meier U-statistics. J Multivar Anal 83(1):84–123

    Article  MathSciNet  Google Scholar 

  • Bothma E, Allison JS, Cockeran M et al (2021) Characteristic function and Laplace transform-based tests for exponentiality in the presence of random right censoring. Stat 10(1):e394

    Article  MathSciNet  Google Scholar 

  • Cuparić M (2021) Asymptotic properties of inverse probability of censored weighted U-empirical process for right-censored data with applications. Statistics 55(5):1035–1057

    Article  MathSciNet  Google Scholar 

  • Cuparić M, Milošević B (2022) New characterization-based exponentiality tests for randomly censored data. TEST 31(2):461–487

    Article  MathSciNet  Google Scholar 

  • Cuparić M, Milošević B, Obradović M (2019) New \({L}^{2}\)-type exponentiality tests. SORT 43(1):25–50

    MathSciNet  Google Scholar 

  • Cuparić M, Milošević B, Obradović M (2022) New consistent exponentiality tests based on V-empirical Laplace transforms with comparison of efficiencies. Revi Real Acad Ciencias Exactas Físicas Nat Ser A 116(1):1–26

    MathSciNet  Google Scholar 

  • Datta S, Bandyopadhyay D, Satten GA (2010) Inverse probability of censoring weighted U-statistics for right-censored data with an application to testing hypotheses. Scand J Stat 37(4):680–700

    Article  MathSciNet  Google Scholar 

  • DeLisle RR, Sullo P, Grivas DA (2003) Network-level pavement performance prediction model incorporating censored data. Transp Res Rec 1853(1):72–79

    Article  Google Scholar 

  • Dobler D, Pauly M (2018) Bootstrap-and permutation-based inference for the Mann-Whitney effect for right-censored and tied data. TEST 27(3):639–658

    Article  MathSciNet  Google Scholar 

  • Emura T, Hsu JH (2020) Estimation of the Mann-Whitney effect in the two-sample problem under dependent censoring. Comput Stat Data Anal 150(106):990

    MathSciNet  Google Scholar 

  • Fernández T, Rivera N (2020) Kaplan-Meier V-and U-statistics. Electron J Stat 14(1):1872–1916

    Article  MathSciNet  Google Scholar 

  • Fortiana J, Grané A (2003) Goodness-of-fit tests based on maximum correlations and their orthogonal decompositions. J R Stat Soc Seri B 65(1):115–126

    Article  MathSciNet  Google Scholar 

  • Gehan EA (1965) A generalized wilcoxon test for comparing arbitrarily singly-censored samples. Biometrika 52(1–2):203–224

    Article  MathSciNet  CAS  PubMed  Google Scholar 

  • Kattumannil SK, Anisha P (2019) A simple non-parametric test for decreasing mean time to failure. Stat Pap 60(1):73–87

    Article  MathSciNet  Google Scholar 

  • Kim C, Park BU, Kim W et al (2003) Bezier curve smoothing of the Kaplan-Meier estimator. Ann Inst Stat Math 55(2):359–367

    Article  MathSciNet  Google Scholar 

  • Koziol JA, Green SB (1976) A Cramer-von Mises statistic for randomly censored data. Biometrika 63(3):465–474

    MathSciNet  Google Scholar 

  • Meeker WQ, Escobar LA (2014) Statistical methods for reliability data. Wiley, New York

    Google Scholar 

  • Milošević B, Obradović M (2016) New class of exponentiality tests based on U-empirical Laplace transform. Stat Pap 57(4):977–990

    Article  MathSciNet  Google Scholar 

  • Nikulin M, Haghighi F (2006) A chi-squared test for the generalized power Weibull family for the head-and-neck cancer censored data. J Math Sci 133(3):1333–1341

    Article  MathSciNet  Google Scholar 

  • Obradović M, Jovanović M, Milošević B (2015) Goodness-of-fit tests for Pareto distribution based on a characterization and their asymptotics. Statistics 49(5):1026–1041

    Article  MathSciNet  Google Scholar 

  • Perera M, Dwivedi AK (2020) Statistical issues and methods in designing and analyzing survival studies. Cancer Rep 3(4):e1176

    Article  Google Scholar 

  • Robins JM, Rotnitzky A (1992) Recovery of information and adjustment for dependent censoring using surrogate markers. In: AIDS Epidemiology. Springer, pp 297–331

  • Sahoo I, Hazra A (2021) Contamination mapping in bangladesh using a multivariate spatial bayesian model for left-censored data. arXiv preprint arXiv:2106.15730

  • Sprague LA, Oelsner GP, Argue DM (2017) Challenges with secondary use of multi-source water-quality data in the united states. Water Res 110:252–261

    Article  CAS  PubMed  Google Scholar 

  • Strzalkowska-Kominiak E, Grané A (2017) Goodness-of-fit test for randomly censored data based on maximum correlation. SORT 41(1):0119–0138

    MathSciNet  Google Scholar 

  • Van Buuren S (2018) Flexible imputation of missing data. CRC Press, Boca Raton

    Book  Google Scholar 

  • Weaver BP, Kaufeld K, Warr R (2020) Estimating correlations with censored data. Qual Eng 32(3):521–527

    Article  Google Scholar 

  • Wyłupek G (2021) A permutation test for the two-sample right-censored model. Ann Inst Stat Math 73:1037–1061

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

The authors express deep gratitude to two anonymous referees whose comments led to the improvement of the paper and opened directions for future research.

Funding

The authors of this work are supported by the Ministry of Science, Technological Development and Innovations of the Republic of Serbia (451-03-47/2023-01/ 200104). The work is also supported by the COST action CA21163 - Text, functional and other high-dimensional data in econometrics: New models, methods, applications (HiTEc).

Author information

Authors and Affiliations

Authors

Contributions

M. Cuparić and B. Milošević have contributed equally to this work.

Corresponding author

Correspondence to Bojana Milošević.

Ethics declarations

Conflicts of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (pdf 328 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cuparić, M., Milošević, B. To impute or to adapt? Model specification tests’ perspective. Stat Papers 65, 1021–1039 (2024). https://doi.org/10.1007/s00362-023-01421-4

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00362-023-01421-4

Keywords

Navigation