Abstract
We study the problem of testing a wide range of statistical hypotheses under the assumption of the sample being randomly right-censored. As an alternative to the classical approach which assumes the modification of a test statistic for complete data, we propose a novel imputation procedure. The new approach, for the first time, is completely hypothesis free which means that it does not require any modification for the application of different statistical procedures. The competitive properties are demonstrated with several goodness-of-fit tests to exponentiality, as well as the most well known two-sample tests. Finally, concluding remarks about whether it is better to impute data or to adapt statistical procedures are provided.
Similar content being viewed by others
References
Allison J, Milošević B, Obradović M et al (2022) Distribution-free goodness-of-fit tests for the Pareto distribution based on a characterization. Comput Stat 37(1):403–418
Balakrishnan N, Chimitova E, Vedernikova M (2015) An empirical analysis of some nonparametric goodness-of-fit tests for censored data. Commun Stat Simul Comput 44(4):1101–1115
Barlow RE, Proschan F (1969) A note on tests for monotone failure rate based on incomplete data. Ann Math Stat 40(2):595–600
Barnhart HX, Song J, Lyles RH (2005) Assay validation for left-censored data. Stat Med 24(21):3347–3360
Bartholomew DJ (1957) A problem in life testing. J Am Stat Assoc 52(279):350–355
Batsidis A, Economou P, Tzavelas G (2016) Tests of fit for a lognormal distribution. J Stat Comput Simul 86(2):215–235
Bose A, Sen A (1999) The strong law of large numbers for Kaplan-Meier U-statistics. J Theor Probab 12(1):181–200
Bose A, Sen A (2002) Asymptotic distribution of the Kaplan-Meier U-statistics. J Multivar Anal 83(1):84–123
Bothma E, Allison JS, Cockeran M et al (2021) Characteristic function and Laplace transform-based tests for exponentiality in the presence of random right censoring. Stat 10(1):e394
Cuparić M (2021) Asymptotic properties of inverse probability of censored weighted U-empirical process for right-censored data with applications. Statistics 55(5):1035–1057
Cuparić M, Milošević B (2022) New characterization-based exponentiality tests for randomly censored data. TEST 31(2):461–487
Cuparić M, Milošević B, Obradović M (2019) New \({L}^{2}\)-type exponentiality tests. SORT 43(1):25–50
Cuparić M, Milošević B, Obradović M (2022) New consistent exponentiality tests based on V-empirical Laplace transforms with comparison of efficiencies. Revi Real Acad Ciencias Exactas Físicas Nat Ser A 116(1):1–26
Datta S, Bandyopadhyay D, Satten GA (2010) Inverse probability of censoring weighted U-statistics for right-censored data with an application to testing hypotheses. Scand J Stat 37(4):680–700
DeLisle RR, Sullo P, Grivas DA (2003) Network-level pavement performance prediction model incorporating censored data. Transp Res Rec 1853(1):72–79
Dobler D, Pauly M (2018) Bootstrap-and permutation-based inference for the Mann-Whitney effect for right-censored and tied data. TEST 27(3):639–658
Emura T, Hsu JH (2020) Estimation of the Mann-Whitney effect in the two-sample problem under dependent censoring. Comput Stat Data Anal 150(106):990
Fernández T, Rivera N (2020) Kaplan-Meier V-and U-statistics. Electron J Stat 14(1):1872–1916
Fortiana J, Grané A (2003) Goodness-of-fit tests based on maximum correlations and their orthogonal decompositions. J R Stat Soc Seri B 65(1):115–126
Gehan EA (1965) A generalized wilcoxon test for comparing arbitrarily singly-censored samples. Biometrika 52(1–2):203–224
Kattumannil SK, Anisha P (2019) A simple non-parametric test for decreasing mean time to failure. Stat Pap 60(1):73–87
Kim C, Park BU, Kim W et al (2003) Bezier curve smoothing of the Kaplan-Meier estimator. Ann Inst Stat Math 55(2):359–367
Koziol JA, Green SB (1976) A Cramer-von Mises statistic for randomly censored data. Biometrika 63(3):465–474
Meeker WQ, Escobar LA (2014) Statistical methods for reliability data. Wiley, New York
Milošević B, Obradović M (2016) New class of exponentiality tests based on U-empirical Laplace transform. Stat Pap 57(4):977–990
Nikulin M, Haghighi F (2006) A chi-squared test for the generalized power Weibull family for the head-and-neck cancer censored data. J Math Sci 133(3):1333–1341
Obradović M, Jovanović M, Milošević B (2015) Goodness-of-fit tests for Pareto distribution based on a characterization and their asymptotics. Statistics 49(5):1026–1041
Perera M, Dwivedi AK (2020) Statistical issues and methods in designing and analyzing survival studies. Cancer Rep 3(4):e1176
Robins JM, Rotnitzky A (1992) Recovery of information and adjustment for dependent censoring using surrogate markers. In: AIDS Epidemiology. Springer, pp 297–331
Sahoo I, Hazra A (2021) Contamination mapping in bangladesh using a multivariate spatial bayesian model for left-censored data. arXiv preprint arXiv:2106.15730
Sprague LA, Oelsner GP, Argue DM (2017) Challenges with secondary use of multi-source water-quality data in the united states. Water Res 110:252–261
Strzalkowska-Kominiak E, Grané A (2017) Goodness-of-fit test for randomly censored data based on maximum correlation. SORT 41(1):0119–0138
Van Buuren S (2018) Flexible imputation of missing data. CRC Press, Boca Raton
Weaver BP, Kaufeld K, Warr R (2020) Estimating correlations with censored data. Qual Eng 32(3):521–527
Wyłupek G (2021) A permutation test for the two-sample right-censored model. Ann Inst Stat Math 73:1037–1061
Acknowledgements
The authors express deep gratitude to two anonymous referees whose comments led to the improvement of the paper and opened directions for future research.
Funding
The authors of this work are supported by the Ministry of Science, Technological Development and Innovations of the Republic of Serbia (451-03-47/2023-01/ 200104). The work is also supported by the COST action CA21163 - Text, functional and other high-dimensional data in econometrics: New models, methods, applications (HiTEc).
Author information
Authors and Affiliations
Contributions
M. Cuparić and B. Milošević have contributed equally to this work.
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Cuparić, M., Milošević, B. To impute or to adapt? Model specification tests’ perspective. Stat Papers 65, 1021–1039 (2024). https://doi.org/10.1007/s00362-023-01421-4
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00362-023-01421-4