To impute or to adapt? Model specification tests’ perspective

Cuparić, Marija; Milošević, Bojana

doi:10.1007/s00362-023-01421-4

To impute or to adapt? Model specification tests’ perspective

Regular Article
Published: 26 March 2023

Volume 65, pages 1021–1039, (2024)
Cite this article

Statistical Papers Aims and scope Submit manuscript

119 Accesses
Explore all metrics

Abstract

We study the problem of testing a wide range of statistical hypotheses under the assumption of the sample being randomly right-censored. As an alternative to the classical approach which assumes the modification of a test statistic for complete data, we propose a novel imputation procedure. The new approach, for the first time, is completely hypothesis free which means that it does not require any modification for the application of different statistical procedures. The competitive properties are demonstrated with several goodness-of-fit tests to exponentiality, as well as the most well known two-sample tests. Finally, concluding remarks about whether it is better to impute data or to adapt statistical procedures are provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improving the Robustness of Parametric Imputation

Model checking in multiple imputation: an overview and case study

Article Open access 23 August 2017

Empirical likelihood-based inferences in varying coefficient models with missing data

Article 12 July 2015

References

Allison J, Milošević B, Obradović M et al (2022) Distribution-free goodness-of-fit tests for the Pareto distribution based on a characterization. Comput Stat 37(1):403–418
Article MathSciNet Google Scholar
Balakrishnan N, Chimitova E, Vedernikova M (2015) An empirical analysis of some nonparametric goodness-of-fit tests for censored data. Commun Stat Simul Comput 44(4):1101–1115
Article MathSciNet Google Scholar
Barlow RE, Proschan F (1969) A note on tests for monotone failure rate based on incomplete data. Ann Math Stat 40(2):595–600
Article MathSciNet Google Scholar
Barnhart HX, Song J, Lyles RH (2005) Assay validation for left-censored data. Stat Med 24(21):3347–3360
Article MathSciNet PubMed Google Scholar
Bartholomew DJ (1957) A problem in life testing. J Am Stat Assoc 52(279):350–355
Article Google Scholar
Batsidis A, Economou P, Tzavelas G (2016) Tests of fit for a lognormal distribution. J Stat Comput Simul 86(2):215–235
Article MathSciNet Google Scholar
Bose A, Sen A (1999) The strong law of large numbers for Kaplan-Meier U-statistics. J Theor Probab 12(1):181–200
Article MathSciNet Google Scholar
Bose A, Sen A (2002) Asymptotic distribution of the Kaplan-Meier U-statistics. J Multivar Anal 83(1):84–123
Article MathSciNet Google Scholar
Bothma E, Allison JS, Cockeran M et al (2021) Characteristic function and Laplace transform-based tests for exponentiality in the presence of random right censoring. Stat 10(1):e394
Article MathSciNet Google Scholar
Cuparić M (2021) Asymptotic properties of inverse probability of censored weighted U-empirical process for right-censored data with applications. Statistics 55(5):1035–1057
Article MathSciNet Google Scholar
Cuparić M, Milošević B (2022) New characterization-based exponentiality tests for randomly censored data. TEST 31(2):461–487
Article MathSciNet Google Scholar
Cuparić M, Milošević B, Obradović M (2019) New \({L}^{2}\)-type exponentiality tests. SORT 43(1):25–50
MathSciNet Google Scholar
Cuparić M, Milošević B, Obradović M (2022) New consistent exponentiality tests based on V-empirical Laplace transforms with comparison of efficiencies. Revi Real Acad Ciencias Exactas Físicas Nat Ser A 116(1):1–26
MathSciNet Google Scholar
Datta S, Bandyopadhyay D, Satten GA (2010) Inverse probability of censoring weighted U-statistics for right-censored data with an application to testing hypotheses. Scand J Stat 37(4):680–700
Article MathSciNet Google Scholar
DeLisle RR, Sullo P, Grivas DA (2003) Network-level pavement performance prediction model incorporating censored data. Transp Res Rec 1853(1):72–79
Article Google Scholar
Dobler D, Pauly M (2018) Bootstrap-and permutation-based inference for the Mann-Whitney effect for right-censored and tied data. TEST 27(3):639–658
Article MathSciNet Google Scholar
Emura T, Hsu JH (2020) Estimation of the Mann-Whitney effect in the two-sample problem under dependent censoring. Comput Stat Data Anal 150(106):990
MathSciNet Google Scholar
Fernández T, Rivera N (2020) Kaplan-Meier V-and U-statistics. Electron J Stat 14(1):1872–1916
Article MathSciNet Google Scholar
Fortiana J, Grané A (2003) Goodness-of-fit tests based on maximum correlations and their orthogonal decompositions. J R Stat Soc Seri B 65(1):115–126
Article MathSciNet Google Scholar
Gehan EA (1965) A generalized wilcoxon test for comparing arbitrarily singly-censored samples. Biometrika 52(1–2):203–224
Article MathSciNet CAS PubMed Google Scholar
Kattumannil SK, Anisha P (2019) A simple non-parametric test for decreasing mean time to failure. Stat Pap 60(1):73–87
Article MathSciNet Google Scholar
Kim C, Park BU, Kim W et al (2003) Bezier curve smoothing of the Kaplan-Meier estimator. Ann Inst Stat Math 55(2):359–367
Article MathSciNet Google Scholar
Koziol JA, Green SB (1976) A Cramer-von Mises statistic for randomly censored data. Biometrika 63(3):465–474
MathSciNet Google Scholar
Meeker WQ, Escobar LA (2014) Statistical methods for reliability data. Wiley, New York
Google Scholar
Milošević B, Obradović M (2016) New class of exponentiality tests based on U-empirical Laplace transform. Stat Pap 57(4):977–990
Article MathSciNet Google Scholar
Nikulin M, Haghighi F (2006) A chi-squared test for the generalized power Weibull family for the head-and-neck cancer censored data. J Math Sci 133(3):1333–1341
Article MathSciNet Google Scholar
Obradović M, Jovanović M, Milošević B (2015) Goodness-of-fit tests for Pareto distribution based on a characterization and their asymptotics. Statistics 49(5):1026–1041
Article MathSciNet Google Scholar
Perera M, Dwivedi AK (2020) Statistical issues and methods in designing and analyzing survival studies. Cancer Rep 3(4):e1176
Article Google Scholar
Robins JM, Rotnitzky A (1992) Recovery of information and adjustment for dependent censoring using surrogate markers. In: AIDS Epidemiology. Springer, pp 297–331
Sahoo I, Hazra A (2021) Contamination mapping in bangladesh using a multivariate spatial bayesian model for left-censored data. arXiv preprint arXiv:2106.15730
Sprague LA, Oelsner GP, Argue DM (2017) Challenges with secondary use of multi-source water-quality data in the united states. Water Res 110:252–261
Article CAS PubMed Google Scholar
Strzalkowska-Kominiak E, Grané A (2017) Goodness-of-fit test for randomly censored data based on maximum correlation. SORT 41(1):0119–0138
MathSciNet Google Scholar
Van Buuren S (2018) Flexible imputation of missing data. CRC Press, Boca Raton
Book Google Scholar
Weaver BP, Kaufeld K, Warr R (2020) Estimating correlations with censored data. Qual Eng 32(3):521–527
Article Google Scholar
Wyłupek G (2021) A permutation test for the two-sample right-censored model. Ann Inst Stat Math 73:1037–1061
Article MathSciNet Google Scholar

Download references

Acknowledgements

The authors express deep gratitude to two anonymous referees whose comments led to the improvement of the paper and opened directions for future research.

Funding

The authors of this work are supported by the Ministry of Science, Technological Development and Innovations of the Republic of Serbia (451-03-47/2023-01/ 200104). The work is also supported by the COST action CA21163 - Text, functional and other high-dimensional data in econometrics: New models, methods, applications (HiTEc).

Author information

M. Cuparić and B. Milošević have contributed equally to this work.

Authors and Affiliations

Faculty of Mathematics, University of Belgrade, Studentski trg 12-16, Belgrade, 11000, Serbia
Marija Cuparić & Bojana Milošević

Authors

Marija Cuparić
View author publications
You can also search for this author in PubMed Google Scholar
Bojana Milošević
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M. Cuparić and B. Milošević have contributed equally to this work.

Corresponding author

Correspondence to Bojana Milošević.

Ethics declarations

Conflicts of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (pdf 328 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Cuparić, M., Milošević, B. To impute or to adapt? Model specification tests’ perspective. Stat Papers 65, 1021–1039 (2024). https://doi.org/10.1007/s00362-023-01421-4

Download citation

Received: 26 July 2022
Revised: 12 January 2023
Published: 26 March 2023
Issue Date: April 2024
DOI: https://doi.org/10.1007/s00362-023-01421-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

To impute or to adapt? Model specification tests’ perspective

Abstract

Access this article

Similar content being viewed by others

Improving the Robustness of Parametric Imputation

Model checking in multiple imputation: an overview and case study

Empirical likelihood-based inferences in varying coefficient models with missing data

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (pdf 328 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

To impute or to adapt? Model specification tests’ perspective

Abstract

Access this article

Similar content being viewed by others

Improving the Robustness of Parametric Imputation

Model checking in multiple imputation: an overview and case study

Empirical likelihood-based inferences in varying coefficient models with missing data

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (pdf 328 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation