Abstract
This work is motivated by the need to perform the appropriate “robust” analysis on right-censored survival data. As in other domains of application, modelling and analysis of data generated by medical and biological studies are often unstable due to the presence of outliers and model misspecification. Use of robust techniques is helpful in this respect, and has often been the default in such situations. However, a large contaminating set of observations can often mean that the group is generated systematically by a model which is different from the one to which the majority of the data are attributed, rather than being stray outliers. The method of weighted likelihood estimating equations might provide a solution to this problem, where the different roots obtained can indicate the presence of distinct parametric clusters, rather than providing a single robust fit which ignores the observations incompatible with the major fitted component. Efron’s (J. Am. Stat. Assoc. 83, 402, 414–425, 1988) head-and-neck cancer data provide an ideal scenario for the application of such a method. A recently developed variant of the weighted likelihood method provides a nice illustration of the presence of different clusters in Efron’s data, and highlights the benefits of the weighted likelihood method in relation to classical robust techniques.
Similar content being viewed by others
References
Basak, S, Basu, A and Jones, MC (2019). On the “optimal” density power divergence tuning parameter. Tech. Rep. ISRU/2019/1, Interdisciplinary Statistical Research Unit, Indian Statistical Institute. https://www.isical.ac.in/isru/tr19a.pdf.
Basu, A. and Lindsay, B.G. (2004). The iteratively reweighted estimating equation in minimum distance problems. Computational Statistics & Data Analysis 45, 2, 105–124.
Basu, A., Harris, I.R., Hjort, N.L. and Jones, M.C. (1998). Robust and efficient estimation by minimising a density power divergence. Biometrika 85, 3, 549–559.
Basu, S., Basu, A. and Jones, M.C. (2006). Robust and efficient parametric estimation for censored survival data. Ann. Inst. Stat. Math. 58, 2, 341–355.
Biswas, A., Roy, T., Majumder, S. and Basu, A. (2015). A new weighted likelihood approach. Stat 4, 1, 97–107.
Biswas, A., Guha Niyogi, P., Majumder, S., Bhandari, S.K., Ghosh, A. and Basu, A. (2018). Theoretical properties of a new weighted likelihood estimator for right censored data. Tech. Rep. ISRU/2018/2, Interdisciplinary Statistical Research Unit, Indian Statistical Institute. https://www.isical.ac.in/isru/tr18a.pdf.
Efron, B. (1988). Logistic regression, survival analysis, and the kaplan-Meier curve. J. Am. Stat. Assoc. 83, 402, 414–425.
Kaplan, E.L. and Meier, P. (1958). Nonparametric estimation from incomplete observations. J. Am. Stat. Assoc. 53, 282, 457–481.
Léger, C. and Romano, J.P. (1990). Bootstrap choice of tuning parameters. Ann. Inst. Stat. Math. 42, 4, 709–735.
Lindsay, B.G. (1994). Efficiency versus robustness: The case for minimum Hellinger distance and related methods. Ann. Stat. 22, 2, 1081–1114.
Majumder, S, Biswas, A, Roy, T, Bhandari, S and Basu, A (2016). Statistical inference based on a new weighted likelihood approach. arXiv:161007949.
Markatou, M., Basu, A. and Lindsay, B.G. (1997). Weighted likelihood estimating equations: the discrete case with applications to logistic regression. Journal of Statistical Planning and Inference 57, 2, 215–232.
Markatou, M., Basu, A. and Lindsay, B.G. (1998). Weighted likelihood equations with bootstrap root search. J. Am. Stat. Assoc. 93, 442, 740–750.
Wand, M.P. and Jones, M.C. (1994). Kernel Smoothing. CRC Press.
Warwick, J. and Jones, M.C. (2005). Choosing a robustness tuning parameter. J. Stat. Comput. Simul. 75, 7, 581–588.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Biswas, A., Majumder, S., Niyogi, P.G. et al. A Weighted Likelihood Approach to Problems in Survival Data. Sankhya B 83, 466–492 (2021). https://doi.org/10.1007/s13571-019-00214-w
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13571-019-00214-w
Keywords
- Censored survival data
- Kaplan-Meier estimator
- Root selection
- Weibull distribution
- Weighted likelihood estimation