Using the so-called martingale difference correlation (MDC), we propose a novel censored-conditional-quantile screening approach for ultrahigh-dimensional survival data with heterogeneity (which is often present in such data). By incorporating a weighting scheme, this method is a natural extension of MDC-based conditional quantile screening, as considered in Shao and Zhang (2014), to handle ultrahigh-dimensional survival data. The proposed screening procedure has a sure-screening property under certain technical conditions and an excellent capability of detecting the nonlinear relationship between independent and censored dependent variables. Both simulation results and an analysis of real data demonstrate the effectiveness of the new censored conditional quantile-screening procedure.
This work was supported by the National Statistical Scientific Research Projects (Grant No. 2015LZ54). The authors thank the two anonymous reviewers for their constructive comments, which have led to a dramatic improvement of the earlier version of this article.
Bair E, Tibshirani R. Semi-supervised methods to predict patient survival from gene expression data. PLoS Biol, 2004, 2: 511–522CrossRefGoogle Scholar
Chu W, Li R, Reimherr M. Feature screening for time-varying coEfficient models with ultrahigh dimensional longitudinal data. Ann Appl Stat, 2016, 10: 596–617MathSciNetCrossRefMATHGoogle Scholar
Pan R, Wang H, Li R. Ultrahigh dimensional multi-class linear discriminant analysis by pairwise sure independence screening. J Amer Statist Assoc, 2016, 111: 169–179MathSciNetCrossRefGoogle Scholar
Rosenwald A, Wright G, Chan W C, et al. The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. New Engl J Med, 2002, 346: 1937–1947CrossRefGoogle Scholar