Abstract
Support vector machine (SVM) is a popular method for classification, but there are few methods that utilize SVM for survival analysis in the literature because of the computational complexity. In this paper, we develop a novel \( {L_1} \) penalized SVM method for mining right-censored survival data (\( {L_1} \) SVMSURV). Our proposed method can simultaneously identify survival-associated prognostic factors and predict survival outcomes. It is easy to understand and efficient to use especially when applied to large datasets. Our method has been examined through both simulation and real data, and its performance is very good with limited experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cox DR (1972). Regression models and life-tables (with discussion). Journal of Royal Statistical Society, Series B 34:187–220.
Gui J, Li H (2005). Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data. Bioinformatics 21:3001–3008.
Heagerty PJ, Zheng Y (2005). Survival model predictive accuracy and ROC curves. Biometrics 61(1):92–105.
Jin Z, Lin DY, Wei LJ, Ying ZL (2003). Rank-based inference for the accelerated failure time model. Biometrika 90:341–353.
Kalbfleisch JD, Prentice RL (1980). The Statistical Analysis of Failure Time Data. New York: John Wiley.
Lin DW, Porter M, Montgomery B (2009). Treatment and survival outcomes in young men diagnosed with prostate cancer: a Population-based Cohort Study. Cancer 115(13):2863–2871.
Liu Z, Jiang F (2009). Gene identification and survival prediction with Lp penalty and novel similarity measure. International Journal of Data Mining and Bioinformatics 3(4):398–408.
Liu Z, Gartenhaus RB, Chen X, Howell C, Tan M (2009). Survival prediction and gene identification with penalized global AUC maximization. Journal of Computational Biology 16(12):1661–1670.
Ma S, Huang J (2007). Additive risk survival model with microarray data. BMC Bioinformatics 8:192.
Mangasarian OL (2006). Exact 1-norm support vector machines via unconstrained convex differentiable minimization. Journal of Machine Learning Research 7:1517–1530.
Sha N, Tadesse MG, Vannucci M (2006). Bayesian variable selection for the analysis of microarray data with censored outcomes. Bioinformatics 22(18):2262–2268.
Tibshirani R (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B 58(1):267–288.
Tibshirani R (1997). The lasso method for variable selection in the Cox model. Statistics in Medicine 16(4):385–395.
Van Houwelingen HC, et al. (2006). Cross-validated Cox regression on microarray gene expression data. Statistics in Medicine 25:3201–3216.
Wei LJ (1992). The accelerated failure time model: a useful alternative to the Cox regression model in survival analysis. Statistics in Medicine 11:1871–1879.
Ying ZL (1993). A large sample study of rank estimation for censored regression data. Annals of Statistics 21:76–99.
Acknowledgment
This work was partially supported by NIH Grant 1R03CA133899-01A210 and NSF CCF-0729080.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this paper
Cite this paper
Liu, Z., Chen, D., Tian, G., Tang, ML., Tan, M., Sheng, L. (2010). Efficient Support Vector Machine Method for Survival Prediction with SEER Data. In: Arabnia, H. (eds) Advances in Computational Biology. Advances in Experimental Medicine and Biology, vol 680. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5913-3_2
Download citation
DOI: https://doi.org/10.1007/978-1-4419-5913-3_2
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-5912-6
Online ISBN: 978-1-4419-5913-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)