Skip to main content

Efficient Support Vector Machine Method for Survival Prediction with SEER Data

  • Conference paper
  • First Online:
Advances in Computational Biology

Part of the book series: Advances in Experimental Medicine and Biology ((AEMB,volume 680))

Abstract

Support vector machine (SVM) is a popular method for classification, but there are few methods that utilize SVM for survival analysis in the literature because of the computational complexity. In this paper, we develop a novel \( {L_1} \) penalized SVM method for mining right-censored survival data (\( {L_1} \) SVMSURV). Our proposed method can simultaneously identify survival-associated prognostic factors and predict survival outcomes. It is easy to understand and efficient to use especially when applied to large datasets. Our method has been examined through both simulation and real data, and its performance is very good with limited experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cox DR (1972). Regression models and life-tables (with discussion). Journal of Royal Statistical Society, Series B 34:187–220.

    Google Scholar 

  2. Gui J, Li H (2005). Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data. Bioinformatics 21:3001–3008.

    Article  PubMed  CAS  Google Scholar 

  3. Heagerty PJ, Zheng Y (2005). Survival model predictive accuracy and ROC curves. Biometrics 61(1):92–105.

    Article  Google Scholar 

  4. Jin Z, Lin DY, Wei LJ, Ying ZL (2003). Rank-based inference for the accelerated failure time model. Biometrika 90:341–353.

    Article  Google Scholar 

  5. Kalbfleisch JD, Prentice RL (1980). The Statistical Analysis of Failure Time Data. New York: John Wiley.

    Google Scholar 

  6. Lin DW, Porter M, Montgomery B (2009). Treatment and survival outcomes in young men diagnosed with prostate cancer: a Population-based Cohort Study. Cancer 115(13):2863–2871.

    Article  PubMed  Google Scholar 

  7. Liu Z, Jiang F (2009). Gene identification and survival prediction with Lp penalty and novel similarity measure. International Journal of Data Mining and Bioinformatics 3(4):398–408.

    Article  PubMed  Google Scholar 

  8. Liu Z, Gartenhaus RB, Chen X, Howell C, Tan M (2009). Survival prediction and gene identification with penalized global AUC maximization. Journal of Computational Biology 16(12):1661–1670.

    Article  PubMed  CAS  Google Scholar 

  9. Ma S, Huang J (2007). Additive risk survival model with microarray data. BMC Bioinformatics 8:192.

    Article  PubMed  Google Scholar 

  10. Mangasarian OL (2006). Exact 1-norm support vector machines via unconstrained convex differentiable minimization. Journal of Machine Learning Research 7:1517–1530.

    Google Scholar 

  11. Sha N, Tadesse MG, Vannucci M (2006). Bayesian variable selection for the analysis of microarray data with censored outcomes. Bioinformatics 22(18):2262–2268.

    Article  PubMed  CAS  Google Scholar 

  12. Tibshirani R (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B 58(1):267–288.

    Google Scholar 

  13. Tibshirani R (1997). The lasso method for variable selection in the Cox model. Statistics in Medicine 16(4):385–395.

    Article  PubMed  CAS  Google Scholar 

  14. Van Houwelingen HC, et al. (2006). Cross-validated Cox regression on microarray gene expression data. Statistics in Medicine 25:3201–3216.

    Article  PubMed  Google Scholar 

  15. Wei LJ (1992). The accelerated failure time model: a useful alternative to the Cox regression model in survival analysis. Statistics in Medicine 11:1871–1879.

    Article  PubMed  CAS  Google Scholar 

  16. Ying ZL (1993). A large sample study of rank estimation for censored regression data. Annals of Statistics 21:76–99.

    Article  Google Scholar 

Download references

Acknowledgment

This work was partially supported by NIH Grant 1R03CA133899-01A210 and NSF CCF-0729080.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhenqiu Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this paper

Cite this paper

Liu, Z., Chen, D., Tian, G., Tang, ML., Tan, M., Sheng, L. (2010). Efficient Support Vector Machine Method for Survival Prediction with SEER Data. In: Arabnia, H. (eds) Advances in Computational Biology. Advances in Experimental Medicine and Biology, vol 680. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5913-3_2

Download citation

Publish with us

Policies and ethics