Skip to main content

Case Based Reasoning with Bayesian Model Averaging: An Improved Method for Survival Analysis on Microarray Data

  • Conference paper
Case-Based Reasoning. Research and Development (ICCBR 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6176))

Included in the following conference series:

Abstract

Microarray technology enables the simultaneous measurement of thousands of gene expressions, while often providing a limited set of samples. These datasets require data mining methods for classification, prediction, and clustering to be tailored to the peculiarity of this domain, marked by the so called ‘curse of dimensionality’. One main characteristic of these specialized algorithms is their intensive use of feature selection for improving their performance. One promising method for feature selection is Bayesian Model Averaging (BMA) to find an optimal subset of genes. This article presents BMA applied to gene selection for classification on two cancer gene expression datasets and for survival analysis on two cancer gene expression datasets, and explains how case based reasoning (CBR) can benefit from this model to provide, in a hybrid BMA-CBR classification or survival prediction method, an improved performance and more expansible model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Annest, A., Bumgarner, R.E., Raftery, A.E., Yeung, K.Y.: Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data. BMC Bioinformatics 10, 10–72 (2009)

    Article  Google Scholar 

  2. Jiangeng, L., Yanhua, D., Xiaogang, R.: A Novel Hybrid Approach to Selecting Marker Genes for Cancer Classification Using Gene Expression Data. In: The 1st International Conference on Bioinformatics and Biomedical Engineering ICBBE 2007, pp. 264–267 (2007)

    Google Scholar 

  3. Liu, H., Motoda, H.: Feature Selection for Knowledge Discovery and Data Mining. The Springer International Series in Engineering and Computer Science, vol. 454. Springer, Heidelberg (1998)

    MATH  Google Scholar 

  4. Liu, H., Motoda, H. (eds.) Computational Methods of Feature Selection. Hall/Crc, Data Mining and Knowledge Discovery Series. Chapman & Hall/CRC (2008)

    Google Scholar 

  5. Huang, T., Kecman, V., Kopriva, I.: Kernel Based Algorithms for Mining Huge Data Sets: Supervised, Semi-Supervised, and Unsupervised Learning. SCI, vol. 17. Springer, The Netherlands (2006)

    MATH  Google Scholar 

  6. Witten, I., Frank, R.: Data mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufman Series in Data Management Systems. Elsevier, Inc., San Francisco (2005)

    MATH  Google Scholar 

  7. Kotsiantis, S.: Supervised Machine Learning: A Review of Classification Techniques. Informatica 31, 249–268 (2007)

    MATH  MathSciNet  Google Scholar 

  8. Cohen, J.: Bioinformatics – An Introduction for Computer Scientists. ACM Computing Surveys 36(2), 122–158 (2004)

    Article  Google Scholar 

  9. Piatetsky-Shapiro, G., Tamayo, P.: Microarray Data Mining: Facing the Challenges. ACM SIGKDD Explorations Newsletter 5(2), 1–5 (2003)

    Article  Google Scholar 

  10. Volinsky, C., Madigan, D., Raftery, A., Kronmal, R.: Bayesian Model Averaging in Proprtional Hazard Models: Assessing the Risk of a Stroke. Applied Statistics 46(4), 433–448 (1997)

    MATH  Google Scholar 

  11. Hosmer, D., Lemeshow, S., May, S.: Applied Survival Analysis: Regression Modeling of Time to Event Data, 2nd edn. Wiley Series in Probability and Statistics. Wiley Interscience, Hoboken (2008)

    MATH  Google Scholar 

  12. Raftery, A.: Bayesian Model Selection in Social Research (with Discussion). In: Marsden, P. (ed.) Sociological Methodology 1995, pp. 111–196. Blackwell, Cambridge (1995)

    Google Scholar 

  13. Hoeting, J., Madigan, D., Raftery, A., Volinsky, C.: Bayesian Model Averaging: A Tutorial. Statistical Science 14(4), 382–417 (1999)

    Article  MATH  MathSciNet  Google Scholar 

  14. Yeung, K., Bumgarner, R., Raftery, A.: Bayesian Model Averaging: Development of an Improved Multi-Class, Gene Selection and Classification Tool for Microarray Data. Bioinformatics 21(10), 2394–2402 (2005)

    Article  Google Scholar 

  15. Furnival, G., Wilson, R.: Regression by Leaps and Bounds. Technometrics 16, 499–511 (1974)

    Article  MATH  Google Scholar 

  16. Madigan, D., Raftery, A.: Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam’s Window. Journal of the American Statistical Association 89, 1335–1346 (1994)

    Article  Google Scholar 

  17. Raftery, A.: Approximate Bayes Factors and Accounting for Model Uncertainty in Generalised Linear Models. Biometrika 83(2), 251–266 (1996)

    Article  MATH  MathSciNet  Google Scholar 

  18. Cox, D.: Regression Models and Life Tables. J. Royal Stat. Soc. Series B 34, 187–220 (1972)

    MATH  Google Scholar 

  19. Kaplan, E., Meier, P.: Nonparametric Estimation from Incomplete Observations. Journal of the American Statistical Association 53, 457–481 (1958)

    Article  MATH  MathSciNet  Google Scholar 

  20. Beer, D., Kardia, S., Huang, C., Giordano, T., Levin, A., et al.: Gene-Expression Profiles Predict Survival of Patients with Lung Adenocarcinoma. Nature Medicine 8(8), 816–824 (2002)

    Google Scholar 

  21. Bair, E., Tibshirani, R.: Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data. PLOS Biology 2(4), 511–522 (2004)

    Article  Google Scholar 

  22. Jurisica, I., Glasgow, J.: Applications of Case-Based Reasoning in Molecular Biology. AI Magazine 25(1), 85–95 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bichindaritz, I., Annest, A. (2010). Case Based Reasoning with Bayesian Model Averaging: An Improved Method for Survival Analysis on Microarray Data. In: Bichindaritz, I., Montani, S. (eds) Case-Based Reasoning. Research and Development. ICCBR 2010. Lecture Notes in Computer Science(), vol 6176. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14274-1_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14274-1_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14273-4

  • Online ISBN: 978-3-642-14274-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics