Abstract
Breast cancer is a heterogenous disease with a large variance in prognosis of patients. It is hard to identify patients who would need adjuvant chemotherapy to survive. Using microarray based technology and various feature selection techniques, a number of prognostic gene expression signatures have been proposed recently. It has been shown that these signatures outperform traditional clinical guidelines for estimating prognosis. This paper studies the applicability of state-of-the-art feature extraction methods together with feature selection methods to develop more powerful prognosis estimators. Feature selection is used to remove features not related with the clinical issue investigated. If the resulted dataset is still described by a high number of probes, feature extraction methods can be applied to further reduce the dimension of the data set. In addition we derived six new signatures using three independent data sets, containing in total 610 samples.
Additional information:
http://como.vub.ac.be/~jtaminau/CSBio2010/
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
van ’t Veer, L.J., Dai, H., van de Vijver, M.J., et al.: Gene expression profiling predicts clinical outcome of breast cancer. Nature 415(6871), 530–536 (2002)
Wang, Y., Klijn, J.G.M., Zhang, Y., et al.: Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet 365(9460), 671–679 (2005)
Sotiriou, C., Wirapati, P., Loi, S., et al.: Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. J. Natl. Cancer Inst. 98(4), 262–272 (2006)
Korkola, J.E., Blaveri, E., DeVries, S., et al.: Identification of a robust gene signature that predicts breast cancer outcome in independent data sets. BMC Cancer 7, 61 (2007)
Haibe-Kains, B., Desmedt, C., Piette, F., et al.: Comparison of prognostic gene expression signatures for breast cancer. BMC Genomics 9, 394 (2008)
Scott, D., Thompson, J.: Probability density estimation in higher dimensions. In: Computer Science and Statistics: Proceedings of the Fifteenth Symposium on the Interface (1983)
Somorjai, R.L., Dolenko, B., Baumgartner, R.: Class prediction and discovery using gene microarray and proteomics mass spectroscopy data: curses, caveats, cautions. Bioinformatics 19(12), 1484–1491 (2003)
Bild, A.H., Yao, G., Chang, J.T., et al.: Oncogenic pathway signatures in human cancers as a guide to targeted therapies. Nature 439(7074), 353–357 (2006)
Desmedt, C., Piette, F., Loi, S., et al.: Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the transbig multicenter independent validation series. Clin. Cancer Res. 13(11), 3207–3214 (2007)
Pawitan, Y., Bjöhle, J., Amler, L., et al.: Gene expression profiling spares early breast cancer patients from adjuvant therapy: derived and validated in two population-based cohorts. Breast Cancer Res. 7(6), R953–R964 (2005)
Miller, L.D., Smeds, J., George, J., et al.: An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival. Proc. Natl. Acad. Sci. USA 102(38), 13550–13555 (2005)
Schmidt, M., Böhm, D., von Törne, C., et al.: The humoral immune system has a key prognostic impact in node-negative breast cancer. Cancer Res. 68(13), 5405–5413 (2008)
Turk, M., Pentland, A.: Eigenfaces for recognition. J. Cognitive Neurosci. 1(3), 71–86 (1991)
Chiang, S.S., Chang, C.I.: Unsupervised hyperspectral image analysis using independent component analysis. In: IEEE International Geoscience and Remote Sensing Symposium, vol. 1(7), pp. 3136–3138 (July 2000)
Robila, S.A., Varshney, P.K.: Target detection in hyperspectral images based on independent component analysis. In: SPIE AeroSense, Orlando, Florida, USA, vol. 1(7), pp. 3136–3138 (April 2002)
Liebermeister, W.: Linear modes of gene expression determined by independent component analysis. Bioinformatics (January 2002)
Luts, J., Poullet, J.B., Garcia-Gomez, J.M., et al.: Effect of feature extraction for brain tumor classification based on short echo time 1h mr spectra. Magn. Reson. Med. 60(2), 288–298 (2008)
Jolliffe, I.: Principal component analysis. Springer Series in Statistics (2002)
Hyvärinen, A., Oja, E.: Independent component analysis: algorithms and applications. Neural Netw. 13(4-5), 411–430 (2000)
Kong, W., Vanderburg, C.R., Gunshin, H., et al.: A review of independent component analysis application to microarray gene expression data. BioTechniques 45(5), 501–520 (2008)
Alter, O., Brown, P.O., Botstein, D.: Singular value decomposition for genome-wide expression data processing and modeling. Proc. Natl. Acad. Sci. USA 97(18), 10101–10106 (2000)
Sims, A.H., Smethurst, G.J., Hey, Y., et al.: The removal of multiplicative, systematic bias allows integration of breast cancer gene expression datasets - improving meta-analysis and prediction of prognosis. BMC Medical Genomics 1, 42 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Taminau, J. et al. (2010). Sequential Application of Feature Selection and Extraction for Predicting Breast Cancer Aggressiveness. In: Chan, J.H., Ong, YS., Cho, SB. (eds) Computational Systems-Biology and Bioinformatics. CSBio 2010. Communications in Computer and Information Science, vol 115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16750-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-16750-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16749-2
Online ISBN: 978-3-642-16750-8
eBook Packages: Computer ScienceComputer Science (R0)