Abstract
Cancer genome projects can provide different types of data on the genetic level, which is significant for cancer research and biological processes in computational methods. Thus, computational methods used to identify cancer subtypes should fully focus on integrating these multidimensional data (e.g., DNA methylation data, mRNA expression data, etc.). Sparse reduced-rank regression (Srrr) method, a state-of-the-art multiple response linear regression method, can easily deal with high dimensional statistical data. In this paper, we introduced Srrr method combining iCluster (Srrr-cluster) to discovery cancer subtypes. Firstly, we used Srrr to estimate the coefficient matrix and then cancer subtypes were clustered by iCluster. Finally, we used our Srrr-cluster method to analyze glioblastoma and breast cancer data. The results show that our Srrr-cluster method is effective for cancer subtype identification.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Pollack, J.R., et al.: Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors. Proc. Natl. Acad. Sci. 99(20), 12963–12968 (2002)
Stratton, M.R., Campbell, P.J., Futreal, P.A.: The cancer genome. Nature 458(7239), 719–724 (2009)
Jones, P.A., Baylin, S.B.: The fundamental role of epigenetic events in cancer. Nat. Rev. Genet. 3(6), 415–428 (2002)
Hoadley, K.A., et al.: Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin. Cell 158(4), 929–944 (2014)
Verhaak, R.G., et al.: Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell 17(1), 98–110 (2010)
Shen, R., Olshen, A.B., Ladanyi, M.: Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis. Bioinformatics 25(22), 2906–2912 (2009)
Shen, R., et al.: Integrative subtype discovery in glioblastoma using iCluster. PLoS ONE 7(4), e35236 (2012)
Zha, H., et al.: Spectral relaxation for K-means clustering. Neural Inf. Process. Syst. 1057–1064 (2001)
Ding, C., He, X.F.: Cluster structure of K-means clustering via principal component analysis. Adv. Knowl. Discov. Data Min. Proc. 3056, 414–418 (2004)
Simon, N., et al.: A sparse-group lasso. J. Comput. Graph. Stat. 22(2), 231–245 (2013)
Huang, J., Breheny P., Ma S.: A selective review of group selection in high-dimensional models. Stat. Sci. 27(4) (2012)
Lin, D.D., et al.: Network-based investigation of genetic modules associated with functional brain networks in schizophrenia. In: 2013 IEEE International Conference on Bioinformatics and Biomedicine (Bibm) (2013)
Chen, L., Huang, J.Z.: Sparse reduced-rank regression for simultaneous dimension reduction and variable selection. J. Am. Stat. Assoc. 107(500), 1533–1545 (2012)
Ma, Z., Sun T.: Adaptive sparse reduced-rank regression (2014). arXiv preprint arXiv:1403.1922
Mo, Q., et al.: Pattern discovery and cancer gene identification in integrated cancer genomic data. Proc. Natl. Acad. Sci. USA 110(11), 4245–4250 (2013)
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Hosmer Jr, D.W., Lemeshow, S.: Applied Survival Analysis: Regression Modelling of Time to Event Data. European Orthodontic Society (1999)
Wang, B., et al.: Similarity network fusion for aggregating data types on a genomic scale. Nat. Methods 11(3), 333–337 (2014)
Acknowledgments
This work was supported by National Natural Science Foundation of China (31301101 and 61272339), the Anhui Provincial Natural Science Foundation (1408085QF106), the Specialized Research Fund for the Doctoral Program of Higher Education (20133401120011), and the Technology Foundation for Selected Overseas Chinese Scholars from Department of Human Resources and Social Security of Anhui Province (No. [2014]-243).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Ge, SG., Xia, JF., Wei, PJ., Zheng, CH. (2016). Srrr-cluster: Using Sparse Reduced-Rank Regression to Optimize iCluster. In: Huang, DS., Han, K., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2016. Lecture Notes in Computer Science(), vol 9773. Springer, Cham. https://doi.org/10.1007/978-3-319-42297-8_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-42297-8_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42296-1
Online ISBN: 978-3-319-42297-8
eBook Packages: Computer ScienceComputer Science (R0)