Novel Multi-sample Scheme for Inferring Phylogenetic Markers from Whole Genome Tumor Profiles

Subramanian, Ayshwarya; Shackney, Stanley; Schwartz, Russell

doi:10.1007/978-3-642-30191-9_24

Ayshwarya Subramanian²³,
Stanley Shackney²⁴ &
Russell Schwartz^23,25

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 7292))

Included in the following conference series:

International Symposium on Bioinformatics Research and Applications

1065 Accesses

Abstract

Computational cancer phylogenetics seeks to enumerate the temporal sequence of aberrations in tumor evolution, thereby delineating the evolution of possible tumor progression pathways, molecular subtypes and mechanisms of action. We previously developed a pipeline for constructing phylogenies describing evolution between major recurring cell types computationally inferred from whole-genome tumor profiles. The accuracy and detail of the phylogenies, however, depends on the identification of accurate, high-resolution molecular markers of progression, i.e., reproducible regions of aberration that robustly differentiate different subtypes and stages of progression. Here we present a novel hidden Markov model (HMM) scheme for the problem of inferring such phylogenetically significant markers through joint segmentation and calling of multi-sample tumor data. Our method classifies sets of genome-wide DNA copy number measurements into a partitioning of samples into normal (diploid) or amplified at each probe. It differs from other similar HMM methods in its design specifically for the needs of tumor phylogenetics, by seeking to identify robust markers of progression conserved across a set of copy number profiles. We show an analysis of our method in comparison to other methods on both synthetic and real tumor data, which confirms its effectiveness for tumor phylogeny inference and suggests avenues for future advances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Al-Kuraya, K., Schraml, P., Torhorst, J., et al.: Prognostic relevance of gene amplifications and coamplifications in breast cancer. Cancer Research 64(23), 8534–8540 (2004)
Article Google Scholar
Ashworth, A., de Bono, J.S.: Translating cancer research into targeted therapeutics. Nature (2010)
Google Scholar
Bamford, S., Dawson, E., Forbes, S., et al.: The COSMIC (catalogue of somatic mutations in cancer) database and website. Br. J. Cancer (2004)
Google Scholar
Beroukhim, R., Getz, G., Nghiemphu, L., et al.: Assessing the significance of chromosomal aberrations in cancer: Methodology and application to glioma. Proceedings of the National Academy of Sciences 104(50), 20007–20012 (2007)
Article Google Scholar
Eilers, P.H.C., de Menezes, R.: Quantile smoothing of array CGH data. Bioinformatics 21(7), 1146–1153 (2005)
Article Google Scholar
Felsenstein, J.: PHYLIP - Phylogeny Inference Package (Version 3.2). Cladistics 5, 164–166 (1989)
Google Scholar
Futreal, P.A., Coin, L., Marshall, M., et al.: A census of human cancer genes. Nat. Rev. Cancer 4(3), 177–183 (2004)
Article Google Scholar
Golub, T.R., Slonim, D.K., Tamayo, P., et al.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)
Article Google Scholar
Horlings, H.M., Bergamaschi, A., Nordgard, S.H., et al.: ESR1 gene amplification in breast cancer: a common phenomenon? Nat. Genet. (2008)
Google Scholar
Hsu, L., Self, S.G., Grove, D., et al.: Denoising array-based comparative genomic hybridization data using wavelets. Biostatistics 6(2), 211–226 (2005)
Article MATH Google Scholar
Kuhner, M.K., Felsenstein, J.: A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Molecular Biology and Evolution 11(3), 459–468 (1994)
Google Scholar
Miller, L.D., Smeds, J., George, J., et al.: An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival. Proceedings of the National Academy of Sciences of the United States of America 102(38), 13550–13555 (2005)
Article Google Scholar
Mittendorf, E.A., Liu, Y., Tucker, S.L., et al.: A novel interaction between HER2/neu and cyclin E in breast cancer. Oncogene 29, 3896–3907 (2010)
Article Google Scholar
Moelans, C.B., de Weger, R.A., Monsuur, H.N., et al.: Molecular profiling of invasive breast cancer by multiplex ligation-dependent probe amplification-based copy number analysis of tumor suppressor and oncogenes. Mod. Pathol. (2010)
Google Scholar
Navin, N., Krasnitz, A., Rodgers, L., et al.: Inferring tumor progression from genomic heterogeneity. Genome Research 20, 68–80 (2010)
Article Google Scholar
Nowak, G., Hastie, T., Pollack, J.R., Tibshirani, R.: A fused lasso latent feature model for analyzing multi-sample aCGH data. Biostatistics 12(4), 776–791 (2011)
Article Google Scholar
Olshen, A.B., Venkatraman, E.S., Lucito, R., Wigler, M.: Circular binary segmentation for the analysis of array based DNA copy number data. Biostatistics 5(4), 557–572 (2004)
Article MATH Google Scholar
Perou, C.M., Sorlie, T., Eisen, M.B., et al.: Molecular portraits of human breast tumors. Nature 406, 747–752 (2000)
Article Google Scholar
Picard, F., Lebarbier, E., Hoebeke, M., Rigaill, G., Thiam, B., Robin, S.: Joint segmentation, calling, and normalization of multiple CGH profiles. Biostatistics 12(3), 413–428 (2011)
Article Google Scholar
Picard, F., Robin, S., Lavielle, M., et al.: A statistical approach for array CGH data analysis. BMC Bioinformatics 6 (2005)
Google Scholar
Pique-Regi, R., Ortega, A., Asgharzadeh, S.: Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA. Bioinformatics 25(10), 1223–1230 (2009)
Article Google Scholar
Scaltriti, M., Eichhorn, P.J., Cortes, J., et al.: Cyclin E amplification/overexpression is a mechanism of trastuzumab resistance in HER2+ breast cancer patients. Proceedings of the National Academy of Sciences (2011)
Google Scholar
Schwartz, R., Shackney, S.: Applying unmixing to gene expression data for tumor phylogeny inference. BMC Bioinformatics 11, 42 (2010)
Article Google Scholar
Shah, S.P., Cheung, K.J., Johnson, N.A., et al.: Model-based clustering of array cgh data. Bioinformatics 25(12), i30–i38 (2009)
Article Google Scholar
Sorlie, T., Perrou, C.M., Tibshirani, R., et al.: Gene expression profiles of breast carcinomas distinguish tumor subclasses with clinical implications. Proc. Natl. Acad. Sci. USA 98, 10869–10864 (2001)
Google Scholar
Sotiriou, C., Neo, S.Y., McShane, L.M., et al.: Breast cancer classification and prognosis based on gene expression profiles from a population-based study. Proc. Natl. Acad. Sci. USA 100, 10393–10398 (2003)
Article Google Scholar
Subramanian, A., Shackney, S., Schwartz, R.: Inference of tumor phylogenies from genomic assays on heterogeneous samples. In: Proc. ACM-BCB 2011 (2011)
Google Scholar
Swafford, D.: PAUP*. Phylogenetic Analysis Using Parsimony (*and other methods). Version 4 (2002)
Google Scholar
Tolliver, D., Tsourakakis, C., Subramanian, A., et al.: Robust unmixing of tumor states in array comparative genomic hybridization data. Bioinformatics 26(12), i106–i114 (2010)
Article Google Scholar
van’t Veer, L.J., Dai, H., van de Vivjer, M., et al.: Gene expression profiling predicts clinical outcome of breast cancer. Nature 415, 530–536 (2002)
Article Google Scholar
Wang, K., Li, M., Hadley, D., et al.: Penncnv: An integrated hidden markov model designed for high-resolution copy number variation detection in whole-genome snp genotyping data. Genome Research 17(11), 1665–1674 (2007)
Article Google Scholar
Wiel, V.D., Mark, A., Brosens, R., et al.: Smoothing waves in array CGH tumor profiles. Bioinformatics 25(9), 1099–1104 (2009)
Article Google Scholar
Wu, L.Y., Chipman, H.A., Bull, S.B., Briollais, L., Wang, K.: A Bayesian segmentation approach to ascertain copy number variations at the population level. Bioinformatics 25(13), 1669–1679 (2009)
Article Google Scholar
Zhang, N.R., Senbabaoglu, Y., Li, J.Z.: Joint estimation of DNA copy number from multiple platforms. Bioinformatics 26(2), 153–160 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, USA, PA, 15213
Ayshwarya Subramanian & Russell Schwartz
Oncotherapeutics, Pittsburgh, PA, 15243, USA
Stanley Shackney
Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Russell Schwartz

Authors

Ayshwarya Subramanian
View author publications
You can also search for this author in PubMed Google Scholar
Stanley Shackney
View author publications
You can also search for this author in PubMed Google Scholar
Russell Schwartz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departments of Bioengineering and Electrical Engineering, University of Texas at Dallas, 75080, Richardson, TX, USA
Leonidas Bleris
Department of Computer Science and Engineering, University of Connecticut, 06269, Storrs, CT, USA
Ion Măndoiu
Department of Biological Sciences, Carnegie Mellon University, 15213, Pittsburgh, PA, USA
Russell Schwartz
School of Information Science and Engineering, Central South University, 410083, Changsha, China
Jianxin Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Subramanian, A., Shackney, S., Schwartz, R. (2012). Novel Multi-sample Scheme for Inferring Phylogenetic Markers from Whole Genome Tumor Profiles. In: Bleris, L., Măndoiu, I., Schwartz, R., Wang, J. (eds) Bioinformatics Research and Applications. ISBRA 2012. Lecture Notes in Computer Science(), vol 7292. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30191-9_24

Download citation

DOI: https://doi.org/10.1007/978-3-642-30191-9_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30190-2
Online ISBN: 978-3-642-30191-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics