PyClone: statistical inference of clonal population structure in cancer

Roth, Andrew; Khattra, Jaswinder; Yap, Damian; Wan, Adrian; Laks, Emma; Biele, Justina; Ha, Gavin; Aparicio, Samuel; Bouchard-Côté, Alexandre; Shah, Sohrab P

doi:10.1038/nmeth.2883

PyClone: statistical inference of clonal population structure in cancer

Brief Communication
Published: 16 March 2014

Volume 11, pages 396–398, (2014)
Cite this article

From

View current issue Submit your manuscript

Andrew Roth^1,2,
Jaswinder Khattra²,
Damian Yap²,
Adrian Wan²,
Emma Laks²,
Justina Biele²,
Gavin Ha^1,2,
Samuel Aparicio^2,3,
Alexandre Bouchard-Côté⁴ &
…
Sohrab P Shah^2,3

37k Accesses
614 Citations
34 Altmetric
1 Mention
Explore all metrics

Abstract

We introduce PyClone, a statistical model for inference of clonal population structures in cancers. PyClone is a Bayesian clustering method for grouping sets of deeply sequenced somatic mutations into putative clonal clusters while estimating their cellular prevalences and accounting for allelic imbalances introduced by segmental copy-number changes and normal-cell contamination. Single-cell sequencing validation demonstrates PyClone's accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

**Figure 1: Comparison of clustering performance for the mixture of normal-tissue data sets.**

**Figure 2: Joint analysis of multiple samples from high-grade serous ovarian cancer 2.**

PyClone-VI: scalable inference of clonal population structures using whole genome data

Article Open access 10 December 2020

Copy-number analysis and inference of subclonal populations in cancer genomes using Sclust

Article 24 May 2018

ReMixT: clone-specific genomic structure estimation in cancer

Article Open access 27 July 2017

References

Nowell, P.C. Science 194, 23–28 (1976).
Article CAS PubMed Google Scholar
Aparicio, S. & Caldas, C. N. Engl. J. Med. 368, 842–851 (2013).
Article CAS PubMed Google Scholar
Greaves, M. & Maley, C.C. Nature 481, 306–313 (2012).
Article CAS PubMed PubMed Central Google Scholar
Shah, S.P. et al. Nature 486, 395–399 (2012).
Article CAS PubMed Google Scholar
Ding, L. et al. Nature 481, 506–510 (2012).
Article CAS PubMed PubMed Central Google Scholar
Nik-Zainal, S. et al. Cell 149, 994–1007 (2012).
Article CAS PubMed PubMed Central Google Scholar
Carter, S.L. et al. Nat. Biotechnol. 30, 413–421 (2012).
Article CAS PubMed PubMed Central Google Scholar
Govindan, R. et al. Cell 150, 1121–1134 (2012).
Article CAS PubMed PubMed Central Google Scholar
Shah, S.P. et al. Nature 461, 809–813 (2009).
Article CAS PubMed Google Scholar
Gerlinger, M. et al. N. Engl. J. Med. 366, 883–892 (2012).
Article CAS PubMed PubMed Central Google Scholar
The 1000 Genomes Project Consortium. Nature 467, 1061–1073 (2010).
Harismendy, O. et al. Genome Biol. 12, R124 (2011).
Article CAS PubMed PubMed Central Google Scholar
Rosenberg, A. & Hirschberg, J. in Proc. 2007 Joint Conf. Empir. Methods Natural Lang. Process. Comput. Natural Lang. Learn. (EMNLP-CoNLL) Vol. 410, 420 (2007).
Google Scholar
Bashashati, A. et al. J. Pathol. 231, 21–34 (2013).
Article CAS PubMed PubMed Central Google Scholar
Forshew, T. et al. Sci. Transl. Med. 4, 136ra68 (2012).
Article PubMed Google Scholar
Dawson, S.J. et al. N. Engl. J. Med. 368, 1199–1209 (2013).
Article CAS PubMed Google Scholar
Sottoriva, A. et al. Proc. Natl. Acad. Sci. USA 110, 4009–4014 (2013).
Article CAS PubMed PubMed Central Google Scholar
Fritsch, A. & Ickstadt, K. Bayesian Anal. 4, 367–392 (2009).
Article Google Scholar
Ng, S.B. et al. Nature 461, 272–276 (2009).
Article CAS PubMed PubMed Central Google Scholar
Van Loo, P. et al. Proc. Natl. Acad. Sci. USA 107, 16910–16915 (2010).
Article CAS PubMed PubMed Central Google Scholar
Greenman, C.D. et al. Biostatistics 11, 164–175 (2010).
Article PubMed Google Scholar
Yau, C. et al. Genome Biol. 11, R92 (2010).
CAS PubMed PubMed Central Google Scholar
Untergasser, A. et al. Nucleic Acids Res. 40, e115 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Bioinformatics 26, 589–595 (2010).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work is funded by Canadian Institutes for Health Research (CIHR), Genome Canada, Genome British Columbia, Canadian Cancer Society Research Institute and Canadian Breast Cancer Foundation grants to S.P.S. and S.A. S.P.S. is supported by the Michael Smith Foundation for Health Research and is the Canada Research Chair (CRC) for Computational Cancer Genomics. S.A. is the CRC for Molecular Oncology. A.R. is supported by a CIHR Banting scholarship.

Author information

Authors and Affiliations

Bioinformatics Graduate Program, University of British Columbia, Vancouver, British Columbia, Canada
Andrew Roth & Gavin Ha
Department of Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, British Columbia, Canada
Andrew Roth, Jaswinder Khattra, Damian Yap, Adrian Wan, Emma Laks, Justina Biele, Gavin Ha, Samuel Aparicio & Sohrab P Shah
Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, British Columbia, Canada
Samuel Aparicio & Sohrab P Shah
Department of Statistics, University of British Columbia, Vancouver, British Columbia, Canada
Alexandre Bouchard-Côté

Authors

Andrew Roth
View author publications
You can also search for this author in PubMed Google Scholar
Jaswinder Khattra
View author publications
You can also search for this author in PubMed Google Scholar
Damian Yap
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Wan
View author publications
You can also search for this author in PubMed Google Scholar
Emma Laks
View author publications
You can also search for this author in PubMed Google Scholar
Justina Biele
View author publications
You can also search for this author in PubMed Google Scholar
Gavin Ha
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Aparicio
View author publications
You can also search for this author in PubMed Google Scholar
Alexandre Bouchard-Côté
View author publications
You can also search for this author in PubMed Google Scholar
Sohrab P Shah
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Project conception and oversight: S.P.S., S.A., A.R.; method development: A.R., A.B.-C., S.P.S.; implementation and benchmarking: A.R.; manuscript writing and editing, study design and execution: A.R., A.B.C., S.P.S., S.A.; single-cell sequencing: J.K., D.Y., A.W., E.L., J.B.; data analysis and interpretation: G.H.

Corresponding author

Correspondence to Sohrab P Shah.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–14, Supplementary Results, Supplementary Discussion and Supplementary Note (PDF 5370 kb)

Supplementary Table 1

Allelic counts, IBBMM and PyClone PCN cellular prevalence estimates for mutations in high grade serous ovarian cancer case 2. Copy number predictions where inferred using PICNIC as described in the Online Methods. Cellular prevalences where computed by taking the mean of the post burnin trace for the cellular prevalences for the respective methods. The standard deviation of the cellular prevalence parameter estimated from the post burnin trace is also included. Cluster ids (last two columns) were predicted from the post burnin trace using the MPEAR clustering criteria as described in the Online Methods and Supplementary Note. Mutation ids list gene name, chromosome and chromosome coordinate. All coordinates are in the hg19 coordinate system. (XLS 50 kb)

Supplementary Table 2

Allelic counts, IBBMM and PyClone PCN cellular prevalence estimates for mutations in high grade serous ovarian cancer case 1. Copy number predictions where inferred using PICNIC as described in the Online Methods. Cellular prevalences where computed by taking the mean of the post burnin trace for the cellular prevalences for the respective methods. The standard deviation of the cellular prevalence parameter estimated from the post burnin trace is also included. Cluster ids (last two columns) were predicted from the post burnin trace using the MPEAR clustering criteria as described in the Online Methods and Supplementary Note. Mutation ids list gene name, chromosome and chromosome coordinate. All coordinates are in the hg19 coordinate system. (XLSX 40 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Roth, A., Khattra, J., Yap, D. et al. PyClone: statistical inference of clonal population structure in cancer. Nat Methods 11, 396–398 (2014). https://doi.org/10.1038/nmeth.2883

Download citation

Received: 18 November 2013
Accepted: 31 January 2014
Published: 16 March 2014
Issue Date: April 2014
DOI: https://doi.org/10.1038/nmeth.2883
Springer Nature America, Inc.

This article is cited by

CONIPHER: a computational framework for scalable phylogenetic reconstruction with error correction
- Kristiana Grigoriadis
- Ariana Huebner
- Nicholas McGranahan
Nature Protocols (2024)
Multiregion sampling of de novo metastatic prostate cancer reveals complex polyclonality and augments clinical genotyping
- Evan W. Warner
- Kim Van der Eecken
- Alexander W. Wyatt
Nature Cancer (2024)
Adoptive neoantigen-reactive T cell therapy: improvement strategies and current clinical researches
- Ruichen Huang
- Bi Zhao
- Wei Zhang
Biomarker Research (2023)
ACT-Discover: identifying karyotype heterogeneity in pancreatic cancer evolution using ctDNA
- Ariana Huebner
- James R. M. Black
- Rodrigo A. Toledo
Genome Medicine (2023)
The heterogeneity and clonal evolution analysis of the advanced prostate cancer with castration resistance
- Ao Liu
- Yi Gao
- Danfeng Xu
Journal of Translational Medicine (2023)

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PyClone: statistical inference of clonal population structure in cancer

From

Abstract

Access this article

Similar content being viewed by others

PyClone-VI: scalable inference of clonal population structures using whole genome data

Copy-number analysis and inference of subclonal populations in cancer genomes using Sclust

ReMixT: clone-specific genomic structure estimation in cancer

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Supplementary Text and Figures

Supplementary Table 1

Supplementary Table 2

Rights and permissions

About this article

Cite this article

This article is cited by

CONIPHER: a computational framework for scalable phylogenetic reconstruction with error correction

Multiregion sampling of de novo metastatic prostate cancer reveals complex polyclonality and augments clinical genotyping

Adoptive neoantigen-reactive T cell therapy: improvement strategies and current clinical researches

ACT-Discover: identifying karyotype heterogeneity in pancreatic cancer evolution using ctDNA

The heterogeneity and clonal evolution analysis of the advanced prostate cancer with castration resistance

Navigation

PyClone: statistical inference of clonal population structure in cancer

Abstract

Access this article

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation