Versatile Identification of Copy Number Variants with Canvas

  • Sergii Ivakhno
  • Eric RollerEmail author
Part of the Methods in Molecular Biology book series (MIMB, volume 1833)


Versatile and efficient variant calling tools are needed to analyze large-scale sequencing datasets. In particular, identification of copy number changes remains a challenging task due to their complexity, susceptibility to sequencing biases, variation in coverage data and dependence on genome-wide sample properties, such as tumor polyploidy, polyclonality in cancer samples, or frequency of de novo variation in germline genomes of pedigrees. The frequent need of core sequencing facilities to process samples from both normal and tumor sources favors multipurpose variant calling tools with functionality to process these diverse sets within a single software framework. This not only simplifies the overall bioinformatics workflow but also streamlines maintenance by shortening the software update cycle and requiring only limited staff training. Here we introduce Canvas, a tool for identification of copy number changes from diverse sequencing experiments including whole-genome matched tumor–normal, small pedigree, and single-sample normal resequencing, as well as whole-exome matched and unmatched tumor–normal studies. In addition to variant calling, Canvas infers genome-wide parameters such as cancer ploidy, purity, and heterogeneity. It provides fast and easy-to-run workflows that can scale to thousands of samples and can be easily incorporated into variant calling pipelines.

Key words

Copy number variation Small pedigree Somatic variation 


  1. 1.
    Navin NE (2010) Tracing the tumor lineage. Mol Oncol 4:267–283CrossRefPubMedPubMedCentralGoogle Scholar
  2. 2.
    Acuna-Hidalgo R et al (2016) New insights into the generation and role of de novo mutations in health and disease. Genome Biol 17:241CrossRefPubMedPubMedCentralGoogle Scholar
  3. 3.
    Teo SM et al (2012) Statistical challenges associated with detecting copy number variations with next-generation sequencing. Bioinformatics 28:2711–2718CrossRefPubMedGoogle Scholar
  4. 4.
    Abyzov A et al (2011) CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res 21:974–984CrossRefPubMedPubMedCentralGoogle Scholar
  5. 5.
    Boeva V et al (2011) Control-FREEC: a tool for assessing copy number and allelic content using next generation sequencing data. Bioinformatics 28:423–425CrossRefPubMedPubMedCentralGoogle Scholar
  6. 6.
    Liu Y et al (2016) Joint detection of copy number variations in parent-offspring trios. Bioinformatics 32:1130–1137CrossRefPubMedGoogle Scholar
  7. 7.
    Roller E et al (2016) Canvas: versatile and scalable detection of copy number variants. Bioinformatics 32:2375–2377CrossRefPubMedGoogle Scholar
  8. 8.
    Ivakhno S, et al. (2018) Canvas SPW: calling de novo copy number variants in pedigrees.
  9. 9.
    Fryzlewicz P (2007) Unbalanced Haar technique for nonparametric function estimation. J Am Stat Assoc 102:1318–1327CrossRefGoogle Scholar
  10. 10.
    Ivakhno S et al (2017) tHapMix: simulating tumour samples through haplotype mixtures. Bioinformatics 33:280–282CrossRefPubMedGoogle Scholar
  11. 11.
    Eberle M et al (2016) A reference dataset of 5.4 million human variants validated by genetic inheritance from sequencing a three-generation 17-member pedigree. Genome Res 27:157–164CrossRefPubMedGoogle Scholar
  12. 12.
    Kim S, et al. (2017) Strelka2: fast and accurate variant calling for clinical sequencing applications. bioRxiv.
  13. 13.
    Raczy C et al (2013) Isaac: ultra-fast whole-genome secondary analysis on Illumina sequencing platforms. Bioinformatics 29:2041–2043CrossRefPubMedGoogle Scholar
  14. 14.
    Saunders C et al (2012) Strelka: accurate somatic small-variant calling from sequenced tumor–normal sample pairs. Bioinformatics 28:1811–1817CrossRefPubMedGoogle Scholar
  15. 15.
    Chen X et al (2016) Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics 32:1220–1222CrossRefPubMedGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Illumina Cambridge Ltd.Chesterford Research ParkEssexUK
  2. 2.Illumina Inc.5200 Illumina WaySan DiegoUSA

Personalised recommendations