Next-generation sequencing (NGS) panel analysis on DNA from formalin-fixed paraffin-embedded (FFPE) tissue is increasingly used to also identify actionable copy number gains (gene amplifications) in addition to sequence variants. While guidelines for the reporting of sequence variants are available, guidance with respect to reporting copy number gains from gene-panel NGS data is limited. Here, we report on Dutch consensus recommendations obtained in the context of the national Predictive Analysis for THerapy (PATH) project, which aims to optimize and harmonize routine diagnostics in molecular pathology. We briefly discuss two common approaches to detect gene copy number gains from NGS data, i.e., the relative coverage and B-allele frequencies. In addition, we provide recommendations for reporting gene copy gains for clinical purposes. In addition to general QC metrics associated with NGS in routine diagnostics, it is recommended to include clinically relevant quantitative parameters of copy number gains in the clinical report, such as (i) relative coverage and estimated copy numbers in neoplastic cells, (ii) statistical scores to show significance (e.g., z-scores), and (iii) the sensitivity of the assay and restrictions of NGS-based detection of copy number gains. Collectively, this information can guide clinical and analytical decisions such as the reliable detection of high-level gene amplifications and the requirement for additional in situ assays in case of borderline results or limited sensitivity.
Diagnostic NGS gene panels allow parallel detection of high-level gene amplifications associated with targeted therapy. In the Netherlands, eight molecular pathology laboratories currently have included copy number analyses in their routine NGS work-up and one laboratory is in the midst of the validation procedure (Supplementary Table 1) and [1,2,3]. Due to the lack of standard procedures and guidelines, the method of determining, interpreting, and reporting these potential clinically actionable gains vary among the different laboratories (Supplementary Table 1). Guidelines to report the clinically relevant sequence variants in cancer (i.e., small indels and single-nucleotide variants) are extensive [4, 5], but are scarce with respect to somatic copy number gains . The Predictive Analysis for THerapy (PATH) project is a national initiative to optimize and harmonize the routine diagnostics in molecular pathology across the Netherlands. A national meeting was arranged, resulting in recommendations for the interpretation and reporting of copy number gains using gene panel NGS data and to support adequate interpretation and clinical decision-making. Here we focus on high-level copy number gains, also described as high-level gene amplifications, that can entail therapeutically targetable aberrations (reviewed in ). High-level copy number gains can be more reliably detected than copy number losses or low-level copy number gains when applying gene (hot spot) panels of limited size.
Detection of copy number gains from gene panel NGS data
With an increased number of genomic DNA (gDNA) template molecules available for targeted sequencing, gene copy number gains will be reflected by an increased abundance of these genomic segments in the libraries and consequently result in an increased number of sequencing reads covering the respective (part of a) gene. To identify these coverage outliers in gene panel NGS data, several approaches have been described [8,9,10,11]. For details on methods and software used in routine diagnostics in the Netherlands, see Supplementary Table 1. Note that at this moment “best practices” are yet to be determined and the choice of method and software is generally based on in house availability and validation.
Generally, these approaches start with sample normalization to correct for differences in total reads, which is especially required in the context of formalin-fixed paraffin-embedded (FFPE) tissue analyses with variable input quality and quantity of gDNA. It is of importance that the applied normalization method is not affected by the presence of high-level copy number gains. For example, the median coverage is more stable relative to average or summed coverage in the presence of high-level copy number gains. In addition, the genomic locations covered by the gene panel should be sufficiently spread throughout the genome covering multiple loci, to allow appropriate normalization in the presence of a high-level amplified gene. Subsequently, the obtained normalized read count per amplicon and/or per gene can be compared to a reference pool of (control) samples to estimate the copy number. From this comparison, statistical measures such as relative coverage (also often referred to as “fold change”) and z-scores can be deduced (Fig. 1). The choice for an internal or external reference pool (that is, reference samples are analyzed within the same or a different batch) and the minimal size of this reference pool depends on diagnostic batch size, batch content, and the stability of the assay and should therefore be determined during validation. One should keep in mind that aneuploidy in tumors can affect normalization and the quantification might be an underestimation or overestimation depending on the nature and extend of aneuploidy and the number of genomic regions that are included in the gene panel. Therefore, the gene panel should also include genes/loci that are not expected to be affected by CNV in the malignancies of interest.
The z-score is a significance score commonly used in the detection of copy number variation from coverage data. This score represents the number of standard deviations that the obtained coverage is above or below the mean of a reference group of values. Therefore, it greatly depends on testing conditions, including the number of data points for a given locus and the analytical noise that causes variations in control samples or between duplicate analyses of the same sample. In a similar manner, the confidence interval is a statistical measure representing the variation in a reference group of samples and reflects whether the measurement likely lies within the reference group interval. Consequently, while z-scores and other measures like confidence intervals are essential to distinguish analytical noise from actual copy number variation and thus reflect the statistical significance of a copy number variation, they are also greatly influenced by the standard deviation that depends on laboratory- and test-specific conditions (see also Fig. 1e, f). In the following paragraphs, the z-score is used as a representative for any significance score that can be used to distinguish noise from actual copy number gains.
By applying validated thresholds, significant coverage gains can be identified that reflect gene amplifications. Note that the relative coverage of individual amplicons within an amplified gene can differ. Generally, poorly performing amplicons with a low absolute coverage in control samples tend to obtain a lower relative coverage in case of high-level amplifications, likely due to technical saturation. The required total number of amplicons per gene depends on the technical variability and should be determined during validation. Distribution of amplicons throughout the gene locus is preferred to prevent false positive calls from partial gene amplifications, while keeping in mind frequently deleted regions like the exons 2 to 7 in EGFRvIII .
A second commonly used approach to detect copy number variation using NGS data is based on variant allele frequencies of germline (single-nucleotide) polymorphisms (SNPs) at the gene loci. These so-called B allele frequencies (BAFs) were initially described for SNP-based array analysis . For heterozygous SNPs, the variant allele frequency in gDNA from normal tissue approaches 50%. In the presence of copy number gains, the variant allele frequency is increased when the B/minor allele is amplified and likewise decreased with amplification of the A/major allele (Fig. 2).
The relative coverage and BAF approaches are complementary. For example, the BAF approach requires coverage information to discriminate copy number gains from copy number losses. With sufficient “SNP-density” the BAF approach can be more sensitive to detect low copy number aberrations (such as gene deletions or duplications), while the relative coverage approach is more reliable in the quantitative assessment of higher-level copy number gains (Fig. 3).
Regardless of the applied method, it is recommended to use positive and negative control samples in which the copy number gains are confirmed by alternative approaches such as fluorescence in situ hybridization (FISH), SNP-array analysis, or multiplex ligation-dependent probe amplification (MLPA). After validation, positive and negative control samples should also be analyzed on a regular basis, to ensure stability of the assay. Analytical cutoff values should be established that translate into reliable and significant copy number gains, preferably for all individual genes of interest. Since analysis of gDNA of limited input quantity and/or quality may result in suboptimal coverage and subsequently lead to false positive calls, the use of minimal coverage thresholds is also recommended.
Clinically relevant measures of gene amplification
Currently, the clinical relevance of gene amplifications is largely based on molecular analyses by in situ approaches such as FISH. The presence of gene copy number gains in single neoplastic nuclei has been correlated with clinical responses towards drugs targeting the product of the amplified gene. However, the above-described, NGS-based measurements are obtained from the total gDNA template molecules in the sample and as such represent a mixture of tumor-derived and non-neoplastic gDNA from stromal and inflammatory cells. The measured gain is thus determined by both the neoplastic cell percentage and the actual allele copy number (Fig. 4a). To relate the NGS detected gains to FISH detected gains, the calculated number of alleles could be corrected for the estimated percentage of neoplastic nuclei in the area from which the gDNA was isolated. While we realize the estimation of this percentage is error-prone [14, 15], it can be supported by the variant allele frequencies (VAF) of somatic variants in other genes and it allows estimation of the number of gene copies in the “order of magnitude” required to asses clinical relevance. As mentioned above, the estimation of the actual copy number gains may be biased in highly aneuploid tumors. For clinical decision making, high copy number gains are most relevant for which the estimation of allele copy number will be less affected compared to low copy number alterations.
It is important to assess the clinical relevance of an estimated copy number on a case-by-case basis. For example, for the tyrosine kinase inhibitor (TKI) crizotinib a FISH established cutoff of 10 copies of the MET gene in lung adenocarcinoma has been suggested , while only very limited data on the response related to low-level (4–9 copies) and high-level (≥ 10 copies) amplification are presently available [17, 18]. For capmatinib targeting MET in EGFR mutated lung adenocarcinoma with disease progression on EGFR-TKI treatment, gains of ≥ 6 copies have been related to response to combined MET and EGFR targeting . For example, in case an estimated copy number gain of 30 copies of MET in the tumor cells is detected by gene panel NGS analysis, TKI treatment could directly be considered, while an estimated number more closely to 6 copies warrants subsequent FISH analysis for clinical guidance. Depending on the results of the internal validation, a confirmatory FISH analysis can be performed in a subset of cases. It is important to realize that these cutoffs differ per therapeutic agent, gene, and tumor type. To determine the optimal cutoff for treatment responses, several trials are ongoing; e.g., FGFR1 amplification levels predicting clinical benefit for inhibitors targeting FGFR1 appear drug and tumor dependent .
Note that in FISH analyses typically only 50–100 selected nuclei are analyzed. While this enables specific investigation of neopastlic nuclei, which is hampered by admixture of non-neoplastic cells in NGS-based analysis, it creates a potential bias that this selection is not representative for the whole neoplastic area (for instance, the selection can favor nuclei with multiple signals). For NGS-based approaches, the data represents an average of a much larger number of cells (e.g., 20 ng represents ~3300 nuclei). To ultimately evaluate the clinical use of FISH- and NGS-based diagnostics, both should be included in clinical trials focusing on treatment response.
Clinical reporting of NGS-based copy number gain analysis
The clinical report that describes the results of the NGS-based analysis of copy number gains should include the interpretation, e.g., “amplification detected of gene X,” “no indications for the presence of amplifications,” “inconclusive,” or “additional testing required.” In addition, it is essential to include quantitative information as well as assay limitations.
Quantitative information includes the relative coverage that reflects the total number of alleles present in the gDNA sample. Relative to BAFs, this measure provides a more linear read-out of allele copy number. For loci with a significant copy number change, based on the minimal validated relative coverage and z-score or confidence interval, it is recommended to report the relative coverage and the estimated number of copies in the neoplastic cells by taking the estimated neoplastic cell percentage into account.
Awareness of assay limitations is critical for routine diagnostics, as illustrated by the necessity to include the “reportable range” for NGS-based detection of sequence variants, like the fraction of the targeted genomic regions for which calls of an acceptable quality can be generated . For the report of copy number gain detection, it is essential to specify the estimated sensitivity as the minimum amount of gene copies that can reliably be measured as a copy number gain. Note that generally these NGS-based analyses are not sufficiently sensitive to reliably exclude the presence of any copy number gain in case of low neoplastic cell percentages. Therefore, the threshold for copy number gains also depends on the percentage of neoplastic cells (Fig. 4b). We suggest a sentence such as “Based on the estimated neoplastic cell percentage, this assay allows detection of copy number gains of > XX copies.” This sensitivity is also crucial to decide on the necessity to perform additional analyses.
High-level copy number gains (gene amplifications) of clinically targetable genes can be detected using NGS gene (hot spot) panels in which multiple independent genomic regions are included. Tables 1 and 2 summarize the technical considerations and biological phenomena impacting on the detection of these copy number gains. NGS gene panel-based analysis has the advantage that high-level copy number gains in multiple genes can be measured simultaneously in addition to sequence variant detection. As such, independent assays to determine copy number gains may not be needed for a subset of tumors, improving turn-around times, cost-efficiency, and tissue management. A limitation of an NGS-based approach is the difficulty to detect low-level copy number gains and/or high-level amplifications in specimens with low neoplastic cell percentages. For these cases, in situ analyses including FISH are recommended to either exclude or confirm the presence of copy number gains. We recommend reporting relative coverage and the estimated copy numbers in neoplastic cells for loci that reach a minimal validated relative coverage and significance score, as these parameters represent both quantitative and clinically relevant measures. Clinical validity of the identified copy number gain needs to be interpreted on a case-by-case basis.
Sie D, Snijders PJ, Meijer GA, Doeleman MW, van Moorsel MI, van Essen HF, Eijk PP, Grunberg K, van Grieken NC, Thunnissen E, Verheul HM, Smit EF, Ylstra B, Heideman DA (2014) Performance of amplicon-based next generation DNA sequencing for diagnostic gene mutation profiling in oncopathology. Cell Oncol (Dordr) 37(5):353–361. https://doi.org/10.1007/s13402-014-0196-2
Dubbink HJ, Atmodimedjo PN, van Marion R, Krol NMG, Riegman PHJ, Kros JM, van den Bent MJ, Dinjens WNM (2016) Diagnostic detection of allelic losses and imbalances by next-generation sequencing: 1p/19q co-deletion analysis of gliomas. J Mol Diagn 18(5):775–786. https://doi.org/10.1016/j.jmoldx.2016.06.002
van Riet J, Krol NMG, Atmodimedjo PN, Brosens E, van IWFJ, Jansen M, Martens JWM, Looijenga LH, Jenster G, Dubbink HJ, Dinjens WNM, van de Werken HJG (2018) SNPitty: an intuitive web application for interactive B-allele frequency and copy number visualization of next-generation sequencing data. J Mol Diagn 20(2):166–176. https://doi.org/10.1016/j.jmoldx.2017.11.011
Deans ZC, Costa JL, Cree I, Dequeker E, Edsjo A, Henderson S, Hummel M, Ligtenberg MJ, Loddo M, Machado JC, Marchetti A, Marquis K, Mason J, Normanno N, Rouleau E, Schuuring E, Snelson KM, Thunnissen E, Tops B, Williams G, van Krieken H, Hall JA, ASBL IQNP (2017) Integration of next-generation sequencing in clinical diagnostic molecular pathology laboratories for analysis of solid tumours; an expert opinion on behalf of IQN Path ASBL. Virchows Arch 470(1):5–20. https://doi.org/10.1007/s00428-016-2025-7
Tack V, Spans L, Schuuring E, Keppens C, Zwaenepoel K, Pauwels P, Van Houdt J, Dequeker EMC (2018) Describing the reportable range is important for reliable treatment decisions: A Multiple Laboratory Study for Molecular Tumor Profiling Using Next-Generation Sequencing. J Mol Diagn 20(6):743–753. https://doi.org/10.1016/j.jmoldx.2018.06.006
Li MM, Datto M, Duncavage EJ, Kulkarni S, Lindeman NI, Roy S, Tsimberidou AM, Vnencak-Jones CL, Wolff DJ, Younes A, Nikiforova MN (2017) Standards and guidelines for the interpretation and reporting of sequence variants in cancer: A Joint Consensus Recommendation of the Association for Molecular Pathology, American Society of Clinical Oncology, and College of American Pathologists. J Mol Diagn 19(1):4–23. https://doi.org/10.1016/j.jmoldx.2016.10.002
Du Z, Lovly CM (2018) Mechanisms of receptor tyrosine kinase activation in cancer. Mol Cancer 17(1):58. https://doi.org/10.1186/s12943-018-0782-4
Budczies J, Pfarr N, Stenzinger A, Treue D, Endris V, Ismaeel F, Bangemann N, Blohmer JU, Dietel M, Loibl S, Klauschen F, Weichert W, Denkert C (2016) Ioncopy: a novel method for calling copy number alterations in amplicon sequencing data including significance assessment. Oncotarget 7(11):13236–13247. https://doi.org/10.18632/oncotarget.7451
Hoogstraat M, Hinrichs JW, Besselink NJ, Radersma-van Loon JH, de Voijs CM, Peeters T, Nijman IJ, de Weger RA, Voest EE, Willems SM, Cuppen E, Koudijs MJ (2015) Simultaneous detection of clinically relevant mutations and amplifications for routine cancer pathology. J Mol Diagn 17(1):10–18. https://doi.org/10.1016/j.jmoldx.2014.09.004
Shen W, Szankasi P, Sederberg M, Schumacher J, Frizzell KA, Gee EP, Patel JL, South ST, Xu X, Kelley TW (2016) Concurrent detection of targeted copy number variants and mutations using a myeloid malignancy next generation sequencing panel allows comprehensive genetic analysis using a single testing strategy. Br J Haematol 173(1):49–58. https://doi.org/10.1111/bjh.13921
Singh RR, Patel KP, Routbort MJ, Aldape K, Lu X, Manekia J, Abraham R, Reddy NG, Barkoh BA, Veliyathu J, Medeiros LJ, Luthra R (2014) Clinical massively parallel next-generation sequencing analysis of 409 cancer-related genes for mutations and copy number variations in solid tumours. Br J Cancer 111(10):2014–2023. https://doi.org/10.1038/bjc.2014.518
Wikstrand CJ, Hale LP, Batra SK, Hill ML, Humphrey PA, Kurpad SN, McLendon RE, Moscatello D, Pegram CN, Reist CJ et al (1995) Monoclonal antibodies against EGFRvIII are tumor specific and react with breast and lung carcinomas and malignant gliomas. Cancer Res 55(14):3140–3148
Popova T, Manie E, Stoppa-Lyonnet D, Rigaill G, Barillot E, Stern MH (2009) Genome alteration print (GAP): a tool to visualize and mine complex cancer genomic profiles obtained by SNP arrays. Genome Biol 10(11):R128. https://doi.org/10.1186/gb-2009-10-11-r128
Smits AJ, Kummer JA, de Bruin PC, Bol M, van den Tweel JG, Seldenrijk KA, Willems SM, Offerhaus GJ, de Weger RA, van Diest PJ, Vink A (2014) The estimation of tumor cell percentage for molecular testing by pathologists is not accurate. Mod Pathol 27(2):168–174. https://doi.org/10.1038/modpathol.2013.134
Dufraing K, De Hertogh G, Tack V, Keppens C, Dequeker EMC, van Krieken JH (2018) External quality assessment identifies training needs to determine the neoplastic cell content for biomarker testing. J Mol Diagn 20(4):455–464. https://doi.org/10.1016/j.jmoldx.2018.03.003
Noonan SA, Berry L, Lu X, Gao D, Baron AE, Chesnut P, Sheren J, Aisner DL, Merrick D, Doebele RC, Varella-Garcia M, Camidge DR (2016) Identifying the appropriate FISH criteria for defining MET copy number-driven lung adenocarcinoma through oncogene overlap analysis. J Thorac Oncol 11(8):1293–1304. https://doi.org/10.1016/j.jtho.2016.04.033
Drilon A, Cappuzzo F, Ou SI, Camidge DR (2017) Targeting MET in lung cancer: will expectations finally be MET? J Thorac Oncol 12(1):15–26. https://doi.org/10.1016/j.jtho.2016.10.014
Bubendorf L, Dafni U, Schobel M, Finn SP, Tischler V, Sejda A, Marchetti A, Thunnissen E, Verbeken EK, Warth A, Sansano I, Cheney R, Speel EJM, Nonaka D, Monkhorst K, Hager H, Martorell M, Savic S, Kerr KM, Tan Q, Tsourti Z, Geiger TR, Kammler R, Schulze K, Das-Gupta A, Shames D, Peters S, Stahel RA, Lungscape C (2017) Prevalence and clinical association of MET gene overexpression and amplification in patients with NSCLC: results from the European Thoracic Oncology Platform (ETOP) Lungscape project. Lung Cancer 111:143–149. https://doi.org/10.1016/j.lungcan.2017.07.021
Wu YL, Zhang L, Kim DW, Liu X, Lee DH, Yang JC, Ahn MJ, Vansteenkiste JF, Su WC, Felip E, Chia V, Glaser S, Pultar P, Zhao S, Peng B, Akimov M, Tan DSW (2018) Phase Ib/II study of capmatinib (INC280) plus gefitinib after failure of epidermal growth factor receptor (EGFR) inhibitor therapy in patients with EGFR-mutated, MET factor-dysregulated non-small-cell lung cancer. J Clin Oncol 36(31):3101–3109. https://doi.org/10.1200/JCO.2018.77.7326
Porta R, Borea R, Coelho A, Khan S, Araujo A, Reclusa P, Franchina T, Van Der Steen N, Van Dam P, Ferri J, Sirera R, Naing A, Hong D, Rolfo C (2017) FGFR a promising druggable target in cancer: molecular biology and new drugs. Crit Rev Oncol Hematol 113:256–267. https://doi.org/10.1016/j.critrevonc.2017.02.018
We thank all technicians, clinical scientists, bioinformaticians, trainees and pathologists that have contributed to the implementation of NGS-based detection of copy number gains in the departments and/or to the development of these practice recommendations through comments and suggestions. In particular, we thank Rubayte Rahman, Daoud Sie, Els Verhoeven, and Guido Roemen. This work is part of the research programme Personalised Medicine which is financed by the Netherlands Organisation for Health Research and Development (ZonMw, project number 846001001).
All authors are involved in routine diagnostics in Dutch Pathology laboratories of 12 different centers and are registered as, or in training for, clinical scientist in molecular pathology by the Dutch Society of Pathology (NVVP). All authors are as such involved in the national Predictive Analysis for THerapy (PATH) project and have contributed to the discussions resulting in the recommendations described in this manuscript. AEij and MLig conceived and designed the manuscript set-up. AEij, BTop, ESch, and MLig wrote the initial manuscript; AvdBer, AvdBru, WDin, HDub, AtEls, WGeu, PGro, FGro, DHei, MHui, CHui, JJeu, LKem, EKor, LKro, WdLen, CvNoe, ESpe, MVog, and TWez edited and reviewed the manuscript. All authors gave final approval for publication. Author MLig takes full responsibility for the work as a whole, including the study design, access to data, and the decision to submit and publish the manuscript.
Conflict of interest
The authors declare that they have no conflict of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the Topical Collection on Quality in Pathology
Electronic supplementary material
An overview of the approaches for copy number analyses in routine gene panel NGS using for predictive testing of FFPE specimen work-up of nine hospital-based molecular pathology laboratories in the Netherlands (April 2018). (XLSX 12 kb)
About this article
Cite this article
Eijkelenboom, A., Tops, B.B.J., van den Berg, A. et al. Recommendations for the clinical interpretation and reporting of copy number gains using gene panel NGS analysis in routine diagnostics. Virchows Arch 474, 673–680 (2019). https://doi.org/10.1007/s00428-019-02555-3
- Copy number gain
- Targeted therapy
- Routine diagnostics
- Molecular pathology