Low-pass whole genome sequencing of circulating tumor cells to evaluate chromosomal instability in triple-negative breast cancer

Di Cosimo, Serena; Silvestri, Marco; De Marco, Cinzia; Calzoni, Alessia; De Santis, Maria Carmen; Carnevale, Maria Grazia; Reduzzi, Carolina; Cristofanilli, Massimo; Cappelletti, Vera

doi:10.1038/s41598-024-71378-3

Low-pass whole genome sequencing of circulating tumor cells to evaluate chromosomal instability in triple-negative breast cancer

Article
Open access
Published: 03 September 2024

Volume 14, article number 20479, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Low-pass whole genome sequencing of circulating tumor cells to evaluate chromosomal instability in triple-negative breast cancer

Download PDF

Serena Di Cosimo¹,
Marco Silvestri^1,2,
Cinzia De Marco¹,
Alessia Calzoni^2,3,
Maria Carmen De Santis^4,5,
Maria Grazia Carnevale^4,5,
Carolina Reduzzi⁶,
Massimo Cristofanilli⁶ &
…
Vera Cappelletti¹

114 Accesses
Explore all metrics

Abstract

Chromosomal Instability (CIN) is a common and evolving feature in breast cancer. Large-scale Transitions (LSTs), defined as chromosomal breakages leading to gains or losses of at least 10 Mb, have recently emerged as a metric of CIN due to their standardized definition across platforms. Herein, we report the feasibility of using low-pass Whole Genome Sequencing to assess LSTs, copy number alterations (CNAs) and their relationship in individual circulating tumor cells (CTCs) of triple-negative breast cancer (TNBC) patients. Initial assessment of LSTs in breast cancer cell lines consistently showed wide-ranging values (median 22, range 4–33, mean 21), indicating heterogeneous CIN. Subsequent analysis of CTCs revealed LST values (median 3, range 0–18, mean 5), particularly low during treatment, suggesting temporal changes in CIN levels. CNAs averaged 30 (range 5–49), with loss being predominant. As expected, CTCs with higher LSTs values exhibited increased CNAs. A CNA-based classifier of individual patient-derived CTCs, developed using machine learning, identified genes associated with both DNA proliferation and repair, such as RB1, MYC, and EXO1, as significant predictors of CIN. The model demonstrated a high predictive accuracy with an Area Under the Curve (AUC) of 0.89. Overall, these findings suggest that sequencing CTCs holds the potential to facilitate CIN evaluation and provide insights into its dynamic nature over time, with potential implications for monitoring TNBC progression through iterative assessments.

Copy number alterations analysis of primary tumor tissue and circulating tumor cells from patients with early-stage triple negative breast cancer

Article Open access 27 January 2022

Interrogating breast cancer heterogeneity using single and pooled circulating tumor cell analysis

Article Open access 05 July 2022

Binary classification of copy number alteration profiles in liquid biopsy with potential clinical impact in advanced NSCLC

Article Open access 09 August 2024

Introduction

Breast cancer is a global health issue with approximately two and a half million new cases diagnosed annually worldwide¹. Despite advances in screening, detection, and treatment, breast cancer remains the leading cause of cancer-related deaths among women¹. The triple-negative (TNBC) subtype has the worst prognosis, emphasizing the need for improved care for both localized and metastatic patients².

Chromosomal Instability (CIN) refers to the increased acquisition or loss of whole or fragmented chromosomes, and represents the most common form of genome instability in breast cancer³. Thus, improving our ability to assess CIN could offer promising insights into tumor progression and optimize patient care. Standard methods for evaluating CIN, such as DNA image cytometry and fluorescence in situ hybridization (FISH), are seldom used in the clinics due to their labor-intensive procedures and lack of high-throughput capabilities⁴. Alternative approaches including CIN70⁵ and HET70⁶ signatures, based on the expression of genes associated with aneuploidy and karyotype heterogeneity, or comparative genomic hybridization⁷ have also been utilized, showing that increased CIN is associated with metastatic potential and dismal prognosis^5,6,7. However, bulk analytical methods give a broad view of CIN without distinguishing between ongoing or past events that may not have continued. In addition, DNA image cytometry, FISH, and transcriptomic analysis face challenges in capturing the inherent cell-to-cell heterogeneity of CIN as they rely on pooled DNA samples⁴.

Single-cell sequencing (scDNAseq) is emerging as a promising approach to tackle the above listed challenges by providing accurate and quantitative CIN measures that are amenable to clinical use⁸. scDNAseq can provide insights into the underlying aberrant molecular pathways driving CIN, with DNA repair genes being prominent candidates⁸. Additionally, scDNAseq overcomes limitations and confounding factors associated with the use of bulk tissue, such as surrounding stromal tissue, tumor heterogeneity, and limited sample availability⁸. Importantly, scDNAseq can be applied to circulating tumor cells (CTCs), which are emerging as a significant resource for timely breast cancer molecular characterization⁹. Unlike invasive tumor tissue biopsy that is prone to sampling error, CTCs allow dynamic and repeatable assessment, representing the ideal source for longitudinal measuring of an evolving feature such as CIN¹⁰.

In this study, we leveraged our expertise in CTC genotyping by next-generation sequencing¹¹ to analyze CIN and underlying molecular alterations in TNBC patients. Specifically, we challenged low-pass Whole Genome Sequencing (lp-WGS) to determine the number of Large-Scale Transitions (LSTs) defined as contiguous regions of chromosomal breakage spanning at least 10 Mb¹². The LST metric was chosen for its frequent use as a biomarker of CIN^8,13. First, we tested the consistency of LST measurements using lp-WGS in a panel of breast cancer cell lines. Next, we extended our analyses to individual patient-derived CTCs collected at different clinical time-points, i.e., baseline, treatment, follow-up, and relapse. Finally, we developed a streamlined model for assessing CIN based on CTC copy number alterations (CNAs) within a specific set of genes.

Results

As part of technical feasibility, we initially evaluated LSTs as a means of CIN evaluation in breast cancer cell lines undergoing whole genome amplification and lp-WGS at the single-cell level. The analyses were conducted on MDA-MB-453, MDA-MB-361, BT474, BT549, and ZR-75 cell lines in replicates as reported in Table 1. We observed a wide range of LSTs (median 22, range 4–33), reflecting the heterogeneous nature of CIN both within individual cells and across different cell lines (Fig. 1a).

Table 1 Reproducibility of LST values determined by lp-WGS in breast cancer cell lines.

Full size table

Notably, LSTs values were significantly and reproducibly determined for the tested cell lines (Table 1).

We next analyzed clinical samples from 12 patients with histologically confirmed TNBC, successfully profiling (> 400,000 reads) a total of 35 CTCs collected at various time points throughout the disease trajectory (Table 2).

Table 2 Triple negative breast cancer patient and CTC characteristics.

Full size table

LSTs in CTCs showed heterogeneity (median 3, range 0–18), with values lower than those observed in cell lines, especially during treatment (median 2, range 0–13). Median LSTs in CTCs from patients with and without metastases were 2 and 3.5, respectively; 3 in germ-line BRCA mutation carriers. The distribution of LSTs values displayed a bimodal shape (Fig. 1b). However, its limited extent prevented definition of a clear threshold, prompting the use of the median number of LSTs to classify CTCs as either LST-low (number of LSTs < 3) or high (number of LSTs ≥ 3).

We next analyzed the CTC CNA profile. The mean number of CNAs per CTC was 30 (range 5–49), with deletions outnumbering amplifications at 401:291 (Supplementary Fig. 3). The most frequently lost or gained chromosomal regions and the corresponding genes are reported in Fig. 2.

Recurrent alterations involved 9p and 9q, containing ABL1, NOTCH1, and CDKN2A; 10, containing MAPK8 and GATA3; and 22q, containing BCR, as expected and consistently with literature on genes involved in TNBC oncogenesis¹⁴. We also analyzed CNA with respect to LSTs. Compared to CTCs classified as LST-low, those with higher values had a numerical increase in CNAs overall, median CNAs in CTCs with high and low LSTs 22 and 13, p = 0.08, and a prevalence of copy number losses, particularly in homologous recombination deficiency (HDR) related genes, with 59% (13/22) of CTCs classified as LST-high and 31% (4/13) of the LST-low showing RAD51, BLM, or WNR copy loss, p = 0.05. Oncogenic signaling pathways analysis showed that CTCs classified as LST-high were enriched for CNAs—either gains or losses—affecting NRF2, TP53, and TGF-beta signaling (Supplementary Fig. 1).

However, the question remained as to which factors most strongly influence LSTs. Therefore, we used a Random Forrest (RF) non parametric machine learning method to develop a CNA-based classifier of patient-derived CTCs with and without LSTs (Supplementary Fig. 2).

A total of 39 covariates were included in the model, consisting of CNAs of established HDR related¹⁵ and TNBC driver¹⁶ genes (Supplementary Table 1). RB1, MYC, and EXO1 emerged as the most relevant predictors of CIN among all covariates, with variable importance index (VIMP) indicating that the prediction error rate would increase by up to 30% if the CNAs of these genes were randomly permuted in the model (Fig. 3a).

Strikingly, the RF model yielded an AUC of 0.89 indicating that the analysis of CNAs in a few genes might be sufficient to achieve reliable classification of CIN (Fig. 3b).

Discussion

Chromosomal instability is increasingly recognized as a cancer hallmark, crucial in initiation, progression, and metastasis, with implications for optimizing care^3,17. However, its regular assessment is hindered by its dynamic nature and limitations in currently available tools⁴. Hence, there is a critical need to develop CIN biomarkers that are easily and reliably assessable to inform and guide clinical management, including in breast cancer patients. To the best of our knowledge, several studies have assessed the CNA of CTCs, but none have tackled CIN analysis ^18,19,20. In this study, we analyzed lp-WGS data to evaluate LSTs and CNAs in individual CTCs from women with TNBC, and to build a predictive classifier of CIN at the single-cell level achieving an AUC of 0.89. While our study is preliminary, we are the first to report a cost-effective sequencing assay such as lp-WGS for assessing LSTs in CTCs, the utilization of distinctive genetic features to evaluate complex phenomena, and ultimately, the development of a performing predictive model based on CNAs interactions. Additionally, we incorporated the assessment of CIN, a dynamic variable on CTCs, whose analysis can be repeated over time through a minimally invasive blood draw. These findings not only pave the way to a novel analytical approach for assessing CIN but also provide significant contributions to the field.

The distributions of LSTs values, both in breast cancer cell lines and individual CTCs, confirm the significant heterogeneity of CIN. This observation is consistent with existing literature, which suggests that the CIN underlying mechanisms leading to dysfunctional chromosome duplication and segregation can vary²¹. Interestingly, the LSTs values observed in CTCs, particularly those from recurrent patients, were not as elevated as expected. These findings align with prior research indicating low karyotypic variance during disease progression across various cancer types including the breast²². To reconcile this observation with the well-documented prevalence of CIN in cancer, the theory of the CIN paradox posits that tumors typically exhibit intermediate levels of CIN as excessively high levels are detrimental, while insufficient levels do not guarantee an advantage in terms of proliferation and survival²³. In addition, the low LST values observed in recurrent breast cancer patients may be influenced by the number of CTCs analyzed potentially affecting the prevalence of CIN. This raises the question of deriving individuals' features from their single-cell data. To the best of our knowledge, few previous work estimated the required sample size, i.e., the number of cells to profile, to infer CIN from scDNAseq data²⁴. Regarding CTCs, while some have suggested diagnosing cancer with CIN based on the presence of only one²⁵ to at least 3 unstable CTCs²⁶, it is uncertain if this also applies to breast cancer. Therefore, further research is needed.

Several studies have characterized CNAs in TNBC tissue using high-resolution genomic data¹⁶. Consistent with these findings, CTC CNAs more frequently showed deletions than amplifications. Despite potential limitations of lp-WGS compared to higher resolution next-generation sequencing, we report that CTC chromosomal gains and losses occurred in regions where breast cancer-related genes are generally found, supporting that our findings were unlikely to be due to random sequencing dropout or due to amplification bias. For instance, CDKN2A and NOTCH1 were identified in loss regions^14,16. It is also not surprising that CTCs with high LSTs were more frequently characterized by the loss of HDR related genes. However, whether this is the cause of LSTs or if, conversely, the loss of these genes is the consequence, we cannot ascertain. The fact remains that DNA repair genes alone do not fully explain CTC CIN. As already reported for tumor tissue, other factors such as mitotic errors, replication stress, telomere crisis, and breakage fusion bridge cycles²¹, among others, may also be at play. Therefore, we hypothesized that the simultaneous analysis of copy number changes in a set of selected genes could help define CTCs with and without LSTs. To this end, we utilized, for the first time in this context, the RF learning model which allowed us to examine the impact of different potential predictors in creating a predictive model²⁷. Our findings indicate that RB1, EXO1, and MYC are the most significant predictors among all covariates for identifying LSTs, with a variable importance index exceeding 30%. These results align with preclinical evidence suggesting that the loss of G1/S control resulting from RB1 pathway inactivation, coupled with MYC-induced mitogen addition and DNA damage, leads to chromatid breaks and chromatid cohesion defects in mitotic cells²⁸. These aberrations ultimately contribute to aneuploidy in the offspring cell population. Furthermore, LSTs represent a subset of chromosomal rearrangements, particularly evident when double-strand breaks are repaired through non-homologous end joining, as observed in BRCA-deficient environments¹². Aligned with this, alterations of BRCA1 and BRCA2 demonstrated substantial predictive value within the developed classifier.

This study and its methods have several strengths, as the classifier presented here represents a resource for a deeper understanding of the origins and diversity of CIN. Our results focus attention on a narrow group of genes involved in fundamental cellular processes for maintaining genomic integrity. Additionally, our results support the broader application of CIN measures in clinical diagnostics, as sequencing techniques, which have been rarely used due to technical difficulties, are becoming more widespread and affordable every day. Finally, this work focuses on targets that may lead to potentially applicable therapies, beyond those traditionally suggested based on platinum²¹ and taxane²⁹ for the most unstable tumors.

Despite these strengths, this study and the methods used also have weakness that should be noted. First, the number of LSTs is only one functional measure of CIN, and other measures exist, including telomere allele imbalance and loss of heterozygosis. Second, data on the single-cell nature of copy number or LST burden in single tumor cells in a large cohort are lacking, and technical limitations require that the data generated to date be interpreted with caution. Finally, RF cannot produce hypothesis testing results, such as relative risks, odds ratios, or p-values, as in classical regression methods, and its use is for model exploration. Hence, the data presented herein merit confirmation.

In conclusion, our study demonstrates the feasibility of low-resolution lp-WGS for assessing both LSTs and CNAs in TNBC CTCs at a single-cell level. As a proof-of-concept study, we developed a classifier of LSTs based on CNAs of genes involved both in HDR and replication process. Future research with larger sample sizes will be necessary to evaluate the clinical application of this assay, which lays the groundwork for leveraging CIN in precision oncology efforts.

Materials and methods

Sample processing

For spiking experiments, five cell lines broadly representative of breast cancer, expressing (+) or lacking (−) the estrogen receptor (ER), and showing Human Epidermal Growth Factor Receptor 2 amplified (HER2+) or normal (HER2−) status were purchased from the American Type Culture Collection (ATCC, Manassas, VA, USA). ZR75-1 (ER+/HER2−), MDA-MB-453 (ER−/HER2+), MDA-MB-361 (ER+/HER2+), and BT-549 (ER−/HER2−) were cultured in DMEM/F-12 (Lonza, Swizerland) medium supplemented with 10% fetal bovine serum, BT474 (ER+/HER2+) in Dulbecco’s Modified Eagle’s Medium (DMEM) (Sigma, Darmstadt, Germany). All culture media were supplemented with antibiotic–antimycotic Solution (100 ×) (Sigma, Darmstadt, Germany), 10% fetal bovine serum (FBS) (Sigma, Darmstadt, Germany) and L-glutamine (2 mM) (Invitrogen GmbH, USA), and tested negative for mycoplasma contamination. Single cells were manually captured under an inverted microscope using a p10 micropipette and directly spiked into healthy donor blood. Spiked-in samples were processed following the same protocols used for clinical samples.

Peripheral blood was collected from study patients in K2EDTA tubes (10 ml) and processed within 1 h of draw using the Parsortix platform (Angle plc, Guildford, UK) for size-based enrichment. Following enrichment, cells were harvested according to manufacturer’s instructions and fixed with 2% paraformaldehyde for 20 min at room temperature.

Cell isolation, amplification and sequencing

Enriched patient samples were processed using the DEPArray system (Menarini Silicon Biosystems, Bologna, IT)¹¹. Individual cells were sorted based on morphological characteristics, DNA content, and fluorescence labeling against epithelial (CK, EpCAM, EGFR) and leukocyte (CD45, CD14, CD16) markers, as previously reported¹¹. Subsequently, white blood cells expressing only leukocyte markers and single CTCs expressing either only epithelial markers or lacking any marker were recovered for downstream molecular analyses. WGA was performed on single cells using the Ampli1™ WGA kit version 02 (Menarini Silicon Biosystems, Bologna, IT) as per manufacturer instructions. For single cells derived from blood (CTCs and WBC), the quality of the WGA product was determined using the Ampli1™ QC Kit (Menarini Silicon Biosystems, Bologna, IT). A genomic integrity index (GII) was allocated for each sample scored from 0 to 4. Only single cells with sufficiently good quality DNA as determined by a GII ≥ 2 were selected for downstream analysis.

Low-pass whole genome sequencing and bioinformatics

Ampli1™ low-pass kit for Illumina (Menarini Silicon Biosystems, Bologna, IT) was used for preparing low-pass Whole Genome Sequencing (lpWGS) libraries from single cells. Forhigh-throughput processing, the manufacturer procedure was implemented in a fully automated workflow on Ion Torrent Ion S5-system (ThermoFisher, Waltham, MA, USA). Ampli1™ low-pass libraries were normalized and sequenced by Ion530 chip. The obtained FASTQ files were quality checked and aligned to the hg19 human reference sequence using tmap aligner tool on Torrent_Suite 5.10.0. and alignment (BAM) files were generated. All samples with < 400.000 reads were excluded from the analyses.

BAM files underwent quality filtering using qualimap³⁰ and were processed using two separate pipelines for CIN and CNAs. Each chromosomal break between contiguous regions of at least 10 Mb was tabulated to calculate the number of large-scale transitions (LSTs) per CTC genome. Copy number alterations were identified using QDNAseq software (version 11.0) according to the following settings: minMapq = 37, window = 500 kb. “Gain” and “loss” calls were filtered out by residual (> 4 standard deviation, SD) and black list regions reported in ENCODE database. Segmented copy number data of each sample were extracted starting from log2Ratio value. For the purpose of CNA profile, chromosome 19 was not considered due to its biased deletion associated with the high CG base percentage. Samples were classified as aberrant if they exhibited either ≥ 1 genomic regions with amplification/deletion greater than 12.5 Mb, or if the cumulative amplification/deletion of different genomic regions exceeded 37.5 Mb. OncoKb database was interrogated to evaluate biological and clinical relevant CNAs in CTCs (access date: March 2024).

Statistics

Biological analyses relied on canonical oncogenic signaling pathways, as previously defined³¹ and processed using custom functions from the maftools R package³², alongside Gene Ontology (GO) biological process terms and KEGG pathways via the ClusterProfiler Bioconductor package. CIN predictor was developed using the SMOTE method³³ to address sample imbalance between presence and absence of LSTs. Classification was performed using the random forest algorithm on 39 genes³⁴ with bootstrap re-sampling used to estimate standard errors and confidence intervals. The discriminatory capability of the CIN classifier was assessed using ROC curves and expressed by AUC values. Analyses of association were conducted using t-test for continuous variables, and Fisher test for categorical variables. All analyses were performed using R software (www.R-project.org), statistical significance was set at a p-value < 0.05.

Conference presentation

These results have been presented in part at the Molecular Analysis for Precision Oncology (MAP) Congress, Amsterdam, Netherlands, Oct 14–16, 2022.

Data availability

Raw sequencing data are available from the corresponding author upon request.

References

Arnold, M. et al. Current and future burden of breast cancer: Global statistics for 2020 and 2040. Breast 66, 15–23 (2022).
Article PubMed PubMed Central Google Scholar
Howard, F. M. & Olopade, O. I. Epidemiology of triple-negative breast cancer: A review. Cancer J. 27, 8–16 (2021).
Article CAS PubMed Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: New dimensions. Cancer Discov. 12, 31–46 (2022).
Article CAS PubMed Google Scholar
Lynch, A. R. et al. A survey of chromosomal instability measures across mechanistic models. Proc. Natl. Acad. Sci. USA 121, e2309621121 (2024).
Article CAS PubMed Google Scholar
Carter, S. L., Eklund, A. C., Kohane, I. S., Harris, L. N. & Szallasi, Z. A signature of chromosomal instability inferred from gene expression profiles predicts clinical outcome in multiple human cancers. Nat. Genet. 38, 1043–1048 (2006).
Article CAS PubMed Google Scholar
Sheltzer, J. M. A transcriptional and metabolic signature of primary aneuploidy is present in chromosomically unstable cancer cells and informs clinical prognosis. Cancer Res. 73, 6401–6412 (2013).
Article CAS PubMed PubMed Central Google Scholar
Climent, J., Garcia, J. L., Mao, J. H., Arsuaga, J. & Perez-Losada, J. Characterization of breast cancer by array comparative genomic hybridization. Biochem. Cell. Biol. 85, 497–508 (2007).
Article CAS PubMed Google Scholar
Greene, S. B. et al. Chromosomal instability estimation based on next generation sequencing and single cell genome wide copy number variation analysis. PLoS One 11, e0165089 (2016).
Article PubMed PubMed Central Google Scholar
Alix-Panabières, C. & Pantel, K. Challenges in circulating tumour cell research. Nat. Rev. Cancer 14, 623–631 (2014).
Article PubMed Google Scholar
Hiley, C. et al. Deciphering intratumor heterogeneity and temporal acquisition of driver events to refine precision medicine. Genome Biol. 15, 453 (2014).
Article PubMed PubMed Central Google Scholar
Silvestri, M. et al. Copy number alterations analysis of primary tumor tissue and circulating tumor cells from patients with early-stage triple negative breast cancer. Sci. Rep. 12, 1470 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Popova, T. et al. Ploidy and large-scale genomic instability consistently identify basal-like carcinomas with BRCA1/2 inactivation. Cancer Res. 72, 5454–5462 (2012).
Article CAS PubMed Google Scholar
Schonhoft, J. D. et al. Morphology-predicted large-scale transition number in circulating tumor cells identifies a chromosomal instability biomarker associated with poor outcome in castration-resistant prostate cancer. Cancer Res. 80, 4892–4903 (2020).
Article CAS PubMed PubMed Central Google Scholar
Li, Z. et al. Comprehensive identification and characterization of somatic copy number alterations in triple-negative breast cancer. Int. J. Oncol. 56, 522–530 (2020).
CAS PubMed Google Scholar
Matis, T. S. et al. Current gene panel s account for nearly all homologous recombination repair-associated multiple-case breast cancer families. NPJ Breast Cancer 7, 109 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bareche, Y. et al. Unravelling triple-negative breast cancer molecular heterogeneity using an integrative multiomic analysis. Ann. Oncol. 29, 895–902 (2018).
Article CAS PubMed PubMed Central Google Scholar
Eccleston, A. Targeting cancers with chromosome instability. Nat. Rev. Drug. Discov. 21, 556 (2022).
Article CAS PubMed Google Scholar
Rossi, T. et al. Single-cell NGS-based analysis of copy number alterations reveals new insights in circulating tumor cells persistence in early-stage breast cancer. Cancers 12(9), 2490. https://doi.org/10.3390/CANCERS12092490 (2020).
Article CAS PubMed PubMed Central Google Scholar
Rothé, F. et al. Interrogating breast cancer heterogeneity using single and pooled circulating tumor cell analysis. NPJ Breast Cancer 8(1), 1–8. https://doi.org/10.1038/s41523-022-00445-7 (2022).
Article CAS Google Scholar
Fernandez-Garcia, D. et al. Shallow WGS of individual CTCs identifies actionable targets for informing treatment decisions in metastatic breast cancer. Br. J. Cancer 127(10), 1858–1864. https://doi.org/10.1038/s41416-022-01962-9 (2022).
Article CAS PubMed PubMed Central Google Scholar
Drews, R. M. et al. A pan-cancer compendium of chromosomal instability. Nature 606, 976–983 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Gao, R. et al. Punctuated copy number evolution and clonal stasis in triple-negative breast cancer. Nat. Genet. 48, 1119–1130 (2016).
Article CAS PubMed PubMed Central Google Scholar
Birkbak, N. J. et al. Paradoxical relationship between chromosomal instability and survival outcome in cancer. Cancer Res. 71, 3447–3452 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lynch, A. R., Arp, N. L., Zhou, A. S., Weaver, B. A. & Burkard, M. E. Quantifying chromosomal instability from intratumoral karyotype diversity using agent-based modeling and Bayesan inference. eLife 11, e69799 (2022).
Article CAS PubMed PubMed Central Google Scholar
Malihi, P. D. et al. Single-cell circulating tumor cell analysis reveals genomic instability as a distinctive feature of aggressive prostate cancer. Clin. Cancer Res. 26, 4143–4153 (2020).
Article CAS PubMed PubMed Central Google Scholar
Xu, Y. et al. Detection of circulating tumor cells using negative enrichment immunofluorescence and an in situ hybridization system in pancreatic cancer. Int. J. Mol. Sci. 18, 622 (2017).
Article PubMed PubMed Central Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
van Harn, T. et al. Loss of Rb proteins causes genomic instability in the absence of mitogenic signaling. Genes Dev. 24, 1377–1388 (2010).
Article PubMed PubMed Central Google Scholar
Scribano, C. M. et al. Chromosomal instability sensitizes patient breast tumors to multipolar divisions induced by paclitaxel. Sci. Transl. Med. 13, 610 (2021).
Article Google Scholar
Okonechnikov, K., Conesa, A. & García-Alcalde, F. Qualimap 2: advanced multi-sample quality control for high-throughput sequencing data. Bioinformatics 32, 292–294 (2016).
Article CAS PubMed Google Scholar
Sanchez-Vega, F. et al. Oncogenic signaling pathways in the cancer genome atlas. Cell 173, 321–337 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mayakonda, A., Lin, D. C., Assenov, Y., Plass, C. & Koeffler, H. P. Maftools: Efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 28, 1747–1756 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002).
Article Google Scholar
Ishwaran, H., Kogalur, U. B., Blackstone, E. H. & Lauer, S. Random survival forests. Ann. Appl. Stat. 2, 841–860 (2008).
Article MathSciNet Google Scholar

Download references

Acknowledgements

We acknowledge the skilful technical support by Patrizia Miodini and Rosita Motta for CTC enrichment.

Author information

Authors and Affiliations

Department of Advanced Diagnostics, Fondazione IRCCS Istituto Nazionale Dei Tumori Di Milano, Via Venezian 1, 20100, Milan, Italy
Serena Di Cosimo, Marco Silvestri, Cinzia De Marco & Vera Cappelletti
Isinnova S.R.L, Brescia, Italy
Marco Silvestri & Alessia Calzoni
Department of Information Engineering, University of Brescia, Brescia, Italy
Alessia Calzoni
Department of Radiation Oncology, Fondazione IRCCS Istituto Nazionale Dei Tumori Di Milano, Milan, Italy
Maria Carmen De Santis & Maria Grazia Carnevale
Breast Unit, Fondazione IRCCS Istituto Nazionale Dei Tumori Di Milano, Milan, Italy
Maria Carmen De Santis & Maria Grazia Carnevale
Division of Hematology-Oncology, Weill Cornell Medicine, New York, NY, USA
Carolina Reduzzi & Massimo Cristofanilli

Authors

Serena Di Cosimo
View author publications
You can also search for this author in PubMed Google Scholar
Marco Silvestri
View author publications
You can also search for this author in PubMed Google Scholar
Cinzia De Marco
View author publications
You can also search for this author in PubMed Google Scholar
Alessia Calzoni
View author publications
You can also search for this author in PubMed Google Scholar
Maria Carmen De Santis
View author publications
You can also search for this author in PubMed Google Scholar
Maria Grazia Carnevale
View author publications
You can also search for this author in PubMed Google Scholar
Carolina Reduzzi
View author publications
You can also search for this author in PubMed Google Scholar
Massimo Cristofanilli
View author publications
You can also search for this author in PubMed Google Scholar
Vera Cappelletti
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: S.D.C., M.S., V.C.; Sample collection and processing: M.C.D.S., M.G.C., C.D.M., C.R.; Data curation and analysis: M.S., V.C., C.R., A.C., S.D.C.; Writing: S.D.C.; V.C., Supervision: S.D.C., V.C., M.C. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Marco Silvestri.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Ethics Committee of Fondazione IRCCS Istituto Nazionale dei Tumori di Milano (INT 196/14).

Informed consent

Informed consent was obtained from all study participants.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Supplementary Information 5.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Di Cosimo, S., Silvestri, M., De Marco, C. et al. Low-pass whole genome sequencing of circulating tumor cells to evaluate chromosomal instability in triple-negative breast cancer. Sci Rep 14, 20479 (2024). https://doi.org/10.1038/s41598-024-71378-3

Download citation

Received: 24 May 2024
Accepted: 27 August 2024
Published: 03 September 2024
DOI: https://doi.org/10.1038/s41598-024-71378-3
Springer Nature Limited

Low-pass whole genome sequencing of circulating tumor cells to evaluate chromosomal instability in triple-negative breast cancer

Abstract

Similar content being viewed by others

Copy number alterations analysis of primary tumor tissue and circulating tumor cells from patients with early-stage triple negative breast cancer

Interrogating breast cancer heterogeneity using single and pooled circulating tumor cell analysis

Binary classification of copy number alteration profiles in liquid biopsy with potential clinical impact in advanced NSCLC

Introduction

Results

Discussion

Materials and methods

Sample processing

Cell isolation, amplification and sequencing

Low-pass whole genome sequencing and bioinformatics

Statistics

Conference presentation

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval

Informed consent

Additional information

Publisher's note

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Supplementary Information 5.

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Low-pass whole genome sequencing of circulating tumor cells to evaluate chromosomal instability in triple-negative breast cancer

Abstract

Similar content being viewed by others

Introduction

Results

Discussion

Materials and methods

Sample processing

Cell isolation, amplification and sequencing

Low-pass whole genome sequencing and bioinformatics

Statistics

Conference presentation

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval

Informed consent

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation