The detection and implication of genome instability in cancer
- 4.1k Downloads
Genomic instability is a hallmark of cancer that leads to an increase in genetic alterations, thus enabling the acquisition of additional capabilities required for tumorigenesis and progression. Substantial heterogeneity in the amount and type of instability (nucleotide, microsatellite, or chromosomal) exists both within and between cancer types, with epithelial tumors typically displaying a greater degree of instability than hematological cancers. While high-throughput sequencing studies offer a comprehensive record of the genetic alterations within a tumor, detecting the rate of instability or cell-to-cell viability using this and most other available methods remains a challenge. Here, we discuss the different levels of genomic instability occurring in human cancers and touch on the current methods and limitations of detecting instability. We have applied one such approach to the surveying of public tumor data to provide a cursory view of genome instability across numerous tumor types.
KeywordsGenomic instability Cancer CIN MSI Nucleotide instability
Cancer is a disease characterized and fuelled by dynamic genomic changes. The vast number of structural abnormalities present in cancer genomes is largely attributed to genomic instability, a transient or persistent state that increases the spontaneous mutation rate, leading to gross genetic alterations such as rearrangements and changes in chromosome number (aneuploidy). Genomic instability is therefore a driving force of tumorigenesis in that continuous modification of tumor cell genomes promotes the acquisition of further DNA alterations, clonal evolution, and tumor heterogeneity . It is a feature of almost all cancers and has been observed in a range of malignant stages, from pre-neoplastic lesions prior to acquired TP53 mutations to advanced cases [2, 3, 4]. Numerous theories regarding the source of genome instability have been proposed. These theories, which include the mutator phenotype, DNA damage-induced replication stress, telomere dysfunction, and mitotic checkpoint failure [5, 6, 7, 8, 9, 10, 11], vary principally in their supposition of how early in tumorigenesis instability occurs, mechanisms leading to sequence level alteration, and whether instability initiates tumorigenesis or is merely a consequence of malignant transformation. While these mechanisms may all contribute to instability phenotypes to some extent in cancer in general, their prevalence varies across tumors derived from distinct cell types or in response to different carcinogens or selective pressures.
Genomic instability refers to a variety of DNA alterations, encompassing single nucleotide to whole chromosome changes, and is typically subdivided into three categories based on the level of genetic disruption. Nucleotide instability (NIN) is characterized by an increased frequency of base substitutions, deletions, and insertions of one or a few nucleotides; microsatellite instability (MIN or MSI) is the result of defects in mismatch repair genes which leads to the expansion and contraction of short nucleotide repeats called microsatellites; chromosomal instability (CIN) is the most prevalent form of genomic instability and leads to changes in both chromosome number and structure . While instability is a characteristic of almost all human cancers, cancer genomes vary considerably in both the amount and type of genomic instability they harbor. Importantly, the instability phenotype has implications in patient prognosis as well as patient management, specifically with the choice of therapeutic agents [13, 14, 15].
Currently, detection of genome instability can be achieved using a variety of technologies, ranging from single-cell approaches to high-throughput multicellular techniques, each capable of detecting different levels of genomic changes. However, at present, no assay is capable of reliably measuring the rate (cell-to-cell variability) of small chromosomal changes such as deletions, amplification, and inversions within a population of cells. There is therefore a great need for sensitive, high-resolution techniques capable of detecting genomic instability over time as this would afford critical insights into the mechanisms that underlie genomic instability and the role of instability in tumorigenesis. In this review, we discuss the different levels of genomic instability and various methods of and limitations to detecting instability and describe global trends in genome instability across numerous tumor types.
2 Levels of genomic instability
2.1 Nucleotide instability
2.2 Microsatellite instability
Microsatellites are repetitive DNA sequences comprising 1–6 bp located throughout the genome [20, 21, 22]. Within the population, microsatellite size is highly variable; however, each individual possesses unique microsatellites of a set length. MSI results from defects in DNA mismatch repair (MMR), specifically alterations of the MLH1, MSH2, MSH6, and PMS2 genes, which causes deletions or random insertion and expansion of microsatellites and a hypermutable phenotype (Fig. 1b). MSI is a characteristic feature of a number of cancers, including gastric, endometrial, ovarian, lung, and colorectal cancer (CRC), where it was first described and has been studied most extensively [23, 24, 25, 26, 27, 28]. MSI occurs in approximately 15 % of CRC, which typically arise in the proximal colon, posses a normal karyotype, and are associated with a better prognosis than non-MSI tumors.
MSI occurs in both hereditary (Lynch syndrome) and sporadic forms of colon cancer, although via distinct mechanisms . Hereditary non-polyposis colorectal cancer (Lynch syndrome) is characterized by inactivating germline mutations to MSH2, MSH6, PMS2, or MLH1, whereas sporadic CRC with MSI is associated with hypermethylation and loss of expression of MLH1 [27, 28, 29, 30, 31, 32]. The majority of sporadic CRC with MSI arise in a background of extensive aberrant promoter methylation—referred to as the CpG island methylator phenotype (CIMP) [33, 34, 35]. CIMP tumors develop and progress by methylating the promoters of tumor suppressor genes such as p16, IGF-2, and MLH1 and possess clinical features distinct from non-CIMP tumors [33, 36, 37, 38].
2.3 Chromosomal instability
Despite the prominence and fundamental importance of CIN to cancer biology, the molecular mechanisms underlying CIN in sporadic cancers remain poorly understood. This is due primarily to the fact that disruption of countless genes can give rise to CIN, including, but not limited to, those involved in chromosome condensation and segregation (STAG2) , telomere dysfunction (TRF1 and Tankyrase) , as well as DNA damage (ATM) [44, 45] and spindle checkpoint genes (BUB1, Mad2) [46, 47, 48], highlighting the heterogeneous nature of CIN in sporadic cancers. Attempts to explain the presence and molecular basis of CIN in sporadic cancers have led to the development of three prevailing theories: the mutator hypothesis, the oncogene-induced DNA damage model, and instability due to telomere erosion, which are reviewed in [5, 10, 49].
The advent of sequencing technologies has led to the recent discovery of an intriguing form of genome chaos and CIN, whereby only one or a few distinct chromosomes in a cancer cell are characterized by the presence of upwards of hundreds of complex genomic rearrangements . These distinct chromosomal rearrangements were proposed by Stephens et al. to have developed through chromosome shattering (“thripsis” in Greek) or incomplete fragmentation and the inaccurate stitching together of chromosomes in a single stochastic event in a process termed “chromothripsis,” an event in contrast to the widely accepted notion of gradual accumulation of cancer genome rearrangements. Chromothripsis has been proposed to occur in ∼2–3 % of a wide spectrum of cancers (with a higher incidence in bone cancers), where chromosome-specific massive rearrangements have been described [11, 50]. The mechanisms underlying chromothripsis, and its clinical implications, have been recently reviewed by Forment et al. .
2.4 Interplay between instability types
While all levels of instability can co-occur within the same cell, and work in concert to disrupt a single gene, protein complex, or pathway, in colorectal and endometrial cancers, an inverse relationship between CIN and MIN has been observed [52, 53]. Although both types of instability appear to occur early in tumor development and increase with tumor progression, cancers with an MMR deficiency tend to be diploid and exhibit normal rates of gross chromosomal changes, whereas MMR-proficient tumors are typically aneuploid and display increased rates of chromosomal alterations . Moreover, the fusion of MIN and CIN cells results in CIN, but not MIN, suggesting that CIN is a dominant phenotype that may result from gain-of-function alterations rather than gene inactivation [3, 46].
3 Methods for the detection and analysis of genome instability
Currently available methods of detecting genome instability
Rate and state
Whole and segmental CIN, aneuploidy
Whole and segmental CIN, translocations, insertions, deletions, and mutations
Whole and segmental CIN
Whole and segmental CIN, SNP, UPD, LOH
Whole and segmental CIN, translocations, insertions, deletions, and mutations
MSI, mitochondrial instability
3.1 Single-cell approaches
Karyotyping is the visualization of a cell’s entire complement of chromosomes, or karyotype. Assessment of a cell karyotype enables the identification of abnormalities in chromosome number (aneuploidy) and large structural rearrangements like inversions and translocations [55, 56]. Traditionally, metaphase chromosomes are stained with a DNA-binding dye, such as Giemsa stain, which is taken up readily by gene-poor A,T-rich genomic regions and results in a chromosome-specific banding pattern that can be used to differentiate chromosomes and identify abnormalities. The use of multicolored fluorescence in situ hybridization (FISH) probes has greatly facilitated the assessment of CIN and is referred to as spectral karyotyping (SKY). The SKY technique results in coloring, or painting, of each chromosome with a different colored fluorophore, readily enabling the identification of chromosomes and rearrangements [55, 57]. Although excellent for detecting global CIN changes, even the most advanced FISH strategies cannot accurately measure somatic mutations throughout the genome. While karyotyping is one of the few techniques available that enable the identification of alterations within a single cell, and the only one capable of profiling both clonal and non-clonal chromosomal alterations , like most other methods, it offers only a static picture of the state of chromosomal alterations with no information regarding the extent of variability between cells. Furthermore, it is labor-intensive and metaphase spreads from even short-term cultures can acquire culturing artifacts that induce additional genomic changes. Despite these limitations, karyotyping remains the most reliable method to detect non-clonal chromosomal aberrations and assess genomic variability among cells.
Advances in next-generation sequencing and whole-genome amplification technologies have enabled the advent of single-cell sequencing, which offers promising insight into understanding genomic instability as it provides not only a comprehensive look at the state of genomic alterations of a tumor cell but also cell-to-cell heterogeneity. Because single-cell sequencing relies on gene amplification, sequence bias and adequate genome coverage remain major challenges. However, new amplification methodologies such as multiple annealing and looping-based amplification cycles, which enable over 90 % genome coverage and can accurately detect mutations and copy number variations , are in development and have the potential to greatly improve single-cell sequencing. Although many obstacles remain before single-cell sequencing can be routinely implemented as a standard procedure for detecting genome instability, it has the ability to provide an unprecedented view of genomic instability.
3.2 Multicellular approaches
Flow cytometry, which measures cells in suspension as they pass through a laser, scatter light, and emit fluorescence, can be used to approximate cellular aneuploidy. This strategy estimates cell ploidy based on DNA content (which correlates to the intensity of fluorescence) and the stage of cells in the cell cycle. Comparison of the estimated ploidy in the G0/G1 fraction of malignant and normal cells allows a gross estimate of genome instability in cancer cells [60, 61]. While flow cytometry is extremely accurate in its ability to estimate ploidy, it provides no information regarding NIN, MSI, or the segmental or whole-chromosome aberration components of CIN.
Array comparative genomic hybridization (aCGH) offers the ability to quantitatively detect and visualize whole and segmental chromosomal alterations such as gains, losses, amplifications, and LOH [62, 63]. Briefly, reference genomic DNA and test DNA are differentially labeled, pooled, and hybridized onto arrays comprising BAC, cDNA, or oligonucleotides, and imbalances are visualized as differences in fluorescence intensity. The advent of SNP arrays offered improved resolution, enabling more precise mapping of copy number alterations and the detection of uniparental disomy (copy neutral loss of heterozygosity) as well as the ability to distinguish alleles at specific polymorphic sites [64, 65, 66]. However, neither aCGH nor SNP arrays are able to detect translocations, inversions, or somatic mutations.
PCR is the gold standard for detecting MSI. PCR is used to amplify known microsatellite regions, and the lengths of the short tandem repeats (PCR products) are compared in tumor and normal DNA to determine the state of MSI [37, 67, 68]. This approach is therefore limited to assessing MSI. PCR is also used routinely for the analysis of mitochondrial instability. The ability to isolate mtDNA from total DNA using mitochondrial-specific primers rather than through centrifugation not only reduced tissue requirements but also enabled the use of archival paraffin-embedded tissues, greatly expanding the number samples available for analysis . Commonly used markers of mitochondrial genome instability detected by PCR and followed by direct sequencing include point mutations, insertions, deletions, and length changes in homopolymeric or dimeric nucleotide tracts. Competitive PCR, in which a competitor DNA fragment is added to the DNA sample, can be used to determine mitochondrial DNA copy number by determining the ratio between the intensities of the control and the sample PCR product band .
Sequencing studies have provided massive amounts of data on cancer genomes, revealing great diversity in the mutation frequency across tumor types and identifying novel rearrangements in epithelial cancers. As data from sequencing studies continue to emerge in the public domain, a large-scale pan-cancer comparison of genomic instability in different cancers will be feasible. Such an analysis may shed more light on the mechanistic differences of cancer development in different tissues, which itself will improve our understanding of cancer biology and our ability to develop rationally designed therapies. The interpretation of whole-genome sequencing data in the context of heterogeneous tumors, however, remains a considerable challenge to the application of such data to patient care.
The fundamental limitation of these multicellular approaches is that they provide only a snapshot of the state of alterations in a tumor sample and are incapable of defining the rate of chromosomal changes within a tumor—two features that define genomic instability. While single-cell approaches such as karyotyping or single-cell array-CGH allow for unbiased comparisons of variability in chromosomal alterations between cells, they are not amenable to automation and are therefore time-consuming and labor-intensive. Collection of repeated tumor biopsy samples and advances in single-cell profiling technologies will help generate more accurate metrics of genomic instability.
4 Pan-cancer trends in CIN
It is well established that vast genome instability exists at different levels and to different extents in various tumor types. In the last decade, several large-scale sequencing studies have been undertaken in an attempt to characterize recurrent alterations in cancer genomes [74, 75, 76, 77, 78, 79]. While thousands of mutations have been identified, these studies have shown that very few genes are recurrently mutated, deleted, or amplified at high frequencies within a tumor type. Of the handful of recurrently altered genes, TP53 is the most frequently altered gene in all tumor types, while the others (CDKN2A, PTEN, EGFR, and RAS) have roles in regulating growth and encode classical tumor suppressors and oncogenes [74, 76, 80, 81].
In general, epithelial tumors are thought to be more genomically unstable than hematologic and mesenchymal malignancies, in which a high proportion of cases are characterized by specific genetic rearrangements such as translocations . Interestingly, certain cancer types display characteristic instability phenotypes. For instance, BRCA-associated breast and ovarian cancers demonstrate high levels of CIN, whereas lung cancer in smokers and never smokers differs in the extent of segmental alterations and subsequently, genome instability [83, 84, 85, 86]. Moreover, specific subtypes of breast, ovarian, and lung cancers exhibit distinct patterns of alterations; the basal-like subtype of breast cancers (typically estrogen receptor-negative) have greater CIN than luminal subtypes, while type II high-grade serous ovarian carcinomas have greater CIN than type I serous ovarian cancers [87, 88]. In lung cancer, adenocarcinoma and squamous cell carcinoma demonstrate distinct patters of genomic alterations, and within lung adenocarcinoma, the magnoid subtype displays higher CIN than other adenocarcinoma subtypes [89, 90]. A review of genome sequencing studies revealed that epithelial-derived cancers such as breast, non-small cell lung, small-cell lung, melanoma, and prostate cancers have a greater number of somatic mutations than blood cancers including acute myeloid leukemia , which could suggest that epithelial cancers have greater nucleotide instability. However, specific environmental exposures, such as tobacco smoke, can have specific signatures in terms of epigenetic and genetic alterations in tumors, making it difficult to determine whether the mutations detected arose from nucleotide instability within a tumor or from carcinogen exposure . As more cancer genome sequence data become publicly available, it will be interesting to determine whether specific cancer types exhibit a mutator phenotype and harbor greater nucleotide instability than others.
Proportion of genome altered (log2 ratio ± 0.1) for various cancer types (n = 2,201)
Acute lymphoblastic leukemia
Genomic instability occurs early in tumorigenesis, increasing the spontaneous mutation rate and enabling the acquisition of DNA alterations that promote the hallmarks of cancer, thereby driving tumor development. While the molecular basis of instability is well understood in hereditary cancers, where it is linked to mutations in DNA repair genes, the basis of instability in sporadic cancers remains poorly defined. This limited understanding is due both to the genomic heterogeneity in different tumor types as well as within individual tumors and a lack of methods capable of capturing both the state and rate of instability, which are required to determine the true measure of instability. Genome sequencing studies have provided a wealth of information regarding the state of instability in a variety of cancers, highlighting the diversity in both the types and amounts of instability observed in tumor genomes. As the amount of starting materials for whole-genome sequencing experiments continues to decrease, single-cell sequencing will become feasible for solid tumors; with this will come an expanded understanding of which mechanisms of genomic instability are selected for and precisely how specific patterns of instability support tumor growth in unique systems. In combination with repeat biopsies and sequencing of multiple areas in a single tumor, detailed maps of how genomic instability changes over time will emerge, which can then be interpreted in the context of unique selective pressures in the tumor microenvironment (e.g., the immune system, chemotherapy) or correlated to specific clinical features (e.g., tumor progression). Genomic instability remains an important, yet poorly defined, mechanism by which tumors accelerate their own evolution and survival. At the same time, once uncovered, these same mechanisms will undoubtedly present to the researcher a host of novel therapeutic opportunities.
This work was supported by funds from the Canadian Institutes for Health Research (CIHR; MOP 86731, MOP 94867, MOP-110949), Canadian Cancer Society Research Institute (no. 700809), U.S Department of Defence (CDMRP W81XWH-10-1-0634), NCI Early Detection Research Network, and the Canary Foundation. LAP and KLT are supported by Vanier Canada Graduate Scholarships. EAV is supported by Frederick Banting and Charles Best Canada Graduate Scholarship from CIHR.
Conflict of interest
The authors declare that they have no conflict of interest.
- 14.Di Nicolantonio, F., Martini, M., Molinari, F., Sartore-Bianchi, A., Arena, S., Saletti, P., et al. (2008). Wild-type BRAF is required for response to panitumumab or cetuximab in metastatic colorectal cancer. Journal of Clinical Oncology: Official Journal of the American Society of Clinical Oncology, 26(35), 5705–5712.CrossRefGoogle Scholar
- 26.Kim, W. S., Park, C., Hong, S. K., Park, B. K., Kim, H. S., & Park, K. (2000). Microsatellite instability (MSI) in non-small cell lung cancer (NSCLC) is highly associated with transforming growth factor-beta type II receptor (TGF-beta RII) frameshift mutation. Anticancer Research, 20(3A), 1499–1502.PubMedGoogle Scholar
- 52.Abdel-Rahman, W. M., Katsura, K., Rens, W., Gorman, P. A., Sheer, D., Bicknell, D., et al. (2001). Spectral karyotyping suggests additional subsets of colorectal cancers characterized by pattern of chromosome rearrangement. Proceedings of the National Academy of Sciences of the United States of America, 98(5), 2538–2543.PubMedCrossRefGoogle Scholar
- 66.Gondek, L. P., Tiu, R., Haddad, A. S., O'Keefe, C. L., Sekeres, M. A., Theil, K. S., et al. (2007). Single nucleotide polymorphism arrays complement metaphase cytogenetics in detection of new chromosomal lesions in MDS. Leukemia: Official Journal of the Leukemia Society of America, Leukemia Research Fund, UK, 21(9), 2058–2061.CrossRefGoogle Scholar
- 78.Seo JS, Ju YS, Lee WC, Shin JY, Lee JK, Bleazard T, Lee J, Jung YJ, Kim JO, Yu SB et al. (2012). The transcriptional landscape and mutational profile of lung adenocarcinoma. Genome Research, 22, 2109–2119.Google Scholar
- 84.Huang, Y. T., Lin, X., Liu, Y., Chirieac, L. R., McGovern, R., Wain, J., et al. (2011). Cigarette smoking increases copy number alterations in nonsmall-cell lung cancer. Proceedings of the National Academy of Sciences of the United States of America, 108(39), 16345–16350.PubMedCrossRefGoogle Scholar
- 88.Fang, M., Toher, J., Morgan, M., Davison, J., Tannenbaum, S., & Claffey, K. (2011). Genomic differences between estrogen receptor (ER)-positive and ER-negative human breast carcinoma identified by single nucleotide polymorphism array comparative genome hybridization analysis. Cancer, 117(10), 2024–2034.PubMedCrossRefGoogle Scholar
Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.