Unintended CRISPR-Cas9 editing outcomes: a review of the detection and prevalence of structural variants generated by gene-editing in human cells

Hunt, John Murray Topp; Samson, Christopher Allan; Rand, Alex du; Sheppard, Hilary M.

doi:10.1007/s00439-023-02561-1

Unintended CRISPR-Cas9 editing outcomes: a review of the detection and prevalence of structural variants generated by gene-editing in human cells

Review
Open access
Published: 24 April 2023

Volume 142, pages 705–720, (2023)
Cite this article

Download PDF

You have full access to this open access article

Human Genetics Aims and scope Submit manuscript

Unintended CRISPR-Cas9 editing outcomes: a review of the detection and prevalence of structural variants generated by gene-editing in human cells

Download PDF

5966 Accesses
17 Citations
16 Altmetric
1 Mention
Explore all metrics

Abstract

Genome editing using the clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated protein (Cas) gene-editing system (CRISPR-Cas) is a valuable tool for fundamental and applied research applications. Significant improvements in editing efficacy have advanced genome editing strategies into phase 3 human clinical trials. However, recent studies suggest that our understanding of editing outcomes has lagged behind the developments made in generating the edits themselves. While many researchers have analyzed on- and off-target events through the lens of small insertions or deletions at predicted sites, screens for larger structural variants (SVs) and chromosomal abnormalities are not routinely performed. Full and comprehensive validation of on- and off-target effects is required to ensure reproducibility and to accurately assess the safety of future editing applications. Here we review SVs associated with CRISPR-editing in cells of human origin and highlight the methods used to detect and avoid them.

Recent advances in the delivery and applications of nonviral CRISPR/Cas9 gene editing

Article 29 March 2023

A survey of best practices for RNA-seq data analysis

Article Open access 26 January 2016

Opportunities and challenges in long-read sequencing data analysis

Article Open access 07 February 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The development of the clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) protein gene-editing system (CRISPR-Cas) in 2012 transformed our ability to treat genetic diseases by enabling targeted-modification of intracellular DNA. Monogenic diseases are the most common target of CRISPR gene therapy, and some, such as sickle cell disease or transfusion-dependent beta-thalassemia (TDT), have recently advanced into phase 3 clinical trials (clinical trial.gov: NCT03655678/NCT03745287/NCT05477563). However, the CRISPR-Cas system is not limited to simple hereditary diseases, and human clinical trials have begun for the treatment of other conditions, such as cancers (clinical-trial.gov: NCT04976218) or bacterial (clinical-trial.gov: NCT04191148) and viral infections (clinical-trial.gov: NCT05144386).

Gene editing using CRISPR-Cas9

The CRISPR-Cas system excels due to its capacity to provide inexpensive, accessible, and robust editing without a requirement for retroviral integration. Using this approach, gene-editing is typically facilitated by the CRISPR-associated protein 9 (Cas9), an endonuclease which cleaves double-stranded DNA at genomic loci defined by a 20-nucleotide guide RNA (gRNA) and a three-nucleotide protospacer adjacent motif (PAM) (Doudna and Charpentier 2014). In humans, double-stranded DNA breaks (DSBs) are primarily repaired by the error-prone non-homologous end joining (NHEJ) pathway, typically resulting in small (up to ten nucleotide) insertion and deletion events (INDELs) (Allen et al. 2018; Chang et al. 2017). INDELS can be targeted to promoter or protein-coding regions to directly disrupt gene activity and can be used to ablate gene expression. Specific DNA changes can be achieved by providing an appropriate DNA template to employ the endogenous homology repair or single-strand template repair pathways in a process referred to as homology directed repair (HDR) (Lee et al. 2022). Genome editing by HDR is the mainstay of personalized gene therapy as it can be used to precisely correct patient-specific disease-causing mutations or to generate genotypes of interest for disease modeling.

Gene editing can lead to the unintended generation of structural variants

The primary concern when considering the clinical application of any gene therapy is the potential for unintended genome alterations that may create genomic instability or interfere with regular gene function. Accordingly, it is important to be aware that the CRISPR-Cas system may generate undesired, genotoxic side effects. For example, CRISPR-Cas gRNAs may tolerate small DNA mismatches and cause DNA cleavage and thus INDELs at off-target sites (Han et al. 2020). Furthermore, the improper repair of any DSBs, either at on- or off-target sites has the potential to induce larger genomic aberrations (> 50 bp) known as structural variants (SVs) (Mahmoud et al. 2019). These structural variants can be broadly categorized into deletions, duplications, insertions, inversions, translocations, viral vector integrations, and more complex events such as chromothripsis (Fig. 1) (Mahmoud et al. 2019). While exogenous DNA insertions, such as template insertions and viral vector integrations, comprise a distinctive group of SVs compared to endogenous DNA insertions, for the purposes of this review, we have chosen to classify these insertions as part of the general insertion SV group. Given that recent studies indicate that SVs can play a key role in driving tumorigenesis (Alhafidz and Ailith 2022), the generation of SVs, however rare, should be a key consideration for any gene therapy. It is worth acknowledging that the oncogenic potential of SVs is dependent on the specific SV and the permissibility of the cell to transformation (Dubois et al. 2022). Additionally, while the analysis of SVs resulting from CRISPR-editing is not well reported in clinical trials, there have been no reports of cancer incidence associated with CRISPR-based therapies to date.

Despite the rapid advancements made with the CRISPR-Cas gene-editing system, our understanding of editing outcomes has lagged behind the developments made in generating the edits themselves. Although the potential for CRISPR-Cas9 editing to cause translocations and large deletions was documented in 2014 (Choi and Meyerson 2014; Maddalo et al. 2014), it was not until four years later that the full extent of the generation of on-target SVs began to be unraveled (Kosicki et al. 2018). Subsequently, on-target, mega-base-scale deletions, insertions, chromosomal truncations, copy-neutral loss of heterozygosity (CN-LOH), translocations, and chromothripsis events have all been described as a result of CRISPR-Cas editing in human cells (Boutin et al. 2021, 2022; Cullot et al. 2019; Kosicki et al. 2018; Leibowitz et al. 2021; Liu et al. 2021; Rayner et al. 2019; Turchiano et al. 2021; Weisheit et al. 2020; Yin et al. 2019; Zhang et al. 2021). Still, to date, the prevalence of SVs in CRISPR-Cas edited human cells is largely unknown. This delay in understanding is likely due to the most widely used on- and off-target editing analysis methods being limited in their ability to detect SVs (see section "Evidence for CRISPR-associated SVs"). If SVs remain undetected, they may alter either the reported editing frequency, cell genotype, or skew assay results by altering cellular function, and thus they have the potential to corrupt entire studies (Blondal et al. 2021; Rayner et al. 2019; Weisheit et al. 2020). Furthermore, the rapidly advancing cell-based CRISPR therapeutic strategies can require millions to billions of edited cells to be engrafted to a patient (Boutin et al. 2021). It is conceivable that within a pool of edited cells, SVs may confer a growth advantage or even promote an oncogenic state. Thus, a comprehensive understanding of SV outcomes in CRISPR-edited cell populations may be prudent to ensure the safety of future gene therapy approaches. As CRISPR-Cas editing strategies are highly diverse, it is likely that the predominant types of SVs and their frequencies will need to be assessed for each unique application.

Evidence for CRISPR-associated SVs

SVs in human cancer cells lines

Genome-edited cancer cell lines are desirable for the study of cancer biology and therapeutics. However, many cancer-derived cell lines exhibit high levels of chromosomal instability, in part due to aberrant DNA repair mechanisms, such as inactivation of the tumor suppressor protein, p53 (Rayner et al. 2019). Thus, it is feasible that the induction of DSBs in these cancer cell lines may further compromise genome integrity and cause widespread complex SVs.

In the well-studied HEK293T cancer cell line, kilobase-sized deletions and inversions have been detected at frequencies of ~ 3% (0.1–5 kilobase (kb)) (Yin et al. 2019; Zhang et al. 2021), and 0.05% (5–50 kb), following the induction of a single DSB (Yin et al. 2019). Distal chromosome arm truncations have been detected at frequencies of 10–25.5% in edited HEK293T cell clones, independent of the target loci (Boutin et al. 2021; Cullot et al. 2019). Intra-chromosomal translocations have also been detected, making up to 6.2–14% of editing outcomes in one study utilizing HEK293T cells (Liu et al. 2021; Yin et al. 2019; Zhang et al. 2021). Interestingly, chromosomal translocations occurred at similar frequency between both predicted off-target sites and low-level genome-wide DSB events, which suggests that translocations may be possible even in the absence of off-target DSBs (Yin et al. 2019; Zhang et al. 2021).

Similarly, widespread chromosomal instability, including intra-chromosomal translocations and distal chromosome arm truncations as a result of CRISPR-editing has been described in well-defined colorectal cancer cell lines (Przewrocka et al. 2020; Rayner et al. 2019). CRISPR-associated chromosome instability was more prominent in cancer cell lines, which exhibit aneuploidy (COLO320 and SW1463) than those with a more stable karyotype (HCC2998 and HTC116). Przewrocka et al. (2020), identified chromosomal truncations in CRISPR-edited HCT116 cells at rates of 2–7%. RNA sequencing of the affected clones showed that the most significantly downregulated genes were those located on the truncated chromosome arm distal to the target site, highlighting a functional consequence of these SVs (human cell studies are summarised in Table 1).

Table 1 Summary of papers that have identified SVs in human cells

Full size table

SVs in primary cells, immortalized primary cells, iPSCs and HSPCs

Primary cells and stem cells are useful for basic and translational research as they more closely reflect in vivo cells compared to cancer cell lines and they are the foundation of cell-based gene therapies. Kiosicki et al. (2018) reported extensive on-target SVs in single-DSB edited human cells (human telomerase reverse transcriptase (hTERT) immortalized retinal pigment epithelium cells (RPE1)), which included kilobase-sized deletions, insertions, and rearrangements (Kosicki et al. 2018). Subsequently, gene-editing related SVs have been reported in other primary and stem cells including hTERT-immortalized fibroblasts, induced pluripotent stem cells (iPSCs), and hemopoietic stem and progenitor cells (HSPCs) (Blondal et al. 2021; Boutin et al. 2021; Simkin et al. 2022; Turchiano et al. 2021; Weisheit et al. 2020; Wen et al. 2021).

Regarding induced pluripotent stem cells, one study indicated that up to 40% of iPSC clones had on-target SVs, including 0.5–4 kb deletions or CN-LOH of the entire chromosome arm distal to the target site (Weisheit et al. 2020). In this study, one of the target loci was the amyloid precursor protein (APP) encoding gene. Cortical neurons derived from iPSCs with SVs had APP expression reduced by ~ 50% compared to those without SVs. The authors noted that the reduction in APP expression may have significantly altered the disease modeling of the iPSC affected by SVs (Weisheit et al. 2020). Other recent studies also noted deletions over 100 bp (< 1.5%) (Wen et al. 2021), plasmid or mitochondrial DNA insertions, and CN-LOH as editing outcomes in iPSCs (Blondal et al. 2021; Simkin et al. 2022).

In hematopoietic stem and progenitor cells, kilobase-sized deletions or copy-neutral SVs seem to be the predominant on-target SVs and have been identified in up to 20% of edited cells (Boutin et al. 2021; Turchiano et al. 2021; Wen et al. 2021). An in-depth analysis of edited HSPCs by CAST-seq (a method described later in section "Methods to detect structural variants in CRISPR-edited cells") showed that large deletions, inversions, and chromosomal translocations with the homologous chromosome made up ~ 19.5% of total edited alleles (Turchiano et al. 2021). In comparison, inter-chromosomal translocations with off-target or genome-wide DSBs were only detected in approximately 0.5% of edited alleles. In a separate study, LOH of the target chromosome arm was detected at a frequency of ~ 1%, the majority being CN-LOH events (Boutin et al. 2021). Significantly, HSPC clones with CN-LOH exhibited abnormal methylation patterns and aberrant expression of two known tumor suppressor genes and one oncogene, again highlighting a previously underappreciated consequence of SVs (Boutin et al. 2021).

SVs in human embryonic stem cells, zygotes, and embryos

CRISPR-Cas9 genome editing has been reported to induce SVs in both mouse embryonic stem cells (ESC) and mouse embryos (Adikusuma et al. 2018; Kosicki et al. 2018; Owens et al. 2019). Recent studies suggest that this may also occur in genome edited human ESC and embryos (Alanis-Lobato et al. 2021; Bi et al. 2020; Zuccaro et al. 2020). In one study of CRISPR-edited human embryonic stem cells, SVs comprised up to 5.4% of detected editing events (Bi et al. 2020). Of these, 78–98% were deletions ranging from 31 to 5500 bp, although insertions and inversions were also detected. However, due to the limitations of the detection methods used in this study (IDM-seq and SNP genotyping; see section "Methods to detect structural variants in CRISPR-edited cells"), the frequency of chromosomal aberrations and CN-LOH in hESCs remains unknown. Recently, two studies in human embryos demonstrated segmental and whole chromosome losses from induced single DSBs (Alanis-Lobato et al. 2021; Zuccaro et al. 2020). Zuccaro et al. (2020) used SNP genotyping and qPCR to demonstrate that LOH of SNP sites was due to a loss of DNA from chromosome arm truncations, as opposed to homology directed repair that was reported in a previous study (Ma et al. 2017). Similarly, Alanis-Lobato et al. (2021) demonstrated that segmental deletions of 4 kb to at least 20 kb occurred in 16% of cells from embryos that were edited with a single DSB (Alanis-Lobato et al. 2021). A recent study in CRISPR-edited macaque embryos also identified large on-target deletions ranging from ~ 0.2 kb to ~ 5 kb, inversions, duplication, and de novo mutations at off-target sites (Schmidt et al. 2023).

HDR-enhancing techniques may increase the incidence of structural variants in CRISPR-edited cells

In general, gene-editing by HDR is currently limited by low editing efficacy and high variability of editing efficacy between loci and cell type, despite high on-target cutting efficiencies (Lee et al. 2022). As a result, positive selection or clonal isolation of edited cells may be required to attain experimentally useful HDR levels, which is not feasible for all editing applications. This has led to the development of HDR-enhancing techniques such as cell cycle synchronization or the chemical modulation of repair pathways (Lee et al. 2022). For example, the transient inhibition of the tumor suppressor protein, p53, has been reported to increase the efficiency of CRISPR-mediated HDR by up to 17-fold in hPSCs (Ihry et al. 2018), making it a desirable target for knockdown or inhibition (Schiroli et al. 2019). However, as p53 is a key regulator of DNA repair and growth arrest, a concern is that the induction of DSBs in p53-deficient cells may, in turn, increase the mutational burden and SV incidence in the edited cells (Mirgayazova et al. 2020). This has been explored in hTERT-immortalized fibroblasts (hTERT-fibs), where a significant increase in CRISPR-induced chromosomal truncations was reported in p53-inactivated hTERT-fibroblasts (7.7%) compared to their p53 intact counterparts (1.1%) (Cullot et al. 2019). Furthermore, in p53-depleted hTERT-RPE1 cells, those that had micro-nucleation post-editing led to granddaughter cells with chromothripsis in 72% of cases (Leibowitz et al. 2021). Although micro-nucleation was also observed in p53-competent cells at rates of up to 2.5% in hSPCs, 3% in primary foreskin fibroblasts and 7.5% in hTERT-RPE1 cells, the affected cells were ~ 50% less likely to undergo cellular division at the first cell cycle post editing (Leibowitz et al. 2021). Furthermore, in p53-active human hematopoietic stem cells, the percentage of alleles containing unbalanced rearrangements and translocations reduced over several cell cycles, likely due to negative selection pressure of these mutations (Turchiano et al. 2021). In light of this, further research is required to identify if temporary p53 inactivation can increase HDR efficiency, without increasing the incidence of long-term viable SVs in an edited population of cells. Furthermore, these data suggest that rigorous analysis for chromosomal aberrations should be standard when editing cell lines with reduced p53 capacity.

NHEJ-inhibitors (e.g., M3814 or NU7441), which inhibit key NHEJ repair pathway proteins, such as KU or DNA-PK4, have been shown to improve HDR-efficacy by four- to five-fold, independent of the target loci (Chu et al. 2015; Riesenberg et al. 2019). However, recently it has been shown that inhibition of NHEJ proteins results in increased incidence of large deletions, insertions, translations and chromosomal truncations as a result of CRISPR-Cas-mediated DNA cleavage (Do et al. 2012; Kosicki et al. 2022; Liu et al. 2021; Quan et al. 2022; Wen et al. 2021). Therefore, broad detection of editing outcomes, including on-target SVs, is essential for primary research and clinical therapies which incorporate the use of NHEJ-inhibitors.

Alternative CRISPR strategies may reduce the incidence of SVs

The conventional CRISPR-Cas system relies on introducing potentially genotoxic DSBs using the Cas9 nuclease, which can result in extensive DNA damage if not repaired correctly. High-fidelity Cas9 nuclease variants can reduce off-target effects, such as INDELs and translocations (Yin et al. 2019); however, they may not be able to prevent translocations between low-level DSBs or isolated SVs at the target site (Turchiano et al. 2021; Yin et al. 2022; Zhang et al. 2021). Cas9-nickases are catalytically modified to induce single-stranded DNA breaks, which may reduce the rate of off-target edits and on-target SVs associated with DSBs. For example, a single-nicking strategy significantly decreased the frequency of chromosomal truncations compared to a standard DSB strategy in a study using HEK293T cells (undetected versus 10% respectively) (Cullot et al. 2019). Similarly, switching from a Cas9 nuclease to a double Cas9-nickase strategy reduced translocation frequency from 2.7 to 0.5% in HEK293T cells, albeit at the expense of an up to 50% loss in editing efficiency (Yin et al. 2019). Both the Cas9-nickase-derived base editors (BE) and prime editors (PE) can also improve editing precision (Anzalone et al. 2020). Base editors introduce specific point changes at targeted sites and have recently demonstrated high efficiency with low rates of INDELS and SVs (Liao et al. 2023; Yin et al. 2022). For example, in HEK293 cells, both a cytosine (BE4max) and adenine (ABEmax) BE generated translocations at a frequency of 0.22% and 0.19% respectively compared to 1.93% with a regular Cas9 nuclease (Yin et al. 2022).

The Cas9TX variant, which fuses Cas9 nuclease with a 3′–5′ exonuclease can also reduce translocations by promoting end processing and thus decreasing re-cutting (Yin, Fang, et al., 2022; Yin et al. 2022). This was demonstrated in HEK293T cells where Cas9TX had a reduced translocation frequency of 0.42%, compared to 1.93% of regular Cas9 nuclease (Yin et al. 2022). Cas12 is another type of CRISPR enzyme that is emerging as a promising alternative to Cas9 for genome editing (Xin et al. 2022). As shown in a recent study, Cas12 can reduce the occurrence of translocations, large deletions, and viral vector integrations when compared to regular Cas9 (Xin et al. 2022). In HEK293 cells, Cas12f variants were found to reduce translocations by two to threefold compared to Cas9 (with translocation rates of approximately 1.17% versus 3.55%, respectively), albeit with a generally lower editing efficiency (Xin et al. 2022). As novel CRISPR-editing tools continue to emerge, it is important to evaluate their effects on the occurrence of SVs, such as translocations, in addition to their editing efficiency. This would provide valuable insights into the overall performance and safety of these new systems.

Retention of SVs post editing

One way to assess the impact of SVs is by tracking their prevalence over time (see section "Clonal expansion assays"). Numerous studies have found that the occurrence of SVs decreases after several cell cycles following editing, which may be due to inhibition in cell growth during cell cycle checkpoints (Boutin et al. 2021; Turchiano et al. 2021; Yin, Lu, et al. 2022). However, the persistence or clonal expansion of a particular edit, including SVs, could indicate a mutation that is either tolerated or potentially confers a positive selective growth advantage. For example, two months following the infusion of autologous edited T cells into mice, Wu et al. found that the translocation frequency in the cell population had reduced, but translocations still persisted (0.98% in ex vivo activated T cells versus 0.17% to 0.59% in T cells two months after transplantation) (Wu et al. 2022). In this case, the authors speculated that the retention of the remaining translocations was likely due to a passenger effect from general expansion of the T cells rather than translocation-driven selection. Nonetheless, despite a trend toward decreasing numbers of SV carrying cells demonstrated in previous studies (Boutin et al. 2021; Turchiano et al. 2021; Yin et al. 2022), these studies indicate that SVs can persist for many months (Wu et al. 2022).

Current methods used to analyze CRISPR edits

The most common CRISPR-edit analysis methods typically involve the generation of short amplicons (< 1 kb) by polymerase chain reaction (PCR) spanning the edited region of interest. Amplicons are then sequenced using either Sanger or next-generation sequencing (NGS). This short amplicon sequencing is favored as it is rapid, relatively inexpensive, and well supported with user-friendly bioinformatic tools for analysis (C Li et al. 2022). The key limitation of short-amplicon sequencing is that it can only detect mutations that are housed within the relatively small amplicon, which renders it unable to detect the majority of SVs (see Fig. 1). For example, in the case of a 1 kb amplicon, deletions (i.e. a unidirectional deletion of > 500 nucleotides), translocations, or other SVs may remove a primer amplification site(s) and prevent amplification and therefore detection. Additionally, insertions or duplicated regions may push primer annealing sites apart potentially reducing amplification efficiency with standard PCR cycle settings. Therefore, short amplicon analysis cannot confirm the absence of undesired edits as it cannot detect the vast majority of SVs.

One commonly used approach to identify off-target INDELs and certain types of SVs involves whole genome sequencing (WGS), specifically using short-read Illumina WGS (SR-WGS) (Li et al. 2019). Whole exome sequencing (WES) and total RNA sequencing (RNA-seq) are also used but have reduced scope compared to WGS as they only encompass the coding regions of the genome. A key limitation of SR-WGS is that the standard 30x genome coverage lacks the read depth to detect low-frequency mutations (INDELs or SVs) in pooled, heterozygous edited DNA. Increasing read depth would enhance the detection of rarer variants, but this comes with a significant increase in cost. Additionally, while currently available SV detection algorithms can detect high frequency SVs of all types, detection of SVs present at a frequency below 20% in pooled cell populations continues to be difficult to perform with adequate sensitivity, even at average sequencing depths exceeding 90x (Gong et al. 2021). An alternative solution is to analyze many individual clones isolated from the original mixed pool of edited cells. Although this would enable the detection of SVs (Alanis-Lobato et al. 2021; Schmidt et al. 2023; Simkin et al. 2022), this method can be expensive, labor-intensive, and would be difficult to achieve sufficient depth to identify rare variants, requiring the analysis of hundreds of cell clones, which makes it unsuitable for use in many settings.

Methods to detect structural variants in CRISPR-edited cells

Currently, there is no single method which can comprehensively detect all SVs present in heterogeneous, pooled, edited DNA in an unbiased genome-wide manner. However, many techniques ranging from cytogenetic analysis to novel NGS-library preparations have been developed to detect and characterize editing-associated SVs (as summarized in Table 2).

Table 2 Methods used or developed to detect SVs in edited human cells

Full size table

Cytogenetic analysis (FISH, aCGH, and SNP-analysis)

Fluorescence in situ hybridization (FISH) uses fluorescent DNA probes to label the presence (or absence) of complementary regions of a chromosome, which are viewed in interphase or metaphase cells. These probes can be designed to flank a locus of interest to identify a change in chromosome ploidy post-editing, with a standard resolution of 50 kb (Martin and Warburton 2015). Chromosome arm truncations can be visualized with FISH using probes complementary to the centromeric and telomeric regions of the target chromosome arm where loss of the telomeric probe is indicative of a truncation (Cullot et al. 2019). This method is valuable when analyzing aneuploid cell lines, where SVs could be masked by homologous chromosomes (Cullot et al. 2019).

Array comparative genomic hybridization (aCGH) enables localized or genome-wide screening of DNA copy-number imbalances based on the relative intensity of sample and control DNA fragments attached to an array of complimentary fluorescent probes (Cullot et al. 2019). aCGH can assay any locus that is represented on an array with a theoretical resolution up to 500 bp dependent on the frequency and size of the probes (Conrad et al. 2009). However, aCGH can only detect copy number changes and is unable to detect balanced chromosomal SVs (inversions, translocations, and CN-LOH) or the location of the copies. Furthermore, aCGH is intended to be used on heterozygous cells, so its application is limited to the analysis of clonal cell lines.

SVs can also be detected via analysis of the allelic ratios of native single-nucleotide polymorphisms (SNPs) (Alanis-Lobato et al. 2021; Leibowitz et al. 2021; Przewrocka et al. 2020; Simkin et al. 2022; Weisheit et al. 2020; Zuccaro et al. 2020). In edited cells, the loss of heterozygosity across multiple concurrent SNPs indicates a deletion, while a consistent 1:2 SNP ratio indicates a duplication. SNP genotyping assays may entail simple PCR and Sanger sequencing of known SNPs, or microarrays which encompass SNPs across the human genome. The LOH of SNPs can be tracked along a chromosome arm to determine the extent of large deletions or identify chromosomal truncations, where the LOH will extend to the telomere. However, SNP-analysis methods are also restricted to clonal cell lines and act as an indicator of SVs as they are not typically able to resolve the SV to base pair resolution.

Quantitative genotyping PCR

SVs which prevent PCR amplification can be detected by allele copy number analysis by real-time quantitative PCR (qPCR) (Boutin et al. 2021). In qPCR, intercalating fluorescent DNA dyes or fluorescently labeled oligo-probes are used during PCR to produce fluorescent amplicons. The fluorescence intensity, which is proportional to the quantity of amplicon at any moment, is measured after each PCR cycle. The cycle number where the fluorescence becomes detectable above background is called the quantification cycle (Cq). For allele copy number analysis, equimolar amounts of DNA from control and edited cell clones are run in parallel so that the Cq cycles can be compared. Cell clones with large deletions will have higher Cq values due to reduced allele copy number, while those with duplications may have lower Cq values due to an increase in allele copy number. Recently, quantitative genotyping PCR (qgPCR) was developed (Simkin et al. 2022; Weisheit et al. 2020, 2021). qgPCR is a combination workflow where the qPCR primers are designed to match standard genotyping PCR primers, so that both the genotype and allele copy number can be determined. For example, in the case of a heterozygous deletion which prevents amplification of one allele, a standard genotyping assay may indicate a homozygous genotype, but the qgPCR will have a higher Cq value, indicative of a deletion. This enables the detection of SVs that would not be detected by standard genotyping PCR due to the loss of a primer site. However, qgPCR acts primarily as an indicator of SVs and is not able to resolve the SV to base pair resolution.

Targeted amplicon sequencing, long-range sequencing technologies and IDM-seq

Standard amplicon sequencing enables the detection of edits that are contained entirely within the amplicon. Thus, the amplification of longer DNA fragments enables the detection of larger SVs, but is limited by the length of the amplification fragments that can be produced, and the requirement of the presence of both primer sites. Long-amplicon sequencing refers to the production and sequencing of large amplicons, typically 5–20 kb, which would enable the detection of kilobase-sized SVs.

Long-read sequencing technologies, such as Oxford Nanopore Technologies (ONT) or Pacific Biosciences (PacBio), provide alternative methods to observe SVs in long amplicons. For a more detailed description of these technologies including their pros and cons, we direct readers to a recent review (Logsdon et al. 2020). In brief, these technologies generate reads that are kilobases to megabases in length, so can cover entire amplicons and generate reads that are likely to contain unique SVs. In contrast, short-read sequencing methods may not capture or only partially capture SVs, making the mapping and analysis of pooled DNA challenging. Although long-read sequencing technologies generally have lower base-pair accuracy than Illumina sequencing, high accuracy is not essential for the visualization of large SVs. Moreover, PacBio sequencing can achieve high accuracy with sufficient sequencing passes of the same amplicon (Logsdon et al. 2020), and ONT sequencing accuracy is continually improving with new sequencing platforms and reagents (https://nanoporetech.com/accuracy). A benefit of higher sequencing accuracy is that it may facilitate the detection of both SVs and the accurate quantification of desired edits (such as HDR and INDELs) in a single assay (Bi et al. 2020). PacBio sequencing can achieve high base pair accuracy up to 15–20 kb, but its throughput is still limited and its cost per base sequenced is comparatively high among NGS technologies (Logsdon et al. 2020). Similarly, ONT technologies face similar limitations, although the exact extent may vary depending on the experimental setup.

While sequencing PCR amplicons can be used to identify the presence of SVs, the process of amplification and sequencing can introduce PCR and sequencing duplicates, hampering the accuracy of quantification. Individual DNA molecule sequencing (IDM-Seq) prevents this using unique molecular identifiers (UMIs) and either short- or long-range PCR to quantify the abundance of SVs over a target site (Bi et al. 2020). In IDM-seq, addition of a UMI is achieved by performing a round of primer extension using a single-stranded primer (containing a 10-12 nucleotide UMI sequence and 5′ universal primer sequence) that is specific to the locus of interest. Subsequent PCR amplification is then performed with a universal primer and a locus-specific reverse primer. Labeled amplicons can then be sequenced on NGS platforms. The addition of UMIs enables accurate quantification of allele frequencies from the bulk DNA, as each UMI group represents a single DNA molecule present at the initial labeling step (Bi et al. 2020).

SV capture techniques; LAM-HTGTS, PEM-seq and CAST-seq

SV capture techniques use linker-mediated amplification or single-primer PCR amplification across an on-target cleavage site to produce amplicons which may contain a SV boundary. These amplicons consist of a known sequence of DNA (bait DNA), followed by the prey DNA, which is either the reference or an aberrant DNA sequence (Fig. 2a). The nature of a SV can be resolved by mapping the prey DNA sequence to a reference genome. For example, in the case of a 5 kb deletion, the prey sequence will align to the reference genome 5 kb downstream of the bait sequence. If a translocation has occurred, the bait and prey DNA sequences will align to different chromosomes.

Several single-primer PCR amplification techniques have been developed, including linear amplification-mediated high-throughput genome-wide translocation sequencing (LAM-HTGTS) and primer extension-mediated sequencing (PEM-seq). LAM-HTGTS was developed to track translocations for the identification of off-target DSB sites. First, linear extension is performed with a biotinylated primer for ~ 80 cycles, followed by streptavidin-based isolation of amplicons (Fig. 2b) (Hu et al. 2016). Adapters are then ligated to the 3′ end of single-stranded amplicons and a nested PCR is performed using the adapter and a target-specific primer. In LAM-HTGTS, to improve the detection of variants, DNA molecules, which do not contain SVs, can be ablated via restriction digest using a rare-cutting restriction enzyme which cleaves unedited prey DNA. This step can be omitted in order to detect small repair events such as INDELs, and to allow quantification of the portion of fragments containing a SV. However, due to amplification and sequencing duplicates, the accuracy of quantification with LAM-HTGTS is limited.

PEM-seq is similar to LAM-HTGTS, but is able to accurately quantify allele frequency in pooled DNA due to the addition of UMIs before PCR amplification (Fig. 2c) (Yin et al. 2019). In PEM-seq, primer extension is conducted with a biotinylated primer for only one cycle, and amplicons are purified using streptavidin magnetic beads. Sequencing adapters with UMIs are then ligated onto the purified amplicons before a nested PCR, followed by NGS. This addition of a UMI means that each DNA molecule from the original pool of DNA before amplification is represented by a single UMI. This UMI can be utilized during analysis to accurately quantify variant frequency (both SV and INDELs) with high sensitivity (dependent on sequencing depth).

Chromosomal aberrations analysis by single targeted linker-mediated PCR sequencing (CAST-seq) is a method which can enrich and detect SVs using the bait/prey DNA system (Fig. 2d) (Turchiano et al. 2021). In CAST-seq bulk DNA is fragmented and then a linker is ligated to both ends. A subsequent PCR is performed using bait sequence specific and linker specific primers. ‘Decoy’ primers are also included which are specific to the reference genome within the prey sequence. Due to the presence of decoy primers, DNA fragments which retain the native prey DNA generate fragmented PCR products, rather than the full-length PCR products produced by those containing a SV. A second nested PCR is then performed using separate primers specific to the bait and linker DNA within the amplicons, allowing for amplification of fragments containing the locus of interest and a SV. A third PCR is performed to introduce a NGS barcode and adapter for sequencing. The CAST-seq method significantly enriches for SV containing sequences and has a detection threshold down to one SV per 10,000 cells. Furthermore, while the frequency of SVs in bulk DNA cannot be readily quantified, this can be achieved via a ddPCR calibration step using the same pooled DNA.

The benefits of using single-primer amplification for analysis of a targeted-DSB site are as follows: first, single-primer and linker-mediated PCR relies on only one loci-specific PCR primer for the initial amplification. Thus, they can detect SVs that cannot be detected by standard amplicon sequencing due to the removal of one primer binding site; second, the target primer can be placed either side of the target loci, which improves the detection capabilities; third, the amplification products are typically small, so can be robustly interrogated by NGS, even with pooled DNA. Finally, these techniques can identify SV junction points, including translocations, and if frequent these junctions may highlight off-target loci for further analysis.

While LAM-HTGTS, PEM-seq, and CAST-seq are useful for detecting SVs, these techniques have limitations associated with the use of PCR amplification. First, these rely on effective primer design, which may not be possible at all loci. Second, SVs where the specific (bait) primer site has been lost cannot be detected, as may be the case with large deletions. As these primers are typically located in close proximity to the target site (~ 200 bp), these techniques are best suited for the detection of insertions, inversions, translocations, and small deletions. However, large deletions which retain the specific (bait) primer site will be detected. While using a primer specific to a more distal site could improve the detection of larger deletions, this is limited by both the maximum size of amplicon that can be produced by PCR and the read length of the sequencing technique used. Finally, it should be noted that a ‘universal bait DSB’ strategy was introduced for LAM-HTGTS (Hu et al. 2016). This strategy may also be useful for PEM-seq and CAST-seq, eliminating the need for primer design at each targeted loci (more information can be found in the original paper) (Hu et al. 2016). However, the ‘universal bait DSB’ method is limited to the detection of chromosomal translocations.

Xdrop

Xdrop™ is an indirect sequence capture system that circumvents the requirement for targeted PCR amplification over the break site in order to capture SV sequences (Blondal et al. 2021; Madsen et al. 2020). Xdrop capture is achieved by encapsulating DNA fragments of up to 100 kb into double emulsion droplets (water/oil/water) along with PCR primers that amplify a 100–200 bp product over 5 kb distal to loci of interest. PCR is then performed on the encapsulated DNA, followed by staining with a DNA-intercalating fluorescent dye and flow-assisted sorting to isolate fluorescent droplets containing the region of interest. The sorted DNA is then amplified by multiple displacement amplification (MDA) which amplifies large DNA fragments without requiring specified primer sets (Lasken 2009). The amplified enriched DNA is then prepared for sequencing by NGS. A key advantage of the Xdrop method is that it enables the examination of longer DNA amplicons (restricted by the average length of DNA) than could be possible by traditional PCR. In addition, the distance of the selection PCR primers from the target site means that they are not restricted in their design and are less likely to be removed by large deletions.

PEAC-seq

Prime editor-assisted off-target characterization (PEAC-seq) is a technique that can detect off-target sites and translocations (Yu et al. 2022). It uses a Cas9 nuclease which is fused to the Moloney Murine Leukemia Virus (M-MLV) reverse transcriptase (RT) protein. The Cas9 creates a DSB at both on-target and off-target sites guided by the prime editing gRNA (pegRNA) sequence. The RT then introduces a "tag" sequence at the DSB site through reverse transcription of the RT template. Bulk edited DNA is fragmented using tagmentation with the Tn5 transposase, which incorporates adapter and UMI sequences to account for potential PCR and sequencing bias. Two separate PCR reactions are conducted to amplify both sides of the target site, each using a PCR primer that binds to the Tn5 and tag regions in either the forward or reverse direction. A subsequent PCR reaction is conducted on the products to add adapters for Illumina sequencing. Off-target sites are then identified by aligning the sequences surrounding the tag DNA to a reference genome. PEAC-seq can also be used to identify translocations at a known target site by substituting the tag specific primers with a site-specific forward primer. This reaction produces an amplicon that spans the target site into the candidate off-target region and can detect translocations from the target DNA with or without the tag sequence. PEAC-seq is however limited by the insertional efficiency of the PEAC-seq tag, which the authors note may vary between pegRNA and target loci.

Alternative SV detection methods (Strand-seq)

While this review focuses on SV detection methods that have already been used to evaluate CRISPR-editing, other technologies that could aid in SV detection have not yet been incorporated into a CRISPR analysis workflow (Mahmoud et al. 2019). For example, Strand-seq is a single-cell sequencing method initially developed to track sister-chromatid exchanges (Falconer et al. 2012) but has also been utilized to detect other SVs such as deletions, duplications, inversions, and translocations (Jeong et al. 2022; Sanders et al. 2019). Further details about Strand-seq can be found in the primary sources (Falconer et al. 2012; Jeong et al. 2022; Sanders et al. 2017, 2019). Briefly, by sequencing single-chromosome strands, Strand-seq enables the differentiation of sequences between parental chromosomes, enabling a more robust evaluation of SVs than is provided by other single-cell and whole genome sequencing approaches. If utilized on a group of CRISPR-edited cells, Strand-seq has the potential to detect SVs across the entire genome, with the threshold for detection depending on the number of cells sequenced.

Clonal expansion assays

Another method to measure the impact of CRISPR-induced SVs is by tracking the frequency of SVs over time or by performing assays to track the clonal expansion of edited cells. The SV detection methods mentioned above can be performed at subsequent time points during cell expansion to monitor the SV containing population of edited cells and to potentially identify the expansion of undesirable edits. However, this analysis will be limited to the types of SVs each method is able to detect. Alternative techniques are available to track the clonal expansion of edited cells within a heterogeneous population, although these are not necessarily specific for the detection of SVs (Sharma et al. 2021). One example is the TRACE-Seq method, which enables the introduction of the desired edit while also tracking the contribution of alleles and allele lineages (Sharma et al. 2021). This method involves generation of adeno-associated virus (AAV) libraries that have semi-randomized, silent mutations within the donor template, while also preserving the reading frame and capacity to induce the desired edit. Consequently, a pool of corrected cells with a diverse range of silent mutations is generated. The allelic contribution of the edited cells can be tracked by sequencing the target site using next-generation sequencing. If one allele increases in frequency or there is a significant change in allele contribution, it is indicative of clonal expansion of the cell containing those edits. Although TRACE-Seq is designed for AAV vectors, the same principles may be applied to any HDR approach by adding semi-randomized silent mutations to the donor template.

Conclusion

Ex vivo CRISPR-Cas9 gene therapies have already advanced into stage 2 and 3 clinical trials for a number of genetic diseases (Chen et al. 2021). In vivo gene therapies, which deliver CRISPR machinery directly into the body via adenoviral or lipid nanoparticle vectors, have also recently moved into human trials (Taha et al. 2022). Although unanticipated genotoxic events in the form of small INDELs at off-target sites are routinely evaluated, new data suggests that large on-target SVs are also consequential editing outcomes that require their own evaluation. This can be technically challenging given their diverse and complex nature. While most analysis methods have a relatively limited capacity to detect various SV classes, others, such as PEM-seq, CAST-seq and Xdrop can detect many SV types from bulk edited genomic DNA. Henceforth, it will be important to combine multiple modes of analysis to ensure the maximum detection of both small INDELs and large SVs.

To date, as described above, large deletions, insertions, inversions, rearrangements, chromosomal truncations, CN-LOH, translocations, and chromothripsis have all been described in various CRISPR-edited primary human cells and human cell lines. None of these would have been detected by “standard” genotyping analysis methods. Interestingly, themes regarding the dominant class of SV in each cell type have begun to emerge. As may be expected, aneuploid cancer cell lines with natural chromosomal instability are more prone to large chromosomal aberrations, such as truncations and translocations, compared to karyotypically stable cell lines (Rayner et al. 2019). This is likely due to a difference in regulation of key DNA repair and checkpoint proteins, such as the tumor suppressor protein p53. In genetically stable cells, moderate kilobase-sized deletions, insertions and rearrangements seem to be the prominent on-target SVs (Turchiano et al. 2021). However, CN-LOH of entire chromosome arms and low-level translocations have also been detected (Boutin et al. 2021; Leibowitz et al. 2021). The predisposition for “small”, or copy number neutral aberrations in primary cells is likely explained by the negative selection pressure of large genomic imbalances at cell cycle checkpoints (Mirgayazova et al. 2020). So, although chromosomal aberrations may be less frequent in genetically stable cells, long-term studies which track the frequency of SVs are warranted.

If not properly accounted for, on-target SVs may deleteriously impact the validity and safety of CRISPR-Cas9 research. There are now substantial precedents which indicate that standard short-amplicon analysis methods do not detect most SVs which may have significant downstream functional consequences (Boutin et al. 2021; Weisheit et al. 2020). Fortunately, as far as we are aware, no adverse events have occurred due to unintended on- or off-target CRISPR-editing in clinical trials to date. Nonetheless, it may be prudent to proceed with caution until the prevalence and the impact of large genomic aberrations are better understood. The newest generation of CRISPR-Cas tools has the potential to completely avoid DSBs and, hopefully, their associated genotoxic effects (Anzalone et al. 2020; Cullot et al. 2019; Yin et al. 2019). However, even with the development of these tools, a comprehensive understanding of all editing outcomes, from small INDELS to SVs will only serve to improve the safety of CRISPR-Cas therapies.

Finally, the aim of this review is to assist researchers who may be using the CRISPR system in a diverse range of applications. Since each application carries a varying risk of both generating SVs and risk incurred from SVs, it is the responsibility of the researcher to determine the level of concern on a case by case basis. We recommend that researchers attempt to quantify SVs when reporting editing efficacy from bulk edited cells and to determine allele copy number if working with clonal cells. This is especially important when working with cancer cells where cytogenic analysis may also be performed. In addition, functional assays could be included alongside genome editing results wherever feasible. CRISPR-based therapies and their gene targets are varied. Therefore, prior to any clinical trial, it is crucial that they undergo rigorous preclinical testing and that this process includes a robust analysis of SVs.

Data availability

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

References

Adikusuma F, Piltz S, Corbett MA, Turvey M, McColl SR, Helbig KJ, Beard MR, Hughes J, Pomerantz RT, Thomas PQ (2018) Large deletions induced by Cas9 cleavage. Nature 2018 560:7717, 560(7717): E8–E9. https://doi.org/10.1038/s41586-018-0380-z
Alanis-Lobato G, Zohren J, McCarthy A, Fogarty NME, Kubikova N, Hardman E, Greco M, Wells D, Turner JMA, Niakan KK (2021) Frequent loss of heterozygosity in CRISPR-Cas9-edited early human embryos. Proceedings of the National Academy of Sciences of the United States of America, 118(22). https://doi.org/10.1073/pnas.2004832117
Alhafidz H, Ailith E (2022) Unravelling the tumour genome: The evolutionary and clinical impacts of structural variants in tumourigenesis. J Pathol 257(4): 479–493. https://doi.org/10.1002/path.5901
Allen F, Crepaldi L, Alsinet C, Strong AJ, Kleshchevnikov V, De Angeli P, Páleníková P, Khodak A, Kiselev V, Kosicki M, Bassett AR, Harding H, Galanty Y, Muñoz-Martínez F, Metzakopian E, Jackson SP, Parts L (2018) Predicting the mutations generated by repair of Cas9-induced double-strand breaks. Nat Biotechnol 2018 37:1, 37(1), 64–72. https://doi.org/10.1038/nbt.4317
Anzalone AV, Koblan LW, Liu DR (2020) Genome editing with CRISPR–Cas nucleases, base editors, transposases and prime editors. Nat Biotechnol 38(7):824–844. https://doi.org/10.1038/s41587-020-0561-9
Article CAS PubMed Google Scholar
Bi C, Wang L, Yuan B, Zhou X, Li Y, Wang S, Pang Y, Gao X, Huang Y, Li M (2020) Long-read individual-molecule sequencing reveals CRISPR-induced genetic heterogeneity in human ESCs. Genome Biol 21(1):1–14. https://doi.org/10.1186/S13059-020-02143-8/FIGURES/2
Article Google Scholar
Blondal T, Gamba C, Møller Jagd L, Su L, Demirov D, Guo S, Johnston CM, Riising EM, Wu X, Mikkelsen MJ, Szabova L, Mouritzen P (2021) Verification of CRISPR editing and finding transgenic inserts by Xdrop indirect sequence capture followed by short- and long-read sequencing. Methods 191:68–77. https://doi.org/10.1016/J.YMETH.2021.02.003
Article CAS PubMed Google Scholar
Boutin J, Cappellen D, Rosier J, Amintas S, Dabernat S, Bedel A, Moreau-Gaudry F (2022) ON-target adverse events of CRISPR-Cas9 nuclease: more chaotic than expected. CRISPR J 5(1):19–30. https://doi.org/10.1089/CRISPR.2021.0120/ASSET/IMAGES/LARGE/CRISPR.2021.0120_FIGURE2.JPEG
Article CAS PubMed Google Scholar
Boutin J, Rosier J, Cappellen D, Prat F, Toutain J, Pennamen P, Bouron J, Rooryck C, Merlio JP, Lamrissi-Garcia I, Cullot G, Amintas S, Guyonnet-Duperat V, Ged C, Blouin JM, Richard E, Dabernat S, Moreau-Gaudry F, Bedel A (2021) CRISPR-Cas9 globin editing can induce megabase-scale copy-neutral losses of heterozygosity in hematopoietic cells. Nature Communications 2021 12:1, 12(1), 1–12. https://doi.org/10.1038/s41467-021-25190-6
Chang HHY, Pannunzio NR, Adachi N, Lieber MR (2017) Non-homologous DNA end joining and alternative pathways to double-strand break repair. Nature Reviews Molecular Cell Biology 2017 18:8, 18(8):495–506. https://doi.org/10.1038/nrm.2017.48
Chen Y, Wen R, Yang Z, Chen Z (2021) Genome editing using CRISPR/Cas9 to treat hereditary hematological disorders. Gene Therapy 2021 29:5, 29(5): 207–216. https://doi.org/10.1038/s41434-021-00247-9
Choi PS, Meyerson M (2014) Targeted genomic rearrangements using CRISPR/Cas technology. Nat Commun 2014 5:1, 5(1), 1–6. https://doi.org/10.1038/ncomms4728
Chu VT, Weber T, Wefers B, Wurst W, Sander S, Rajewsky K, Kühn R (2015) Increasing the efficiency of homology-directed repair for CRISPR-Cas9-induced precise gene editing in mammalian cells. Nature Biotechnology 2015 33:5, 33(5): 543–548. https://doi.org/10.1038/nbt.3198
Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang, Y, Aerts J, Andrews TD, Barnes C, Campbell P, Fitzgerald T, Hu M, Ihm CH, Kristiansson K, MacArthur DG, MacDonald JR, Onyiah I, Pang AWC, Robson S, Hurles ME (2009) Origins and functional impact of copy number variation in the human genome. Nature 2009 464:7289, 464(7289), 704–712. https://doi.org/10.1038/nature08516
Cullot G, Boutin J, Toutain J, Prat F, Pennamen P, Rooryck C, Teichmann M, Rousseau E, Lamrissi-Garcia I, Guyonnet-Duperat V, Bibeyran A, Lalanne M, Prouzet-Mauléon V, Turcq B, Ged C, Blouin J-M, Richard E, Dabernat S, Moreau-Gaudry F, Bedel A (2019) CRISPR-Cas9 genome editing induces megabase-scale chromosomal truncations. Nat Commun 10(1):1136. https://doi.org/10.1038/s41467-019-09006-2
Article CAS PubMed PubMed Central Google Scholar
Do TU, Ho B, Shih SJ, Vaughan A (2012) Zinc finger nuclease induced DNA double stranded breaks and rearrangements in MLL. Mutation Res/fundamental Mol Mech Mutagenesis 740(1–2):34–42. https://doi.org/10.1016/J.MRFMMM.2012.12.006
Article CAS Google Scholar
Doudna JA, Charpentier E (2014) The new frontier of genome engineering with CRISPR-Cas9. Science, 346(6213). https://doi.org/10.1126/SCIENCE.1258096/ASSET/2313E70A-5C58-4755-A0E6-2E64EE240A09/ASSETS/GRAPHIC/346_1258096_F6.JPEG
Dubois F, Sidiropoulos N, Weischenfeldt J (2022) Beroukhim R (2022) Structural variations in cancer and the 3D genome. Nat Rev Cancer 22(9):533–546. https://doi.org/10.1038/s41568-022-00488-9
Article CAS PubMed Google Scholar
Falconer E, Hills M, Naumann U, Poon SSS, Chavez EA, Sanders AD, Zhao Y, Hirst M, Lansdorp PM (2012) DNA template strand sequencing of single-cells maps genomic rearrangements at high resolution. Nature Methods 2012 9(11): 1107–1112. https://doi.org/10.1038/nmeth.2206
Geng K, Merino LG, Wedemann L, Martens A, Sobota M, Sanchez YP, Søndergaard JN, White RJ, Kutter C (2022) Target-enriched nanopore sequencing and de novo assembly reveals co-occurrences of complex on-target genomic rearrangements induced by CRISPR-Cas9 in human cells. Genome Res 32(10):1876–1891. https://doi.org/10.1101/GR.276901.122
Article PubMed PubMed Central Google Scholar
Gong T, Hayes VM, Chan EKF (2021) Detection of somatic structural variants from short-read next-generation sequencing data. Brief Bioinform 22(3):1–15. https://doi.org/10.1093/BIB/BBAA056
Article CAS Google Scholar
Han HA, Pang JKS, Soh BS (2020) Mitigating off-target effects in CRISPR/Cas9-mediated in vivo gene editing. J Mol Med 98(5):615–632. https://doi.org/10.1007/S00109-020-01893-Z/TABLES/2
Article CAS PubMed Google Scholar
Hu J, Meyers RM, Dong J, Panchakshari RA, Alt FW, Frock RL (2016) Detecting DNA double-stranded breaks in mammalian genomes by linear amplification–mediated high-throughput genome-wide translocation sequencing. Nature Protocols 2016 11(5): 853–871. https://doi.org/10.1038/nprot.2016.043
Ihry RJ, Worringer KA, Salick MR, Frias E, Ho D, Theriault K, Kommineni S, Chen J, Sondey M, Ye C, Randhawa R, Kulkarni T, Yang Z, McAllister G, Russ C, Reece-Hoyes J, Forrester W, Hoffman GR, Dolmetsch R, Kaykas A (2018) p53 inhibits CRISPR–Cas9 engineering in human pluripotent stem cells. Nat Med 2018 24(7): 939–946. https://doi.org/10.1038/s41591-018-0050-6
Jeong H, Grimes K, Rauwolf KK, Bruch PM, Rausch T, Hasenfeld P, Benito E, Roider T, Sabarinathan R, Porubsky D, Herbst SA, Erarslan-Uysal B, Jann JC, Marschall T, Nowak D, Bourquin JP, Kulozik AE, Dietrich S, Bornhauser B, Korbel JO (2022) Functional analysis of structural variants in single cells using Strand-seq. Nat Biotechnol 2022, 1–13. https://doi.org/10.1038/s41587-022-01551-4
Kosicki M, Tomberg K, Bradley A (2018) Repair of double-strand breaks induced by CRISPR–Cas9 leads to large deletions and complex rearrangements. Nat Biotechnol 36(8):765–771. https://doi.org/10.1038/nbt.4192
Article CAS PubMed PubMed Central Google Scholar
Kosicki M, Allen F, Steward F, Tomberg K, Pan Y, Bradley A (2022) Cas9-induced large deletions and small indels are controlled in a convergent fashion. Nature Communications 2022 13(1): 1–11. https://doi.org/10.1038/s41467-022-30480-8
Lasken RS (2009) Genomic DNA amplification by the multiple displacement amplification (MDA) method. Biochem Soc Trans 37(2):450–453. https://doi.org/10.1042/BST0370450
Article CAS PubMed Google Scholar
Lee ABC, Tan MH, Chai CLL (2022) Small-molecule enhancers of CRISPR-induced homology-directed repair in gene therapy: a medicinal chemist’s perspective. Drug Discovery Today 27(9):2510–2525. https://doi.org/10.1016/J.DRUDIS.2022.06.006
Article CAS PubMed Google Scholar
Leibowitz ML, Papathanasiou S, Doerfler PA, Blaine LJ, Sun L, Yao Y, Zhang CZ, Weiss MJ, Pellman D (2021) Chromothripsis as an on-target consequence of CRISPR–Cas9 genome editing Nat Genet 1–11. https://doi.org/10.1038/s41588-021-00838-7
Li C, Chu W, Gill RA, Sang S, Shi Y, Hu X, Yang Y, Zaman QU, Zhang B (2022) Computational tools and resources for CRISPR/Cas genome editing. Genomics Proteomics Bioinform. https://doi.org/10.1016/J.GPB.2022.02.006
Article Google Scholar
Li J, Hong S, Chen W, Zuo E, Yang H (2019) Advances in detecting and reducing off-target effects generated by CRISPR-mediated genome editing. In: J Genet Genomics (Vol. 46, Issue 11, pp. 513–521). Institute of Genetics and Developmental Biology. https://doi.org/10.1016/j.jgg.2019.11.002
Liao J, Chen S, Hsiao S, Jiang Y, Yang Y, Zhang Y, Wang X, Lai Y, Bauer DE, Wu Y (2023) Therapeutic adenine base editing of human hematopoietic stem cells. Nat Commun 2023 14(1); 1–11. https://doi.org/10.1038/s41467-022-35508-7
Liu M, Zhang W, Xin C, Yin J, Shang Y, Ai C, Li J, Meng FL, Hu J (2021) Global detection of DNA repair outcomes induced by CRISPR–Cas9. Nucl Acids Res 49(15):8732–8742. https://doi.org/10.1093/NAR/GKAB686
Article CAS PubMed PubMed Central Google Scholar
Logsdon GA, Vollger MR, Eichler EE (2020) Long-read human genome sequencing and its applications. Nat Rev Genet 2020 21(10), 597–614. https://doi.org/10.1038/s41576-020-0236-x
Ma H, Marti-Gutierrez N, Park SW, Wu J, Lee Y, Suzuki K, Koski A, Ji D, Hayama T, Ahmed R, Darby H, Van Dyken C, Li Y, Kang E, Park AR, Kim D, Kim ST, Gong J, Gu Y, Mitalipov S (2017) Correction of a pathogenic gene mutation in human embryos. Nature 2017 548:7668, 548(7668), 413–419. https://doi.org/10.1038/nature23305
Maddalo D, Manchado E, Concepcion CP, Bonetti C, Vidigal JA, Han YC, Ogrodowski P, Crippa A, Rekhtman N, Stanchina E, De Lowe SW, Ventura A (2014) In vivo engineering of oncogenic chromosomal rearrangements with the CRISPR/Cas9 system. Nature 2014 516(7531): 423–427. https://doi.org/10.1038/nature13902
Madsen EB, Höijer I, Kvist T, Ameur A, Mikkelsen MJ (2020) Xdrop: targeted sequencing of long DNA molecules from low input samples using droplet sorting. Hum Mutat 41(9):1671–1679. https://doi.org/10.1002/HUMU.24063
Article CAS PubMed PubMed Central Google Scholar
Mahmoud M, Gobet N, Cruz-Dávalos DI, Mounier N, Dessimoz C, Sedlazeck FJ (2019) Structural variant calling: the long and the short of it. Genome Biol 20(1):1–14. https://doi.org/10.1186/S13059-019-1828-7/TABLES/2
Article Google Scholar
Martin CL, Warburton D (2015) Detection of Chromosomal Aberrations in Clinical Practice: From Karyotype to Genome Sequence. https://doi.org/10.1146/Annurev-Genom-090413-025346, 16: 309–326
Mirgayazova, R., Khadiullina, R., Chasov, V., Mingaleeva, R., Miftakhova, R., Rizvanov, A., & Bulatov, E. (2020). Therapeutic Editing of the TP53 Gene: Is CRISPR/Cas9 an Option? Genes 2020, 11(6): 704. https://doi.org/10.3390/GENES11060704
Owens DDG, Caulder A, Frontera V, Harman JR, Allan AJ, Bucakci A, Greder L, Codner GF, Hublitz P, McHugh PJ, Teboul L, de Bruijn MFTR (2019) Microhomologies are prevalent at Cas9-induced larger deletions. Nucleic Acids Res 47(14):7402–7417. https://doi.org/10.1093/NAR/GKZ459
Article CAS PubMed PubMed Central Google Scholar
Przewrocka J, Rowan A, Rosenthal R, Kanu N, Swanton C (2020) Unintended on-target chromosomal instability following CRISPR/Cas9 single gene targeting. Ann Oncol 31(9):1270–1273. https://doi.org/10.1016/J.ANNONC.2020.04.480
Article CAS PubMed Google Scholar
Quan Z-J, Li S-A, Yang Z-X, Zhao J-J, Li G-H, Zhang F, Wen W, Cheng T, Zhang X-B (2022) GREPore-seq: A robust workflow to detect changes after gene editing through long-range PCR and nanopore sequencing. Genomics Proteomics Bioinformatics. https://doi.org/10.1016/J.GPB.2022.06.002
Article PubMed Google Scholar
Rayner E, Durin M-A, Thomas R, Moralli D, O’Cathail SM, Tomlinson I, Green CM, Lewis A (2019) CRISPR-Cas9 causes chromosomal instability and rearrangements in cancer cell lines, detectable by cytogenetic methods. CRISPR J 2(6):406–416. https://doi.org/10.1089/CRISPR.2019.0006/SUPPL_FILE/SUPP_FIG5.PDF
Article CAS PubMed PubMed Central Google Scholar
Riesenberg S, Chintalapati M, Macak D, Kanis P, Maricic T, Pääbo S (2019) Simultaneous precise editing of multiple genes in human cells. Nucleic Acids Research, 47(19): e116. https://doi.org/10.1093/nar/gkz669
Sanders AD, Falconer E, Hills M, Spierings DCJ, Lansdorp PM (2017) Single-cell template strand sequencing by Strand-seq enables the characterization of individual homologs. Nature Protocols 2017 12(6): 1151–1176. https://doi.org/10.1038/nprot.2017.029
Sanders AD, Meiers S, Ghareghani M, Porubsky D, Jeong H, van Vliet MACC, Rausch T, Richter-Pechańska P, Kunz JB, Jenni S, Bolognini D, Longo GMC, Raeder B, Kinanen V, Zimmermann J, Benes V, Schrappe M, Mardin BR, Kulozik AE, Korbel JO (2019) Single-cell analysis of structural variations and complex rearrangements with tri-channel processing. Nat Biotechnol 2019 38(3): 343–354. https://doi.org/10.1038/s41587-019-0366-x
Schiroli G, Conti A, Ferrari S, della Volpe L, Jacob A, Albano L. Beretta S, Calabria A, Vavassori V, Gasparini P, Salataj E, Ndiaye-Lobry D, Brombin C, Chaumeil J, Montini E, Merelli I, Genovese P, Naldini L, Di Micco R (2019) Precise Gene Editing Preserves Hematopoietic Stem Cell Function following Transient p53-Mediated DNA Damage Response. Cell Stem Cell, 24(4): 551-565.e8. https://doi.org/10.1016/J.STEM.2019.02.019
Schmidt JK, Kim YH, Strelchenko N, Gierczic SR, Pavelec D, Golos TG, Slukvin II (2023) Whole genome sequencing of CCR5 CRISPR-Cas9-edited Mauritian cynomolgus macaque blastomeres reveals large-scale deletions and off-target edits. Front Genome Editing 4:58. https://doi.org/10.3389/FGEED.2022.1031275
Article Google Scholar
Sharma R, Dever DP, Lee CM, Azizi A, Pan Y, Camarena J, Köhnke T, Bao G, Porteus MH, Majeti R (2021) The TRACE-Seq method tracks recombination alleles and identifies clonal reconstitution dynamics of gene targeted human hematopoietic stem cells. Nature Communications 2021 12(1): 1–12. https://doi.org/10.1038/s41467-020-20792-y
Simkin D, Papakis V, Bustos BI, Ambrosi CM, Ryan SJ, Baru V, Williams LA, Dempsey GT, McManus OB, Landers JE, Lubbe SJ, George AL, Kiskinis E (2022) Homozygous might be hemizygous: CRISPR/Cas9 editing in iPSCs results in detrimental on-target defects that escape standard quality controls. Stem Cell Reports 17(4):993–1008. https://doi.org/10.1016/J.STEMCR.2022.02.008
Article CAS PubMed PubMed Central Google Scholar
Taha EA, Lee J, Hotta A (2022) Delivery of CRISPR-Cas tools for in vivo genome editing therapy: trends and challenges. J Control Release 342:345–361. https://doi.org/10.1016/J.JCONREL.2022.01.013
Article CAS PubMed Google Scholar
Turchiano G, Andrieux G, Klermund J, Blattner G, Pennucci V, el Gaz M, Monaco G, Poddar S, Mussolino C, Cornu TI, Boerries M, Cathomen T (2021) Quantitative evaluation of chromosomal rearrangements in gene-edited human stem cells by CAST-Seq. Cell Stem Cell 28(6):1136-1147.e5. https://doi.org/10.1016/J.STEM.2021.02.002
Article CAS PubMed Google Scholar
Weisheit I, Kroeger JA, Malik R, Klimmt J, Crusius D, Dannert A, Dichgans M, Paquet D (2020) Detection of Deleterious On-Target Effects after HDR-Mediated CRISPR Editing. Cell Reports, 31(8): 107689. https://doi.org/10.1016/j.celrep.2020.107689
Weisheit I, Kroeger JA, Malik R, Wefers B, Lichtner P, Wurst W, Dichgans M, Paquet D (2021) Simple and reliable detection of CRISPR-induced on-target effects by qgPCR and SNP genotyping. Nat Protocols 2021 16(3): 1714–1739. https://doi.org/10.1038/s41596-020-00481-2
Wen W, Quan ZJ, Li SA, Yang ZX, Fu YW, Zhang F, Li GH, Zhao M, Yin MD, Xu J, Zhang JP, Cheng T, Zhang XB (2021) Effective control of large deletions after double-strand breaks by homology-directed repair and dsODN insertion. Genome Biol 22(1):1–22. https://doi.org/10.1186/S13059-021-02462-4/FIGURES/6
Article Google Scholar
Wu J, Zou Z, Liu Y, Liu X, Zhangding Z, Xu M, Hu J (2022) CRISPR/Cas9-induced structural variations expand in T lymphocytes in vivo. Nucleic Acids Res 50(19):11128–11137. https://doi.org/10.1093/NAR/GKAC887
Article CAS PubMed PubMed Central Google Scholar
Xin C, Yin J, Yuan S, Ou L, Liu M, Zhang W, Hu J (2022) Comprehensive assessment of miniature CRISPR-Cas12f nucleases for gene disruption. Nat Commun 2022 13(1): 1–10. https://doi.org/10.1038/s41467-022-33346-1
Yin J, Liu M, Liu Y, Wu J, Gan T, Zhang W, Li Y, Zhou Y, Hu J (2019) Optimizing genome editing strategy by primer-extension-mediated sequencing. Cell Discovery 2019 5(1): 1–11. https://doi.org/10.1038/s41421-019-0088-8
Yin J, Fang K, Gao Y, Ou L, Yuan S, Xin C, Wu W, Wu Ww, Hong J, Yang H, Hu J (2022a) Safeguarding genome integrity during gene-editing therapy in a mouse model of age-related macular degeneration. Nat Commun2022a 13(1): 1–8. https://doi.org/10.1038/s41467-022-35640-4
Yin J, Lu R, Xin C, Wang Y, Ling X, Li D, Zhang W, Liu M, Xie W, Kong L, Si W, Wei P, Xiao B, Lee HY, Liu T, Hu J (2022b) Cas9 exo-endonuclease eliminates chromosomal translocations during genome editing. Nature Communications 2022b 13(1): 1–14. https://doi.org/10.1038/s41467-022-28900-w
Yoo KW, Yadav MK, Song Q, Atala A, Lu B (2022) Targeting DNA polymerase to DNA double-strand breaks reduces DNA deletion size and increases templated insertions generated by CRISPR/Cas9. Nucl Acids Res 50(7):3944–3957. https://doi.org/10.1093/NAR/GKAC186
Article CAS PubMed PubMed Central Google Scholar
Yu Z, Lu Z, Li J, Wang Y, Wu P, Li Y, Zhou Y, Li B, Zhang H, Liu Y, Ma L (2022) PEAC-seq adopts Prime Editor to detect CRISPR off-target and DNA translocation. Nat Commun 2022 13(1): 1–13. https://doi.org/10.1038/s41467-022-35086-8
Zhang W, Yin J, Zhang-Ding Z, Xin C, Liu M, Wang Y, Ai C, Hu J (2021) In-depth assessment of the PAM compatibility and editing activities of Cas9 variants. Nucleic Acids Res 49(15):8785–8795. https://doi.org/10.1093/NAR/GKAB507
Article CAS PubMed PubMed Central Google Scholar
Zuccaro MV, Xu J, Mitchell C, Marin D, Zimmerman R, Rana B, Weinstein E, King RT, Palmerola KL, Smith ME, Tsang SH, Goland R, Jasin M, Lobo R, Treff N, Egli D (2020) Allele-Specific Chromosome Removal after Cas9 Cleavage in Human Embryos. Cell 183(6):1650-1664.e15. https://doi.org/10.1016/J.CELL.2020.10.025
Article CAS PubMed Google Scholar

Download references

Acknowledgements

All figures were created with BioRender.com.

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions.

Author information

Authors and Affiliations

School of Biological Sciences, University of Auckland, Auckland, New Zealand
John Murray Topp Hunt, Christopher Allan Samson, Alex du Rand & Hilary M. Sheppard

Authors

John Murray Topp Hunt
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Allan Samson
View author publications
You can also search for this author in PubMed Google Scholar
Alex du Rand
View author publications
You can also search for this author in PubMed Google Scholar
Hilary M. Sheppard
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Sheppard and Hunt devised the initial concept of the manuscript, which was subsequently drafted by Hunt. All authors contributed to the analysis of the relevant literature and to critical revisions. The final manuscript has been read and approved for publication by all authors.

Corresponding author

Correspondence to Hilary M. Sheppard.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hunt, J.M.T., Samson, C.A., Rand, A.d. et al. Unintended CRISPR-Cas9 editing outcomes: a review of the detection and prevalence of structural variants generated by gene-editing in human cells. Hum Genet 142, 705–720 (2023). https://doi.org/10.1007/s00439-023-02561-1

Download citation

Received: 01 February 2023
Accepted: 13 April 2023
Published: 24 April 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s00439-023-02561-1

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Unintended CRISPR-Cas9 editing outcomes: a review of the detection and prevalence of structural variants generated by gene-editing in human cells

Abstract

Similar content being viewed by others

Recent advances in the delivery and applications of nonviral CRISPR/Cas9 gene editing

A survey of best practices for RNA-seq data analysis

Opportunities and challenges in long-read sequencing data analysis

Introduction

Gene editing using CRISPR-Cas9

Gene editing can lead to the unintended generation of structural variants

Evidence for CRISPR-associated SVs

SVs in human cancer cells lines

SVs in primary cells, immortalized primary cells, iPSCs and HSPCs

SVs in human embryonic stem cells, zygotes, and embryos

HDR-enhancing techniques may increase the incidence of structural variants in CRISPR-edited cells

Alternative CRISPR strategies may reduce the incidence of SVs

Retention of SVs post editing

Current methods used to analyze CRISPR edits

Methods to detect structural variants in CRISPR-edited cells

Cytogenetic analysis (FISH, aCGH, and SNP-analysis)

Quantitative genotyping PCR

Targeted amplicon sequencing, long-range sequencing technologies and IDM-seq

SV capture techniques; LAM-HTGTS, PEM-seq and CAST-seq

Xdrop

PEAC-seq

Alternative SV detection methods (Strand-seq)

Clonal expansion assays

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation