Structural rearrangements generate cell-specific, gene-independent CRISPR-Cas9 loss of fitness effects
Abstract
Background
CRISPR-Cas9 genome editing is widely used to study gene function, from basic biology to biomedical research. Structural rearrangements are a ubiquitous feature of cancer cells and their impact on the functional consequences of CRISPR-Cas9 gene-editing has not yet been assessed.
Results
Utilizing CRISPR-Cas9 knockout screens for 250 cancer cell lines, we demonstrate that targeting structurally rearranged regions, in particular tandem or interspersed amplifications, is highly detrimental to cellular fitness in a gene-independent manner. In contrast, amplifications caused by whole chromosomal duplication have little to no impact on fitness. This effect is cell line specific and dependent on the ploidy status. We devise a copy-number ratio metric that substantially improves the detection of gene-independent cell fitness effects in CRISPR-Cas9 screens. Furthermore, we develop a computational tool, called Crispy, to account for these effects on a single sample basis and provide corrected gene fitness effects.
Conclusion
Our analysis demonstrates the importance of structural rearrangements in mediating the effect of CRISPR-Cas9-induced DNA damage, with implications for the use of CRISPR-Cas9 gene-editing in cancer cells.
Keywords
CRISPR Structural rearrangements Ploidy Copy-number CrispyBackground
Genetic loss-of-function screens are used to systematically identify genes important for cellular fitness and genetic interactions in model organisms [1, 2]. Traditionally, these have been performed with RNA interference (RNAi) [3, 4, 5], although its application to mammalian cells has been hampered by incomplete protein depletion and off-target effects [6, 7]. The advent of CRISPR-Cas technologies facilitates gene editing of human cells by addressing many of the limitations of RNAi and increases capacity to identify genes essential for cellular fitness [8, 9, 10, 11, 12, 13]. In cancer cell lines, CRISPR-Cas9 dropout screens have been integrated with genomic data sets to propose novel therapeutic targets [3, 14, 15, 16]. Tumor cell genetic instability can induce synthetic-lethal dependencies on genes that otherwise have no impact on cellular fitness [17].
Gene copy-number changes, despite being rare and often detrimental in normal cells [18], are one of the most frequent types of genomic alterations in cancers [19]. They are of particular importance when analyzing CRISPR-Cas9 experiments because targeting genomic regions that are copy-number amplified induces DNA damage responses that lead to cell cycle arrest and cell death [20, 21]. The effect is gene-independent and ubiquitous across cancer types. This increases the false-positive rate of gene loss of fitness (LOF) detection when interpreting results using CRISPR-Cas9 reagents targeting amplified regions. We and others have developed computational methods to account for this systematic bias [22, 23]. Some of these approaches are guided by knowledge of gene copy-number values, which on average are proportional to the non-specific LOF effect of CRISPR-Cas9 targeting. Nonetheless, the strength of this association varies between cell lines and amplicons with similar copy-number and is completely absent in some cases [23]. This indicates that other cellular features besides copy-number influence non-specific CRISPR-Cas9 LOF effects.
Cancer cells undergo extensive genomic alterations [24, 25, 26] and the impact of these on response to CRISPR-Cas9 targeting is poorly understood. Here, we combined CRISPR-Cas9 screens with whole-genome sequencing (WGS) and DNA SNP6 copy-number arrays to investigate the impact of structural variation (SV) on CRISPR-Cas9 response. We find that LOF effects of CRISPR-Cas9 screens mediated by copy-number amplifications are associated with structural rearrangements such as tandem duplications, in stark contrast to amplifications arising from chromosomal duplication. Considering this, we show that gene copy-number ratios, normalized by chromosome copy-number, provide more accurate identification of gene independent LOF effects. Additionally, we provide a novel computational tool that takes these effects into consideration on a per sample basis to provide accurate measurements of CRISPR-Cas9 LOF effects.
Results
Increased cell ploidy buffers non-specific CRISPR-Cas9 LOF effects
We considered publicly available genome-wide CRISPR-Cas9 knockout screens (BROAD DepMap 18Q3 depmap.org/portal/) performed in 36 different tumor types comprising 250 cancer cell lines [22, 27], which have been previously genomically characterized for copy-number and gene expression [28, 29] (Additional file 1: Figure S1a, Additional file 2: Table S1). Gene-essentiality fold-change profiles were estimated for a total of 17,328 genes, each targeted on average by 3.8 single-guide RNAs (sgRNAs). For the majority of the cell lines, two technical replicates were performed and gene averaged log-fold change values had a mean Pearson correlation (R) of 0.79. Genes previously defined as essential for cellular viability [15] were robustly recapitulated in all samples (mean area under recall curve (AURC) = 0.86), and as previously described [21], non-detrimental genes displayed a small enrichment for positive fold changes (mean AURC = 0.42) (Additional file 1: Figure S1b and S1c).
CRISPR-Cas9 screens and cell ploidy effect. a Enrichment of non-specific CRISPR-Cas9 LOF effects in non-expressed genes (RNA-seq RPKM < 1) grouped by their copy-number profile, performed across 250 cell lines. For each copy-number group, the recall curve is drawn and the area under the recall curve (AURC) is reported. X-axis shows the ranked gene level CRISPR-Cas9 fold changes, from negative to positive. b Boxplots of AURCs as in a but performed in each cell line independently. Each dot represents the AURC of the given gene copy-number in a specific cell line. c Similar to b but cell lines are grouped according to their ploidy status. d AURC of non-expressed genes estimated per chromosome in each cell line independently. Chromosomes were grouped according to their estimated number of copies. Boxplots represent 1.5 of the interquartile range
Structural rearrangements are determinants of CRISPR-Cas9 LOF
Structural variation impacts CRISPR-Cas9 response. a–c Representative examples of the strongest associations between SVs and CRISPR-Cas9 LOF. Structural rearrangements are mapped in the upper panel, in the middle panel copy-number levels are represented, and in the lower panel, CRISPR-Cas9 gene level fold changes are shown. SVs are colored with tandem duplications defined with blue lines, deletions with red lines, inversions in green, and chromosome translocations in purple. Average mean values for copy-number (middle panel) and CRISPR-Cas9 fold changes (lower panel) for each copy-number segment are represented as gray lines
Copy-number ratio improves the identification of CRISPR-Cas9 LOF effects
Gene copy-number ratio association with CRISPR-Cas9 loss of fitness effect. a Diagram of the different genomic rearrangements captured by the gene copy-number ratio and their potential effect in CRISPR-Cas9 response. b FISH of MYC amplifications (green signal) in HCC1954 tetraploid cell line. In the left panel, representative metaphase (80% of cells) with high MYC amplifications and low copy-number ratio. In the right panel, detailed view of the chromosomes containing MYC signal (upper) and the corresponding derivative chromosomes identified by M-FISH (lower). c Similar to b, FISH of MYC amplifications (green signal) in NCI-H2087 diploid cell line. In the left panel, representative metaphase (90% of cells) with high MYC copy-number ratio. For both cell lines, 10 cells were analyzed in the FISH experiments. d High copy-number ratios are enriched for strong CRISPR-Cas9 LOF effects. Recall curves of non-expressed genes grouped by their copy-number ratio profile across CRISPR-Cas9 fold changes. e Matrix of gene and chromosome copies representing the mean gene-level CRISPR-Cas9 fold change of non-expressed genes in the respective group. f Representative example of an amplified non-expressed gene with strong LOF effects associated with gene copy-number ratio (right panel) and to a lesser extent with absolute gene copy-number (left panel). CRISPR-Cas9 fold changes in e and f are scaled (known essential genes mean fold change = − 1)
Consistent with our hypothesis, we confirmed that the stringent set of tandem duplications and deletions identified previously showed significantly higher copy-number ratios (median = 1.24) compared to deletions (median = 0.74) (Welch’s t test p value = 1.8e−3) (Additional file 1: Figure S2e). Higher copy-number ratios were significantly enriched for genes commonly amplified in tumors such as oncogenes, e.g., CCND1 and EGFR, conversely copy-number ratios smaller than one were enriched, but not significantly, for known tumor suppressors, e.g., CDKN2A and TP53 (Additional file 1: Figure S3a, Additional file 3: Table S2). We confirmed that high copy-number ratios represent strong focal tandem amplifications by performing fluorescence in situ hybridization (FISH) in two MYC amplified cell lines with distinct copy-number ratios (Fig. 3b, c). Moreover, chromosome copy-number estimations from SNP arrays were consistent with FISH karyotypes for the cell lines tested (Additional file 1: Figure S4a and S4b). Thus, gene copy-number ratio allows us to differentiate gene duplications that originate from whole chromosome/genome duplication from those arising from defined amplification events, such as tandem amplifications, which we hypothesize induces stronger CRISPR-Cas9 LOF effects.
Utilizing the copy-number ratio, we observed that non-expressed genes with copy-number ratios greater than 1 showed strikingly higher LOF effects (Fig. 3d). Ratios greater than or equal to 4 displayed among the strongest LOF effects captured in the screen, with non-expressed genes showing mean effects of 73.3% of those of known essential genes (Additional file 1: Figure S3b). The distribution of the copy-number ratios across all 250 cell lines was centered around 1, confirming that the vast majority of copy-number alterations originate from whole chromosome duplications (Additional file 1: Figure S3c). Notably, we observed thousands of occurrences of copy-number amplified genes with a copy-number ratio close to 1 which displayed no CRISPR-Cas9 LOF effects (Fig. 3e, Additional file 1: Figure S3c). As an example, among the frequently amplified and not expressed genes, neuronal differentiation 2 (NEUROD2) gene copy-number ratio recapitulated more clearly the LOF response than absolute copy number (Fig. 3f). Cell lines from multiple tumor types with 3, 4, 5 and 6 copies of NEUROD2 have very limited LOF effects, which could lead to incorrect correction of LOF effects in these cell lines if using approaches based on absolute copy number [22].
Taken together, our results indicate that non-specific LOF effects induced by targeting of copy-number amplified regions are enriched for tandem or interspersed duplicated regions, while copy-number amplifications originating from chromosome duplication have little to no effect. We cannot exclude that other complex structural rearrangements might also be captured by the gene copy-number ratio metric. Nonetheless, these observations have important implications for the analysis of CRISPR-Cas9 datasets, suggesting that correcting gene-independent LOF effects based on absolute copy number could, in many instances, lead to incorrect estimates.
Crispy is a single sample copy-number correction tool for CRISPR-Cas9 screens
Crispy single sample correction at copy-number segment level. a Crispy pipeline taking as inputs sgRNA CRISPR-Cas9 raw-counts and copy-number segmentation. Gaussian process regression (GPR) is performed at the segment level on per sample basis. The tool outputs corrected fold changes and copy-number ratios and facilitates plotting functions for QC assessment. b Representative example of GPR fit on AU565 cell line. Yellow area represents the standard deviation estimated by the GPR. c Copy-number ratio AURCs obtained by ranking the original uncorrected CRISPR-Cas9 fold changes, x-axis, and the Crispy corrected fold changes, y-axis. d Recall of known essential genes before and after fold change correction. e Comparison between CERES and Crispy copy-number ratio AURCs of corrected fold changes. f Correlation of CERES and Crispy corrected fold changes with original uncorrected fold changes. Scatter plot in c and e represent the median AURCs and error bars the 25th and 75th percentile
Next, we benchmarked our approach against another copy-number correction tool, CERES [22], which contrary to Crispy performs a sgRNA level correction across multiple samples. We observed that Crispy provides a small but consistent improvement, reducing the median AURCs across the different copy-number ratios (Fig. 4e, Additional file 1: Figure S5a and S5b). Unlike CERES, Crispy does not boost the recall of known essential genes compared to the original fold changes (Additional file 1: Figure S5c). These improvements are more prevalent in cell lines that display weaker correlation between replicates (Additional file 1: Figure S5d), likely due to CERES modeling sgRNAs LOF as a shared effect across multiple cell lines, and therefore borrowing information from other samples. Consistent with this, Crispy corrected fold changes were more similar to the original fold changes (Fig. 4f), indicating that it effectively preserves the underlying LOF effect while correcting for bias due to unbalanced structural alterations.
In summary, Crispy is an open-source Python tool that can be used to correct in a supervised way CRISPR-Cas9 gene independent copy-number LOF effects on per sample basis using segment level copy-number ratios.
Discussion
In this study, we demonstrate that copy-number amplifications lead to non-specific CRISPR-Cas9 LOF effects if originating from structural rearrangements, in particular tandem or interspersed duplications. In contrast, little or no impact is observed from gene amplifications associated with increased cell ploidy. Therefore, gene independent LOF effects seem to be cell line specific. We devised a gene copy-number ratio metric, normalized by chromosome copy-number, that improves the ability to classify which copy-number amplifications will result in a LOF bias in CRISPR-Cas9 experiments. Notably, our findings were recapitulated when considering only non-expressed genes, emphasizing that the LOF bias is not due to a potential biological function of the genes. Combining the copy-number ratios, WGS, and FISH experiments, we found that tandem duplications are amongst the most frequent SVs associated with CRISPR-Cas9 non-specific deleterious effects. Based on these observations, we developed a computational method, Crispy, to perform on a single sample basis the correction of CRISPR-Cas9 LOF effects due to targeting copy-number amplified regions. Crispy corrected fold changes retained high correlation with the original fold changes, preserved the ability to recall known essential genes, and improved on methods designed to correct these effects across different samples.
While this work furthers our understanding of the implication of SV in interpreting CRISPR-Cas9 screens several limitations remain. Due to the complexity of structural rearrangements we cannot exclude that other events might also play a role. For example, extrachromosomal DNAs (ecDNA) have been found to be widespread in cancer [32, 33] and are tandem duplicated rich DNA sequences, although we have no evidence of ecDNAs in the cell lines analyzed with FISH. Furthermore, we also observed that complex structural rearrangements involving multiple unbalanced chromosomal translocations overlap with some of the strongest LOF effects in the screens. This illustrates the complexity and the limitations of interpreting accurately the efficacy of CRISPR-Cas9 reagents that target a locus that is structurally rearranged.
Structural rearrangements are amongst the most common alterations in cancer [25] indicating that our findings are of general importance when designing and interpreting CRISPR-Cas9 experiments in cancer cells. Specifically, targeting genes that reside within tandem duplicated or highly rearranged and unbalanced regions, whether knocking out individual genes, performing genetic screens using a library of sgRNAs, or performing specific gene edits, will lead to strong non-specific LOF effects. While Crispy can correct for this effect in CRISPR-Cas9 sgRNA library screens, for many studies sufficient data to robustly train models to correct these effects is unlikely to be available, for example the common scenario of a single gene knockout in an individual cell line. In these instances, information about gene copy-number, cell line ploidy and ideally SV information should be incorporated to guide interpretation of LOF effects. Furthermore, the use of an orthogonal technology such as RNAi or CRISPR interference to corroborate results is advisable. For CRISPR-Cas9 sgRNA library screens where reliable copy number data are unavailable, an unsupervised LOF correction method such as CRISPRcleanR [23] can be used. We expect that the bias in CRISPR-Cas9 data described here is a general phenomenon and consequently will be observed in other cancer cell models such as patient-derived xenografts and organoids, and are potentially also present in other types of CRISPR-Cas-based systems that introduce DNA double-strand breaks [34, 35].
Conclusions
CRISPR-Cas9 targeting of structurally rearranged regions, in particular tandem or interspersed amplifications, is detrimental to cellular fitness in a gene independent manner. Targeting amplifications caused by whole chromosomal duplications has little to no impact on fitness. Crispy is a computational tool that accounts for this effect in screening data by correcting copy-number bias on a per sample basis. Our findings have implications for the design, analysis, and interpretation of CRISPR-Cas9-based methods for commonly used approaches such as gene knockouts, sgRNA library screens and targeted gene-editing.
Materials and methods
Processing of CRISPR-Cas9, SNP6, and RNA-seq samples
Publicly available CRISPR-Cas9, BROAD DepMap 18Q3, drop-out screens across 250 cell lines was utilized to assess the loss of fitness (LOF) impact of knocking-out 17,328 genes [22, 27]. Raw sequence counts of each sgRNA were downloaded and corrected by library size in each sample. Non-targeting plasmid control sample was used and sgRNAs with lower than 30 counts were discarded. Log2 sgRNA fold changes were estimated between samples and the plasmid control. Gene level estimates of the fold changes were calculated by averaging all mapping sgRNA fold changes. Single nucleotide polymorphism (SNP) array hybridization using the Affymetrix SNP6.0 platform was performed according to Affymetrix protocols. Segment copy-number variants were obtained using PICNIC [36] as previously described [29]. RNA-seq experiments for CRISPR-Cas9 profiled cell lines were assembled from multiple data-sets [37]. To minimize technical bias, all samples were processed with the same pipeline, iRAP [38], to obtain raw counts. Genes with Reads Per Kilobase per Million (RPKM) with zero counts were termed as non-expressed in the particular sample. Non-expressed genes were defined as those with a RPKM lower than 1.
Chromosome harvest and fluorescence in situ hybridization (FISH)
Metaphase chromosomes were harvested from the cancer cell lines after incubation with 0.05 g/ml of colcemide (Thermo-Fisher) for 2–3 h. Subsequently, cells were treated with a buffered hypotonic solution (0.4% KCl in 10 mM HEPES, pH 7.4) for 8–12 min at 37 °C and fixed with 4:1(v/v) methanol: glacial fixative. The human fosmid clone WI2-1694H13 was labeled with green-dUTP as described in [32]. Human 24 color FISH (M-FISH) probe preparation and slides treatments followed [39] with slight modifications. Freshly-prepared metaphase slides were immersed in acetone for 10 min and then baked at 62 C for 1 h. Slides were denatured in an alkaline denaturation solution (0.5 M NaOH, 1.5 M NaCl, Sigma-Aldrich) for 9–10 min. Metaphases were examined with a Zeiss AxioIamger D1 fluorescence microscope. FISH images were captured using the SmartCapture software (Digital Scientific UK) and karyotyped using the SmartType Karyotyper software (Digital Scientific UK). Ten metaphases for each sample were analyzed by M-FISH.
Whole-genome sequencing
DNA of 4 cancer cell lines and 4 EBV derived matched normal cell lines were obtained and sequenced with massively parallel Illumina sequencing technology (EGAD00001004124) and aligned to the human reference genome GRCh37 using Burrows-Wheeler Aligner (v0.5.9) [40]. Average sequence coverage was 43-fold for cancer cell lines and 42-fold for matched normal. Somatic structural rearrangements were identified by providing aligned bam files to BRASS (BReakpoint AnalySiS) (https://github.com/cancerit/BRASS/). BRASS calls structural variations via assembly of discordant paired-end reads.
Identification of structural rearrangements overlapping with copy-number segments was limited to tandem duplications and deletions identified with BRASS. To increase confidence in the SVs found, a BRASS assembly score was required for the tandem duplication or deletion to be considered. Then for each cell line, the tandem duplications and deletions were searched against all copy-number segments (identified with PICNIC) to find those SVs and segments for which start and end sites overlapped. A mismatch tolerance range of 10 Kbp was used for both start and end sites.
Crispy, single sample method to correct copy-number gene-independent effects in CRISPR-Cas9 screens
Crispy is a Python tool to model the copy-number impact on LOF effects in CRISPR-Cas9 on a per sample basis. For each sample the required inputs are (i) the sgRNA raw-counts together with their targeting genomic information and (ii) the copy-number segmentation output, typically a BED file containing the segment mapping chromosome, start and end genomic positions and absolute copy-number. Segments and sgRNAs are intersected using BEDtools (v2.27.1) [41] and pybedtools (v0.7.10) [42]. For each segment defined by the copy-number segmentation algorithm two metrics are computed: (i) a copy-number ratio, i.e., segment copy-number divided by the estimated chromosome copy-number; and (ii) a mean CRISPR-Cas9 fold change of all the sgRNAs that overlap with the segment. Gaussian Processes regression implemented on scikit-learn Python module (v0.19.1) [43] is used to model the non-linear effects between the segment copy-number ratio and the CRISPR-Cas9 fold changes. Specifically, a squared-exponential kernel (RBF) with a length scale (ө) hyperparameter varying between 1e−5 and 10 is used. A constant (σ) and noise (ψ) kernels are also added:
Gene copy-number ratios
Gene copy-number ratios, i.e., number of absolute gene copies divided by the number of copies of the respective chromosome, are calculated for all genes covered by the CRISPR screens and SNP6 arrays. Gene and chromosome absolute copy-number values are estimated by taking the copy-number weighted mean of all the mapping segments weighted by their size.
To verify that high copy-number ratios represent focal chromosome amplifications, we applied FISH and probed the location of the frequently and highly amplified oncogene MYC. We chose 2 cell lines (HCC1954 and NCI-H2087) with high MYC absolute copy-number (9 and 7, respectively) but discordant copy-number ratios (1.58 and 4.05, respectively) due to different ploidy. A control triploid cell line (LS1034) with diploid MYC and copy-number ratio of 1 was analyzed and corroborated our prediction that chromosome 8 is mostly diploid and contains 2 copies of MYC (Additional file 1: Figure S4a).
Code availability
Crispy is a Python module (https://github.com/EmanuelGoncalves/crispy), and its code and the source code for the analysis in this manuscript are distributed under the open-source 3-Clause BSD License. To facilitate usability and adaptation, Crispy can be easily installed through the commonly used PyPI repository (https://pypi.org/project/cy/) and instructions are provided.
Notes
Acknowledgements
We gratefully acknowledge Pedro Beltrao, Dominik Glodzik, Helen Davies, Simon Brunner and Aviad Tsherniak for helpful comments and discussions. We acknowledge the Wellcome Sanger Institute Sequencing Core Facility and Core IT Facility teams for general support.
Funding
Work in M.G. lab is supported by funding from Open Targets (OTAR015), CRUK (C44943/A22536), SU2C (SU2C-AACR-DT1213) and the Wellcome Trust (206194 and 098051).
Availability of data and materials
The CRISPR-Cas9 datasets, DepMap Broad 18Q3 public, supporting the conclusions of this article were downloaded through the portal https://depmap.org/portal/ and in the figshare repository, https://doi.org/10.6084/m9.figshare.6931364.v1 [27]. Genome-wide copy-number ratios estimated from SNP6 arrays for 250 cell lines and gene-level CRISPR-Cas9 Crispy corrected log2 fold changes are also available from figshare repository https://figshare.com/articles/Structural_rearrangements_generate_cell-specific_gene-independent_CRISPR-Cas9_loss_of_fitness_effects/7610918 [44] Other public datasets supporting the conclusions of this paper include: copy-number SNP6 DNA-arrays and whole-genome sequencing both are publically available through EGA repository, URLs are, respectively, https://ega-archive.org/datasets/EGAD00001004124 [45] and https://www.ebi.ac.uk/ega/studies/EGAS00001000978 [46]. Crispy and analysis code are available in the GitHub repository (https://github.com/EmanuelGoncalves/crispy, doi:10.5281/zenodo.2530755) [47], released under 3-Clause BSD License.
Authors’ contributions
EG, FI, KY, and MG conceived the work. EG and DA built the software. EG, DA, FI, and MG carried out validation. EG, DA FB, SL, FY, and MG performed the data analyses and curation. All of the authors wrote, read and approved the manuscript.
Ethics approval and consent to participate
Not applicable.
Competing interests
E.S. is an employee of GSK. M.G. has performed consultancy for Sanofi and receives research funding from AstraZeneca. This work was funded by Open Targets which is private-public initiative. All other authors declare no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary material
References
- 1.Costanzo M, Baryshnikova A, Bellay J, Kim Y, Spear ED, Sevier CS, et al. The genetic landscape of a cell. Science. 2010;327:425–31.CrossRefGoogle Scholar
- 2.Berns K, Hijmans EM, Mullenders J, Brummelkamp TR, Velds A, Heimerikx M, et al. A large-scale RNAi screen in human cells identifies new components of the p53 pathway. Nature. 2004;428:431–7.CrossRefGoogle Scholar
- 3.Marcotte R, Sayad A, Brown KR, Sanchez-Garcia F, Reimand J, Haider M, et al. Functional genomic landscape of human breast cancer drivers, vulnerabilities, and resistance. Cell. 2016;164:293–309.CrossRefGoogle Scholar
- 4.McDonald ER 3rd, de Weck A, Schlabach MR, Billy E, Mavrakis KJ, Hoffman GR, et al. Project DRIVE: a compendium of cancer dependencies and synthetic lethal relationships uncovered by large-scale, deep RNAi screening. Cell. 2017;170:577–92 e10.CrossRefGoogle Scholar
- 5.Tsherniak A, Vazquez F, Montgomery PG, Weir BA, Kryukov G, Cowley GS, et al. Defining a cancer dependency map. Cell. 2017;170:564–76 e16.CrossRefGoogle Scholar
- 6.Jackson AL, Burchard J, Leake D, Reynolds A, Schelter J, Guo J, et al. Position-specific chemical modification of siRNAs reduces “off-target” transcript silencing. RNA. 2006;12:1197–205.CrossRefGoogle Scholar
- 7.Echeverri CJ, Beachy PA, Baum B, Boutros M, Buchholz F, Chanda SK, et al. Minimizing the risk of reporting false positives in large-scale RNAi screens. Nat Methods. 2006;3:777–9.CrossRefGoogle Scholar
- 8.Shalem O, Sanjana NE, Hartenian E, Shi X, Scott DA, Mikkelson T, et al. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science. 2014;343:84–7.CrossRefGoogle Scholar
- 9.Wu X, Scott DA, Kriz AJ, Chiu AC, Hsu PD, Dadon DB, et al. Genome-wide binding of the CRISPR endonuclease Cas9 in mammalian cells. Nat Biotechnol. 2014;32:670–6.CrossRefGoogle Scholar
- 10.Wang T, Wei JJ, Sabatini DM, Lander ES. Genetic screens in human cells using the CRISPR-Cas9 system. Science. 2014;343:80–4.CrossRefGoogle Scholar
- 11.Koike-Yusa H, Li Y, Tan E-P, Velasco-Herrera MDC, Yusa K. Genome-wide recessive genetic screening in mammalian cells with a lentiviral CRISPR-guide RNA library. Nat Biotechnol. 2014;32:267–73.CrossRefGoogle Scholar
- 12.Morgens DW, Deans RM, Li A, Bassik MC. Systematic comparison of CRISPR/Cas9 and RNAi screens for essential genes. Nat Biotechnol. 2016;34:634–6.CrossRefGoogle Scholar
- 13.Evers B, Jastrzebski K, Heijmans JPM, Grernrum W, Beijersbergen RL, Bernards R. CRISPR knockout screening outperforms shRNA and CRISPRi in identifying essential genes. Nat Biotechnol. 2016;34:631–3.CrossRefGoogle Scholar
- 14.Tzelepis K, Koike-Yusa H, De Braekeleer E, Li Y, Metzakopian E, Dovey OM, et al. A CRISPR dropout screen identifies genetic vulnerabilities and therapeutic targets in acute myeloid leukemia. Cell Rep. 2016;17:1193–205.CrossRefGoogle Scholar
- 15.Hart T, Chandrashekhar M, Aregger M, Steinhart Z, Brown KR, MacLeod G, et al. High-resolution CRISPR screens reveal fitness genes and genotype-specific cancer liabilities. Cell. 2015;163:1515–26.CrossRefGoogle Scholar
- 16.Wang T, Yu H, Hughes NW, Liu B, Kendirli A, Klein K, et al. Gene essentiality profiling reveals gene networks and synthetic lethal interactions with oncogenic Ras. Cell. 2017;168:890–903 e15.CrossRefGoogle Scholar
- 17.Kaelin WG Jr. The concept of synthetic lethality in the context of anticancer therapy. Nat Rev Cancer. 2005;5:689–98.CrossRefGoogle Scholar
- 18.Itsara A, Cooper GM, Baker C, Girirajan S, Li J, Absher D, et al. Population analysis of large copy number variants and hotspots of human genetic disease. Am J Hum Genet. 2009;84:148–61.CrossRefGoogle Scholar
- 19.Beroukhim R, Mermel CH, Porter D, Wei G, Raychaudhuri S, Donovan J, et al. The landscape of somatic copy-number alteration across human cancers. Nature. Nature Publishing Group. 2010;463:899–905.Google Scholar
- 20.Aguirre AJ, Meyers RM, Weir BA, Vazquez F, Zhang C-Z, Ben-David U, et al. Genomic copy number dictates a gene-independent cell response to CRISPR/Cas9 targeting. Cancer Discov. American Association for Cancer Research. 2016;6:914–29.Google Scholar
- 21.Munoz DM, Cassiani PJ, Li L, Billy E, Korn JM, Jones MD, et al. CRISPR screens provide a comprehensive assessment of cancer vulnerabilities but generate false-positive hits for highly amplified genomic regions. Cancer Discov. 2016;6:900–13.CrossRefGoogle Scholar
- 22.Meyers RM, Bryan JG, McFarland JM, Weir BA, Sizemore AE, Xu H, et al. Computational correction of copy number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells. Nat Genet. 2017;49:1779–84.CrossRefGoogle Scholar
- 23.Iorio F, Behan FM, Gonçalves E, Bhosle SG, Chen E, Shepherd R, et al. Unsupervised correction of gene-independent cell responses to CRISPR-Cas9 targeting. BMC Genomics. 2018;19:604.CrossRefGoogle Scholar
- 24.Sudmant PH, Rausch T, Gardner EJ, Handsaker RE, Abyzov A, Huddleston J, et al. An integrated map of structural variation in 2,504 human genomes. Nature. 2015;526:75–81.CrossRefGoogle Scholar
- 25.Li Y, Roberts N, Weischenfeldt J, Wala JA, Shapira O, Schumacher S, et al. Patterns of structural variation in human cancer [Internet]. bioRxiv. 2017 [cited 2017 Dec 14]. p. 181339. Available from: https://www.biorxiv.org/content/early/2017/08/27/181339
- 26.Glodzik D, Morganella S, Davies H, Simpson PT, Li Y, Zou X, et al. A somatic-mutational process recurrently duplicates germline susceptibility loci and tissue-specific super-enhancers in breast cancers. Nat Genet. 2017;49:341–8.CrossRefGoogle Scholar
- 27.DepMap B. DepMap Achilles 18Q3 public [Internet]. 2018. Available from: https://figshare.com/articles/DepMap_Achilles_18Q3_public/6931364 Google Scholar
- 28.Garnett MJ, Edelman EJ, Heidorn SJ, Greenman CD, Dastur A, Lau KW, et al. Systematic identification of genomic markers of drug sensitivity in cancer cells. Nature. 2012;483:570–5.CrossRefGoogle Scholar
- 29.Iorio F, Knijnenburg TA, Vis DJ, Bignell GR, Menden MP, Schubert M, et al. A landscape of pharmacogenomic interactions in cancer. Cell. 2016;166:740–54.CrossRefGoogle Scholar
- 30.Nik-Zainal S, Davies H, Staaf J, Ramakrishna M, Glodzik D, Zou X, et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature. 2016;534:47–54.CrossRefGoogle Scholar
- 31.McBride DJ, Etemadmoghadam D, Cooke SL, Alsop K, George J, Butler A, et al. Tandem duplication of chromosomal segments is common in ovarian and breast cancer genomes. J Pathol. 2012;227:446–55.CrossRefGoogle Scholar
- 32.Stephens PJ, Greenman CD, Fu B, Yang F, Bignell GR, Mudie LJ, et al. Massive genomic rearrangement acquired in a single catastrophic event during cancer development. Cell. 2011;144:27–40.CrossRefGoogle Scholar
- 33.Turner KM, Deshpande V, Beyter D, Koga T, Rusert J, Lee C, et al. Extrachromosomal oncogene amplification drives tumour evolution and genetic heterogeneity. Nature. 2017;543:122.CrossRefGoogle Scholar
- 34.Barrangou R, Doudna JA. Applications of CRISPR technologies in research and beyond. Nat Biotechnol. 2016;34:933–41.CrossRefGoogle Scholar
- 35.Knott GJ, Doudna JA. CRISPR-Cas guides the future of genetic engineering. Science. 2018;361:866–9.CrossRefGoogle Scholar
- 36.Greenman CD, Bignell G, Butler A, Edkins S, Hinton J, Beare D, et al. PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data. Biostatistics. 2010;11:164–75.CrossRefGoogle Scholar
- 37.Garcia-Alonso LM, Iorio F, Matchan A, Fonseca NA, Jaaks P, Peat G, et al. Transcription factor activities enhance markers of drug sensitivity in cancer. Cancer Res. 2017;canres.1679.2017.Google Scholar
- 38.Fonseca NA, Petryszak R, Marioni J, Brazma A. iRAP - an integrated RNA-seq analysis pipeline [Internet]. bioRxiv. 2014 [cited 2018 Feb 26]. p. 005991. Available from: http://biorxiv.org/content/early/2014/06/06/005991
- 39.Agu CA, Soares FAC, Alderton A, Patel M, Ansari R, Patel S, et al. Successful generation of human induced pluripotent stem cell lines from blood samples held at room temperature for up to 48 hr. Stem Cell Reports. 2015;5:660–71.CrossRefGoogle Scholar
- 40.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60.CrossRefGoogle Scholar
- 41.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–2.CrossRefGoogle Scholar
- 42.Dale RK, Pedersen BS, Quinlan AR. Pybedtools: a flexible Python library for manipulating genomic datasets and annotations. Bioinformatics. 2011;27:3423–4.CrossRefGoogle Scholar
- 43.Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–30.Google Scholar
- 44.Glodzik D, Morganella S, Davies H, Simpson PT, Li Y, Zou X, et al. A somatic-mutational process recurrently duplicates germline susceptibility loci and tissue-specific super-enhancers in breast cancers. Dataset. Figshare. https://figshare.com/articles/Structural_rearrangements_generate_cell-specific_gene-independent_CRISPR-Cas9_loss_of_fitness_effects/7610918
- 45.Glodzik D, Morganella S, Davies H, Simpson PT, Li Y, Zou X, et al. A somatic-mutational process recurrently duplicates germline susceptibility loci and tissue-specific super-enhancers in breast cancers. Dataset. European Genome-Phenome Archive. https://ega-archive.org/datasets/EGAD00001004124.
- 46.Glodzik D, Morganella S, Davies H, Simpson PT, Li Y, Zou X, et al. A somatic-mutational process recurrently duplicates germline susceptibility loci and tissue-specific super-enhancers in breast cancers. Dataset. European Genome-Phenome Archive. https://www.ebi.ac.uk/ega/studies/EGAS00001000978.
- 47.Gonçalves E, Behan FM, Louzada S, Arnol D, Stronach EA, Yang F, Yusa K, Stegle O, Iorio F, Garnett MJ. Structural rearrangements generate cell-specific, geneindependent CRISPR-Cas9 loss of fitness effects. Software. Zenodo. . https://doi.org/10.5281/zenodo.2530755.
Copyright information
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.