Pan-cancer analysis of non-oncogene addiction to DNA repair

Bermúdez-Guzmán, Luis

doi:10.1038/s41598-021-02773-3

Pan-cancer analysis of non-oncogene addiction to DNA repair

Article
Open access
Published: 01 December 2021

Volume 11, article number 23264, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Pan-cancer analysis of non-oncogene addiction to DNA repair

Download PDF

Luis Bermúdez-Guzmán ORCID: orcid.org/0000-0002-0749-4010^1,2

2657 Accesses
5 Citations
Explore all metrics

Abstract

Cancer cells usually depend on the aberrant function of one or few driver genes to initiate and promote their malignancy, an attribute known as oncogene addiction. However, cancer cells might become dependent on the normal cellular functions of certain genes that are not oncogenes but ensure cell survival (non-oncogene addiction). The downregulation or silencing of DNA repair genes and the consequent genetic and epigenetic instability is key to promote malignancy, but the activation of the DNA-damage response (DDR) has been shown to become a type of non-oncogene addiction that critically supports tumour survival. In the present study, a systematic evaluation of DNA repair addiction at the pan-cancer level was performed using data derived from The Cancer Dependency Map and The Cancer Genome Atlas (TCGA). From 241 DDR genes, 59 were identified as commonly essential in cancer cell lines. However, large differences were observed in terms of dependency scores in 423 cell lines and transcriptomic alterations across 18 cancer types. Among these 59 commonly essential genes, 14 genes were exclusively associated with better overall patient survival and 19 with worse overall survival. Notably, a specific molecular signature among the latter, characterized by DDR genes like UBE2T, RFC4, POLQ, BRIP1, and H2AFX showing the weakest dependency scores, but significant upregulation was strongly associated with worse survival. The present study supports the existence and importance of non-oncogenic addiction to DNA repair in cancer and may facilitate the identification of prognostic biomarkers and therapeutic opportunities.

Integrating Multi-omics Data to Dissect Mechanisms of DNA repair Dysregulation in Breast Cancer

Article Open access 26 September 2016

DNA Damage Response Pathways in Cancer Predisposition and Metastasis

Low E2F2 activity is associated with high genomic instability and PARPi resistance

Article Open access 21 October 2020

Although tumours develop through a multistage process driven by the acquisition of genetic and epigenetic abnormalities, many of them become dependent on one or few genes to promote malignancy. As many of these genes were originally identified as oncogenes, this attribute was named oncogene addiction¹. However, it is known that tumours can also become dependent on the normal cellular functions of certain genes, which themselves are not classical oncogenes, an attribute known as non-oncogene addiction (NOA)².

In order to achieve uncontrolled proliferation, tumour cells rely upon the downregulation or epigenetic silencing of DNA Damage Repair (DDR) genes and the consequent increase in genetic and epigenetic instability³. Genome instability is indeed a fundamental hallmark of cancer⁴, possibly linked to oncogene-induced DNA damage⁵. Accordingly, it has been demonstrated that DDR genes alterations are prevalent in multiple human cancer types^6,7. Despite the need for this genomic instability, tumours require some degree of DNA repair proficiency to survive the damage induced by genotoxic stress, uncontrolled proliferation, and treatments. Thus, the activation of the DNA-damage response in cancer can be considered as a type of NOA that critically supports tumour survival⁸. NOA genes are especially associated with stress maintenance functions such as DNA double-stranded break repair, chromatid segregation, and DNA replication regulation⁹. Accordingly, several DNA repair genes have been shown to exhibit Copy Number Variation (CNV) gain in cancer, which is positively correlated with upregulation of DDR genes¹⁰.

Given the importance of both oncogene and non-oncogene addiction, different groups have developed a variety of approaches to identify and prioritize cancer dependencies and vulnerabilities to exploit them therapeutically. The Cancer Dependency Map was initially created by systematically identifying genetic dependencies using RNAi-based loss-of-function genetic screens in 501 cancer cell lines¹¹. Additionally, two large pan-cancer CRISPR-Cas9 screens were also independently performed with the same goal, containing data from over 1000 screens of more than 900 cell lines^12,13. Based on both approaches, a combined CRISPR-shRNA dependency score was later developed, providing a more sensitive measure to identify essential genes¹⁴. These loss-of-function genetic screens have allowed investigating cellular drug mechanism-of-action in cancer cell lines¹⁵ as well as the identification of potential therapeutic targets like the DNA repair helicase WRN, a synthetic lethal vulnerability and promising target in cancers with microsatellite instability¹⁶.

In this study, I integrated data derived from The Cancer Dependency Map and transcriptomic data derived from The Cancer Genome Atlas (TCGA) to characterize the non-oncogene addiction to DNA repair across the pan-cancer scale. Next, I explored the relevance of DDR genes in terms of patient survival and potential therapeutic alternatives. Following this approach, a molecular signature of overexpressed DDR genes showing the weakest dependency scores was strongly associated with worse survival. The fact that cancer cells overexpress these genes despite the weak dependence on them (as growth promoters), suggests that they may act as late promoters of cell survival. Thus, as opposed to traditional cancer drivers, this supports the existence and importance of non-oncogenic addiction to DNA repair in cancer. This approach may facilitate the identification of prognostic biomarkers and therapeutic opportunities by targeting this non-oncogene addiction.

Results

Efficacy and selectivity of DNA repair genes

One of the few comprehensive lists of human DDR genes was published in 2005¹⁷. In order to obtain an updated list, a literature search was done and the most comprehensive list found was used as a starting point⁶. This list comprised 276 DDR genes annotated in ten different DNA repair pathways, but genes involved in the nucleotide pool maintenance were excluded in this study as that is not truly a pathway. First, to identify the essentiality of these genes across different types of cancer cell lines, the efficacy and selectivity scores (θ = 0.6; CRISPR:shRNA = 60:40; see methods) were obtained from shinyDepMap¹⁴. These metrics are derived from a combined CRISPR-shRNA gene dependency score based on the Cancer Dependency Map dataset¹¹. Efficacy scores represent the degree to which loss of a particular gene reduces cell growth in sensitive lines while selectivity represents the degree to which its essentiality varies across lines. After removing genes that were not found in the shinyDepMap dataset, 241 DDR genes (Supplementary Table 1) were grouped in nine different DNA repair pathways and the efficacy scores for each pathway were compared (p = 0.02, Kruskal–Wallis rank-sum test, Fig. 1a). Although it seems that some pathways contain more genes with strongly negative efficacy scores, no statistically significant differences were found after Dunn’s Kruskal–Wallis Multiple Comparisons test. The same trend was observed for selectivity (p = 0.0004, Kruskal–Wallis rank-sum test, Fig. 1b).

Next, as some pathways showed different behaviour in terms of efficacy and selectivity, the association between both metrics was determined. Genes were also grouped based on pathway annotation but only NER showed a statistically significant positive correlation (R = 0.29, p = 0.045, Spearman’s correlation test, Fig. 2). Additionally, to identify genes that are commonly essential in cancer cell lines (negative selectivity and negative efficacy scores), an “essentiality threshold” was set based on previous findings analyzing the whole set of human genes¹⁴. Following this approach, genes with efficacy scores higher than − 0.5 and selectivity scores higher than 0.0 were excluded. From the 241 genes initially included, only 59 were classified as commonly essential following these criteria (quadrant III, Fig. 2).

Dependency scores of commonly essential DDR genes

To explore deeper the essentiality of these 59 DDR genes, the combined dependency scores (θ = 0.6; CRISPR:shRNA = 60:40) across 423 cancer cell lines were also obtained from the shinyDepMap dataset (Supplementary Table 2). Moreover, the lineage-dependent essentiality of 17 cell lineages was also analyzed. Only genes with at least one dependent lineage were included. Differences were found between these 59 genes in terms of dependency (p-value < 2.2e−16, Kruskal–Wallis rank-sum test, Fig. 3a). Only 43 of these genes had at least one dependent lineage (Fig. 3b), meaning that all cell lines for that lineage must have dependency scores lower than the essentiality threshold (near − 0.5) obtained by analyzing all human genes¹⁴. Notice that as expected, genes showing strongly negative dependency scores have more dependent lineages.

Transcriptomic alterations of commonly essential DDR genes

Since these 59 commonly essential DDR genes can also be essential for normal cells, the expression patterns of these genes were analyzed. For this purpose, the TACCO (Transcriptome Alterations in CanCer Omnibus) web server¹⁸ was used to obtain the log2 fold change (log2 FC) of these genes from paired normal-tumour samples. TACCO includes the mRNAs and miRNA expression levels (Transcripts Per Million) of 26 types of cancer from the Broad GDAC Firehose. Gene expression data were obtained for 18 types of cancer (Supplementary Table 3) after establishing two conditions: (1) only genes with a statistically significant log2 fold change (after p-value adjustment) were included and (2) only cancer types in which at least half of DDR genes meet condition 1 were included. Differences were also found between the 59 genes in terms of log2 FC, with some genes being downregulated and others highly upregulated (p-value < 2.2e−16, Kruskal–Wallis rank-sum test, Fig. 4a). Next, to have a better visualization of the expression levels of these 59 DDR genes across the 18 types of cancer, a heatmap was elaborated with the log2 FC of each gene (Fig. 4b). Three groups are observed among the cancer types with at least 10 genes being highly upregulated in most of them. The majority of these genes are part of several pathways or participate in Homologous Recombination. For many genes either there is no significant fold change (grey colour), or they are downregulated.

Since it was noticed that genes with strongly negative dependency scores (more important for cancer cell growth) were not always highly upregulated, all variables were analyzed together to further explore clustering patterns. The efficacy and selectivity scores were included as continuous variables. The dependency scores of the 59 genes (for the whole set of 423 cancer cell lines) and the log2 FC (for the 18 types of cancer) were transformed from continuous to categorical (ordinal) variables (see methods) (Supplementary Table 4). A distance matrix was created based on Gower’s similarity coefficient. Three clusters were generated using the Partitioning Around Medoids (PAM) and hclust algorithms. The t-Distributed Stochastic Neighbor Embedding (tSNE) algorithm (perplexity = 10, max. iteration = 1000) and Hierarchical clustering were used for dimensionality reduction. Figure 5 shows the three clusters generated based on the two clustering algorithms. Interestingly, cluster 1 consists of upregulated genes with strongly negative dependency and efficacy scores. On the contrary, cluster 3 consists of upregulated genes but they all have the weakest negative dependency and efficacy scores. Cluster 2 consists of genes with intermediate scores.

Commonly essential DDR genes survival analysis

To explore the clinical significance of the above findings, the GEPIA (Gene Expression Profiling Interactive Analysis) web server¹⁹ was used to perform survival analysis based on gene expression levels. GEPIA delivers fast and customizable functionalities using The Cancer Genome Atlas (TCGA) dataset²⁰. The genes from each cluster were analyzed to determine if their expression was associated with worse or better overall survival in each of the three cancer groups (A, B, C) from the heatmap above (group B was subdivided into B1 and B2 as it was much larger and heterogeneous than A and C). Almost all 59 genes were statistically significant associated with worse or better survival at least in one cancer group (Supplementary Table 5). However, only genes whose higher expression was associated with a statistically significant log-rank P-value and statistically significant hazard ratio (HR > 2/HR < 1) were selected for further analysis (Fig. 6a).

As several genes have opposite effects for two or more cancer groups, genes that were exclusively associated with better survival and those exclusively associated with worse survival were analyzed as separate molecular signatures. Fourteen genes were classified as “better survival” (four from cluster 1 and five from cluster 3) whereas nineteen genes were classified as “worse survival” (five from cluster 1 and ten from cluster 3). Despite this trend, no statistically significant differences were found between the number of genes contributed by clusters 1 and 3 to each signature (p = 0.68, Fisher's exact test; Fig. 6b). The “better survival” gene signature was analyzed in terms of overall survival and as expected, it was associated with better survival across the 18 cancer types included in this study (p = 7.4e−09; HR 0.76; pHR = 8.3e−09, n = 3244; Fig. 6c). On the contrary, the “worse survival” gene signature was associated with worse overall survival (p = 0; HR 2.0; pHR = 0, n = 3244; Fig. 6c). Notice that p values of cero must be understood as “below machine precision” (see “Methods”).

To further understand these molecular signatures, the GEPIA2 web server²¹ (which is also based on TCGA dataset) was used to identify genes with similar expression patterns across the 18 cancer types. For each signature (better/worse), the Pearson’s correlation coefficient of 100 genes was obtained (range 0.65–0.80 for the “better survival” and 0.79–0.84 for the “worse survival”) (Supplementary Table 6). Remarkably, when these genes were used as a multi-gene signature to perform survival analysis based on their expression levels in the 18 cancer types, a higher expression of the “better 100 signature” was found to be also associated with better survival (p = 8.5e−06; HR 0.81; pHR = 8.8e−06; n = 3244; Fig. 7A). On the contrary, a higher expression of the “worse 100 signature” was associated with worse survival (p = 0; HR 2.1; pHR = 0; n = 3244; Fig. 7D).

To further characterize the genes involved in each signature, the STRING web server²² was used to plot their protein–protein interaction networks. The stringApp²³, was used to import the STRING networks into Cytoscape²⁴ for network analysis. Overall, the worse survival signature network showed a much higher network density and centralization, as well as lower network heterogeneity (Fig. 7B,E). Additionally, the g:Profiler web server²⁵ was used for functional interpretation of these signatures according to their gene ontology (Fig. 7C,F). Interestingly, most of the genes from the “better survival 100 signature" are annotated in functions related to transcription regulation and mRNA metabolism. On the other hand, genes from the "worse survival 100 signature" are annotated in functions related to DNA replication and cell cycle (Supplementary Table 7).

Finally, considering that DDR genes have been good candidates for the synthetic lethality approach²⁶ and considering the huge impact of the worse survival signature in terms of overall survival, the SynLethDB²⁷ was used to search synthetic lethal partners for the 19 genes from this signature. SynLethDB harbours a large set of synthetic lethality gene pairs collected from a variety of sources, including biochemical assays, computational predictions, other related databases, and text mining. Sixteen genes presented at least one potential synthetic lethal partner (SLP) (statistic score ≥ 0.50). In 10 of these genes (ERCC3, BRIP1, POLQ, UBE2T, RFC2, RAD51D, MUS81, PCNA, RPA3, RFC4), NAE1 (NEDD8 Activating Enzyme E1 Subunit 1) was the SLP with the highest score, and in 6 of them, it was the only gene with a statistic score ≥ 0.50 (Fig. 8A). To further explore this finding, the correlation between the expression levels of these 6 genes and NAE1 was analyzed in GEPIA2. A statistically significant positive correlation was found in the 18 types of cancer (R = 0.52, p = 0, Spearman’s correlation test, Fig. 8B). In fact, higher expression of NAE1 was also found to be associated with a worse prognosis in all 18 cancer types (p = 9e−06.; HR 1.2; pHR = 9.2e−06; n = 3244; Fig. 8C).

Discussion

The analysis of cancer dependencies and vulnerabilities aims to identify and prioritize new potential therapeutic targets. In this work, a Pan-cancer analysis of non-oncogene addiction to DDR genes using data derived from the Cancer Dependency Map and TCGA was performed. Among the 59 commonly essential DDR genes, large differences were found between them in terms of dependency scores across the 423 cancer cell lines. However, it must be considered that dependency scores are measured for cells grown in culture and are unlikely to fully reflect the conditions of the in vivo tumour microenvironment. Thus, results must be interpreted with caution and should be further addressed through different approaches.

Since some of the common essential DDR genes in cancer cells may also be essential for normal cells, the implications of essentiality were analyzed at the transcriptomic level from paired normal-tumour samples and large differences were also found. Genes with strongly negative dependency scores (cluster 1), were upregulated in most cancer types suggesting that they might be essential for carcinogenesis. Some of them can even be considered as potential cancer drivers as previously suggested⁶. Despite this, it was surprising that higher expression of several genes from clusters 1 and 2 (e.g., PRPF19 and ATRIP) were strongly associated with better overall survival in some cancer groups. This was further confirmed when the 14 genes exclusively associated with better survival were considered a multi-gene signature and the higher expression of this signature was also associated with better survival across the 18 types of cancer. This is especially interesting considering that the lower expression and/or alterations in DDR genes have been associated with improved survival in some cancer types^28,29,30. Nevertheless, a couple of studies have also reported that active DNA repair and higher expression of several DDR genes (including genes from clusters 1 and 2 from this study) were associated with better survival in gastric and ovarian cancer^31,32. Taken together, this better survival signature might represent promising biomarkers for prognosis and further clinical studies should be performed to fully validate this.

Another surprising finding was that genes from cluster 3 (weakest dependency and efficacy scores) like UBE2T, POLQ, BRIP1, H2AFX, and RTEL1, were also highly upregulated. This could indicate that cancer cells do not depend on these genes for their growth but survival, especially considering the persistent DNA damage and replication stress in cancer cells². We can refer to this as the timing-dependent addiction, which means that although cancer cells have weak dependency on these genes (as growth promoters), these genes may act as late promoters of cell survival (given the evident overexpression). For instance, among genes from cluster 3, POLQ (Pol θ) might be one of the best examples of non-oncogene addiction to DNA repair (low efficacy and dependency scores but highly upregulated). In fact, it has been demonstrated that cancer cells with mutations in POLQ synthetic lethal (DDR) genes tend to become addicted to Theta Mediated End Joining (TMEJ) for survival, suggesting that POLQ becomes essential upon increased levels of endogenous and unrepaired DNA damage³³.

Remarkably, the present work also demonstrated that higher expression of UBE2T, RFC4, POLQ, BRIP1, and H2AFX is especially associated with worse overall survival (HR > 5) in several types of cancer (Group A and D). Additionally, when considered as part of the worse survival signature, these genes were associated with a worse prognosis in the 18 cancer types. This might be also linked to the correlation found between the expression levels of this signature and the "worse 100 signature", which is enriched with genes involved in DNA replication and cell cycle, probably suggesting a more aggressive phenotype. Previous findings have reported that genes involved in cell cycle phases and checkpoint genes had higher essentiality percentages both in embryonic stem cells and in cancer cell lines³⁴. Additionally, it has been shown that the amplification and overexpression of several DDR genes across the pan-cancer scale are associated with reduced mutation burden, cell line drug resistance, and poor prognosis³⁵. In contrast, a higher somatic tumour mutational burden has been associated with better overall survival in several types of cancer³⁶. Taken together, the results from the present study support the notion that non-oncogene addiction to several DDR genes could reduce the tumour mutational burden and confer treatment resistance, suggesting an explanation for the worse overall survival associated with this signature.

The results discussed above also support previous findings reporting the key role of the DDR genes from cluster 3 in cancer. For instance, studying more than 10,000 TCGA pan-cancer tumours, Wu and colleagues found that tumours with amplifications in DDR genes like UBE2T (Ubiquitin Conjugating Enzyme E2 T) exhibited significantly reduced mutation burden, temozolomide resistance, and worse patient survival. In fact, UBE2T and BRIP1 are also in the top 10 amplified DDR genes in their study³⁵. RFC4 (Replication factor C subunit 4) has been also identified as an upregulated DDR gene across the pan-cancer scale¹⁰. RFC4 protects colorectal cancer cells from X-ray-induced DNA damage and apoptosis through nonhomologous end joining (NHEJ)-mediated DNA repair³⁷ and has also been associated with poorer prognosis in this malignancy³⁸ and lung cancer³⁹.

DNA polymerase theta (Pol θ) (which is encoded by the POLQ gene) is important in the repair of genomic double-strand breaks (DSBs) from many sources⁴⁰. Upregulation of POLQ has been associated with poor clinical outcomes in breast and lung cancer^41,42,43,44. Following the same line, the present study highlights the importance of POLQ as a prognostic factor associated with worse survival in different types of cancer (especially Group A and D). Recent reports have demonstrated that POLQ inhibitors selectively kill HR-deficient tumour cells in vitro and in vivo^45,46. Additionally, it has been demonstrated that POLQ knockdown sensitizes several tumour cell lines to Ionizing Radiation and causes minimal effects on normal tissue radiosensitivity⁴⁷.

RFC2, CHEK1, PCNA, and POLE2 were also associated with worse overall survival in several types of cancer (Group A and D). As they belong to cluster 1 (strongly negative dependency and efficacy scores and high expression levels), these genes might be more necessary for tumour growth and promotion. Upregulation of PCNA has been reported in several types of cancer and its inhibition is gaining more interest as a novel strategy in many cancers^48,49,50. Similarly, the knockdown of POLE2 (DNA polymerase epsilon subunit 2) has been linked to reduced or suppressed tumorigenesis in different types of cancer^51,52,53. CHEK1 (also known as Chk1 or checkpoint kinase 1) is essential to maintain cell viability in cancer cells⁵⁴ and it has been shown that the combination of ATR and CHEK1 inhibitors results in cancer-specific synthetic lethality⁵⁵. Notably, CHEK1 inhibition is synthetically lethal with loss of B-Family DNA Polymerases like POLE2 in lung cancer and colorectal cancer cells⁵⁶. Taken together, inhibiting these genes from cluster 1 represents a promising therapeutic avenue in several types of cancer, especially focusing on the combined inhibition of CHEK1 and POLE2.

Finally, it was remarkable that NAE1 was identified as a synthetic lethal partner of ten genes from the worse survival signature. NAE1 is responsible for Neddylation, a post-translational modification that adds an ubiquitin-like protein (NEDD8) to multiple substrate proteins and it has been shown that its inhibition exerts anticancer effects mainly by triggering cell apoptosis, autophagy, and senescence^57,58. In fact, Neddylation is beginning to be considered as a key factor of the DNA repair process⁵⁹. As NAE1 inhibition has been proposed as a new approach to treating cancer⁶⁰, these results support the potential of the combined inhibition of NAE1 and several DDR genes. More research will be needed to validate the implication of the molecular signatures described in this study and to what extent they apply to different subtypes of cancer and the complexity of their genetic backgrounds.

Methodology

All data processing steps, and statistical analyses were performed in the RStudio 1.4.1717 statistical environment (https://www.rstudio.com/).

Efficacy, selectivity, and dependency of DNA repair genes

The list of DNA repair genes and their respective pathway annotation was obtained from one of the latest and more complete reports so far⁶. The efficacy and selectivity scores for the 241 genes (θ = 0.6; CRISPR:shRNA = 60%:40%; where θ is the mixing ratio of the two scores) were obtained from shinyDepMap¹⁴ (https://labsyspharm.shinyapps.io/depmap). These values are derived from a combined CRISPR-shRNA gene dependency score based on the Cancer Dependency Map dataset¹¹ (https://depmap.org/portal/). Additionally, for those genes with efficacy scores lower than -0.5 and selectivity scores lower than 0 (termed essential genes), the combined (θ = 0.6; CRISPR:shRNA = 60%:40%, where θ is the mixing ratio of the two scores) dependency scores across 423 cell lines were obtained. Likewise, the lineage-dependent essentiality of 17 cell lineages was also analyzed. For a lineage to be classified as dependent, all its cell lines must have dependency values that exceed the essentiality threshold obtained by analyzing all available genes.

Transcriptomic alterations of commonly essential DRGs

For transcriptomic analysis, the TACCO (Transcriptome Alterations in CanCer Omnibus) webserver¹⁸ (http://tacco.life.nctu.edu.tw/) was used to identify the patterns of expression of essential DNA repair genes (efficacy < − 0.5; selectivity < 0) in 18 cancer types. TACCO includes the mRNAs expression levels (Transcripts Per Million) for 26 cancer types from the Broad GDAC Firehose (https://gdac.broadinstitute.org/) (version stddata__2016_01_28). For this task, only the log2 fold change (log2_FC) of those genes with statistically significant (Benjamini-Hochberg) adjusted p values were included. For the heatmap elaboration, the ComplexHeatmap⁶¹ R package was used.

To find clustering patterns among DDR genes, all variables were analyzed together. The efficacy and selectivity scores were included for each gene as continuous variables. Based on the median dependency score of each gene (for the whole set of 423 cell lines), dependency scores were transformed from continuous to categorical: weakly negative (from 0 to − 0.5), moderate (from − 0.5 to − 1.0), and strongly negative (from − 1.0 to − 1.5). The log2 FC was also transformed to categorical: downregulated (from 0 to − 1.25), upregulated (from 0 to 1.25), and highly upregulated (from 1.25 to 2.5 +). A distance matrix was created based on Gower's similarity coefficient, as it is suggested for mixed-type variables. The clustering algorithm used was Partitioning Around Medoids (PAM) and the silhouette plot was used to determine the number of clusters. The t-Distributed Stochastic Neighbor Embedding (tSNE) algorithm was used for dimensionality reduction (perplexity = 10, max_iter = 1000). Hierarchical clustering was used as an additional algorithm for dimensionality reduction based on the same Gower's similarity coefficients calculated.

Survival analysis, protein networks, and gene ontology

The GEPIA (Gene Expression Profiling Interactive Analysis) webserver¹⁹ (http://gepia.cancer-pku.cn/) was used to compare the different gene clusters in terms of patient overall survival across the cancer groups included in this study. GEPIA is based on The Cancer Genome Atlas (TCGA)²⁰ and The Genotype-Tissue Expression (GTEx) project⁶². The thresholds for high/low gene expression levels between cohorts were adjusted to 50%. Log-rank P values, hazard ratios (HR), and 95% confidence intervals (CI) were obtained for each gene. Notably, as GEPIA calculations are run in R, some p values might be reported as cero since R does not report values lower than 2.220446e-16 (.Machine$double.eps). Therefore, p values of cero must be understood as “below machine precision”.

The GEPIA2 web server²¹ (http://gepia2.cancer-pku.cn/#index) was used to identify genes that have a similar expression pattern for each signature across all cancer types. The Search Tool for the Retrieval of Interacting Genes/Proteins database (STRING v11.0) web server²² (https://string-db.org/) was used to construct the protein–protein network associated with the better/worse 100 signature). The stringApp²³, a Cytoscape app that makes it easy to import STRING networks into Cytoscape²⁴ was used for network visualization and analysis. Additionally, the g:Profiler web server²⁵ (http://biit.cs.ut.ee/gprofiler/) was used to find statistically significant Gene Ontology terms, pathways, and other gene functions giving a list of DDR genes.

Synthetic lethal pair analysis

The SynLethDB web server²⁷ (http://synlethdb.sist.shanghaitech.edu.cn/v2/#/) was used to search synthetic lethal partners given a specific gene. SynLethDB harbours a large set of synthetic lethality gene pairs collected from a variety of sources, including biochemical assays, other related databases, computational predictions, and text mining.

References

Weinstein, I. B. & Joe, A. Oncogene addiction. Cancer Res. 68, 3077–3080 (2008).
CAS PubMed Google Scholar
Luo, J., Solimini, N. L. & Elledge, S. J. Principles of cancer therapy: Oncogene and non-oncogene addiction. Cell 136, 823–837 (2009).
CAS PubMed PubMed Central Google Scholar
Jeggo, P. A., Pearl, L. H. & Carr, A. M. DNA repair, genome stability and cancer: A historical perspective. Nat. Rev. Cancer 16, 35–42 (2016).
CAS PubMed Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: The next generation. Cell 144, 646–674 (2011).
CAS PubMed Google Scholar
Negrini, S., Gorgoulis, V. G. & Halazonetis, T. D. Genomic instability—An evolving hallmark of cancer. Nat. Rev. Mol. Cell Biol. 11, 220–228 (2010).
CAS PubMed Google Scholar
Knijnenburg, T. A. et al. Genomic and molecular landscape of DNA damage repair deficiency across The Cancer Genome Atlas. Cell Rep. 23, 239-254.e6 (2018).
CAS PubMed PubMed Central Google Scholar
Dietlein, F., Thelen, L. & Reinhardt, H. C. Cancer-specific defects in DNA repair pathways as targets for personalized therapeutic approaches. Trends Genet. 30, 326–339 (2014).
CAS PubMed Google Scholar
Solimini, N. L., Luo, J. & Elledge, S. J. Non-oncogene addiction and the stress phenotype of cancer cells. Cell 130, 986–988 (2007).
CAS PubMed Google Scholar
Hjaltelin, J. X. et al. Identification of hyper-rewired genomic stress non-oncogene addiction genes across 15 cancer types. NPJ Syst. Biol. Appl. 5, 27 (2019).
PubMed PubMed Central Google Scholar
Chae, Y. K. et al. Genomic landscape of DNA repair genes in cancer. Oncotarget 7, 23312–23321 (2016).
PubMed PubMed Central Google Scholar
Tsherniak, A. et al. Defining a Cancer Dependency Map. Cell 170, 564-576.e16 (2017).
CAS PubMed PubMed Central Google Scholar
Meyers, R. M. et al. Computational correction of copy number effect improves specificity of CRISPR–Cas9 essentiality screens in cancer cells. Nat. Genet. 49, 1779–1784 (2017).
CAS PubMed PubMed Central Google Scholar
Behan, F. M. et al. Prioritization of cancer therapeutic targets using CRISPR–Cas9 screens. Nature 568, 511–516 (2019).
ADS CAS PubMed Google Scholar
Shimada, K., Bachman, J. A., Muhlich, J. L. & Mitchison, T. J. shinyDepMap, a tool to identify targetable cancer genes and their functional connections from Cancer Dependency Map data. Elife 10, e57116 (2021).
CAS PubMed PubMed Central Google Scholar
Gonçalves, E. et al. Drug mechanism-of-action discovery through the integration of pharmacological and CRISPR screens. Mol. Syst. Biol. 16, e9405 (2020).
PubMed PubMed Central Google Scholar
Chan, E. M. et al. WRN helicase is a synthetic lethal target in microsatellite unstable cancers. Nature 568, 551–556 (2019).
ADS CAS PubMed PubMed Central Google Scholar
Wood, R. D., Mitchell, M. & Lindahl, T. Human DNA repair genes, 2005. Mutat. Res. Fundam. Mol. Mech. Mutagen. 577, 275–283 (2005).
CAS Google Scholar
Chou, P.-H. et al. TACCO, a database connecting transcriptome alterations, pathway alterations and clinical outcomes in cancers. Sci. Rep. 9, 3877 (2019).
ADS PubMed PubMed Central Google Scholar
Tang, Z. et al. GEPIA: A web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res. 45, W98–W102 (2017).
CAS PubMed PubMed Central Google Scholar
The Cancer Genome Atlas Research Network et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nat. Genet. 45, 1113–1120 (2013).
Google Scholar
Tang, Z., Kang, B., Li, C., Chen, T. & Zhang, Z. GEPIA2: An enhanced web server for large-scale expression profiling and interactive analysis. Nucleic Acids Res. 47, W556–W560 (2019).
CAS PubMed PubMed Central Google Scholar
Szklarczyk, D. et al. STRING v11: Protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613 (2019).
CAS PubMed Google Scholar
Doncheva, N. T., Morris, J. H., Gorodkin, J. & Jensen, L. J. Cytoscape StringApp: Network analysis and visualization of proteomics data. J. Proteome Res. 18, 623–632 (2019).
CAS PubMed Google Scholar
Shannon, P. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
CAS PubMed PubMed Central Google Scholar
Reimand, J. et al. g:Profiler—A web server for functional interpretation of gene lists (2016 update). Nucleic Acids Res. 44, W83–W89 (2016).
CAS PubMed PubMed Central Google Scholar
Nickoloff, J. A., Jones, D., Lee, S.-H., Williamson, E. A. & Hromas, R. Drugging the cancers addicted to DNA repair. JNCI J. Natl. Cancer Inst. 109, djx059 (2017).
Google Scholar
Guo, J., Liu, H. & Zheng, J. SynLethDB: Synthetic lethality database toward discovery of selective and sensitive anticancer drug targets. Nucleic Acids Res. 44, D1011–D1017 (2016).
CAS PubMed Google Scholar
Mo, X. Low expression of 12 DNA repair genes was associated with better disease-free survival in non-small cell lung cancer patients having adjuvant chemotherapy. Int. J. Radiat. Oncol. Biol. Phys. 98, 237 (2017).
Google Scholar
van der Doelen, M. J. et al. Impact of DNA damage repair defects on response to radium-223 and overall survival in metastatic castration-resistant prostate cancer. Eur. J. Cancer 136, 16–24 (2020).
PubMed Google Scholar
Teo, M. Y. et al. DNA damage response and repair gene alterations are associated with improved survival in patients with platinum-treated advanced urothelial carcinoma. Clin. Cancer Res. 23, 3610–3618 (2017).
CAS PubMed PubMed Central Google Scholar
Sun, H. et al. Identification of a prognostic signature associated with DNA repair genes in ovarian cancer. Front. Genet. 10, 839 (2019).
CAS PubMed PubMed Central Google Scholar
Jinjia, C. et al. The use of DNA repair genes as prognostic indicators of gastric cancer. J. Cancer 10, 4866–4875 (2019).
PubMed PubMed Central Google Scholar
Feng, W. et al. Genetic determinants of cellular addiction to DNA polymerase theta. Nat. Commun. 10, 4286 (2019).
ADS PubMed PubMed Central Google Scholar
Viner-Breuer, R., Yilmaz, A., Benvenisty, N. & Goldberg, M. The essentiality landscape of cell cycle related genes in human pluripotent and cancer cells. Cell Div. 14, 15 (2019).
CAS PubMed PubMed Central Google Scholar
Wu, Z. et al. Copy number amplification of DNA damage repair pathways potentiates therapeutic resistance in cancer. Theranostics 10, 3939–3951 (2020).
CAS PubMed PubMed Central Google Scholar
Samstein, R. M. et al. Tumor mutational load predicts survival after immunotherapy across multiple cancer types. Nat. Genet. 51, 202–206 (2019).
CAS PubMed PubMed Central Google Scholar
Wang, X.-C. et al. Genome-wide RNAi screening identifies RFC4 as a factor that mediates radioresistance in colorectal cancer by facilitating nonhomologous end joining repair. Clin. Cancer Res. 25, 4567–4579 (2019).
CAS PubMed Google Scholar
Xiang, J. et al. Levels of human replication factor C4, a clamp loader, correlate with tumor progression and predict the prognosis for colorectal cancer. J. Transl. Med. 12, 320 (2014).
PubMed PubMed Central Google Scholar
Chen, W., Zhu, S., Zhang, Y., Xiao, J. & Tian, D. Identification of key candidate tumor biomarkers in non-small-cell lung cancer by in silico analysis. Oncol. Lett. https://doi.org/10.3892/ol.2019.11169 (2019).
Article PubMed PubMed Central Google Scholar
Wood, R. D. & Doublié, S. DNA polymerase θ (POLQ), double-strand break repair, and cancer. DNA Repair 44, 22–32 (2016).
CAS PubMed PubMed Central Google Scholar
Lemee, F. et al. DNA polymerase up-regulation is associated with poor survival in breast cancer, perturbs DNA replication, and promotes genetic instability. Proc. Natl. Acad. Sci. 107, 13390–13395 (2010).
ADS CAS PubMed PubMed Central Google Scholar
Higgins, G. S. et al. Overexpression of POLQ confers a poor prognosis in early breast cancer patients. Oncotarget 1, 175–184 (2010).
PubMed PubMed Central Google Scholar
Shinmura, K. et al. POLQ overexpression is associated with an increased somatic mutation load and PLK4 overexpression in lung adenocarcinoma. Cancers 11, 722 (2019).
CAS PubMed Central Google Scholar
Allera-Moreau, C. et al. DNA replication stress response involving PLK1, CDC6, POLQ, RAD51 and CLASPIN upregulation prognoses the outcome of early/mid-stage non-small cell lung cancer patients. Oncogenesis 1, e30–e30 (2012).
CAS PubMed PubMed Central Google Scholar
Zhou, J. et al. A first-in-class polymerase theta inhibitor selectively targets homologous-recombination-deficient tumors. Nat. Cancer 2, 598–610 (2021).
PubMed PubMed Central Google Scholar
Zatreanu, D. et al. Polθ inhibitors elicit BRCA-gene synthetic lethality and target PARP inhibitor resistance. Nat. Commun. 12, 3636 (2021).
ADS CAS PubMed PubMed Central Google Scholar
Higgins, G. S. et al. A small interfering RNA screen of genes involved in DNA repair identifies tumor-specific radiosensitization by POLQ knockdown. Cancer Res. 70, 2984–2993 (2010).
CAS PubMed PubMed Central Google Scholar
Dillehay, K. L., Lu, S. & Dong, Z. Antitumor effects of a novel small molecule targeting PCNA chromatin association in prostate cancer. Mol. Cancer Ther. 13, 2817–2826 (2014).
CAS PubMed Google Scholar
Smith, S. J. et al. Molecular targeting of cancer-associated PCNA interactions in pancreatic ductal adenocarcinoma using a cell-penetrating peptide. Mol. Ther. Oncolytics 17, 250–256 (2020).
CAS PubMed PubMed Central Google Scholar
Søgaard, C. K. et al. Targeting the non-canonical roles of PCNA modifies and increases the response to targeted anti-cancer therapy. Oncotarget 10, 7185–7197 (2019).
PubMed PubMed Central Google Scholar
Li, J. et al. Knockdown of POLE2 expression suppresses lung adenocarcinoma cell malignant phenotypes in vitro. Oncol. Rep. https://doi.org/10.3892/or.2018.6659 (2018).
Article PubMed PubMed Central Google Scholar
Zhang, C. et al. Targeting POLE2 creates a novel vulnerability in renal cell carcinoma via modulating stanniocalcin 1. Front. Cell Dev. Biol. 9, 622344 (2021).
PubMed PubMed Central Google Scholar
Zhu, Y., Chen, G., Song, Y., Chen, Z. & Chen, X. POLE2 knockdown reduce tumorigenesis in esophageal squamous cells. Cancer Cell Int. 20, 388 (2020).
CAS PubMed PubMed Central Google Scholar
Cho, S. H., Toouli, C. D., Fujii, G. H., Crain, C. & Parry, D. Chk1 is essential for tumor cell viability following activation of the replication checkpoint. Cell Cycle 4, 131–139 (2005).
CAS PubMed Google Scholar
Sanjiv, K. et al. Cancer-specific synthetic lethality between ATR and CHK1 kinase activities. Cell Rep. 14, 298–309 (2016).
CAS PubMed Google Scholar
Rogers, R. F. et al. CHK1 inhibition is synthetically lethal with loss of B-family DNA polymerase function in human lung and colorectal cancer cells. Cancer Res. 80, 1735–1747 (2020).
CAS PubMed PubMed Central Google Scholar
Zhou, L., Jiang, Y., Luo, Q., Li, L. & Jia, L. Neddylation: A novel modulator of the tumor microenvironment. Mol. Cancer 18, 77 (2019).
PubMed PubMed Central Google Scholar
Chen, P. et al. Neddylation inhibition activates the extrinsic apoptosis pathway through ATF4–CHOP–DR5 axis in human esophageal cancer cells. Clin. Cancer Res. 22, 4145–4157 (2016).
CAS PubMed Google Scholar
Brown, J. S. & Jackson, S. P. Ubiquitylation, neddylation and the DNA damage response. Open Biol. 5, 150018 (2015).
PubMed PubMed Central Google Scholar
Soucy, T. A. et al. An inhibitor of NEDD8-activating enzyme as a new approach to treat cancer. Nature 458, 732–736 (2009).
ADS CAS PubMed Google Scholar
Gu, Z., Eils, R. & Schlesner, M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32, 2847–2849 (2016).
CAS PubMed Google Scholar
Lonsdale, J. et al. The genotype-tissue expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
CAS Google Scholar

Download references

Acknowledgements

I thank Dr Pau Creixell (CRUK CI) for thoughtful and helpful suggestions. I also thank Gabriel Jiménez-Huezo for the graphic design of some of the figures in this paper.

Author information

Authors and Affiliations

Robotic Radiosurgery Center, International Cancer Center, San José, Costa Rica
Luis Bermúdez-Guzmán
Section of Genetics and Biotechnology, School of Biology, University of Costa Rica, San Pedro, San José, Costa Rica
Luis Bermúdez-Guzmán

Authors

Luis Bermúdez-Guzmán
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The author confirms sole responsibility for the following: study conception and design, data collection, analysis and interpretation of results, and manuscript preparation.

Corresponding author

Correspondence to Luis Bermúdez-Guzmán.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table 1.

Supplementary Table 2.

Supplementary Table 3.

Supplementary Table 4.

Supplementary Table 5.

Supplementary Table 6.

Supplementary Table 7.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bermúdez-Guzmán, L. Pan-cancer analysis of non-oncogene addiction to DNA repair. Sci Rep 11, 23264 (2021). https://doi.org/10.1038/s41598-021-02773-3

Download citation

Received: 06 October 2021
Accepted: 23 November 2021
Published: 01 December 2021
DOI: https://doi.org/10.1038/s41598-021-02773-3
Springer Nature Limited

This article is cited by

The biological significance of cuproptosis-key gene MTF1 in pan-cancer and its inhibitory effects on ROS-mediated cell death of liver hepatocellular carcinoma
- Liying Song
- Rong Zeng
- Fanhua Kang
Discover Oncology (2023)

Pan-cancer analysis of non-oncogene addiction to DNA repair

Abstract

Similar content being viewed by others

Results

Efficacy and selectivity of DNA repair genes

Dependency scores of commonly essential DDR genes

Transcriptomic alterations of commonly essential DDR genes

Commonly essential DDR genes survival analysis

Discussion

Methodology

Efficacy, selectivity, and dependency of DNA repair genes

Transcriptomic alterations of commonly essential DRGs

Survival analysis, protein networks, and gene ontology

Synthetic lethal pair analysis

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation