Introduction

Bladder urothelial carcinoma (BLCA) is a global health concern and ranks among the top 10 most prevalent types of cancer, with an annual incidence of 550,000 new cases and a mortality rate of 200,000 [1, 2]. Among the 80% of patients with non-muscle-invasive bladder cancer, about 50% of patients relapse after surgery [3, 4]. Due to the absence of specific biomarkers, a portion of severe BLCA patients still cannot receive effective treatment. Although BLCA can be treated surgically in the early stages, the prognosis of advanced BLCA is poor. Innovative therapies and prognostic models are necessary to enhance the prognosis of advanced BLCA.

Metal micronutrients, especially copper, iron, and zinc are essential for life [5]. Copper plays a crucial role as a coenzyme in fundamental enzymes, such as those involved in mitochondrial respiration, immune system function, and the elimination of free radicals [6,7,8]. A recent report showed that the level of copper in the serum and tumor tissues of cancer patients was significantly increased, and copper could be directly combined with the fatty components of the TCA cycle [9,10,11]. Tsvetkov et al. found a new mechanism of copper-induced cell death related to mitochondrial dysfunction [12], this new form of controlled cell death was named “cuproptosis”. Cuproptosis might happen when mitochondrial enzymes aggregate and lead to mitochondrial stress [13]. This indicates that cuproptosis could potentially serve as a latent target for immunotherapy in BLCA.

Long noncoding RNAs (lncRNAs) are a class of RNA molecules that exceed 200 nucleotides in length and lack protein-coding capacity [14]. As a new class of cellular regulatory molecules, lncRNAs interact with various molecules, depending on their subcellular distribution, to modulate gene transcription and kinase cascades [15]. Meanwhile, lncRNAs participate in mRNA expression and gene regulation, affecting various aspects of tumor cells [16].

In this paper, we investigated the potential involvement of crlncRNAs in the prognosis of BLCA patients. The results could aid in predicting the prognosis of BLCA patients and identifying potential drugs.

Materials and methods

Download and processing of TCGA data

We retrieved RNA sequencing (RNA-seq) transcription data, expression and mutation files, as well as clinical information of BLCA patients from the Cancer Genome Atlas (TCGA) database (https://portal.gdc.cancer.gov/). The data included tumor tissues of 412 BLCA samples and 19 normal tissue samples. All data were acquired and processed in accordance with the guidelines set forth by TCGA.

Selection and differential expression analysis of crlncRNAs

According to the research of Tsvetkov et al. [12], we selected 19 CRGs (Additional File TableS1) for analysis. Then, the correlation between CRGs and differentially expressed lncRNAs was analyzed. All 277 crlncRNAs met the criteria of Pearson’s correlation coefficient (|PearsonR|) > 0.5 and p < 0.001.

CrlncRNAs risk signature for BLCA prognosis

Univariate Cox regression analysis was performed on the obtained clinical and demographic data of BLCA patients to determine the lncRNA related to the overall survival (OS) rate of BLCA patients. On the basis of 10-fold cross-validation, Lasso regression was conducted to screen the lncRNAs that were really related to the survival of patients. Meanwhile, 8 optimal lncRNAs related to BLCA prognosis were screened and used to establish the risk model. Risk score \(={\sum }_{i=1}^{n}\text{E}\text{x}\text{p} \left(\text{l}\text{n}\text{c}\text{R}\text{N}\text{A}\right) \times \text{c}\text{o}\text{e}\text{f} \left(\text{l}\text{n}\text{c}\text{R}\text{N}\text{A}\right)\). Where coef (lncRNA) was the regression coefficient and \(\text{E}\text{x}\text{p}\)(lncRNA) was the expression level of crlncRNAs.

We randomly divided 277 BLCA samples into train group and test group in 1:1 ratio, and classified BLCA patients into high-risk and low-risk groups [17, 18]. The prognostic characteristics were analyzed using the Chi-square test, Receiver operating characteristic (ROC) curves analysis, and Kaplan-Meier survival analysis. We further made the nomogram with crlncRNAs risk score, established clinical risk factors, and predicted the survival time of patients. Then, the concordance index (C-index) and calibration curves of a nomogram model were built to investigate the prediction power of the nomogram. Finally, we carried out a stratified analysis to assess whether this feature maintained its predictive power in patient subgroups (phases I – II and III – IV).

Principal component analysis, gene ontology and functional enrichment analysis

Principal component analysis (PCA) was performed to describe the expression model of crlncRNAs. At the same time, the relationship between the three variables of the sample was visualized with 3D scatter plots. Then, the analysis of differentially expressed genes (DEGs) was performed [(Log2 fold change (FC) > 1, fdrFilter (FDR) < 0.05]. The Gene Ontology (GO) was used to elucidate the DEGs in terms of relevant cellular components (CC), biological processes (BP), and molecular functions (MF). Gene Set Enrichment Analysis (GSEA) was used to conduct differential Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, with a false discovery rate (FDR) less than 0.25.

Investigation of tumor immune function and mutation burden

We used Single-sample GSEA (ssGSEA) enrichment score to describe which genes were coordinately upregulated or downregulated within a sample. Meanwhile, we obtained the somatic mutation file and calculated the tumor mutation burden (TMB) score for each BLCA patient. Kaplan Meier analysis was used to evaluate the impact of TMB on BLCA patients’ OS, and a t-test was used to compare the difference between high-risk groups and low-risk groups. Then, the Tumor Immune Dysfunction and Exclusion (TIDE) model was utilized to forecast immune response by simulating tumor immune escape mechanisms (http://tide.dfci.harvard.edu) [19, 20].

Drug sensitivity prediction

Inhibitory concentrations (IC50) represented the semiinhibitory concentration of the measured antagonist [20]. To assess drug candidates for BLCA treatment in clinical trials, we utilized pRRophetic to calculate the half-maximal IC50 values [13, 20, 21].

Statistical analysis

We utilized the R software (version 4.1.3) for conducting scientific statistical analyses. The “limma” package was used to combine RNA-seq transcriptome and somatic mutation data. Pearson correlation test was used to obtain crlncRNAs. Then, cox regression, survival, univariate and multivariate analysis were conducted. Besides, in order to evaluate the predictive performance of the risk model, time-dependent ROC curve analysis was performed via “survivalROC” R package. The GO and KEGG were analyzed using the “clusterProfiler” package in R software. Wilcoxon rank-sum test and chi-square test were used to compare the difference between two groups. The OS time of the two groups was estimated using Kaplan-Meier analysis with a log-rank test, and statistical significance was indicated by p < 0.05 for all analyses.

Results

Identification of crIncRNA

The design route for analysis can be seen in Fig. 1. We identified 16,876 lncRNAs in the TCGA_BLCA dataset. In total, we obtained 19 CRGs. Based on Pearson analysis, 277 crlncRNAs were found. We made sankey plot and heat map to show the association between CRGs and crlncRNAs (Fig. 2A, B). The 403 patients were divided into the train group (n = 202) and the test group (n = 201), and the clinical information on BLCA was presented in Table 1. The results showed that there was no difference between the train and test groups in all clinical features.

Table 1 Characteristic of bladder urothelial carcinoma patients

Construction and validation of the crlncRNAs risk model

Univariate Cox regression analysis identified 16 lncRNAs as prognostic factors for the survival of BLCA patients (Fig. 3A). To avoid overfitting, LASSO regression method was performed (Fig. 3B, C). Then, multiple Cox regression analysis was conducted, and we selected eight crlncRNAs prognostic markers to establish the risk model. Risk score = EXPAC073534.2 × (-0.268) + EXPLINC01648 × (-1.317) + EXPAC108449.2 × (0.321) + EXPTRG−AS1 × (-1.102) + EXPLINC02886 × (-0.702) + EXPAL590428.1 × (0.686) + EXPSH3RF3−AS1 × (0.306) + EXPAL117344.2 × (-1.047). Then, BLCA patients were categorized into low-risk and high-risk groups based on the median value of their risk score. The risk-score distribution plot revealed a negative correlation between survival times and increasing risk-scores (Fig. 4A-I). The analysis results showed that the high-risk group had worse survival (Fig. 4J-O).

Evaluation of the risk model

We utilized univariate and multivariate Cox regression analyses to assess the prognostic model’s predictive value. The age, risk score and stage were identified as significant independent prognostic factors (Fig. 5A, B). The areas under the l-year, 3 years and 5 years ROC curves (AUC) were 0.683, 0.712, and 0.744 respectively (Fig. 5C). The c-index and ROC curve results demonstrate that the accuracy of the prognosis model surpasses other clinical factors (Fig. 5D, E).

Construction of nomogram

A nomogram model was developed to accurately predict the overall survival of patients with BLCA (Fig. 6A). The calibration plots indicated good conformity with the prediction of this nomogram (Fig. 6B). Afterwards, the 3D scatter diagram revealed distinct aggregation features of PCA in both low- and high-risk groups (Fig. 6C, D, E, F).

Functional and pathway analysis

We further conducted the GO and KEGG enrichment analyses to find the biological functions and pathway analysis of DEGs. A total of 608 DEGs were obtained between two groups (Additional File TableS2). In the biological process category, the genes were mainly associated with signaling receptor activator activity, receptor ligand activity and glycosaminoglycan binding. In the cellular component category, it was mainly enriched in the collagen − containing extracellular matrix, endoplasmic reticulum lumen and intermediate filament cytoskeleton. In the molecule function category, it was epidermis development, external encapsulating structure organization and extracellular structure organization (Fig. 7A, B, C). Genes in the KEGG [22,23,24] category were enriched in the PI3K − Akt signaling pathway, focal adhesion and human papillomavirus infection (Fig. 7D, E, F).

Immunity analysis of the risk score and tumor mutational burden

Figure 8 A displays the immune response heatmap generated by the ssGSEA algorithm. According to ssGSEA analysis of TCGA-BLCA data, the correlation between immune cell populations and related functions showed significant differences in IFN response types, T cell functions, antigen-presenting cell (APC) functions, human leukocyte antigen (HLA), chemokine receptor (CCR), and inflammation between the two groups. We collected somatic mutation data in BLCA and TIDE data (http://tide.dfci.harvard.edu/), and then calculated the corresponding TMB and TIDE scores (Fig. 8B, C). The results indicated that the high-risk group in BLCA had a greater burden of mutations compared to the low-risk group. We categorized BLCA patients into two groups, “High-TMB” and “Low-TMB,“ in order to conduct survival analyses. The fifteen most mutated genes were TP53, TTN, KMT2D, MUC16, ARID1A, KDM6A, PIK3CA, SYNE1, RYR2, KMT2C, HMCN1, FAT4, RB1, MACF1 and FLG. TP53 was the gene with the highest mutation frequency (Fig. 8D, E). The results indicated that the low-TMB group exhibited a lower survival rate compared to the high-TMB group in BLCA (Fig. 8F). Patients with lower scores and higher tumor mutational burden (TMB) exhibited the most favorable prognosis among the four groups (Fig. 8G).

Drug sensitivity analysis

We aimed to investigate the potential of crlncRNAs as prognostic markers for personalized treatment of BLCA by examining the correlation between drug risk scores and IC50 values in BLCA therapy. As shown in Fig. 9, the sensitivity of 24 anticancer drugs was significantly different in two groups (p < 0.05). These drugs have the potential to be used in the future for treating BLCA. The relationship between the risk scores and IC50 was shown in Additional File FigureS1.

Discussion

Bladder urothelial carcinoma (BLCA) is a malignant tumor with a dismay outcome [25]. Due to complex pathological subtypes, genomic differences and drug resistance, the overall chemotherapy effect of BLCA is not very ideal [26]. It has been reported that an elevated level of copper in tumor patients can stimulate the growth of new blood vessels, facilitate the progression of tumors, and promote their spread to other parts of the body. Recently, Tsvetkov et al. have reported on the phenomenon of cuproptosis, which has been shown to overcome malignant cell resistance to chemotherapy and facilitate the removal of defective cells [12]. In addition, compared with normal cells, copper ionophores have inherent selectivity in inducing cancer cell clusters [20]. Therefore, cuproptosis may be a promising approach for treating BLCA in the future. Additionally, lncRNA exerts biological effects on the development and treatment of various cancers [20, 27, 28]. LncRNAs have been identified as significant prognostic factors in BLCA and may serve as promising molecular targets for its treatment [29,30,31]. The aim of this article is to identify potential prognostic markers by examining the possible interaction between lncRNA and cuproptosis.

In our research, we identified 16 crlncRNAs that can be used as prognostic markers for predicting the overall survival of patients with BLCA. Among them, eight were selected to construct a prognostic model. First, 19 CRGs and 277 crlncRNAs were identified. Then, Lasso regression and COX regression analyses were performed to identify the prognostic crlncRNAs. Different types of predictive lncRNA signatures have been reported in previous studies for patients with BCAL [4]. The highest AUC of the crlncRNAs signature in 5 years was 0.653 in the study by Zhang. In our study, the AUC in 5 years is 0.744, which demonstrates this crlncRNAs marker has strong predictive power. What’s more, we further identified the crlncRNAs clinical variables, immune cell infiltration and immunotherapy, and drug sensitivity of BLCA.

Based on Cox and Lasso regression analyses, we identified eight lncRNAs that are associated with prognosis: AC073534.2, LINC01648, AC108449.2, TRG-AS1, LINC02886, AL590428.1, SH3RF3-AS1 and AL117344.2. Ding et al. found that AC073534.2 was a prognostic biomarker for acute myeloid leukaemia [32]. LINC01648 was associated with glycosylated hemoglobin [33]. Sun et al. found that the expression level of AC108449.2 was correlated with the OS of BLCA patients. The expression level of AC108449.2 positively correlates with the risk score in BLCA [34]. TRG-AS1 has been reported to play an important role in many tumors [35,36,37,38]. One study showed that in pterygium fibroblasts, the expression of AL590428.1 was decreased [39]. SH3RF3-AS1 was found to be significant expression in hepatocellular carcinoma with cirrhosis [40]. However, there are few reports on LINC02886 and AL117344.2. Thus, it is imperative to further elucidate their mechanisms in future research.

Then, the accuracy of crlncRNAs prognostic model was verified. Based on the Kaplan Meier method, our study revealed that the high-risk group had a lower overall survival rate compared to the low-risk group. The ROC curves demonstrated that the prognostic model based on crlncRNAs exhibited high accuracy in predicting 1-, 3-, and 5-year survival rates, with all AUC values exceeding 0.65. The application of PCA also revealed the distinction between the two groups. By combining crlncRNAs prognostic model with clinical information, a nomogram was developed to predict prognosis and metastasis in BLCA patients using PFS and C-index.

The GO and KEGG analysis revealed that the differentially expressed crlncRNAs prognostic marker was significantly enriched in the PI3K-Akt signaling pathway, focal adhesion, and human papillomavirus infection. The PI3K/AKT signaling pathway governs cellular survival and proliferation. Abnormal activation of this pathway is typically linked to the progression of cancer and resistance to tumor therapies [13, 41]. Thus, further studies are needed to explore the relationship between cuproptosis and the PI3K/AKT signaling pathway. Waterfall plot revealed that TP53 was mutated more frequently in BLCA patients. Chemical damage induced by certain mutagens may be responsible for TP53 mutations [42]. We believed that elevated levels of copper in patients with bladder cancer may trigger TP53 mutations, which could be associated with the occurrence of cuproptosis. We utilized pRRophetic to predict the response of drugs in treating BLCA [21]. There were significant variations in IC50 values among the 24 drugs for high-risk and low-risk patients.

Cuproptosis is a novel form of programmed cell death that has the potential to become a significant therapeutic target in cancer treatment. Our study identified several crlncRNAs that play a significant role in cancer progression and treatment through various biological mechanisms. However, the current study had several limitations. First, we built and validated the prediction model with only a single retrospective BLCA data source. Second, there are still many unexplored areas between cuproptosis and lncRNAs. Thus, more validation is needed through preclinical studies. Third, since our scoring model requires the detection of the relevant genes, its applicability in clinical practice may lead to an increase in the cost-related burden for BLCA.

Conclusion

In summary, we identified a risk model based on crlncRNAs expression to predict survival of patients with BLCA. Our study provided a novel therapeutic approach for personalized treatment and improved immunotherapy response in patients with BLCA. Meanwhile, the usefulness of nomogram in predicting BLCA patient survival needs to be explored in future studies.

Fig. 1
figure 1

Flowchart of the present research. TCGA, The Cancer Genome Atlas; lncRNAs, long noncoding RNAs; PCC, pearson’s correlation coefficient; OS: overall survival; GO, gene ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes

Fig. 2
figure 2

(A) Sankey diagram of coexpression between 19 CRGs and 277 crlncRNAs. (B) correlation 19 CRGs and 8 prognostic crlncRNAs. CRGs, cuproptosis-related genes; crlncRNAs, cuproptosis-related long noncoding RNAs

Fig. 3
figure 3

Identification of crlncRNAs with prognostic value in BLCA patients. (A) univariate Cox regression analysis for identifying the prognostic crlncRNAs. (BC) Lasso–Cox regression analysis was conducted to construct prognostic prediction models. BLCA, bladder urothelial carcinoma; crlncRNAs, cuproptosis-related long noncoding RNAs

Fig. 4
figure 4

Prognosis capability of the model in the three patient sets. (A-C)Risk score distribution, survival status and heatmap for BLCA patients in high-risk and low-risk groups in the entire cohort. (D-F Risk score distribution, survival status and heatmap for BLCA patients in high-risk and low-risk groups in the test cohort. (G-I Risk score distribution, survival status and heatmap for BLCA patients in high-risk and low-risk groups in the train cohort.(J-K) Stage I-II and Stage III-IV survival analysis in the high- and low-risk groups. (L) Kaplan–Meier curves of progression-free survival. (M-O) Kaplan–Meier curves for overall survival analysis in the entire, test and train cohort. BLCA, bladder urothelial carcinoma

Fig. 5
figure 5

Independent prognostic analysis of BLCA OS. (A-B) Forest plot for univariate and multivariate Cox regression analysis. (C) Time-ROC curves predicted 1, 3, and 5-year of OS for BLCA patients. (D-E) ROC curves and C-index curves showed the predictive accuracy of the risk model was superior to other clinical factors. BLCA, bladder urothelial carcinoma; OS, overall survival; ROC, receiver operating characteristic

Fig. 6
figure 6

Nomogram for survival prediction (A) Nomogram was constructed based on crlncRNAs prognostic markers (B) Calibration plots for 1-, 3-, and 5-years survival predictions. (C-F) PCA between the high- and low-risk groups based on the (C) all genes, (D)CRGs, (E) crlncRNAs, and (F) crlncRNAs prognostic marker. PCA indicated differences in both risk groups and clusters. PCA, principal component analysis; crlncRNAs, cuproptosis-related long noncoding RNAs; CRGs cuproptosis-related genes; OS: overall survival

Fig. 7
figure 7

GO and KEGG pathway enrichment analysis. (A-C) Barplot, bubble chart and circle diagram of the GO enrichment terms. (D-F) Barplot, bubble chart and circle diagram of the KEGG enrichment terms. GO, gene ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes

Fig. 8
figure 8

Immunological landscape in BLCA patients and relationship between TMB and risk scores. (A) Heatmap of the tumor-infiltrating lymphocytes among two groups in BLCA. (B-C) Analysis of TMB differences and TIDE prediction score. (D-E) Waterfall plot of top 15 mutation genes in the high-risk and low-risk group in BLCA. (F) Overall survival analysis curves of the high- and low-TMB groups. (G) Overall survival curve combined with TMB risk in BLCA. TMB, tumor mutation burden; TIDE, tumor immune dysfunction and exclusion; BLCA, bladder urothelial carcinoma

Fig. 9
figure 9

Drug sensitivity (IC50) correlated with high- and low-risk patients in bladder urothelial carcinoma. IC50, half-maximal inhibitory concentration prediction