circRNA profiles in GC patients with different prognoses
The flowchart for circRNA screening in this study is shown in Fig. 1a. We performed RNA sequencing to characterize the circRNA expression profiles of GC patients with different RFS. A total of 23,995 circRNAs were detected in the discovery set, among which 9443 circRNAs were included in the circBase database and annotated (Fig. 1b). Consistent with previous studies, these circRNAs were distributed in all human chromosomes, and their splicing length was mostly less than 1000 nucleotides (Fig. 1c, d). Differential analysis was performed on the circRNA expression profile of GC patients with different prognoses. A total of 259 differentially expressed circRNAs were identified with the threshold of pval < 0.05, of which 192 circRNAs were upregulated, and 67 circRNAs were downregulated in GC patients with good prognoses (Fig. 1e).
Screening and validation of prognostic circRNAs
Then, we evaluated whether circRNAs could be used as prognostic biomarkers in GC patients. The top 20 up- and downregulated circRNAs with extremely fold change obtained by RNA sequencing were included in the preliminary screening (Fig. 2a). Considering the contingency of RNA sequencing samples, 12 circRNAs with significant expression differences in at least 4 of 5 pairs of samples were selected for further validation. The circRNA ID, genome position, spliced length, and gene symbols of these 12 candidate circRNAs are shown in Table 1.
Table 1 The information of 12 candidate circRNAs The relative expression levels of 12 circRNAs were detected by qRT-PCR assay in the training set (n = 136). According to the median expression level of each circRNA, GC patients were divided into high and low expression groups. Kaplan–Meier survival analysis showed that four circRNAs (hsa_circ_0005092, hsa_circ_0002647, hsa_circ_0008197 and hsa_circ_0105599) were significantly associated with RFS in GC patients and were upregulated in patients with good prognoses, which was consistent with the sequencing results (Fig. 2b–e, Additional file 1: Figure S1). Among them, hsa_circ_0005092 and hsa_circ_0002647 were confirmed to be independent prognostic factors of RFS in GC patients by univariate and multivariate Cox regression analysis (Table 2).
Table 2 Univariate and multivariate Cox regression analysis of prognostic factors for RFS in the training set To further verify the correlations of hsa_circ_0005092 and hsa_circ_0002647 with postoperative recurrence in GC patients, a validation set (n = 167) was applied. Survival analysis showed that GC patients with high expression level of hsa_circ_0005092 and hsa_circ_0002647 had longer RFS than those with low expression (Fig. 2f-g). Regression analysis also confirmed the prognostic value of hsa_circ_0005092 and hsa_circ_0002647 (Table 3). Additionally, the circularity of the two circRNAs was verified by Sanger sequencing (Fig. 3a, d), and their half-lives were experimentally demonstrated to be longer than those of linear host genes (Fig. 3b, e). RNase R resistance assays also confirmed that they both have circular structure and higher stability (Fig. 3c, f). More importantly, we also detected their presence in the plasma of GC patients and healthy people (Additional file 2: Figure S2). Taken together, the above results indicated that hsa_circ_0005092 and hsa_circ_0002647 have the potential to be biomarkers for postoperative recurrence in GC patients.
Table 3 Univariate and multivariate Cox regression analysis of prognostic factors for RFS in the validation set Table 4 Correlation between circRNAs and clinicopathologic features of GC patients Construction of prognostic model based on hsa_circ_0005092 and hsa_circ_0002647
Next, the training set and validation set were combined to construct a postoperative recurrence model based on the regression coefficients of hsa_circ_0005092 and hsa_circ_0002647 for statistical significance (Additional file 4: Table S3):
$${\text{circPanel}} = - 0.369 \times Exp_{hsa\_circ\_0005092} - 0.223 \times Exp_{hsa\_circ\_0002647}$$
The expression level of hsa_circ_0005092 and hsa_circ_0002647 was brought into the circPanel for calculation to obtain the recurrence risk index of each patient. A total of 303 gastric cancer patients were included in the statistics, and were divided into high- and low-risk groups according to the median of the recurrence risk index obtained above. Patients with a high recurrence risk index were defined as circPanelhigh patients, and patients with a low recurrence risk index were defined as circPanellow patients. Survival analysis showed that circPanellow patients had a shorter RFS than circPanelhigh patients (hazard ratio [HR]: 2.229, 95% confidence interval [CI]: 1.662–2.989, p < 0.0001, Fig. 4a). The ROC curve and the area under the ROC curve (AUC) were further used to analyze the prognostic value of circPanel, as shown in Fig. 4c. Compared with the hsa_circ_0005092 and hsa_circ_0002647, circPanel had a larger AUC (0.709, 95% CI 0.607–0.742), with sensitivity of 69.9% and specificity of 58.1%.
Moreover, we observed a similar effect of circPanel on OS in GC patients. Patients with circPanellow had a shorter OS (HR: 2.025, 95% CI 1.362–3.010, p = 0.0004, Fig. 4b). As shown in Fig. 4d, the AUCs of hsa_circ_0002647, hsa_circ_0005092 and circPanel were 0.612 (95% CI 0.533–0.691), 0.631 (95% CI 0.553–0.709) and 0.638 (95% CI 0.56–0.716), respectively. The sensitivity and specificity of circPanel for predicting OS were 51.0% and 75.5%, respectively. In short, circPanel could be used as a biomarker for the prognostic evaluation of GC patients, with better predictive performance than single circRNA.
Comparative analysis of circRNA and other clinical indicators
We also analyzed the correlation between circPanel and the clinicopathological features of GC patients. The results suggested that circPanel was related to the tumor differentiation and T stage in GC patients (Table 4). Patients with circPanelhigh might have higher tumor differentiation and lower T stages. No significant association was observed between circPanel and other characteristics (including sex, age, tumor size, N stage and TNM stage).
In addition, we compared the prognostic value of circPanel with some traditional tumor markers (including CEA, CA19-9 and CA724) by ROC curve analysis. As a result, the AUCs of circPanel, CEA, CA19-9 and CA724 were 0.67, 0.508, 0.56, and 0.493 for RFS, respectively, and 0.638, 0.485, 0.566, and 0.524 for OS. CircPanel has greater prognostic value than CEA, CA19-9 and CA724 (Fig. 4e, f). In summary, these results strongly confirmed the clinical application value of circPanel in the prognostic assessment of GC patients.
CeRNA network of hsa_circ_0005092 and hsa_circ_0002647
Furthermore, we attempted to explore the biological function of these two selected circRNAs. At present, ceRNA is the most common circRNA regulation mechanism. Through the prediction of the CircInteractome, 6 miRNAs (miR-616, miR-599, miR-409-3p, miR-217, miR-513a-5p and miR-890) targeted by hsa_circ_0005092 and 6 miRNAs (miR-370, miR-626, miR-637, miR-648, miR-326 and miR-574-5p) targeted by hsa_circ_0002647, with more binding sites and lower context scores, were screened. Then, we predicted the target mRNAs of these miRNAs by multimiR and obtained 193 and 372 mRNAs for hsa_circ_0005092 and hsa_circ_0002647 (Additional file 4: Table S4), respectively. Cytoscape was further used to describe the regulatory network of circRNA-miRNA-mRNA (Fig. 5a, b).
GO and KEGG analysis of hsa_circ_0005092 and hsa_circ_0002647
To further predict the potential function of circRNAs, GO and KEGG enrichment analyses were performed on the ceRNA regulatory networks of these two circRNAs. The significantly enriched biological processes (BPs), cellular components (CCs) and molecular functions (MFs) are shown in Fig. 6a, b. The main biological function of the hsa_circ_0005092 and hsa_circ_0002647 both included the involvement of metabolic process, response to stimulus and developmental process. In addition, hsa_circ_0005092 may also be involved in cell proliferation, while hsa_circ_0002647 involved in molecular localization and cell adhesion. The pathways enriched for the two circRNAs based on KEGG analysis are shown in Fig. 6c, d. The cellular senescence, FoxO signaling pathway, and microRNAs in cancer involved in hsa_circ_0005092 and the Hippo signaling pathway, cellular senescence and transcriptional misregulation in cancers involved in hsa_circ_0002647 are all related to cancer.