Background

As we know, tumors are often exhibited their characteristics by both homogeneity and heterogeneity. For example, tumors from different organs could share the same molecular signatures [1], while tumors from the same organs may have different molecular characteristics [2]. Therefore, a comprehensive pan-cancer analysis for certain molecules across multiple tumor types is of great significance. Analysis for pan-cancer can identify the differences and similarities of the key biological processes among various cancer types from different anatomical site [3] and provide clues for developing new combination and targeted therapies for cancer.

Insulin-like growth factor binding proteins (IGFBPs) are flexible endocrine and autocrine/paracrine regulators of the bioactivity of Insulin-like growth factor (IGF) [4]. The members of the IGFBP family include IGFBP1, IGFBP 2, IGFBP3, IGFBP4, IGFBP5, IGFBP6, IGFBP7 and IGFBPL1. They have crucial functions such as transcriptional regulation and induction of apoptosis [5], and play important roles in human cancers. For example, IGFBP1 and IGFBP2 are closely related to human insulin sensitivity [6]; IGFBP2 may inhibit the effective ways of tumor growth and metastasis [7]. IGFBP3 blocks the hyaluronic acid CD44 signaling pathway through a p53-dependent mechanism [8]. IGFBP4 can increase the bioavailability of IGF and stimulate IGF-mediated growth [9]. IGFBP5 has the function of reducing neovascularization [10]. IGFBP6 is a relatively specific inhibitor of IGF-II, playing crucial roles in inhibiting cancer cell migration [11]. IGFBP7 has the function of stimulating the production of prostacyclin (PGI2) and stimulating cell adhesion [12, 13]. The inactivation of IGFBPL1 is associated with the pathogenesis of breast cancer [14]; moreover, IGFBPL1 inhibits the growth of cancer cell by inhibiting the signaling pathway PI3K-AKT in esophageal cancer [15]. However, previous research on the relationship between IGFBPs and human tumors usually focused on a single gene in the most common cancers, such as breast cancer, lung cancer, and gastric cancer, etc. The family member IGFBP4, IGFBP5, IGFBPL1 has been less studied in relation to cancer compared with other members.

This study utilized a set of multi-source and multi-dimensional data to characterize the expression profiles of IGFBP gene family, the interaction between IGFBPs, and its relationship with cancer-related pathways, prognosis, Tumor Mutational Burden (TMB), Microsatellite Instability (MSI), tumor stemness, tumor immune microenvironment, immune cell infiltration, comorbidity factors, and gene alteration characteristics of pan-cancer. The results of the database analysis results using immunohistochemistry and qRT-PCR. We intend to discover the potential of IGFBPs as new biomarkers for cancer diagnosis or prognosis as well as therapeutic target, and shed new light on relevant mechanism. Further, bibliometrics and literature review were used for summarizing the research progress of IGFBP family.

Materials and methods

Data collection from multi-source databases

We extracted the expression and gene alteration data of IGFBPs in 33 types of cancer (Additional file 1: Table S1) from TCGA (http://cancergenome.nih.gov/). Besides, the survival data and tumor stemness characteristics (based on DNA methylation and RNA expression) were downloaded from UCSC Xena (https://xenabrowser.net/). The Oncomine database (https://www.oncomine.org/) was used to verify the expression level of IGFBPs between different cancer and normal tissues. The expression and gene alteration data for IGFBPs in different cancer cell lines was downloaded from CCLE database (https://portals.broadinstitute.org/ccle), 325 cell lines in 8 tumors were included in the analysis. The HPA database (https://www.proteinatlas.org/) was used to collect the immunohistochemical staining data of IGFBP family in 20 types of tumors.

Expression profiling analysis of IGFBPs at multidimensional levels

Expression profiles of IGFBPs at mRNA level

The gene expression data from TCGA was analyzed using the R package “Deseq2” [16] to determine the differentially expressed IGFBPs, P < 0.05 indicated a statistically significant difference. The Oncomine database was used to validate the differential genes applying the following thresholds: |fold change|> 2, P < 0.05 and gene rank = top 10%.

Expression profiles of IGFBPs at cell level

The expression profile of IGFBPs in various cancer cell lines were calculated using the Kruskal–Wallis rank test, differences were considered significant when P < 0.05.

Expression profiles of IGFBPs at protein level

We obtained data on immunohistochemistry of IGFBP family from the HPA database, which has information on 20 different cancer types. According to the intensity and proportion of staining, the protein expression levels of IGFBPs were grouped into high, medium, low and not detected.

Comorbidity factors associated with IGFBPs expression in pan-cancer

Using the Kruskal–Wallis rank sum test, we further investigated the comorbidity factors (age and gender) associated with IGFBPs expression in pan-cancer.

Correlation between IGFBPs expression and prognosis in pan-cancer

According to the median expression levels of IGFBPs, all the cancer samples were classified into two groups, and the overall survival was compared using the log-rank test.

Correlation between IGFBPs expression and hallmark pathways and interaction between IGFBPs

Gene set variation analysis (GSVA) provides an algorithm to estimate the variation of pathway activity [17]. In the present study, the Pearson correlation coefficients were calculated to find out the hallmark pathways related to the expression of IGFBPs, significantly correlating pathways were selected using | r |> 0.3 and P < 0.05.

To investigate the correlation and interaction of IGFBPs expression, we calculated the Pearson’s correlation coefficient of IGFBPs expression using TCGA data. R package “ggplot2” [18] was used for visualization. Furthermore, the GENEMANIA (http://genemania.org/) and STRING (https://string-db.org/) databases were applied to predict the network to explore the potential interaction relationships of IGFBPs. Cytoscape 3.9.1 was used for visualization.

Correlation between IGFBPs expression and multiple molecular features

Correlation between IGFBPs expression and TMB as well as MSI

TMB and MSI are regarded as indicators for immune checkpoint inhibitor activity [19, 20]. Thus, we calculated the Spearman coefficients between IGFBPs expression levels and TMB as well as MSI scores obtained from TCGA. Correlations were considered as significant when | r |> 0.2 and P < 0.05.

Correlation between IGFBPs expression and tumor stemness score

Tumor stemness score RNAss and DNAss obtained through RNA sequencing data and DNA methylation data respectively were used to measure the degree of oncogenic dedifferentiation [21]. Spearman coefficients between IGFBPs expression and stemness scores were calculated. P < 0.05 was statistically significant.

Correlation between IGFBPs expression and tumor immune microenvironment

The ESTIMATE scores for each patient from TCGA were calculated by the “estimate” R package [22] to reflect the amount of immune and stromal components at tumor immune microenvironment in pan-cancer tissues, while the estimate score was used to forecast tumor purity. We performed Spearman’s correlation analysis between IGFBPs expression and the ESTIMATE immune/ stromal/ estimate scores. A difference of P < 0.05 was considered significant.

We utilized data from the CIBERSORT [23] platform to perform Spearman correlation analysis between the expression of IGFBPs and infiltration levels for 22 types of immune cells in pan-cancer. Correlations were considered as significant when | r |> 0.3 and P < 0.05.

Correlation between IGFBPs expression and its genetic variation

The gene mutation data of IGFBPs in various cancer tissues and cells was downloaded from TCGA and CCLE databases respectively, while information for IGFBPs copy number variation (CNV) frequencies was extracted from TCGA database. The mutation and CNV proportion were indicated by the mutation frequencies and deletion or amplification CNV rates. Besides, to determine the impact of mutations and CNVs on IGFBPs expression, the Mann–Whitney U test was performed.

Verification of IGFBPs expression at mRNA and protein levels

Sample source

This study was approved by the Ethics Approval Committee of the First Hospital of China Medical University, and informed consent was signed by the participants. In total, 67 pairs of gastric cancer and adjacent tissues, as well as 44 pairs of colorectal cancer and adjacent tissues were used to extract total RNA for qRT-PCR experiments. We further retrospectively collected gastric cancer tissue specimens for immunohistochemical staining. A total of 18 gastric cancer and 20 adjacent normal specimens were detected. Tissue specimens were embedded in paraffin following fixation in 10% formalin.

Validation of the mRNA expression levels of IGFBPs by qRT-PCR

RNA was extracted and reversed to cDNA; real-time quantitative PCR (qRT-PCR) was performed to detect the relative mRNA levels of IGFBPs in a 20μL reaction system. The fold change was determined via the method of 2 − ΔΔCt [ΔΔCt = (ΔCt of IGFBPs) − (ΔCt of β-actin)]. The test results were analyzed using the Wilkson test of the non-parametric test in SPSS software, and P < 0.05 was considered statistically significant. The primer sequences of IGFBPs and β-actin were showed in Additional file 1: Table S2.

Validation of the protein expression levels of IGFBPs by immunohistochemistry

Immunohistochemistry (IHC) staining was used to detect the protein expression of IGFBP2 in gastric cancer tissues. The paraffin-embedded tissue specimens were cut into 4 μm-thick sections, which were then deparaffinized, rehydrated with gradient ethanol, and incubated in EDTA. Endogenous peroxidase was blocked using 3% hydrogen peroxide. 10% normal goat serum was used to destroy tissue collagen for reducing non-specific binding. The rabbit monoclonal antibody to IGFBP2 (ab188200, 1:1000, Abcam, UK) was used as the primary antibody, and the samples were incubated for 1 h at room temperature. After washing with PBS, biotin-labeled secondary antibodies and streptavidin–horseradish peroxidase were added to the samples, respectively, and incubated at room temperature for 10 min each time. The samples were then dyed with DAB, dehydrated and fixed with resin.

Finally, the stained tissue sections were observed by experienced pathologists under the microscope. uses the widely accepted HSCORE (histological score) system to semi-quantitatively evaluate the protein expression level in the tissues [24, 25]. The formula of HSCORE is: HSCORE = ∑Pi (i + 1), where i is the staining intensity of the cell (0: negative, 1: weak, 2: moderate, and 3: strong), and Pi is the percentage of cells at each intensity level (0–100%). The results were analyzed by SPSS software, Mann–Whitney U and Wilcoxon matched pairs signed-rank test were performed for non-parametric tests. P < 0.05 was considered statistically significant.

Bibliometrics and literature review

Publications on research related to IGFBPs in cancer were searched in the Web of Science Core Collection for bibliometric analysis and literature review with the following search strategy: Topic search (TS) = (Insulin-like growth factor binding protein) and TS = (cancer). The dates of publication were restricted from Jan 1st, 2000 to Dec 31st, 2021, type of publication was limited to articles, and the language was restricted to English. Keywords co-occurrence analysis was performed using VOSviewer (version 1.6.18) [26].

Results

The expression profile of IGFBPs in pan-cancer

IGFBPs expression profile at mRNA level

We used the count data of 33 types of tumors covered by TCGA to analyze the expression levels of the IGFBP family in general. The results showed that the expression of IGFBPs were different in multiple cancers (Table 1, Fig. 1A). We selected IGFBP1, which had differential expression in most cancers and larger fold changes to further visualize its expression in 10 types of cancers and normal tissues, and presented in the form of mean ± standard deviation (Fig. 1B).

Table 1 Differential expression of IGFBPs at mRNA level in pan-cancer
Fig. 1
figure 1

Expression profile of IGFBPs across different cancer types. A Expression of IGFBPs in different cancer and normal samples. The color in heatmap represents the log2 fold change value between cancer and normal. The blue color represents the low expression in cancer, whereas the red color represents the high expression in cancer. The * sign represents degree of statistical significance, * P < 0.05, ** P < 0.01, *** P < 0.001. B Expression of IGFBP1 in 10 types of cancers between cancer and normal tissues. C Expression of IGFBPs in various cancers from Oncomine database. The blue color represents the low expression, whereas the red color represents the high expression

The Oncomine database was used to verify the findings from TCGA (Fig. 1C). The results indicated that, consistent with our TCGA findings, the IGFBP genes were differentially expressed in various cancer types.

IGFBPs expression profile at cell level

The results of CCLE analysis showed different IGFBPs exhibited distinct expression levels in different cancer cell lines (Fig. 2A). Among them, IGFBP2, IGFBP3, IGFBP4, IGFBP6 and IGFBP7 showed higher expression levels, especially in gastric, liver and lung cancer cell lines; conversely, IGFBP1, IGFBP5 and IGFBPL1 showed relatively lower expression, particularly in colorectal, gastric and kidney cancer cell lines (Fig. 2B).

Fig. 2
figure 2

The expression profile of IGFBPs at cell and protein level. A Expression heatmap of IGFBPs in 6 cancer cell lines (breast, colorectal, gastric, kidney, liver and lung) from CCLE database. The blue color represents the low expression, whereas the red color represents the high expression. B Boxplot of IGFBPs expression in 6 cancer cell lines. C IGFBPs protein expression across various cancer types. The expression of each gene in each cancer was divided into four groups of high expression (red color), medium expression (orange color), low expression (yellow color) and not detected (grey color). D IGFBP6 protein expression in 16 cancer types based on immunohischemistry staining results from 'The Human Protein Atlas' database

IGFBPs expression profile at the protein level

We collected the protein expression information of IGFBP1, IGFBP4, IGFBP6, IGFBP7, and IGFBPL1 from the HPA database. The results indicated that the protein expression of IGFBP6 and IGFBP7 could be detectable in all 20 types of tumor tissues (Additional file 1: Table S3). By contrast, IGFBP1 and IGFBP4 protein showed medium or low expression in several tumors such as pancreatic cancer and colorectal cancer (Fig. 2C). In ENCA and OV, the protein expression of IGFBPs was the most. In addition, Fig. 2D displayed representative immunohistochemical pictures depicting the IGFBP6 protein expression in different cancer tissues.

Comorbidity factors associated with IGFBPs expression in pan-cancer

The correlation between IGFBPs expression and age or gender of patients were investigated to reflect the comorbidity factors in pan-cancer. The results showed that the expression trend of different IGFBPs in different cancers varies with age and gender. For example, IGFBP1 was highly expressed among elder patients in LGG, LUSC, while lowly expressed in CESC, COAD and UCEC (Fig. 3). And as is shown in Fig. 4, IGFBP1 was highly expressed in GBM and HNSC among male patients.

Fig. 3
figure 3

Correlation between IGFBPs expression and age of patients in pan-cancer. Blue bars represent younger age (< = 60), while red bars represent older age (> 60). * P < 0.05, ** P < 0.01, *** P < 0.001

Fig. 4
figure 4

Correlation between IGFBPs expression and gender of patients in pan-cancer. Blue bars represent male, while red bars represent female. * P < 0.05, ** P < 0.01, *** P < 0.001

Correlation between IGFBPs expression and prognosis in pan-cancer

The prognostic values of IGFBPs in pan-cancer were analyzed using the log-rank test. The results indicated that IGFBPs had different prognostic effects in 19 cancer types (Table 2, Fig. 5A and Additional file 2: Fig. S1). For example, in LGG, the expression of IGFBP2, IGFBP3 and IGFBP5 were related to poor prognosis, but the expression of IGFBP4 and IGFBP7 were related to good prognosis; and in UVM, the expression of IGFBP2, IGFBP4 and IGFBP7 were related to poor prognosis, but the expression of IGFBPL1 was related to good prognosis. In addition, IGFBP2 and IGFBP7 showed different prognostic roles in multiple cancer types, we therefore created forest plots to show specific predictive effects in various types of cancer (Fig. 5B and C).

Table 2 The association between IGFBPs expression and overall survival in pan-cancer
Fig. 5
figure 5

Prognostic significance of IGFBP genes. A Correlation between IGFBPs expression and survival of different cancers. Red color represents high risk of death whereas blue color represents low risk of death. B, C Forest plot for the prognostic analysis of IGFBP2 and IGFBP7 across 33 cancer types

Correlation between IGFBPs expression and cancer-related pathways and interaction between IGFBPs in pan-cancer

We examined and illustrated the relationship between IGFBPs and hallmark pathways in order to clarify the molecular implications of IGFBPs in carcinogenesis. The results indicated that the expression of IGFBPs, especially IGFBP4 and IGFBP7, was significantly correlated with the suppression or activation of a variety of cancer pathways (Additional file 1: Table S4, Fig. 6A).

Fig. 6
figure 6

Association of IGFBPs expression with cancer-related pathways. A Network displaying the correlation between IGFBPs expression and cancer-related pathways. The white nodes represent IGFBPs and grey nodes represent pathways. The red edges represent activated pathways and blue edges represent inhibited pathways. B The number of significantly correlated pathways in each individual IGFBPs. Red color represents activated pathways and blue color represents inhibited pathways. C Correlation between the expression of different IGFBPs. Red color represents positive correlation and blue color represents negative correlation

Among them, IGFBP1 mainly contributed to bile acid metabolism and xenobiotic metabolism pathways; IGFBP3 mainly participated in the angiogenesis, epithelial-mesenchymal transition, glycolysis, and hypoxia pathways; IGFBP4 mainly participated in angiogenesis, apoptosis, epithelial-mesenchymal transition, hypoxia, K-RAS signaling up, TGF-β signaling and ultraviolet response down pathways; IGFBP5 was mainly involved in angiogenesis, epithelial-mesenchymal transition, Hedgehog signaling, TGF-β signaling and ultraviolet response down pathways; IGFBP6 was mainly involved in the epithelial-mesenchymal transition pathway; IGFBP7 mainly participated in angiogenesis, E2F targets, epithelial-mesenchymal transition, Hedgehog signaling, K-RAS signaling up, MYC-targets-V2, TGF-β signaling and ultraviolet response down pathways. However, the expression of IGFBP2 and IGFBPL1 was not found to be significantly correlated with these signaling pathways. After summarizing the number of pathways that IGFBPs participated in, we found that IGFBPs activated much more pathways than they inhibited (Fig. 6B). In addition, IGFBPs may play synergistic roles in various cancer-related pathways because a significant correlation was detected (Fig. 6C), such as IGFBP3-IGFBP4(r = 0.54), IGFBP4-IGFBP7(r = 0.50) and IGFBP5-IGFBP7(r = 0.49).

We used the data from TCGA to analyze the correlation of IGFBPs expression in all cancer patients and 33 different cancers respectively. Figure 7A and B demonstrated the overall expression level and correlation of IGFBPs in pan-cancer. The results showed that, on the whole, the correlation between IGFBP3-6 was strong and mostly positive, such as IGFBP4-IGFBP7 (r = 0.50), IGFBP5-IGFBP7 (r = 0.50), while the correlation between IGFBP1, IGFBP2 or IGFBPL1 and other IGFBPs was poor (Fig. 7B). In addition, IGFBPs often showed positive correlation in different cancers (Fig. 8 and Additional file 2: Fig. S2). The interaction diagram established by GENEMANIA and STRING showed that there were extensive interactions between IGFBPs, especially IGFBP3, IGFBP5 and IGFBP7, while the interaction between IGFBPL1 and other IGFBPs was relatively poor (Fig. 7C and D).

Fig. 7
figure 7

The correlation and interaction analysis among IGFBPs. A IGFBPs expression in all cancer patients in TCGA. B The correlation among IGFBPs expression in all 33 types of cancer from TCGA. Red color represents positive correlation and blue color represents negative color (P < 0.05). C Interaction network of IGFBPs from GENEMANIA. D Protein–protein interaction network of IGFBPs from STRING. Red color represents high degree score and blue color represents low degree score

Fig. 8
figure 8

The correlation among IGFBPs expression in different 9 types of cancer from TCGA

Correlation between IGFBPs expression and multiple molecular features in pan-cancer

Correlation between IGFBPs expression and TMB in pan-cancer

The TMB phenomenon, which is used to describe the mutation number involved in cancer cells, has emerged as a potential cancer prognosis biomarker in multiple tumor types. In the present study, the relationship between IGFBPs expression and TMB was investigated in 33 cancer types. Overall, except for IGFBP1, the expression of other IGFBPs showed significantly association with TMB in multiple cancers, especially IGFBP5 and IGFBP7, which were significantly correlated with TMB in 10 types of cancers, respectively, including LIHC, STAD, THYM, CESC, etc. Furthermore, IGFBPs were more negatively correlated with TMB in different cancers (Fig. 9A and Additional file 2: Fig. S3A).

Fig. 9
figure 9

Correlation between IGFBPs expression and (A) tumor mutational burden (TMB) as well as (B) microsatellite instability (MSI) in pan-cancer. * P < 0.05, ** P < 0.01, *** P < 0.001

Correlation between IGFBPs expression and MSI in pan-cancer

We also analyzed the association between IGFBPs expression and MSI in 33 cancer types. The results showed that, except for IGFBP1 and IGFBP2, the expression of other IGFBP genes was significantly correlated with MSI in several cancer types, and the vast majority also showed negative correlation. For example, IGFBP5 was negatively associated with MSI in TGCT, STAD as well as DLBC, and IGFBP7 was also negatively correlated with MSI in LUSC, STAD as well as DLBC (Fig. 9B and Additional file 2: Fig. S3B).

Correlation between IGFBPs expression and tumor stemness score in pan-cancer

To evaluate the impact of IGFBPs on tumor stemness, we explored the relationship between IGFBPs expression and tumor stemness scores in 33 cancer types. The results showed that the expression of most IGFBPs was negatively associated with the DNAss (Fig. 10A) and RNAss (Fig. 10B). Especially for the RNAss, nearly all the correlations were negative in the majority of cancer types.

Fig. 10
figure 10

Correlation between IGFBPs expression and stemness score in pan-cancer. A Correlation between IGFBPs expression and DNAss. B Correlation between IGFBPs expression and RNAss. The red color represents positive correlations, whereas the green color represents negative correlations. The depth of the color represents the strength of the relationship. * P < 0.05, ** P < 0.01, *** P < 0.001. DNAss, DNA stemness score; RNAss, RNA stemness score

Correlation between IGFBPs expression and tumor immune microenvironment in pan-cancer

We analyzed the correlation between the expression of IGFBPs and tumor immune microenvironment in 33 cancer types through the ESTIMATE algorithm. The results indicated that IGFBP family showed significantly negative or positive correlations with stromal (Fig. 11A), immune (Fig. 11B) and ESTIMATE (Fig. 11C) scores in pan-cancer. And for the majority of the cancers, IGFBP3, IGFBP4, IGFBP5, IGFBP6 and IGFBP7 were significantly positively associated with stromal, immune as well as ESTIMATE scores including COAD, PCPG, PRAD, READ and STAD. While for IGFBP1, IGFBP2 and IGFBPL1, there were more negative correlations than negative.

Fig. 11
figure 11

Correlation between IGFBPs expression and tumor immune microenvironment in pan-cancer. A Correlation between IGFBPs expression and stromal scores. B Correlation between IGFBPs expression and immune scores. C Correlation between IGFBPs expression and ESTIMATE scores. The red color represents positive correlations, whereas the blue color represents negative correlations. The depth of the color represents the strength of the relationship. * P < 0.05, ** P < 0.01, *** P < 0.001

Correlation between IGFBPs expression and immune cell infiltration in pan-cancer

We also analyzed the association between IGFBPs expression and 22 types of immune cell infiltration levels in 33 cancer types. The results indicated that in 26 kinds of cancers (including ACC, renal chromophobe cell carcinoma, thymic carcinoma, uterine SARC, and CHOL, etc.), the expression of IGFBPs was significantly associated with different immune cell infiltration, such as resting mast cells and macrophages M2 cells and naïve B cells (Table 3 and Additional file 1: Table S5). Figure 12 shows the correlation between IGFBPs expression and infiltration levels of specific immune cells in various cancer types.

Table 3 Correlation between IGFBPs expression and immune cell infiltration in Pan-cancer
Fig. 12
figure 12

Correlation between IGFBPs expression and immune cell infiltration. A Correlation between IGFBP1 expression and Dendritic cells activated infiltration level in KICH. B Correlation between IGFBP2 expression and Mast cells resting cell infiltration level in THYM. C Correlation between IGFBP3 expression and Macrophages M2 cell infiltration level in TGCT. D Correlation between IGFBP4 expression and Dendritic cells activated cell infiltration level in ACC. E Correlation between IGFBP5 expression and Macrophages M2 cell infiltration level in TGCT. F Correlation between IGFBP6 expression and T cells CD8 cell infiltration level in KICH. G Correlation between IGFBP7 expression and Macrophages M1 cell infiltration level in ACC. H Correlation between IGFBPL1 expression and Mast cells resting cell infiltration level in KIRP

Genetic variation characteristics of IGFBPs in pan-cancer

We explored the mutation frequency of the IGFBP family in 33 cancer types (Fig. 13A). The overall average mutation rate of IGFBPs was 0–5.7%, the mutation frequency of IGFBPs was less than 2% in most tumors except for UCEC. IGFBP1 and IGFBP5 showed relatively higher mutation frequency, while IGFBPL1 showed the lowest mutation frequency. UCEC exhibited frequent mutations of all IGFBPs genes, the mutational details were visualized via a waterfall plot (Fig. 13B). In contrast, no IGFBPs mutations were found in CHOL, THYM and PCPG. And the mutation types involved in case of the genetic variation of IGFBPs in all cancers were displayed in Additional file 2: Fig. S4. Furthermore, we also summarized the mutational status of IGFBPs in eight cancer cell lines from CCLE (Fig. 13C). The results revealed that IGFBPs mutations were more likely to occur in gastric, lung, and ovarian cancer cell lines.

Fig. 13
figure 13

Genetic alternations of IGFBPs in pan-cancer. A Mutation frequency of IGFBPs in different cancers. Red color represents high mutation frequency whereas blue color represents low mutation frequency. B Oncoplot for IGFBPs in UCEC. IGFBP5 shows the most frequent mutation in UCEC. C Mutation frequency of IGFBPs in different cancer cell lines (colorectal, gastric, liver, kidney, breast, lung, esophagus and ovary) from CCLE database. Red color represents high mutation frequency whereas blue color represents low mutation frequency. D The copy number variations frequency of IGFBPs in different cancers. The upper part of each grid shows the deletion frequency, and the bottom part shows the amplification frequency

Furthermore, the copy number variations (CNVs) of IGFBPs in pan-cancer was also analyzed, the results showed that the IGFBP family had a low degree of CNVs in various cancers (Fig. 13D). Relatively speaking, IGFBP4, IGFBP7 and IGFBPL1 showed more amplifications of copy number in multiple cancer types. While IGFBP2 and IGFBP5 had higher frequencies of copy number deletions.

Correlation between IGFBPs expression and IGFBPs variation in pan-cancer

The correlation between IGFBPs expression and IGFBPs mutation, CNVs was analyzed to investigate whether the IGFBPs variation modulated their expression levels. The results showed that the expression of IGFBP3 was modulated by its mutation in UCEC, and the expression of IGFBP6 was regulated by its mutation in SKCM. Additionally, we found that CNVs of IGFBPs could affect their expression in multiple cancer types (Table 4). For instance, in BRCA and OV, the increased expression of IGFBP1 and IGFBP2 was correlated with its increased copy number. The copy number variations of other IGFBPs also have different impacts on their expression in corresponding cancers, the details were displayed in Additional file 1: Table S6.

Table 4 Correlation between IGFBPs CNV and IGFBPs expression

Verification of IGFBPs expression at mRNA and protein levels

Validation of the mRNA expression levels of IGFBPs by qRT-PCR

In order to verify the analysis results of TCGA at the mRNA level, we selected the differentially expressed IGFBPs in some tumor tissues and then performed qRT-PCR experiments to verify the expression of IGFBP1, IGFBP2, and IGFBP6 in 67 pairs of gastric cancer and adjacent tissues, as well as 44 pairs of colorectal cancer and adjacent tissues. The results indicated that, consistent with the results of TCGA analysis, the expression of IGFBP2 (P < 0.001) and IGFBP6 (P = 0.005) in gastric cancer tissues was lower than that in normal gastric tissues; the expression of IGFBP6 (P < 0.001) in colorectal cancer tissues was also lower than that in normal colorectal tissues (Fig. 14). Besides, the verification results of IGFBP1 in gastric cancer and colorectal cancer did not reach statistical difference, but the expression trend in colorectal cancer was consistent with the analysis results of TCGA.

Fig. 14
figure 14

Validation of the mRNA expression levels of IGFBPs using the qRT-PCR experiments. P < 0.05 was considered statistically significant. A The relative expression of IGFBP1 in gastric cancer. B The relative expression of IGFBP2 in gastric cancer. C The relative expression of IGFBP6 in gastric cancer. D The relative expression of IGFBP1 in colorectal cancer. E The relative expression of IGFBP6 in colorectal cancer

Validation of the protein expression levels of IGFBPs by immunohistochemistry

Based on the results of the qPCR verification and considering the lack of IGFBP2 protein expression data in the Human Protein Atlas (HPA), we performed IHC staining to verify the protein expression of IGFBP2 in gastric cancer tissues. The results showed that IGFBP2 had no obvious staining in the pits of the normal stomach (Fig. 15A-1) but was labeled dark brown in the normal gastric antrum glands (Fig. 15A-2). Besides, there was no obvious positive staining on the moderately differentiated adenocarcinoma (Fig. 15A-3) and signet ring cell carcinoma (Fig. 15A-4) of the stomach. Figure 15A-5 shows the staining of continuous gastric cancer tissues and adjacent normal tissues. From a quantitative point of view, IGFBP2 protein expression was lower in gastric cancer tissues compared with in normal tissues (P < 0.001) (Fig. 15B-1). The expression of IGFBP2 in matched normal tissues is higher than that in gastric cancer tissues (Fig. 15B-2).

Fig. 15
figure 15

The expression levels of IGFBP2 protein in gastric cancer tissues and adjacent normal tissues. A The expression levels of IGFBP2 protein in gastric tissues by IHC staining: (A-1) Gastric pit tissue (200x), the arrow indicates the negative staining of IGFBP2 protein in gastric pit cell. (A-2) Normal gastric antrum tissue (200x), the arrow indicates the positive staining of IGFBP2 protein in gastric antrum cell. (A-3) Moderately differentiated GC tissue (200x), the arrow indicates the negative staining of IGFBP2 protein in GC cell. (A-4) Gastric signet ring cells carcinoma (200x), the arrow indicates the negative staining of IGFBP2 protein in signet ring cell. (A-5) GC tissue and adjacent normal gastric tissue (100x), the solid arrows indicate the negative staining of IGFBP2 protein in GC cells, and the dotted arrow indicates the positive staining of IGFBP2 protein in normal gastric cell. B Statistical quantification of the HSCORE of IGFBP2 staining in the indicated tissues: (B-1) The protein expression level of IGFBP2 in adjacent normal gastric tissues was significantly higher than that in GC tissues. (B-2) In the paired GC tissues and adjacent normal tissues, the protein expression of IGFBP2 in the normal tissues was significantly higher. GC, gastric cancer; HSCORE, histological score

Frontier trend analysis of IGFBP related research from 2000 to 2021

From 2000 to 2021, 3543 studies fulfilled the search criteria in total. Overall, the publication of related researches peaked at 224 articles in the year 2007 and then leveled off, while the annual citation showed an increasing trend, reaching a peak of 9514 times in year 2021. Co-occurrence analysis of keywords showed that a total of 104 keywords appeared more than 40 times (Table 5). All the keywords were divided into 3 clusters, with the following occurring more frequently: expression, risk, apoptosis, proliferation, survival, growth, breast cancer, prostate cancer, colorectal cancer, lung cancer, etc. The above keywords appeared in the center of each cluster, and had higher weights in terms of size and strength of relationship compared with other keywords (Fig. 16A). The most recent keywords of IGFBP cancer-related studies were as follows: invasion, poor-prognosis, migration, prognosis, biomarker, target, survival, and therapy, etc. (Fig. 16B).

Table 5 Clusters of the top 104 keywords
Fig. 16
figure 16

Keywords co-occurrence analysis. A The co-occurrence network visualization map of the top 104 keywords. The visualization map of publications for 104 keywords forming 3 collaborating clusters (nodes with the same color); a node represents a keyword, the size of the node represents the number of publications, a link shows collaboration, and the distance and the thickness of the link between nodes show the relative strength of the relation. B The co-occurrence overlay visualization map of the top 104 keywords. The color of node represents the average publication year of the keywords, and the blue to yellow represents the average publication year from early to late

Discussion

IGF is one of the important somatomedins in humans, which plays an important role in regulating the proliferation, differentiation and apoptosis of tissue cells, the growth and development of the body and the development of tumors [27]. The IGF system is commonly found in human circulation and various target tissues, and its main components include IGF ligands (IGF-I, IGF-II and insulin), receptors (IGF-IR, IGF-IIR and insulin receptor) and IGFBPs [28]. The IGFBPs mainly include IGFBP1-6 with high affinity for IGF and IGFBP7 and IGFBPL1 with low binding to IGF. In addition to the function of binding IGF, each IGFBP has its own unique role. In the human circulation, IGFBPs function primarily by binding to IGF-I/II to form complexes, thereby prolonging the half-life of IGFs and regulating their clearance, as well as controlling the transport of IGFs in plasma and the diffusion or efflux of IGFs in the vasculature [29]. In tissues, IGFBPs can enhance the effect of IGFs by directly interacting with cell surface or extracellular matrix to localize IGFs near their receptors and regulate their binding to receptors, or they can inhibit the action of IGFs by limiting their receptor access [30]. In recent years, it has been found that IGFBPs can act outside tumor cells, in the cytoplasm or in the nucleus, and play different functions on the proliferation and apoptosis of tumor cells under different conditions [5]. Several studies have found that circulating IGFBPs concentrations or tissue mRNA expression levels are associated with the risk or prognosis of certain cancers. The special properties of IGFBPs make them closely related to the complex signaling pathways and various molecular features of tumors. The role of IGFBPs in the carcinogenesis and development of tumors is increasingly attracting attention.

In the present study, we utilized multi-source data to characterize the expression profiles of IGFBP gene family in pan-cancer, and its relationship with prognosis, cancer-related pathways, TMB, MSI, tumor stemness, tumor immune microenvironment, comorbidity factors, etc. The results of the database analysis were validated using immunohistochemistry and qRT-PCR. Our study indicates that IGFBPs are differentially expressed in multiple cancers and affect diverse tumor prognosis. The expression of IGFBPs is correlated with multiple cancer hallmark pathways and molecular features of tumors. On this basis, we conducted a comprehensive bibliometric analysis and literature review on the role of IGFBP family in the occurrence and development of pan-cancer. As far as we know, our study is the first “quantitative analysis” based systematic review supported by bioinformatics and bibliometrics to provide a pan-cancer wide characterization about IGFBP gene family in tumorigenesis and progression. These findings provide new clues for identifying novel biomarkers and therapeutic targets for cancer diagnosis and treatment.

IGFBP gene family was differentially expressed in various cancers, which makes IGFBPs available as new cancer biomarkers and therapeutic targets

The current study extracted multi-level data from TCGA, Oncomine, HPA, and CCLE to analyze the expression profiles of IGFBPs at the mRNA, protein, and cell levels in 33 tumors. Our examination of the TCGA data and verification using the Oncomine database revealed that the expression of IGFBPs at the mRNA level varies in multiple cancer types. In stomach adenocarcinoma, IGFBP1 is upregulated, but IGFBP2 and IGFBP6 are downregulated. In COAD, IGFBP1 is significantly highly expressed, while the expression of IGFBP6 is low. In LIHC, the expression of IGFBP1, IGFBP2, IGFBP3, IGFBP4 are all low. Besides, the qRT-PCR experiment confirmed the low expression of IGFBP2 and IGFBP6 in gastric cancer and the low expression of IGFBP6 in colorectal cancer. According to HPA immunohistochemistry findings, IGFBP staining intensity in 20 types of tumor tissues was primarily low or medium at the protein expression level. Besides, the IHC staining further confirmed the low expression of IGFBP2 protein in gastric cancer tissues. Furthermore, according to the results from CCLE analysis, IGFBP2, IGFBP3, IGFBP4 and IGFBP6 are highly expressed in most cancer cell lines at the cell level. Previous studies have been conducted to explore the relationships between IGFBP family and multiple cancers. The expression of IGFBP1 has close relationship with ovarian cancer, breast cancer, liver cancer, gastric cancer, pancreatic cancer, prostate cancer and esophageal squamous cell carcinoma [31,32,33,34,35,36]. IGFBP2 is one of the most common and abundantly expressed IGFBPs in human cancers [7], which was found to be highly expressed in glioma, cervical cancer, prostate cancer, breast cancer, lung cancer, and other cancers [7, 37,38,39,40]. The expression of IGFBP3 in ovarian cancer, lung cancer, breast cancer, prostate cancer, and gastric cancer has also been reported [41,42,43,44,45,46,47]. IGFBP4 has been mainly studied in colon cancer, lung cancer, gastric cancer, ovarian cancer and breast cancer [47,48,49,50,51]. IGFBP6 expression was found to be lower in gastric cancer, nasopharyngeal cancer, colon cancer and breast cancer [52,53,54,55]. IGFBP7 is closely associated with bladder cancer, gastric cancer, breast cancer, lung cancer and acute myeloid leukemia [56,57,58,59,60,61]. However, there are relatively few studies on the expression of IGFBP5 and IGFBL1 in cancer. In addition, we found that previous studies concentrated more on the expression of IGFBP genes in gastric cancer, ovarian cancer, breast cancer, and prostate cancer, there are still some gaps to be filled for IGFBPs in other cancer types, for example, the present research is the first to reveal the expression levels of IGFBP family in renal clear cell carcinoma, head and neck squamous cell carcinoma, and endometrial carcinoma. Our study comprehensively and systematically describes the expression profile of IGFBPs at different levels in a variety of cancers, which suggesting that IGFBPs may act as cancer biomarkers and therapeutic targets.

Besides, we found that IGFBPs are related to age and gender in various cancers. Age has a great influence on the expression of IGFBPs in cancer patients. The expression of each IGFBPs is different in the age of at least five types of cancer, and the expression trend is different. For example, in CESC, the expression of IGFBP1, IGFBP2, IGFBP6, IGFBP7, and IGFBPL1 is lower in elder patients (> 60); while in LGG, IGFBP1, IGFBP2, IGFBP3, and IGFBP5 are all highly expressed in elder patients. Relatively speaking, gender has little effect on the expression of IGFBPs in cancer patients. IGFBP1 is highly expressed in male patients in GBM and HNSC, while IGFBP4 is highly expressed in female patients in BRCA, DLBC, KIRP, LIHC and SARC. This further illustrates the specificity of IGFBPs.

IGFBPs expression was significantly associated with a variety of tumor prognosis through positive or negative two distinct styles, which makes IGFBPs available as biomarkers for prognosis

The results of this study's analysis of the connection between IGFBPs and pan-cancer prognosis showed that the expression of IGFBPs affects the prognosis of various cancers in distinct ways. IGFBP2 and IGFBP7 expression may impact the prognosis of more cancer types. Besides, most members of the IGFBP family could affect the prognosis of UVM and LGG.

It has been shown that ovarian cancer and breast cancer patients with high expression of IGFBP1 have a better prognosis [54, 62]. High mRNA expression of IGFBP2 is associated with better survival in patients with breast cancer [54] and poor survival in glioblastoma patients [63,64,65]. IGFBP3 expression is associated with the survival and proliferation of glioblastoma [66]. High IGF1/IGFBP3 ratio at diagnosis shows better prognosis in renal cell carcinoma [67]. Elevated IGFBP3 expression correlates with poor prognosis of pancreatic ductal adenocarcinoma, breast cancer, oral squamous cell carcinoma, papillary thyroid carcinoma and colorectal cancer [45, 54, 68,69,70,71,72]. Patients with high IGFBP4 mRNA expression levels have better overall survival and disease-free survival rate in breast cancer [51]. IGFBP5 is considered to be a positive indicator for the prognosis of breast cancer. For ER-positive breast cancer patients, low expression of IGFBP5 is associated with a better prognosis [51, 54, 73]. IGFBP5 is negatively related to patient survival in kidney renal papillary cell carcinoma [74]. Overexpression of IGFBP6 is associated with a better prognosis in breast cancer [54, 75]. IGFBP6 may serve as an independent prognostic factor for the prognosis of glioma patients, and overexpression of IGFBP6 can induce apoptosis of glioma cells [76, 77]. In colorectal cancer, low IGFBP6 expression is associated with poor survival [78]. Increased IGFBP7 expression is related to a better survival in breast cancer and CHOL [59, 79].

As secreted proteins, a number of studies have analyzed the correlation between circulating IGFBP concentrations and the risk or prognosis of various cancers. However, due to the complexity of the endocrine IGF–IGFBP system, many studies have produced inconclusive or even contradictory results. For example, circulating IGFBP1 and IGFBP2 appear to be involved in regulating acute bioavailability of IGFs, and when their expression is inhibited, they may increase the mitogenic activity, thereby increasing the risk of cancer [80]. However, studies of serum IGFBP1 and IGFBP2 levels associated with cancer risk or outcome have generally failed to show a consistent and definitive association [5]. Therefore, the expression of IGFBPs at mRNA or protein levels in tissues may be relatively more suitable as tumor risk or prognostic markers.

IGFBPs expression was associated with multiple signal transduction pathways, which revealed relevant mechanism behind IGFBPs function

In order to elucidate the molecular significance of IGFBPs in the carcinogenic process, we explored the correlation between the expression of IGFBPs and signal transduction pathways. The results showed that the expression of IGFBPs is associated with a variety of cancer pathways, such as epithelial-mesenchymal transition, apoptosis, angiogenesis, TGF-β signaling, and IL2-STAT5 signaling pathway. In addition, by summarizing the number of cancer pathways related to the expression of IGFBPs, it is found that the pathways activated by IGFBPs are far more than the pathways inhibited. Therefore, IGFBPs are more likely to have carcinogenic effects. There have been previous researches indicating the associations between IGFBPs and multiple cancer-related pathways. For example, IGFBP2 can serve as a potential marker for PI3K/Akt pathway activation and PTEN status in prostate cancer as well as glioblastoma [81]. IGFBP3 promotes apoptosis and inhibits Hepg2 cell proliferation by inhibiting the signaling pathways including IGF-1R-mediated PI3K/AKT and Wnt/β-catenin, respectively [82]. In triple-negative breast cancer, IGFBP3 is involved in the PARP-dependent DNA damage repair pathway through interacting with SFPQ and NONO [83]. IGFBP4 activates the signaling pathway Wnt/β-catenin and induces M-CAM expression in human renal cell carcinoma [84]. In the present study, we found that the expression of IGFBP3, IGFBP4, IGFBP5, IGFBP6, IGFBP7 has a significant correlation with the activation of angiogenesis, epithelial-mesenchymal transition, apical junction, TGF-β signaling, KRAS signaling, and other signaling pathways. Previous studies have revealed that the above pathways are closely correlated with tumor progression [85,86,87,88,89,90].

As mentioned previously, IGFBPs regulate the activity of IGFs in a variety of ways, thereby affecting their binding to IGF receptors, and thus playing a complex regulatory role in the occurrence and development of tumors. Other studies have found that certain IGFBPs can affect tumor progression through pathways independent of IGFs, and the mechanisms are often complex and specific. For example, there are p53 response elements on the IGFBP gene which can be induced when DNA damage results in the activation of p53, and thus the IGFBP may act directly on DNA repair mechanisms [28, 91]. Currently, at least IGFBP2 and IGFBP3 have been shown to promote DNA repair [92, 93]. By regulating enzymes involved in sphingolipid metabolism, IGFBPs can affect the balance between growth-stimulating lipids (such as sphingosine-1-phosphate) and growth-inhibiting lipids (such as ceramides). This may be a key mechanism by which IGFBP3, IGFBP5, and possibly other IGFBPs may regulate the balance between cell death and survival in response to some cancer therapies [5]. Therefore, based on the role of IGFBPs in cancer and the complexity of its mechanism, our study demonstrated the relationship between IGFBP gene and cancer-related signaling pathways from the perspective of pan-cancer, providing new clues for exploring cancer-related pathways and helping to understand the specific effects on tumor cells.

The correlation and interaction between IGFBPs were also analyzed in pan-cancer. The results showed that, on the whole, the correlation between IGFBP3-6 was strong and mostly positive, while the correlation between IGFBP1, IGFBP2 or IGFBPL1 and other IGFBPs was poor. In addition, IGFBPs often showed positive correlation in different cancers. The interaction diagram established by GENEMANIA and STRING showed that there were extensive interactions between IGFBPs, especially IGFBP3 and IGFBP5, while the interaction between IGFBPL1 and other IGFBPs was relatively poor.

IGFBPs expression was significantly correlated with multiple molecular features of tumor, which shed new light on the new strategies for tumor targeted therapy

TMB and MSI are currently clinically important tumor biomarkers for the sensitivity of immunotherapy [94,95,96]. Hence, we investigated the relationship between IGFBPs expression and TMB as well as MSI in 33 cancer types. The results demonstrated that the expression of IGFBP genes was significantly correlated with TMB and MSI in various cancers, and most of them were inversely correlated, suggesting that IGFBPs may indirectly affect the immunotherapeutic response in these cancers. Currently, the research literature on the relationship between IGFBPs expression and TMB or MSI is relatively less. This study reveals novel horizons for exploring the role of IGFBPs in cancer from the perspective of immunotherapy response.

As is well known, tumor stemness contributes to tumorigenesis, progression and drug resistance. The stemness index could be used to predict clinical features and outcomes for cancer patients. Stemness index was found to be increased in metastatic tumors, application of stemness scores shows potential target for cancer therapy [97, 98]. In the current study, we analyzed the correlation between IGFBPs expression and tumor stemness in 33 cancer types. DNAss and RNAss are novel stemness indices for evaluating the degree of oncogenic dedifferentiation proposed by Malta, T. M. et al. [21]. Tumor stemness score is related to multiple oncological processes. For example, higher stemness score is associated with increasing tumor dedifferentiation and biological activity of tumor stem cells. Stemness indices can be used for stratification of undifferentiated carcinoma. Stemness score is higher in metastatic cancers and correlated with intertumoral heterogeneity as well as tumor microenvironment [21, 97, 99]. The results of our study showed that the expression of most IGFBP genes was negatively correlated with the DNAss and RNAss in pan-caner, indicating that the higher expression of these IGFBPs, the less stemness features of tumor cells and the higher degree of tumor differentiation. Currently, there are few reports on the relationship between IGFBP genes and tumor stemness characteristics in human cancer. The present study provides new horizons for the IGFBP family in the development of new cancer therapies based on stem cells.

Multiple researches have revealed that tumor immune microenvironment is associated with the progression, recurrence and metastasis of tumor [100, 101], understanding the immune microenvironment of tumor patients will hopefully lead to improve the ability to predict and guide immunotherapy responses and to reveal new therapeutic targets [102, 103]. The current study analyzed the correlation between IGFBPs expression and tumor immune microenvironment. First, we used the ESTIMATE algorithm for immune and stromal score calculation. The results indicated that the expression of IGFBP genes has significantly negative or positive correlations with stromal, immune and ESTIMATE scores in various cancers. Indicating that IGFBPs expression is linked to the content of stromal and immune components in tumor tissue and purity of tumor. Previous studies have partly confirmed our findings. IGFBP2 has been reported to participate the macrophage based immunosuppressive microenvironment in pancreatic ductal adenocarcinoma [104, 105]. IGFBP3 can promote breast tumor growth by regulating tumor immune microenvironment [106]. IGFBP6 is involved in migration, immune escape and inflammation in glioblastoma [107]. Deletion of IGFBP7 presents a pro-inflammatory and immunosuppressive microenvironment and promotes the development of hepatocellular carcinoma [108].

Immune cell infiltration in tumor tissues plays important roles in tumor immune microenvironment. The present study indicated that the association between the expression of IGFBPs with immune cell infiltration was significant in 26 cancer types. Previous studies have found that IGFBP2 modulates the expression of PD-L1 through activating the EGFR-STAT3 signaling pathway, thereby exerting an anti-melanoma effect [109]. The effect of IGFBP3 in promoting the growth of breast cancer cell is related to accumulation of damaged T cells [106]. IGFBP6 protein has the function of inducing chemotaxis and enhancing the ability of neutrophils to burst and degranulate [110]. IGFBP7 can regulate the mechanism of antigen presentation, thereby generating a response of anti-tumor immune. Absence of IGFBP7 will reduce antigen presentation and reduce infiltration levels of CD4 + , CD8 + T cells, and NK cells [108]. Our research first found significant correlations between the expression of IGFBPs and various immune cell infiltration such as mast resting, macrophages M2, and naïve B cells in pan-cancer. All these findings indicate that IGFBPs can affect cancer progression by regulating immune function. Our research provides comprehensive foundation for future research on the molecular mechanisms of IGFBPs in tumor immune microenvironment.

The mutation frequency and copy number variation of IGFBPs were generally low but had a greater impact on their expression, which revealed the potential regulation mechanism of IGFBPs expression based on their own genetic variation

The results of the mutation and CNV frequency analysis showed that the general average mutation rate of IGFBPs was 0–5.7%, of which IGFBP1 and IGFBP5 showed relatively high mutation frequency, and IGFBPL1 showed the lowest mutation frequency. The mutation frequency of IGFBPs was less than 2% in most tumors except for endometrial cancer, a type of cancer with high global mutation rates [111]. In addition, no IGFBP mutation was found in CHOL, THYM and PCPG. Studies have shown that each of IGFBP1, 3, 4, 5, and 6 is inserted or deleted in variable domains corresponding to the codon hotspot to increase mutation rates [112]. CNV frequency analysis showed that IGFBPs had a lower CNV degree in various cancers. Besides, relatively more IGFBPs mutations were found in gastric, lung, and ovarian cancer cell lines. All the results as mentioned demonstrated the variation features of IGFBPs among different cancers.

Furthermore, this study also analyzed the effects of IGFBPs genetic variation on IGFBPs expression. We found that only the mutation of IGFBP3 will affect its expression in endometrial cancer, and the mutation of IGFBP6 will affect its expression in skin melanoma. Compared with mutations, the CNV of IGFBPs has more influence on its expression in cancer, and the expression is mostly up-regulated with increased copy number. Existing studies have also proved the increase of copy number usually matches the up-regulation of expression [113]. To sum up, our findings indicate that genetic change can play crucial roles in regulating the expression of IGFBPs, and the regulation mechanism is worthy of our further research and exploration.

IGFBP-related research has been paid more and more attention, and research hotspots mainly focus on the potential of IGFBPs as tumor diagnostic and prognostic markers, and develop related therapies accordingly

In the current study, a bibliometric and visual analysis was conducted to present the current research status and hot topics of IGFBP related studies in cancer. The results showed that the annual publication of relevant studies peaked in 2007 and then leveled off, while the number of citations maintained an increasing trend, indicating that the related studies have increasing academic value and are gaining attention. The co-occurrence network divided the keywords appearing in the top 104 into 3 clusters, and by analyzing the structure of the keywords and their intrinsic connections, we found the hot topics of related studies mainly focused on that the expression of IGFBPs was correlated with apoptosis, proliferation, differentiation, metastasis and invasion of tumor cells; IGFBPs were closely related to breast, prostate, colorectal, lung and ovarian cancers; and that IGFBPs were associated with cancer risk and prognosis and may serve as corresponding markers and therapeutic targets. The overlay visualization map of keywords showed the trends of relevant research hotspots over time. The results indicated that, with the passage of time, the keywords: invasion, poor-prognosis, migration, prognosis, biomarker, target, survival, and therapy have become increasingly popular and suggested that relative researches were more focused on the potential of IGFBPs as tumor diagnostic and prognostic markers, and develop related therapies accordingly.

In addition, through the review and quantitative analysis of the previous literature, we found that IGFBPs play an important role in the effects of cancer, including bioenergetics and metabolism, metastasis and apoptosis. Therefore, we have conducted a brief literature review on the influence of IGFBP family on the biological behavior of cancer, and compiled the relevant contents into Additional file 1: Table S7.

Limitations

As a pan-cancer analysis of gene family, this study has not explored a single gene or a single cancer in depth and detail. However, it has comprehensively introduced the expression, prognosis, various molecular characteristics and mutations of IGFBP family in pan-cancer for the first time, which can help macroscopically understand the role of this family in pan-cancer. Furthermore, a PCA analysis may be useful to better explain the results indicate both positive and negative effects of different IGFBPs in the case of different cancer types. However, considering that the purpose of this study is to comprehensively elaborate the role of IGFBP gene family in pan-cancer, we did not conduct further PCA analysis from a specific point of view. In addition, most of the data used in this study were obtained from public databases, and there is lack of direct experimental evidence although we have carried out qRT-PCR and immunohistochemistry experiments to verify part of our results, which increases the reliability of the bioinformatics analysis.

Conclusions and future perspectives

In conclusion, we conducted a comprehensive and systematic analysis on the expression and mutation profiles of IGFBP gene family, and its relationship with cancer-related pathways, prognosis, TMB, MSI, tumor stemness, tumor immune microenvironment in pan-cancer. At the same time, we conducted a bibliometric analysis and literature review of the latest research progress of IGFBP, and discussed the research results in detail. Our study expands the research ideas for exploring the potential of IGFBP family to be new biomarkers for cancer diagnosis or prognosis, and shed new light on relevant mechanism research.

Currently, although studies have found the multifunctional roles of IGFBPs in various cancers, there is still a long way to go before IGFBPs can be used as biomarkers for tumorigenesis or novel targets for tumor therapy, and the underlying mechanisms behind the complex functions of IGFBPs still need to be explored. Recently, numerous researches have confirmed that certain IGFBPs, particularly IGFBP2 and IGFBP3, are key tumorigenic factors and viable therapeutic targets in multiple cancers, but even so, no tumor therapies targeting IGFBP family genes are currently available clinically. IGFBP genes have been found to be involved in the oncogenic process in various cancer types, and there is evidence that IGFBPs play essential roles in tumorigenesis and suppression by regulating signal transduction pathways in tumor cells. On this basis, our study macroscopically demonstrated the relationship between IGFBP genes and cancer-related signaling pathways from a pan-cancer perspective, providing a foundation for subsequent in-depth exploration of the regulation mechanism in tumor progression affected by IGFBP genes. Recently, tumor microenvironment and immunotherapy has become research hotspot. As secreted proteins, the role of IGFBPs in immune regulation should be noted. Several studies have been conducted to develop the potential of IGFBPs as immune biomarkers and to explore its relationship with immune cells in cancer [114,115,116,117,118]. Our study further links the IGFBP family with tumor immune microenvironment and the levels of immune cell infiltration in 33 types of cancer, providing more clues for further studies on the mechanisms of immune regulation and the development of tumor immunotherapies. In addition, we analyzed the mutational profile of IGFBPs in pan-cancer and the effect of their mutations on their own expression. Based on previous studies, no cancers have been identified that can be attributed to IGFBP gene mutations, but alterations in IGFBP expression can be observed in many cancers. This may be explained by that the expression and biological activity of the IGFBP gene is regulated by post-translational modifications such as methylation, glycosylation and phosphorylation, as well as the differential localization of IGFBP in the pericellular and extracellular spaces [5, 30]. For example, hypermethylation of IGFBP3 has been observed in non-small cell lung cancer, prostate cancer, ovarian cancer, and hepatocellular carcinoma [119,120,121,122]. The regulation of IGFBP expression by epigenetic alterations and its mechanisms, as well as the possible changes in tumor biological behavior it may cause, still need to be investigated in depth.

Through our comprehensive analysis of the role of IGFBP gene family in pan-cancer, we found that the expression of different IGFBPs in different cancers has specificity, and its influence on prognosis, TMB, MSI and other molecular characteristics, as well as cancer-related pathways is also specific. We speculate that this specificity is mainly due to the structural specificity of different IGFBPs and the temporal-spatial specificity during development. For example, in terms of sequence structure, IGFBP1-6 and IGFBP7/IGFBPL1 are obviously in two different branches, which may be the reason why the expression patterns of IGFBP7 and IGFBPL1 are often different from other IGFBPs. In addition, in the process of ontogenesis, different IGFBPs appear in different tissues and organs at different times, which also leads to specificity. For example, STAD and COAD are both digestive system cancers, so they usually show the same expression trend. According to our study, IGFBP1 is low expressed in both two cancers, while IGFBP6 is high expressed. The same trend is also shown in KICH, KIRC and KIRP. A deep understanding of the commonality and specificity of IGFBPs genes is of great significance for a comprehensive understanding of the gene family.