Introduction

Liver fibrosis is a common event that triggers wound-healing in the context of chronic injury. Fibrosis may progress to cirrhosis and primary liver cancer [1]. Studies reported that liver fibrosis is associated with the deposition of extracellular matrix (ECM) proteins [2]. The activation of hepatic stellate cells (HSCs) is thought to be the primary factor causing liver fibrosis [3,4,5,6]. Liver fibrosis can be detected early and reversed by treatments [7]. In the past few decades, investigators have focused substantial attention on the molecular mechanisms of liver fibrosis [8, 9]. Nevertheless, precise pathogenesis requires further exploration [10]. Definitive treatment of liver fibrosis requires identifying novel therapeutic strategies that include novel biomarkers and targets.

Long non-coding RNAs (lncRNAs) are regulated by several mechanisms, including chromatin regulation, epigenetic modification promoter activity regulation, and post-transcriptional mechanisms [11,12,13,14]. Studies indicated that lncRNA participates in the formation or inhibition of liver fibrosis [15,16,17]. LncRNAs regulate gene expression in acis or trans manner [18, 19]. Exploring the mechanisms of interactions between lncRNA and miRNA can generate novel strategies for liver fibrosis treatment [20].

The number of lncRNA families is vast, and it remains unknown whether there are other lncRNAs involved in liver fibrosis. Therefore, in this study, the expression of lncRNAs and mRNAs in liver fibrosis were identified using the Gene Expression Omnibus (GEO) database, and a co-expression network was constructed to elucidate the regulation and targets of lncRNAs. Then, we used Gene Ontology (GO) terminology and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis to identify the functions of the lncRNAs associated with liver fibrosis. Finally, we used quantitative real-time polymerase chain reaction (qRT-PCR) to confirm the results.

Material and methods

Data collection

This study intended to preliminarily explore the regulatory mechanisms of ceRNA during liver fibrosis progression by analyzing the gene expression differences and co-expression analysis between the healthy group and a liver fibrosis group. GEO, a public international and functional genomics data repository, is used for high-throughput microarray and next-generation sequences [21]. Therefore, GSE12392 and GSE84044 were mined for bioinformatics analysis by searching the keywords "liver fibrosis" and "Homo sapiens" in the GEO database. In the dataset, lncRNA and mRNA expression in six healthy individuals and six patients were measured using the Agilent MicroArray V4 platform.

DEGs analysis

The DEGs were located using edgeR [22] in the R Bioconductor package. The expression data in GSE12392 were analyzed using the LIMMA package of the R language, and the probe ID was converted into GeneSymbol according to the CORRESPONDING GPL file of the probe. The mRNA or lncRNA were selected from the probe for differential expression and gene co-expression network analysis. The criteria for selecting DEGs were P < 0.05 and log2 Fold-Change less than -1 or greater than 1.

WGCNA analysis

Gene co-expression network analysis was used to analyze gene expression in the two groups at the mRNA and lncRNA levels in combination with basic clinical information of the samples (i.e., age, gender, and whether liver fibrosis occurred) aiming to explore gene co-expression modules related to disease occurrence.

LncRNAs target gene prediction

The results of co-expression analysis and differential expression analysis were combined to obtain common lncRNAs, that is, to identify DEGs and co-expressed genes associated with clinicopathological features. StarBase and bioinformatics were used to predict the target genes of lncRNAs.

MiRNA target gene prediction

StarBase and bioinformatics methods were used to predict the target genes of human miRNAs, and the default parameters of the official database were adopted.

Establishment of the ceRNA regulatory network

An lncRNA-miRNA-mRNA network was constructed according to ceRNA theory [23]. Based on the prediction results of these target genes, a ceRNA regulatory network of lncRNA-miRNA-mRNA was constructed by combining mRNA co-expression network analysis (i.e., genes co-expressed between mRNA level and liver fibrosis) and mRNA differential expression analysis (i.e., genes with differential expression). We used Cytoscape 3.7.2 [24] to visualize the lncRNA-miRNA-mRNA network, and the CytoHubba plugin in Cytoscape 3.7.2 [25] was used to calculate all node degrees.

GO and pathway enrichment analysis

GO analysis [26] included biological process, cellular component, and molecular function, generated using the bioinformatics tool DAVID [27] (P < 0.05). This analysis was used to examine the unique biological significance of the high-throughput transcriptome. KEGG enrichment analysis was used to predict the biological pathways [28]. The results were visualized using the R language.

Analysis of the PPI network

The PPI information for the common DEG network was evaluated using the search tool STRING [29]. Any potential correlations between these DEGs were analyzed using Cytoscape [30]. The PPI network modules were recognized using the Cytoscape plugin MCODE, and only those were presented based on a node degree ≥ 3.

Hub gene selection

We analyzed the PPI network using the Cyto-Hubba plugin of Cytoscape 3.7.2 and selected candidate hub genes with top node degrees. Subsequently, hub genes associated with liver fibrosis were identified using GSE84044.

Human samples

From the Xiangya Hospital of Central South University, ten healthy subjects and 16 patients with liver fibrosis were recruited. Informed consent was obtained, and ethical approval was obtained from the Ethics Committee of Xiangya Hospital. Human liver tissues were immediately frozen in liquid nitrogen and stored at − 80 °C.

Cell culture experiments

LX-2 cells were obtained from our laboratory and cultured in DMEM containing 1% fetal bovine serum and 1% penicillin/streptomycin. LX-2 cells were passaged using trypsin. LX-2 cells were stimulated by TGF-β1 (0, 5, 10, 20, 40 ng/ml; Sigma, Cat No. SAB4502958) for 48 h. All cells were stored in an incubator at 37 °C in 5% CO2.

QRT-PCR

Total RNA was isolated from liver fibrosis human samples and LX-2 cell lines using TRIzol reagent (Invitrogen, CA, USA). The cDNAs were synthesized using a commercial kit (Bio-Rad, Hercules, CA). Gene expression was measured using the ABI 7900HT Fast Real-Time PCR System. Relative mRNA expression levels were calculated using the 2−ΔΔCT method. The following primer sequences were used: lncRNA PCBP1-AS1, forward, 5'-ACTACTCAGTCAATTGCTCCA-3', reverse, 5'-ATTTCCTTACTGACCTGCAT-3'; lncRNA AC100861, forward, 5'-GCAC ATGACACGGGATGAGA-3', reverse, 5'-GGCTTTCGGGAGGCTGATT A-3'; lncRNA TEX41, forward, 5'-TGGCCAAGAGACAACACCAA-3', reverse, 5'-GGCAGAGTGAGTCCAAAGG-3'; GAPDH, forward, 5'-TGGAAATCCCATCACCATCT-3', reverse, 5'-TGGACTCCACGACGTACTCA-3'.

Statistical analysis

Data obtained from three independent experiments were expressed as mean ± standard deviation. The t-test was performed to compare differences between two groups, and one-way analysis of variance was used for multi-group comparisons. Differences with p < 0.05 were considered statistically significant in multiple testing.

Results

Identification of differentially expressed genes (DEGs)

The differential expressions of mRNA and lncRNA in the GSE123932 data set were analyzed using the R language's LIMMA package. The standard of differential gene screening was P < 0.05, log2 fold change > 1 or < -1. There were 210 cases with significantly differentially expressed lncRNAs (Figs. 1A and 2A); there were 333 cases of significantly differentially expressed mRNAs (Figs. 1B and 2B).

Fig. 1
figure 1

Volcano map demonstrating the differential expression analysis of lncRNAs (A) and mRNAs (B). Red and blue dots were significantly high and low expression in the disease group, respectively

Fig. 2
figure 2

Heat map illustrating differential expression analysis of lncRNAs (A) and mRNAs (B)

Weighted gene co-expression network analysis(WGNA)

mRNA. None of the 12 samples showed abnormal gene expression (Fig. 3A). Cluster analysis of gene expression profiles was performed by combining clinical information of samples (Fig. 3B). The experimental results converged when the Soft threshold (SFT) value was 8 (Fig. 3C, D). Based on the WGCNA documentation, the SFT value was set to 8 to construct the co-expression network, and 11 modules were obtained (Fig. 3E, F). Genes in brown, yellow, and turquoise modules are related to disease occurrence (P < 0.05, Fig. 3G).

Fig. 3
figure 3

Sample cluster analysis (A and B); Calculation and selection of soft threshold (C and D); Cluster analysis of modules (E and F); Correlation analysis between modules and traits (G).

A total of 246 DEGs were obtained from disease-related modules by exporting the genes from these three modules and combining them with differential expression analysis (Data collection Section) (Fig. 4).

Fig. 4
figure 4

Venn diagram of joint analysis

LncRNA

None of the 12 samples showed abnormal gene expression (Fig. 5A). Cluster analysis of gene expression profiles was performed by combining clinical information of samples (Fig. 5B). The experimental results converged when the SFT value was 8 (Figs. 5C, D). According to the WGCNA documentation, the SFT value was set to 7 to construct the co-expression network, and eight modules were obtained (Figs. 5E, F). Genes under the green module were significantly correlated with disease occurrence (P < 0.05, Fig. 5G).

Fig. 5
figure 5

Sample cluster analysis (A and B); Calculation and selection of soft threshold (C and D); Cluster analysis of modules (E and F); Correlation analysis between modules and traits (G)

The co-expressed genes in the green module were exported and combined with the differential expression analysis results of lncRNA (Data collection Section), which obtained 57 DEGs (Fig. 6).

Fig. 6
figure 6

Venn diagram of joint analysis

Competing endogenous RNA (ceRNA) network construction

LncRNA target gene prediction

StarBase was used to predict 57 lncRNAs target genes to obtain the predicted results of nine lncRNA (Table 1).

Table 1 Prediction results of some lncRNA target genes

Prediction of miRNA target genes

StarBase was used to predict lncRNA target genes-miRNAs (Table 2).

Table 2 Prediction results of miRNAs target genes

Establishment of the ceRNA regulatory network

Combined with the prediction results and difference analysis results, lncRNA-miRNA and miRNA-mRNA were identified, and Cytoscape software was used for ceRNA-network visualization (Fig. 7).

Fig. 7
figure 7

CeRNA network (hexagon: lncRNAs; inverted triangle: miRNAs; circle: mRNA)

GO and KEGG pathway analyses

GO and KEGG pathway analyses were performed on mRNA in the ceRNA network using DAVID, and the results were visualized using the R language (Fig. 8).

Fig. 8
figure 8

GO and KEGG enrichment analysis: (A) Cell component; (B) Molecular function; (C) biological process; and (D) KEGG pathway

PPI analysis

PPI analysis was performed for genes in the lncRNA-miRNA-mRNA regulatory network using STRING. The analysis results were visualized using Cytoscape (Fig. 9).

Fig. 9
figure 9

PPI (blue dots: low expression; red dots: high expression)

Hub gene validation

Using PPI network analysis of genes in the lncRNA-miRNA-mRNA regulatory network, the following hub genes were obtained (Table 3). The gene expression data in dataset GSE84044 were used to verify the expression of hub genes (genes with degree > 20 in the PPI network).

Table 3 Hub genes in the PPI network

The expressions of hub genes in the eight cases were verified using GSE840044. The expression differences of genes (CCNB1, TOP2A, KIF20A, KIF4A, KIF14, MYCN, CDCA7) were found (except MET) (p < 0.05) (Fig. 10).

Fig. 10
figure 10

The expression of hub genes: A. CCNB1; B. TOP2A; C. KIF20A; D. KIF4A; E. KIF14; F. MYCN; G. CDCA7; H. MET

The downregulation of lncRNA AC100861

To verify the critical lncRNAs involved in liver fibrosis, qRT-PCR was performed in fibrotic and healthy livers. LncRNA AC100861 mRNA expression was much lower in samples from liver fibrosis patients than healthy liver tissues (Fig. 11).

Fig. 11
figure 11

Downregulation of lncRNA AC100861 in liver fibrosis. All data are expressed as mean ± standard deviation (SD).The experiment was repeated three times. * p < 0.05, ** p < 0.01

The downregulation of lncRNA AC100861 in activated HSCs

Because activated HSCs (LX-2) are thought to be the significant cellular participants in accelerating the deposition of ECM proteins, we determined whether lncRNA AC100861 participated in HSC activation. QRT-PCR analysis showed that compared with quiescent HSCs, lncRNA AC100861 expression was lower in activated HSCs treated with various concentrations of TGF-β1 (Fig. 12).

Fig. 12
figure 12

Downregulation of lncRNA AC100861 in activated HSCs. All data are expressed as mean ± standard deviation (SD).The experiment was repeated three times. * p < 0.05, ** p < 0.01

Discussion

Liver fibrosis is considered a severe health problem worldwide. Liver fibrosis is an intense and reversible wound-healing process resulting from tissue necrosis [31, 32]. The resting HSCs transdifferentiate into myofibroblasts responsible for the deposition of ECM proteins (or collagen fibers), causing tissue scarring [7, 33]. Therefore, it is of great significance to study biomarkers for the diagnosis of liver fibrosis.

LncRNA, a small RNA molecule with 200 nucleotides, is closely involved in critical processes of cell development, proliferation, differentiation, and pluripotency [34]. With the improvement of deep transcriptome sequencing technology, the study of RNA molecules has significantly increased [35]. Although lncRNAs without protein-coding ability and bio-function, previous studies have provided ample evidence that these RNA molecules play a crucial function in controlling the expression of a gene by a series of mechanisms (such as targeted transcription) [36], affecting splicing function [37], targeting cis-acting promoter RNAs [38]. In recent decades, lncRNAs are involved in developing liver fibrosis [39,40,41,42].

LncRNAs act by regulating the binding of micro RNAs to targeted mRNA. The mechanism of co-expression and interaction between mRNA miRNAs and lncRNAs in disease regulation requires clarification. In this study, using the gene expression data from the GEO database and R language LIMMA package, we conducted bioinformatics mining on GSE12392 and GSE84044 datasets and analyzed the significantly expressed lncRNA and mRNA in tissues of patients with liver fibrosis. Then, 57 lncRNAs and 246 mRNAs were changed in liver fibrosis compared to the control group. Finally, we constructed the lncRNA-miRNA-mRNA network that included three lncRNAs, five miRNAs, and 93 mRNAs.

To further understand the effect of lncRNA activity on function, the possible biological mechanisms of lncRNA in liver fibrosis tissues were clarified through GO and KEGG. The analysis of GO functional enrichment were mainly manifested in cytosol, cytoplasm and other cellular components, protein binding, protein dimer activity, protein kinase binding and other molecular functions, as well as positive transcriptional regulation of DNA templatization and significant enhancement of extracellular matrix tissues. These results suggested that liver fibrosis-related enrichment of GO is mainly related to cell growth and tissue hyperplasia. KEGG pathway analysis revealed that this pathway is generally enriched in tumor transcriptional dysregulation, Rap1 signaling, tumor proteoglycan, mineral absorption, HTLVL infection and central carbon metabolism.

We found eight candidate hub genes using PPI analysis of genes in the lncRNA-miRNA-mRNA regulatory network using STRING. The expression of the eight hub genes was verified using GSE84044, and seven hub genes (CCNB1, TOP2A, KIF20A, KIF4A, KIF14, MYCN, CDCA7) (p < 0.05) showed significant differences between the groups (except MET). A study showed that CCNB1 and KIF20A are related to the activation of HSCs [43].

LncRNAs with more edges are hubs that participate in more ceRNA interactions, suggesting that lncRNAs participate in network organization. According to node degree and relationship, we selected the critical lncRNAs (TEX41, PCBP1-AS1, and AC100861). These lncRNAs can be used as biomarkers for the diagnosis of liver fibrosis. To speculate about the potential functions of lncRNAs, the function of annotated miRNAs or mRNAs can be studied based on the ceRNA theory. LncRNA-miRNA-mRNA can offer a holistic view of the ceRNA interactions, which permit the study of the regulatory properties of lncRNAs; however, the subnetwork of the key lncRNA-miRNA-mRNA detailed how key lncRNAs work in conjunction with competing mRNAs [44]. A growing body of evidence suggests that lncRNAs and miRNAs ‘chat’ with one another in ceRNA processes [45]. In the present study case, lncRNAs may act as sponges to isolate miRNAs from their targeted mRNAs, causing changes in the expression of their target genes [46]. The CeRNA network showed that DE lncRNAs in liver fibrosis bound to miR-103a-3p, miR-525-5p, miR-103a-3p, miR-107, and miR-873-5p, can effectively regulate a large of target genes. Of the three lncRNAs, some may competitively bind to these miRNAs through the ceRNA affect and regulate the expression of hub genes (ACCNB1, TOP2A, kif20a, dkif4a, kif14, MYCN, and CDH2).

QRT-PCR results indicated no significant differences in expression of lncRNA TEX41 or PCBP1-AS1 between liver fibrosis tissue control liver tissue; however, the mRNA level of lncRNA AC100861 (lncRNA TNFRSF10A-DT) was significantly decreased. At present, there are no reports on the correlation between the lncRNA TNFRSF10A-DT and liver fibrosis. The mRNA expression of lncRNA TNFRSF10A-DT in activated HSCs was significantly lower than in resting HSCs. Our study demonstrated that the lncRNA TNFRSF10A-DT expression is correlated with liver fibrosis and may be helpful for diagnosis and outcome prediction.

We used StarBase to predict the interactions between lncRNA and miRNA and found that the predicted target miRNA of lncRNA TNFRSF10A-DT was miR-873–5. Next, we used StarBase to obtain miRNA-targeted mRNAs and found that the predicted target genes of miR-873–5 were RCC2, UBR4 and AKR7A2. Based on the ceRNA network, lncRNA TNFRSF10A-DT/has-mir-873–5/RCC2, UBR4, and the AKR7A2 axis may construct a ceRNA network to participate in the progression of liver fibrosis.

In summary, we constructed a liver fibrosis-related lncRNA-miRNA-mRNA network to study the biological functions of lncRNA in liver fibrosis development according to ceRNA theory. LncRNA TNFRSF10A-DT was the critical lncRNA in the ceRNA network. QRT-PCR indicated that the mRNA level of lncRNA TNFRSF10A-DT in liver fibrosis patients and the expression level of activated HSCs were much lower than in healthy people and resting HSCs. These findings suggest that lncRNA TNFRSF10A-DT may serve as a biomarker and therapeutic target for liver fibrosis. We speculate that lncRNA TNFRSF10A-DT may compete with miR-873–5 to regulate the target mRNA genes (RCC2, UBR2, and AKR7A2) in the ceRNA network. The ceRNA network is helpful to improve understanding of the pathogenesis of liver fibrosis. However, the specific ceRNA mechanism of lncRNA TNFRSF10A-DT requires further research.