Background

Keloids are benign lesions with invasive growth beyond the original wound borders, in which the hyperproliferation of fibroblasts and excessive accumulation of extracellular matrix can be observed [1]. The claw-like extension of keloids not only causes cosmetic disfigurement, but also brings about uncomfortable feelings such as itch and pain, as well as local dysfunction, resulting in impaired emotional wellbeing and reduced quality of life. However, the exact mechanism of keloid scarring remains unknown. Any disorder during wound healing, including an enhanced inflammatory response, overexpression of growth-related factors, and excessive activation of fibroblasts, is conducive to abnormal collagen deposition in keloids [2]. Transforming growth factor-β (TGF-β), connective tissue growth factor (CTGF), vascular endothelial growth factor (VEGF) and the renin–angiotensin system were reported to be potential targets in keloid management considering their participation in wound healing and keloid formation [3, 4]. In addition, miRNA modulation may be a novel therapeutic solution [5]. However, neither first-line therapies nor developing treatments could guarantee satisfying curative effects, which calls for more research on effective targets. Currently, the contribution of epithelial-mesenchymal transition (EMT) to keloid overgrowth is appealing as a potential treatment target. High expression of mesenchymal markers, such as vimentin and fibronectin, and decreased expression of the epithelial marker E‐cadherin confirmed EMT phenomenon in keloids [6]. EMT refers to the biological process associated with the loss of polarity of epithelial cells and their transformation into mesenchymal cells with migratory and invasive characteristics [7]. It has been reported that inflammatory stimuli and a hypoxic microenvironment may induce the transition from keratinocytes to fibroblasts, facilitating the invasive growth of keloids [8].

Long noncoding RNAs (lncRNAs) are widely conserved sequences longer than 200 nucleotides without protein-coding capacity [9]. As an emerging class of regulatory RNAs, lncRNAs may participate in fundamental processes including cell cycle regulation, proliferation, apoptosis, self-renewal, metastasis and the DNA damage response through transcriptional or posttranscriptional regulation and thus give rise to diseases. Abnormally expressed lncRNAs play a key role in fibrosis [10] and different types of cancer, including breast cancer [11,12,13], lung cancer [14, 15], colorectal cancer [16, 17], esophageal squamous cell carcinoma [18], and cutaneous squamous cell carcinoma [19, 20].

The role of lncRNAs in normal skin and keloids is an emerging research topic. LncRNAs such as antidifferentiation noncoding RNA (ANCR) and terminal differentiation-induced noncoding RNA (TINCR) are critical in maintaining epidermal stability [21]. In addition to regulating normal skin formation, it has been reported that lncRNAs may influence the occurrence and development of keloids. LncRNAs are aberrantly expressed in keloid tissues compared with normal skin tissues, indicating that differentially expressed lncRNAs may participate in keloid formation [22]. Although the lncRNA expression profiles of keloids have been reported, few studies have concentrated on the physiological functions of lncRNAs. Zhu et al. [23] reported increased lncRNA-ATB expression in keloids, and revealed that lncRNA-ATB regulates the autocrine secretion of TGF-β2 in keloid fibroblasts by downregulating ZNF217 via miR-200c. Yuan et al. [24] identified the upregulation of lncRNA LINC01116 in keloids and demonstrated that LINC01116 accelerates the development of keloids by regulating the miR-203/SMAD5 axis. In addition, differential expression profiles of lncRNAs associated with the Hedgehog and Wnt pathways suggest a possible regulatory mechanism of lncRNAs in the occurrence and development of keloids [25, 26].

Many studies have confirmed the connection between lncRNAs and EMT [27,28,29,30] in multiple diseases, especially in tumors [31,32,33,34,35]. Chen et al. [36] reported that lncRNA HOXA11-AS upregulation promotes EMT by inhibiting miR-200b in non-small cell lung cancer (NSCLC) and predicts a poor prognosis. LncRNA NORAD was also reported to enhance invasion and EMT in NSCLC cells [37], while lncRNA H19 was found to facilitate the EMT and metastasis of esophageal cancer cells through the let-7c/STAT3/EZH2/β-catenin axis [38]. In addition, increased expression of lncRNA UCA1 plays a role in EMT occurrence by targeting miR-155 and can be used as survival prediction in cervical cancer [39]. Nevertheless, to date, no research has focused on EMT-related lncRNAs in keloids. Considering the role of EMT in the pathogenesis of keloids, this study made the first effort into detecting and analyzing EMT-related lncRNA expression profiles in keloids.

Results

RNA quality testing

The baseline data of the 3 patients included in the study are shown in Table 1. We extracted total RNA from keloids and adjacent normal skin and carried out quality testing by a NanoDrop ND-1000 spectrophotometer (Table 2). The A260/A280 ratios of RNAs extracted from the keloid and normal groups were between 1.8 and 2.1, and the A260/A230 ratios were < 1.8, indicating good-quality RNA samples. The imaging of the agarose gel after electrophoresis demonstrated fairly sharp and intense 18 s and 28 s bands, indicating the acceptable purity and integrity of RNA samples (Fig. 1).

Table 1 Baseline data of the included patients
Table 2 RNA quantification and quality assurance by NanoDrop ND-1000
Fig. 1
figure 1

RNA integrity and gDNA contamination were assessed by denaturing agarose gel electrophoresis. Lanes 1–3 correspond to RNA samples of A1–C1, and lanes 4–6 represent RNA samples of A2–C2

A1–C1 refer to RNAs extracted from keloids, and A2–C2 refer to RNAs extracted from adjacent normal skin from the corresponding patients.

Differentially expressed lncRNAs and mRNAs

A total of 773 EMT-related lncRNAs and 219 mRNAs were evaluated in the study, 11 lncRNAs and 16 mRNAs were identified to have differential expression between keloid tissue and normal skin tissue (fold change > 1.5, P < 0.05) (Tables 3 and 4). Among the detected lncRNAs, 7 were upregulated in keloid specimens, while 4 were downregulated. Regarding differentially expressed mRNAs, 12 were significantly upregulated, and 4 were significantly downregulated in keloids.

Table 3 Differentially expressed EMT-related lncRNAs in keloids compared with normal skin
Table 4 Differentially expressed EMT-related mRNAs in keloids compared with normal skin

To compare the intensity distribution of 6 samples, we used box-whisker plots, which suggested that the data distribution from 6 chips was similar (Fig. 2). Variations in EMT-related lncRNAs and mRNA expression among specimens are shown in volcano plots and scatter plots (Fig. 3). Based on the microarray results, hierarchical clustering was performed to infer the relationship among specimens and demonstrated distinguishable lncRNA and mRNA expression profiles between keloid tissue and normal skin tissue (Fig. 4).

Fig. 2
figure 2

Box-whisker plots of lncRNAs (A) and mRNAs (B) in keloid specimens and normal skin specimens show similar distributions of intensities for all samples

Fig. 3
figure 3

Expression profiles of EMT-related lncRNAs and mRNAs in keloid specimens and normal skin specimens. Volcano plots of lncRNA (A) and mRNA (B) expression profiles showing differential expression between keloid tissue and normal skin tissue. The horizontal green line represents a P-value of 0.05 and the vertical green lines correspond to 1.5-fold upregulation and downregulation. The red points represent differentially expressed RNAs with statistical significance. Scatter plots of lncRNAs (C) and mRNAs (D) are presented for the assessment of expression variation between keloid tissue and normal skin tissue. The values of the x and y-axes are the averaged normalized signal values (log2 scaled), and the default fold change given is 1.5

Fig. 4
figure 4

Hierarchical clustering of lncRNAs (A) and mRNAs (B) classifies samples into groups according to expression level. Each column corresponds to a sample, and each row stands for a gene. Red and green colors represent high and low expression of genes, respectively

By quantifying lncRNAs at the transcript level and their potential target genes at both the gene and transcript levels in parallel, microarray analysis helped to establish connections between the regulatory mechanisms and biological functions of the lncRNAs. Five pairs of predicted lncRNA–mRNA were listed as examples (Table 5). More information could be found in the additional file (see Additional file 1).

Table 5 Predicted regulation of genes by lncRNAs

Gene Ontology (GO) analysis

Utilizing GO, the de facto standard in describing gene functions, we carried out functional annotation and enrichment analysis of differentially expressed transcripts. We found that in upregulated transcripts from keloids, the most enriched GO terms were single-organism cellular processes (biological process), the intracellular part (cellular component), and protein binding (molecular function) (Fig. 5A–C). For downregulated transcripts, the most enriched GO terms were the positive regulation of cellular processes (biological process), the Set1C/COMPASS complex (cellular component), and histone acetyltransferase activity (molecular function) (Fig. 5D–F).

Fig. 5
figure 5

GO analysis of differentially expressed coding gene transcripts. The most highly enriched GO terms for the upregulated transcripts in keloids in terms of biological processes (A), cellular components (B), and molecular functions (C). The most highly enriched GO terms for the downregulated transcripts in terms of biological processes (D), cellular components (E), and molecular functions (F)

Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis

KEGG analysis helped us to map lncRNAs to pathways, and this analysis demonstrated a total of 10 pathways related to upregulated transcripts in keloids (Fig. 6A). The top 3 most enriched networks were hypertrophic cardiomyopathy (HCM), dilated cardiomyopathy (DCM) and pathways in cancer. Moreover, 2 pathways, corresponding to HTLV-1 infection and pathways in cancer, were found to be related to downregulated transcripts in keloids (Fig. 6B).

Fig. 6
figure 6

Pathway analysis. Pathways related to upregulated (A) and downregulated (B) transcripts in keloids

Validation of the microarray finding by qRT-PCR

To validate the microarray results, we performed qRT-PCR of 5 upregulated lncRNAs (ENST00000539135, NR_033321, NR_024065, TCONS_00002241 and ENST00000563754) and 3 downregulated lncRNAs (uc003djn.3, NR_002605 and TCONS_00014246) in the original specimens (Fig. 7A). NR_033321, TCONS_00002241, ENST00000563754, and NR_002605 were proven to have significant changes between the original keloid specimens and normal skin specimens. Among these verified lncRNAs, TCONS_00002241 demonstrated the most apparent variation with fold change up to 112.53.

Fig. 7
figure 7

Relative expression of candidate lncRNAs detected by qRT-PCR. A Comparison of the fold changes of 8 lncRNAs between microarray and qRT-PCR results in original specimens. B Validation of 3 chosen lncRNAs by qRT-PCR in an enlarged sample set

A total of 12 pairs of keloid specimens and adjacent normal skin specimens were then tested by qRT-PCR to increase the sample size, and the expression of NR_033321, TCONS_00002241, and NR_002605 was detected (Fig. 7B). All these lncRNAs were significantly differentially expressed. Consistent with the validation results in the original specimens, TCONS_00002241 presented the most evident fold change (6.88).

Discussion

This study utilized an EMT-specific microarray and bioinformatics analysis to evaluate EMT-related lncRNAs, mRNAs, and involved signaling pathways in keloids. The results of the microarray are summarized in Tables 3 and 4 and were verified by qRT-PCR in a larger set of samples to guarantee reliability. Compared to the use of qRT-PCR alone to assess certain lncRNAs commonly investigated in cancer, microarray analysis provides a more macroscopic differential expression profile, helping to identify novel lncRNAs involved in the pathogenesis of keloid diseases. Because the functions of many lncRNAs are still elusive while corresponding mRNAs are better understood, we conducted GO and KEGG analyses of target gene mRNAs to analyze the regulatory mechanism and biological function of lncRNAs in keloids. GO analysis showed that the functions of these mRNAs were related to the extracellular matrix, protein binding, the positive regulation of cellular process, the Set1C/COMPASS complex, and histone acetyltransferase activity, suggesting that the differentially expressed lncRNAs and mRNAs play a role in the development of tumors. Following GO analysis, the subsequent KEGG analysis demonstrated that the mRNAs are involved in pathways in cancer. Considering that keloids are benign lesions characterized by invasive growth, it is reasonable that pathways in cancer are recognized to be differentially enriched between keloids and normal skin tissue.

Notably, XLOC_000587, the corresponding gene of TCONS_00002241, is located within 300 kb upstream of ENAH and may be an enhancer of ENAH. ENAH is an actin regulatory protein involved in cell motility and adhesion. Extensive studies have demonstrated upregulation of ENAH in many tumors, including breast carcinoma [40], cervical carcinoma [41], colorectal carcinoma [42], and hepatocellular carcinoma [43]. Notably, the expression of ENAH is correlated with vascular invasion, tumor grade, and unfavorable prognosis, which suggests that ENAH might participate in carcinogenesis and tumor progression [44,45,46]. Chen et al. [45] reported that EMT may be involved in the role of ENAH in gastric cancer cell proliferation and metastasis. EMT is regulated by a complex network including the TGF-β, Wnt, and Notch signaling pathways [47], while ENAH has been demonstrated to function as a transcriptional target of the Wnt/β-catenin pathway and a novel connection between the Wnt/β-catenin and Notch signaling cascades [48]. Therefore, we speculate that XLOC_000587 in keloids may further regulate related signaling pathways by increasing the expression of ENAH to promote EMT and the invasive development of keloids. Elucidating the mechanism by which XLOC_000587 participates in EMT is expected to provide new insights into keloid treatment.

In addition, AF268386 was predicted to be an enhancer of discoidin domain receptor 2 (DDR2), a tyrosine kinase receptor activated by collagen [49]. As a recognized critical regulator of EMT, DDR2 upregulation has been confirmed in many tumor cell lines and takes part in the interaction with the extracellular matrix, the regulation of cell motility and adhesion, and metastatic spreading [49,50,51,52]. Toy et al. [53] reported that overexpressed DDR2 in triple-negative breast cancer may indicate carcinoma with worse prognosis. Moreover, Ren et al. [54] reported that DDR2 expression was closely correlated with the expression of the hypoxic marker HIF-1α in breast cancer specimens, and hypoxia treatment could induce the expression of DDR2 in human breast cancer cell lines. Regarding wound healing, DDR2 is capable of promoting proliferation, collagen I secretion and MMP2 production in skin fibroblasts, and a delay in wound healing was observed in DDR2 null mice [55]. According to the role of DDR2 in wound healing and metastatic spreading and the hypoxic microenvironment in keloids, we hypothesize that targeting AF268386 or the corresponding lncRNA AF268386 could be a potential therapeutic approach for keloids treatment. In view of the limited research findings, more studies concerning these candidate lncRNAs remain to be carried out to uncover underlying pathways, establish lncRNA–miRNA–mRNA molecular network and clarify the exact functions in the occurrence and development of keloids.

Conclusion

This study identified and verified the differential expression of EMT-related lncRNAs in keloids, indicating that lncRNAs are likely to be involved in EMT and the overgrowth of keloids. In particular, lncRNA XLOC_000587 may participate in cell proliferation and migration by enhancing the expression of ENAH, and AF268386 may accelerate the invasive development of keloids by upregulating DDR2. The differentially expressed lncRNAs will hopefully provide novel insights into the pathogenesis of keloid diseases by multiple mechanisms such as signal transduction and competing endogenous RNA (ceRNA) networks.

Methods

Patients and specimens

The study complied with the Declaration of Helsinki of the World Medical Association and was approved by the Ethics Committee of The First Affiliated Hospital, Zhejiang University School of Medicine. All 12 patients were fully informed of the aim and protocol of the study, gave written informed consent, and provided excised keloids and adjacent normal skin for scientific research and publication. None of the patients had previously undergone radiotherapy or drug therapy. The keloids had clinical characteristics of overgrowth beyond the original wound boundaries and were confirmed by pathology. Tissues were frozen in liquid nitrogen and stored at − 80 °C immediately after excision.

RNA extraction and quality control

Total RNA was extracted from frozen specimens using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) following the manufacturer's instructions (Invitrogen). The RNA quality was assessed by NanoDrop ND-1000 (Thermo Fisher Scientific, Waltham, MA, USA), and RNA integrity was assessed by denaturing agarose gel electrophoresis.

RNA labeling and array hybridization

Keloid and normal skin specimens from 3 randomly selected patients were tested by microarray profiling. The chip used was the lncPath Human EMT Array (8 × 15 K) (ArrayStar, Rockville, MD, USA). Microarray analysis was performed by Kangchen Biology Engineering Co., Ltd., (Shanghai, China) according to the manufacturer’s protocol (Arraystar). LncRNAs/mRNAs were purified from total RNA after removal of rRNA (Arraystar rRNA removal kit). Then, each sample was amplified and transcribed into fluorescent cRNA along the entire length of the transcripts without 3′ bias utilizing a random priming method (Arraystar Flash RNA Labeling Kit, Arraystar). The labeled cRNAs were purified by RNeasy Mini Kit (Qiagen, Germany). The concentration and specific activity of the labeled cRNAs (pmol Cy3/μg cRNA) were measured by NanoDrop ND-1000. One microgram of each labeled cRNA was fragmented by adding 5 μl 10 × Blocking Agent and 1 μl 25 × Fragmentation Buffer and then heating the mixture at 60 °C for 30 min. Finally, 25 μl 2 × GE Hybridization buffer was added to dilute the labeled cRNA. Fifty microliters of hybridization solution was dispensed into the gasket slide and assembled into the lncRNA expression microarray slide. The slides were incubated for 17 h at 65 °C in an Agilent Hybridization Oven. The hybridized arrays were washed, fixed and scanned using the Agilent Scanner G2505C.

Data analysis

Differentially expressed lncRNAs/mRNAs were defined as those with a fold change ≥ 1.5 (upregulated or downregulated) combined with P-value < 0.05. Agilent Feature Extraction software (version 11.0.1.1) was used to analyze acquired array images. Quantile normalization and subsequent data processing were performed using the R software limma package. Differentially expressed lncRNAs/mRNAs with statistical significance between two groups were identified through volcano plot filtering. Differentially expressed lncRNAs/mRNAs between two samples were identified through fold change filtering. Hierarchical clustering was performed to show the distinguishable lncRNA/mRNA expression pattern among samples.

Based on the Gene Ontology (GO) database (http://www.geneontology.org), the differential transcripts underwent enrichment analysis of gene function via GO analysis, which indicates enriched terms in the categories of biological process, molecular function and cellular component. Pathway analysis of identified differentially expressed genes was performed using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database (http://www.genome.jp/kegg/).

Real-time quantitative polymerase chain reaction (qRT-PCR)

Reverse transcription of extracted RNA was performed utilizing SuperScript™ III reverse transcriptase (Invitrogen) following the manufacturer’s instructions. qRT-PCR was conducted using the ethidium bromide staining method with ViiA 7 real-time PCR system (Applied Biosystems) following the manufacturer’s protocols.

Five significantly upregulated (ENST00000539135, NR_033321, NR_024065, TCONS_00002241 and ENST00000563754) and 3 downregulated lncRNAs (uc003djn.3, NR_002605 and TCONS_00014246) were selected for qRT-PCR validation in the original specimens. NR_033321, NR_002605 and TCONS_00002241 were further detected in a larger set of samples including 12 pairs of keloid tissues and adjacent normal skin tissues. β-Actin was used as endogenous control. The primer sequences were as follows:

β-actin:

Forward: 5′ GTGGCCGAGGACTTTGATTG3′

Reverse: 5′ CCTGTAACAACGCATCTCATATT3′

ENST00000539135:

Forward: 5′ AGCCAAGAAACGGTCCAGA 3′

Reverse: 5′ AGCACAGGTAGAGATCAGGAGG 3′

NR_033321:

Forward: 5′ GACCCGAGTTGGAGGCATCT 3′

Reverse: 5′ TGTGGTTGTGAACGTCCTGAAG3′

NR_024065:

Forward: 5′ GCTATTAGAGGGTGTTGCTGAC3′

Reverse: 5′ TGTATTGTGGCTATGTGGGAG 3′

TCONS_00002241:

Forward: 5′ CACATTTACAGACTACAGAGCC 3′

Reverse: 5′ CTGACCACACCATCAGCAAC 3′

ENST00000563754:

Forward: 5′ CATTCCTGAGGCTGTTCGT 3′

Reverse: 5′ GAAAATGTCTCTGAGAACCCAT 3′

uc003djn.3:

Forward: 5′ AGCACAGCCTGAAACCCAA 3′

Reverse: 5′ AGGAAGCATCTCACCCTCCT 3′

NR_002605:

Forward: 5′ CCTTGTATATGGATACACGTGC 3′

Reverse: 5′ TCCTTTCTGGTAGAATCACTGG 3′

TCONS_00014246:

Forward: 5′ CCGAAAAGCGAACAGTCCA 3′

Reverse: 5′ CGTGTTTGGTGAGTTCGGTT 3′.

The reaction conditions for PCR were as follows: 95 °C for 10 min, then 40 PCR cycles at 95 °C for 10 s and 60 °C for 60 s. Products were slowly heated from 60 to 99 °C after amplification, and melting curve was established to test the specificity of PCR products. The corrected relative expression level was acquired by the following calculation, the concentration of the target gene divided by the concentration of the housekeeping gene.

Statistical analysis

All data were analyzed utilizing SPSS (version 18.0; SPSS, Inc., Chicago, IL, USA). The Student’s t-test was performed to evaluate the differences between distinct groups, and one-way ANOVA and SNK methods were used for comparisons among groups. A P-value < 0.05 was considered significant.