LINC01420 RNA structure and influence on cell physiology
It was shown that the major part of human genome is transcribed and produces a large number of long noncoding RNAs (lncRNAs). Today there are many evidences that lncRNAs play important role in the regulation of gene expression during different cellular processes. Moreover, lncRNAs are involved in the development of various human diseases. However, the function of the major part of annotated transcripts is currently unknown, whereas different lncRNAs annotations tend to have low overlap. Recent studies revealed that some lncRNAs have small open reading frames (smORFs), that produce the functional microproteins. However, the question whether the function of such genes is determined by microprotein or RNA itself or both remains open. Thus, the study of new lncRNA genes is important to understanding the functional role of such a heterogeneous class of genes.
In the present study, we used reverse transcription PCR and rapid amplification of cDNA ends (RACE) analysis to determine the structure of the LINC01420 transcript. We revealed that LINC01420 has two isoforms that differ in length of the last exon and are localized predominantly in the cytoplasm. We showed that expression of the short isoform is much higher than the long. Besides, MTT and wound-healing assays revealed that LINC01420 inhibited cell migration in human melanoma cell line A375, but does not influence on cell viability.
During our work, D’Lima et al. found smORF in the first exon of the LINC01420 gene. This smORF produces functional microprotein named non-annotated P-body dissociating polypeptide (NoBody). However, our results provide new facts about LINC01420 transcript and its function.
KeywordsLINC01420 NBDY Long noncoding RNA lncRNA Cell migration smORF RACE MTT Wound healing
Long noncoding RNA
Open reading frame
Polymerase chain reaction
Rapid amplification of cDNA ends
Reverse transcription PCR
Small interfering RNA
Small open reading frame
After the appearance of the GENCODE project , the mRNA-centric paradigm for transcript annotation has dramatically changed. It was shown that the major part of the human genome (~ 62–75%) is transcribed and produces a large number of long noncoding RNAs (lncRNAs) [2, 3]. LncRNAs are transcripts with the length more than 200 nucleotides that do not possess long open reading frames (ORFs) . They are actively studied relatively recently, and today there is not even a good understanding of their number. According to the current GENCODE release (version 29), 16′066 lncRNA genes were annotated in the human genome. At the same time, the FANTOM CAT project revealed 27′919 human lncRNA genes and predicted the potential functionality of ~ 69% of them .
Despite this, only a small number of lncRNAs has experimentally defined function . It is already known, that they are involved in various gene-expression regulation processes on transcriptional [7, 8, 9, 10, 11, 12, 13] and post-transcriptional [14, 15, 16] levels. Besides, lncRNAs play a role in the development of various human diseases . The lncRNADisease database contains entries about 914 lncRNAs associated with 329 diseases , including different cancer types, genomic imprinting disorders, neurodegenerative, cardiovascular diseases and other pathologies. In some cases, non-coding transcripts play a key role in the molecular pathogenesis of the diseases. Thus, lncRNAs have the potential to be used as biomarkers and therapeutic targets .
There are several lncRNA annotations, which tend to have low overlap. For example, FANTOM CAT and GENCODE have 25,7% and 57,8% common genes respectively. Moreover, the most annotations of transcripts are 5′-end and 3′-end incomplete . The challenge of lncRNA gene annotation is related to their differences from mRNAs, such as weak conservation, low and tissue-specific expression, lack of information about functional elements [20, 21]. Despite this trend, there are lncRNAs with high and widespread expression (MALAT1, NEAT1, TINCR) and cross-species conservation (MALAT1, TINCR, PVT1). It suggests that some lncRNAs may play a role in essential cellular processes. Thus, the study of new lncRNA genes is intriguing and requires in the experimental validation of transcript structure.
Recent studies revealed that some lncRNAs have small open reading frames (smORFs) (< 100 amino acids), that are translated. The resulting short peptides are functional and play a role in cell physiology . At the same time, some studies have shown the dual function of RNAs: coding and intrinsic RNA .
In the present study, we investigated widely expressed lncRNA LINC01420, the function of which was not described at the time of the beginning of the study. We determined the structure of LINC01420 transcript, its localization, and influence on cell physiology. During our work, D’Lima et al. found smORF in LINC01420. This smORF produces microprotein named non-annotated P-body dissociating polypeptide (NoBody) . Authors demonstrated the potential functionality of this microprotein as a component of the mRNA decapping complex. However, our results provide new facts about LINC01420 transcript and its function.
LINC01420 has conservative sequences and high expression level in human tissues and cell lines
Using nucleotide BLAST search, we revealed that LINC01420 transcript has homologs across Mammals, but not in other Vertebrates. Moreover, multiple alignment of 100 Vertebrates genomes presented in UCSC browser confirms that this gene is present only in Mammals. HMMER analysis of NoBody homologous proteins showed the same result.
LINC01420 has two isoforms
Different lnсRNA annotations revealed various possible structures of the LINC01420 transcript. To determine a real structure of the LINC01420 isoforms, we performed reverse transcription PCR and rapid amplification of cDNA ends (RACE) analysis on HEK293T, HeLa cell lines, and human primary skin fibroblasts (Fig. 1a). We revealed that the LINC01420 RNA consists of three exons and has two polyadenylated isoforms that differ in length of the last exon. The total length of the short and long LINC01420 isoforms is 701 bp and 1510 bp respectively. Nucleotide sequences of short and long isoforms were deposited into GenBank under accession numbers MH892397 and MH892398, respectively. To determine the expression level of two different isoforms, we performed qPCR with primers common for both isoforms (pairs 1 and 3) and with primers specific for the long isoform (pair 5) (Fig. 1b). We found that expression of the long isoform is ~ 150 times lower than the short one.
Cytoplasmic localization of LINC01420
Since a large group of lncRNA functions is associated with the regulation of transcription and binding to chromatin, we investigated the subcellular localization of LINC01420 using soft lysis method. RNA was isolated from cytoplasmic, nuclear and chromatin-bound fractions of HEK293T cells. RT-qPCRs were performed to determine the level of the transcript of interest in each fraction. As control transcripts, we used U1  and BIRC5 , which are localized predominantly in the nucleus and the cytoplasm, respectively. We revealed the LINC01420 transcript has mostly cytoplasmic localization (Fig. 1c). It suggests that the function of this transcript is not related to the transcription regulation and chromatin binding.
The influence of LINC01420 on cell physiology
To determine the potential effect of LINC01420 on cell physiology we performed knockdown experiments using RNA-interference. We designed LINC01420 siRNA and optimized transfection conditions of human melanoma cell line A375. The expression level of a gene of interest was measured by qPCR. The LINC01420 knockdown efficiency was ~ 60% (data not shown).
In the present study, we experimentally determined the structure of LINC01420 transcript and revealed that it has two polyadenylated isoforms that differ in length of the last exon. We showed that the short isoform expressed at a much higher level than the long form and is localized predominantly in the cytoplasm. Besides, we found that LINC01420 inhibited cell migration in human melanoma cell line A375, but does not influence cell viability. Recently Yang L et al. have shown that expression of the LINC01420 gene in nasopharyngeal carcinoma (NPC) is higher than in normal nasopharyngeal epithelial tissues and correlates with NPC distant metastasis and poor prognosis . The authors observed that knockdown of LINC01420 inhibited NPC cell migration and invasion in 5-8F cell line. These contradictory results allow us to assume that this transcript affects cell migration in cell context-specific manner.
In silico analysis of CLIP-seq data about miRNA-mRNA interactions [31, 32, 33, 34] from starBase v2.0 database  revealed that the LINC01420 transcript interacts with several miRNAs. Top miRNA interactors are involved in the cell migration regulation pathways (miR-876-5p , miR-197-3p , miR-410-3p [38, 39], miR-340-5p , miR-873-5p [41, 42]). Moreover, expression of this miRNAs could either activate or inhibit cell migration depending on cell context. These observations confirm the contradictory results obtained in the present work and Yang L et al. However, this question requires further investigation.
Cytoplasmic localization of the LINC01420 transcript is consistent with the work of D’Lima et al. in which the authors revealed smORF that is translated from the first exon of LINC01420 . This smORF produces microprotein NoBody (non-annotated P-body dissociating polypeptide). D’Lima et al. demonstrated that NoBody interacts with mRNA decapping protein EDC4 and loss of NoBody causes a decrease in the cellular levels of an NMD substrate. The influence of this microprotein on cell migration was not studied. Thus, it remains unclear whether the LINC01420 effect on cell migration is associated with the microprotein or with the RNA itself.
Using SPLASH data  available in starBase v2.0  we found that the LINC01420 transcript interacts with ACAD11, NPHP3 and PPP1R3F RNAs. However, the possible functional role of these interactions requires further investigation. At the same time, analysis of LINC01420 RNA-binding proteins revealed that many of them are involved in regulation of mRNA nuclear export, translation and stability (e.g. EIF4A3 , IGF2BP1 , IGF2BP2 , IGF2BP3 , UPF1 ). It indicates the protein coding function of this transcript.
LncRNA genes became a breakthrough in our understanding of gene regulation and RNA metabolism. smORFs are a new class of genetic elements that expand the coding potential of the genome. High-throughput RNA sequencing (RNA-seq) and ribosome footprinting followed by sequencing (Ribo-seq) increase evidence of the existence of lncRNAs with smORFs . Some of these microproteins are evolutionary conserved and play a functional role not related to the host lncRNA. Taking into account the common features of lncRNAs (weak conservation, low and tissue-specific expression) we speculate that most conservative and widely-expressed lncRNA genes contain smORF, as in the case of the NBDY/LINC01420 gene. However, the question of whether microprotein or RNA itself or both determine the function of such genes remains open.
We provide an analysis of widely expressed lncRNA LINC01420, the function of which was not described at the time of the beginning of the study. Bioinformatics analysis showed that the LINC01420 gene has conservation across mammalian and “housekeeping” broad and uniform expression. Our experimental work revealed that LINC01420 has two isoforms, that are localized predominantly in the cytoplasm. Besides, cell physiology experiments showed that LINC01420 does not affect cell viability, but inhibits melanoma cell line migration.
Recently published work of D’Lima et al. revealed functional NoBody microprotein, that is translated from the first exon of LINC01420. However, the influence of this microprotein on cell migration was not studied. Thus, it remains unclear whether LINC01420 effect on cell migration is associated with the microprotein or with the RNA itself.
Nucleotide sequences of the studied gene were found in following databases: RefSeq release 90 , Ensemble release 93 , GENCODE release 28. Conservation and expression level in various human cell lines and tissues were analyzed using data from the UCSC genomic browser . Nucleotide sequences were analyzed using the BLAST (Basic Local Alignment Search Tool) search of the NCBI NR nucleotide database with standard parameters .
RNA extraction and reverse transcription-quantitative PCR (RT-qPCR)
Primer sequences used for amplification of different LINC01420 loci and reference genes and siRNA sequences used for knockdown
5′ - GCCCACCGGAGAAAACTGAC – 3
5′ – CCTTCCGGATAATCCCAACG - 3‘
5′ – GAGGCTTGGCTTCCCGTG - 3‘
5′ – TTCCAGGTGGGAGAGTGGA – 3
5′ – AGTGATTGCAGTATGACTCCA - 3‘
5′ – TTCCAGGTTCAGGACACCAGA - 3‘
5′ – TCAGCGCGATTTCACTTCCTG – 3’
5′ – CTGAATTTCGATGAATTCTAAGAC – 3’
5′ – ACCTCTGAGATTTAAGGCCATG – 3’
5′ – GACCATCTCACAGGCATTGTT – 3’
5′ – GTACACTTTCTTTAATTTGCTGTC – 3’
5′ – GAAAATGTCAGATAAACTTGGCT – 3’
5′ – CTTTAATTCTGATGCTAGGGACT – 3’
5′ – TGTAATGACCAGTCAACAGGG - 3’
5′ – TGCGACCTTGACCATCTTTG - 3’
5′ – TCTTTCAGCAAGGACTGGTC - 3’
5′ – GGCATCTTCAAACCTCCATG - 3’
5′ – CGGAGAGTTCTGGGATTGTAC - 3’
5′ – GTGGTTCGTGGCTCTCTTATC - 3’
5′ – TCCTTGCATATTCTGGAATCCC - 3’
5′ – ATCACGAACTGACCAGCG - 3’
We isolated total RNA from HEK293T, HeLa cell lines and human primary skin fibroblasts using ExtractRNA reagent (Evrogen, Russia) according to the manufacturer’s instruction. cDNAs synthesis and rapid amplification of cDNA ends were performed using Mint RACE cDNA amplification set (Evrogen, Russia) according to the manufacturer’s instruction. All primers used for RACE are presented in Table 1. Amplicons were analyzed by 1% agarose gel electrophoresis. 5′- and 3′-RACE fragments were cloned into pGEM-T Easy vector. Ten random insert clones were obtained and sequenced.
Subcellular localization (fractionation of RNA)
For RNA fractionation was used soft lysis procedure . The HEK293T cells were detached by treating with 1× Trypsin, transferred into 1.5 ml tube and centrifuged at RT 168 g for 5′. The pellet was lysed with 175 μl/106 cells of cold RLN1 solution (50 mM Tris HCl pH 8, 140 mM NaCl, 1.5 mM MgCl2, 0.5% NP-40, RNasin Plus RNase Inhibitor, Promega) and incubated 5′ in ice. Next, the suspension was centrifuged at 4 °C 300 g for 2′ and the supernatant, corresponding to the cytoplasmic fraction, was transferred into a new tube and stored in ice. The pellet containing nuclei was extracted with 175 μl/106 cells of cold RLN2 solution (50 mM Tris HCl pH 8, 500 mM NaCl, 1.5 mM MgCl2, 0.5% NP-40, RNasin Plus RNase Inhibitor, Promega) and 5′ incubated in ice. The suspension was centrifuged at 4 °C 16360 g for 2′ and the supernatant, corresponding to the nuclear-soluble fraction, was transferred into a new tube and stored in ice. The remaining pellet corresponds to the chromatin-associated fraction. The ratio of fractions to total RNA was estimated using RT-qPCR. All experiments were performed in triplicate.
For LINC01420 knockdown we designed two siRNAs using in-house software.
The A375 cells were maintained in DMEM (PanEco, Russia) with 10% fetal bovine serum (Biosera, France) in a with 5% CO2 at 37 °C. Knockdown experiments were conducted as described in Vyakhireva et al. . Briefly, 5 × 103 cells were seed in 96-well plates overnight and transfected with siRNA using METAFECTENE® (Biontex, Germany) according to the manufacturer’s instructions.
For LINC01420 knockdown we used siLINC01420; for EIF3D knockdown – mix of two siRNAs siEIF3D#1 and siEIF3D#2; for control we used nonspecific siControl; for transfection efficiency control we used FAM-labeled nonspecific siRNA (Table 1).
At 0, 24, 48, 72 h, 96 h and 120 h after transfection of A375 cells in 96-well plate, the 20 μl of MTT (Sigma-Aldrich, USA) solution (5 mg/ml) was added to each well with 200 μl culture media and incubated 3 h at 37 °C. After this media was removed. Formazan pellets in each well were dissolved in 200 μl DMSO, and the absorbance of formazan solutions was measured at 570 nm and 670 nm (for background signals).
All the experiments were carried out at three biological and five technical replicates. Data analyzed by a nonparametric paired Mann-Whitney U Test.
Wound healing assay
At 24 h after transfection of A375 cells in 96-well plate. Cells in monolayer culture were scraped using the pipette tips. At 0, 3, 5.5, 8, 10.5, 17.5 and 20.5 h after wounding, 1 field/well was visualized by microscopy. Then, images were analyzed using Image J program (National Institutes of Health). Changes of the remaining wound area were measured relative to total wound square at 0 h.
All the experiments were carried out at three biological and five technical replicates. Data analyzed by a nonparametric paired Mann-Whitney U Test.
The publication cost was covered by The Ministry of Education and Science of the Russian Federation.
Availability of data and materials
About this supplement
This article has been published as part of BMC Genomics Volume 20 Supplement 3, 2019: Selected articles from BGRS\SB-2018: genomics. The full contents of the supplement are available online at https://bmcgenomics.biomedcentral.com/articles/supplements/volume-20-supplement-3.
MS and AF designed the experiments, DK and AF performed the experiments and wrote the manuscript. MS revised the manuscript. All authors reviewed, considered and approved the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 19.Yu Filatova A, et al. Long noncoding RNAs are a promising therapeutic target in various diseases. Bulletin of RSMU. 2017;6:5–16.Google Scholar
- 22.Choi SW, Kim HW, Nam JW. The small peptide world in long noncoding RNAs. Brief Bioinform. 2018.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.