Abstract
Long non-coding RNAs (LncRNAs) are a large, diverse class of RNA molecules that has garnered attention for their potential to regulate gene expression and been identified in various organisms. Here we report the first prediction of lncRNAs in Brassica rapa (B. rapa) genome using computational method. We subjected the publicly available full length cDNA sequences and identified 2237 candidate lncRNAs, characterized their functions based on insilico methods. Housekeeping and small regulatory non-coding RNAs (ncRNAs) were removed from the pool. Although 14–15 % of the total sequences were predicted to be non-coding initially, on filtering only 4.6 % of the total sequences were predicted as lncRNAs carrying a large number of simple repeats. The lncRNAs had an average length of 497 bp and were mapped on each chromosome of B. rapa. They were classified to 4 groups based on their origin. Thirty six motifs involving transcription related activities, signaling mechanism and stress response were identified in the lncRNAs. Repeat elements and neighboring genes of the lncRNAs were analyzed since they were associated in function and regulating the expression of these long non-coding RNAs. We believe that this study would be an initial and reference for any further studies regarding long non-coding RNAs in B. rapa and other Brassica crops.
Similar content being viewed by others
References
Altschul SF, Gish W, Miller W, Myers EW, Lipman GJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410
Bailey TL (2011) DREME: motif discovery in transcription factor Chip-seq data. Bioinformatics 27:1653–1659
Bertani S, Sauer S, Bolotin E, Sauer F (2011) The noncoding RNA mistral activates Hoxa7 expression and stem cell differentiation by recruiting MLL1 to chromatin. Mol Cell 43:1040–1046
Burge SW, Daub J, Eberhardt R, Tate J, Barquist L, Nawrocki EP, Eddy SR, Gardner PP, Bateman A (2012) Rfam 11.0:10 years of RNA families. Nucleic Acids Res 41:D226–D232
Buske FA, Boden M, Bauer DC, Bailey TL (2010) Assigning roles to DNA regulatory motifs using comparative genomics. Bioinformatics 26:860–866
Cai G, Yang Q, Yang Q, Zhao Z, Chen H, Wu J, Fan C, Zhou Y (2012) Identification of candidate genes of QTLs for seed weight in Brassica napus through comparative mapping among Arabidopsis and Brassica species. BMC Genet 13:105
Chalk AM, Warfnge RE, Georgii-Hemming P, Sonnhammer ELL (2005) siRNAdb: a database of siRNA sequences. Nucleic Acids Res 33:D113–D134
Cheng F, Liu S, Wu J, Fang L, Sun S, Liu B, Li P, Hua P, Wang X (2011) BRAD, the genetics and genomics database for Brassica plant. BMC Plant Biol 11:136
Conesa A, Gotz S (2008) Blast2Go: a comprehensive suite for functional analysis in plant genomics. Int J Plant Gen 619–832
Crespi MD, Jurkevitch E, Poiret M, d’ Aubenton CF, Petrovics G, Konsorosi E, Kondorosi A (1994) Enod40, a gene expressed during nodule organogenesis, codes for a non-translatable RNA involved in plant growth. Embo J 13:5099–5112
Dai X, Zhao PX (2011) psRNAtarget: a plant small RNA target analysis server. Nucleic Acids Res 39:W155–W159
Dijkstra JM, Alexander DB (2015) The NF-kB interacting long noncoding RNA(NKILA) transcript is antisense to cancer-associated gene PMEPA1. F1000Research 4
Furuno M, Pang KC, Ninomiya N, Fukuda S, Frith MC, Bilt C, Kai C, Kawai J, Carninci P, Hayashizaki Y, Mattick JS, Suzuki H (2006) Clusters of internally primed transcripts reveal novel long noncoding RNAs. PLoS Genet 2:e37
Guttman M, Donaghey J, Carey BW, Garber M, Grenier JK, Munson G, Young G, Lucas AB, Ach R, Bruhn L, Yang X, Amit I, Meissner A, Regev A, Rinn JL, Root DE, Lander ES (2011) LincRNAs act in the circuitry controlling pluripotency and differentiation. Nature 477:295–300
Hainer SJ, Prineski JA, Mitchell RD, Monteverde RM, Martens JA (2011) Intergenic transcription causes repression by directing nucleosome assembly. Genes Dev 25:29–40
Handong M, Yun H, Xinran D, Qingtian G, Jingqi C, Jifeng Z, Weidong T (2012) Molecular mechanisms and functional prediction of long noncoding RNA. Sci World J 541–786
Heo JB, Sung S (2011) Vernalization-mediated epigenetic silencing by a long intronic noncoding RNA. Science 331:76–79
Huang CY, Shirley N, Genc Y, Shi B, Langridge P (2011) Phosphate utilization efficiency correlated with expression of low-affinity phosphate transporters and noncoding RNA, IPSI, in barley. Plant Physiol 56:1217–1229
Huang W, Long L, Khatib H (2012) Genome-wide identification and initial characterization of bovine long non-coding RNAs from EST data. Anim Genet 43:674–682
Jalali S, Bhartiya D, Lalwani MK, Sivasubbu S, Scaria V (2013) Systematic transcriptome wide analysis of lncRNA–miRNA interactions. PLoS ONE 8:e53823
Johnston JS, Pepper AE, Hall AE, Chen ZJ, Hodnett G, Drabek Lopez R, Price HJ (2005) Evolution of genome size in Brassicaceae. Ann Bot 95:229–235
Kapranov P, Cheng J, Dike S, Nix DA, Duttagupta R et al (2007) RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science 316:1484–1488
Kapusta A, Kronenberg Z, Lynch VJ, Zhuo X, Ramsay LA, Bourque G, Yandell M, Feschotte C (2013) Transposable elements are the major contributors to the origin of diversification, and regulation of vertebrate long noncoding RNAs. PLoS Genet 9:e1003470
Kent WJ (2002) BLAT—the BLAST-like alignment tool. Genome Res 12:656–664
Kim B, Yu HJ, Park SG, Shin JY, Oh M, Kim N, Hwan J (2012) Identification and profiling of novel microRNAs in the Brassica rapa genome based on small RNA deep sequencing. BMC Plant Biol 12:218
Kouchi H, Takane K, So RB, Ladha JK, Reddy PM (1999) Rice ENOD40: isolation and expression analysis in rice and transgenic soybean root nodules. Plant J 18:121–129
Kozomara A, Griffihs-Jones S (2014) miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res 42:D68–D73
Krzywinski M, Jacqueline S, Inanc B, Joseph C, Randy G, Doug H, Steven JJ, Marco AM (2009) Circos: an information aesthetic for comparative genomics. Genome Res 19:1639–1645
Kwok CK, Ding Y, Tang Y, Assmann SM, Bevilacqua PC (2013) Determination of in vivo RNA structure in low-abundance transcripts. Nat Commun 4:2971
Lee JT (2009) Lessons from X-chromosome inactivation: long ncRNA as guides and tethers to the epigenome. Genes Dev 23:1831–1842
Li X, Ramchiary N, Dhandapani V, Choi SR, Hur Y, Nou IS, Yoon MK, Lim YP (2013) Quantitative trait loci mapping in Brassica rapa revealed the structural and functional conservation of genetic loci governing morphological and yield component traits in the A, B, and C subgenomes of Brassica species. DNA Res 20:1–6
Li J, Wu B, Xu J, Liu C (2014a) Genome-wide identification and characterization of long intergenic non-coding RNAs in Ganoderma lucidum. PLoS ONE 9:e99442
Li L, Eichten SR, Shimizu R, Petsch K, Yeh CT, Wu W, Chetoor AM, Givan SA, Cole RA, Fowler JE (2014b) Genome-wide discovery and characterization of maize long non-coding RNAs. Genome Biol 1:R40
Liu C, Muchhal US, Raghothama KG (1997) Differential expression of TPS11, a phosphate starvation-induced gene in tomato. Plant Mol Biol 33:867–874
Liu J, Jung C, Xu J, Wang H, Deng S, Bernaud L, Arenas-Huertero C, Chua NH (2012) Genome-wide analysis uncovers regulation of long intergenic noncoding RNAs in Arabidopsis. Plant Cell 24:4333–4345
Lysak MA, Koch MA, Beaulieu JM, Meister A, Leitch IJ (2009) The dynamic ups and downs of genome size evolution in Brassicaceae. Mol Biol Evol 26:85–96
Masoudi NA, Tonomra K, Kawashima S, Moriya Y, Suzuki M, Itoh M, Kanehisa M, Endo T, Goto S (2006) EGassembler: online bioinformatic server for large scale processing, clustering and assembling ESTs and genomic DNA fragments. Nucleic Acids Res 34:W459–W462
Min XJ, Butler G, Storms R, Tsang A (2005) OrfPredictor: predicting protein-coding regions in EST-derived sequences. Nucleic Acids Res 33:W677–W680
Paraskevopoulou MD, Georgakilas G, Kostoulas N et al (2013) DIANA-LncBase: experimentally verified and computationally predicted microRNa targets on long non-coding RNAs. Nucleic Acids Res 41:D239–D245
Perkel JM (2013) Visiting “oncorarnia”. Biotechniques 54:301–304
Piriyapongsa J, Jordan IK (2007) A family of human microRNA genes from miniature inverted-repeat transposable elements. PLoS ONE 2:e203
Piriyapongsa J, Marino-Ramirez L, Jorden IK (2007) Origin and evolution of human microRNAs from transposable elements. Genetics 176:1323–1337
Ponting CP, Pl Oliver, Reik W (2009) Evolution and functions of long noncoding RNAs. Cell 136:629–641
Prasanth KV, Spector DL (2007) Eukaryotic regulatory RNAs: an answer to the ‘genome complexity’ conundrum. Genes Dev 21:11–42
Ranbani M, Kertesz M, Segal E (2008) Computational prediction of RNA structural motifs involved in posttranscriptional regulatory processes. Proc Natl Acad Sci USA 105:14885–14890
Ray D, Kazan H, Cook KB, Weirauch MT, Najafabadi HS, Li X, Gueroussv S, Albu M, Yang A, Na H et al (2013) A compendium of RNA-binding motifs for decosing gene regulation. Nature 499:172–177
Rouskin S, Zubradt M, Washietl S, Kellis M, Weissman JS (2014) Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo. Nature 505:701–705
Smit AFA, Hubley R, Green P (2010). RepeatMAsker Open-3.0. http://www.repeatmasker.org
Susan B, McGinnis KM (2012) Computational identification and functional prediction of long noncoding RNA in Zea mays. PLoS ONE 7(8):e43047
Swiezewski S, Liu F, Magsin A, Dean C (2009) Cold-induced silencing by long antisense transcripts of an Arabidopsis polycomb target. Nature 462:799–802
Tsai MC, Manor O, Wan Y, Mosammaparast N, Wang J, Lan F, Shi Y, Segal E, Chang H (2010) Long noncoding RNA as modular scaffold of histone modification complexes. Science 329:689–693
Villegas VE, Zaphiropoulos PG (2015) Neighboring gene regulation by antisense long non-coding RNAs. Int J Mol Sci 16:3251–3266
Wang WC, Yang YW, Liu B, Sanyal A, Zimmerman RC, Chen Y, Lajoie BR, Protacia A, Flynn RA, Gupta RA, Wysocka J, Lei M, Dekker J, Helms JA, Chang HY (2011) A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature 472:120–126
Wen J, Parker BJ, Weiller GF (2007) In silico identification and characterization of mRNA-like noncoding transcripts in Medicago truncatula. In Silico Biol 7:485–505
Xiao D, Wang H, Basnet RK, Zhao J, Lin K, Hou X, Bonnema G (2014) Genetic dissection of leaf development in Brassica rapa using a genetical genomics approach. Plant Physiol 163:1309–1325
Yanfei M, Wu F, Yu X, Bai J, Zhong W, He Y (2014) MicroRNA319a-targeted Brassica rapa ssp. pekinensis TCP genes modulate head shape in Chinese cabbage by differential cell division arrest in leaf regions. Plant Physiol 164:710–720
Yang WC, Katinakis P, Hendriks P, Snolders A, Voris DF, Spee J, Kammen VA, Bisseling T, Franssen H (1993) Characterization of GmENOD40, a gene showing novel patterns of cell specific expression during soybean nodule development. Plant J 3:573–585
Yi X, Zhang Z, Ling Y, Xu W, Su Z (2015) PNRD: a plant non-coding RNA database. Nucleic Acids Res 43:D982–D989
Zaratiegui M, Irvin DV, Martienssen RA (2007) Noncoding RNAs and gene silencing. Cell 128:763–776
Zhang YC, Liao JY, Li ZY, Yu Y, Zhang JP, Li QF, Qu LH, Shu WS, Chen YQ (2014) Genome-wide screening and functional analysis identify a large number of long noncoding RNAs involved in the sexual reproduction of rice. Genome Biol 15:512
Zhao J, Sun BK, Erwin JA, Song JJ, Lee JT (2008) Polycomb proteins targeted by a short repeat RNA to the mouse X chromosome. Science 322:750–756
Acknowledgments
This work was supported by Golden Seed Project (No.213002-04-4-SB110), Ministry of Agriculture, Food and Rural Affairs (MAFRA), Ministry of Oceans and Fisheries (MOF), Rural Development Administration (RDA) and Korea Forest Service (KFS). We would also like to thank Dr. Sathishkumar Natarajan, from Sunchon National University for his valuable support.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest in this study.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Paul, P., Dhandapani, V., Choi, S.R. et al. Genome wide identification and functional prediction of long non-coding RNAs in Brassica rapa . Genes Genom 38, 547–555 (2016). https://doi.org/10.1007/s13258-016-0405-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13258-016-0405-7