Abstract
MicroRNAs (miRNAs) are a family of non-coding RNAs that play a central role in fine-tuning gene expression regulation. Over the past decade, identification and annotation of miRNAs have become a major focus in epigenomics research. However, detection and characterization of miRNA are challenging due to its small size (~22 nucleotide-long) and susceptibility to degradation. The difficulties involved in experimental prediction and characterization of miRNA coding genes have led to the development of in silico-based approaches. Although several algorithms have been developed in recent years, a comprehensive assessment of the principles, methodological insights, and estimate of the strengths and weaknesses of computational methods are limited. The present review is dealt with the detailed methodological insights of different tools used for identifying miRNA coding genes falling under four computational approaches. The parameters considered in these tools along with their specificity are also delineated. Furthermore, the strengths and weaknesses of these four computational approaches, and the bioinformatics resources pertaining to target identification, expression analysis, regulatory network analysis, and SNP identification are stated in this review. The methodological details of miRNA prediction methods and bioinformatics resources related to miRNA research in one platform would facilitate the miRNA research community to develop efficient tools for uncovering novel miRNAs and understanding their role in regulatory networks.
Similar content being viewed by others
References
Mattick JS, Makunin IV (2006) Non-coding RNA. Hum Mol Genet 15 Spec No 1:R17–R29
Macfarlane LA, Murphy PR (2010) MicroRNA: biogenesis, function and role in cancer. Curr Genom 11(7):537–561
Davis BN, Hata A (2009) Regulation of MicroRNA Biogenesis: A miRiad of mechanisms. Cell Commun Signal 7:18
Dalmay T (2013) Mechanism of miRNA-mediated repression of mRNA translation. Essays Biochem 54:29–38
Kozomara A, Griffiths-Jones S (2014) miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res 42(Database issue):D68–D73
Jones-Rhoades MW, Bartel DP, Bartel B (2006) MicroRNAS and their regulatory roles in plants. Annu Rev Plant Biol 57:19–53
Stefani G, Slack FJ (2008) Small non-coding RNAs in animal development. Nat Rev Mol Cell Biol 9(3):219–230
Xu P et al (2003) The Drosophila microRNA Mir-14 suppresses cell death and is required for normal fat metabolism. Curr Biol 13(9):790–795
Tsuchiya S, Okuno Y, Tsujimoto G (2006) MicroRNA: biogenetic and functional mechanisms and involvements in cell differentiation and cancer. J Pharmacol Sci 101(4):267–270
Wang Y et al (2009) MicroRNAs in embryonic stem cells. J Cell Physiol 218(2):251–255
Pegoraro M, Tauber E (2008) The role of microRNAs (miRNA) in circadian rhythmicity. J Genet 87(5):505–511
Gantier MP, Sadler AJ, Williams BR (2007) Fine-tuning of the innate immune response by microRNAs. Immunol Cell Biol 85(6):458–462
Poy MN et al (2004) A pancreatic islet-specific microRNA regulates insulin secretion. Nature 432(7014):226–230
Greco SJ, Rameshwar P (2007) MicroRNAs regulate synthesis of the neurotransmitter substance P in human mesenchymal stem cell-derived neuronal cells. Proc Natl Acad Sci USA 104(39):15484–15489
Jopling CL et al (2005) Modulation of hepatitis C virus RNA abundance by a liver-specific MicroRNA. Science 309(5740):1577–1581
Johnson SM et al (2005) RAS is regulated by the let-7 microRNA family. Cell 120(5):635–647
Johnson CD et al (2007) The let-7 microRNA represses cell proliferation pathways in human cells. Cancer Res 67(16):7713–7722
Bussing I, Slack FJ, Grosshans H (2008) let-7 microRNAs in development, stem cells and cancer. Trends Mol Med 14(9):400–409
Eis PS et al (2005) Accumulation of miR-155 and BIC RNA in human B cell lymphomas. Proc Natl Acad Sci USA 102(10):3627–3632
Mendell JT (2008) miRiad roles for the miR-17-92 cluster in development and disease. Cell 133(2):217–222
Volinia S et al (2006) A microRNA expression signature of human solid tumors defines cancer gene targets. Proc Natl Acad Sci USA 103(7):2257–2261
Voorhoeve PM et al (2006) A genetic screen implicates miRNA-372 and miRNA-373 as oncogenes in testicular germ cell tumors. Cell 124(6):1169–1181
Calin GA et al (2002) Frequent deletions and down-regulation of micro- RNA genes miR15 and miR16 at 13q14 in chronic lymphocytic leukemia. Proc Natl Acad Sci USA 99(24):15524–15529
Cheng CY et al (2014) miR-34 cooperates with p53 in suppression of prostate cancer by joint regulation of stem cell compartment. Cell Rep 6(6):1000–1007
Stern-Ginossar N et al (2007) Host immune system gene targeting by a viral miRNA. Science 317(5836):376–381
Lecellier CH et al (2005) A cellular microRNA mediates antiviral defense in human cells. Science 308(5721):557–560
Mendes ND, Freitas AT, Sagot MF (2009) Current tools for the identification of miRNA genes and their targets. Nucleic Acids Res 37(8):2419–2433
Gomes CP et al (2013) A review of computational tools in microRNA discovery. Front Genet 4:81
Hertel J, Langenberger D, Stadler PF (2014) Computational prediction of microRNA genes. Methods Mol Biol 1097:437–456
Lee Y et al (2002) MicroRNA maturation: stepwise processing and subcellular localization. EMBO J 21(17):4663–4670
Lee Y et al (2003) The nuclear RNase III Drosha initiates microRNA processing. Nature 425(6956):415–419
Zeng Y, Cullen BR (2003) Sequence requirements for micro RNA processing and function in human cells. RNA 9(1):112–123
Lund E et al (2004) Nuclear export of microRNA precursors. Science 303(5654):95–98
Zeng Y, Cullen BR (2004) Structural requirements for pre-microRNA binding and nuclear export by Exportin 5. Nucleic Acids Res 32(16):4776–4785
Ketting RF et al (2001) Dicer functions in RNA interference and in synthesis of small RNA involved in developmental timing in C. elegans. Genes Dev 15(20):2654–2659
Khvorova A, Reynolds A, Jayasena SD (2003) Functional siRNAs and miRNAs exhibit strand bias. Cell 115(2):209–216
Schwarz DS et al (2003) Asymmetry in the assembly of the RNAi enzyme complex. Cell 115(2):199–208
Chen X (2005) MicroRNA biogenesis and function in plants. FEBS Lett 579(26):5923–5931
Voinnet O, Origin, biogenesis (2009) and activity of plant microRNAs. Cell 136(4):669–687
Dong H et al (2013) MicroRNA: function, detection, and bioanalysis. Chem Rev 113(8):6207–6233
Szittya G et al (2008) High-throughput sequencing of Medicago truncatula short RNAs identifies eight new miRNA families. BMC Genomics 9:593
Burnside J et al (2008) Deep sequencing of chicken microRNAs. BMC Genom 9:185
Li SC, Pan CY, Lin WC (2006) Bioinformatic discovery of microRNA precursors from human ESTs and introns. BMC Genom 7:164
Lee IH et al (2014) A detailed analysis of next generation sequencing reads of microRNA expression in Barrett’s esophagus: absolute versus relative quantification. BMC Res Notes 7:212
Krol J, Krzyzosiak WJ (2006) Structure analysis of microRNA precursors. Methods Mol Biol 342:19–32
Ohler U et al (2004) Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification. RNA 10(9):1309–1322
Unver T, Namuth-Covert DM, Budak H (2009) Review of current methodological approaches for characterizing microRNAs in plants. Int J Plant Genom 2009:262463
Legendre M, Lambert A, Gautheret D (2005) Profile-based detection of microRNA precursors in animal genomes. Bioinformatics 21(7):841–845
Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22(22):4673–4680
Hofacker IL, Fekete M, Stadler PF (2002) Secondary structure prediction for aligned RNA sequences. J Mol Biol 319(5):1059–1066
Lambert A et al (2004) The ERPIN server: an interface to profile-based RNA motif identification. Nucleic Acids Res 32(Web Server issue):W160–W165
Hofacker IL (2003) Vienna RNA secondary structure server. Nucleic Acids Res 31(13):3429–3431
Wang X et al (2005) MicroRNA identification based on sequence and structure alignment. Bioinformatics 21(18):3610–3614
Altschul SF et al (1990) Basic local alignment search tool. J Mol Biol 215(3):403–410
Hochsmann M et al (2003) Local similarity in RNA secondary structures. Proc IEEE Comput Soc Bioinform Conf 2:159–168
Lim LP et al (2003) The microRNAs of Caenorhabditis elegans. Genes Dev 17(8):991–1008
Hofacker IL et al (1998) Automatic detection of conserved RNA structure elements in complete RNA virus genomes. Nucleic Acids Res 26(16):3825–3836
Hofacker IL, Stadler PF (1999) Automatic detection of conserved base pairing patterns in RNA virus genomes. Comput Chem 23(3–4):401–414
Lai EC et al (2003) Computational identification of Drosophila microRNA genes. Genome Biol 4(7):R42
Zuker M (2003) Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res 31(13):3406–3415
Boffelli D (2008) Phylogenetic shadowing sequence comparisons of multiple primate species. Methods Mol Biol 453:217–231
Berezikov E et al (2005) Phylogenetic shadowing and computational identification of human microRNA genes. Cell 120(1):21–24
Olsen GJ et al (1994) fastDNAmL: a tool for construction of phylogenetic trees of DNA sequences using maximum likelihood. Comput Appl Biosci 10(1):41–48
Bonnet E et al (2004) Evidence that microRNA precursors, unlike other non-coding RNAs, have lower folding free energies than random sequences. Bioinformatics 20(17):2911–2917
Bentwich I et al (2005) Identification of hundreds of conserved and nonconserved human microRNAs. Nat Genet 37(7):766–770
Siepel A, Haussler D (2004) Combining phylogenetic and hidden Markov models in biosequence analysis. J Comput Biol 11(2–3):413–428
Schwartz S et al (2003) Human-mouse alignments with BLASTZ. Genome Res 13(1):103–107
Friedlander MR et al (2008) Discovering microRNAs from deep sequencing data using miRDeep. Nat Biotechnol 26(4):407–415
Xue C et al (2005) Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine. BMC Bioinformatics 6:310
Ritchie W, Theodule FX, Gautheret D (2008) Mireval: a web tool for simple microRNA prediction in genome sequences. Bioinformatics 24(11):1394–1396
Gao D et al (2013) miREval 2.0: a web tool for simple microRNA prediction in genome sequences. Bioinformatics 29(24):3225–3226
Ritchie W, Legendre M, Gautheret D (2007) RNA stem-loops: to be or not to be cleaved by RNAse III. RNA 13(4):457–462
Hertel J, Stadler PF (2006) Hairpins in a Haystack: recognizing microRNA precursors in comparative genomics data. Bioinformatics 22(14):e197–e202
Gruber AR et al (2010) RNAz 2.0: improved noncoding RNA detection. Pac Symp Biocomput, p. 69–79
Pedersen JS et al (2006) Identification and classification of conserved RNA secondary structures in the human genome. PLoS Comput Biol 2(4):e33
Yousef M et al (2006) Combining multi-species genomic data for microRNA identification using a Naive Bayes classifier. Bioinformatics 22(11):1325–1334
Ding, J., S. Zhou, and J. Guan (2010) MiRenSVM: towards better prediction of microRNA precursors using an ensemble SVM classifier with multi-loop features. BMC Bioinform 11(Suppl 11):S11
Gkirtzou K et al (2010) MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors. PLoS One 5(8):e11843
Nam JW et al (2005) Human microRNA prediction through a probabilistic co-learning model of sequence and structure. Nucleic Acids Res 33(11):3570–3581
Wu Y et al (2011) MiRPara: a SVM-based software tool for prediction of most probable microRNA coding regions in genome scale sequences. BMC Bioinformatics 12:107
Markham NR, Zuker M (2008) UNAFold: software for nucleic acid folding and hybridization. Methods Mol Biol 453:3–31
Xuan P et al (2011) MaturePred: efficient identification of microRNAs within novel plant pre-miRNAs. PLoS One 6(11):e27422
Shen W et al (2012) MicroRNA prediction using a fixed-order Markov model based on the secondary structure pattern. PLoS One 7(10):e48236
He C et al (2012) MiRmat: mature microRNA sequence prediction. PLoS One 7(12):e51673
Tran Vdu T et al (2015) miRBoost: boosting support vector machines for microRNA precursor classification. RNA 21(5):775–785
Tempel S, Tahi F (2012) A fast ab-initio method for predicting miRNA precursors in genomes. Nucleic Acids Res 40(11):e80
Batuwita R, Palade V (2009) microPred: effective classification of pre-miRNAs for human miRNA gene prediction. Bioinformatics 25(8):989–995
Xu Y, Zhou X, Zhang W (2008) MicroRNA prediction with a novel ranking algorithm based on random walks. Bioinformatics 24(13):i50–i58
Sewer A et al (2005) Identification of clustered microRNAs using an ab initio prediction method. BMC Bioinformatics 6:267
Jiang P et al (2007) MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features. Nucleic Acids Res 35(Web Server issue):W339–W344
Higashi S et al (2015) Mirinho: an efficient and general plant and animal pre-miRNA predictor for genomic and deep sequencing data. BMC Bioinformatics 16:179
Langmead B et al (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10(3):R25
Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 48(3):443–453
Mathews DH et al (2004) Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. Proc Natl Acad Sci USA 101(19):7287–7292
Nussinov R et al (1978) Algorithms for loop matchings. SIAM J Appl Math 35(1):68–82
Mathelier A, Carbone A (2010) MIReNA: finding microRNAs with high accuracy and no learning at genome scale and from deep sequencing data. Bioinformatics 26(18):2226–2234
Agarwal S et al (2010) Prediction of novel precursor miRNAs using a context-sensitive hidden Markov model (CSHMM). BMC Bioinform 11(Suppl 1):S29
Tyagi S et al (2008) CID-miRNA: a web server for prediction of novel miRNA precursors in human genome. Biochem Biophys Res Commun 372(4):831–834
Grundhoff A, Sullivan CS, Ganem D (2006) A combined computational and microarray-based approach identifies novel microRNAs encoded by human gamma-herpesviruses. RNA 12(5):733–750
Tav C et al (2016) miRNAFold: a web server for fast miRNA precursor prediction in genomes. Nucleic Acids Res 44(W1):W181–W184
Zhang B et al (2006) Plant microRNA: a small regulatory molecule with big impact. Dev Biol 289(1):3–16
Lewis BP, Burge CB, Bartel DP (2005) Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 120(1):15–20
Cai Y et al (2009) A brief review on the mechanisms of miRNA regulation. Genom Proteom Bioinform 7(4):147–154
Agarwal V et al (2015) Predicting effective microRNA target sites in mammalian mRNAs. Elife 4
Kruger J, Rehmsmeier M (2006) RNAhybrid: microRNA target prediction easy, fast and flexible. Nucleic Acids Res 34(Web Server issue):W451–W454
Lall S et al (2006) A genome-wide map of conserved microRNA targets in C. elegans. Curr Biol 16(5):460–471
Hsu JB et al (2011) miRTar: an integrated system for identifying miRNA-target interactions in human. BMC Bioinform 12:300
Dweep H et al (2011) miRWalk–database: prediction of possible miRNA binding sites by “walking” the genes of three genomes. J Biomed Inform 44(5):839–847
Loher P, Rigoutsos I (2012) Interactive exploration of RNA22 microRNA target predictions. Bioinformatics 28(24):3322–3323
Paraskevopoulou MD et al (2013) DIANA-microT web server v5.0: service integration into miRNA functional analysis workflows. Nucleic Acids Res 41(Web Server issue):W169–W173
Fahlgren N et al (2007) High-throughput sequencing of Arabidopsis microRNAs: evidence for frequent birth and death of MIRNA genes. PLoS One 2(2):e219
Dai X, Zhao PX (2011) psRNATarget: a plant small RNA target analysis server. Nucleic Acids Res 39(Web Server issue):W155–W159
Bonnet E et al (2010) TAPIR, a web server for the prediction of plant microRNA targets, including target mimics. Bioinformatics 26(12):1566–1568
Xie F, Zhang B (2010) Target-align: a tool for plant microRNA target identification. Bioinformatics 26(23):3002–3003
Milev I et al (2011) miRTour: plant miRNA and target prediction tool. Bioinformation 6(6):248–249
Jha A, Shankar R (2011) Employing machine learning for reliable miRNA target identification in plants. BMC Genomics 12:636
Wu HJ et al (2012) PsRobot: a web-based plant small RNA meta-analysis toolbox. Nucleic Acids Res 40(Web Server issue):W22–W28
Dweep H, Sticht C, Gretz N (2013) In-silico algorithms for the screening of possible microRNA binding sites and their interactions. Curr Genom 14(2):127–136
Wong N, Wang X (2015) miRDB: an online resource for microRNA target prediction and functional annotations. Nucleic Acids Res 43(Database issue):D146–D152
Hsu SD et al (2011) miRTarBase: a database curates experimentally validated microRNA-target interactions. Nucleic Acids Res 39(Database issue):D163–D169
Griffiths-Jones S et al (2008) miRBase: tools for microRNA genomics. Nucleic Acids Res 36(Database issue):D154–D158
Xiao F et al (2009) miRecords: an integrated resource for microRNA-target interactions. Nucleic Acids Res 37(Database issue):D105–D110
Blin K et al (2015) DoRiNA 2.0–upgrading the doRiNA database of RNA interactions in post-transcriptional regulation. Nucleic Acids Res 43(Database issue):D160–D167
Le Brigand K et al (2010) MiRonTop: mining microRNAs targets across large scale gene expression studies. Bioinformatics 26(24):3131–3132
Alexiou P et al (2010) The DIANA-mirExTra web server: from gene expression data to microRNA function. PLoS One 5(2):e9171
Kaya KD et al (2011) mESAdb: microRNA expression and sequence analysis database. Nucleic Acids Res 39(Database issue):D170–D180
Cho S et al (2013) MiRGator v3.0: a microRNA portal for deep sequencing, expression profiling and mRNA targeting. Nucleic Acids Res 41(Database issue):D252–D257
Sales G et al (2010) MAGIA, a web-based tool for miRNA and genes integrated analysis. Nucleic Acids Res 38(Web Server issue): W352–W359
Huang GT, Athanassiou C, Benos PV (2011) mirConnX: condition-specific mRNA-microRNA network integrator. Nucleic Acids Res 39(Web Server issue):W416–W423
Gennarino VA et al (2012) Identification of microRNA-regulated gene networks by expression analysis of target genes. Genome Res 22(6):1163–1172
Chiromatzo AO et al (2007) miRNApath: a database of miRNAs, target genes and metabolic pathways. Genet Mol Res 6(4):859–865
Kowarsch A et al (2011) miTALOS: analyzing the tissue-specific regulation of signaling pathways by human and mouse microRNAs. RNA 17(5):809–819
Lu TP et al (2012) miRSystem: an integrated system for characterizing enriched functions and pathways of microRNA targets. PLoS One 7(8):e42390
Papadopoulos GL et al (2009) DIANA-mirPath: Integrating human and mouse microRNAs in pathways. Bioinformatics 25(15):1991–1993
Gaidatzis D et al (2007) Inference of miRNA targets using evolutionary conservation and pathway analysis. BMC Bioinform 8:69
Jiang Q et al (2009) miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res 37(Database issue):D98–D104
Lagana A et al (2009) miRo: a miRNA knowledge base. Database (Oxford) 2009:bap008
Ruepp A et al (2010) PhenomiR: a knowledgebase for microRNA expression in diseases and biological processes. Genome Biol 11(1):R6
Xie B et al (2013) miRCancer: a microRNA-cancer association database constructed by text mining on literature. Bioinformatics 29(5):638–644
Wang D et al (2014) OncomiRDB: a database for the experimentally verified oncogenic and tumor-suppressive microRNAs. Bioinformatics 30(15):2237–2238
Li Y et al (2014) HMDD v2.0: a database for experimentally supported human microRNA and disease associations. Nucleic Acids Res 42(Database issue):D1070–D1074
Gong J et al (2012) Genome-wide identification of SNPs in microRNA genes and the SNP effects on microRNA target binding and biogenesis. Hum Mutat 33(1):254–263
Sherry ST, Ward M, Sirotkin K (1999) dbSNP-database for single nucleotide polymorphisms and other classes of minor genetic variation. Genome Res 9(8):677–679
Hiard S et al (2010) Patrocles: a database of polymorphic miRNA-mediated gene regulation in vertebrates. Nucleic Acids Res 38(Database issue):D640–D651
Bruno AE et al (2012) miRdSNP: a database of disease-associated SNPs and microRNA target sites on 3′UTRs of human genes. BMC Genom 13:44
Liu C et al (2012) MirSNP, a database of polymorphisms altering miRNA target sites, identifies miRNA-related SNPs in GWAS SNPs and eQTLs. BMC Genom 13:661
Barenboim M et al (2010) MicroSNiPer: a web tool for prediction of SNP effects on putative microRNA targets. Hum Mutat 31(11):1223–1232
Thomas LF, Saito T, Saetrom P (2011) Inferring causative variants in microRNA target sites. Nucleic Acids Res 39(16):e109
Bhattacharya A, Ziebarth JD, Cui Y (2014) PolymiRTS Database 3.0: linking polymorphisms in microRNAs and their target sites with human diseases and biological pathways. Nucleic Acids Res 42(Database issue):D86–D91
Bhattacharya A, Ziebarth JD, Cui Y (2013) SomamiR: a database for somatic mutations impacting microRNA function in cancer. Nucleic Acids Res 41(Database issue):D977–D982
Bhattacharya A, Cui Y (2015) miR2GO: comparative functional analysis for microRNAs. Bioinformatics 31(14):2403–2405
van Rooij E, Kauppinen S (2014) Development of microRNA therapeutics is coming of age. EMBO Mol Med 6(7):851–864
Acknowledgements
AR is grateful to Pondicherry University for the pre-doctoral fellowship. Authors are thankful to P. Manivel for critical reading and his valuable suggestions. The authors also thank the Department of Biotechnology, Department of Science and Technology, and University Grants Commission, Government of India, for supporting the research work in Centre for Bioinformatics.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Authors have no conflict of interest.
Rights and permissions
About this article
Cite this article
Rajendiran, A., Chatterjee, A. & Pan, A. Computational Approaches and Related Tools to Identify MicroRNAs in a Species: A Bird’s Eye View. Interdiscip Sci Comput Life Sci 10, 616–635 (2018). https://doi.org/10.1007/s12539-017-0223-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12539-017-0223-x