“Hypothesis for the Modern RNA World”: A pervasive Non-coding RNA-Based Genetic Regulation is a Prerequisite for the Emergence of Multicellular Complexity
- 510 Downloads
- 7 Citations
Abstract
The transitions to multicellularity mark the most pivotal and distinctive events in life’s history on Earth. Although several transitions to “simple” multicellularity (SM) have been recorded in both bacterial and eukaryotic clades, transitions to complex multicellularity (CM) have only happened a few times in eukaryotes. A large number of cell types (associated with large body size), increased energy consumption per gene expressed, and an increment of non-protein-coding DNA positively correlate with CM. These three factors can indeed be understood as the causes and consequences of the regulation of gene expression. Here, we discuss how a vast expansion of non-protein-coding RNA (ncRNAs) regulators rather than large numbers of novel protein regulators can easily contribute to the emergence of CM. We also propose that the evolutionary advantage of RNA-based gene regulation derives from the robustness of the RNA structure that makes it easy to combine genetic drift with functional exploration. We describe a model which aims to explain how the evolutionary dynamic of ncRNAs becomes dominated by the accessibility of advantageous mutations to innovate regulation in complex multicellular organisms. The information and models discussed here outline the hypothesis that pervasive ncRNA-based regulatory systems, only capable of being expanded and explored in higher eukaryotes, are prerequisite to complex multicellularity. Thereby, regulatory RNA molecules in Eukarya have allowed intensification of morphological complexity by stabilizing critical phenotypes and controlling developmental precision. Although the origin of RNA on early Earth is still controversial, it is becoming clear that once RNA emerged into a protocellular system, its relevance within the evolution of biological systems has been greater than we previously thought.
Keywords
Modern RNA world Multicellular complexity Eukaryote evolution Genome complexity Non-coding RNA Gene regulationNotes
Acknowledgements
IL-C is funded by the fellowship 185993 from the National Council of Science and Technology of Mexico. We thank Christian Arnold and an anonymous referee for valuable comments and suggestions. We dedicate this article to the memory of Lynn Margulis, whose work has allowed us to go forward on the understanding of the origin and evolution of complex life on Earth.
References
- Aguirre J, Rios-Momberg M, Hewitt D et al (2005) Reactive oxygen species and development in microbial eukaryotes. Trends Microbiol 13:111–118PubMedGoogle Scholar
- Amar L, Chen CL, Zhou H et al (2009) Genome-wide evolutionary analysis of the noncoding RNA genes and noncoding DNA of Paramecium tetraurelia. RNA 15:503–514PubMedGoogle Scholar
- Arendt D, Christodoulou F, Raible F et al (2010) Ancient animal microRNAs and the evolution of tissue identity. Nature 463:1084–U1105PubMedGoogle Scholar
- Babu MM, Teichmann SA, Aravind L (2006) Evolutionary dynamics of prokaryotic transcriptional regulatory networks. J Mol Biol 358:614–633Google Scholar
- Banfield W, Woke PA, MacKay CM, Cooper HL (1965) Mosquito transmission of a reticulum cell sarcoma of hamsters. Science 148:1239–1240PubMedGoogle Scholar
- Bartel DP, Nodine MD (2010) MicroRNAs prevent precocious gene expression and enable pattern formation during plant embryogenesis. Genes Dev 24:2678–2692PubMedGoogle Scholar
- Bell G, Mooers AO (1997) Size and complexity among multicellular organisms. Biol J Linn Soc 60:345–363Google Scholar
- Benton MJ, Ayala FJ (2003) Dating the tree of life. Science 300:1698–1700PubMedGoogle Scholar
- Bernstein E, Kim SY, Carmell MA et al (2003) Dicer is essential for mouse development. Nat Genet 35:215–217PubMedGoogle Scholar
- Berretta J, Morillon A (2009) Pervasive transcription constitutes a new level of eukaryotic genome regulation. Embo Reports 10:973–982PubMedGoogle Scholar
- Bistis GN, Perkins David D, Read Nick D (2003) Different cell types in Neurospora crassa. Fungal Genetics Newsletter 50:17–19Google Scholar
- Blackstone NW (2000) Redox control and the evolution of multicellularity. Bioessays 22:947–953PubMedGoogle Scholar
- Bocobza SE, Aharoni A (2008) Switching the light on plant riboswitches. Trends Plant Sci 13:526–533PubMedGoogle Scholar
- Bonner JT (1998) The origins of multicellularity. Integr Biol 1:28–36Google Scholar
- Bonner JT (2004) Perspective: the size-complexity rule. Evolution 58:1883–1890PubMedGoogle Scholar
- Bowman JL, Floyd SK (2007) The ancestral developmental tool kit of land plants. Int J Plant Sci 168:1–35Google Scholar
- Cabili MN, Trapnell C, Goff L et al (2011) Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev 25:1915–1927PubMedGoogle Scholar
- Carrington JC, Allen E, Xie ZX et al (2004) Evolution of microRNA genes by inverted duplication of target gene sequences in Arabidopsis thaliana. Nat Genet 36:1282–1290PubMedGoogle Scholar
- Carrington JC, Fahlgren N, Howell MD et al (2007) High-throughput sequencing of Arabidopsis microRNAs: evidence for frequent birth and death of MIRNA genes. PLoS One 2:e219PubMedGoogle Scholar
- Carrington JC, Cuperus JT, Fahlgren N (2011) Evolution and functional diversification of MIRNA genes. Plant Cell 23:431–442PubMedGoogle Scholar
- Carroll SB (2001) Chance and necessity: the evolution of morphological complexity and diversity. Nature 409:1102–1109PubMedGoogle Scholar
- Cases I, de Lorenzo V, Ouzounis CA (2003) Transcription regulation and environmental adaptation in bacteria. Trends Microbiol 11:248–253PubMedGoogle Scholar
- Cheah MT, Wachter A, Sudarsan N et al (2007) Control of alternative RNA splicing and gene expression by eukaryotic riboswitches. Nature 447:497–500PubMedGoogle Scholar
- Chu C, Qu K, Zhong FL et al (2011) Genomic maps of long noncoding RNA occupancy reveal principles of RNA-chromatin interactions. Molecular Cell 44:667–678PubMedGoogle Scholar
- Clark MB, Amaral PP, Schlesinger FJ et al (2011) The reality of pervasive transcription. Plos Biology 9:e1000625PubMedGoogle Scholar
- Cobb BS, Nesterova TB, Thompson E et al (2005) T cell lineage choice and differentiation in the absence of the RNase III enzyme Dicer. J Exp Med 201:1367–1373PubMedGoogle Scholar
- Cock JM, Sterck L, Rouze P et al (2010) The Ectocarpus genome and the independent evolution of multicellularity in brown algae. Nature 465:617–621PubMedGoogle Scholar
- Condorelli G, Dimmeler S (2008) MicroRNAs: components of an integrated system controlling cardiac development, physiology, and disease pathogenesis. Cardiovasc Res 79:551–552PubMedGoogle Scholar
- Costa FF (2005) Non-coding RNAs: new players in eukaryotic biology. Gene 357:83–94PubMedGoogle Scholar
- de Meaux J, Hu JY, Tartler U et al (2008) Structurally different alleles of the ath-MIR824 microRNA precursor are maintained at high frequency in Arabidopsis thaliana. Proc Natl Acad Sci U S A 105:8994–8999PubMedGoogle Scholar
- DeLong JP, Okie JG, Moses ME et al (2010) Shifts in metabolic scaling, production, and efficiency across major evolutionary transitions of life. Proc Natl Acad Sci U S A 107:12941–12945PubMedGoogle Scholar
- Deng XW, Li L, Wang XF et al (2006) Genome-wide transcription analyses in rice using tiling microarrays. Nat Genet 38:124–129PubMedGoogle Scholar
- Donoghue PCJ, Heimberg AM, Sempere LF et al (2008) MicroRNAs and the advent of vertebrate morphological complexity. Proc Natl Acad Sci U S A 105:2946–2950PubMedGoogle Scholar
- Duret L, Chureau C, Samain S et al (2006) The Xist RNA gene evolved in eutherians by pseudogenization of a protein-coding gene. Science 312:1653–1655PubMedGoogle Scholar
- Erwin DH (2009) Early origin of the bilaterian developmental toolkit. Phil Trans Roy Soc B Biol Sci 364:2253–2261Google Scholar
- Fontana W, Schuster P (1998) Continuity in evolution: on the nature of transitions. Science 280:1451–1455PubMedGoogle Scholar
- Gerstein MB, Mu XJ, Lu ZJ et al (2011) Analysis of genomic variation in non-coding elements using population-scale sequencing data from the 1000 Genomes Project. Nucleic Acids Res 39:7058–7076PubMedGoogle Scholar
- Gingeras TR, Kapranov P, Cheng J et al (2007) RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science 316:1484–1488PubMedGoogle Scholar
- Giraldez AJ, Cinalli RM, Glasner ME et al (2005) MicroRNAs regulate brain morphogenesis in zebrafish. Science 308:833–838PubMedGoogle Scholar
- Gregory TR (2005) Synergy between sequence and size in large-scale genomics. Nat Rev Genet 6:699–708PubMedGoogle Scholar
- Grosberg RK, Strathmann RR (2007) The evolution of multicellularity: a minor major transition? Annu Rev Ecol Evol Syst 38:621–654Google Scholar
- Gruber AR, Kilgus C, Mosig A et al (2008) Arthropod 7SK RNA. Mol Biol Evol 25:1923–1930PubMedGoogle Scholar
- Guo XY, Zhang ZL, Gerstein MB et al (2009) Small RNAs originated from pseudogenes: cis- or trans-Acting? Plos Comput Biol 5:e1000449PubMedGoogle Scholar
- Hampl V, Hug L, Leigh JW et al (2009) Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic "supergroups". Proc Natl Acad Sci U S A 106:3859–3864PubMedGoogle Scholar
- Harfe BD, McManus MT, Mansfield JH et al (2005) The RNaseIII enzyme Dicer is required for morphogenesis but not patterning of the vertebrate limb. Proc Natl Acad Sci U S A 102:10898–10903PubMedGoogle Scholar
- Hedges SB (2002) The origin and evolution of model organisms. Nat Rev Genet 3:838–849PubMedGoogle Scholar
- Hedges SB, Blair JE, Venturi ML et al (2004) A molecular timescale of eukaryote evolution and the rise of complex multicellular life. BMC Evol Biol 4:2PubMedGoogle Scholar
- Hiller M, Findeiss S, Lein S et al (2009) Conserved introns reveal novel transcripts in Drosophila melanogaster. Genome Res 19:1289–1300PubMedGoogle Scholar
- Holland HD (2006) The oxygenation of the atmosphere and oceans. Phil Trans Roy Soc B Biol Sci 361:903–915Google Scholar
- Hornstein E, Shomron N (2006) Canalization of development by microRNAs. Nat Genet 38:S20–S24PubMedGoogle Scholar
- Huynen MA (1996) Exploring phenotype space through neutral evolution. J Mol Evol 43:165–169PubMedGoogle Scholar
- Huynen MA, Stadler PF, Fontana W (1996) Smoothness within ruggedness: the role of neutrality in adaptation. Proc Natl Acad Sci U S A 93:397–401PubMedGoogle Scholar
- Jacquier A (2009) The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs. Nat Rev Genet 10:833–844PubMedGoogle Scholar
- Joyce GF (2002) The antiquity of RNA-based evolution. Nature 418:214–221PubMedGoogle Scholar
- Kaiser D (2001) Building a multicellular organism. Annu Rev Genet 35:103–123PubMedGoogle Scholar
- Kapranov P, St Laurent G, Raz T et al (2010) The majority of total nuclear-encoded non-ribosomal RNA in a human cell is 'dark matter' un-annotated RNA. BMC Biol 8:149PubMedGoogle Scholar
- Kazazian HH (2004) Mobile elements: drivers of genome evolution. Science 303:1626–1632PubMedGoogle Scholar
- Kim, ED and Sung, S (2011) Long noncoding RNA: unveiling hidden layer of gene regulatory networks. Trends Plant Sci (in press)Google Scholar
- Kim VN, Han J, Siomi MC (2009) Biogenesis of small RNAs in animals. Nat Rev Mol Cell Biol 10:126–139PubMedGoogle Scholar
- King N (2004) The unicellular ancestry of animal development. Dev Cell 7:313–325PubMedGoogle Scholar
- King N, Westbrook MJ, Young SL et al (2008) The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans. Nature 451:783–788PubMedGoogle Scholar
- Kishore S, Stamm S (2006) Regulation of alternative splicing by snoRNAs. Cold Spring Harb Symp Quant Biol 71:329–334PubMedGoogle Scholar
- Knoll AH (2011) The multiple origins of complex multicellularity. Annu Rev Earth Planet Sci 39:217–239Google Scholar
- Kolter R, Branda SS, Gonzalez-Pastor JE et al (2001) Fruiting body formation by Bacillus subtilis. Proc Natl Acad Sci U S A 98:11621–11626PubMedGoogle Scholar
- Kong FX, Yang Z, Yang Z et al (2009) Benefits and costs of the grazer-induced colony formation in Microcystis aeruginosa. Ann Limnol-Int J Lim 45:203–208Google Scholar
- Konstantinidis KT, Tiedje JM (2004) Trends between gene content and genome size in prokaryotic species with larger genomes. Proc Natl Acad Sci U S A 101:3160–3165PubMedGoogle Scholar
- Koonin EV, Fedorova ND, Jackson JD et al (2004) A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes. Genome Biol 5:R7PubMedGoogle Scholar
- Lai EC, Liu N, Okamura K et al (2008) The evolution and functional diversification of animal microRNA genes. Cell Res 18:985–996PubMedGoogle Scholar
- Lane N, Martin W (2010) The energetics of genome complexity. Nature 467:929–934PubMedGoogle Scholar
- Lesser MP (2006) Oxidative stress in marine environments: biochemistry and physiological ecology. Annu Rev Physiol 68:253–278PubMedGoogle Scholar
- Lin HF, Gangaraju VK (2009) MicroRNAs: key regulators of stem cells. Nat Rev Mol Cell Biol 10:116–125PubMedGoogle Scholar
- Liu Y, Lee HC, Li LD et al (2010) Diverse pathways generate microRNA-like RNAs and dicer-independent small interfering RNAs in fungi. Molecular Cell 38:803–814PubMedGoogle Scholar
- Lozada-Chavez I, Janga SC, Collado-Vides J (2006) Bacterial regulatory networks are extremely flexible in evolution. Nucleic Acids Res 34:3434–3445PubMedGoogle Scholar
- Lozada-Chavez I, Angarica VE, Collado-Vides J et al (2008) The role of DNA-binding specificity in the evolution of bacterial regulatory networks. J Mol Biol 379:627–643PubMedGoogle Scholar
- Lu J, Fu YG, Kumar S et al (2008) Adaptive evolution of newly emerged micro-RNA genes in Drosophila. Mol Biol Evol 25:929–938PubMedGoogle Scholar
- Lurling M, Van Donk E (1999) Grazer-induced colony formation in Scenedesmus acutus (Chlorophyceae): ecomorph expression at different temperatures. J Phycol 35:1120–1126Google Scholar
- Lynch M (2006) The origins of eukaryotic gene structure. Mol Biol Evol 23:450–468PubMedGoogle Scholar
- Lynch M, Conery JS (2003) The origins of genome complexity. Science 302:1401–1404PubMedGoogle Scholar
- Lynch M, Bobay LM, Catania F et al (2011) The repatterning of eukaryotic genomes by random genetic drift. Annu Rev Genomics Hum Genet 12:347–366PubMedGoogle Scholar
- Marques AC, Ponting CP (2009) Catalogues of mammalian long noncoding RNAs: modest conservation and incompleteness. Genome Biol 10:R124PubMedGoogle Scholar
- Martinez-Antonio A, Collado-Vides J (2003) Identifying global regulators in transcriptional regulatory networks in bacteria. Curr Opin Microbiol 6:482–489PubMedGoogle Scholar
- Mattick JS, Taft RJ, Pheasant M (2007) The relationship between non-protein-coding DNA and eukaryotic complexity. Bioessays 29:288–299PubMedGoogle Scholar
- Mattick JS, Amaral PP, Dinger ME et al (2008) The eukaryotic genome as an RNA machine. Science 319:1787–1789PubMedGoogle Scholar
- Mattick JS, Mercer TR, Dinger ME (2009) Long non-coding RNAs: insights into functions. Nat Rev Genet 10:155–159PubMedGoogle Scholar
- McCarthy MC, Enquist BJ (2005) Organismal size, metabolism and the evolution of complexity in metazoans. Evol Ecol Res 7:681–696Google Scholar
- Medina M, Collins AG, Taylor JW, Valentine JW, Lipps JH, Amaral-Zettler L, Sogin ML (2003) Phylogeny of Opisthokonta and the evolution of multicellularity and complexity in Fungi and Metazoa. Int J Astrobiol 2:203–211Google Scholar
- Millar AA, Waterhouse PM (2005) Plant and animal microRNAs: similarities and differences. Funct Integr Genomics 5:129–135PubMedGoogle Scholar
- Mosig A, Zhu L, Stadler PF (2009) Customized strategies for discovering distant ncRNA homologs. Brief Funct Genomic Proteomic 8:451–460PubMedGoogle Scholar
- Niklas KJ (2000) The evolution of plant body plans - A biomechanical perspective. Ann Bot 85:411–438Google Scholar
- Nilsen TW, Graveley BR (2010) Expansion of the eukaryotic proteome by alternative splicing. Nature 463:457–463PubMedGoogle Scholar
- Ochman H, Davalos LM (2006) The nature and dynamics of bacterial genomes. Science 311:1730–1733PubMedGoogle Scholar
- Ohta T (1973) Slightly deleterious mutant substitutions in evolution. Nature 246:96–98PubMedGoogle Scholar
- Ohta T (1992) The nearly neutral theory of molecular evolution. Annu Rev Ecol Syst 23:263–286Google Scholar
- Pain A, Mourier T, Carret C et al (2008) Genome-wide discovery and verification of novel structured RNAs in Plasmodium falciparum. Genome Res 18:281–292PubMedGoogle Scholar
- Pauli A, Rinn JL, Schier AF (2011) Non-coding RNAs as regulators of embryogenesis. Nat Rev Genet 12:136–149PubMedGoogle Scholar
- Pearse AM, Swift K (2006) Transmission of devil facial-tumour disease - An uncanny similarity in the karyotype of these malignant tumours means that they could be infective. Nature 439:549–549PubMedGoogle Scholar
- Peterlin BM, Brogie JE, Price DH (2012) 7SK snRNA: a noncoding RNA that plays a major role in regulating eukaryotic transcription. Wiley Interdiscip Rev RNA 3:92–103PubMedGoogle Scholar
- Peterson KJ, Dietrich MR, McPeek MA (2009) MicroRNAs and metazoan macroevolution: insights into canalization, complexity, and the Cambrian explosion. Bioessays 31:736–747PubMedGoogle Scholar
- Ponting CP, Ponjavic J, Lunter G (2007) Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Res 17:556–565PubMedGoogle Scholar
- Ponting CP, Oliver PL, Reik W (2009) Evolution and functions of long noncoding RNAs. Cell 136:629–641PubMedGoogle Scholar
- Rajewsky N, Chen K (2007) The evolution of gene regulation by transcription factors and microRNAs. Nat Rev Genet 8:93–103PubMedGoogle Scholar
- Ren B (2010) Enhancers make non-coding RNA. Nature 465:173–174PubMedGoogle Scholar
- Repoila F, Darfeuille F (2009) Small regulatory non-coding RNAs in bacteria: physiology and mechanistic aspects. Biology of the Cell 101:117–131PubMedGoogle Scholar
- Robertson MP, Joyce GF (2010) The origins of the RNA world. Cold Spring Harb Perspect Biol. doi: 10.1101/cshperspect.a003608
- Rokas A (2008) The origins of multicellularity and the early history of the genetic toolkit for animal development. Annu Rev Genet 42:235–251PubMedGoogle Scholar
- Rose D, Hiller M, Schutt K et al (2011) Computational discovery of human coding and non-coding transcripts with conserved splice sites. Bioinformatics 27:1894–1900PubMedGoogle Scholar
- Roush S, Slack FJ (2008) The let-7 family of microRNAs. Trends Cell Biol 18:505–516PubMedGoogle Scholar
- Schuster P, Fontana W, Stadler PF et al (1994) From sequences to shapes and back: a case study in RNA secondary structures. Proc Biol Sci 255:279–284PubMedGoogle Scholar
- Scott MS, Ono M (2011) From snoRNA to miRNA: dual function regulatory non-coding RNAs. Biochimie 93:1987–1992PubMedGoogle Scholar
- Sharma CM, Hoffmann S, Darfeuille F et al (2010) The primary transcriptome of the major human pathogen Helicobacter pylori. Nature 464:250–255PubMedGoogle Scholar
- Specht CD, Bartlett ME (2009) Flower evolution: the origin and subsequent diversification of the angiosperm flower. Annu Rev Ecol Evol Syst 40:217–243Google Scholar
- Spector DL, Prasanth KV (2007) Eukaryotic regulatory RNAs: an answer to the 'genome complexity' conundrum. Genes Dev 21:11–42PubMedGoogle Scholar
- Spector DL, Wilusz JE, Sunwoo H (2009) Long noncoding RNAs: functional surprises from the RNA world. Genes Dev 23:1494–1504PubMedGoogle Scholar
- Srivastava M, Simakov O, Chapman J et al (2010) The Amphimedon queenslandica genome and the evolution of animal complexity. Nature 466:720–U723PubMedGoogle Scholar
- Storz G, Waters LS (2009) Regulatory RNAs in Bacteria. Cell 136:615–628PubMedGoogle Scholar
- Strathmann R (1991) From metazoan to protist via competition among cell lineages. Evol Theor 10:67–70Google Scholar
- Sudarsan N, Barrick JE, Breaker RR (2003) Metabolite-binding RNA domains are present in the genes of eukaryotes. RNA 9:644–647PubMedGoogle Scholar
- Tisseur M, Kwapisz M, Morillon A (2011) Pervasive transcription - Lessons from yeast. Biochimie 93:1889–1896PubMedGoogle Scholar
- Tomitani A, Knoll AH, Cavanaugh CM et al (2006) The evolutionary diversification of cyanobacteria: molecular-phylogenetic and paleontological perspectives. Proc Natl Acad Sci U S A 103:5442–5447PubMedGoogle Scholar
- Valentine JW, Collins AG, Meyer CP (1994) Morphological complexity increase in metazoans. Paleobiology 20:131–142Google Scholar
- van Bakel H, Nislow C, Blencowe BJ et al (2010) Most "Dark Matter'' transcripts are associated with known genes. Plos Biology 8:e1000371PubMedGoogle Scholar
- van Bakel H, Nislow C, Blencowe BJ et al (2011) Response to "The reality of pervasive transcription". Plos Biology 9:e1001102Google Scholar
- Velicer GJ, Kroos L, Lenski RE (1998) Loss of social behaviors by Myxococcus xanthus during evolution in an unstructured habitat. Proc Natl Acad Sci U S A 95:12376–12380PubMedGoogle Scholar
- Voinnet O (2009) Origin, biogenesis, and activity of plant MicroRNAs. Cell 136:669–687PubMedGoogle Scholar
- Wagner A (2005) Energy constraints on the evolution of gene expression. Mol Biol Evol 22:1365–1374PubMedGoogle Scholar
- Wang SM, Lu J, Shen Y et al (2008) The birth and death of microRNA genes in Drosophila. Nat Genet 40:351–355PubMedGoogle Scholar
- Weiss RA, Murgia C, Pritchard JK et al (2006) Clonal origin and evolution of a transmissible cancer. Cell 126:477–487PubMedGoogle Scholar
- Wolpert L, Szathmary E (2002) Multicellularity: evolution and the egg. Nature 420:745–745PubMedGoogle Scholar
- Yano Y, Saito R, Yoshida N et al (2004) A new role for expressed pseudogenes as ncRNA: regulation of mRNA stability of its homologous coding gene. J Mol Med-Jmm 82:414–422Google Scholar
- Zhao Y, Ransom JF, Li A et al (2007) Dysregulation of cardiogenesis, cardiac conduction, and cell cycle in mice lacking miRNA-1-2. Cell 129:303–317PubMedGoogle Scholar
- Zheng DY, Frankish A, Baertsch R et al (2007) Pseudogenes in the ENCODE regions: consensus annotation, analysis of transcription, and evolution. Genome Res 17:839–851PubMedGoogle Scholar