Advertisement

Enhancer RNAs pp 121-138 | Cite as

Computational Approaches for Mining GRO-Seq Data to Identify and Characterize Active Enhancers

Protocol
Part of the Methods in Molecular Biology book series (MIMB, volume 1468)

Abstract

Transcriptional enhancers are DNA regulatory elements that are bound by transcription factors and act to positively regulate the expression of nearby or distally located target genes. Enhancers have many features that have been discovered using genomic analyses. Recent studies have shown that active enhancers recruit RNA polymerase II (Pol II) and are transcribed, producing enhancer RNAs (eRNAs). GRO-seq, a method for identifying the location and orientation of all actively transcribing RNA polymerases across the genome, is a powerful approach for monitoring nascent enhancer transcription. Furthermore, the unique pattern of enhancer transcription can be used to identify enhancers in the absence of any information about the underlying transcription factors. Here, we describe the computational approaches required to identify and analyze active enhancers using GRO-seq data, including data pre-processing, alignment, and transcript calling. In addition, we describe protocols and computational pipelines for mining GRO-seq data to identify active enhancers, as well as known transcription factor binding sites that are transcribed. Furthermore, we discuss approaches for integrating GRO-seq-based enhancer data with other genomic data, including target gene expression and function. Finally, we describe molecular biology assays that can be used to confirm and explore further the function of enhancers that have been identified using genomic assays. Together, these approaches should allow the user to identify and explore the features and biological functions of new cell type-specific enhancers.

Key words

GRO-seq groHMM Enhancer Enhancer RNAs (eRNAs) Enhancer prediction Gene regulation Looping Motif Motif search Promoter Response element Transcription Transcription factor Transcription unit 

Notes

Acknowledgments

The authors thank Minho Chae and Hector L. Franco for helpful comments and suggestions about enhancer identification using GRO-seq, as well as this manuscript. The enhancer-related work in the Kraus lab is supported by grants from the NIH/NIDDK and the Cancer Prevention and Research Institute of Texas (CPRIT).

References

  1. 1.
    Shlyueva D, Stampfel G, Stark A (2014) Transcriptional enhancers: from properties to genome-wide predictions. Nat Rev Genet 15:272–286CrossRefPubMedGoogle Scholar
  2. 2.
    Wamstad JA, Wang X, Demuren OO et al (2014) Distal enhancers: new insights into heart development and disease. Trends Cell Biol 24:294–302CrossRefPubMedGoogle Scholar
  3. 3.
    Ong CT, Corces VG (2011) Enhancer function: new insights into the regulation of tissue-specific gene expression. Nat Rev Genet 12:283–293CrossRefPubMedPubMedCentralGoogle Scholar
  4. 4.
    Pennacchio LA, Bickmore W, Dean A et al (2013) Enhancers: five essential questions. Nat Rev Genet 14:288–295CrossRefPubMedPubMedCentralGoogle Scholar
  5. 5.
    Spitz F, Furlong EE (2012) Transcription factors: from enhancer binding to developmental control. Nat Rev Genet 13:613–626CrossRefPubMedGoogle Scholar
  6. 6.
    Heinz S, Romanoski CE, Benner C et al (2015) The selection and function of cell type-specific enhancers. Nat Rev Mol Cell Biol 16:144–154CrossRefPubMedPubMedCentralGoogle Scholar
  7. 7.
    Hah N, Murakami S, Nagari A et al (2013) Enhancer transcripts mark active estrogen receptor binding sites. Genome Res 23:1210–1223CrossRefPubMedPubMedCentralGoogle Scholar
  8. 8.
    Luo X, Chae M, Krishnakumar R et al (2014) Dynamic reorganization of the AC16 cardiomyocyte transcriptome in response to TNFalpha signaling revealed by integrated genomic analyses. BMC Genomics 15:155CrossRefPubMedPubMedCentralGoogle Scholar
  9. 9.
    Savic D, Roberts BS, Carleton JB et al (2015) Promoter-distal RNA polymerase II binding discriminates active from inactive CCAAT/enhancer-binding protein beta binding sites. Genome Res 25(12):1791–1800CrossRefPubMedPubMedCentralGoogle Scholar
  10. 10.
    Heintzman ND, Hon GC, Hawkins RD et al (2009) Histone modifications at human enhancers reflect global cell-type-specific gene expression. Nature 459:108–112CrossRefPubMedPubMedCentralGoogle Scholar
  11. 11.
    Buenrostro JD, Giresi PG, Zaba LC et al (2013) Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods 10:1213–1218CrossRefPubMedPubMedCentralGoogle Scholar
  12. 12.
    Flores O, Deniz O, Soler-Lopez M et al (2014) Fuzziness and noise in nucleosomal architecture. Nucleic Acids Res 42:4934–4946CrossRefPubMedPubMedCentralGoogle Scholar
  13. 13.
    Song L, Zhang Z, Grasfeder LL et al (2011) Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity. Genome Res 21:1757–1767CrossRefPubMedPubMedCentralGoogle Scholar
  14. 14.
    Carter D, Chakalova L, Osborne CS et al (2002) Long-range chromatin regulatory interactions in vivo. Nat Genet 32:623–626CrossRefPubMedGoogle Scholar
  15. 15.
    Dekker J, Rippe K, Dekker M et al (2002) Capturing chromosome conformation. Science 295:1306–1311CrossRefPubMedGoogle Scholar
  16. 16.
    Fullwood MJ, Liu MH, Pan YF et al (2009) An oestrogen-receptor-alpha-bound human chromatin interactome. Nature 462:58–64CrossRefPubMedPubMedCentralGoogle Scholar
  17. 17.
    Core LJ, Waterfall JJ, Lis JT (2008) Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science 322:1845–1848CrossRefPubMedPubMedCentralGoogle Scholar
  18. 18.
    Hah N, Danko CG, Core L et al (2011) A rapid, extensive, and transient transcriptional response to estrogen signaling in breast cancer cells. Cell 145:622–634CrossRefPubMedPubMedCentralGoogle Scholar
  19. 19.
    De Santa F, Barozzi I, Mietton F et al (2010) A large fraction of extragenic RNA pol II transcription sites overlap enhancers. PLoS Biol 8:e1000384CrossRefPubMedPubMedCentralGoogle Scholar
  20. 20.
    Kim TK, Hemberg M, Gray JM et al (2010) Widespread transcription at neuronal activity-regulated enhancers. Nature 465:182–187CrossRefPubMedPubMedCentralGoogle Scholar
  21. 21.
    Wang Q, Carroll JS, Brown M (2005) Spatial and temporal recruitment of androgen receptor and its coactivators involves chromosomal looping and polymerase tracking. Mol Cell 19:631–642CrossRefPubMedGoogle Scholar
  22. 22.
    Heintzman ND, Stuart RK, Hon G et al (2007) Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nat Genet 39:311–318CrossRefPubMedGoogle Scholar
  23. 23.
    Ling J, Baibakov B, Pi W et al (2005) The HS2 enhancer of the beta-globin locus control region initiates synthesis of non-coding, polyadenylated RNAs independent of a cis-linked globin promoter. J Mol Biol 350:883–896CrossRefPubMedGoogle Scholar
  24. 24.
    Spicuglia S, Kumar S, Yeh JH et al (2002) Promoter activation by enhancer-dependent and -independent loading of activator and coactivator complexes. Mol Cell 10:1479–1487CrossRefPubMedGoogle Scholar
  25. 25.
    Vieira KF, Levings PP, Hill MA et al (2004) Recruitment of transcription complexes to the beta-globin gene locus in vivo and in vitro. J Biol Chem 279:50350–50357CrossRefPubMedPubMedCentralGoogle Scholar
  26. 26.
    Yamashita R, Sathira NP, Kanai A et al (2011) Genome-wide characterization of transcriptional start sites in humans by integrative transcriptome analysis. Genome Res 21:775–789CrossRefPubMedPubMedCentralGoogle Scholar
  27. 27.
    Core LJ, Martins AL, Danko CG et al (2014) Analysis of nascent RNA identifies a unified architecture of initiation regions at mammalian promoters and enhancers. Nat Genet 46:1311–1320CrossRefPubMedPubMedCentralGoogle Scholar
  28. 28.
    Lam MT, Cho H, Lesch HP et al (2013) Rev-Erbs repress macrophage gene expression by inhibiting enhancer-directed transcription. Nature 498:511–515CrossRefPubMedPubMedCentralGoogle Scholar
  29. 29.
    Wang D, Garcia-Bassets I, Benner C et al (2011) Reprogramming transcription by distinct classes of enhancers functionally defined by eRNA. Nature 474:390–394CrossRefPubMedPubMedCentralGoogle Scholar
  30. 30.
    Kwak H, Fuda NJ, Core LJ et al (2013) Precise maps of RNA polymerase reveal how promoters direct initiation and pausing. Science 339:950–953CrossRefPubMedPubMedCentralGoogle Scholar
  31. 31.
    Franco HL, Nagari A, Kraus WL (2015) TNFalpha signaling exposes latent estrogen receptor binding sites to alter the breast cancer cell transcriptome. Mol Cell 58:21–34CrossRefPubMedPubMedCentralGoogle Scholar
  32. 32.
    Chae M, Danko CG, Kraus WL (2015) groHMM: a computational tool for identifying unannotated and cell type-specific transcription units from global run-on sequencing data. BMC Bioinformatics 16:222CrossRefPubMedPubMedCentralGoogle Scholar
  33. 33.
    Danko CG, Hyland SL, Core LJ et al (2015) Identification of active transcriptional regulatory elements from GRO-seq data. Nat Methods 12:433–438CrossRefPubMedPubMedCentralGoogle Scholar
  34. 34.
    Martin M (2012) Cutadapt removes adapter sequences from high-throughput sequencing reads. Bioinformatics Action 17(1):10–12, Key: citeulike:11851772 17:10-12Google Scholar
  35. 35.
    Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760CrossRefPubMedPubMedCentralGoogle Scholar
  36. 36.
    Quinlan AR, Hall IM (2010) BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26:841–842CrossRefPubMedPubMedCentralGoogle Scholar
  37. 37.
    Li H, Handsaker B, Wysoker A et al (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079CrossRefPubMedPubMedCentralGoogle Scholar
  38. 38.
    Andrews S. (2010) Fastqc. A quality control tool for high throughput sequence data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc
  39. 39.
    Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120CrossRefPubMedPubMedCentralGoogle Scholar
  40. 40.
    Li R, Li Y, Kristiansen K et al (2008) SOAP: short oligonucleotide alignment program. Bioinformatics 24:713–714CrossRefPubMedGoogle Scholar
  41. 41.
    Heinz S, Benner C, Spann N et al (2010) Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell 38:576–589CrossRefPubMedPubMedCentralGoogle Scholar
  42. 42.
    Zang C, Schones DE, Zeng C, Cui K, Zhao K, Peng W (2009) A clustering approach for identification of enriched domains from histone modification ChIP-Seq data. Bioinformatics. 25:1952–1958. doi:  10.1093/bioinformatics/btp340
  43. 43.
    Fang B, Everett LJ, Jager J et al (2014) Circadian enhancers coordinate multiple phases of rhythmic gene transcription in vivo. Cell 159:1140–1152CrossRefPubMedPubMedCentralGoogle Scholar
  44. 44.
    Sun M, Gadad SS, Kim DS et al (2015) Discovery, annotation, and functional analysis of long noncoding RNAs controlling cell-cycle gene expression and proliferation in breast cancer cells. Mol Cell 59:698–711CrossRefPubMedPubMedCentralGoogle Scholar
  45. 45.
    Grant CE, Bailey TL, Noble WS (2011) FIMO: scanning for occurrences of a given motif. Bioinformatics 27:1017–1018CrossRefPubMedPubMedCentralGoogle Scholar
  46. 46.
    Aerts S, Thijs G, Coessens B et al (2003) Toucan: deciphering the cis-regulatory logic of coregulated genes. Nucleic Acids Res 31:1753–1764CrossRefPubMedPubMedCentralGoogle Scholar
  47. 47.
    Bailey TL, Boden M, Buske FA et al (2009) MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res 37:W202–W208CrossRefPubMedPubMedCentralGoogle Scholar
  48. 48.
    Mahony S, Benos PV (2007) STAMP: a web tool for exploring DNA-binding motif similarities. Nucleic Acids Res. 35:W253–W258Google Scholar
  49. 49.
    Gupta S, Stamatoyannopoulos JA, Bailey TL, Noble WS (2007) Quantifying similarity between motifs. Genome Biol 8(2):R24CrossRefPubMedPubMedCentralGoogle Scholar
  50. 50.
    Handoko L, Xu H, Li G et al (2011) CTCF-mediated functional chromatin interactome in pluripotent cells. Nat Genet 43:630–638CrossRefPubMedPubMedCentralGoogle Scholar
  51. 51.
    Li G, Ruan X, Auerbach RK et al (2012) Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell 148:84–98CrossRefPubMedPubMedCentralGoogle Scholar
  52. 52.
    Zhang B, Kirov S, Snoddy J (2005) WebGestalt: an integrated system for exploring gene sets in various biological contexts. Nucleic Acids Res 33:W741–W748CrossRefPubMedPubMedCentralGoogle Scholar
  53. 53.
    Huang DW, Sherman BT, Tan Q et al (2007) The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists. Genome Biol 8:R183CrossRefPubMedPubMedCentralGoogle Scholar
  54. 54.
    Mclean CY, Bristor D, Hiller M et al (2010) GREAT improves functional interpretation of cis-regulatory regions. Nat Biotechnol 28:495–501CrossRefPubMedPubMedCentralGoogle Scholar
  55. 55.
    Heldring N, Isaacs GD, Diehl AG et al (2011) Multiple sequence-specific DNA-binding proteins mediate estrogen receptor signaling through a tethering pathway. Mol Endocrinol 25:564–574CrossRefPubMedPubMedCentralGoogle Scholar
  56. 56.
    Meyer MB, Benkusky NA, Onal M et al (2015) Selective regulation of Mmp13 by 1,25(OH)D, PTH, and Osterix through distal enhancers. J Steroid Biochem Mol BiolGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2017

Authors and Affiliations

  1. 1.The Laboratory of Signaling and Gene Expression, Cecil H. and Ida Green Center for Reproductive Biology SciencesUniversity of Texas Southwestern Medical CenterDallasUSA
  2. 2.The Division of Basic Research, Department of Obstetrics and GynecologyUniversity of Texas Southwestern Medical CenterDallasUSA
  3. 3.Program in Genetics, Development and Disease, Graduate School of Biomedical SciencesUniversity of Texas Southwestern Medical CenterDallasUSA

Personalised recommendations