Integration of Genome-Wide TF Binding and Gene Expression Data to Characterize Gene Regulatory Networks in Plant Development

Protocol
Part of the Methods in Molecular Biology book series (MIMB, volume 1629)

Abstract

Key transcription factors (TFs) controlling the morphogenesis of flowers and leaves have been identified in the model plant Arabidopsis thaliana. Recent genome-wide approaches based on chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) enable systematic identification of genome-wide TF binding sites (TFBSs) of these regulators. Here, we describe a computational pipeline for analyzing ChIP-seq data to identify TFBSs and to characterize gene regulatory networks (GRNs) with applications to the regulatory studies of flower development. In particular, we provide step-by-step instructions on how to download, analyze, visualize, and integrate genome-wide data in order to construct GRNs for beginners of bioinformatics. The practical guide presented here is ready to apply to other similar ChIP-seq datasets to characterize GRNs of interest.

Key words

Flower development Gene regulatory networks (GRNs) Transcription factors (TFs) DNA binding sites ChIP-seq Bioinformatics 

Notes

Acknowledgments

K.K. wishes to thank the Alexander-von-Humboldt foundation and the BMBF for ongoing support.

References

  1. 1.
    Kaufmann K, Muino JM, Jauregui R, Airoldi CA, Smaczniak C, Krajewski P, Angenent GC (2009) Target genes of the MADS transcription factor SEPALLATA3: integration of developmental and hormonal pathways in the Arabidopsis flower. PLoS Biol 7:e1000090CrossRefPubMedPubMedCentralGoogle Scholar
  2. 2.
    Deng W, Ying H, Helliwell CA, Taylor JM, Peacock WJ, Dennis ES (2011) FLOWERING LOCUS C (FLC) regulates development pathways throughout the life cycle of Arabidopsis. Proc Natl Acad Sci U S A 108:6680–6685CrossRefPubMedPubMedCentralGoogle Scholar
  3. 3.
    Gregis V, Andres F, Sessa A, Guerra RF, Simonini S, Mateos JL, Torti S, Zambelli F, Prazzoli GM, Bjerkan KN et al (2013) Identification of pathways directly regulated by SHORT VEGETATIVE PHASE during vegetative and reproductive development in Arabidopsis. Genome Biol 14:R56CrossRefPubMedPubMedCentralGoogle Scholar
  4. 4.
    Kaufmann K, Wellmer F, Muino JM, Ferrier T, Wuest SE, Kumar V, Serrano-Mislata A, Madueno F, Krajewski P, Meyerowitz EM et al (2010) Orchestration of floral initiation by APETALA1. Science 328:85–89CrossRefPubMedGoogle Scholar
  5. 5.
    O’ Maoileidigh D, Wuest SE, Rae L, Raganelli A, Ryan PT, Kwasniewska K, Das P, Lohan AJ, Loftus B, Graciet E, Wellmer F (2013) Control of reproductive floral organ identity specification in Arabidopsis by the C function regulator AGAMOUS. Plant Cell 25:2482–2503CrossRefGoogle Scholar
  6. 6.
    Pajoro A, Madrigal P, Muino JM, Matus JT, Jin J, Mecchia MA, Debernardi JM, Palatnik JF, Balazadeh S, Arif M et al (2014) Dynamics of chromatin accessibility and gene regulation by MADS-domain transcription factors in flower development. Genome Biol 15:R41CrossRefPubMedPubMedCentralGoogle Scholar
  7. 7.
    Yant L, Mathieu J, Dinh TT, Ott F, Lanz C, Wollmann H, Chen X, Schmid M (2010) Orchestration of the floral transition and floral development in Arabidopsis by the bifunctional transcription factor APETALA2. Plant Cell 22:2156–2170CrossRefPubMedPubMedCentralGoogle Scholar
  8. 8.
    Wuest SE, O'Maoileidigh DS, Rae L, Kwasniewska K, Raganelli A, Hanczaryk K, Lohan AJ, Loftus B, Graciet E, Wellmer F (2012) Molecular basis for the specification of floral organs by APETALA3 and PISTILLATA. Proc Natl Acad Sci U S A 109:13452–13457CrossRefPubMedPubMedCentralGoogle Scholar
  9. 9.
    Mateos JL, Madrigal P, Tsuda K, Rawat V, Richter R, Romera-Branchat M, Fornara F, Schneeberger K, Krajewski P, Coupland G (2015) Combinatorial activities of SHORT VEGETATIVE PHASE and FLOWERING LOCUS C define distinct modes of flowering regulation in Arabidopsis. Genome Biol 16:31CrossRefPubMedPubMedCentralGoogle Scholar
  10. 10.
    Immink RG, Pose D, Ferrario S, Ott F, Kaufmann K, Valentim FL, de Folter S, van der Wal F, van Dijk AD, Schmid M, Angenent GC (2012) Characterization of SOC1's central role in flowering by the identification of its upstream and downstream regulators. Plant Physiol 160:433–449CrossRefPubMedPubMedCentralGoogle Scholar
  11. 11.
    Moyroud E, Minguet EG, Ott F, Yant L, Pose D, Monniaux M, Blanchet S, Bastien O, Thevenon E, Weigel D et al (2011) Prediction of regulatory interactions from genome sequences using a biophysical model for the Arabidopsis LEAFY transcription factor. Plant Cell 23:1293–1306CrossRefPubMedPubMedCentralGoogle Scholar
  12. 12.
    Pose D, Verhage L, Ott F, Yant L, Mathieu J, Angenent GC, Immink RG, Schmid M (2013) Temperature-dependent regulation of flowering by antagonistic FLM variants. Nature 503:414–417CrossRefPubMedGoogle Scholar
  13. 13.
    O'Maoileidigh DS, Graciet E, Wellmer F (2014) Gene networks controlling Arabidopsis thaliana flower development. New Phytol 201:16–30CrossRefPubMedGoogle Scholar
  14. 14.
    Landt SG, Marinov GK, Kundaje A, Kheradpour P, Pauli F, Batzoglou S, Bernstein BE, Bickel P, Brown JB, Cayting P et al (2012) ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res 22:1813–1831CrossRefPubMedPubMedCentralGoogle Scholar
  15. 15.
    Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359CrossRefPubMedPubMedCentralGoogle Scholar
  16. 16.
    Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760CrossRefPubMedPubMedCentralGoogle Scholar
  17. 17.
    Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25CrossRefPubMedPubMedCentralGoogle Scholar
  18. 18.
    Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K, Wang J (2009) SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 25:1966–1967CrossRefPubMedGoogle Scholar
  19. 19.
    Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R (2009) Genome Project Data Processing S: the sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079CrossRefPubMedPubMedCentralGoogle Scholar
  20. 20.
    Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS (2008) Model-based analysis of ChIP-Seq (MACS). Genome Biol 9:R137CrossRefPubMedPubMedCentralGoogle Scholar
  21. 21.
    Muino JM, Kaufmann K, van Ham RC, Angenent GC, Krajewski P (2011) ChIP-seq analysis in R (CSAR): an R package for the statistical detection of protein-bound genomic regions. Plant Methods 7:11CrossRefPubMedPubMedCentralGoogle Scholar
  22. 22.
    Kharchenko PV, Tolstorukov MY, Park PJ (2008) Design and analysis of ChIP-seq experiments for DNA-binding proteins. Nat Biotechnol 26:1351–1359CrossRefPubMedPubMedCentralGoogle Scholar
  23. 23.
    Rozowsky J, Euskirchen G, Auerbach RK, Zhang ZD, Gibson T, Bjornson R, Carriero N, Snyder M, Gerstein MB (2009) PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls. Nat Biotechnol 27:66–75CrossRefPubMedPubMedCentralGoogle Scholar
  24. 24.
    Quinlan AR, Hall IM (2010) BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26:841–842CrossRefPubMedPubMedCentralGoogle Scholar
  25. 25.
    Kent WJ, Zweig AS, Barber G, Hinrichs AS, Karolchik D (2010) BigWig and BigBed: enabling browsing of large distributed datasets. Bioinformatics 26:2204–2207CrossRefPubMedPubMedCentralGoogle Scholar
  26. 26.
    John S, Sabo PJ, Thurman RE, Sung MH, Biddie SC, Johnson TA, Hager GL, Stamatoyannopoulos JA (2011) Chromatin accessibility pre-determines glucocorticoid receptor binding patterns. Nat Genet 43:264–268CrossRefPubMedGoogle Scholar
  27. 27.
    Marinov GK, Kundaje A, Park PJ, Wold BJ (2014) Large-scale quality analysis of published ChIP-seq data. G3 (Bethesda) 4:209–223CrossRefGoogle Scholar
  28. 28.
    Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP (2011) Integrative genomics viewer. Nat Biotechnol 29:24–26CrossRefPubMedPubMedCentralGoogle Scholar
  29. 29.
    Stein LD, Mungall C, Shu S, Caudy M, Mangone M, Day A, Nickerson E, Stajich JE, Harris TW, Arva A, Lewis S (2002) The generic genome browser: a building block for a model organism system database. Genome Res 12:1599–1610CrossRefPubMedPubMedCentralGoogle Scholar
  30. 30.
    Lamesch P, Berardini TZ, Li D, Swarbreck D, Wilks C, Sasidharan R, Muller R, Dreher K, Alexander DL, Garcia-Hernandez M et al (2012) The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res 40:D1202–D1210CrossRefPubMedGoogle Scholar
  31. 31.
    Skinner ME, Uzilov AV, Stein LD, Mungall CJ, Holmes IH (2009) JBrowse: a next-generation genome browser. Genome Res 19:1630–1638CrossRefPubMedPubMedCentralGoogle Scholar
  32. 32.
    Buels R, Yao E, Diesh CM, Hayes RD, Munoz-Torres M, Helt G, Goodstein DM, Elsik CG, Lewis SE, Stein L, Holmes IH (2016) JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol 17:66CrossRefPubMedPubMedCentralGoogle Scholar
  33. 33.
    Krishnakumar V, Hanlon MR, Contrino S, Ferlanti ES, Karamycheva S, Kim M, Rosen BD, Cheng CY, Moreira W, Mock SA et al (2015) Araport: the Arabidopsis information portal. Nucleic Acids Res 43:D1003–D1009CrossRefPubMedGoogle Scholar
  34. 34.
    Zhou X, Maricque B, Xie M, Li D, Sundaram V, Martin EA, Koebbe BC, Nielsen C, Hirst M, Farnham P et al (2011) The human epigenome browser at Washington University. Nat Methods 8:989–990CrossRefPubMedPubMedCentralGoogle Scholar
  35. 35.
    Jin J, Zhang H, Kong L, Gao G, Luo J (2014) PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors. Nucleic Acids Res 42:D1182–D1187CrossRefPubMedGoogle Scholar
  36. 36.
    Bourque G, Leong B, Vega VB, Chen X, Lee YL, Srinivasan KG, Chew JL, Ruan Y, Wei CL, Ng HH, Liu ET (2008) Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res 18:1752–1762CrossRefPubMedPubMedCentralGoogle Scholar
  37. 37.
    Kunarso G, Chia NY, Jeyakani J, Hwang C, Lu X, Chan YS, Ng HH, Bourque G (2010) Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat Genet 42:631–634CrossRefPubMedGoogle Scholar
  38. 38.
    Lee TH, Maheshri N (2012) A regulatory role for repeated decoy transcription factor binding sites in target gene expression. Mol Syst Biol 8:576CrossRefPubMedPubMedCentralGoogle Scholar
  39. 39.
    Chung D, Kuan PF, Li B, Sanalkumar R, Liang K, Bresnick EH, Dewey C, Keles S (2011) Discovering transcription factor binding sites in highly repetitive regions of genomes with multi-read analysis of ChIP-Seq data. PLoS Comput Biol 7:e1002111CrossRefPubMedPubMedCentralGoogle Scholar
  40. 40.
    Kidder BL, Hu G, Zhao K (2011) ChIP-Seq: technical considerations for obtaining high-quality data. Nat Immunol 12:918–922CrossRefPubMedPubMedCentralGoogle Scholar
  41. 41.
    Consortium EP (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489:57–74CrossRefGoogle Scholar
  42. 42.
    Consortium RE, Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J et al (2015) Integrative analysis of 111 reference human epigenomes. Nature 518:317–330CrossRefGoogle Scholar
  43. 43.
    Araya CL, Kawli T, Kundaje A, Jiang L, Wu B, Vafeados D, Terrell R, Weissdepp P, Gevirtzman L, Mace D et al (2014) Regulatory analysis of the C. elegans genome with spatiotemporal resolution. Nature 512:400–405CrossRefPubMedPubMedCentralGoogle Scholar
  44. 44.
    Yue F, Cheng Y, Breschi A, Vierstra J, Wu W, Ryba T, Sandstrom R, Ma Z, Davis C, Pope BD et al (2014) A comparative encyclopedia of DNA elements in the mouse genome. Nature 515:355–364CrossRefPubMedPubMedCentralGoogle Scholar
  45. 45.
    Nicol JW, Helt GA, Blanchard SG Jr, Raja A, Loraine AE (2009) The Integrated Genome Browser: free software for distribution and exploration of genome-scale datasets. Bioinformatics 25:2730–2731CrossRefPubMedPubMedCentralGoogle Scholar
  46. 46.
    Liu T (2014) Use model-based analysis of ChIP-Seq (MACS) to analyze short reads generated by sequencing protein-DNA interactions in embryonic stem cells. Methods Mol Biol 1150:81–95CrossRefPubMedGoogle Scholar
  47. 47.
    Wilbanks EG, Facciotti MT (2010) Evaluation of algorithm performance in ChIP-seq peak detection. PLoS One 5:e11471CrossRefPubMedPubMedCentralGoogle Scholar
  48. 48.
    Li QH, Brown JB, Huang HY, Bickel PJ (2011) Measuring reproducibility of high-throughput experiments. Ann Appl Stat 5:1752–1779CrossRefGoogle Scholar
  49. 49.
    Yu G, Wang LG, He QY (2015) ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31:2382–2383CrossRefPubMedGoogle Scholar
  50. 50.
    Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T (2011) Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 27:431–432CrossRefPubMedGoogle Scholar

Copyright information

© Springer Science+Business Media LLC 2017

Authors and Affiliations

  1. 1.Institute for Biochemistry and BiologyUniversity of PotsdamPotsdamGermany
  2. 2.Department for Plant Cell and Molecular Biology, Institute for BiologyHumboldt-Universität zu BerlinBerlinGermany

Personalised recommendations