Practical Analysis of Genome Contact Interaction Experiments

  • Mark A. Carty
  • Olivier Elemento
Part of the Methods in Molecular Biology book series (MIMB, volume 1418)


The three dimensional (3D) architecture of chromosomes is not random but instead tightly organized due to chromatin folding and chromatin interactions between genomically distant loci. By bringing genomically distant functional elements such as enhancers and promoters into close proximity, these interactions play a key role in regulating gene expression. Some of these interactions are dynamic, that is, they differ between cell types, conditions and can be induced by specific stimuli or differentiation events. Other interactions are more structural and stable, that is they are constitutionally present across several cell types. Genome contact interactions can occur via recruitment and physical interaction between chromatin-binding proteins and correlate with epigenetic marks such as histone modifications. Absence of a contact can occur due to presence of insulators, that is, chromatin-bound complexes that physically separate genomic loci. Understanding which contacts occur or do not occur in a given cell type is important since it can help explain how genes are regulated and which functional elements are involved in such regulation. The analysis of genome contact interactions has been greatly facilitated by the relatively recent development of chromosome conformation capture (3C). In an even more recent development, 3C was combined with next generation sequencing and led to Hi-C, a technique that in theory queries all possible pairwise interactions both within the same chromosome (intra) and between chromosomes (inter). Hi-C has now been used to study genome contact interactions in several human and mouse cell types as well as in animal models such as Drosophila and yeast. While it is fair to say that Hi-C has revolutionized the study of chromatin interactions, the computational analysis of Hi-C data is extremely challenging due to the presence of biases, artifacts, random polymer ligation and the huge number of potential pairwise interactions. In this chapter, we outline a strategy for analysis of genome contact experiments based on Hi-C using R and Bioconductor.

Key words

Chromatin interactions DNA looping Hi-C Enhancers Promoters Gene regulation 


  1. 1.
    Williamson I et al (2014) Spatial genome organization: contrasting views from chromosome conformation capture and fluorescence in situ hybridization. Genes Dev 28(24):2778–2791CrossRefPubMedPubMedCentralGoogle Scholar
  2. 2.
    Gibcus JH, Dekker J (2013) The hierarchy of the 3D genome. Mol Cell 49(5):773–782CrossRefPubMedPubMedCentralGoogle Scholar
  3. 3.
    van de Werken HJ et al (2012) Robust 4C-seq data analysis to screen for regulatory DNA interactions. Nat Methods 9(10):969–972CrossRefPubMedGoogle Scholar
  4. 4.
    Gavrilov A et al (2009) Chromosome conformation capture (from 3C to 5C) and its ChIP-based modification. Methods Mol Biol 567:171–188CrossRefPubMedGoogle Scholar
  5. 5.
    Belton JM et al (2012) Hi-C: a comprehensive technique to capture the conformation of genomes. Methods 58(3):268–276CrossRefPubMedGoogle Scholar
  6. 6.
    Kalhor R et al (2012) Genome architectures revealed by tethered chromosome conformation capture and population-based modeling. Nat Biotechnol 30(1):90–98CrossRefGoogle Scholar
  7. 7.
    Rao SS et al (2014) A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159(7):1665–1680CrossRefPubMedGoogle Scholar
  8. 8.
    Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9(4):357–359CrossRefPubMedPubMedCentralGoogle Scholar
  9. 9.
    Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25(14):1754–1760CrossRefPubMedPubMedCentralGoogle Scholar
  10. 10.
    Lieberman-Aiden E et al (2009) Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326(5950):289–293CrossRefPubMedPubMedCentralGoogle Scholar
  11. 11.
    Robinson MD, Oshlack A (2010) A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol 11(3):R25CrossRefPubMedPubMedCentralGoogle Scholar
  12. 12.
    Robinson MD, McCarthy DJ, Smyth GK (2010) edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26(1):139–140CrossRefPubMedPubMedCentralGoogle Scholar
  13. 13.
    Nagano T et al (2013) Single-cell Hi-C reveals cell-to-cell variability in chromosome structure. Nature 502(7469):59–64CrossRefPubMedGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2016

Authors and Affiliations

  1. 1.Institute for Computational BiomedicineWeill Cornell Medical CollegeNew YorkUSA
  2. 2.Memorial Sloan Kettering Cancer CenterNew YorkUSA

Personalised recommendations