Abstract
In mammals, extensive chromatin reorganization is essential for reprogramming terminally committed gametes to a totipotent state during preimplantation development. However, the global chromatin landscape and its dynamics in this period remain unexplored. Here we report a genome-wide map of accessible chromatin in mouse preimplantation embryos using an improved assay for transposase-accessible chromatin with high throughput sequencing (ATAC-seq) approach with CRISPR/Cas9-assisted mitochondrial DNA depletion. We show that despite extensive parental asymmetry in DNA methylomes, the chromatin accessibility between the parental genomes is globally comparable after major zygotic genome activation (ZGA). Accessible chromatin in early embryos is widely shaped by transposable elements and overlaps extensively with putative cis-regulatory sequences. Unexpectedly, accessible chromatin is also found near the transcription end sites of active genes. By integrating the maps of cis-regulatory elements and single-cell transcriptomes, we construct the regulatory network of early development, which helps to identify the key modulators for lineage specification. Finally, we find that the activities of cis-regulatory elements and their associated open chromatin diminished before major ZGA. Surprisingly, we observed many loci showing non-canonical, large open chromatin domains over the entire transcribed units in minor ZGA, supporting the presence of an unusually permissive chromatin state. Together, these data reveal a unique spatiotemporal chromatin configuration that accompanies early mammalian development.
Similar content being viewed by others
References
Kouzarides, T. Chromatin modifications and their function. Cell 128, 693–705 (2007)
Gross, D. S. & Garrard, W. T. Nuclease hypersensitive sites in chromatin. Annu. Rev. Biochem. 57, 159–197 (1988)
Rivera, C. M. & Ren, B. Mapping human epigenomes. Cell 155, 39–55 (2013)
Burton, A. & Torres-Padilla, M. E. Chromatin dynamics in the regulation of cell fate allocation during early embryogenesis. Nat. Rev. Mol. Cell Biol. 15, 723–735 (2014)
Zernicka-Goetz, M., Morris, S. A. & Bruce, A. W. Making a firm decision: multifaceted regulation of cell fate in the early mouse embryo. Nat. Rev. Genet. 10, 467–477 (2009)
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013)
Buenrostro, J. D. et al. Single-cell chromatin accessibility reveals principles of regulatory variation. Nature 523, 486–490 (2015)
Cusanovich, D. A. et al. Epigenetics. Multiplex single-cell profiling of chromatin accessibility by combinatorial cellular indexing. Science 348, 910–914 (2015)
Lavin, Y. et al. Tissue-resident macrophage enhancer landscapes are shaped by the local microenvironment. Cell 159, 1312–1326 (2014)
Lara-Astiaso, D. et al. Immunogenetics. Chromatin state dynamics during blood formation. Science 345, 943–949 (2014)
Gu, W. et al. Depletion of Abundant Sequences by Hybridization (DASH): using Cas9 to remove unwanted high-abundance species in sequencing libraries and molecular counting applications. Genome Biol. 17, 41 (2016)
Deng, Q., Ramsköld, D., Reinius, B. & Sandberg, R. Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells. Science 343, 193–196 (2014)
Fenouil, R. et al. CpG islands and GC content dictate nucleosome depletion in a transcription-independent manner at mammalian promoters. Genome Res. 22, 2399–2408 (2012)
De Braekeleer, E. et al. ETV6 fusion genes in hematological malignancies: a review. Leuk. Res. 36, 945–961 (2012)
Reik, W. & Walter, J. Genomic imprinting: parental influence on the genome. Nat. Rev. Genet. 2, 21–32 (2001)
Wang, L. et al. Programming and inheritance of parental DNA methylomes in mammals. Cell 157, 979–991 (2014)
Maza, I. et al. Transient acquisition of pluripotency during somatic cell transdifferentiation with iPSC reprogramming factors. Nat. Biotechnol. 33, 769–774 (2015)
Glover-Cutter, K., Kim, S., Espinosa, J. & Bentley, D. L. RNA polymerase II pauses and associates with pre-mRNA processing factors at both ends of genes. Nat. Struct. Mol. Biol. 15, 71–78 (2008)
Randise-Hinchliff, C. E. & Brickner, J. H. A new direction for gene looping. Dev. Cell 23, 919–921 (2012)
Shen, Y. et al. A map of the cis-regulatory sequences in the mouse genome. Nature 488, 116–120 (2012)
Peaston, A. E. et al. Retrotransposons regulate host genes in mouse oocytes and preimplantation embryos. Dev. Cell 7, 597–606 (2004)
Robbez-Masson, L. & Rowe, H. M. Retrotransposons shape species-specific embryonic stem cell gene expression. Retrovirology 12, 45 (2015)
Xie, M. et al. DNA hypomethylation within specific transposable element families associates with tissue-specific enhancer landscape. Nat. Genet. 45, 836–841 (2013)
Buecker, C. & Wysocka, J. Enhancers as information integration hubs in development: lessons from genomics. Trends Genet. 28, 276–284 (2012)
McLean, C. Y. et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 28, 495–501 (2010)
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010)
Boroviak, T., Loos, R., Bertone, P., Smith, A. & Nichols, J. The ability of inner-cell-mass cells to self-renew as embryonic stem cells is acquired following epiblast specification. Nat. Cell Biol. 16, 516–528 (2014)
Soudais, C. et al. Targeted mutagenesis of the transcription factor GATA-4 gene in mouse embryonic stem cells disrupts visceral endoderm differentiation in vitro. Development 121, 3877–3888 (1995)
Morrisey, E. E. et al. GATA6 regulates HNF4 and is required for differentiation of visceral endoderm in the mouse embryo. Genes Dev. 12, 3579–3590 (1998)
Wicklow, E. et al. HIPPO pathway members restrict SOX2 to the inner cell mass where it promotes ICM fates in the mouse blastocyst. PLoS Genet. 10, e1004618 (2014)
Jedrusik, A., Cox, A., Wicher, K. B., Glover, D. M. & Zernicka-Goetz, M. Maternal-zygotic knockout reveals a critical role of Cdx2 in the morula to blastocyst transition. Dev. Biol. 398, 147–152 (2015)
Lefebvre, C. et al. A human B-cell interactome identifies MYB and FOXM1 as master regulators of proliferation in germinal centers. Mol. Syst. Biol. 6, 377 (2010)
Xie, W. et al. Epigenomic analysis of multilineage differentiation of human embryonic stem cells. Cell 153, 1134–1148 (2013)
Labelle-Dumais, C., Jacob-Wagner, M., Paré, J. F., Bélanger, L. & Dufort, D. Nuclear receptor NR5A2 is required for proper primitive streak morphogenesis. Dev. Dyn. 235, 3359–3369 (2006)
Gu, P. et al. Orphan nuclear receptor LRH-1 is required to maintain Oct4 expression at the epiblast stage of embryonic development. Mol. Cell. Biol. 25, 3492–3505 (2005)
Schultz, R. M. The molecular foundations of the maternal to zygotic transition in the preimplantation embryo. Hum. Reprod. Update 8, 323–331 (2002)
Aoki, F., Worrad, D. M. & Schultz, R. M. Regulation of transcriptional activity during the first and second cell cycles in the preimplantation mouse embryo. Dev. Biol. 181, 296–307 (1997)
Hamatani, T., Carter, M. G., Sharov, A. A. & Ko, M. S. Dynamics of global gene expression changes during mouse preimplantation development. Dev. Cell 6, 117–131 (2004)
Macfarlan, T. S. et al. Embryonic stem cell potency fluctuates with endogenous retrovirus activity. Nature 487, 57–63 (2012)
Ishiuchi, T. et al. Early embryonic-like cells are induced by downregulating replication-dependent chromatin assembly. Nat. Struct. Mol. Biol. 22, 662–671 (2015)
Abe, K. et al. The first murine zygotic transcription is promiscuous and uncoupled from splicing and 3′ processing. EMBO J. 34, 1523–1537 (2015)
Picelli, S. et al. Full-length RNA-seq from single cells using Smart-seq2. Nat. Protocols 9, 171–181 (2014)
Peng, X. et al. TELP, a sensitive and versatile library construction method for next-generation sequencing. Nucleic Acids Res. 43, e35 (2015)
Keane, T. M. et al. Mouse genomic variation and its effect on phenotypes and gene regulation. Nature 477, 289–294 (2011)
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013)
Tang, F. et al. Deterministic and stochastic allele specific gene expression in single mouse blastomeres. PLoS One 6, e21208 (2011)
Xie, W. et al. Base-resolution analyses of sequence and parent-of-origin dependent DNA methylation in the mouse genome. Cell 148, 816–831 (2012)
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015)
Dennis, G., Jr et al. DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol. 4, 3 (2003)
Schug, J. et al. Promoter features related to tissue specificity as measured by Shannon entropy. Genome Biol. 6, R33 (2005)
Acknowledgements
We appreciate B. Ren, D. Leung, H. Yang and the members of the Xie laboratory for comments during preparation of the manuscript. This work is supported by the funding provided by the National Basic Research Program of China (973 program) 2015CB856201 (W.Xie), the National Natural Science Foundation of China 31422031 (W.Xie), 31171381 (J.N.), 81472855 (X.Y.), the National Basic Research Program of China 2012CB966701 (J.N.), the Beijing Natural Science Foundation grant 5152014 (J.N.), Tsinghua University Initiative Scientific Research Program (20131089278, 2014z21046) (X.Y.), the funding from the THU-PKU Center for Life Sciences (W.Xie, X.Y.), and the Youth Thousand Scholar Program of China (W.Xie, X.Y.).
Author information
Authors and Affiliations
Contributions
J.W., B.H. and W.Xie conceived and designed the experiments. J.W. conducted the ATAC-seq experiments. J.W., B.L., W.Xia and Q.W. developed CARM. B.H. performed the mouse embryo experiments with help from Y.X., J.M., W.L. and J.Z. H.C. prepared the RNA-seq libraries. Q.Y. and W.Z. helped with various experiments. J.W., H.C., H.Z., Y.L., X.Y. and W.Xie performed the bioinformatics analysis of the data. B.Z. conducted the ChIP-seq experiment. X.P., F.X., G.T. advised the development or application of ATAC-seq and CARM. Y.L. and Q.W. performed NGS sequencing. Z.C. and J.N. supervised the mouse work. J.W., H.C., B.H. and W.Xie wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Additional information
Reviewer Information Nature thanks T. Magnuson and the other anonymous reviewer(s) for their contribution to the peer review of this work.
Extended data figures and tables
Extended Data Figure 1 Development of CARM, a method to deplete mitochondrial DNA from ATAC-seq library.
a, A snapshot of the UCSC browser view shows enrichment of ATAC-seq (with replicates) in early embryos and mESCs, as well as the enrichment of DNase-seq in mESCs. b, Scatter plots comparing the ATAC-seq enrichment (RPKM, 5-kb-window for the entire genome) between samples using various numbers of mESCs. The Pearson correlation coefficients are shown. c, Bar charts showing the average percentages of monoclonal nuclear DNA reads in ATAC-seq sequencing libraries before and after mitochondrial DNA (mtDNA) depletion for early mouse embryos at each developmental stage. d, Schematic of mtDNA depletion for the ATAC-seq library. mtDNA in ATAC-seq library was digested by Cas9 with sgRNAs targeting mtDNA. These digested mtDNA cannot form sequencing clusters during sequencing due to the lack of adaptors on both ends. e, Bar chart shows the mtDNA depletion efficiency for samples of different stages. The depletion efficiency refers to the percentages of total mtDNA reads that were removed by CARM. f, The DNA gel showing the Cas9 digestion efficiency for mtDNA. A total of sixteen 1 kb mtDNA amplicons covering the entire mitochondrial genome were pooled and subjected to Cas9 digestion together with mtDNA sgRNAs. g, Scatter plots showing the correlation of ATAC-seq enrichment for the 2-cell embryos between pre-depletion and post-depletion. A similar analysis was performed between sequencing and re-sequencing results for the same library as a control.
Extended Data Figure 2 Validation of ATAC-seq and RNA-seq data in early mouse embryos.
a, The Spearman correlation between the replicates of RNA-seq samples and between RNA-seq in this study and Deng et al.12 at stages when available. b, Scatter plots comparing the enrichment (5-kb window for the entire genome) between ATAC-seq replicates for mouse preimplantation embryos. The Pearson correlation of the RPKM values is shown. c, Box plots comparing the levels of gene expression and promoter ATAC-seq enrichment in the 2-cell embryos and ICM for genes with different promoter CG densities. Genes specifically expressed at the 2-cell stage were analysed. d, A snapshot of the UCSC genome browser shows the global view of ATAC-seq, H3K27ac and H3K27me3 ChIP-seq enrichment in the 2-cell embryos. e, The overlaps between ATAC-seq peaks of early embryos at each stage and H3K27ac ChIP-seq peaks in the 2-cell embryos for promoter peaks (left) or distal peaks (right). The overlaps of random promoters or peaks and H3K27ac ChIP-seq peaks in the 2-cell embryos are included as controls.
Extended Data Figure 3 Allelic accessible chromatin in early mouse embryos.
a, The percentages of regions (covered by at least 8 reads with assigned parental origins) defined as allele specific regions (blue) at various P-value cutoffs are plotted. As a control, these reads were randomly assigned to either allele and a similar percentage was calculated (red, average of 5 random permutations). A P value of 0.001 was selected as the cutoff to give the FDR (random to observed; green) less than 2% for each stage. b, Scatter plots showing the numbers of ATAC-seq reads (top) assigned to each allele in SNP-containing regions and RNA-seq reads assigned to each allele for SNP-containing genes (middle and bottom). For RNA-seq analysis, either all genes (middle) or ZGA-only genes (bottom) were used. The purple denotes allele-specific ATAC-seq regions or allele-specific genes. The red, yellow and green denote biallelic ATAC-seq regions or biallelic genes, with green to red showing high to low point densities. The numbers show the percentages of maternal- and paternal-specific ATAC-seq regions or genes. c, Snapshots show the allelic ATAC-seq signals and RNA-seq signals near the imprinted locus Snrpn and near the gene Etv6. Note, not all regions are covered by SNPs. d, Box plots showing the ratio of paternal versus maternal read numbers for all regions in the genome (1-kb window) covered by at least 8 reads with assigned parental origins. The analysis was performed for either autosomes (blue) or the X chromosome (red). e, Bar charts showing the percentages of allele specific genes (combining all stages) that contain allele specific ATAC-seq peaks at the same stages in nearby regions (within 40 kb). A P value based on the hypergeometric distribution is shown.
Extended Data Figure 4 Allelic ATAC-seq enrichment and DNA methylation levels at ATAC-seq peaks.
a, The allelic average ATAC-seq enrichment and DNA methylation levels around biallelic, maternal- and paternal-specific ATAC-seq peaks are shown for both promoter peaks (left) and distal peaks (right). M-specific, maternal specific; P-specific, paternal-specific. b, The allelic average fCAB-seq enrichment, which measures the sum of 5fC and 5mC, and TAB-seq enrichment, which measures 5hmC, are shown for both promoter peaks (left) and distal peaks (right) in the 2-cell embryos. The fCAB-seq and TAB-seq data were obtained from a previous study16. Notably, MethylC-seq measures the sum of 5mC and 5hmC. As the levels of 5hmC on both alleles are much lower compared to 5mC, the presence of 5hmC should not significantly alter the total 5mC levels. c, Box plots showing allelic DNA methylation differences (maternal − paternal) at the known germline imprinted regions47. M, maternal; P, paternal.
Extended Data Figure 5 Analyses of ATAC-seq peaks at transcription end sites.
a, The average levels of ATAC-seq enrichment for genes of different expression levels (ZGA-only) in the 2-cell embryos. b, The Gene Ontology analysis results for the top 1,000 genes ranked by TES ATAC-seq enrichment (average enrichment from TES to 2 kb downstream) in the 2-cell embryos.
Extended Data Figure 6 Widespread presence of repetitive elements in accessible chromatin in early mouse embryos.
a, Bar charts showing the percentages of distal ATAC-seq peaks in early embryos that overlap with enhancers in somatic tissues identified previously20. b, Bar charts showing the percentages of the RNA-seq reads mapped to RepBase (repeat reads) compared to all mappable reads (repeat reads + single-copy gene reads). The error bars denote the standard deviations of two biological replicates of RNA-seq for each stage. c, Bar charts showing the number of peaks with no overlap with repeats, with any regions in the peak overlapping with repeats, or with at least 20% of the peak regions overlapping with repeats. d, The snapshots of promoter (left) or distal (middle and right) ATAC-seq peaks near various types of repeats. e, Percentages of stage-specific genes (ZGA-only) of which the TSSs directly overlap with repeats are shown as bar graphs. A similar analysis result for random TSSs overlapping with repeats is included as control. f, The percentages of different classes of repeats that overlap with TSSs of stage-specific genes are shown. g, Heat maps show the enrichment of repeats in promoter and distal H3K27ac ChIP-seq peaks of early embryos and somatic tissues. h, The densities of various families of repeats near genes are shown.
Extended Data Figure 7 Analysis of stage-specific distal ATAC-seq peaks.
a, Histogram shows the distribution of the Shannon entropy of enrichment levels for ATAC-seq promoter peaks and distal peaks across early embryos and mESCs. b, A heat map showing the ATAC-seq enrichment at stage specific distal ATAC-seq peaks (left). The functional enrichment of genes nearby (by the GREAT analysis25) is also shown (right). c, Box plots showing the expression levels of all chromatin regulator genes (as identified by the GREAT tool) near stage-specific distal ATAC-seq peaks in early development. d, Scatter plots comparing the gene expression changes between ICMs and mESCs (log ratio of FPKM) for each gene in this study and in Tang et al.46. Genes that are lowly expressed in both ICMs and mESCs (FPKM <1) were removed. e, A heat map showing the expression levels of the ICM- and mESC-specific genes (identified in both this study and Tang et al.46) (left). The Gene Ontology terms are also shown (right).
Extended Data Figure 8 Analysis of single-cell RNA-seq and identification of putative promoter–enhancer pairs in early embryos.
a, A heat map shows the differential expression levels between ICM and TE for all genes ranked by their ICM:TE expression ratio (left). Genes that with low expression in both ICM and TE (FPKM <1) were removed. Selected ICM and TE regulators/markers are shown with their differential expression rank indicated in parentheses (middle). Positive and negative ranks indicate the ranking in ICM-specific and TE-specific genes, respectively. Examples of ICM-specific genes are shown on the right, including those exclusively expressed in ICM (right top) or known ICM regulators (right bottom). b, Schematic shows the method to identify target genes of a TF. The correlation of ATAC-seq enrichment of distal peaks and their nearby (within 100 kb) promoter peaks was calculated across all developmental stages. Distal-promoter peak pairs showing high correlation are considered as putative enhancer-promoter pairs (see Supplementary Information). Candidate regulatory TFs were identified by searching motifs in distal peaks, and distal-peak linked promoters were defined as the potential targets of TFs. c, Heat maps showing the ATAC-seq enrichment at stage-specific distal ATAC-seq peaks, their predicted target promoters, and the expression of corresponding genes. Only genes that are not expressed in oocytes (ZGA-only) were included in this analysis. d, Box plots showing the correlation between enrichment levels of distal ATAC-seq peaks and expression of their predicted target genes, compared to random distal peak-promoter peak pairs. A P value based on the t-test is shown.
Extended Data Figure 9 NR5A2 regulates expression of key TFs for ICM and TE at the 8-cell stage.
a, Bar charts showing the expression levels of Nr5a2 in preimplantation embryos using data from this study or Deng et al.12. The error bars denote the standard deviation of FPKM values across two RNA-seq replicates. b, Heat maps show the correlation between the expression of Nr5a2 and all genes across individual single cells from the 8-cell embryos (left) or blastocysts (right) using data from Deng et al.12. Scatter plots comparing expression of Nr5a2 with Pou5f1 or Cdx2 across individual cells from the 8-cell embryos or blastocysts. c, Schematic of the Nr5a2 knockdown experiments. A Venn diagram shows the overlap between Nr5a2-knockdown downregulated genes and 8-cell-specific genes. A P value based on the hypergeometric distribution is shown.
Extended Data Figure 10 Transcription and chromatin states in minor ZGA.
a, A snapshot of the UCSC genome browser shows an example of early 2-cell gene family (Zscan4) that reside in clusters in the genome. b, Box plots show the expression levels of ZGA-only genes that are activated by the 2-cell stage (activated either at zygote, early 2-cell or 2-cell stages). c, Pie chart shows the percentages of early 2-cell genes that reside in clusters or are solitary. d, Bar chart shows the numbers of ATAC-seq peaks identified at each stage. Two programs were used (MACS and HOMER26) to verify the ATAC-seq peak analyses in early 2-cell stages. e, Bar chart shows the genome coverages by ATAC-seq peaks at different stages called either by MACS or HOMER. f, The enrichment (log ratio of observed to random) of repeat subfamily in ATAC-seq promoter and distal peaks in the early 2-cell embryos. g, Bar chart shows the expression levels of MERVL in oocyte and preimplantation embryos. h, The average ATAC-seq enrichment in the early 2-cell embryos is shown for regions near MERVLs with high, median and low levels of expression. The MERVL region is not shown due to the low mappability. i, Bar chart shows the numbers of early 2-cell genes that fall into the broad domains of open chromatin. A similar analysis for a set of random domains (of equal length as each corresponding broad domain) was performed as a control. The P value was calculated based on the hypergeometric distribution. j, Bar chart shows the expression levels of Zfp352 in oocyte and preimplantation embryos.
Supplementary information
Supplementary Data
This file contains Supplementary Table 1 (XLSX 15 kb)
Supplementary Data
This file contains Supplementary Table 2 (XLSX 103 kb)
Supplementary Information
This file contains the uncropped gel in Extended Data figure 1f. (PDF 100 kb)
Rights and permissions
About this article
Cite this article
Wu, J., Huang, B., Chen, H. et al. The landscape of accessible chromatin in mammalian preimplantation embryos. Nature 534, 652–657 (2016). https://doi.org/10.1038/nature18606
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nature18606
- Springer Nature Limited
This article is cited by
-
Cell-type differential targeting of SETDB1 prevents aberrant CTCF binding, chromatin looping, and cis-regulatory interactions
Nature Communications (2024)
-
Low-input lipidomics reveals lipid metabolism remodelling during early mammalian embryo development
Nature Cell Biology (2024)
-
Emergence of replication timing during early mammalian development
Nature (2024)
-
Lineage regulators TFAP2C and NR5A2 function as bipotency activators in totipotent embryos
Nature Structural & Molecular Biology (2024)
-
Single-cell multi-omics profiling of human preimplantation embryos identifies cytoskeletal defects during embryonic arrest
Nature Cell Biology (2024)