Assessing Chromatin Accessibility During WBR in Acoels

Gehrke, Andrew R.; Srivastava, Mansi

doi:10.1007/978-1-0716-2172-1_29

Part of the book series: Methods in Molecular Biology ((MIMB,volume 2450))

6383 Accesses
1 Citations
1 Altmetric

The original version of this chapter was revised. The correction to this chapter is available at https://doi.org/10.1007/978-1-0716-2172-1_37

Abstract

Dynamic gene expression seen during whole-body regeneration is likely controlled by genomic regulatory elements that dictate the spatiotemporal activity of the regeneration transcriptome. Identifying and characterizing these non-coding regulatory sequences are key to understanding how genes are connected into networks to deploy the process of whole-body regeneration. Here, we describe the application of the Assay for Transposase Accessible Chromatin (ATAC-seq) in the acoel Hofstenia miamia to identify regions of open chromatin that represent putative regulatory elements. Notably, when paired with gene knockdown techniques such as RNAi, ATAC-seq can be implemented in a functional genomics approach to validate putative regulatory elements. ATAC-seq requires no species-specific reagents, is amenable to small input cell numbers, and can be completed in a single day, making it an ideal assay to identify dynamic chromatin at high resolution during whole-body regeneration in virtually any species with a quality genome assembly.

You have full access to this open access chapter, Download protocol PDF

ATAC-Seq for Assaying Chromatin Accessibility Protocol Using Echinoderm Embryos

Identification of Open Chromatin Regions in Plant Genomes Using ATAC-Seq

Genome-wide identification of accessible chromatin regions in bumblebee by ATAC-seq

Article Open access 26 October 2020

Key words

1 Introduction

The crucial role of regulatory elements that comprise the non-coding genome has been demonstrated in development, disease, and evolution [1, 2]. Advances in genomics (e.g., the ability to sequence and assemble myriads of animal genomes) and techniques in molecular biology have now made it possible to explore the role of the regulatory genome in the process of whole-body regeneration. Previous techniques to characterize regulatory elements have relied either on species-specific reagents or a large number of input cells, hindering the genome-wide identification of putative enhancers in emerging model systems. The Assay for Transposase Accessible Chromatin (ATAC-seq) [3], which is relatively wet-lab simple and requires a small amount of input material, has the potential to revolutionize the fields of functional genomics and evolutionary-developmental biology by providing a method to identify putative enhancers at high resolution in emerging systems of study (Fig. 1).

ATAC-seq works by treating a small number of permeabilized cells or exposed nuclei to a transposase enzyme that preferentially accesses regions of open chromatin, simultaneously cutting DNA and inserting primers for sequencing (“tagmentation”) (Fig. 1). Following sequencing, reads mapped to the genome provide information on open chromatin, nucleosome position, and transcription factor binding. The main benefits of the assay are (1) no species-specific reagents, (2) low input required, from 50,000 cells down to a few thousand, (3) reproducibility, in that replicates are highly concordant, and (4) speed, one can go from intact tissue to a sequencing-ready library in a single day.

Due to the experimental ease and high resolution of ATAC-seq, a number of methods papers have been published that describe the assay in detail. These include step-by-step instructions for cell lines [4], zebrafish [5, 6], echinoderms [7], xenopus [8], and plants [9]. Recent advances to the protocol (“Omni-ATAC”) have improved the sensitivity of the assay and made it possible to perform in frozen tissues [10]. In addition to the wet-lab protocols for ATAC-seq, there are a number of methods papers that describe the bioinformatic data analysis portion of ATAC-seq [11,12,13]. The majority of the wet and dry lab portions of ATAC-seq are quite similar across organisms and do not deviate much from the original methods paper describing the assay [4]. The critical factor when performing ATAC-seq in a “new” species is attaining the correct number of cells for proper transposition. Keeping this as a focus, here we describe step-by-step instructions for ATAC-seq in the acoel worm Hofstenia miamia . A defining step of this protocol is direct disruption of tissue in lysis buffer (as opposed to traditional dissociation and cell counting), followed immediately by transposition. This rapid processing of samples likely reduces background noise and better captures transcription factor binding as inferred by footprinting. We envision that this protocol will work robustly for all invertebrate animals that are generally easy to lyse or dissociate into single cells.

2 Materials

1.
Octylphenoxypolyethoxyethanol (IGEPAL CA-630) (Sigma cat # I8896).
2.
Tagment DNA enzyme 1 (TDE1) enzyme.
3.
2× Tagment DNA (TD) buffer (Illumina cat # 20034197) (see Note 1).
4.
Mini kit for gel extraction and PCR clean up (e.g., Nucleospin, Macherey-Nagel cat # 740609).
5.
High-fidelity 2× PCR master mix (New England Labs cat # M0541).
6.
PCR primers (Table 1).
7.
DNA concentration measurement equipment (e.g., Qubit, Thermo Fisher Scientific, cat # Q32851).
8.
Automated electrophoresis tool (e.g., Tapestation, Agilent).
9.
0.40-μm cell strainer (Falcon cat # 352340).
10.
Lysis buffer: 10 mM Tris–HCl, pH 7.4, 10 mM NaCl, 3 mM MgCl₂, 0.1% (v/v) IGEPAL CA-630. Prepare fresh, keep on ice.
11.
Transposition reaction mix: 25 μL TD buffer, 2.5 μL TDE1 enzyme, 22.5 μL ddH₂O. Prepare fresh, keep on ice.
12.
PCR reaction mix: 25 μL high-fidelity 2× PCR mix, 2.5 μL 25 μM universal PCR primer 1, 2.5 μL 25 μM barcoded PCR primer 2, 10 μL H₂O. Make fresh when performing the PCR amplification.
13.
Samtools software version 1.10 [14].
14.
Bowtie2 software version 2.3.2 [15].
15.
Picard software version 2.24.0.
16.
NGmerge software version 0.3 [16].

Table 1 Primer sequences. Primer sequences used for PCR, table reproduced from Supplementary Table 1 of [3]

Full size table

3 Methods

Care should be taken to move as quickly as possible from tissue extraction to dissociation to retain chromatin state at the appropriate timepoint. This protocol is based on the original ATAC-seq protocol [3]. Modifications that improve the assay have been described (“Omni-ATAC”) [10], but use a detergent mixture that may be harmful to more sensitive cells. Thus, we suggest attempting the original protocol first and subsequently exploring the Omni-ATAC modifications to potentially improve the experiment.

3.1 Library Preparation

Determine the optimal tissue size of interest that contains ~50,000–100,000 cells (see Note 2).

1.
Extract the desired tissue at timepoint of interest using sterile surgical blade (see Notes 3 and 4).
2.
Transfer the sample to a 1.5-mL tube filled with ~25 μL of appropriate solution (e.g., PBS, sea water).
3.
Replace the solution with 200 μL of cold lysis buffer.
4.
Dissociate the tissue by gently pipetting using a p200 pipette until the fragment is completely in solution (~30 s) (see Note 5).
5.
Filter the solution through a 40-μm filter into a new 1.5-mL tube.
6.
Centrifuge the solution at 800 rcf for 10 min at 4 °C to pellet the cells/nuclei.
7.
Gently remove the supernatant.
8.
Resuspend the (invisible) pellet in 50 μL of the transposition reaction mix.
9.
Incubate the cells at 37 °C for 30 min under 1000 rpm orbital shaking (e.g., thermomixer).
10.
Purify the transposed DNA using extraction kit (see Note 6) according to the manufacturer’s instructions.
11.
Elute in 12 μL of elution buffer.
12.
Store purified DNA at −20 °C.

3.2 PCR Amplification of Library and Sequencing

1.
Add 10 μL of the eluted library to the 40 μL PCR reaction mix in a 0.2-mL PCR tube.
2.
Run PCR using the following conditions (see Note 7): 1 cycle 5 min at 72 °C, 11 cycles 10 s at 98 °C, 30 s at 63 °C, 1 min at 72 °C, hold at 4 °C.
3.
Purify the amplified DNA using the gel extraction and PCR clean up mini kit according to the manufacturer’s instructions.
4.
Elute in 22 μL of elution buffer.
5.
Store-purified DNA at −20 °C.
6.
Determine the concentration of library using the DNA concentration measurement equipment according to the manufacturer’s instructions. We typically attain around ~10–20 ng/μL, but concentration can range from ~1 to 30 ng/μL.
7.
Run purified DNA on the automated electrophoresis tool according to manufacturer’s instructions (see Note 8, Fig. 2).
8.
Pool libraries according to Illumina sequencing platform and desired ratio of reads (see Note 9).
9.
Sequence using 50 bp paired-end on an Illumina platform at ~15 million mapped reads per Gb of genome (see Note 10).

3.3 Data Analysis

Raw reads should be backed up in at least two separate locations, ideally one physical and one cloud- or server-based. The following steps are designed to guide the user from raw reads to a processed alignment file, which is the most common input file for most downstream applications. Further example code and details for read processing and other applications (including differential peak analysis, see Note 11) can be found at https://github.com/agehrke6/ATAC_processing_analysis_guide. Note that the example code given below is designed as a starting point for the beginner user, and the manuals for each bioinformatic tool should be consulted for full explanation and detail.

1.
Trim raw reads of adapters using NGmerge: NGmerge -a -e 20 -n 4 -1 <sample>.R1.fastq.gz -2 <sample>.R2.fastq.gz -o <sample>_trimmed. This command will output two files: <sample>_trimmed.R1.fastq.gz and <sample>_trimmed.R2.fastq.gz.
2.
Index genome of interest with Bowtie2: bowtie2-build <genome>.fasta <build_name>.
3.
Map trimmed reads to reference genome with Bowtie2: bowtie2 -x <build_name> -X 2000 -1 <sample>_trimmed.R1.fastq.gz -2 <sample>_trimmed.R2.fastq.gz -p 31 | samtools view -b -S - | samtools sort - <sample>. This command will create an alignment file (.bam).
4.
Index the .bam file: samtools index <sample>_nodups_nomulti.bam. Quality libraries have a high percentage of mapped reads (>80%).
5.
Remove PCR duplicates from alignment (.bam) file using Picard: java -jar picard.jar MarkDuplicates I=<sample>.bam O=<sample>_nodups.bam M=<sample>_dups.txt REMOVE_DUPLICATES=true VALIDATION_STRINGENCY=LENIENT.
6.
Remove reads mapping to the mitochondrial genome from the de-duplicated alignment (.bam) file using samtools: samtools view -h <sample>_nodups.bam | grep -v chrM | samtools sort -O bam -o <sample>_nodups_noMt.bam. Note that chrM should be changed to the designation of the mitochondrial genome in the input assembly (e.g., scaffoldX).
7.
Remove multi-mapped reads using samtools (see Note 12): samtools view -h -q 30 <sample>_nodups_noMt.bam > <sample>_nodups_noMt_nomulti.bam.
8.
Retain only properly paired reads using samtools (see Note 12): samtools view -h -b -F 1804 -f 2 <sample>_nodups_noMt_nomulti.bam > <sample>_nodups_noMt_nomulti_filtered.bam.
9.
Sort the .bam file: samtools sort <sample>_nodups_noMt_nomulti_filtered.bam -o <sample>_nodups_noMt_nomulti_filtered_sorted.bam.
10.
Index the final file using samtools: samtools index <sample>_nodups_noMt_nomulti_filtered_sorted.bam.
11.
Use “clean” .bam file for peak calling and downstream analysis (see Note 12) (Fig. 3).

4 Notes

1.
Alternatively, a homemade tagment buffer can be made: 20 mM Tris(hydroxymethyl)aminomethane; 10 mM MgCl₂; 20% (v/v) dimethylformamide [17]. Adjust pH to 7.6 with 100% acetic acid before adding dimethylformamide. Store at −20 °C for 6 months.
2.
Determining the optimal number of cells is the most crucial and variable aspect of the ATAC-seq protocol . As a first step, we recommend excising different tissue sizes, dissociating cells, and then counting to determine the size that most likely contains 50,000–100,000 cells. Once the appropriate general tissue size is identified, subsequent ATAC-seq experiments can be run without having to take the time-sensitive step of preparing and counting cells. It is likely that the immediate lysis of tissue described in this protocol is key to producing high-quality data by capturing chromatin state as quickly as possible. If the “direct lysis” method continues to give substandard libraries, we recommend the standard protocol of attaining a single-cell suspension of live cells, counting and attaining ~50,000 cells, then proceeding with the remainder of the protocol using 50 μL volumes of reagents.
3.
Design experiments so multiple samples can be processed at the same time, e.g., a 0 hour post amputation (hpa) sample can be processed alongside a 6 hpa sample that was cut 6 h prior (Fig. 4).
4.
We aim to include three biological replicates during each ATAC experiment, eventually choosing the best two samples based on the Tapestation trace to sequence. Due to the speed of the ATAC-seq protocol and depending on the timepoints desired, multiple samples of a regeneration time-course can be completed in a single day. We typically cut the appropriate number of animals at the beginning of the day, and process samples as different timepoints of regeneration (e.g., 0 hpa, 1 hpa, 3 hpa, 6 hpa). Avoid processing more than ~4 samples at a time to ensure the speed of the assay.
5.
Due to the delicate nature of Hofstenia miamia tissue, we are able to attain single-cell suspensions in less than 30 s of gentle pipetting, which are simultaneously made accessible to the transposase by performing this step in lysis buffer. Depending on the organism or tissue being used, attaining a single-cell suspension or lysis may be more challenging, and thus, we suggest species-specific protocols to attain single-cell suspensions if necessary.
6.
A variety of DNA purification kits are acceptable, including the Qiagen minelute kit. After adding the buffer in the first step of the cleanup protocol , the solution can be frozen at −20 °C for purification at another time.
7.
In order to determine the correct number of PCR cycles to avoid saturation and PCR-induced artifacts, it is advisable to run the first PCR for five cycles and then subsequently perform a qPCR reaction. This protocol is provided in detail in [4]. We found that our libraries nearly always converged on 11 cycles as the optimal number, and thus adopted this number as part of the protocol . When starting ATAC-seq with a new system, the optimal number of PCR cycles should be explored using the qPCR instructions found in [4].
8.
The automated electrophoresis tool is used (in this case) to view the sizes of nucleic acids present in an ATAC-seq library. In a properly tagmented library, the transposase enzyme will insert into histone linker regions, creating a “nucleosomal ladder” rolling landscape where peaks correspond to varying numbers of nucleosomes. A proper nucleosomal ladder is the best indicator of a successful ATAC-seq experiment, and examples of the most common traces of varying quality are shown in Fig. 2.
9.
Work with sequencing facility or Illumina representative to pool different barcoded ATAC libraries at the proper molar ratios for the desired model of sequencer. If desired, molar ratios can be adjusted to skew the number of reads to a more desired library.
10.
The ENCODE standard for acceptable libraries states that human paired-end ATAC-seq libraries must have 50 million non-duplicate, non-mitochondrial aligned reads (i.e., 25 million fragments). For footprinting, recommended sequencing depths are much higher (>200 million mapped reads), though we have found emerging footprinting software [18] can reliably detect footprints at substantially less sequencing depth, though this need to be empirically determined for the species of interest.
11.
With a clean alignment file in hand, a number of useful downstream applications can be performed. These include peak-calling with programs such as Genrich or MAC2, which will call sets of peaks for each replicate that can be narrowed down to reproducible peaks. In order to identify dynamic regions of chromatin (e.g., between different stages of regeneration), two common tools are Diffbind [19] and csaw [20]. Motif-finding algorithms such as HOMER [21] or GimmeMotifs [22] using significant peaks, or assigning regions-to-nearest-gene using ChIPseeker [23] can be useful ways to define candidate regulators. In addition to scanning regions for motif matches, regions of the genome that are protected by bound TFs will leave “footprints” that can be determined bioinformatically (Fig. 1). We use TOBIAS [18] to make bound/unbound calls for all predicted sites of a particular TF in the genome, as well as create aggregate plots that show overall binding differences between timepoints. See Fig. 3 for a genome browser example of these types of data combined.
12.
Depending on downstream analysis, removing multi-mapped reads and retaining only properly paired reads may be unnecessary or detrimental (too few reads retained). For instance, peak-calling with Genrich can utilize both multi-mapped and unpaired alignments in generating peak calls. We recommend checking the documentation of software for the desired downstream application to determine how best to handle these reads.

Change history

21 July 2022
In the original version of this book, Figure 3 in Chapter 29 was inadvertently published with some errors. This has been rectified in the updated version of this book.

References

Maeso I, Acemel RD, Gómez-Skarmeta JL (2017) Cis-regulatory landscapes in development and evolution. Curr Opin Genet Dev 43:17–22
Article CAS Google Scholar
Spitz F, Duboule D (2008) Global control regions and regulatory landscapes in vertebrate development and evolution. Adv Genet 61:175–205
Article CAS Google Scholar
Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ (2013) Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods 10:1213–1218
Article CAS Google Scholar
Buenrostro JD, Wu B, Chang HY, Greenleaf WJ (2015) ATAC-seq: a method for assaying chromatin accessibility genome-wide. Curr Protoc Mol Biol 109:21.29.1–21.29.9. https://doi.org/10.1002/0471142727.mb2129s109
Article Google Scholar
Fernández-Miñán A, Bessa J, Tena JJ, Gómez-Skarmeta JL (2016) Assay for transposase-accessible chromatin and circularized chromosome conformation capture, two methods to explore the regulatory landscapes of genes in zebrafish. Methods Cell Biol 135:413–430. https://doi.org/10.1016/bs.mcb.2016.02.008
Article CAS PubMed Google Scholar
Doganli C, Sandoval M, Thomas S, Hart D (2017) Assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-Seq) protocol for zebrafish embryos. Methods Mol Biol 1507:59–66. https://doi.org/10.1007/978-1-4939-6518-2_5
Article CAS PubMed Google Scholar
Shashikant T, Ettensohn CA (2019) Genome-wide analysis of chromatin accessibility using ATAC-seq. Methods Cell Biol 151:219–235
Article CAS Google Scholar
Bright AR, Veenstra GJC (2019) Assay for transposase-accessible chromatin-sequencing using Xenopus embryos. Cold Spring Harb Protoc 2019. https://doi.org/10.1101/pdb.prot098327
Bajic M, Maher KA, Deal RB (2018) Identification of open chromatin regions in plant genomes using ATAC-Seq. Methods Mol Biol 1675:183–201
Article CAS Google Scholar
Corces MR, Trevino AE, Hamilton EG, Greenside PG, Sinnott-Armstrong NA, Vesuna S et al (2017) An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nat Methods 14:959–962
Article CAS Google Scholar
Miskimen KLS, Chan ER, Haines JL (2017) Assay for transposase-accessible chromatin using sequencing (ATAC-seq) data analysis. Curr Protoc Hum Genet 92:1.16.1
Google Scholar
Yan F, Powell DR, Curtis DJ, Wong NC (2020) From reads to insight: a hitchhiker’s guide to ATAC-seq data analysis. Genome Biol 21:22
Article Google Scholar
Wei Z, Zhang W, Fang H, Li Y, Wang X (2018) esATAC: an easy-to-use systematic pipeline for ATAC-seq data analysis. Bioinformatics 34:2664–2665
Article CAS Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N et al (2009) The Sequence Alignment/Map format and SAMtools. Bioinformatics 25:2078–2079
Article Google Scholar
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359
Article CAS Google Scholar
Gaspar JM (2018) NGmerge: merging paired-end reads via novel empirically-derived models of sequencing errors. BMC Bioinformatics 19:536
Article CAS Google Scholar
Magri MS, Jiménez-Gancedo S, Bertrand S, Madgwick A, Escrivà H, Lemaire P et al (2019) Assaying chromatin accessibility using ATAC-Seq in invertebrate chordate embryos. Front Cell Dev Biol 7:372
Article Google Scholar
Bentsen M, Goymann P, Schultheis H, Petrova A (2019) Beyond accessibility: ATAC-seq footprinting unravels kinetics of transcription factor binding during zygotic genome activation. BioRxiv. https://www.biorxiv.org/content/10.1101/869560v1.abstract
Stark R, Brown GD (2011) DiffBind: differential binding analysis of ChIP-Seq peak data. Bioconductor (https://bioconductor.org/packages/release/bioc/html/DiffBind.html).
Lun ATL, Smyth GK (2016) csaw: a Bioconductor package for differential binding analysis of ChIP-seq data using sliding windows. Nucleic Acids Res 44:e45
Article Google Scholar
Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P et al (2010) Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell 38:576–589
Article CAS Google Scholar
Bruse N, Van Heeringen SJ (2018) GimmeMotifs: an analysis framework for transcription factor motif analysis. BioRxiv. https://www.biorxiv.org/content/10.1101/474403v1.abstract
Yu G, Wang L-G, He Q-Y (2015) ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31:2382–2383
Article CAS Google Scholar

Download references

Acknowledgments

We thank Jose Luis Gomez Skarmeta (Centro Andaluz de Biología del Desarrollo, Sevilla, Spain) and Elisa de la Calle-Mustienes (Centro Andaluz de Biología del Desarrollo, Sevilla, Spain) for assistance with initial Hofstenia ATAC-seq experiments, performed at the Marine Biological Laboratories. We thank Kyle McCulloch for critical reading of the manuscript. A.R.G. is supported by the Simeon J. Fortin Charitable Foundation, Bank of America, N.A. Trustee, and the Helen Hay Whitney Foundation, M.S. is supported by the Milton Foundation of Harvard University and the National Science Foundation (award no.1652104).

Author information

Authors and Affiliations

Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
Andrew R. Gehrke & Mansi Srivastava

Authors

Andrew R. Gehrke
View author publications
You can also search for this author in PubMed Google Scholar
Mansi Srivastava
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Andrew R. Gehrke or Mansi Srivastava .

Editor information

Editors and Affiliations

Department of Biology, University of Fribourg, Fribourg, Fribourg, Switzerland
Simon Blanchoud
Dept Génétique et Evolution, Université de Genève, Genève, Geneva, Switzerland
Brigitte Galliot

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Gehrke, A.R., Srivastava, M. (2022). Assessing Chromatin Accessibility During WBR in Acoels. In: Blanchoud, S., Galliot, B. (eds) Whole-Body Regeneration. Methods in Molecular Biology, vol 2450. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-2172-1_29

Download citation

DOI: https://doi.org/10.1007/978-1-0716-2172-1_29
Published: 01 April 2022
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-2171-4
Online ISBN: 978-1-0716-2172-1
eBook Packages: Springer Protocols

Publish with us

Policies and ethics