Key words

1 Introduction

5-methylcytosine (5mC) is a chemical modification of vertebrate genomes frequently associated with transcriptional repression [1]. 5mC is generated by the transfer of a methyl group from S-adenosylmethionine (SAM) to the fifth carbon of the cytosine pyrimidine ring, and this reaction is catalyzed by DNA methyltransferases (DNMTs). Vertebrate embryogenesis is characterized by extensive 5mC dynamics [2]. 5mC is deposited de novo by DNMT3a and DNMT3b and is maintained through DNA replication by DNMT1 [3, 4]. 5mC can be passively lost by exclusion of DNMT1 activity from the nucleus during DNA replication, or actively by a family of enzymes known as Ten-eleven translocation (TET) methylcytosine dioxygenases [5, 6]. Many vertebrates, including mammals and zebrafish, encode three TET protein copies (TET1, TET2, TET3). TET enzymes catalyze the iterative oxidation of 5mC to 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC), and 5-carboxylcytosine (5caC) [5,6,7]. In vitro studies have demonstrated that 5fC and 5caC can be removed from the DNA and replaced with unmethylated cytosines by thymine DNA glycosylase (TDG) and base-excision repair machinery (BER) [8, 9]. TET knockout animals have revealed both redundant and specialized roles for each TET protein, with animals deficient for all three TET proteins exhibiting most severe developmental phenotypes [10]. The majority of triple TET knockout (TKO) mice failed to survive past gastrulation, due to hyperactive Nodal signaling caused by reduced expression of Lefty1 and Lefty2 genes [11]. Similarly, the majority of triple tet zebrafish morphants (tet1/tet2 /tet3) displayed embryonic lethality; however, a minor percentage of embryos that survived gastrulation exhibited impaired head structures, shortened axes, microphthalmia, and reduced pigmentation [12]. TALEN-induced zebrafish tet TKOs were characterized by brain abnormalities, smaller eyes, altered pigmentation, trunk curvature and did not survive beyond the larval period [13]. These animal studies indicate that TET proteins are dispensable for pluripotency but are required for organ development and body plan formation.

Whole genome bisulfite sequencing (WGBS ) of mouse, zebrafish, and frog embryos revealed TET-mediated demethylation of developmental enhancers during the vertebrate phylotypic period [12]. Phylotypic period is the most conserved phase of vertebrate development that is associated with hox cluster expression and organogenesis [14]. WGBS analysis of tet1/2/3 zebrafish morphants also unraveled that the loss of Tet proteins reduces extent of demethylation, resulting in hypermethylation of these regions [12]. This suggests that the activation of key developmental pathways could be driven by active 5mC demethylation. These differentially methylated regions (DMRs), which normally become hypomethylated during the phylotypic period, will from here on be referred to as phylo-DMRs.

Here we present a system that can facilitate the studies of TET-mediated 5mC demethylation during zebrafish embryogenesis. We employ CRISPR/Cas9 technology [15] to produce tet1/2/3 KO F0 zebrafish, and reduced representation bisulfite sequencing (RRBS) to measure changes in 5mC levels [16]. Purified Cas9 nuclease and guide RNAs (gRNA) against all three tet genes are co-injected into single-cell zebrafish zygotes, and DNA and RNA are harvested at 24 h post fertilization (hpf). The RNA is used for qPCR analyses, whereas the DNA is used for both amplicon sequencing, to validate knockout efficiency, and for RRBS library construction. This system allows for rapid and cost-efficient generation of data, overcoming specificity issues associated with morpholino-based approaches [17, 18] as well as time, labor, and viability issues associated with generating stable tet1/2/3 KO animals. This protocol can be applied to test any gene or gene combination for their involvement in the regulation of DNA methylation.

RRBS is a DNA methylation analysis technique that can be used for assessment of 5mC levels in CG-dense regions, such as promoters and certain enhancers (i.e., phylo-DMRs), thereby reducing the total sequencing depth required. RRBS is based on restriction enzyme digestion, most commonly MspI, which allows for the tailoring of libraries to specific genomic contexts. Like WGBS , the generation of RRBS libraries entails end repair, adapter ligation, sodium bisulfite conversion, and PCR amplification. Sodium bisulfite promotes the deamination of unmethylated cytosines (C), 5fC, and 5caC to uracil while 5mC and 5hmC do not undergo this transition [19, 20]. After PCR amplification, this results in C, 5fC, and 5caC being replaced with thymines, and 5mC and 5hmC remaining as cytosines. The data generated by RRBS are processed and analyzed using bioinformatics pipelines comparable to those used for WGBS data.

2 Materials

2.1 Zebrafish Husbandry

Zebrafish maintenance and breeding were conducted under full animal use and care guidelines with approval by the Garvan Institute of Medical Research animal care and use committee. The zebrafish used were wild type (Tübingen/AB) and were raised under standard conditions at 28 ° C.

2.2 Equipment

  1. 1.

    PCR thermocycler.

  2. 2.

    Real-time PCR thermocycler such as the LightCycler 480 (Roche).

  3. 3.

    DynaMag 96-side Magnetic rack or equivalent.

  4. 4.

    Qubit fluorometer.

  5. 5.

    Microcentrifuge.

  6. 6.

    NanoDrop spectrophotometer.

  7. 7.

    Picospritzer III—intracellular microinjection dispense system.

  8. 8.

    P-87 stutter instrument or equivalent.

  9. 9.

    Stage microruler.

  10. 10.

    Agilent 4200 TapeStation or equivalent.

  11. 11.

    Illumina HiSeq X sequencing platform and corresponding reagents or equivalent.

2.3 Consumables

  1. 1.

    0.2 ml PCR tubes.

  2. 2.

    1.5 ml nuclease-free tubes.

  3. 3.

    Qubit Assay Tubes.

  4. 4.

    Petri Dishes 90 mm × 15 mm.

  5. 5.

    Petri Dishes 150 mm × 15 mm.

  6. 6.

    Z-MOLDS ZEBRAFISH MI/TRANSPLANT KIT (Coherent Scientific).

  7. 7.

    Scalpels.

  8. 8.

    Thin wall glass capillaries (0.75 mm, od: 1 mm) (World Precision Instruments).

  9. 9.

    Pasteur pipettes.

  10. 10.

    Tapestation D1000 ScreenTape and reagents (Agilent Technologies).

  11. 11.

    Microloader pipette tips (Eppendorf).

2.4 Reagents

  1. 1.

    Nuclease-free water.

  2. 2.

    50 mM MgCl2.

  3. 3.

    10 mM DNTPs.

  4. 4.

    1 mM DNTPs.

  5. 5.

    Taq DNA Polymerase, recombinant (Invitrogen).

  6. 6.

    PCR buffer (10×): 200 mM Tris–HCl, pH 8.4, 500 mM KCl.

  7. 7.

    AMPure XP (Agencourt).

  8. 8.

    Ethanol (molecular biology grade).

  9. 9.

    AMPLISCRIBE™ T7-FLASH™ TRANSCRIPTION KIT (Lucigen).

  10. 10.

    Qubit™ RNA BR Assay Kit.

  11. 11.

    Qubit™ DNA HS Assay Kit.

  12. 12.

    Agarose powder.

  13. 13.

    Phenol Red.

  14. 14.

    EnGene CAS9 NLS, S Pyogenes (NEB).

  15. 15.

    E3 solution: 5 mM NaCl, 0.17 mM KCl, 0.33 mM CaCl2, 0.33 mM MgSO4.

  16. 16.

    DNeasy Blood and Tissue Kit (Qiagen) or equivalent.

  17. 17.

    TRIsure (Bioline).

  18. 18.

    Phenol:Chloroform:Isoamyl Alcohol 25:24:1.

  19. 19.

    Chloroform.

  20. 20.

    SensiFAST™ cDNA Synthesis Kit (Bioline).

  21. 21.

    KAPA SYBR FAST qPCR Master Mix (2×) Universal (KAPA Biosystems).

  22. 22.

    MspI restriction endonuclease [20 U/μl] (NEB).

  23. 23.

    Unmethylated λ phage DNA (Promega).

  24. 24.

    10 mM dATP.

  25. 25.

    Klenow Fragment (3′–5′ exo−) (NEB).

  26. 26.

    NEXTflex bisulfite-seq barcodes and NEXTflex universal primer (Bioo Scientific).

  27. 27.

    EZ DNA Methylation-Gold Kit (Zymo Research).

  28. 28.

    KAPA HiFi HotStart Uracil + ReadyMix (Kapa Biosystems).

  29. 29.

    Library Quantification Kit—Illumina/Universal (KAPA Biosystems).

3 Methods

3.1 CRISPR/Cas9 Guide Oligo Design

Three DNA guide oligos, which will serve as the template for gRNAs, were designed for each tet gene using CRISPRscan (https://www.crisprscan.org/) (Table 1) (see Note 1) [21, 22] (Fig. 1a). The DNA guide oligos were selected based on their protospacer sequence score, appearance in the first half of the gene, appearance in all annotated isoforms, and lack of published single nucleotide polymorphisms (SNPs) (Table 2). The protospacer sequences for each gene were also kept within a 300 bp window, to allow for easier and more accurate knockout calculations, as they can be covered by a single PCR amplicon. Each DNA guide oligo (Table 1) starts with a T7 RNA polymerase transcription start site (5′-taatacgactcactata-3′), followed by the protospacer sequence with the first two nucleotides changed to guanine for efficient transcription (Table 2), and ends with a sequence complementary to the universal DNA oligo (5′-gttttagagctagaa-3′). The universal DNA oligo contains the structural template for each gRNA and will be annealed to DNA guide oligos in Subheading 3.2.

Table 1 DNA Oligos
Table 2 gRNA oligo design for tet1/2/3 KO zebrafish

3.2 Annealing and Amplification of the DNA Guide Oligo and Universal Oligo

In order to create templates for gRNA in vitro transcription in Subheading 3.3, the DNA guide oligos and the universal oligo need to be annealed and amplified.

  1. 1.

    Resuspend the DNA guide oligos (Table 1) and the universal DNA oligo at a concentration of 100μM in nuclease-free water. Mix well by vortexing. From 100μM stock solutions prepare 10μM working solutions using nuclease-free water. Mix well by vortexing.

  2. 2.

    In separate 0.2 ml PCR tubes, add 2μl of each guide oligo (10μM). There should be nine tubes in total.

  3. 3.

    Create a master mix by combining 2μl of universal DNA oligo (10μM), 5μl 10× PCR buffer, 2μl MgCl2 (50 mM), 1μl dNTP (10 mM), 0.2μl Taq DNA Polymerase (5 U/μl), and 37.8μl nuclease-free water per reaction. Mix gently by pipetting.

  4. 4.

    Add 48μl of the master mix to each 0.2 ml PCR tube. Mix gently by pipetting.

  5. 5.

    Anneal and amplify the samples in a thermocycler under the following conditions:

    1. (a)

      3 min at 95 °C

    2. (b)

      30 cycles of 30 s at 95 °C, 30 s at 45 °C, 20 s at 72 °C

    3. (c)

      5 min at 72 °C

    4. (d)

      Hold at 4 °C

  6. 6.

    In the last 30 min of the PCR program, remove an aliquot of AMPure XP beads from the fridge and let it equilibrate at room temperature (see Note 2). Also prepare fresh 75% ethanol.

  7. 7.

    After the program has finished, add 90μl of room temperature AMPure XP beads (1.8:1 ratio of beads to sample) to each of the 0.2 ml PCR tubes. Mix well by pipetting until the solution looks homogenous.

  8. 8.

    Incubate the PCR tubes at room temperature for 10 min.

  9. 9.

    Place the PCR tubes on a magnetic stand and leave at room temperature for 5 min or until the liquid appears clear.

  10. 10.

    Remove and discard the supernatant from the PCR tubes (see Note 3).

  11. 11.

    Carefully add 200μl of 75% ethanol to each PCR tube and then transfer the tubes back and forth between neighboring positions in the magnetic stand, allowing the beads to travel through the ethanol.

  12. 12.

    Remove and discard all of the supernatant from the PCR tubes.

  13. 13.

    Repeat steps 11 and 12.

  14. 14.

    Allow beads to air-dry until no ethanol is visible.

  15. 15.

    Elute the DNA by adding 21μl nuclease-free water to each sample. Mix well by pipetting until the beads are completely resuspended (see Note 4).

  16. 16.

    Place the PCR tubes on a magnetic rack for 5 min at room temperature.

  17. 17.

    Transfer 20μl of the supernatant from each sample to new 0.2 ml PCR tubes being careful not to carry over any beads.

3.3 gRNA In Vitro Transcription

  1. 1.

    In separate 0.2 ml PCR tubes, add 6.3μl of each annealed template oligo from Subheading 3.2.

  2. 2.

    Using the AMPLISCRIBE™ T7-FLASH™ Transcription Kit, create a master mix by combining 2μl 10× reaction buffer, 1.8μl ATP (100 mM), 1.8μl GTP (100 mM), 1.8μl CTP (100 mM), 1.8μl UTP (100 mM), 2μl DTT (100 mM), 0.5μl RNase inhibitor, and 2μl T7 RNA polymerase per reaction. It is important that the reagents are added in that exact order to avoid precipitation of the components. Mix well by pipetting.

  3. 3.

    Add 13.7μl of the master mix to each PCR tube. Mix well by pipetting.

  4. 4.

    Incubate the PCR tubes at 42 °C overnight in a thermocycler with lid heating ON.

  5. 5.

    After the overnight incubation, add 1.5μl DNaseI to each PCR tube and incubate at 37 °C for 25 min.

  6. 6.

    Add 80μl nuclease-free water to each PCR tube. Mix well by pipetting.

  7. 7.

    Transfer the total 100μl of each sample to separate 1.5 ml nuclease-free tubes (see Note 5).

  8. 8.

    Precipitate the gRNA by adding 10μl of Sodium Acetate (3 M, pH 5.5) and 300μl of 100% ethanol to each 1.5 ml tube. Incubate overnight at −20 °C or 1–2 h at −80 °C.

  9. 9.

    After the incubation, pellet the gRNAs by centrifugation at max speed (>12,000 × g) for 30 min at 4 °C.

  10. 10.

    Wash the gRNA pellets by adding 500μl of 75% ethanol (see Note 6).

  11. 11.

    Repeat step 10 for a total of two washes.

  12. 12.

    Air-dry the gRNA pellets until no ethanol is visible.

  13. 13.

    Resuspend the gRNA pellets in 50μl nuclease-free water, pipetting up and down at least 20 times. Incubate the tubes on ice for 15 min, gently flicking the tube every 5 min to further promote resuspension.

  14. 14.

    Measure the RNA concentration using Qubit™ RNA BR Assay Kit. Expected concentrations range from 500 ng/μl to 3μg/μl.

  15. 15.

    Make 5μl aliquots out of each tet gRNA and store them at −80 °C. The gRNAs are now ready for microinjections.

3.4 Microinjection Chambers and Needles

In order to perform microinjections, both microinjection needles and microinjection chambers need to be made. It is recommended to perform this step on the day before microinjections.

  1. 1.

    Prepare 100 ml of hot 2% agarose by dissolving 2 g of agarose in 100 ml of water. Microwave the mixture appropriately until the agarose is dissolved.

  2. 2.

    Pour approximately 20 ml of the hot 2% agarose into two petri dishes (90 mm × 15 mm) and let them cool on the bench at room temperature.

  3. 3.

    Once the agar has nearly solidified, add 20 ml of the 2% agarose to each dish.

  4. 4.

    Immediately place the larval injection mold (zebrafish microinjection and transplantation mold kit) teeth into the liquid (see Note 7).

  5. 5.

    After the agarose hardens, remove the plastic mold and wrap the Petri dish in parafilm. This can be stored for up to 7 days at 4 °C.

  6. 6.

    Prepare needles for microinjection by loading a glass capillary (0.78 mm, od: 1 mm) into the Sutter Instrument. Pull the glass capillary into two equal sized needles using the following machine setting: Heat: 480, Pull: 200, Vel: 100, Time: 225.

  7. 7.

    Repeat step 6 two times until there are six pulled needles.

  8. 8.

    Under a microscope, cut approximately 1 mm off the sharp end of each microinjection needle using a scalpel blade. Store the needles in a large petri dish (150 mm × 15 mm) on top of a bed of Blu Tack adhesive or tape.

3.5 Zebrafish Egg Collection and Preparation of the tet1/2/3 KO Microinjection Mixture

In this step, zebrafish are set up for mating and the tet1/2/3 KO microinjection mixture is prepared. The single microinjection mixture will contain all 9 gRNAs created in Subheading 3.3.

  1. 1.

    The evening before the microinjections, set up two zebrafish mating tanks. Each mating tank is to contain 4 males and 2 females separated by a divider. More tanks can be set up if desired (see Note 8).

  2. 2.

    On the morning of microinjections, prepare tet1/2/3 KO microinjection mixture by combining 100 ng of each tet gRNA with 1μl Cas9 nuclease (NEB) and 1.5μl KCl (1 M) in a single nuclease-free tube. Top the up mixture to 5μl with nuclease-free water and mix by pipetting.

  3. 3.

    Incubate the microinjection mixture at 37 °C for 5 min to allow the complex between Cas9 nuclease and the gRNAs to form.

  4. 4.

    During incubation, take the microinjection chambers out of the fridge to warm up.

  5. 5.

    After incubation, add 1μl of 0.5% phenol red to the injection mixture. Mix by pipetting.

  6. 6.

    After the microinjection mixture is prepared, transfer the zebrafish from one of the mating tanks from step 1 to a new mating tank and remove the dividers.

  7. 7.

    Observe the zebrafish until the females begin to release eggs. After the first eggs have been released, wait 10 min to allow for fertilization to occur.

  8. 8.

    Collect all eggs using a mesh strainer. The zebrafish can then be moved to a new tank and separated again by a divider.

  9. 9.

    Clean the eggs with E3 solution before transferring them to a petri dish (90 mm × 15 mm). If there appears to be at least ~400 produced eggs, proceed to microinjections. If not, repeat steps 59 with the other mating tank (see Note 9).

3.6 Microinjections

  1. 1.

    Turn on gas supply for the Picospritzer III.

  2. 2.

    Turn on the Picospritzer III and set the pressure to 20 PSI and the pulse duration to 20 ms.

  3. 3.

    Using a microloader pipette tip, load one of the cut microinjection needles (Subheading 3.4) with 2μl of the microinjection mixture (Subheading 3.5).

  4. 4.

    Attach the loaded microinjection needle to the needle holder on the Picospritzer III.

  5. 5.

    Add a droplet of mineral oil to the center of a stage microruler and place it under a microscope at approximately 6× magnification.

  6. 6.

    Place the tip of a loaded microinjection needle into the mineral oil and dispense the microinjection mixture by pressing down on the foot pedal. Under the microscope, measure the size of the red bead which forms in the mineral oil on the stage microruler.

  7. 7.

    Adjust the pulse duration on the picospritzer III until a bead with a diameter of 0.1 mm can be produced. A bead with a diameter of 0.1 mm equates to 500 pL of microinjection mixture. Further cutting of the needle will be required if a maximum of 60 ms pulse duration is not enough.

  8. 8.

    Using a plastic pipette, transfer ~200 zebrafish embryos into the wells of the microinjection chamber.

  9. 9.

    Observe the eggs under a microscope and confirm that the eggs are not past the first cell stage.

  10. 10.

    Inject a single egg by smoothly piercing the chorion with the microinjection needle until the tip enters into the cell or yolk (see Note 10). Press down on the foot pedal while the needle is inside the egg. The red injection mixture should be visible inside the egg if the injection was successful (see Note 11).

  11. 11.

    Repeat the injections for every egg in the chamber.

  12. 12.

    Using E3 solution, rinse the eggs off into a new petri dish and incubate them in an incubator at 29 °C.

  13. 13.

    Transfer the remaining ~200 uninjected eggs to the same incubator to use as uninjected controls.

  14. 14.

    In the evening, clean the embryos by removing all of the dead and unfertilized eggs, and by replacing the E3 solution with a plastic pipette.

3.7 Embryo Collection

  1. 1.

    The next morning, 24 h after the eggs were laid, observe both injected and uninjected embryos under a microscope. tet1/2/3 KO embryos should display phenotypes such as microphthalmia and should lack properly formed midbrain and hindbrain boundaries (Fig. 1b) (see Note 12).

  2. 2.

    Collect four batches of 20 embryos that display developmental phenotypes from the injected eggs and four batches of 20 embryos from the uninjected control in eight separate 1.5 ml nuclease-free tubes.

  3. 3.

    Remove all the liquid from the 1.5 ml tubes and either snap-freeze the embryos in liquid nitrogen or immediately proceed to DNA and RNA extraction (see Note 13).

Fig. 1
figure 1

Phenotypes of tet1/2/3 KO F0 zebrafish following CRISPR/Cas9 microinjections. (a) Zebrafish Tet protein structure and guide RNA design. (b) Top left: uninjected 24 hpf zebrafish, bottom left: tet1/2/3 KO injected 24 hpf zebrafish displaying improperly formed midbrain hindbrain boundary (i) and microphthalmia (ii). Top right: uninjected 48 hpf zebrafish; bottom right: tet1/2/3 KO injected 48 hpf zebrafish displaying microphthalmia (iii), altered pigmentation and curved trunk (iv). (c) qPCR quantification of two biological replicates of uninjected wild type (wt) and injected (tet1/2/3 KO) F0 zebrafish. Each gene is tested in triplicate and represented as the mean relative expression to wt. Error bars represent standard deviation

3.8 gDNA and RNA Extraction

In this step, gDNA is extracted from two batches of 20 embryos from both the injected and uninjected samples. These samples will be used as biological replicates in amplicon sequencing and RRBS and are referred to now on as tet1/2/3 KO replicate 1, tet1/2/3 KO replicate 2, uninjected replicate 1, and uninjected replicate 2. RNA is also extracted from the remaining embryo batches for use in qPCR estimation of knockout efficiencies.

  1. 1.

    Extract gDNA from two batches of 20 embryos for both injected and uninjected samples using the Qiagen blood and Tissue DNA extraction kit.

  2. 2.

    Elute the gDNA in 50μl nuclease-free water and measure the concentration using Qubit™ DNA HS Assay Kit. Store the samples at −20 °C for later use in RRBS and amplicon sequencing in Subheadings 3.9 and 3.10.

  3. 3.

    Under a fume hood, add 250μl of TRIsure reagent to the two remaining 1.5 ml tubes for both the uninjected and injected groups.

  4. 4.

    Homogenize the embryos using a pestle or needle.

  5. 5.

    Once sufficiently homogenized, add a further 750μl TRIsure reagent to each tube and incubate for 5 min at room temperature.

  6. 6.

    After the incubation, add 0.2 ml chloroform to each tube and mix by inverting the tube several times. Incubate the sample for 5 min at room temperature.

  7. 7.

    Centrifuge the samples at 12,000 × g for 15 min at 4 °C.

  8. 8.

    The mixture will separate into a lower organic phase, an interphase, and a colorless upper aqueous phase. Transfer the top aqueous layers into new 1.5 ml nuclease-free tubes. It is better to be conservative and take approximately 75% of the aqueous layer then risk contamination with other layers.

  9. 9.

    Add 0.5 ml isopropanol to each 1.5 ml tube in order to precipitate the RNA. Incubate at room temperature for 10 min.

  10. 10.

    Pellet the RNA by centrifugation at 12,000 × g for 10 min at 4 °C.

  11. 11.

    Remove the supernatant with a pipette and add 1 ml 75% ethanol.

  12. 12.

    Mix the sample by inversion and re-pellet the RNA by centrifugation at 12,000 × g for 5 min at 4 °C.

  13. 13.

    Remove as much ethanol as possible and allow the samples to air-dry at room temperature until no liquid is visible.

  14. 14.

    Resuspend the pellet by adding 20μl of nuclease-free water, pipetting up and down at least 20 times gently. Incubate the RNA on ice for 15 min, gently flicking the tube every 5 min to further facilitate resuspension.

  15. 15.

    Measure the concentration of RNA using the Qubit™ RNA BR Assay Kit as well as a Nanodrop to estimate the quality of the sample. A 260/280 ratio of ~2 and a 260/230 ratio > 2 is desired for efficient cDNA synthesis and qPCR.

3.9 Amplicon Sequencing

  1. 1.

    Using 1μl of the tet1/2/3 KO gDNA from Subheading 3.8, set up six 0.2 ml PCR tubes as outlined in Table 3.

  2. 2.

    Make a PCR master mix by combining 5μl 10× PCR Buffer, 2μl MgCl2 (10 mM), 1μl dNTP (10 mM), 0.2μl Taq DNA polymerase (5 U/μl), and 39.8μl nuclease-free water in 1.5 ml microfuge tube per reaction. Mix by pipetting.

  3. 3.

    Add 48μl of master mix to each PCR tube and mix gently by pipetting.

  4. 4.

    Amplify each amplicon under the following conditions in a thermocycler:

    1. (a)

      3 min at 95 °C.

    2. (b)

      30 cycles of 30 s at 95 °C, 30 s at 45 °C, 20 s at 72 °C.

    3. (c)

      5 min at 72 °C.

    4. (d)

      Hold at 4 °C.

  5. 5.

    In the last 30 min of the PCR program, remove an aliquot of AMPure XP beads from the fridge and let it equilibrate at room temperature. Also prepare fresh 75% ethanol.

  6. 6.

    Add 80μl of room temperature AMPure XP beads to the 50μl annealed products and gently mix by pipetting.

  7. 7.

    Incubate the PCR tubes at room temperature for 10 min.

  8. 8.

    Place the PCR tubes on a magnetic stand and keep at room temperature for 5 min or until the liquid appears clear.

  9. 9.

    Remove and discard the supernatant from each PCR tube.

  10. 10.

    Carefully add 200μl of 75% ethanol to each PCR tube and then transfer the tubes back and forth between neighboring positions in the magnetic stand, allowing the beads to travel through the ethanol.

  11. 11.

    Remove and discard all of the supernatant from the PCR tubes.

  12. 12.

    Repeat steps 10 and 11.

  13. 13.

    Allow beads to air-dry until no ethanol is visible.

  14. 14.

    Elute the DNA by adding 17μl nuclease-free water to each tube. Mix well by pipetting until the beads are completely resuspended.

  15. 15.

    Place the PCR tubes on magnetic rack for 5 min at room temperature.

  16. 16.

    Transfer 16μl of the supernatant from each sample to new 0.2 ml PCR tubes.

  17. 17.

    Add 1μl NEXTflex barcode adapter 26 (12.5μM) to the first three tubes containing replicate 1 amplicons, and 1μl NEXTflex barcode adapter 44 (12.5μM) to the remaining three tubes containing replicate 2 amplicons (see Note 14).

  18. 18.

    Add 2μl T4 DNA Ligase Buffer ×10 and 1μl T4 DNA Ligase 400 U/μl to each tube. Mix by pipetting.

  19. 19.

    Incubate the ligation mixtures at 16 °C overnight in a thermocycler.

  20. 20.

    Heat-inactivate the samples at 65 °C for 10 min.

  21. 21.

    Transfer 1μl from each ligation mixture to new 0.2 ml PCR tubes.

  22. 22.

    Create a PCR master mix by combining 1μl of NEXTflex universal primer (10μM), 5μl 10× PCRBuffer, 2μl MgCl2 (10 mM), 1μl DNTP (10 mM), 0.2μl Taq DNA polymerase (5 U/μl), and 39.8μl nuclease-free water in a 1.5 ml nuclease-free tube per reaction. Mix by pipetting.

  23. 23.

    Add 49μl of the PCR master mix to the 0.2 ml PCR tubes. Mix by pipetting.

  24. 24.

    Amplify amplicons with successfully ligated adapters by using the following conditions in a thermocycler:

    1. (a)

      3 min at 95 °C.

    2. (b)

      30 cycles of 30 s at 95 °C, 30 s at 45 °C, 20 s at 72 °C.

    3. (c)

      5 min at 72 °C.

    4. (d)

      Hold at 4 °C

  25. 25.

    Purify the DNA by repeating steps 615.

  26. 26.

    Measure the DNA concentration using the Qubit™ DNA HS Assay Kit. Store the samples at −20 °C.

Table 3 Amplicon sequencing setup

3.10 RRBS : MspI Digestion

RRBS libraries will be made from all four gDNA samples from Subheading 3.8: tet1/2/3 KO replicate 1, tet1/2/3 KO replicate 2, uninjected replicate 1, and uninjected replicate 2.

  1. 1.

    Create a working solution of 5 ng/μl unmethylated lambda DNA.

  2. 2.

    Digest 500 ng of all four gDNA samples from Subheading 3.9 by combining them with 5μl CutSmart buffer, 1μl Msp1 (20 U/μl), and 0.5μl 5 ng/μl unmethylated lambda DNA in separate 0.2 ml PCR tubes. Top each reaction up to 50μl with nuclease-free water (see Note 15).

  3. 3.

    Digest the reactions at 37 °C overnight in a thermocycler with lid heating set to 50 °C.

  4. 4.

    Heat-inactive the reactions at 80 °C for 20 min then hold at 4 °C.

3.11 RRBS : End Repair and A-Tailing

MspI digestion creates fragments with identical 2 bp 5′ overhangs. As a result, only a Klenow fragment is needed for both end repairing and A-tailing in RRBS libraries.

  1. 1.

    To each 50μl of digested gDNA from Subheading 3.10, add 1.5μl dNTP (1 mM), 1.5μl dATP (10 mM), 1.5μl Klenow Fragment (3′→5′ exo, 5 U/μl), 0.5μl CutSmart buffer. Mix by pipetting.

  2. 2.

    Fill in the overhangs and A-tail the fragments by incubating the reactions at 37 °C for 50 min in a thermocycler.

  3. 3.

    Heat-inactivate the reactions at 80 °C for 30 min then hold at 4 °C.

  4. 3.

    At the beginning of the heat-inactivation step, remove an aliquot of AMPure XP beads from the fridge and let it equilibrate at room temperature. Also prepare fresh 75% ethanol.

  5. 4.

    Add 71.5μl of room temperature AMPure XP beads to the 55μl annealed products (1.3:1 ratio of beads to sample) and gently mix by pipetting until the solution is homogenous.

  6. 5.

    Incubate the PCR tubes at room temperature for 10 min.

  7. 6.

    Place the PCR tubes on a magnetic stand and keep at room temperature for 5 min or until the liquid appears clear.

  8. 7.

    Remove and discard the supernatant from each PCR tube.

  9. 8.

    Carefully add 200μl of 75% ethanol to each tube and then transfer the tubes back and forth between neighboring positions in the magnetic stand, allowing the beads to travel through the ethanol.

  10. 9.

    Remove and discard all of the supernatant from the PCR tubes.

  11. 10.

    Repeat steps 32 and 33.

  12. 11.

    Allow beads to air-dry until no ethanol is visible.

  13. 12.

    Elute the DNA by adding 17μl nuclease-free water to each sample. Mix well by pipetting until the beads are completely resuspended.

  14. 13.

    Place the PCR tubes on a magnetic rack and let stand for 5 min at room temperature.

  15. 14.

    Transfer 16μl of the supernatant from each sample to new 0.2 ml PCR tubes.

3.12 RRBS : Adaptor Ligation

  1. 1.

    To the eluted DNA from Subheading 3.11, add 2μl 10× T4 DNA Ligase Buffer, 1μl T4 DNA Ligase 400 U/μl, and 1μl NEXTflex barcode adapter (12.5μM) as outlined in Table 4 (see Note 16).

  2. 2.

    Ligate the adapters to the DNA fragments by incubating the reactions at 16 °C overnight followed by heat inactivation at 65 °C for 10 min, in a thermocycler.

  3. 3.

    Add 30μl of nuclease-free water to each sample bringing them up to a total volume of 50μl each.

  4. 4.

    Add 65μl of room temperature AMPure XP beads to the 50μl annealed products (1.3:1 ratio of beads to sample) and gently mix by pipetting until the solution is homogenous.

  5. 5.

    Incubate the PCR tubes at room temperature for 10 min.

  6. 6.

    Place the PCR tubes on a magnetic stand and keep at room temperature for 5 min or until the liquid appears clear.

  7. 7.

    Remove and discard the supernatant from each PCR tube.

  8. 8.

    Carefully add 200μl of 75% ethanol to each tube and then transfer the tubes back and forth between neighboring positions in the magnetic stand, allowing the beads to travel through the ethanol.

  9. 9.

    Remove and discard all of the supernatant from the PCR tubes.

  10. 10.

    Repeat steps 8 and 9.

  11. 11.

    Allow beads to air-dry until no ethanol is visible.

  12. 12.

    Elute the DNA by adding 21μl nuclease-free water to each sample. Mix well by pipetting until the beads are completely resuspended.

  13. 13.

    Place the PCR tubes on magnetic rack and let stand for 5 min at room temperature.

  14. 14.

    Transfer 20μl of the supernatant from each sample to new 0.2 ml PCR tubes.

Table 4 RRBS Barcode Adapters

3.13 RRBS : Bisulfite Conversion

  1. 1.

    Make the CT conversion reagent according to manufacturer’s instructions in the EZ DNA Methylation-Gold kit.

  2. 2.

    Add 130μl of CT conversion reagent to each sample from Subheading 3.12. Mix gently by pipetting.

  3. 3.

    Split the 150μl of each sample from step 2 into 3 × 50μl in 0.2 ml PCR tubes. This assures the volume will be completely covered by the wells of the thermocycler.

  4. 4.

    Perform the bisulfite conversion, column purification, and desulfonation according to the manufacturer’s instructions in the EZ DNA Methylation-Gold kit.

  5. 5.

    Elute each sample in 24μl nuclease-free water.

3.14 RRBS : PCR Amplification

  1. 1.

    Transfer 24μl of the bisulfite-converted samples from Subheading 3.13 to separate 0.2 ml PCR tubes.

  2. 2.

    Add 1μl of 12.5μM NEXTflex universal primer and 25μl 2× KAPA HiFi HotStart Uracil + ReadyMix to each PCR tube. Mix gently by pipetting.

  3. 3.

    Perform a low amplification PCR under the following conditions in a thermocycler:

    1. (a)

      2 min at 95 °C.

    2. (b)

      13 cycles of 20 s at 98 °C, 15 s at 60 °C, 1 min at 72 °C.

    3. (c)

      5 min at 72 °C.

    4. (d)

      Hold reaction at 4 °C.

  4. 4.

    Purify the samples with AMPure XP beads as in Subheading 3.12.

  5. 5.

    Measure the DNA concentration of each RRBS library using the Qubit™ DNA HS Assay Kit. The expected concentration will be approximately between 10 and 30 ng/μl.

  6. 6.

    Run 1μl of each library on a Tapestation D1000 ScreenTape following manufacturer’s instructions to assess the fragment size distribution of each library (see Note 17).

3.15 RRBS : Library Quantification and Multiplexing

In the next step, the amplifiable DNA in each library will be quantified by qPCR against DNA standards. This will allow the pooling of the samples in appropriate ratios to ensure even sequencing depth for each library.

  1. 1.

    Take 1μl of each RRBS library from Subheading 3.15 and dilute it by adding 99μl of nuclease-free water. Mix well by vortexing.

  2. 2.

    Take 1μl of 100× diluted library from step 1 and dilute it further by adding another 99μl of nuclease-free water. Mix well by vortexing.

  3. 3.

    Quantify the molar DNA concentration of the 10,000× diluted samples using the Library Quantification Kit (KAPA Biosystems) and a qPCR thermocycler according to the manufacturer’s protocol.

  4. 4.

    Using the concentration determined for the 10,000× diluted samples, calculate the concentration of the original undiluted libraries from Subheading 3.15.

  5. 5.

    Using the concentrations calculated in step 4, create 20μl diluted RRBS libraries (10 nM) for each original RRBS library. This can be achieved by transferring 0.2 pmol of each library (Subheading 3.14) to separate nuclease-free tubes and topping them up to 20μl with nuclease-free water.

  6. 6.

    Pool 7.5μl of each 10 nM RRBS library into a single 1.5 ml nuclease-free tube to a final volume of 30μl.

  7. 7.

    Measure the DNA concentration using Qubit™ DNA HS Assay Kit.

  8. 8.

    Spike in an amount of the amplicon sequencing libraries (Subheading 3.9) to the multiplexed libraries (step 6), such that each amplicon sequencing library makes up ~0.03% of the total concentration in ng (see Note 18).

  9. 9.

    Sequence the multiplexed libraries (10 nM) on an Illumina sequencer (HiSeq X or equivalent (see Note 18).

3.16 qPCR Validation of Knockout Efficiencies

To assess the efficiency of tet1/tet2 /tet3 knockout qPCR can be used. In this step, cDNA is created using the SensiFAST™ cDNA Synthesis kit from both biological replicates of injected and uninjected RNA samples from Subheading 3.8. The cDNA is then used in a standard qPCR experiment to further estimate knockout levels for each tet gene (Fig. 1c).

  1. 1.

    Using the SensiFAST™ cDNA Synthesis Kit, create cDNA from each RNA sample from Subheading 3.8 by combining 10μl nuclease-free water, 5μl RNA, 4μl 5× TransAmp buffer, and 1μl reverse transcriptase in separate 0.2 ml PCR tubes. Mix gently by pipetting.

  2. 2.

    Perform the reverse transcription reaction in a thermocycler.

    1. (a)

      10 min at 25 °C (annealing).

    2. (b)

      15 min at 42 °C (reverse transcription).

    3. (c)

      5 min at 85 °C (inactivation).

    4. (d)

      Hold at 4 °C.

  3. 3.

    Dilute each cDNA sample by adding 80μl of nuclease-free water. Mix by pipetting.

  4. 4.

    Create five qPCR master mixes, one for each tet gene and the two control genes, as shown in Table 5 (see Note 19). Mix well by pipetting. Each mix is enough for 14 samples.

  5. 5.

    Add 1μl of each diluted cDNA to 15 wells in a 384 PCR plate, using a different row for each sample, as follows:

    1. (a)

      tet1/2/3 KO replicate 1 to row A (A1-A15).

    2. (b)

      tet1/2/3 KO replicate 2 to row B (B1-B15).

    3. (c)

      Uninjected control replicate 1 to row C (C1–C15).

    4. (d)

      Uninjected control replicate 2 to row D (D1–D15).

  6. 6.

    Add 9μl of each master mix to three columns from A–D as follows:

    1. (a)

      Master mix 1 to columns 1–3 (A1–A3, B1–B3, C1–C3, D1–D3).

    2. (b)

      Master mix 2 to columns 4–6 (A4–A6, B4–B6, C4–C6, D4–D6).

    3. (c)

      Master mix 3 to columns 7–9 (A7–A9, B7–B9, C7–C9, D7–D9).

    4. (d)

      Master mix 4 to columns 10–12 (A10–A12, B10–B12, C10–C12, D10–D12).

    5. (e)

      Master mix 5 to columns 13–15 (A13–A15, B13–B15, C13–C15, D13–D15).

  7. 7.

    Seal the plate with optically clear sealing tape.

  8. 8.

    Run the plate in a qPCR thermocycler compatible with 384 PCR plates, such as the LightCycler 480 (Roche), under the following conditions:

    1. (a)

      5 min at 95 °C.

    2. (b)

      35 cycles of 30 s at 95 °C, 45 s at 60 °C, plate read.

  9. 9.

    Export the CT values from the run into excel sheet in order to calculate the tet1/2/3 KO efficiencies. An example of some successful results can be seen in Fig. 1c. The values for each tet gene triplicate are calculated individually against the average control CTs, before the average of all these values are plotted with standard deviation. An example of these calculation for tet 1 replicate 1 in tetTKO replicate 1 and WT replicate 1 are as follows (pay close attention to when replicate CT values are used compared to average values, as well as when WT values are used):

    1. (a)

      WT: (2(WT replicate 1, average tet1 CT − WT replicate 1, average elfa CT))/(2(WT replicate 1, tet1 replicate 1 CT − WT replicate 1, elfa CT)) and (2(WT replicate 1, average tet1 CT − WT replicate 1, average bactin CT)/(2(WT replicate 1, tet1 replicate 1 CT − WT replicate 1, average bactin CT)).

    2. (b)

      tetTKO: (2(WT replicate 1, average tet1 CT − WT replicate 1, average elfa CT))/(2(tetTKO replicate 1, tet1 replicate 1 CT − tetTKO replicate 1, average elfa CT)) and (2(WT replicate 1, average tet1 CT − WT replicate 1, bactin CT))/(2(tetTKO replicate 1, tet1 replicate 1 CT− tetTKO replicate 1, average bactin CT)).

Table 5 qPCR master mixes

3.17 Software

Download the following software:

  1. 1.

    FastQC http://www.bioinformatics.babraham.ac.uk/projects/fastqc/

  2. 2.

    Trim Galore https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/

  3. 3.

    SAMtools http://samtools.sourceforge.net/ [23].

  4. 4.

    Sambamba https://lomereiter.github.io/sambamba/ [24].

  5. 5.

    WALT https://github.com/smithlabcode/walt [25].

  6. 6.

    MethylDackel https://github.com/dpryan79/MethylDackel

  7. 7.

    BEDtools https://bedtools.readthedocs.io/en/latest/ [26].

  8. 8.

    bedGraphToBigWig, fetchChromSizes, and liftOver http://hgdownload.cse.ucsc.edu/admin/exe

  9. 9.

    IGV http://www.broadinstitute.org/igv/ [27].

  10. 10.

    CRISPResso https://github.com/lucapinello/CRISPResso [28].

3.18 File Names

  1. 1.

    In the following pipeline, paired FASTQ files generated from RRBS and amplicon sequencing should be named as follows (see Note 20):

    1. (a)

      tet1/2/3 KO replicate 1 Read 1: tetTKO_1_R1.fastq.gz.

    2. (b)

      tet1/2/3 KO replicate 1 Read 2: tetTKO_1_R2.fastq.gz.

    3. (c)

      tet1/2/3 KO replicate 2 Read 1: tetTKO_2_R1.fastq.gz.

    4. (d)

      tet1/2/3 KO replicate 2 Read 2: tetTKO_2_R2.fastq.gz.

    5. (e)

      Uninjected replicate 1 Read 1: WT_1_R1.fastq.gz.

    6. (f)

      Uninjected replicate 1 Read 2: WT_1_R2.fastq.gz.

    7. (g)

      Uninjected replicate 2 Read 1: WT_2_R1.fastq.gz.

    8. (h)

      Uninjected replicate 2 Read 2: WT_2_R2.fastq.gz.

    9. (i)

      Amplicon sequencing library for tet 1/2/3 KO replicate 1 Read 1: Amp_1_R1.fastq.gz.

    10. (j)

      Amplicon sequencing library for tet 1/2/3 KO replicate 1 Read 2: Amp_1_R2.fastq.gz.

    11. (k)

      Amplicon sequencing library for tet 1/2/3 KO replicate 2 Read 1: Amp_2_R1.fastq.gz

    12. (l)

      Amplicon sequencing library for tet 1/2/3 KO replicate 2 Read 2: Amp_2_R2.fastq.gz.

  2. 2.

    Files can be renamed using the (mv) command as follows:

    $ mv OriginalName.ext NewName.ext

3.19 Quality Control

  1. 1.

    If the FASTQ files were delivered without accompanying FastQC files (see Note 21), a quality report can be created for all FASTQ files as follows:

    $ fastqc *.fastq.gz

  2. 2.

    There are a number of hallmark features of a successful RRBS experiment to assess before performing quality trimming, such as: (i) a high level of duplication and (ii) read 1 sequences beginning with CGG (see Note 20). An example of these features can be seen in Fig. 2.

Fig. 2
figure 2

Quality assessment of RRBS sequence data. (a) Sequence content across all bases from the FASTQ file for read 1 showing enrichment for CGG or TGG trinucleotides at the beginning of read 1. This bias is due to MspI digestion, which creates fragments beginning with CGG nucleotides (methylated state) or TGG (unmethylated state). (b) Sequence duplication level of the FASTQ file for Read 1. This is another hallmark of successful MspI digestion as each DNA molecule should produce identical fragments

3.20 Adapter Trimming

  1. 1.

    Quality-trim and remove any adapter sequences from the paired tetTKO_1 FASTQ files using Trim Galore in RRBS mode (see Note 22).

    $ trim_galore --dont_gzip --illumina --rrbs --paired tetTKO_1_R1.fastq.gz tetTKO_1_R2.fastq.gz

    Repeat for the rest of the paired FASTQ files. The trimmed FASTQ files will be produced with the suffixes “_val_1.fq” and “_val_2.fq” for Read 1 and Read 2, respectively. For example: tetTKO_1_R1_val_1.fq and tetTKO_1_R2_val_2.fq will be produced from the above command.

3.21 Amplicon Sequencing Analysis

  1. 1.

    Use CRISPResso to determine the frequencies of non-homologous end joining (NHEJ) events for each tet gene in the amplicon sequencing library for tet1/2/3 KO replicate 1 (Amp_1_R1_val_1.fq; Amp_1_R2_val_2.fq). CRISPResso will generate a number of informative graphs and reports (see Note 23).

    $ CRISPResso -r1 Amp_1_R1_val_1.fq -r2 Amp_1_R2_val_2.fq -a CATTCCCACACTCTGTCCGTATGTTACTTGTGACTCACTCCTCATTGAGGGCACAGCGTCGGCTAGTGGGCGAGCCATATTTGCAGAGCGTCTGCGTGAGCTCCTGGTACAGCCTGTCAGCCACGCTGCGTGGGATCCCCTCCCAGGCCAAGATGAGGATGACTATTACTGCGTTCTGACAGCAGTGACCAGCACGCTGCCGCACGAGACACAGCAACTTCTCCTGCTCACTGCCTCGACGTATGA -g GGGCACAGCGTCGGCTAGT,GGAGGGGATCCCACGCAGCG,AGAAGTTGCTGTGTCTCGTG --ignore_substitutions -w 10 -o tet1_rep1 $ CRISPResso -r1 Amp_1_R1_val_1.fq -r2 Amp_1_R2_val_2.fq -a AGACTTGTTGTCTTGATCATTGTTTCGATTTTGAGAAGGGGAAGTACATGCCTGGGGAATCAACTTGCTCTCTACTGCTTGTGGCTCATGAGTCTGTTGCTGAAGCAAGGAAAAGTCTCTATTGCCATGGAGGTTTTCATCAGTTGTTTTTTTCTGTGGCACTTTAATCTGTGGGTTATACTGACCTGTCTGAGAGGTTTGTGAGGGGGAACCTGCTGTCACCGGGGGCAATGTATAAGATCCATGACTTAAATTCTGATTGGATTTGGAGAACATGT -g AGAAGGGGAAGTACATGCCT,AGGAAAAGTCTCTATTGCCATGG,GAGGGGGAACCTGCTGTCAC --ignore_substitutions -w 10 -o tet2_rep1 $ CRISPResso -r1 Amp_1_R1_val_1.fq -r2 Amp_1_R2_val_2.fq -a TTTTTGGCCTCCTCACCTCCATTGGACATTTCCTGATTGAATCTTTCATACAGCTGAGCTGTAGGGGTCGGTGAAGGTTGGTGGTTGGTGTAAGGTGGCAGTGAGCCAACAGAAAAGGCCACCAGAGTTTGTGCATCCTCCATATCCACATCATTTACTGTTGTGCTGGCATTTTGACCACTCCAAACACCTTGTTGAGATTCAGGATTTGACAACTTCCCAGTATGCTCAAGGACCCAGCCTGGTGTCTGGGAAATACTTCCAGCCCCCTCATGAAAGTT -g TGAGCTGTAGGGGTCGGTGA,AGTAAATGATGTGGATATGG,GTATGCTCAAGGACCCAGCC --ignore_substitutions -w 10 -o tet3_rep1

  2. 2.

    Repeat for the amplicon sequencing library for tet1/2/3 KO replicate 2 (Amp_2_R1_val_1.fq; Amp_2_R2_val_1.fq). An example of CRISPResso outputs from a successful tet1/2/3 KO experiment can be seen in Fig. 3.

  3. 3.

    Amplicon sequencing results corresponding to RRBS data are outlined in Table 6.

Fig. 3
figure 3

Analysis of genome editing efficiency of tet1 locus from the tet1/2/3 KO amplicon sequencing. (a) Locations of insertions and deletions surrounding the targeted tet1 locus. (b) Quantification of unmodified and non-homologous end joining (NHEJ) events in the tet1 amplicon library. (c) Alleles surrounding the cut site for one of the tet1 gRNAs

Table 6 Amplicon sequencing results corresponding to RRBS libraries

3.22 Read Mapping

  1. 1.

    Download the zebrafish reference genome, danRer10, from UCSC.

    $ wget http://hgdownload.soe.ucsc.edu/goldenPath/danRer10/bigZips/danRer10.fa.gz

    $ gunzip danRer10.fa.gz

  2. 2.

    Download the lambda reference genome and annotate it as chrL (see Note 24).

    $ wget -O lambda.txt.gz ′ https://www.encodeproject.org/files/lambda.fa/@@download/lambda.fa.fasta.gz

    $ gunzip lambda.txt.gz

    $ fold -w 50 lambda.txt | sed ′$d′ > lambda.fa

  3. 3.

    Concatenate the lambda genome to the zebrafish genome.

    $ cat danRer10.fa lambda.fa > danRer10_lambda.fa

  4. 4.

    Index the combined genome using WALT.

    $ makedb -c danRer10_lambda.fa -o danRer10_lambda.dbindex

  5. 5.

    Map the trimmed paired FASTQ files for the tet1/2/3 KO replicate 1 library (tetTKO_1_R1_val_1.fq; tetTKO_1_R2_val_2.fq) to the reference genome using WALT. To speed up the process of mapping, and many other steps in this pipeline, multiple threads can be used for many such commands (see Note 25). For example, in the below command and the commands in step 7, 12 threads (-t) are used.

    $ walt -i danRer10_lambda.dbindex -1 tetTKO_1_R1_val_1.fq -2 tetTKO_1_R2_val_2.fq -o tetTKO_1.sam -t 12

  6. 6.

    Repeat with the trimmed paired FASTQ files for tetTKO_2, WT_1 and WT_2, remembering to also change the output name (-o) accordingly.

  7. 7.

    Convert the tetTKO_1.sam SAM file to a sorted BAM file (see Note 26).

    $ sambamba view -t 12 -S -f bam -h tetTKO_1.sam | sambamba sort -t 12 -o tetTKO_1.bam /dev/stdin

  8. 8.

    Repeat with the tetTKO_2.bam, WT_1.bam, and WT_2.bam files, remembering to also change the output name (-o) accordingly.

3.23 Methylation Calling

  1. 1.

    Check the methylation bias for the tetTKO_1.bam file (see Note 27).

    $ MethylDackel mbias danRer10_lambda.fa tetTKO_1.bam tetTKO_1

  2. 2.

    Repeat with the tetTKO_2.bam, WT_1.bam, and WT_2.bam files, remembering to also change the output name prefix accordingly.

  3. 3.

    Using MethylDackel, call methylation levels for CG dinucleotides from the tetTKO_1.bam file, correcting for genome variation and methylation bias (see Notes 27 and 28). This command will create a bedGraph file (see Note 29).

    $ MethylDackel extract --mergeContext --maxVariantFrac 0.25 --minOppositeDepth 2 --OB 10,140,10,140 --OT 10,140,10,140 danRer10_lambda.fa tetTKO_1.bam

  4. 4.

    Repeat for the tetTKO_2.bam, WT_1.bam, and WT_2.bam files.

  5. 5.

    Methylation levels can also be called for all C nucleotide contexts if desired (see Note 30).

  6. 6.

    Remove the header from each CG bedGraph file (see Note 31).

    $ sed -i ′1d′ *bedGraph

  7. 7.

    Estimate the CG methylation non-conversion percentage by calculating the methylation values assigned to the unmethylated lambda control from the tetTKO_1_CpG.bedGraph (see Note 24).

    $ grep chrL tetTKO_1_CpG.bedGraph | awk ′{s+=$5}{x+=$6}END{print s/(x+s)*100}′

  8. 8.

    Repeat for tetTKO_2_CpG.bedGraph, WT_1_CpG.bedGraph, and WT_2_CpG.bedGraph files. The output value from the above command is given as a percentage.

  9. 9.

    Non-conversion rates can also be calculated for CHG and CHH nucleotides from their corresponding bedGraphs if desired.

3.24 Data Visualization

  1. 1.

    Download supplementary Tables 6–20 from https://www.nature.com/articles/ng.3522#Sec29 [12]. Copy and merge supplementary Tables 8 and 10, without the headers, into a new excel document on the same sheet. Save the new document as a tab-delimited file with the name phyloDMRs_danRer7.bed (see Note 32). This BED file now contains the coordinates of all the regions that become developmentally hypomethylated after gastrulation in a Tet-dependent manner in zebrafish.

  2. 2.

    Transfer this file into the directory of the CpG bedGraphs.

  3. 3.

    Lift over the coordinates of the phyloDMRs from danRer7 to danRer10 using liftOver (see Note 33).

    $ wget https://hgdownload.soe.ucsc.edu/gbdb/danRer7/liftOver/danRer7ToDanRer10.over.chain.gz

    $ liftOver phyloDMRs_danRer7.bed danRer7To DanRer10.over.chain.gz phyloDMRs.bed unmapped.bed

  4. 4.

    Sort the phyloDMRs.bed BED file.

    $ sort -k1,1 -k2,2n -o phyloDMRs.bed phyloDMRs.bed

  5. 5.

    Sort the tetTKO_1_CpG.bedGraph file.

    $ sort -k1,1 -k2,2n -o tetTKO_1_CpG.bedGraph tetTKO_1_CpG.bedGraph

  6. 6.

    Repeat for the tetTKO_2_CpG.bedGraph, WT_1_CpG.bedGraph, and WT_2_CpG.bedGraph files.

  7. 7.

    Create a 4 column bedGraph file from the sorted tetTKO_1_CpG.bedGraph file.

    $ awk ′{OFS=“\t”}{print $1,$2,$3,$4/100}′ tetTKO_1_CpG. bedGraph | sed ′/^chrL/d′ > tetTKO_1_CpG_4col.bedGraph

  8. 8.

    Repeat for the tetTKO_2_CpG.bedGraph, WT_1_CpG.bedGraph, and WT_2_CpG.bedGraph files.

  9. 9.

    Download a chromosome size file for danRer10.

    $ fetchChromSizes danRer10 > chrom.sizes

  10. 10.

    Convert the tetTKO_1_CpG_4col.bedGraph file to a BigWig file (see Note 34).

    $ bedGraphToBigWig tetTKO_1_CpG_4col.bedGraph chrom.sizes tetTKO_1_CpG.bw

  11. 11.

    Repeat for the tetTKO_2_CpG_4col.bedGraph, WT_1_CpG_4col.bedGraph, and WT_2_CpG_4col.bedGraph files.

  12. 12.

    All BAM files, Bigwig files, and the phyloDMRs.bed file can now be loaded into the IGV browser for visualization (see Note 35). An example of both the BAM file and Bigwig file outputs can be seen in Fig. 4a.

Fig. 4
figure 4

Visualization and analysis of tet1/2/3 KO RRBS data. (a) Left panel: IGV visualization of BAM files in bisulfite mode (CG) over a phylo-DMR (phylotypic stage enhancer) region. Red = methylated cytosine, blue = unmethylated cytosine. Grey boxes represent single reads. Right panel: IGV visualization of BigWig files. Blue bars represent methylation level (mCG/CG). (b) Global mCG/CG levels of two biological replicates of uninjected (wt) and injected (tet1/2/3 KO) 24hpf F0 zebrafish. Methylation values correspond to average mCG/CG level in genomic bins (150 bp). (c) PhyloDMR mCG/CG levels of two biological replicates of uninjected (wt) and injected (tet1/2/3 KO) 24 hpf F0 zebrafish. (d) Heatmap showing locus-specific mCG/CG levels in phyloDMRs

3.25 Methylation Phenotype Analysis

To interrogate the methylation phenotypes of the tet1/2/3 KO samples, average methylation levels for phyloDMRs, and MspI fragments, can be determined.

  1. 1.

    Using bedtools, create a BED file with the phyloDMRs that are covered in all the bedGraphs (see Note 36).

    $ bedtools intersect -sorted -c -a tetTKO_1_CpG.bedGraph -b tetTKO_2_CpG.bedGraph WT_1_CpG.bedGraph WT_2_CpG.bedGraph | awk ′{OFS=“\t”}{if ($7==3) print $1,$2,$3}′ > common_CG $ bedtools intersect -sorted -u -a phyloDMRs.bed -b common_CG > common_phyloDMRs.bed

  2. 2.

    Calculate the average methylation levels across the commonly covered phyloDMRs for the tetTKO_1_CpG.bedGraph.

    $ bedtools map -a common_phyloDMRs.bed -b tetTKO_1_CpG. bedGraph -c 5,6 -o sum,sum | awk ′ {print $4/($4+$5)}′ > tetTKO_1_phyloDMRs_mCG-CG

  3. 3.

    Repeat for the tetTKO_2_CpG.bedGraph, WT_1_CpG.bedGraph, and WT_2_CpG.bedGraph files.

  4. 4.

    Using bedtools, merge all commonly covered CG dinucleotides into windows that loosely represent MspI fragments (see Note 37).

    $ bedtools merge -d 150 -i common_CG > common_CG_windows

  5. 5.

    Calculate the average methylation for these CG windows for the tetTKO_1_CpG.bedGraph.

    $ bedtools map -a common_CG_windows -b tetTKO_1_CpG.bedGraph -c 5,6 -o sum,sum | awk ′{print $4/($4+$5)}′ > tetTKO_1_ common_mCG-CG

  6. 6.

    Repeat for the tetTKO_2_CpG.bedGraph, WT_1_CpG.bedGraph, and WT_2_CpG.bedGraph files.

  7. 7.

    Create tables, which can be easily analyzed in R, for both the average methylation of phyloDMRs and the average methylation of these common MspI CG windows.

    $ paste WT_1_common_mCG-CG WT_2_common_mCG-CG tetTKO_1_common_mCG-CG tetTKO_2_common_mCG-CG | awk ′{printf "%.2f %.2f %.2f %.2f\n", $1, $2, $3, $4}′ | sed ′s/ /\t/g; 1iWT WT_2 tetTKO tetTKO_2′ | sed ′s/ /\t/g′ > common_mCG-CG $ paste WT_1_phyloDMRs_mCG-CG WT_2_phyloDMRs_mCG-CG tetTKO_1_phyloDMRs_mCG-CG tetTKO_2_phyloDMRs_mCG-CG | awk ′{printf "%.2f %.2f %.2f %.2f\n", $1, $2, $3, $4}′ | sed ′s/ /\t/g; 1iWT WT_2 tetTKO tetTKO_2′ | sed ′s/ /\t/g′ > phyloDMRs_mCG-CG $ sed ′s/\t/\t\t/g; s/$/\t/g; s/0\.0./none/g; s/0\.1./low/g; s/0\.2./medium/g; s/0\.3./medium/g; s/0\.4./medium/g; s/0\.5./medium/g; s/0\.6./medium/g; s/0\.7./medium/g; s/0\.8./high/g; s/0\.9./high/g; s/1\.00/high/g; s/\t\t/\t/g; s/\t$//g′ phyloDMRs_mCG-CG > phyloDMRs_ranges $ sed ′s/\t/\t\t/g; s/$/\t/g; s/0\.0./none/g; s/0\.1./low/g; s/0\.2./medium/g; s/0\.3./medium/g; s/0\.4./medium/g; s/0\.5./medium/g; s/0\.6./medium/g; s/0\.7./medium/g; s/0\.8./high/g; s/0\.9./high/g; s/1\.00/high/g; s/\t\t/\t/g; s/\t$//g′ common_mCG-CG > common_ranges

  8. 8.

    In R, create stacked plots showing the average methylation values for each sample at phyloDMRs and for the common MspI CG windows (Fig. 4b,c).

    R> install.packages(“ggplot2”) R> install.packages(“reshape2”) R> install.packages(“plyr”) R> library(ggplot2) R> library(reshape2) R> library(plyr) R> phyloDMRs_ranges = read.delim(′phyloDMRs_ranges′, header = TRUE) R> common_ranges = read.delim(′common_ranges′, header = TRUE) R> dat1 <- melt(as.matrix(phyloDMRs_ranges)) R> dat2 <- melt(as.matrix(common_ranges)) R> PhyloCounts <- count(dat1[, -1]) R> CommonCounts <- count(dat2[, -1]) R> ggplot(PhyloCounts, aes(fill=factor(value, levels=c(“none”,“low”,“medium”,“high”)), y=freq, x=Var2)) + geom_bar(position=“stack”, stat=“identity”) + scale_fill_manual (values=c(“#4575b4”,“#74add1”, “#f46d43”, “#d73027”)) R> ggplot(CommonCounts, aes(fill=factor(value, levels=c(“none”,“low”,“medium”,“high”)), y=freq, x=Var2)) + geom_bar(position=“stack”, stat=“identity”) + scale_fill_manual (values=c(“#4575b4”,“#74add1”, “#f46d43”, “#d73027”))

  9. 9.

    Methylation levels at each locus can be visualized using pheatmaps in R (Fig. 4d) (see Note 38).

    R> install.packages(“pheatmap”) R> library(pheatmap) R> phyloDMRs_mCG = read.delim(′phyloDMRs_mCG-CG′) R> mymat = as.matrix(phyloDMRs_mCG) R> my_palette <- c(“#4575b4”,“#74add1”, “#abd9e9”,“#e0f3f8”,“#ffffff”, “#fee090”, “#fdae61”, “#f46d43”, “#d73027”) R> pheatmap(mymat, color=my_palette, clustering_distance_rows = “euclidean”, cluster_cols = FALSE, cluster_rows = TRUE)

  10. 10.

    An example of plots from a successful tet1/2/3 KO RRBS experiment can be seen in Fig. 4.

4 Notes

  1. 1.

    The protospacer sequence is the genomic sequence that is targeted by the gRNA and is thus complementary to a portion of the gRNA. When using the Cas9 nuclease, the protospacer sequence must be followed by the protospacer adjacent motif (PAM): NGG. Cas9 will cut the protospacer sequence 3 base pairs away from the PAM site.

  2. 2.

    AMPure XP beads should be regularly tested by performing purifications on DNA ladders using different ratios of beads to ladder. The products can be analyzed by gel electrophoresis to determine the relationships between bead ratios and fragment size selection.

  3. 3.

    The supernatant from the AMPure XP beads can be most effectively removed with multiple rounds of pipetting with short tabletop centrifugation in between each attempt.

  4. 4.

    When eluting, it is recommended to add an extra 1–2μl of solution to the volume that will ultimately be taken. This will avoid carrying over any AMPure XP beads into subsequent steps.

  5. 5.

    The gRNA produced by the in vitro transcription will often be visible as a white precipitate. The precipitate can settle to the bottom of tubes so take care to not to leave the gRNA behind when transferring between tubes. This can be achieved by mixing well before any transfers.

  6. 6.

    If the gRNA pellet is not disturbed, simply remove the ethanol with a pipette to complete the wash. If the pellet becomes dislodged, centrifuge for another 5 min at 4 °C before removing the ethanol.

  7. 7.

    Air bubbles will often become trapped under the injection mold. Gently tap the mold to try and displace the formed bubbles. The bubbles themselves will not cause any issues if they are smaller than a single egg.

  8. 8.

    While more zebrafish can be added per breeding tank to maximize the amount of eggs, this will result in greater genetic variability in the experiment. To overcome this, use the same fish to lay multiple rounds of eggs in a single day.

  9. 9.

    From each injection 80 embryos will be collected. However, it is best to inject as many eggs as possible as many will be unfertilized, dead or will not display any phenotype.

  10. 10.

    Microinjection of multiple gRNA and Cas9 into the single cell or the yolk have been shown to have comparable results [29].

  11. 11.

    The microinjection needle can become clogged during the injection process. To unclog the needle, the pressure can be temporarily increased or the needle can be cut again or replaced.

  12. 12.

    At 24 hpf the tet1/2/3 KO phenotypes tend to be very subtle. On average, for a successful injection experiment, approximately 50% of embryos will display developmental phenotypes as described above. We do not observe any noticeable lethality before 48 hpf compared to wild type controls. However, by ~7 dpf, nearly all larvae which had phenotypes will die. It is therefore recommended to keep some of the fish until after 48 hpf to get a better estimation of how successful the injection experiment was.

  13. 13.

    RRBS libraries are dependent on high quality gDNA because of their reliance on size selection and restriction enzymes. Any degraded DNA products can be carried over into the final libraries if they are of the right size. The more degraded the gDNA the less enriched the libraries will be.

  14. 14.

    While methylated adapters are not required for amplicon sequencing, we use methylated adapters in this case as excess is available from the kit required for RRBS experiments.

  15. 15.

    The RRBS protocol is optimized for gDNA input of 500 ng, however, as little as 100 ng per sample can be used. Libraries that will be compared against each other should start with the same amount of input gDNA.

  16. 16.

    For low level multiplexing, certain barcode indexes need to be used in combination. Follow the manufacturer’s instructions from the NEXTflex bisulfite-seq barcodes kit to determine which adapter barcodes to use when multiplexing a different amount of samples.

  17. 17.

    Expected fragment sizes for this protocol range from 250 bp to 500 bp, with ~120 bp of each fragment being contributed to by the adapter sequence. The lower limit is determined by the AMPure XP bead size selections while the upper limit is driven by the sodium bisulfite treatment and the preferential amplification of smaller fragments in PCR reactions. A sharp band can sometimes appear at ~120 bp and this is caused by adapter dimer contamination. If this band is present, repurify the libraries as in Subheading 3.12.

  18. 18.

    ROX is a reference dye which is used to normalize the fluorescent reporter signal in some qPCR instruments. 0.2 μl low ROX or 0.2 μl high ROX may be added to each sample depending on the instrument used.

  19. 19.

    A single lane of the HiSeq X yields approximately 2 × 400 million, 150 bp paired end reads. 2 × 20 million paired end reads are sufficient for an RRBS library using zebrafish gDNA. Zebrafish bisulfite-based libraries most commonly map to the genome with ~60% mapping efficiency which will yield approximately 2 × 12 million usable paired end reads. In this protocol, as only 4 RRBS libraries are pooled together, only 10% of a single lane is needed. Up to 20 RRBS libraries can be pooled into a single HiSeq X lane. Each amplicon from an amplicon sequencing library requires 5–10k reads, which is the equivalent of ~0.01% of what is required for four RRBS libraries (or 0.03% for a pool of 3 amplicons).

  20. 20.

    The data from Illumina sequencer is usually converted into FASTQ format [30]. The index sequence from each adapter barcode allows the sequencing data to be de-multiplexed into multiple FASTQ files, which contain both the sequence and quality information for each library. In paired end sequencing, separate FASTQ files for Read 1 and Read 2 will be generated for each library.

  21. 21.

    The quality of FASTQ files for each library can be assessed with FastQC. FastQC produces a number of informative graphs and tables relating to both sequence quality and content. There are some hallmark features of a FastQC report from a successful RRBS library. The “Per base sequence content” should show that nearly 100% of reads begin with a C or T followed by GG. This is due to the MspI digestion creating fragments beginning with CGG nucleotides. These will remain as CGG if methylated or become TGG if they were unmethylated. RRBS libraries should also have a high level of duplication, on average greater than 70% duplication rate, as every copy of DNA should produce the same fragments (Fig. 2). Therefore, unlike in WGBS , deduplication of the data is not recommended.

  22. 22.

    RRBS libraries created with MspI digestion need to have 2 bp from the 3′ end of Read 1 removed and the first 2 bp of Read 2 removed. This is because MspI digestion causes overhangs which get filled in during the end repair process. This means an unmethylated cytosine will always be introduced in the fragment regardless of the original methylation state.

  23. 23.

    CRISPResso is used to analyze amplicon sequencing libraries and determine the efficiency of tet1/2/3 KO. CRISPResso generates graphs and tables relating to the percentage of reads with detectable NHEJ events, as well as the position and sequence of these events. As zebrafish display considerable genetic diversity, CRISPResso is best used with substitutions ignored (--ignore_substitutions) to avoid calling NHEJ events which are merely polymorphisms. A window of 10 bp around the cut site (-w 10) is also used to avoid including errors from the start and end of reads.

  24. 24.

    The lambda genome is used as a control for testing the non-conversion rate of the libraries. As unmethylated lambda DNA was spiked into each library, the non-conversion rate can be determined by the percentage of cytosines which remained unconverted after the bisulfite conversion process.

  25. 25.

    Multiple threads, and in some instances cores, can be allocated to running commands to greatly speed up the time taken for completion. To find out if commands have the options for allocating multiple threads/cores, it is best to read the manuals which come when installing the commands or to visit their associated webpages.

  26. 26.

    SAM files are text-based files which contain the alignment information and read information generated by mapping the FASTQ files [23]. SAM files can be converted into a compressed binary format called a BAM file.

  27. 27.

    RRBS libraries are prone to methylation biases across the length of the read. It is highly recommended to use MethylDackel to analyze this (mbias) and then correct for it in the subsequent methylation calling steps.

  28. 28.

    As zebrafish are highly polymorphic, it is important to perform a genome correction to avoid erroneously calling genomic sequence changes as methylation changes.

  29. 29.

    BedGraph files are similar to BED files (see Note 32); however, they also must have a fourth column with a value. In this case, this value is the methylation percentage. BedGraphs produced by MethylDackel also have additional fifth and sixth columns, and these are the number of reads reporting a methylated base and unmethylated base, respectively.

  30. 30.

    Zebrafish display both mCG an mCH methylation dynamics during embryogenesis [31]. One can thus extend the analyses to all C nucleotides by using the (–CHH) and (–CHG) options in the MethylDackel extract command.

    $ MethylDackel extract --CHH --CHG --maxVariantFrac 0.25 --minDepth 5 --OB 10,140,10,140 --OT 10,140,10,140 danRer10_lambda.fa tetTKO_1.bam

  31. 31.

    A header left in the bedGraph file can cause some commands to fail, such as BEDtools and liftOver commands.

  32. 32.

    BED files are tab-delimited files that contain 3 columns describing genomic coordinates. BED files contain a chromosome column, start column and end column.

  33. 33.

    The coordinates of phylo-DMRs were originally determined using the danRer7 genome assembly. In order to use them in this pipeline, the coordinates must be converted to the danRer10 assembly using the liftOver command.

  34. 34.

    BigWig files are compressed, indexed, and in binary format. They contain signal data, in this case methylation values. BigWig files can be used to visualize data, such as in genome browsers or to create heatmaps.

  35. 35.

    IGV browser can display BED, BedGraph, BAM, and BigWig files. Visualizing BAM files is useful as this type of visualization allows inspection of single reads, thus providing information about the heterogeneity of DNA methylation at specific loci. BigWig files are useful as they show the overall average methylation values at specific loci (Fig. 4).

  36. 36.

    Using only common phylo-DMRs makes comparisons between libraries more valid. It also makes R commands simpler and heatmaps easier to interpret.

  37. 37.

    RRBS libraries cannot accurately determine global methylation levels due to libraries only covering a fraction of the genome with specific sequence biases. However, using all the regions covered can still be useful in showing that the methylation changes in tet1/2/3 KO are more pronounced at phylo-DMR regions. Random control regions can also be created by using BEDtools shuffle.

    $ bedtools shuffle -i phyloDMRs.bed -g chrom.sizes > shuffled_control.bed

  38. 38.

    Heatmaps can also be created by using a variety of other methods such as deepTools (https://github.com/deeptools/) and the gplots package in R (http://cran.r-project.org/package=gplots).