Defiant: (DMRs: easy, fast, identification and ANnoTation) identifies differentially Methylated regions from iron-deficient rat hippocampus

Condon, David E.; Tran, Phu V.; Lien, Yu-Chin; Schug, Jonathan; Georgieff, Michael K.; Simmons, Rebecca A.; Won, Kyoung-Jae

doi:10.1186/s12859-018-2037-1

Defiant: (DMRs: easy, fast, identification and ANnoTation) identifies differentially Methylated regions from iron-deficient rat hippocampus

Software
Open access
Published: 05 February 2018

Volume 19, article number 31, (2018)
Cite this article

Download PDF

You have full access to this open access article

BMC Bioinformatics Aims and scope Submit manuscript

Defiant: (DMRs: easy, fast, identification and ANnoTation) identifies differentially Methylated regions from iron-deficient rat hippocampus

Download PDF

David E. Condon¹,
Phu V. Tran²,
Yu-Chin Lien³,
Jonathan Schug¹,
Michael K. Georgieff²,
Rebecca A. Simmons³ &
…
Kyoung-Jae Won ORCID: orcid.org/0000-0002-2924-9630^1,4

5977 Accesses
24 Citations
3 Altmetric
Explore all metrics

Abstract

Background

Identification of differentially methylated regions (DMRs) is the initial step towards the study of DNA methylation-mediated gene regulation. Previous approaches to call DMRs suffer from false prediction, use extreme resources, and/or require library installation and input conversion.

Results

We developed a new approach called Defiant to identify DMRs. Employing Weighted Welch Expansion (WWE), Defiant showed superior performance to other predictors in the series of benchmarking tests on artificial and real data. Defiant was subsequently used to investigate DNA methylation changes in iron-deficient rat hippocampus. Defiant identified DMRs close to genes associated with neuronal development and plasticity, which were not identified by its competitor. Importantly, Defiant runs between 5 to 479 times faster than currently available software packages. Also, Defiant accepts 10 different input formats widely used for DNA methylation data.

Conclusions

Defiant effectively identifies DMRs for whole-genome bisulfite sequencing (WGBS), reduced-representation bisulfite sequencing (RRBS), Tet-assisted bisulfite sequencing (TAB-seq), and HpaII tiny fragment enrichment by ligation-mediated PCR-tag (HELP) assays.

MethGo: a comprehensive tool for analyzing whole-genome bisulfite sequencing data

Article Open access 09 December 2015

HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data

Article Open access 21 April 2015

Differential Methylation Analysis for Bisulfite Sequencing (BS-Seq) Data

Background

DNA methylation plays a critical role in gene regulation [1]. In human somatic cells, 70–80% of all CpG dinucleotides in the genome are methylated [2]. DNA methylation represents one type of epigenetic modification which has been shown to control transcription in mammals [3], interacting sometimes with DNA binding proteins [4]. DNA methylation regulates many diverse biological functions, such as embryonic stem cell differentiation [5], aging [6], gene imprinting [7, 8], and X-chromosome inactivation [9]. DNA methylation is conserved and somatically heritable mark that is generally associated with transcriptional repression [10]. Aberrant methylation has been found in multiple diseases such as cancer [11], imprinting defects [12] and mental disorders such as schizophrenia [13, 14]. Environmental exposures such as uteroplacental insufficiency [15, 16] or cigarette smoking [17, 18] have also been observed to alter DNA methylation.

Recent developments in sequencing technology enabled genome-wide characterization of DNA methylation. Whole-genome bisulfite sequencing (WGBS) and reduced representation bisulfite sequencing (RRBS) have been widely used to measure DNA methylation at a single CpG resolution [19]. DMRs are the contiguous genomic regions whose DNA methylation status differs between two groups of samples. DMRs have been used to characterize cell-type or condition specific DNA methylation [20,21,22].

Computational approaches have been developed based on statistical frameworks to identify DMRs. BSmooth [23] identifies DMRs using a local-likelihood approach to estimate a sample-specific methylation profile. For a statistical test between samples, BSmooth uses a Welch’s t-test [24]. MethylKit identifies DMRs based on logistical regression if multiple replicates are available, or a Fisher’s exact test [25] if only one sample is available. MethylSig [26] uses a beta-binomial approach to identify DMRs based on read coverage and biological variability, which showed high sensitivity in comparison with MethylKit, BSmooth, a standard t-test and the Wilcoxon rank test. Metilene [27] uses a binary segmentation algorithm combined with a two-dimensional Kolmogorov-Smirnov test. Genomic regions are pre-segmented, and gradually reduced in size until the region contains less than a defined minimum number of CpGs or statistical significance is not improved. RADMeth [28] employs a beta-binomial regression and a Stouffer-Liptak test. And RnBeads [29] calculates p-values for each CpG in the data set using hierarchical linear models and M-values [30]. BiSeq uses a smooth-based approach while considering coverage to call DMRs [31].

With the exception of Metilene, many well-known DMR callers require knowledge of the R programming language [32] or even a specific version of R (MethylKit & BiSeq). In addition, each DMR caller requires specific input formats to run it properly. Furthermore, many of them use extreme computing resources for genome-scale analysis. Table 1 summarizes the characteristics and algorithms of the widely used DMR callers we investigated.

Table 1 Comparison of DMR calling software

Full size table

The numerous disadvantages of these programs/statistical methods prompted the development of a new DMR-identification program. Defiant is a standalone program following GNU99 standard, which reduces the issues of portability. Defiant automatically detects ten different input formats widely used for DNA methylation. It does not require installation of any libraries and runs with a single command line. We evaluate Defiant’s performance with comprehensive benchmarking tests using both artificial and real WGBS data. We applied Defiant to analyzing the DNA methylation changes induced by iron deficiency during the critical neuro-developmental period (fetus and newborn) in the rat hippocampus.

Implementation

Animals and Hippocampal dissection

G2 pregnant CD1 Sprague-Dawley rats were purchased from Charles Rivers Laboratories. The experimental conditions for induction of fetal-neonatal iron deficiency were following the previously described protocol [33]. All procedures were approved by the Institutional Animal Care and Use Committee of the University of Minnesota. Hippocampal dissection and storage from PND15 rats was performed as previously described [33]. Genomic DNA from PND15 rat hippocampi was isolated using Allprep DNA/RNA mini kit (Qiagen).

WGBS & data processing

WGBS was performed as a published protocol [34]. Briefly, 1 μg of genomic DNA was fragmented into 300 bp size using M220 Covaris Ultrasonicator. Sequencing libraries were generated using NEBNext genomic sequencing kit (New England Biolabs) and ligated with Illumina methylated paired end adaptors. Libraries were bisulfite-converted using Imprint DNA modification kit (Sigma), and the size of 300–600 bp was selected using Pippin Prep DNA size selection system (Sage Science). Libraries were then amplified using PfuTurbo Cx Hotstart DNA polymerase (Agilent Technologies). Samples were sequenced to 100 bps in either paired-end or single-read formation on an Illumina HiSeq 2000 with RTA version 1.13.48 and HiSeq control software version HiSeqCS:1.5.15.1. Adpaters were trimmed from the reads using a custom C language program. Trimmed reads were aligned against the rat genome (rn4) using bs seeker (v1) [35]. The methylation status was then tallied from the bs seeker output. When reads overlapped at a base, the methylation status from read 1 was used. Methylation data at the C and G in a CpG pair was merged to produce the estimate at that locus. The WGBS data have been deposited in the GEO repository (GSE98064).

Identification of DMRs using WWE

Defiant defines DMRs based on seven criteria. These can also be specified by the end-user.

All nucleotides in all samples are present and meet minimum coverage (default 10). The user can specify that some nucleotides can be missing from certain replicates, but we recommend against it as it can introduce false positives.
Absolute value in the difference of the sum of the methylation percentages is above a given cutoff %. The default is 10%, but this will vary in every individual experiment based on the chemistry [36]. The mean methylation percentage, $ \overline{m} $, is weighted based on coverage, i.e.
$$ \overline{m}=\frac{\sum_{r=1}^R{C}_r\frac{{}_m{C}_r}{C_r}}{\sum_{r=1}^R{C}_r}=\frac{{\sum_{r=1}^R}_m{C}_r}{\sum_{r=1}^R{C}_r} $$
(1)
where C_r is the coverage for replicate r, _mC_r is the number of 5mC for replicate r, and R is the number of replicates.
A 2-tailed p-value, default 0.05

If there is only 1 replicate in either sample, a p-value between groups A and B is calculated by Fisher’s exact test [37]

$$ p=\frac{\left({}_m{C}_A +_m{C}_B\right)!\left({C}_A+{C}_B\right)!\left({}_m{C}_A+{C}_A\right)!\left({}_m{C}_B+{C}_B\right)!}{{}_m{C}_A!{}_m{C}_B!{C}_A!{C}_B!\left(\Big({}_m{C}_A +_m{C}_B+{C}_A+{C}_B\right)!} $$

(2)

where _mC = number of 5-methyl Cystosine and C = number of Cytosine.

If there are multiple replicates in both groups, the p-value is based on Welch’s t-test [24] in sum of methylation percentages is below a given cutoff.

If both samples have multiple replicates, the t-test between groups A and B is calculated thus:

$$ t=\frac{{\overline{m}}_B-{\overline{m}}_A}{\sqrt{\frac{s_A^2}{N_B}+\frac{s_B^2}{N_B}}}, $$

(3)

where the unbiased sample variance s for any group A is also weighted based on coverage:

$$ {s}_A=\frac{\sum \limits_{r=1}^R{C}_r{\left({\overline{m}}_A-{m}_r\right)}^2}{\left(\sum \limits_{r=1}^R{C}_r\right)-1} $$

(4)

The Benjamini-Hochberg [38] approach is applied to the identified DMRs to adjust p-value for multiple testing.

a minimum number of CpN constituting differentially methylated region (default CpN = 5),
a minimum range of the differentially methylated nucleotides (default 0)
a maximum range between CpN (default 20,000 nucleotides).
a maximum similar, i.e. non-differentially methylated, CpG count (default 5). If a DMR is currently expanding, the DMR expands until one criterion of differential methylation stops. By allowing a similar CpG count, DMRs can contain similarly methylated CpG inside the DMR. Once the similar nucleotide is broken, the DMR shrinks to the point where differential methylation stopped.

All criteria are easily set by command-line options. Defiant is designed to test multiple parameters in parallel, to make the final decision on each parameter up to the end user. Defiant has a companion program in Perl, plot_results.pl, which plots the number of DMRs as a function of different parameters using GNUPlot if a user has chosen to test multiple parameters. The effects of the variation of each parameter on the number of DMRs found in the rat hippocampus data can be seen in (Additional file 1: Figures S2-S16).

The number of DMR is sensitive to CpN, differentially methylated CpN (d), and p-value, but less sensitive to minimum percent change, minimum coverage. Minimum CpN = 5 is a point where the number of DMR is large enough and less sensitive to minimum coverage (Additional file 1: Figure S7). Minimum coverage determines the number of DMRs. The default (10) is the point where the number of DMR is robust to the “Differentially methylated CpN” (Additional file 1: Figure S8). We set a large number for “Maximum range for CpN” (default = 20,000). The difference in the methylation percentage (default 10%) can further be used to identify subtle but significant changes. A minimum range between CpN is not used for the rat data (default 0) but provide a user with flexibility in defining DMRs.

Comparison with other methods

DMR Overlay is the percent of nucleotides predicted by the program inside of the artificial DMR. DMR overlay is measured in two directions, once with respect to the benchmark DMRs and once with respect to the predicted DMRs (Fig. 5). For example, in Fig. 1, the overlay of predicted DMR 1 with respect to Benchmark DMR “A” is calculated as

$$ \mathrm{DMR}\ \mathrm{overlay}=\frac{\mathrm{length}\left(\mathrm{A}\right)+\mathrm{length}\left(\mathrm{B}\right)}{\mathrm{length}\ \left(\mathrm{DMR}\ 1\right)}=\frac{160+139}{319}=93.7\% $$

while for the reverse, DMRs A and B would each have their own DMR overlay equal to 100%. This is similar, but more informative than a Jaccard index, which is symmetric while DMR overlay depends on direction of comparison.

Defiant is run with default minimum coverage of 10, p = 0.05, minimum percent change of 10%, and five CpN (CpG) in each DMR.
MethylKit [25] scans for DMRs using a tiling window of 1000 nucleotides and a step size of 1000 nucleotides. Consecutive windows that score q < 0.05 are considered as a single DMR. Tiles can report q = 0 with p ≈ 1, so tiles with q = 0 are treated as non-DMRs. MethylSig [26] also uses a tiling approach, with a window and step size of 25 nucleotides. As for MethylKit, consecutive windows with q < 0.05 are considered as a single DMR.
Metilene [27] is a command-line program, and was run according to defaults. Metilene was run with a minimum CpG count of 5 with the rat data to make results comparable to Defiant.
RADMeth [28] is run as a series of three commands to the Linux shell. RADMeth showed itself superior to ComMet [39] and DSS [40] so we do not compare these methods here.
RnBeads [29] differentially methylated regions are defined by 500 nucleotide tiles. Similarly with MethylKit and MethylSig, consecutive tiles with q < 0.05 are considered as a single DMR.

Results

DMR identification by weighted Welch expansion

Defiant calculates a p-value using a Welch’s t-test for the weighted means and variance (Method). A weight is to give more credence to the replicate with a high coverage. If there is only one replicate in either set, Welch’s t-test cannot be used, and thus Defiant uses a Fisher’s exact test. For accurate detection of the boundary of a DMR, Defiant detects the start point based on the following factors: differences in methylation levels, coverage, p-value, and a minimum number of CpGs. If all of these criteria are satisfied, the next nucleotide is checked for constituting differential methylation, i.e. the DMR expands. It terminates when there is a number of consecutive CpGs that do not pass the criteria. We call this algorithm weighted Welch expansion (WWE).

Figure 2 demonstrates how Defiant identifies DMRs using WWE. Defiant calculates the weighted mean based on the coverage. The weighted mean for the two datasets (“B” and “F”) are shown on the top two panels. The p-values were calculated using a Welch’s t-test (on the third panel). The DMR on the left side starts at 120,124 when the difference in methylation levels is − 88% and p = 7.7 × 10^− 20. The DMR expands as long as it sees CpGs that pass the criteria. At 120,052, Defiant stops its expansion when it observes at least 5 consecutive CpGs that fail the criteria. Defiant does not require setting the size of a window or “tiles” as is done in MethylKit [25], so the data determines the size of the differentially methylated regions. This approach gives much greater flexibility and power in analyzing the data, as small differentially methylated regions can be easily missed when using a tiling approach.

Datasets

We used two datasets to evaluate Defiant’s performance.

Artificial benchmarking datasets used for Metilene [27]

Metilene [27] simulated RRBS and WGBS datasets using beta binomial distribution. The dataset are composed of two different backgrounds, each with four subsets. These are named 1.1 (strongest methylation differences) through 1.4 (weakest differences) for the first background, and 2.1 through 2.4 for the second (Additional file 1: Figure S1). In total, there are 16 subsets (eight for RRBS and eight for WGBS) to test DMR calling for various data configurations. Each subset has ten replicates. We downloaded them from http://www.bioinf.uni-leipzig.de/Software/metilene/Downloads/ .

WGBS data from postnatal day 15 iron deficient and iron sufficient rat hippocampi

WGBS data were generated to analyze changes in hippocampal DNA methylation due to iron-deficiency during the fetal and neonatal periods. Pregnant/nursing rat dams were fed an iron-deficient diet (4 ppm iron) from gestational day 2 through postnatal day (P) 7, at which time they were given an iron-sufficient control diet (200 ppm iron). Iron sufficient control rats were fed an iron sufficient diet through the entire experimental duration. At P15, rat pups from both groups were euthanized and hippocampi were isolated. This diet manipulation induced a 60% iron deficiency in the P15 rat hippocampus [41, 42]. Three biological replicates of WGBS data were generated from hippocampi of both groups.

Performance evaluation

We evaluate the DMR-identification programs Defiant, Metilene, MethylKit, MethylSig, RadMeth, and RnBeads. We chose not to use BSmooth and MOABS as they have already shown inferior performance as compared to Metilene [27] using the same test set we use in this experiment. Using the DMRs in the artificial datasets as the gold standard, we defined true positive (TP) when a predicted DMR overlapped with a DMR in the benchmark, otherwise it was defined as a false positive (FP). False negative (FN) was defined when a DMR in the benchmark dataset was not predicted. FN, FP, and TP values for all DMR callers for each artificial dataset are listed in Additional file 1: Table S1.

For comprehensive evaluation, we compared precision (TP/(TP + FP)) against FN as well as against recall (TP/(TP + FN)) for 8 sets of RRBS and WGBS data (Fig. 3). In these tests, both Defiant and Metilene showed excellent precision and recall with very low FNs compared with other DMR callers. They scored perfect precision because their FP is 0 for all 16 tests. RnBeads also showed FP equal or close to zero. However, it suffered a high FN (Fig. 3). MethylKit showed the worst performance in these tests mainly due to excessive number of FPs. MethylSig showed high number of FPs for WGBS datasets (Additional file 1: Table S1). Moreover, MethylSig did not predict any DMRs for RRBS datasets (TPs and FPs were 0). RADMeth scored moderately well, but scored behind both Defiant and Metilene.

To obtain the overall performance we calculated the F1 score (=$ 2\times \frac{\mathrm{precision}\times \mathrm{recall}}{\mathrm{precision}+\mathrm{recall}}\Big) $. The F1 score is the harmonic mean of precision and recall. Both Defiant and Metilene showed outperforming F1 scores compared with other predictors (Fig. 4). The performances between Defiant and Metilene were comparable when we investigated the statistical differences between the predictions using a Welch’s t-test [24] (p = 0.81, Additional file 1: Table S2). Considering that the artificial datasets were generated using a beta binomial distribution, providing a favorable environment for Metilene, the comparable performance of Defiant is notable. Examining the results more closely, we found that Defiant outperformed Metilene especially for background 2, dataset 4 where there were subtle but significant differences in DNA methylation levels (Fig. 3 and Additional file 1: Table S1). To examine the DMR calling boundary more carefully, we calculated the overlapping ratios of DMRs, quantified as “DMR overlay” (Methods). Compared with Metilene, Defiant showed better overlay with respect to the predicted DMRs, suggesting that DMRs that Defiant calls capture the robust portion of DMRs (Fig. 5).

Comparison using real WGBS data

We applied Defiant to identify DMRs in the iron-deficient rat hippocampus using the same parameters as were used with the artificial data. Fetal-neonatal iron deficiency, which is one of the most common nutritional deficiencies in the world [43], affecting as many as 2 billion people and approximately 30% of pregnancies [44, 45], causing long-term cognitive deficits despite iron treatment [46, 47]. Because these changes persist into adulthood even in the face of normal iron levels, and the known link between fetal exposures and long-term outcomes after birth, we investigated whether DNA methylation is altered in the developing rat hippocampus induced by fetal-neonatal iron-deficiency using WGBS datasets. The methods used to induce fetal-neonatal iron-deficiency have been previously described [48]. We investigated if DNA methylation is affected by iron-deficiency during fetal-neonatal periods using WGBS datasets with three iron-deficient rats and three iron sufficient rats.

We compared performance of Defiant to that of Metilene because they showed comparable performances based on the benchmarking test. Between the iron-deficient and iron-sufficient groups, Defiant identified 229 DMRs, while Metilene identified only 80 DMRs, with ten regions showing overlap between the two approaches. Figure 6 showed the DNA methylation status of the DMRs identified both by Defiant and Metilene.

We found that the DMRs identified by Metilene but not by Defiant were mostly in low coverage areas. When methylation levels were weighted by the coverage, they showed non-significant p-values (Fig. 7a and b). Indeed, compared with the artificial datasets which have the distribution of coverage in a short range, the WGBS data in rat hippocampus were with a wider range of coverage (Fig. 7c). These results indicate a superior performance of Defiant when applied to real WGBS data. Figure 8 shows the examples of DMRs detected by Defiant but not by Metilene. The DNA methylation profiles showed clear differences in DNA methylation between the iron sufficient and the iron deficient groups for regions near genes such as Pde6c, Chd2, Mobp, and Pck1. The differences between Defiant’s and Metilene’s predictions are due to Defiant’s use of coverage-weighted means. The coverage-weighted means used in WWE allow Defiant to avoid artifacts such as Simpson’s paradox [49].

Defiant identifies DMRs close to genes potentially affected by iron-deficiency in fetal-neonatal periods

The 229 DMRs that Defiant identified mapped within 15 Kbps of 108 genes (Additional file 1: Table S3). Among the 108 genes, 45 showed hypomethylated and 63 showed hypermethylated regions (Additional file 1: Tables S4 and S5, respectively). We identified that considerable portion of them (37 out of 43 hypomethylated and 31 out of 62 hypermethylated DMRs) were associated with neuronal function or development (Additional file 1: Tables S4 and S5), corroborating a previous finding [50]. Gene ontology (GO) analysis using Enrichr [51, 52] identified “abnormal nervous system” in Mouse Genome Informatics [53] mammalian phenotype level 3 (p-value = 0.005). Genes associated with this term were Camk2b, Fkrp, Ncf1, St8sia1, Itsn1, Cacna1c, Usf2, Mib1, Fig4, Jph3, Mobp, Ush1g, Prkar1b, Tal1, and Pde6c. We also observed terms “Rho GTPase cycle” (p-value: 0.0005), and “Axon guidance” (p-value = 0.001). In total, 7 out of 43 and 12 out of 62 genes were associated with GTPase activity for gain and loss of DNA methylation levels, respectively. The rho family of GTPases [54, 55], one of the G-protein coupled receptors, regulates neuronal morphogenesis [56], dendritogenesis [57], and spinogenesis [58]. We also found terms “synapse part” (p-value = 0.017; Arf1, Tenm2, Pde2a, Cacna1c, Srgap2, and Mib1). Collectively, Defiant identified DMRs close to the genes critical for neuronal synaptogenesis and plasticity.

Defiant showed best performance based on the running time

Running time is important for genome-scale analysis. We evaluated the performance based on resource use. Using the benchmarking datasets, all pertinent DMR callers were run on the same system (Intel Xeon CPU 2.1GHz with 32GB RAM). For fair comparison, we converted the format of the benchmarking datasets into the format preferred by each DMR caller before evaluation. Therefore, mapping and input format conversion were not needed for this benchmarking. We observed dynamic range of running time and memory use. Defiant showed much faster performance compared with other methods: 5× faster than Metilene, 10× faster than MethylKit and MethylSig, and > 300× faster than RADMeth and RnBeads. When we applied Defiant to our WGBS rat datasets on the same system, it took less than 2 min to obtain the DMR results for the entire genome. Defiant’s memory usage is light, about 1GB for genome-wide WGBS data. Compared to the memory usages of the DMR callers, Defiant used slightly more memory than Metilene because Defiant is designed to run in a single step.

Conclusions

We developed Defiant a new method to identify DMRs. Defiant is designed to provide easy and fast implementation of DMR calling while guaranteeing the prediction performance. We also put more credence to the CpGs with high coverage. For a rigorous test while weighing DNA methylation based on coverage, we used a Welch’s t-test. A Welch’s t-test does not assume equal variation between the sets and simplifies the incorporation of the weight information.

One of the widely used approaches for modeling DNA methylation is a beta binomial distribution. In our benchmarking tests, Defiant showed superior performance to other beta binomial distribution based predictors such MethylSig, MOABS, and RADMeth. It is noteworthy that the artificial data we used were generated by the developers of Metilene using a beta binomial distribution. Despite the potential bias against it, Defiant showed comparable performance with Metilene. Our results indicate that using a Welch’s t-test is appropriate for DMR identification. Close examination found that Defiant performed better then Metilene when modest but significant differences were observed (Fig. 3). More importantly, Defiant identified more DMRs in the rat hippocampus datasets which showed clear DMRs (Fig. 8). The DMRs uniquely observed by Metilene were in very low coverage areas (Fig. 7). Together, these suggest that Defiant performs better than Metilene for analysis of real data.

For genome-scale analysis, reduced running time is highly desirable. In our test, Defiant ran remarkably faster than other competitors using memory less than 1GB (Fig. 9). When applied to the whole-genome data in rats, it took less than 2 min for DMR calling for the entire genome. Defiant accepts diverse formats for DNA methylation including the format for Bismark coverage, and Bismark cytosine [59], BisSNP [60], MethylKit & MethylSig input, UCSC ENCODE, and EPP [61]. Defiant also runs as a standalone software.

For convenient analysis, Defiant provides annotation about the genes located around DMRs. Additional file 1: Table S3 shows the example of output of Defiant. Defiant is applicable to bisulfite-sequencing data, RRBS [19], HpaII tiny fragment enrichment by ligation-mediated PCR-tag (HELP-Tag) data [62], and Tet-assisted bisulfite sequencing (TAB-Seq) [63]. The source code is available on http://github.com/hhg7/defiant.

Abbreviations

DMR:: Differentially methylated region
GO:: Gene ontology
RRBS:: Reduced Representation Bisulfite Sequencing
WGBS:: Whole Genome Bisulfite Sequencing
WWE:: Weighted Welch Expansion

References

Bird A. DNA methylation patterns and epigenetic memory. Genes Dev. 2002;16(1):6–21. https://doi.org/10.1101/gad.947102. http://genesdev.cshlp.org/content/16/1/6.full.pdf+html
Article CAS PubMed Google Scholar
Ehrlich M, Gama-Sosa MA, Huang L-H, Midgett RM, Kuo KC, McCune RA, Gehrke C. Amount and distribution of 5-methylcytosine in human DNA from different types of tissues or cells. Nucleic Acids Res. 1982;10(8):2709. https://doi.org/10.1093/nar/10.8.2709.
Article CAS PubMed PubMed Central Google Scholar
Choy M-K, Movassagh M, Goh H-G, Bennett MR, Down TA, Foo RS. Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated. BMC Gen. 2010;11(1):1–10. https://doi.org/10.1186/1471-2164-11-519.
Article Google Scholar
Stadler MB, Murr R, Burger L, Ivanek R, Lienert F, Scholer A, Wirbelauer C, Oakeley EJ, Gaidatzis D, Tiwari VK, Schubeler D. DNA-binding factors shape the mouse methylome at distal regulatory regions. Nature. 2011;480:490–5. https://doi.org/10.1038/nature10716.
CAS PubMed Google Scholar
Altun G, Loring JF, Laurent LC. DNA methylation in embryonic stem cells. J Cell Biochem. 2010;109(1):1–6. https://doi.org/10.1002/jcb.22374.
CAS PubMed PubMed Central Google Scholar
Wilson V, Jones P. DNA methylation decreases in aging but not in immortal cells. Science. 1983;220(4601):1055–7. https://doi.org/10.1126/science.6844925.
Article CAS PubMed Google Scholar
Moore T, Haig D. Genomic imprinting in mammalian development: a parental tug-of-war. TIG. 1991;7(2):45–9. https://doi.org/10.1016/0168-9525(91)90230-N.
Article CAS PubMed Google Scholar
Li E, Beard C, Jaenisch R. Role for DNA methylation in genomic imprinting. Nature. 1993;366(6453):362–5. https://doi.org/10.1038/366362a0.
Article CAS PubMed Google Scholar
Beard C, Li E, Jaenisch R. Loss of methylation activates Xist in somatic but not in embryonic cells. Genes Dev. 1995;9(19):2325–34. https://doi.org/10.1101/gad.9.19.2325.
Article CAS PubMed Google Scholar
Ushijima T, Watanabe N, Okochi E, Kaneda A, Sugimura T, Miyamoto K. Fidelity of the Methylation pattern and its variation in the genome. Genes Res. 2003;13(5):868–74. https://doi.org/10.1101/gr.969603.
Article CAS Google Scholar
Esteller M. Cancer epigenomics: DNA methylomes and histone-modification maps. Nat. Rev. Genet. 2007;8(4):286–98. https://doi.org/10.1038/nrg2005.
Article CAS PubMed Google Scholar
Robertson KD. DNA methylation and human disease. Nat Rev Genet. 2005;6 https://doi.org/10.1038/nrg1655.
Iwamoto K, Bundo M, Yamada K, Takao H, Iwayama-Shigeno Y, Yoshikawa T, Kato T. DNA Methylation status of SOX10 correlates with its Downregulation and Oligodendrocyte dysfunction in schizophrenia. J Neurosci. 2005;25(22):5376–81. https://doi.org/10.1523/JNEUROSCI.0766-05.2005.
Article CAS PubMed Google Scholar
Huang H-S, Akbarian S. GAD1 mRNA expression and DNA Methylation in prefrontal cortex of subjects with schizophrenia. PLoS One. 2007;2(8):1–6. https://doi.org/10.1371/journal.pone.0000809.
CAS Google Scholar
Pham TD, MacLennan NK, Chiu CT, Laksana GS, Hsu JL, Lane RH. Uteroplacental insufficiency increases apoptosis and alters p53 gene methylation in the full-term IUGR rat kidney. Am J Physiol Regul Integr Comp Physiol. 2003;285(5):962–70. https://doi.org/10.1152/ajpregu.00201.2003.
Article Google Scholar
Tobi EW, Goeman JJ, Monajemi R, Gu H, Putter H, Zhang Y, Slieker RC, Stok AP, Thijssen PE, Müller F, van Zwet EW, Bock C, Meissner A, Lumey LH, Eline Slagboom P, Heijmans BT. DNA methylation signatures link prenatal famine exposure to growth and metabolism. Nat Commun. 2014;5:5592. https://doi.org/10.1038/ncomms6592.
Article CAS PubMed PubMed Central Google Scholar
Zeilinger S, Kühnel B, Klopp N, Baurecht H, Kleinschmidt A, Gieger C, Weidinger S, Lattka E, Adamski J, Peters A, Strauch K, Waldenberger M, Illig T. Tobacco smoking leads to extensive genome-wide changes in DNA Methylation. PLoS One. 2013;8(5):1–14. https://doi.org/10.1371/journal.pone.0063812.
Article Google Scholar
Joehanes R, Just AC, Marioni RE, Pilling LC, Reynolds LM, Mandaviya PR, Guan W, Xu T, Elks CE, Aslibekyan S, Moreno-Macias H, Smith JA, Brody JA, Dhingra R, Yousefi P, Pankow JS, Kunze S, Shah SH, McRae AF, Lohman K, Sha J, Absher DM, Ferrucci L, Zhao W, Demerath EW, Bressler J, Grove ML, Huan T, Liu C, Mendelson MM, Yao C, Kiel DP, Peters A, Wang-Sattler R, Visscher PM, Wray NR, Starr JM, Ding J, Rodriguez CJ, Wareham NJ, Irvin MR, Zhi D, Barrdahl M, Vineis P, Ambatipudi S, Uitterlinden AG, Hofman A, Schwartz J, Colicino E, Hou L, Vokonas PS, Hernandez DG, Singleton AB, Bandinelli S, Turner ST, Ware EB, Smith AK, Klengel T, Binder EB, Psaty BM, Taylor KD, Gharib SA, Swenson BR, Liang L, DeMeo DL, O’Connor GT, Herceg Z, Ressler KJ, Conneely KN, Sotoodehnia N, Kardia SLR, Melzer D, Baccarelli AA, van Meurs JBJ, Romieu I, Arnett DK, Ong KK, Liu Y, Waldenberger M, Deary IJ, Fornage M, Levy D, London SJ. Epigenetic signatures of cigarette SmokingCLINICAL PERSPECTIVE. Circ Cardiovasc Genet. 2016;9(5):436–47. https://doi.org/10.1161/CIRCGENETICS.116.001506.
Article CAS PubMed PubMed Central Google Scholar
Meissner A, Gnirke A, Bell GW, Ramsahoye B, Lander ES, Jaenisch R. Reduced representation bisulfite sequencing for comparative high-resolution DNA methylation analysis. Nucleic Acids Res. 2005;33(18):5868–77. https://doi.org/10.1093/nar/gki901.
Article CAS PubMed PubMed Central Google Scholar
Novak P, Stampfer MR, Munoz-Rodriguez JL, Garbe JC, Ehrich M, Futscher BW, Jensen TJ. Cell-type specific DNA Methylation patterns define human breast cellular identity. PLoS One. 2012;7(12):1–9. https://doi.org/10.1371/journal.pone.0052299.
Article Google Scholar
Gervin K, Page CM, Aass HCD, Jansen MA, Fjeldstad HE, Andreassen BK, Duijts L, van Meurs JB, van Zelm MC, Jaddoe VW, Nordeng H, Knudsen GP, Magnus P, Nystad W, Staff AC, Felix JF, Lyle R. Cell type specific DNA methylation in cord blood: a 450K-reference data set and cell count-based validation of estimated cell type composition. Epigenetics. 2016;11(9):690–8. https://doi.org/10.1080/15592294.2016.1214782.
Article PubMed PubMed Central Google Scholar
Kawakatsu T, Stuart T, Valdes M, Breakfield N, Schmitz RJ, Nery JR, Urich MA, Han X, Lister R, Benfey PN, Ecker JR. Unique cell-type-specific patterns of DNA methylation in the root meristem. Nature Plants. 2016;2 https://doi.org/10.1038/nplants.2016.58.
Hansen KD, Langmead B, Irizarry RA. BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions. Genome Biol. 2012;13(10):83. https://doi.org/10.1186/gb-2012-13-10-r83.
Article Google Scholar
Welch BL. The generalization of Student’s problem when several different population variances are involved. Biometrika. 1947;34(1–2):28–35. https://doi.org/10.1093/biomet/34.1-2.28.
CAS PubMed Google Scholar
Akalin A, Kormaksson M, Li S, Garrett-Bakelman FE, Figueroa ME, Melnick A, Mason CE. methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles. Genome Biol. 13:87. https://doi.org/10.1186/gb-2012-13-10-r87.
Park Y, Figueroa ME, Rozek LS, Sartor MA. MethylSig: a whole genome DNA methylation analysis pipeline. Bioinformatics. 2014;30(17):2414–22. https://doi.org/10.1093/bioinformatics/btu339.
Article CAS PubMed PubMed Central Google Scholar
Jühling F, Kretzmer H, Bernhart SH, Otto C, Stadler PF, Hoffmann S. Metilene: fast and sensitive calling of differentially methylated regions from bisulfite sequencing data. Genome Res. 2015; https://doi.org/10.1101/gr.196394.115.
Dolzhenko E, Smith AD. Using beta-binomial regression for high-precision differential methylation analysis in multifactor whole-genome bisulfite sequencing experiments. BMC Bioinformatics. 2014;15(1):215. https://doi.org/10.1186/1471-2105-15-215.
Article PubMed PubMed Central Google Scholar
Assenov Y, Mueller F, Lutsik P, Walter J, Lengauer T, Bock C. Compehensive analysis of DNA Methylation data with RnBeads. Nat Methods. 2014;11(11):1138–40. https://doi.org/10.1038/nmeth.3115.
Article CAS PubMed PubMed Central Google Scholar
Du P, Zhang X, Huang C-C, Jafari N, Kibbe WA, Hou L, Lin SM. Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis. BMC Bioinform. 2010;11(1):1–9. https://doi.org/10.1186/1471-2105-11-587.
Article Google Scholar
Hebestreit K, Dugas M, Klein H-U. Detection of significantly differentially methylated regions in targeted bisulfite sequencing data. Bioinformatics. 2013;29(13):1647–53. https://doi.org/10.1093/bioinformatics/btt263.
Article CAS PubMed Google Scholar
Team, R.C.: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2015). R Foundation for Statistical Computing. https://www.R-project.org
Tran PV, Kennedy BC, Pisansky MT, Won K-J, Gewirtz JC, Simmons RA, Georgieff MK. Prenatal choline supplementation diminishes early-life iron deficiency induced reprogramming of molecular networks associated with behavioral abnormalities in the adult rat hippocampus. J Nutr. 2016;146(3):484–93. https://doi.org/10.3945/jn.115.227561.
Article CAS PubMed PubMed Central Google Scholar
Sheaffer KL, Kim R, Aoki R, Elliott EN, Schug J, Burger L, Schbeler D, Kaestner KH. DNA methylation is required for the control of stem cell differentiation in the small intestine. Genes Dev. 2014;28(6):652–64. https://doi.org/10.1101/gad.230318.113.
Article CAS PubMed PubMed Central Google Scholar
Chen P-Y, Cokus SJ, Pellegrini M. BS seeker: precise mapping for bisulfite sequencing. BMC Bioinformatics. 2010;11(1):203. https://doi.org/10.1186/1471-2105-11-203.
Article CAS PubMed PubMed Central Google Scholar
Marabita F, Almgren M, Lindholm ME, Ruhrmann S, Fagerström-Billai F, Jagodic M, Sundberg CJ, Ekström TJ, Teschendorff AE, Tegner J, Gomez-Cabrero D. An evaluation of analysis pipelines for DNA methylation profiling using the Illumina HumanMethylation450 BeadChip platform. Epigenetics. 2013;8(3):333–46. https://doi.org/10.4161/epi.24008.
Article CAS PubMed PubMed Central Google Scholar
Fisher RA. On the interpretation of χ2 from contingency tables, and the calculation of P. J R Stat Soc. 1922;85(1):87–94. https://doi.org/10.2307/2340521.
Article Google Scholar
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc B. 1995;57(1):289–300.
Google Scholar
Saito Y, Tsuji J, Mituyama T. Bisulfighter: accurate detection of methylated Cytosines and differentially methylated regions. Nucleic Acids Res. 2014;42(6):45. https://doi.org/10.1093/nar/gkt1373.
Article Google Scholar
Feng H, Conneely KN, Wu H. A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data. Nucleic Acids Res. 2014;42(8):69. https://doi.org/10.1093/nar/gku154.
Article Google Scholar
Jorgenson LA, Wobken JD, Georgieff MK. Perinatal iron deficiency alters apical Dendritic growth in Hippocampal CA1 pyramidal neurons. Dev Neurosci. 2003;25 https://doi.org/10.1159/000075667.
Jorgenson LA, Sun M, O’Connor M, Georgieff MK. Fetal iron deficiency disrupts the maturation of synaptic function and efficacy in area ca1 of the developing rat hippocampus. Hippocampus. 2005;15(8):1094–102. https://doi.org/10.1002/hipo.20128.
Article CAS PubMed Google Scholar
Carlson ES, Tkac I, Magid R, O’Connor MB, Andrews NC, Schallert T, Gunshin H, Georgieff MK, Petryk A. Iron is essential for neuron development and memory function in mouse hippocampus. J Nutr. 2009;139(4):672–9. https://doi.org/10.3945/jn.108.096354.
Article CAS PubMed PubMed Central Google Scholar
Rao R, Georgieff MK. Iron in fetal and neonatal nutrition. Semin Fetal Neonat M. 2007;12(1):54–63. https://doi.org/10.1016/j.siny.2006.10.007.
Article Google Scholar
McLean E, Cogswell M, Egli I, Wojdyla D, de Benoist B. Worldwide prevalence of anaemia, WHO vitamin and mineral nutrition information system, 1993-2005. Public Health Nutr. 2009;12(4):444–54. https://doi.org/10.1017/S1368980008002401.
Article PubMed Google Scholar
Lozoff B. Iron deficiency and child development. Food Nutr Bull. 2007;28(4):suppl4), 560–71. https://doi.org/10.1177/15648265070284S409.
Google Scholar
Lukowski AF, Koss M, Burden MJ, Jonides J, Nelson CA, Kaciroti N, Jimenez E, Lozoff B. Iron deficiency in infancy and neurocognitive functioning at 19 years: evidence of long-term deficits in executive function and recognition memory. Nutr Neurosci. 2010;13(2):54–70. https://doi.org/10.1179/147683010X12611460763689.
Article CAS PubMed PubMed Central Google Scholar
Tran PV, Kennedy BC, Lien Y-C, Simmons RA, Georgieff MK. Fetal iron deficiency induces chromatin remodeling at the Bdnf locus in adult rat hippocampus. Am J Physiol Regul Integr Comp Physiol. 2015;308(4):276–82. https://doi.org/10.1152/ajpregu.00429.2014.
Article Google Scholar
Simpson EH. The interpretation of interaction in contingency tables. J R Stat Soc Series B. 1951;13:238–41.
Google Scholar
Brunette KE, Tran PV, Wobken JD, Carlson ES, Georgieff MK. Gestational and neonatal iron deficiency alters apical dendrite structure of CA1 pyramidal neurons in adult rat hippocampus. Dev Neurosci. 2010;32 https://doi.org/10.1159/000314341.
Chen EY, Tan CM, Kou Y, Duan Q, Wang Z, Meirelles GV, Clark NR, Ma’ayan A. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinform. 2013;14(1):128. https://doi.org/10.1186/1471-2105-14-128.
Article Google Scholar
Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z, Koplev S, Jenkins SL, Jagodnik KM, Lachmann A, McDermott MG, Monteiro CD, Gundersen GW, Ma’ayan A. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 2016;44(W1):90. https://doi.org/10.1093/nar/gkw377.
Article Google Scholar
Blake JA, Eppig JT, Kadin JA, Richardson JE, Smith CL, Bult CJ. Mouse genome database (MGD)-2017: community knowledge resource for the laboratory mouse. Nucleic Acids Res. 2017;45(D1):723. https://doi.org/10.1093/nar/gkw1040.
Article Google Scholar
Boettner B, Aelst LV. The role of rho GTPases in disease development. Gene. 2002;286(2):155–74. https://doi.org/10.1016/S0378-1119(02)00426-2.
Article CAS PubMed Google Scholar
Auer M, Hausott B, Klimaschewski L. Rho GTPases as regulators of morphological neuroplasticity. Annals of Anatomy - Anatomischer Anzeiger. 2011;193(4):259–66. https://doi.org/10.1016/j.aanat.2011.02.015.
Article CAS PubMed Google Scholar
Luo L. RHO GTPASES in neuronal morphogenesis. Nat Rev Neurosci. 2000;1:173–80. https://doi.org/10.1038/35044547.
Article CAS PubMed Google Scholar
Nakayama AY, Harms MB, Luo L. Small GTPases Rac and rho in the maintenance of Dendritic spines and branches in Hippocampal pyramidal neurons. J Neurosci. 2000;20(14):5329–38.
CAS PubMed Google Scholar
Tolias KF, Duman JG, Um K. Control of synapse development and plasticity by rho GTPase regulatory proteins. Prog Neurobiol. 2011;94(2):133–48. https://doi.org/10.1016/j.pneurobio.2011.04.011.
Article CAS PubMed PubMed Central Google Scholar
Krueger F, Andrews SR. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics. 2011; https://doi.org/10.1093/bioinformatics/btr167.
Liu Y, Siegmund KD, Laird PW, Berman BP. Bis-SNP: combined DNA methylation and SNP calling for Bisulfite-seq data. Genome Biol. 2012;13(7):1–14. https://doi.org/10.1186/gb-2012-13-7-r61.
Article CAS Google Scholar
Ziller MJ, Müller F, Liao J, Zhang Y, Gu H, Bock C, Boyle P, Epstein CB, Bernstein BE, Lengauer T, Gnirke A, Meissner A. Genomic distribution and inter-sample variation of non-CpG Methylation across human cell types. PLoS Genet. 2011;7(12):1–15. https://doi.org/10.1371/journal.pgen.1002389.
Article Google Scholar
Khulan B, Thompson RF, Ye K, Fazzari MJ, Suzuki M, Stasiek E, Figueroa ME, Glass JL, Chen Q, Montagna C, Hatchwell E, Selzer RR, Richmond TA, Green RD, Melnick A, Greally JM. Comparative isoschizomer profiling of cytosine methylation: the HELP assay. Genome Res. 2006;16(8):1046–55. https://doi.org/10.1101/gr.5273806.
Article CAS PubMed PubMed Central Google Scholar
Ito S, Shen L, Dai Q, Wu SC, Collins LB, Swenberg JA, He C, Zhang Y. Tet proteins can convert 5-Methylcytosine to 5-Formylcytosine and 5-Carboxylcytosine. Science. 2011;333(6047):1300–3. https://doi.org/10.1126/science.1210597.
Article CAS PubMed PubMed Central Google Scholar
Crowder, M.J.: Beta-binomial Anova for proportions. Journal of the Royal Statistical Society. Series C (Applied Statistics) 27(1), 34–37 (1978). doi:https://doi.org/10.2307/2346223.
Sun D, Xi Y, Rodriguez B, Park HJ, Tong P, Meong M, Goodell MA, Li W. MOABS: model based analysis of bisulfite sequencing data. Genome Biol. 2014;15(2):38. https://doi.org/10.1186/gb-2014-15-2-r38.
Article Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

We thank the Functional Genomics Core of the Penn Diabetes Research Center (P30- DK19525). This work is supported by NIH grant (R01 DK106027 to KJW and R01 HD029421 to MG).

Availability and requirements

Input files for all programs compared with benchmark artificial data, Defiant source code, and installation instructions are available on http://github.com/hhg7/defiant. Data is publicly available on the Gene Expression Omnibus under GSE98064.

Author information

Authors and Affiliations

Department of Genetics, The Institute for Diabetes, Obesity, and Metabolism, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
David E. Condon, Jonathan Schug & Kyoung-Jae Won
Department of Pediatrics, University of Minnesota, 2450 Riverside Avenue, Minneapolis, MN, 55454, USA
Phu V. Tran & Michael K. Georgieff
Center for Research on Reproduction and Women’s Health, University of Pennsylvania, 421 Curie Blvd, Philadelphia, PA, 19104, USA
Yu-Chin Lien & Rebecca A. Simmons
Biotech Research and Innovation Centre (BRIC), University of Copenhagen, 2200, Copenhagen, Denmark
Kyoung-Jae Won

Authors

David E. Condon
View author publications
You can also search for this author in PubMed Google Scholar
Phu V. Tran
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Chin Lien
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Schug
View author publications
You can also search for this author in PubMed Google Scholar
Michael K. Georgieff
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca A. Simmons
View author publications
You can also search for this author in PubMed Google Scholar
Kyoung-Jae Won
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

DC developed the Defiant method, wrote the source code, and compared performance. PT and YL performed bisulfite sequencing for the rat hippocampus. RS and KW conceived the project. YL, DC, RS and KW designed the experiment. JS prepared the preprocessing for the DNA methylation data and helped implementation of various DMR callers. DC, PT, RS, JS, YL, and KW wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Kyoung-Jae Won.

Ethics declarations

Ethics approval and consent to participate

All experimental procedures involving the use of live animals were approved by the Institutional Animal Care and Use Committee review boards of the University of Minnesota, the Children’s Hospital of Philadelphia, and the University of Pennsylvania.

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1:

Supplementary document. (PDF 475 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Condon, D.E., Tran, P.V., Lien, YC. et al. Defiant: (DMRs: easy, fast, identification and ANnoTation) identifies differentially Methylated regions from iron-deficient rat hippocampus. BMC Bioinformatics 19, 31 (2018). https://doi.org/10.1186/s12859-018-2037-1

Download citation

Received: 24 June 2017
Accepted: 23 January 2018
Published: 05 February 2018
DOI: https://doi.org/10.1186/s12859-018-2037-1

Defiant: (DMRs: easy, fast, identification and ANnoTation) identifies differentially Methylated regions from iron-deficient rat hippocampus

Abstract

Background

Results

Conclusions

Similar content being viewed by others

MethGo: a comprehensive tool for analyzing whole-genome bisulfite sequencing data

HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data

Differential Methylation Analysis for Bisulfite Sequencing (BS-Seq) Data

Background

Implementation

Animals and Hippocampal dissection

WGBS & data processing

Identification of DMRs using WWE

Comparison with other methods

Results

DMR identification by weighted Welch expansion

Datasets

Artificial benchmarking datasets used for Metilene [27]

WGBS data from postnatal day 15 iron deficient and iron sufficient rat hippocampi

Performance evaluation

Comparison using real WGBS data

Defiant identifies DMRs close to genes potentially affected by iron-deficiency in fetal-neonatal periods

Defiant showed best performance based on the running time

Conclusions

Abbreviations

References

Acknowledgements

Funding

Availability and requirements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional file

Additional file 1:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation