On the Use of Binary Trees for DNA Hydroxymethylation Analysis
- 1.8k Downloads
DNA methylation (mC) and hydroxymethylation (hmC) can have a significant effect on normal human development, health and disease status. Hydroxymethylation studies require specific treatment of DNA, as well as software tools for their analysis. In this paper, we propose a parallel software tool for analyzing the DNA hydroxymethylation data obtained by TAB-seq. The software is based on the use of binary trees for searching the different occurrences of methylation and hydroxymethylation in DNA samples. The binary trees allow to efficiently store and access the information about the methylation of each methylated/hydroxymethylated cytosines in the samples. Evaluation results shows that the performance of the application is only limited by the computer input/output bandwidth, even for the case of very long samples.
KeywordsHigh performance computing DNA hydroxymethylation Parallel pipeline
- 12.Xu, Z., Taylor, J.A., Leung, Y.K., Ho, S.M., Niu, L.: oxBS-MLE: an efficient method to estimate 5-methylcytosine and 5-hydroxymethylcytosine in paired bisulfite and oxidative bisulfite treated dna. Bioinformatics 32(23), 3667–3669 (2016)Google Scholar