On the Use of Binary Trees for DNA Hydroxymethylation Analysis

  • César González
  • Mariano Pérez
  • Juan M. OrduñaEmail author
  • Javier Chaves
  • Ana-Bárbara García
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10393)


DNA methylation (mC) and hydroxymethylation (hmC) can have a significant effect on normal human development, health and disease status. Hydroxymethylation studies require specific treatment of DNA, as well as software tools for their analysis. In this paper, we propose a parallel software tool for analyzing the DNA hydroxymethylation data obtained by TAB-seq. The software is based on the use of binary trees for searching the different occurrences of methylation and hydroxymethylation in DNA samples. The binary trees allow to efficiently store and access the information about the methylation of each methylated/hydroxymethylated cytosines in the samples. Evaluation results shows that the performance of the application is only limited by the computer input/output bandwidth, even for the case of very long samples.


High performance computing DNA hydroxymethylation Parallel pipeline 


  1. 1.
    Drong, A.W., Lindgren, C.M., McCarthy, M.I.: The genetic and epigenetic basis of type 2 diabetes and obesity. Clin. Pharmacol. Ther. 92(6), 707–715 (2012)CrossRefGoogle Scholar
  2. 2.
    Haumaitre, C.: Epigenetic regulation of pancreatic islets. Curr. Diabetes Rep. 13(5), 624–632 (2013)CrossRefGoogle Scholar
  3. 3.
    Krueger, F., Andrews, S.R.: Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 27(11), 1571–1572 (2011)CrossRefGoogle Scholar
  4. 4.
    Laird, P.W.: Principles and challenges of genome-wide dna methylation analysis. Nat. Rev. Genet. 11, 191–203 (2010)CrossRefGoogle Scholar
  5. 5.
    de Mello, V., Pulkkinen, L., Lalli, M., Kolehmainen, M., Pihlajamâmki, J., Uusitupa, M.: DNA methylation in obesity and type 2 diabetes. Ann. Med. 46(3), 103–13 (2014)CrossRefGoogle Scholar
  6. 6.
    Olanda, R., Pérez, M., Orduña, J.M., Tárraga, J., Dopazo, J.: A new parallel pipeline for DNA methylation analysis of long reads datasets. BMC Bioinform. 18(1), 161 (2017)CrossRefGoogle Scholar
  7. 7.
    Raciti, A., Nigro, C., Longo, M., Parrillo, L., Miele, C., Formisano, P., Bguino, F.: Personalized medicine and type 2 diabetes: lesson from epigenetics. Epigenomics 6(2), 229–238 (2014)CrossRefGoogle Scholar
  8. 8.
    Shen, L., Zhang, Y.: 5-hydroxymethylcytosine: generation, fate, and genomic distribution. Curr. Opin. Cell Biol. 25(3), 289–296 (2013)CrossRefGoogle Scholar
  9. 9.
    Tárraga, J., Pérez, M., Orduña, J.M., Duato, J., Medina, I., Dopazo, J.: A parallel and sensitive software tool for methylation analysis on multicore platforms. Bioinformatics 31(19), 3130 (2015)CrossRefGoogle Scholar
  10. 10.
    Wen, L., Li, X., Yan, L., Tan, Y., Li, R., Zhao, Y., Wang, Y., Xie, J., He, C., Li, R., Tang, F., Qiao, J.: Whole-genome analysis of 5-hydroxymethylcytosine and 5-methylcytosine at base resolution in the human brain. Genome Biol. 15(3), R49 (2014)CrossRefGoogle Scholar
  11. 11.
    Xi, Y., Bock, C., Muller, F., Sun, D., Meissner, A., Li, W.: RRBSMAP: a fast, accurate and user-friendly alignment tool for reduced representation bisulfite sequencing. Bioinformatics 28(3), 430–432 (2012)CrossRefGoogle Scholar
  12. 12.
    Xu, Z., Taylor, J.A., Leung, Y.K., Ho, S.M., Niu, L.: oxBS-MLE: an efficient method to estimate 5-methylcytosine and 5-hydroxymethylcytosine in paired bisulfite and oxidative bisulfite treated dna. Bioinformatics 32(23), 3667–3669 (2016)Google Scholar
  13. 13.
    Yu, M., Hon, G.C., Szulwach, K.E., Song, C.X., Jin, P., Ren, B., He, C.: TET-assisted bisulfite sequencing of 5-hydroxymethylcytosine. Nat. Protoc. 7(12), 2159–2170 (2012)CrossRefGoogle Scholar
  14. 14.
    Yu, M., Hon, G.C., Szulwach, K.E., Song, C.X., Zhang, L., Kim, A., Li, X., Dai, Q., Park, B., Min, J.H., Jin, P., Ren, B., He, C.: Base-resolution analysis of 5-hydroxymethylcytosine in the mammalian genome. Cell 149(6), 1368–1380 (2012)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • César González
    • 1
  • Mariano Pérez
    • 1
  • Juan M. Orduña
    • 1
    Email author
  • Javier Chaves
    • 2
  • Ana-Bárbara García
    • 2
  1. 1.Depto. de InformáticaUniversidad de ValenciaBurjassot, ValenciaSpain
  2. 2.INCLIVA Health Research Institute, CIBERDEM (Carlos III Health Institute)ValenciaSpain

Personalised recommendations