Chromosome Research

, Volume 24, Issue 4, pp 481–494 | Cite as

Data analysis algorithm for DamID-seq profiling of chromatin proteins in Drosophila melanogaster

  • Daniil A. Maksimov
  • Petr P. Laktionov
  • Stepan N. BelyakinEmail author
Original Article


Analysis of gene expression regulation typically requires identification of genomic sites bound by regulatory proteins. For this purpose, chromatin immunoprecipitation (ChIP) and Dam identification (DamID) methods can be applied to cell lines, whole organisms, or enriched cell populations. In this work, we present modifications to the experimental DamID protocol, as well as a custom data processing algorithm, that allow to confidently identify genomic sites enriched with the proteins of interest. This algorithm is implemented in Perl and is also available as executable files, thereby making DamID analysis relatively straightforward. Finally, we demonstrate how this pipeline performs when fed with real experimental data.


DamID-seq transcription factors chromatin profiling high-throughput sequencing peak calling 



Chromatin immunoprecipitation


Dam identification


False-positive discovery rate


Flippase recognition target


Micrococcal nuclease




Next-generation sequencing


Open reading frame


Quantitative polymerase chain reaction


Upstream activating sequence



This work was supported by the Russian Foundation for Basic Research grant 16-04-01463 in a part concerning DSX profiles analysis and grant 13-04-40087 in a part concerning COMR DamID profiling in Drosophila testis. High-throughput sequencing was performed with support from the Federal Agency for Scientific Organizations of the Russian Federation (project 0310-2014-0002). The algorithm was developed for the project supported by Russian Scientific Foundation grant 14-14-00641. Perl scripts, compiled applications for Windows, and user manual are available from

Supplementary material (4 kb)
Supplementary material 1 Perl script for “GATC_mapper” (PL 3 kb) (17 kb)
Supplementary material 2 Perl script for “dynSD” (PL 17 kb) (13 kb)
Supplementary material 3 Perl script for “Fisher_auto-FDR” (PL 13 kb)
10577_2016_9538_MOESM4_ESM.txt (9.2 mb)
Supplementary material 4 List and coordinates of all GATC-fragments in the D. melanogaster genome, Release dm6. This list is used by default. (TXT 9401 kb)
10577_2016_9538_MOESM5_ESM.txt (8.2 mb)
Supplementary material 5 List and coordinates of all GATC-fragments in the D. melanogaster genome, Release dm3. This may be used for previous genome version (see DamID-seq manual). (TXT 8424 kb)
10577_2016_9538_MOESM6_ESM.pdf (114 kb)
Supplementary material 6 DamID-seq manual with step-by-step instruction for our pipeline. (PDF 114 kb)
10577_2016_9538_MOESM7_ESM.xlsx (14 kb)
Supplementary Table 1 Normalized read counts in Dam and Dam-DSX samples in males and females around Yp1 gene. The major peak is marked with bright yellow. (XLSX 14 kb)
10577_2016_9538_Fig7_ESM.gif (245 kb)
Supplementary Figure 1

Reproducibility-based filtering algorithm for biological replicas (“dynSD” package). Each GATC-fragment is represented by a dot, whose coordinates correspond to the read numbers bound in the first and second biological replicas. Local filtering is performed in groups of 200 fragments. GATC-fragments whose reads differ by more than 2 SD from the group average, are removed from downstream analyses (red) (Materials and Methods). (GIF 245 kb)

10577_2016_9538_MOESM8_ESM.tif (2.1 mb)
High Resolution Image (TIF 2123 kb)


  1. Banerjee S, Pedersen T (2003) The design, implementation, and use of the ngram statistics package. In: Gelbukh A (ed) computational linguistics and intelligent text processing: 4th International Conference, CICLing 2003 Mexico City, Mexico, February 16–22, 2003 Proceedings. Springer Berlin Heidelberg, Berlin, Heidelberg, pp 370–381. doi: 10.1007/3-540-36456-0_38
  2. Bonn S, Zinzen RP, Perez-Gonzalez A, Riddell A, Gavin AC, Furlong EE (2012) Cell type-specific chromatin immunoprecipitation from multicellular complex samples using BiTS-ChIP. Nat Protoc 7:978–994. doi: 10.1038/nprot.2012.049 CrossRefPubMedGoogle Scholar
  3. Camara N, Whitworth C, Van Doren M (2008) The creation of sexual dimorphism in the Drosophila soma. Curr Top Dev Biol 83:65–107. doi: 10.1016/S0070-2153(08)00403-1 CrossRefPubMedGoogle Scholar
  4. Clough E et al (2014) Sex- and tissue-specific functions of Drosophila doublesex transcription factor target genes. Dev Cell 31:761–773. doi: 10.1016/j.devcel.2014.11.021 CrossRefPubMedPubMedCentralGoogle Scholar
  5. de Jesus Domingues AM, Artegiani B, Dahl A, Calegari F (2016) Identification of Tox chromatin binding properties and downstream targets by DamID-Seq. Genom Data 7:264–268. doi: 10.1016/j.gdata.2016.02.003 CrossRefPubMedPubMedCentralGoogle Scholar
  6. Filion GJ et al (2010) Systematic protein location mapping reveals five principal chromatin types in Drosophila cells. Cell 143:212–224CrossRefPubMedPubMedCentralGoogle Scholar
  7. Gomez-Saldivar G, Meister P, Askjaer P (2016) DamID analysis of nuclear organization in Caenorhabditis elegans. Methods Mol Biol 1411:341–358. doi: 10.1007/978-1-4939-3530-7_22 CrossRefPubMedGoogle Scholar
  8. Greer EL et al (2015) DNA methylation on N6-adenine in C. elegans. Cell 161:868–878. doi: 10.1016/j.cell.2015.04.005 CrossRefPubMedPubMedCentralGoogle Scholar
  9. Greil F, Moorman C, van Steensel B (2006) DamID: mapping of in vivo protein-genome interactions using tethered DNA adenine methyltransferase. Methods Enzymol 410:342–359CrossRefPubMedGoogle Scholar
  10. Heyn H, Esteller M (2015) An adenine code for DNA: a second life for N6-methyladenine. Cell 161:710–713. doi: 10.1016/j.cell.2015.04.021 CrossRefPubMedGoogle Scholar
  11. Ishii H, Kadonaga JT, Ren B (2015) MPE-seq, a new method for the genome-wide analysis of chromatin structure. Proc Natl Acad Sci U S A 112:E3457–3465. doi: 10.1073/pnas.1424804112 CrossRefPubMedPubMedCentralGoogle Scholar
  12. Jin Y et al (2015) EGFR/Ras signaling controls Drosophila intestinal stem cell proliferation via Capicua-regulated genes. PLoS Genet 11, e1005634. doi: 10.1371/journal.pgen.1005634 CrossRefPubMedPubMedCentralGoogle Scholar
  13. Kasinathan S, Orsi GA, Zentner GE, Ahmad K, Henikoff S (2014) High-resolution mapping of transcription factor binding sites on native chromatin. Nat Methods 11:203–209. doi: 10.1038/nmeth.2766 CrossRefPubMedGoogle Scholar
  14. Kaufmann K, Muino JM, Osteras M, Farinelli L, Krajewski P, Angenent GC (2010) Chromatin immunoprecipitation (ChIP) of plant transcription factors followed by sequencing (ChIP-SEQ) or hybridization to whole genome arrays (ChIP-CHIP). Nat Protoc 5:457–472. doi: 10.1038/nprot.2009.244 CrossRefPubMedGoogle Scholar
  15. Laktionov PP, Maksimov DA, Andreeva EN, Shloma VV, Beliakin SN (2013) A genetic system for somatic and germinal lineage tracing in the Drosophila melanogaster gonads. Tsitologiia 55:185–189PubMedGoogle Scholar
  16. Laktionov PP, White-Cooper H, Maksimov DA, Belyakin SN (2014) Transcription factor Comr acts as a direct activator in the genetic program controlling spermatogenesis in D. melanogaster. Mol Biol 48:130–140. doi: 10.1134/S0026893314010087 CrossRefGoogle Scholar
  17. Lee WP, Stromberg MP, Ward A, Stewart C, Garrison EP, Marth GT (2014) MOSAIK: a hash-based algorithm for accurate next-generation sequencing short-read mapping. PLoS One 9:e90581 doi: 10.1371/journal.pone.0090581
  18. Lefrancois P, Zheng W, Snyder M (2010) ChIP-Seq using high-throughput DNA sequencing for genome-wide identification of transcription factor binding sites. Methods Enzymol 470:77–104. doi: 10.1016/S0076-6879(10)70004-5 CrossRefPubMedGoogle Scholar
  19. Li R, Hempel LU, Jiang T (2015) A non-parametric peak calling algorithm for DamID-Seq. PLoS ONE 10, e0117415. doi: 10.1371/journal.pone.0117415 CrossRefPubMedPubMedCentralGoogle Scholar
  20. Lukyanov KA, Launer GA, Tarabykin VS, Zaraisky AG, Lukyanov SA (1995) Inverted terminal repeats permit the average length of amplified DNA fragments to be regulated during preparation of cDNA libraries by polymerase chain reaction. Anal Biochem 229:198–202.Google Scholar
  21. Luo SD, Shi GW, Baker BS (2011) Direct targets of the D. melanogaster DSXF protein and the evolution of sexual development. Development 138:2761–2771CrossRefPubMedPubMedCentralGoogle Scholar
  22. Maksimov DA, Koryakov DE, Belyakin SN (2014) Developmental variation of the SUUR protein binding correlates with gene regulation and specific chromatin types in D. melanogaster. Chromosoma 123:253–264. doi: 10.1007/s00412-013-0445-6 CrossRefPubMedGoogle Scholar
  23. Marshall OJ, Brand AH (2015) damidseq_pipeline: an automated pipeline for processing DamID sequencing datasets. Bioinformatics 31:3371–3373. doi: 10.1093/bioinformatics/btv386 CrossRefPubMedPubMedCentralGoogle Scholar
  24. Martin M (2011) Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J 17:10–12. doi: 10.14806/ej.17.1.200
  25. Mundade R, Ozer HG, Wei H, Prabhu L, Lu T (2014) Role of ChIP-seq in the discovery of transcription factor binding sites, differential gene regulation mechanism, epigenetic marks and beyond. Cell Cycle 13:2847–2852. doi: 10.4161/15384101.2014.949201 CrossRefPubMedPubMedCentralGoogle Scholar
  26. Murray MJ, Southall TD, Liu W, Fraval H, Lorensuhewa N, Brand AH, Saint R (2012) Snail-dependent repression of the RhoGEF pebble is required for gastrulation consistency in Drosophila melanogaster. Dev Genes Evol 222:361–368. doi: 10.1007/s00427-012-0414-8 CrossRefPubMedPubMedCentralGoogle Scholar
  27. Orsi GA, Kasinathan S, Hughes KT, Saminadin-Peter S, Henikoff S, Ahmad K (2014) High-resolution mapping defines the cooperative architecture of Polycomb response elements. Genome Res 24:809–820. doi: 10.1101/gr.163642.113 CrossRefPubMedPubMedCentralGoogle Scholar
  28. Pindyurin AV, Pagie L, Kozhevnikova EN, van Arensbergen J, van Steensel B (2016) Inducible DamID systems for genomic mapping of chromatin proteins in Drosophila. Nucleic Acids Res. doi: 10.1093/nar/gkw176 PubMedPubMedCentralGoogle Scholar
  29. Shimbo T et al (2013) MBD3 localizes at promoters, gene bodies and enhancers of active genes. PLoS Genet 9, e1004028. doi: 10.1371/journal.pgen.1004028 CrossRefPubMedPubMedCentralGoogle Scholar
  30. Southall TD, Gold KS, Egger B, Davidson CM, Caygill EE, Marshall OJ, Brand AH (2013) Cell-type-specific profiling of gene expression and chromatin binding without cell isolation: assaying RNA Pol II occupancy in neural stem cells. Dev Cell 26:101–112. doi: 10.1016/j.devcel.2013.05.020 CrossRefPubMedPubMedCentralGoogle Scholar
  31. Valouev A et al (2008) Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data. Nat Methods 5:829–834. doi: 10.1038/nmeth.1246 CrossRefPubMedPubMedCentralGoogle Scholar
  32. van Steensel B, Henikoff S (2000) Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nat Biotechnol 18:424–428CrossRefPubMedGoogle Scholar
  33. Vogel MJ, Peric-Hupkes D, van Steensel B (2007) Detection of in vivo protein-DNA interactions using DamID in mammalian cells. Nat Protocols 2:1467–1478CrossRefPubMedGoogle Scholar
  34. Wilczynski B, Furlong EE (2010) Challenges for modeling global gene regulatory networks during development: insights from Drosophila. Dev Biol 340:161–169. doi: 10.1016/j.ydbio.2009.10.032 CrossRefPubMedGoogle Scholar
  35. Wu F, Yao J (2013) Spatial compartmentalization at the nuclear periphery characterized by genome-wide mapping. BMC Genomics 14:591. doi: 10.1186/1471-2164-14-591 CrossRefPubMedPubMedCentralGoogle Scholar
  36. Yang IV, Chen E, Hasseman JP, Liang W, Frank BC, Wang S, Sharov V, Saeed AI, White J, Li J, Lee NH, Yeatman TJ, Quackenbush J (2002) Within the fold: assessing differential expression measures and reproducibility in microarray assays. Genome Biol 3:research0062Google Scholar
  37. Zhang G et al (2015) N6-methyladenine DNA modification in Drosophila. Cell 161:893–906. doi: 10.1016/j.cell.2015.04.018 CrossRefPubMedGoogle Scholar

Copyright information

© Springer Science+Business Media Dordrecht 2016

Authors and Affiliations

  • Daniil A. Maksimov
    • 1
  • Petr P. Laktionov
    • 1
  • Stepan N. Belyakin
    • 1
    • 2
    Email author
  1. 1.Institute of Molecular and Cellular Biology SB RASNovosibirskRussia
  2. 2.Novosibirsk State UniversityNovosibirskRussia

Personalised recommendations