Journal of Mathematical Biology

, Volume 52, Issue 3, pp 307–342 | Cite as

A fast algorithm for the construction of universal footprinting templates in DNA

  • James W. AndersonEmail author
  • Keith R. Fox
  • Graham A. Niblo


We introduce and give a complete description of a new graph to be used for DNA sequencing questions. This graph has the advantage over the classical de Bruijn graph that it fully accounts for the double stranded nature of DNA, rather than dealing with single strands. Technically, our graph may be thought of as the quotient of the de Bruijn graph under the natural involution of sending a DNA strand to its complementary strand. However, this involution has fixed points, and this complicates the structure of the quotient graph which we have therefore modified herein.

As an application and motivating example, we give an efficient algorithm for constructing universal footprinting templates for n-mers. This problem may be formulated as the task of finding a shortest possible segment of DNA which contains every possible sequence of base pairs of some fixed length n. Previous work by Kwan et al has attacked this problem from a numerical point of view and generated minimal length universal footprinting templates for n=2, 3, 5, 7, together with unsubstantiated candidates for the case n=4. We show that their candidates for n=4 are indeed minimal length universal footprinting templates.

Keywords or phrases

DNA sequencing Universal footprinting template de Bruijn graph Eulerian graphs 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bollobas, B.: Graph Theory: an introductory course. Graduate Texts Math. 63, Springer-Verlag, New York, 1979Google Scholar
  2. 2.
    Fox, K.R., Waring, M.J.: High Resolution footprinting studies of drug-DNA complexes using chemical and enzymic probes. Meth. Enzymol. 340, 412–430 (2001)CrossRefGoogle Scholar
  3. 3.
    Galas, D.J., Schmitz, A.: DNAase footprinting – Simple method for detection of protein – DNA binding specificity. Nucleic Acids Res. 5, 3157–3170 (1978)Google Scholar
  4. 4.
    Guille, M.J., Kneale, G.: Methods for the analysis of DNA-protein interactions. Molecular Biotechnology 8, 35–52 (1997)Google Scholar
  5. 5.
    Kwan, A.H.Y., Czolij, R., Mackay, J.P., Crossley, M.: Pentaprobe: a comprehensive sequence for the one-step detection of DNA-binding activities. Nucleic Acids Res. 31, e124 (2003)Google Scholar
  6. 6.
    Lavesa, M., Fox, K.R.: Preferred binding sites for [N-MeCys3,N-MeCys7]TANDEM determined using a universal footprinting substrate. Analytical Biochemistry 293, 246–250 (2001)CrossRefGoogle Scholar
  7. 7.
    Pevzner, P., Tang, H., Waterman, M.S.: An Eulerian path approach to DNA fragment assembly. Proc. Nat. Acad. Sci. U.S.A. 98, 9748–9753 (2001)CrossRefzbMATHMathSciNetGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • James W. Anderson
    • 1
    Email author
  • Keith R. Fox
    • 2
  • Graham A. Niblo
    • 1
  1. 1.School of MathematicsUniversity of SouthamptonSouthamptonUnited Kingdom
  2. 2.School of Biological SciencesUniversity of SouthamptonSouthamptonUnited Kingtom

Personalised recommendations