Alignment of Genomic Sequences Using DIALIGN

  • Burkhard Morgenstern
Part of the Methods in Molecular Biology™ book series (MIMB, volume 395)


DIALIGN is a software program for multiple alignment of DNA or protein sequences that combines global and local alignment features. During the last years, the program has been used extensively to compare syntenic regions in genomic sequences. An anchoring option speeds up the alignment procedure and makes it possible to use user-defined constraints to improve the quality of the program output. This chapter explains features of DIALIGN that are useful if genomic sequences are to be aligned. The program is online available through Göttingen Bioinformatics Compute Server at

Key Words

Multiple sequence alignment anchored alignment DIALIGN gene prediction phylogenetic footprinting 


  1. 1.
    Needleman, S. B. and Wunsch, C. D. (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48, 443–453.CrossRefPubMedGoogle Scholar
  2. 2.
    Thompson, J. D., Higgins, D. G., and Gibson, T. J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680.CrossRefPubMedGoogle Scholar
  3. 3.
    Notredame, C., Higgins, D., and Heringa, J. (2000) T-Coffee: a novel algorithm for multiple sequence alignment. J. Mol. Biol. 302, 205–217.CrossRefPubMedGoogle Scholar
  4. 4.
    Smith, T. F. and Waterman, M. S. (1981) Comparison of biosequences. Advances in Applied Mathematics 2, 482–489.CrossRefGoogle Scholar
  5. 5.
    Altschul, S. F., Gish, W., Miller, W., Myers, E. M., and Lipman, D. J. (1990) Basic local alignment search tool. J. Mol. Biol. 215, 403–410.PubMedGoogle Scholar
  6. 6.
    Altschul, S. F., Madden, T. L., Schäffer, A. A., et al. (1997) Gapped BLAST and PSIBLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402.CrossRefPubMedGoogle Scholar
  7. 7.
    Lawrence, C. E., Altschul, S. F., Boguski, M. S., Liu, J. S., Neuwald, A. F., and Wootton, J. C. (1993) Detecting subtle sequence signals: a gibbs sampling strategy for multiple alignment. Science 262, 208–214.CrossRefPubMedGoogle Scholar
  8. 8.
    Brudno, M., Do, C., Cooper, G., et al. (2003) LAGAN and multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. 13, 721–731.CrossRefPubMedGoogle Scholar
  9. 9.
    Höhl, M., Kurtz, S., and Ohlebusch, E. (2002) Efficient multiple genome alignment. Bioinformatics 18, 312S–320S.Google Scholar
  10. 10.
    Delcher, A. L., Kasif, S., Fleischmann, R. D., Peterson, J., White, O., and Salzberg, S. L. (1999) Alignment of whole genomes. Nucleic Acids Res. 27, 2369–2376.CrossRefPubMedGoogle Scholar
  11. 11.
    Bray, N., and Pachter, L. (2003) MAVID multiple alignment server. Nucleic Acids Res. 31, 3525–3526.CrossRefPubMedGoogle Scholar
  12. 12.
    Morgenstern, B., Dress, A. W. M., and Werner, T. (1996) Multiple DNA and protein sequence alignment based on segment-to-segment comparison. Proc. Natl. Acad. Sci. USA 93, 12,098–12,103.Google Scholar
  13. 13.
    Morgenstern, B. (2004) DIALIGN: Multiple DNA and protein sequence alignment at BiBiServ. Nucleic Acids Res. 32, W33–W36.CrossRefPubMedGoogle Scholar
  14. 14.
    Morgenstern, B., Frech, K., Dress, A. W. M., and Werner, T. (1998) DIALIGN: finding local similarities by multiple sequence alignment. Bioinformatics 14, 290–294.CrossRefPubMedGoogle Scholar
  15. 15.
    Morgenstern, B. (1999) DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment. Bioinformatics 15, 211–218.CrossRefPubMedGoogle Scholar
  16. 16.
    Prohaska, S. J., Fried, C., Flamm, C., Wagner, G., and Stadler, P. F. (2004) Surveying phylogenetic footprints in large gene clusters: applications to Hox cluster duplications. Mol. Phyl. Evol. 31, 581–604.CrossRefGoogle Scholar
  17. 17.
    Wagner, G. P., Fried, C., Prohaska, S. J., and Stadler, P. F. (2004) Divergence of conserved non-coding sequences: rate estimates and relative rate tests. Mol. Biol. Evol. 21, 2116–2121.CrossRefPubMedGoogle Scholar
  18. 18.
    Blanchette, M. and Tompa, M. (2002) Discovery of regulatory elements by a computationalmethod for phylogenetic footprinting. Genome Res. 12, 739–748.CrossRefPubMedGoogle Scholar
  19. 19.
    Göttgens, B., Barton, L. M., Gilbert, J. G. R., et al. (2000) Analysis of vertebrate SCL loci identifies conserved enhancers. Nat. Biotechnol. 18, 181–186.CrossRefPubMedGoogle Scholar
  20. 20.
    Göttgens, B., Gilbert, J. G. R., Barton, L. M., et al. (2001) Long-range comparison of human and mouse SCL loci: localized regions of sensitivity to restriction endonucleases correspond precisely with peaks of conserved noncoding sequences. Genome Res. 11, 87–97.CrossRefPubMedGoogle Scholar
  21. 21.
    Göttgens, B., Barton, L., Chapman, M., et al. (2002) Transcriptional regulation of the stem cell leukemia gene (SCL) comparative analysis of five vertebrate SCL loci. Genome Res. 12, 749–759.CrossRefPubMedGoogle Scholar
  22. 22.
    Chapman, M. A., Charchar, F. J., Kinston, S., et al. (2003) Comparative and functional analysis of LYL1 loci establish marsupial sequences as a model for phylogenetic footprinting. Genomics 81, 249–259.CrossRefPubMedGoogle Scholar
  23. 23.
    Fitch, J. P., Gardner, S. N., Kuczmarski, T. A., et al. (2002) Rapid development of nucleic acid diagnostics. Proc. IEEE 90, 1708–1721.CrossRefGoogle Scholar
  24. 24.
    Chain, P., Kurtz, S., Ohlebusch, E., and Slezak, T. (2003) An applications-focused review of comparative genomics tools: capabilities, limitations, and future challenges. Brief. Bioinform. 4, 105–123.CrossRefPubMedGoogle Scholar
  25. 25.
    Rinner, O. and Morgenstern, B. (2002) AGenDA: gene prediction by comparative sequence analysis. In Silico Biol. 2, 195–205.PubMedGoogle Scholar
  26. 26.
    Stanke, M., Schöffmann, O., Morgenstern, B., and Waack, S. (2006) Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics 7,62.CrossRefPubMedGoogle Scholar
  27. 27.
    Stanke, M., Tzvetkova, A., and Morgenstern, B. (2006) AUGUSTUS+ at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome. Genome Biol. 7, 1–8.CrossRefPubMedGoogle Scholar
  28. 28.
    Pollard, D. A., Bergman, C. M., Stoye, J., Celniker, S. E., and Eisen, M. B. (2004) Benchmarking tools for the alignment of functional noncoding DNA. BMC Bioinformatics 5, 6.CrossRefPubMedGoogle Scholar
  29. 29.
    Morgenstern, B., Rinner, O., Abdeddaïm, S., Haase, D., Mayer, K., Dress, A., and Mewes, H. -W. (2002) Exon discovery by genomic sequence alignment. Bioinformatics 18, 777–787.CrossRefPubMedGoogle Scholar
  30. 30.
    Morgenstern, B., Werner, N., Prohaska, S. J., et al. (2005) Multiple sequence alignment with user-defined constraints at GOBICS. Bioinformatics 21, 1271–1273.CrossRefPubMedGoogle Scholar
  31. 31.
    Morgenstern, B., Prohaska, S. J., Pöhler, D., and Stadler, P. F. (2006) Multiple sequence alignment with user-defined anchor points. Algorithms Mol. Biol. 1, 6.CrossRefPubMedGoogle Scholar
  32. 32.
    Brudno, M., Chapman, M., Göttgens, B., Batzoglou, S., and Morgenstern, B. (2003) Fast and sensitive multiple alignment of large genomic sequences. BMC Bioinformatics 4, 66.CrossRefPubMedGoogle Scholar
  33. 33.
    Brudno, M., Steinkamp, R., and Morgenstern, B. (2004) The CHAOS/DIALIGN WWW server for multiple alignment of genomic sequences. Nucleic Acids Res. 32, W41–W44.CrossRefPubMedGoogle Scholar
  34. 34.
    Pöhler, D., Werner, N., Steinkamp, R., and Morgenstern, B. (2005) Multiple alignment of genomic sequences using CHAOS, DIALIGN and ABC. Nuc. Acids Res. 33, W523–W524.Google Scholar
  35. 35.
    Cooper, G. M., Singaravelu, S. A. G., and Sidow, A. (2004) ABC: software for interactive browsing of genomic multiple sequence alignment data. BMC Bioinformatics 5, 192.CrossRefPubMedGoogle Scholar

Copyright information

© Humana Press Inc. 2007

Authors and Affiliations

  • Burkhard Morgenstern
    • 1
  1. 1.Institute of Microbiology & GeneticsUniversity of GöttingenGermany

Personalised recommendations