Summary
The Lagan Toolkit is a software package for comparison of genomic sequences. It includes the CHAOS local alignment program, LAGAN global alignment program for two, or more sequences and Shuffle-LAGAN, a “glocal” alignment method that handles genomic rearrangements in a global alignment framework. The alignment programs included in the Lagan Toolkit have been widely used to compare genomes of many organisms, from bacteria to large mammalian genomes. This chapter provides an overview of the algorithms used by the LAGAN programs to construct genomic alignments, explains how to build alignments using either the standalone program or the web server, and discusses some of the common pitfalls users encounter when using the toolkit.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Brudno, M., Do, C. B., Cooper, G. M, et al., and NISC Comparative Sequencing Program. (2003) LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. 13, 721–731.
Schwartz, S., Elnitski, L., Li, M., et al., NISC Comparative Sequencing Program. (2003) MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res. 31, 3518–3524.
Darling, A. C., Mau, B., Blattner, F. R., and Perna, N. T. (2004) Mauve: multiple alignment of Res. 14, 1394–1403.
Bray, N. and Pachter, L. (2004) MAVID: constrained ancestral alignment of multiple sequences. Genome Res. 14, 693–699.
Blanchette M, Kent WJ, Riemer C, et al. (2004) Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 14, 708–715.
Batzoglou, S., Pachter, L., Mesirov, J. P., Berger, B., and Lander, E. S. (2000) Human and mouse gene structure: comparative analysis and application to exon prediction. Genome Res. 10, 950–958.
Bray, N., Dubchak, I., and Pachter, L. (2003) AVID: A global alignment program. Genome Res. 13, 97–102.
Schwartz, S., Kent, W. J., Smit, A., et al. (2003) Human-mouse alignments with BLASTZ. Genome Res. 13, 103–107.
Thompson, J. D., Higgins, D. G., and Gibson, T. J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680.
Morgenstern, B., Frech, K., Dress, A., and Werner, T. (1998) DIALIGN: finding local similarities by multiple sequence alignment. Bioinformatics 14, 290–294.
Altschul, S. F., Madden, T. L., Schaffer, A. A., et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402.
Bergman, C. M. and Kreitman, M. (2001) Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences. Genome Res. 11, 1335–1345.
Brudno, M., Malde, S., Poliakov, A., et al. (2003) Glocal alignment: finding rearrangements during alignment. Bioinformatic 19, 54i–62i.
Brudno, M., Chapman, M., Gottgens, B., Batzoglou, S., and Morgenstern, B. (2003) Fast and sensitive multiple alignment of large genomic sequences. BMC Bioinformatics 4, 66.
Brudno, M., Poliakov, A., Salamov, A., et al. (2004) Automated whole-genome multiple alignment of rat, mouse, and human. Genome Res. 14, 685–692.
Kalafus, K. J., Jackson, A. R., Milosavljevic, A. (2004) Pash: efficient genome-scale sequence anchoring by positional hashing. Genome Res. 14, 672–678.
Hubbard T, Andrews D, Caccamo M, et al. (2005) Ensembl 2005. Nucleic Acids Res. 33, D447–D453.
Mayor, C., Brudno, M., Schwartz, J. R., et al. (2000) VISTA: visualizing global DNA sequence alignments of arbitrary length. Bioinformatics 16, 1046–1047.
Needleman, S. B. and Wunsch, C. D. (1970) An efficient method applicable to the search for similarities in the amino acid sequences of two proteins. J. Mol. Biol. 48, 444–453.
Delcher, A. L., Kasif, S., Fleischman, R., Peterson, J., White, O., and Salzberg, S. L. (1999) Alignment of whole genomes. Nucleic Acids Res. 27, 2369–2376.
Delcher, A. L., Phillippy, A., Carlton, J., and Salzberg, S. L. (2002) Fast algorithms for large-scale genome alignment and comparison. Nucleic Acids Res. 30, 2478–2483.
Kent, W, J. and Zahler, A. M. (2000) Conservation, regulation, synteny, and introns in a large-scale C. briggsae-C. elegans genomic alignment. Genome Res. 10, 1115–1125.
Shah, N., Couronne, O., Pennacchio, L. A., et al. (2004) PhyloVISTA: an interactive visualization tool for multiple DNASequence alignments.Bioinformatics 20, 636–643.
Montgomery, S. B., Astakhova, T., Bilenky, M., et al. (2004) Sockeye: a 3D environment for comparative genomics. Genome Res. 14, 956–962.
Couronne, O., Poliakov, A., Bray, N., et al. (2002) Strategies and tools for whole genome alignments. Genome Res. 13, 73–80.
Kent, J. (2002) BLAT: the BLAST-like alignment tool. Genome Res. 12, 656–664.
Acknowledgments
Many people have contributed to the development of the LAGAN Toolkit during its development. Michael F. Kim and Chuong B. Do were actively involved in the original development, including writing many of the utilities. Sanket Malde and Mukund Sundararajan developed the 1-monotonic chaining algorithm, and Serafim Batzoglou supervised the development of the software (and the degrees of the people writing it).
Our early users bore the brunt of the bugs in the package, with Kerrin Small and Gregory M. Cooper (working with Arend Sidow) deserving special recognition for being the first to use the programs on their own. Alexander Poliakov and Inna Dubchak were the first to use LAGAN in a large scale pipeline and hence also helped identify several problems with the software. This manuscript is partially based on the original Lagan and CHAOS papers as well as on a book chapter appearing in the Handbook of Computational Biology (S. Aluru, ed). The author's research was funded by the NSF Graduate Fellowship Award and the NSERC Discovery Grant during the writing.
Finally, I would like to thank the many users of LAGAN (both standalone and through the website) who have made this into a popular alignment package while also keeping the developers appraised of the problems and being patient as the problems were fixed.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Humana Press Inc.
About this protocol
Cite this protocol
Brudno, M. (2007). An Introduction to the Lagan Alignment Toolkit. In: Bergman, N.H. (eds) Comparative Genomics. Methods in Molecular Biology™, vol 395. Humana Press. https://doi.org/10.1007/978-1-59745-514-5_13
Download citation
DOI: https://doi.org/10.1007/978-1-59745-514-5_13
Publisher Name: Humana Press
Print ISBN: 978-1-58829-693-1
Online ISBN: 978-1-59745-514-5
eBook Packages: Springer Protocols