Skip to main content

An Introduction to the Lagan Alignment Toolkit

  • Protocol
Book cover Comparative Genomics

Part of the book series: Methods in Molecular Biology™ ((MIMB,volume 395))

Summary

The Lagan Toolkit is a software package for comparison of genomic sequences. It includes the CHAOS local alignment program, LAGAN global alignment program for two, or more sequences and Shuffle-LAGAN, a “glocal” alignment method that handles genomic rearrangements in a global alignment framework. The alignment programs included in the Lagan Toolkit have been widely used to compare genomes of many organisms, from bacteria to large mammalian genomes. This chapter provides an overview of the algorithms used by the LAGAN programs to construct genomic alignments, explains how to build alignments using either the standalone program or the web server, and discusses some of the common pitfalls users encounter when using the toolkit.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Brudno, M., Do, C. B., Cooper, G. M, et al., and NISC Comparative Sequencing Program. (2003) LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. 13, 721–731.

    Google Scholar 

  2. Schwartz, S., Elnitski, L., Li, M., et al., NISC Comparative Sequencing Program. (2003) MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res. 31, 3518–3524.

    Google Scholar 

  3. Darling, A. C., Mau, B., Blattner, F. R., and Perna, N. T. (2004) Mauve: multiple alignment of Res. 14, 1394–1403.

    Google Scholar 

  4. Bray, N. and Pachter, L. (2004) MAVID: constrained ancestral alignment of multiple sequences. Genome Res. 14, 693–699.

    Article  CAS  PubMed  Google Scholar 

  5. Blanchette M, Kent WJ, Riemer C, et al. (2004) Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 14, 708–715.

    Article  CAS  PubMed  Google Scholar 

  6. Batzoglou, S., Pachter, L., Mesirov, J. P., Berger, B., and Lander, E. S. (2000) Human and mouse gene structure: comparative analysis and application to exon prediction. Genome Res. 10, 950–958.

    Article  CAS  PubMed  Google Scholar 

  7. Bray, N., Dubchak, I., and Pachter, L. (2003) AVID: A global alignment program. Genome Res. 13, 97–102.

    Article  CAS  PubMed  Google Scholar 

  8. Schwartz, S., Kent, W. J., Smit, A., et al. (2003) Human-mouse alignments with BLASTZ. Genome Res. 13, 103–107.

    Article  CAS  PubMed  Google Scholar 

  9. Thompson, J. D., Higgins, D. G., and Gibson, T. J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680.

    Article  CAS  PubMed  Google Scholar 

  10. Morgenstern, B., Frech, K., Dress, A., and Werner, T. (1998) DIALIGN: finding local similarities by multiple sequence alignment. Bioinformatics 14, 290–294.

    Article  CAS  PubMed  Google Scholar 

  11. Altschul, S. F., Madden, T. L., Schaffer, A. A., et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402.

    Article  CAS  PubMed  Google Scholar 

  12. Bergman, C. M. and Kreitman, M. (2001) Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences. Genome Res. 11, 1335–1345.

    Article  CAS  PubMed  Google Scholar 

  13. Brudno, M., Malde, S., Poliakov, A., et al. (2003) Glocal alignment: finding rearrangements during alignment. Bioinformatic 19, 54i–62i.

    Article  Google Scholar 

  14. Brudno, M., Chapman, M., Gottgens, B., Batzoglou, S., and Morgenstern, B. (2003) Fast and sensitive multiple alignment of large genomic sequences. BMC Bioinformatics 4, 66.

    Article  PubMed  Google Scholar 

  15. Brudno, M., Poliakov, A., Salamov, A., et al. (2004) Automated whole-genome multiple alignment of rat, mouse, and human. Genome Res. 14, 685–692.

    Article  CAS  PubMed  Google Scholar 

  16. Kalafus, K. J., Jackson, A. R., Milosavljevic, A. (2004) Pash: efficient genome-scale sequence anchoring by positional hashing. Genome Res. 14, 672–678.

    Article  CAS  PubMed  Google Scholar 

  17. Hubbard T, Andrews D, Caccamo M, et al. (2005) Ensembl 2005. Nucleic Acids Res. 33, D447–D453.

    Article  CAS  PubMed  Google Scholar 

  18. Mayor, C., Brudno, M., Schwartz, J. R., et al. (2000) VISTA: visualizing global DNA sequence alignments of arbitrary length. Bioinformatics 16, 1046–1047.

    Article  CAS  PubMed  Google Scholar 

  19. Needleman, S. B. and Wunsch, C. D. (1970) An efficient method applicable to the search for similarities in the amino acid sequences of two proteins. J. Mol. Biol. 48, 444–453.

    Article  Google Scholar 

  20. Delcher, A. L., Kasif, S., Fleischman, R., Peterson, J., White, O., and Salzberg, S. L. (1999) Alignment of whole genomes. Nucleic Acids Res. 27, 2369–2376.

    Article  CAS  PubMed  Google Scholar 

  21. Delcher, A. L., Phillippy, A., Carlton, J., and Salzberg, S. L. (2002) Fast algorithms for large-scale genome alignment and comparison. Nucleic Acids Res. 30, 2478–2483.

    Article  PubMed  Google Scholar 

  22. Kent, W, J. and Zahler, A. M. (2000) Conservation, regulation, synteny, and introns in a large-scale C. briggsae-C. elegans genomic alignment. Genome Res. 10, 1115–1125.

    Article  CAS  PubMed  Google Scholar 

  23. Shah, N., Couronne, O., Pennacchio, L. A., et al. (2004) PhyloVISTA: an interactive visualization tool for multiple DNASequence alignments.Bioinformatics 20, 636–643.

    Google Scholar 

  24. Montgomery, S. B., Astakhova, T., Bilenky, M., et al. (2004) Sockeye: a 3D environment for comparative genomics. Genome Res. 14, 956–962.

    Article  CAS  PubMed  Google Scholar 

  25. Couronne, O., Poliakov, A., Bray, N., et al. (2002) Strategies and tools for whole genome alignments. Genome Res. 13, 73–80.

    Article  Google Scholar 

  26. Kent, J. (2002) BLAT: the BLAST-like alignment tool. Genome Res. 12, 656–664.

    CAS  PubMed  Google Scholar 

Download references

Acknowledgments

Many people have contributed to the development of the LAGAN Toolkit during its development. Michael F. Kim and Chuong B. Do were actively involved in the original development, including writing many of the utilities. Sanket Malde and Mukund Sundararajan developed the 1-monotonic chaining algorithm, and Serafim Batzoglou supervised the development of the software (and the degrees of the people writing it).

Our early users bore the brunt of the bugs in the package, with Kerrin Small and Gregory M. Cooper (working with Arend Sidow) deserving special recognition for being the first to use the programs on their own. Alexander Poliakov and Inna Dubchak were the first to use LAGAN in a large scale pipeline and hence also helped identify several problems with the software. This manuscript is partially based on the original Lagan and CHAOS papers as well as on a book chapter appearing in the Handbook of Computational Biology (S. Aluru, ed). The author's research was funded by the NSF Graduate Fellowship Award and the NSERC Discovery Grant during the writing.

Finally, I would like to thank the many users of LAGAN (both standalone and through the website) who have made this into a popular alignment package while also keeping the developers appraised of the problems and being patient as the problems were fixed.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Humana Press Inc.

About this protocol

Cite this protocol

Brudno, M. (2007). An Introduction to the Lagan Alignment Toolkit. In: Bergman, N.H. (eds) Comparative Genomics. Methods in Molecular Biology™, vol 395. Humana Press. https://doi.org/10.1007/978-1-59745-514-5_13

Download citation

  • DOI: https://doi.org/10.1007/978-1-59745-514-5_13

  • Publisher Name: Humana Press

  • Print ISBN: 978-1-58829-693-1

  • Online ISBN: 978-1-59745-514-5

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics