Multiple Sequence Alignment Methods pp 105-116

Part of the Methods in Molecular Biology book series (MIMB, volume 1079)

Clustal Omega, Accurate Alignment of Very Large Numbers of Sequences

  • Fabian Sievers
  • Desmond G. Higgins

Abstract

Clustal Omega is a completely rewritten and revised version of the widely used Clustal series of programs for multiple sequence alignment. It can deal with very large numbers (many tens of thousands) of DNA/RNA or protein sequences due to its use of the mBED algorithm for calculating guide trees. This algorithm allows very large alignment problems to be tackled very quickly, even on personal computers. The accuracy of the program has been considerably improved over earlier Clustal programs, through the use of the HHalign method for aligning profile hidden Markov models. The program currently is used from the command line or can be run on line.

Key words

Multiple sequence alignment Progressive alignment Protein sequences Clustal 

References

  1. 1.
    Sievers F, Wilm A, Dineen D et al (2011) Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7:539. doi:10.1038/msb.2011.75 PubMedCrossRefGoogle Scholar
  2. 2.
    Higgins DG, Sharp PM (1988) CLUSTAL: a package for performing multiple sequence alignment on a microcomputer. Gene 73(1):237–244PubMedCrossRefGoogle Scholar
  3. 3.
    Larkin MA, Blackshields G, Brown NP et al (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23:2947–2948PubMedCrossRefGoogle Scholar
  4. 4.
    Blackshields G, Sievers F, Shi W et al (2010) Sequence embedding for fast construction of guide trees for multiple sequence alignment. Algorithms Mol Biol 5:21PubMedCrossRefGoogle Scholar
  5. 5.
    Söding J (2005) Protein homology detection by HMM-HMM comparison. Bioinformatics 21:951–960PubMedCrossRefGoogle Scholar
  6. 6.
    Arthur D, Vassilvitskii S (2007) k-means++: the advantages of careful seeding. Proceedings of the eighteenth annual ACM-SIAM symposium on discrete algorithms, Philadelphia, PA, pp 1027–1035Google Scholar
  7. 7.
    Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797PubMedCrossRefGoogle Scholar
  8. 8.
    Finn RD, Clements J, Eddy SR (2011) HMMER web server: interactive sequence similarity searching. Nucleic Acids Res 39(Suppl 2):W29–W37PubMedCrossRefGoogle Scholar
  9. 9.
    Kimura M (1985) The neutral theory of molecular evolution. Cambridge University Press, CambridgeGoogle Scholar
  10. 10.
    Lassmann T, Sonnhammer ELL (2005) Kalign – an accurate and fast multiple sequence alignment algorithm. BMC Bioinformatics 6:298PubMedCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2014

Authors and Affiliations

  • Fabian Sievers
    • 1
  • Desmond G. Higgins
    • 1
  1. 1.Conway InstituteUniversity College DublinDublinIreland

Personalised recommendations