Algorithms for Multiple Protein Structure Alignment and Structure-Derived Multiple Sequence Alignment
Primary amino acid content and the geometry of the folded protein 3D structure are major parameters of protein function. During the course of evolution the protein 3D structure is more preserved than its primary sequence. Thus, analysis of protein structures is expected to lead to a deep insight into protein function. Recognition of a structural core common to a set of protein structures serves as a basic tool for the studies of protein evolution and classification, analysis of similar structural motifs and functional binding sites, and for homology modeling and threading.
In this chapter, we discuss several biologically related computational aspects of the multiple structure alignment and propose a method that provides solutions to these problems. Finally, we address the problem of structure-based multiple sequence alignment and propose an optimization method that unifies primary sequence and 3D structure information.
KeywordsMultiple structure alignment partial alignment structure base sequence alignment structure-sequence conservation
The research of M. Shatsky is supported by a PhD fellowship in “Complexity Science“ from the Yeshaya Horowitz association. This research was supported by the Israel Science Foundation (grant no. 281/05), the Binational US-Israel Science Foundation (BSF) and by the Hermann Minkowski-Minerva Center for Geometry at Tel Aviv University. The research of R. Nussinov has been funded in whole or in part with Federal funds from the National Cancer Institute, National Institutes of Health, under contract number NO1-CO-12400. This research was supported (in part) by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research. The content of this publication does not necessarily reflect the view or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organization imply endorsement by the U.S. Government.
- 1.Madej, T., Gibrat, J., and Bryant, S. Threading a database of protein cores. Proteins 23:356–369, 1995. Online available on http://www.ncbi.nlm.nih.gov/Structure/VAST/vast.shtml.Google Scholar
- 3.Shindyalov, I. and Bourne, P. Protein structure alignment by incremental combinatorical extension (ce) of the optimal path. Protein Eng 11(9):739–747, 1998. Online available on http://cl.sdsc.edu/ce.html.Google Scholar
- 4.Dietmann, S., Park, J., Notredame, C., Heger, A., Lappe, M., and Holm, L. A fully automatic evolutionary classification of protein folds: dali domain dictionary version 3. Nucleic Acids Res 29(1):55–57, 2001. Online available on http://www.embl-ebi.ac.uk/dali/.Google Scholar
- 11.Akutsu, T. and Sim, K. Protein threading based on multiple protein structure alignment. In Genome Informatics (GIW’99), Asai, K. and Miyano, S. and Takagi, T (eds). Universal Academy Press, Tokyo, 23–29, 1999.Google Scholar
- 16.Dror, O., Benyamini, H., Nussinov, R., and Wolfson, H. MASS: multiple structural alignment by secondary structures. Bioinformatics 19 Suppl. 1:i95–i104, 2003.Google Scholar
- 18.Ambuhl, C., Chakraborty, S., and Gartner, B. Computing largest common point sets under approximate congruence. In Proceedings of the 8th Annual European Symposium on Algorithms, 52–63, Springer-Verlag, Springer, Berlin, 2000.Google Scholar
- 19.Akutsu, T. Protein structure alignment using dynamic programming and iterative improvement. IEICE Trans Inf Syst E79-D:1629–1636, Springer Berlin, 1996.Google Scholar
- 23.Gerstein, M. and Levitt, M. Using iterative dynamic programming to obtain accurate pairwise and multiple alignments of protein structures. In Proceedings of the Fourth International Conference on Intelligent Systems in Molecular Biology, 59–67, Menlo Park, CA, AAAI Press, Heidleberg, Germany, 1996.Google Scholar
- 33.Shatsky, M., Fligelman, Z., Nussinov, R., and Wolfson, H. Alignment of flexible protein structures. In 8th International Conference on Intelligent Systems for Molecular Biology, 329–343, AAAI press, Heidleberg, Germany, 2000.Google Scholar
- 48.Fischer, D., Elofsson, A., Rice, D., and Eisenberg, D. Assessing the performance of fold recognition methods by means of a comprehensive benchmark. In Proceedings of Pacific Symposium on Biocomputing (Hunter, L. and Klein, T., editors), World Scientific Press, Singapore, 300–318, 1996.Google Scholar