Multiple Structure Alignment and Consensus Identification for Proteins
An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus structure which captures common substructures present in the given proteins. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins. A distinguishing feature of the algorithm is that it works directly with the coordinate representation in three dimensions with no loss of spatial information, unlike some other multiple structure alignment algorithms that operate on sets of backbone vectors translated to the origin; hence, the algorithm is able to generate true alignments. Experimental studies on several protein datasets show that the algorithm is quite competitive with a well-known algorithm called CE-MC. A web-based tool has also been developed to facilitate remote access to the algorithm over the Internet.
KeywordsStructure Alignment Backbone Vector True Alignment Consensus Structure Monte Carlo Optimization
Unable to display preview. Download preview PDF.
- 1.Branden, C., Tooze, J.: Introduction to Protein Structure, Garland (1999)Google Scholar
- 3.Chew, L.P., Kedem, K.: Finding the consensus shape of a protein family. In: Proc. ACM Symp. Comput. Geometry SoCG 2002, pp. 64–73 (2002)Google Scholar
- 4.Gerstein, M., Levitt, M.: Using iterative dynamic programming to obtain accurate pairwise and multiple alignments of protein structures. In: Proc. ISMB 1996, pp. 59–66 (1996)Google Scholar
- 5.Golub, G.H., Van Loan, C.F.: Matrix Computations, 3rd edn. John Hopkins University Press (1996)Google Scholar
- 6.Guda, C., Scheeff, E.D., Bourne, P.E., Shindyalov, I.N.: A new algorithm for the alignment of multiple protein structures using Monte Carlo optimization. In: Proc. PSB 2001, pp. 275–286 (2001)Google Scholar
- 10.Leibowitz, N., Fligelman, Z., Nussinov, R., Wolfson, H.: Multiple Structural Alignment and Core Detection by Geometric Hashing. In: Proc. ISMB 1999, pp. 169–177 (1999)Google Scholar
- 13.Rose, G.: No assembly required. The Sciences 36, 26–31 (1996)Google Scholar
- 16.Singh, A.P., Brutlag, D.L.: Hierarchical protein structure superposition using both secondary structure and atomic representation. In: Proc. ISMB 1997, pp. 284–293 (1997)Google Scholar
- 19.Ye, J., Ilinkin, I., Janardan, R., Isom, A.: Multiple structure alignment and consensus identification for proteins (submitted), available at: http://www.geom-comp.umn.edu