Abstract
In this paper, we investigate a simple protein sequence conservation measure which takes amino acid similarity into account. Instead of grouping 20 amino acids into disjoint sets in previous methods, we consider ten overlapping classes. The method is based on the assumption that a column in a multiple sequence alignment is evolved from an identical column in the evolutionary history. Two ten-dimensional vectors are constructed for each position to denote frequencies of ten classes in a column and the corresponding hypothetical identical column. Then the cosine function of the angle between these two vectors is considered as a measure of divergence of stereochemical properties at this position. This divergence, combining with other conservation scores, is used as conservation measure of the column. Finally, we evaluate our methods by identifying catalytic sites, using rank analysis criterion and receiver operator characteristic analysis criterion.
Similar content being viewed by others
Abbreviations
- MSA:
-
Multiple sequence alignment
- RE:
-
Relative entropy
- JSD:
-
Jensen–Shannon divergence
- SP:
-
Stereochemical properties divergence
- SPR:
-
Revision of SP with RE
- SPJ:
-
Revision of SP with JSD
- ROC:
-
Receiver operator characteristic curve
- TPR:
-
True positive rate
- FPR:
-
False positive rate
- AUC:
-
Area under ROC curve
References
Petrova N, Wu C (2006) BMC Bioinformatics 7:312
Valdar W, Thornton J (2001) J Mol Biol 313:399–416
Caffery D, Somaroo S, Hughes J, Mintseris J, Huang E (2004) Protein Sci 13:190–202
Capra J, Singh S (2007) Bioinformatics 23:1875–1882
Karlin S, Brocchieri L (1996) J Bacteriol 178:1881–1894
Mayrose I, Graur D, Ben-Tal A, Pupko T (2004) Mol Biol Evol 21:1781–1791
Pei J, Grishin N (2001) Bioinformatics 17:700–712
Palenchar P, Mount M, Cusato D, Dougherty J (2008) Protein J 5:283–291
Shenkin P, Erman BLM (1991) Proteins 11:297–313
Wang K, Samudrala R (2006) BMC Bioinformatics 7:385
Mirny L, Shakhnovich E (1999) J Mol Biol 291:177–196
Taylor W (1986) J Theor Biol 119:205–218
Valdar W (2002) Proteins 48:227–241
Zvelibil M, Barton G, Taylor W, Sternberg M (1987) J Mol Biol 195:957–961
Cover T, Thomas J (1991) Elements of information theory. Wiley, New York
Williamson R (1995) J Theor Biol 174:179–188
Gribskov M, Robinson N (1996) Comput Chem 20:25–33
Porter C, Bartleett G, Thornton J (2003) Nucleic Acids Res 32:D129–D133
Henikoff S, Henikoff J (1994) J Mol Biol 243:574–578
Panchenko A, Kondrashov F, Bryant S (2003) Protein Sci 13:884–892
Acknowledgments
This work was supported in part by Leading Academic Discipline Project of Shanghai Normal University (No. DZL803) and Shanghai Leading Academic Discipline Project (No. S30405).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dou, Y., Zheng, X. & Wang, J. Prediction of Catalytic Residues Using the Variation of Stereochemical Properties. Protein J 28, 29–33 (2009). https://doi.org/10.1007/s10930-008-9161-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10930-008-9161-0