A High Performing Tool for Residue Solvent Accessibility Prediction
Many efforts were spent in the last years in bridging the gap between the huge number of sequenced proteins and the relatively few solved structures. Relative Solvent Accessibility (RSA) prediction of residues in protein complexes is a key step towards secondary structure and protein-protein interaction sites prediction. With very different approaches, a number of software tools for RSA prediction have been produced throughout the last twenty years. Here, we present a binary classifier which implements a new method mainly based on sequence homology and implemented by means of look-up tables. The tool exploits residue similarity in solvent exposure pattern of neighboring context in similar protein chains, using BLAST search and DSSP structure. A two-state classification with 89.5% accuracy and 0.79 correlation coefficient against the real data is achieved on a widely used dataset.
KeywordsSupport Vector Regression Query Sequence Solvent Accessibility Accessible Surface Area Similarity Depth
Unable to display preview. Download preview PDF.
- 1.Jones, S., Thornton, J.M.: Analysis of Protein-Protein Interaction Sites Using Surface Patches. J. Mol. Biol. 272, 132–143 (1997)Google Scholar
- 20.Meshkin, A., Ghafuri, H.: Prediction of Relative Solvent Accesibility by Support Vector Regression and Best-First Method. EXCLI Journal 9, 29–38 (2010)Google Scholar
- 30.Carugo, O.: Prediction of Polypeptide Fragments Exposed to the Solvent. Silico Biology 3, 35 (2003)Google Scholar
- 31.Palmieri, L., Federico, M., Leoncini, M., Montangero, M.: Sequence-Based Prediction of Solvent Accessibility in Proteins. University of Modena and Reggio Emilia, M2CSC doctoral research school, internal report (2009)Google Scholar
- 34.Brenner, S.E., Chothia, C., Hubbard, T.J.P.: PNAS 95, 6073–6078 (1998)Google Scholar
- 35.Blaber, M., Lindstrom, J.D., Gassner, N., Xu, J., Heinz, D.W., Matthews, B.W.: Energetic Cost and Structural Consequences of Burying a Hydroxyl Group within the Core of a Protein Determined from Ala–>Ser and Val–>Thr Substitutions in T4 lysozyme. Biochemistry 32, 11363–11373 (1993)CrossRefGoogle Scholar