Sequence Based Prediction of Protein Mutant Stability and Discrimination of Thermophilic Proteins
- Cite this paper as:
- Gromiha M.M., Huang LT., Lai LF. (2008) Sequence Based Prediction of Protein Mutant Stability and Discrimination of Thermophilic Proteins. In: Chetty M., Ngom A., Ahmad S. (eds) Pattern Recognition in Bioinformatics. PRIB 2008. Lecture Notes in Computer Science, vol 5265. Springer, Berlin, Heidelberg
Prediction of protein stability upon amino acid substitution and discrimination of thermophilic proteins from mesophilic ones are important problems in designing stable proteins. We have developed a classification rule generator using the information about wild-type, mutant, three neighboring residues and experimentally observed stability data. Utilizing the rules, we have developed a method based on decision tree for discriminating the stabilizing and destabilizing mutants and predicting protein stability changes upon single point mutations, which showed an accuracy of 82% and a correlation of 0.70, respectively. In addition, we have systematically analyzed the characteristic features of amino acid residues in 3075 mesophilic and 1609 thermophilic proteins belonging to 9 and 15 families, respectively, and developed methods for discriminating them. The method based on neural network could discrimi-nate them at the 5-fold cross-validation accuracy of 89% in a dataset of 4684 proteins and 91% in a test set of 707 proteins.
KeywordsProtein stability rule generator discrimination prediction thermophilic proteins neural network machine learning techniques
Unable to display preview. Download preview PDF.