Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model

Huang, Liang-Tsung; Gromiha, M. Michael; Ho, Shinn-Ying

doi:10.1007/s00894-007-0197-4

Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model

Original Paper
Published: 30 March 2007

Volume 13, pages 879–890, (2007)
Cite this article

Journal of Molecular Modeling Aims and scope Submit manuscript

Liang-Tsung Huang^1,2,
M. Michael Gromiha³ &
Shinn-Ying Ho⁴

1290 Accesses
26 Citations
Explore all metrics

Abstract

Understanding the mechanism of the protein stability change is one of the most challenging tasks. Recently, the prediction of protein stability change affected by single point mutations has become an interesting topic in molecular biology. However, it is desirable to further acquire knowledge from large databases to provide new insights into the nature of them. This paper presents an interpretable prediction tree method (named iPTREE-2) that can accurately predict changes of protein stability upon mutations from sequence based information and analyze sequence characteristics from the viewpoint of composition and order. Therefore, iPTREE-2 based on a regression tree algorithm exhibits the ability of finding important factors and developing rules for the purpose of data mining. On a dataset of 1859 different single point mutations from thermodynamic database, ProTherm, iPTREE-2 yields a correlation coefficient of 0.70 between predicted and experimental values. In the task of data mining, detailed analysis of sequences reveals the possibility of the compositional specificity of residues in different ranges of stability change and implies the existence of certain patterns. As building rules, we found that the mutation residues in wild type and in mutant protein play an important role. The present study demonstrates that iPTREE-2 can serve the purpose of predicting protein stability change, especially when one requires more understandable knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Computational approaches for predicting mutant protein stability

Article 09 May 2016

Applications of Protein Thermodynamic Database for Understanding Protein Mutant Stability and Designing Stable Mutants

Feature-based multiple models improve classification of mutation-induced stability changes

Article Open access 20 May 2014

References

Daggett V, Fersht AR (2003) Trends Biochem Sci 28:18–25
Article CAS Google Scholar
Saven JG (2002) Curr Opin Struct Biol 12:453–458
Article CAS Google Scholar
Mendes J, Guerois R, Serrano L (2002) Curr Opin Struct Biol 12:441–446
Article CAS Google Scholar
Bolon DN, Marcus JS, Ross SA, Mayo SL (2003) J Mol Biol 329:611–622
Article CAS Google Scholar
Looger LL, Dwyer MA, Smith JJ, Hellinga HW (2003) Nature 423:185–190
Article CAS Google Scholar
Gromiha MM, Oobatake M, Kono H, Uedaira H, Sarai A (1999) Protein Eng 12:549–555
Article CAS Google Scholar
Guerois R, Nielsen JE, Serrano L (2002) J Mol Biol 320:369–387
Article CAS Google Scholar
Prevost M, Wodak SJ, Tidor B, Karplus M (1991) Proc Natl Acad Sci USA 88:10880–10884
Article CAS Google Scholar
Gilis D, Rooman M (1997) J Mol Biol 272:276–290
Article CAS Google Scholar
Parthiban V, Gromiha MM, Schomburg D (2006) Nucleic Acids Res 34:W239–W242
Article CAS Google Scholar
Funahashi J, Takano K, Yutani K (2001) Protein Eng 14:127–134
Article CAS Google Scholar
Capriotti E, Fariselli P, Casadio R (2004) Bioinformatics 20 Suppl 1:I63–I68
Article CAS Google Scholar
Capriotti E, Fariselli P, Casadio R (2005) Nucleic Acids Res 33:W306–W310
Article CAS Google Scholar
Cheng J, Randall A, Baldi P (2006) Proteins 62:1125–1132
Article CAS Google Scholar
Xiong W, Wang JTL, Shasha D, Shapiro BA, Rigoutsos I, Kaizhong Z (2002) Knowledge and Data Engineering, IEEE Transactions on 14:731–749
Article Google Scholar
Creighton C, Hanash S (2003) Bioinformatics 19:79–86
Article CAS Google Scholar
Oyama T, Kitano K, Satou K, Ito T (2002) Bioinformatics 18:705–714
Article CAS Google Scholar
Baldi P, Brunak S (2001) Bioinformatics: the machine learning approach. MIT Press, Cambridge, Mass
Google Scholar
Larose DT (2005) Discovering knowledge in data: An introduction to data mining. Wiley-Interscience, Hoboken, New York
Google Scholar
Huang LT, Gromiha MM, Hwang SF, Ho SY (2006) Computational Biology and Chemistry 30:408–415
Article CAS Google Scholar
Bordner AJ, Abagyan RA (2004) Proteins 57:400–413
Article CAS Google Scholar
Casadio R, Compiani M, Fariselli P, Vivarelli F (1995) Proc Int Conf Intell Syst Mol Biol 3:81–88
CAS Google Scholar
Frenz CM (2005) Proteins 59:147–151
Article CAS Google Scholar
Lacroix E, Viguera AR, Serrano L (1998) J Mol Biol 284:173–191
Article CAS Google Scholar
Munoz V, Serrano L (1997) Biopolymers 41:495–509
Article CAS Google Scholar
Huang LT, Saraboji K, Ho SY, Hwang SF, Ponnuswamy MN, Gromiha MM (2007) Biophysical Chemistry 125:462–470
Article CAS Google Scholar
Bava KA, Gromiha MM, Uedaira H, Kitajima K, Sarai A (2004) Nucleic Acids Res 32:D120–D121
Article CAS Google Scholar
Gromiha MM, An J, Kono H, Oobatake M, Uedaira H, Sarai A (1999) Nucleic Acids Res 27:286–288
Article CAS Google Scholar
Breiman L (1984) Classification and regression trees. Wadsworth International Group, Belmont, CA
Google Scholar
Bai JP, Utis A, Crippen G, He HD, Fischer V, et al (2004) J Chem Inf Comput Sci 44:2061–2069
Article CAS Google Scholar
Deconinck E, Zhang MH, Coomans D, Vander Heyden Y (2006) J Chem Inf Model 46:1410–1419
Article CAS Google Scholar
Witten IH, Frank E (2005) Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, San Francisco
Google Scholar
Zscherp C, Aygun H, Engels JW, Mantele W (2003) Biochim Biophys Acta 1651:139–145
CAS Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Information Engineering and Computer Science, Feng-Chia University, Taichung, 407, Taiwan
Liang-Tsung Huang
Department of Computer Science and Information Engineering, Ming-Dao University, Changhua, 523, Taiwan
Liang-Tsung Huang
Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), AIST Tokyo Waterfront Bio-IT Research Building, 2-42 Aomi, Koto-ku, Tokyo, 135-0064, Japan
M. Michael Gromiha
Department of Biological Science and Technology, and Institute of Bioinformatics, National Chiao Tung University, Hsinchu, 300, Taiwan
Shinn-Ying Ho

Authors

Liang-Tsung Huang
View author publications
You can also search for this author in PubMed Google Scholar
M. Michael Gromiha
View author publications
You can also search for this author in PubMed Google Scholar
Shinn-Ying Ho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shinn-Ying Ho.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Huang, LT., Gromiha, M.M. & Ho, SY. Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model. J Mol Model 13, 879–890 (2007). https://doi.org/10.1007/s00894-007-0197-4

Download citation

Received: 25 November 2006
Accepted: 01 March 2007
Published: 30 March 2007
Issue Date: August 2007
DOI: https://doi.org/10.1007/s00894-007-0197-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model

Abstract

Access this article

Similar content being viewed by others

Computational approaches for predicting mutant protein stability

Applications of Protein Thermodynamic Database for Understanding Protein Mutant Stability and Designing Stable Mutants

Feature-based multiple models improve classification of mutation-induced stability changes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model

Abstract

Access this article

Similar content being viewed by others

Computational approaches for predicting mutant protein stability

Applications of Protein Thermodynamic Database for Understanding Protein Mutant Stability and Designing Stable Mutants

Feature-based multiple models improve classification of mutation-induced stability changes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation