Protein Subcellular Location Prediction Based on Pseudo Amino Acid Composition and Immune Genetic Algorithm

  • Tongliang Zhang
  • Yongsheng Ding
  • Shihuang Shao
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4115)


Protein subcellular location prediction with computational method is still a hot spot in bioinformatics. In this paper, we present a new method to predict protein subcellular location, which based on pseudo amino acid composition and immune genetic algorithm. Hydrophobic patterns of amino acid couples and approximate entropy are introduced to construct pseudo amino acid composition. Immune Genetic algorithm (IGA) is applied to find the fittest weight factors for pseudo amino acid composition, which are crucial in this method. As such, high success rates are obtained by both self-consistency test and jackknife test. More than 80% predictive accuracy is achieved in independent dataset test. The result demonstrates that this new method is practical. And, the method illuminates that the hydrophobic patterns of protein sequence influence its subcellular location.


Support Vector Machine Markov Chain Model Simple Genetic Algorithm Approximate Entropy Protein Subcellular Location 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Nakai, K., Kanehisa, M.: A Knowledge Base for Predicting Protein Localization Sites in Eukaryotic Cells. Genomics 14, 897–911 (1992)CrossRefGoogle Scholar
  2. 2.
    Reinhardt, A., Hubbard, T.: Using Neural Networks for Prediction of the Subcellular Location of Proteins. Nucleic Acids Res. 26, 2230–2236 (1998)CrossRefGoogle Scholar
  3. 3.
    Yuan, Z.: Prediction of Protein Subcellular Locations using Markov Chain Models. FEBS Letter 451, 23–26 (1999)CrossRefGoogle Scholar
  4. 4.
    Park, K.J., Kanehisa, M.: Prediction of Protein Subcellular Locations Support Vector Machines using Compositions Amino Acids and Amino Acid Pairs. Bioinformatics 19, 1656–1663 (2003)CrossRefGoogle Scholar
  5. 5.
    Chou, K.C., Cai, Y.D.: Using Functional Domain Composition and Support Vector Machines for Prediction of Protein Subcellular Location. Journal of biological chemistry 277, 45765–45769 (2002)CrossRefGoogle Scholar
  6. 6.
    Hua, S., Sun, Z.: Support Vector Machine Approach for Protein Subcellular Localization Prediction. Bioinformatics 17, 721–728 (2001)CrossRefGoogle Scholar
  7. 7.
    Cai, Y.D., Liu, X.J., Xu, X.B., Chou, K.C.: Support Vector Machines for Prediction of Protein Subcellular Llocation by Incorporating Quasi-sequence-order Effect. J. Cell Biochemistry 84, 343–348 (2002)CrossRefGoogle Scholar
  8. 8.
    Pan, Y.X., Zhang, Z.Z., Guo, Z.M., Feng, G.Y., Huang, Z.D., He, L.: Application of Pseudo Amino Acid Composition for Predicting Protein Subcellular Location: Stochastic Signal Processing Approach. Journal of Protein Chemistry 22, 395–402 (2003)CrossRefGoogle Scholar
  9. 9.
    Cai, Y.D., Chou, K.C.: Nearest Neighbor Algorithm for Predicting Protein Subcellular Location by Combining Function Domain Composition and Pseudo Amino Acid Composition. Biochem. Biophys. Res. Comm. 305, 407–411 (2003)CrossRefGoogle Scholar
  10. 10.
    Xiao, X., Shao, S.H., Ding, Y.S., Huang, Z.D., Huang, Y.S., Chou, K.C.: Using Complexity Measure Factor to Predict Protein Subcellular Location. Amino Acid 28, 57–61 (2005)CrossRefGoogle Scholar
  11. 11.
    Wang, M., Yang, J., Xu, Z.J., Chou, K.C.: Weight-support Vector Machines for Prediction Membrane Protein Type Based on Pseudo-amino Acid Composition. Protein Engineering Design and Selection 17, 509–516 (2004)CrossRefGoogle Scholar
  12. 12.
    Lim, V.I.: Algorithms for the Prediction of A-helical and 0-structural Regions in Globular Proteins. J. Mol. Biol. 88, 873–894 (1974)CrossRefGoogle Scholar
  13. 13.
    Dill, K.A.: Dominant Forces in Protein Folding. Biochemistry 29, 7133–7155 (1990)CrossRefGoogle Scholar
  14. 14.
    Sadovsky, M.G.: The Method to Compare Nucleotide Sequence based on Minimum Entropy Principle. Bull Math. Biol. 65, 309–322 (2003)CrossRefGoogle Scholar
  15. 15.
    Pincus, S.M.: Approximate Entropy as a Measure of System Complexity. PNAS 88, 2297–2301 (1991)MATHCrossRefMathSciNetGoogle Scholar
  16. 16.
    Chou, K.C., Elrod, D.W.: Protein Subcellular Location Prediction. Protein Eng. 12, 183–190 (1999)CrossRefGoogle Scholar
  17. 17.
    Schiffer, M., Edmundson, A.: Use of Helical Wheels to Represent the Structures of Proteins and to Identify Segments with Helical Potential. Biophys. J. 7, 121–133 (1967)CrossRefGoogle Scholar
  18. 18.
    Rose, G.D., Geselowitz, A.R., Lesser, G.J., Lee, R.H., Zehfus, M.H.: Hydrophobic of Amino Acid Residue in Globular Proteins. Science 229, 834–838 (1985)CrossRefGoogle Scholar
  19. 19.
    Hong, B., Tang, Q.Y., Yang, F.S.: Apen and Cross-ApEn: Property, Fast Algorithm and Preliminary Application to the Study of EEG and Cognition. Signal Process 15, 100–108 (1999)Google Scholar
  20. 20.
    Chou, K.C.: Prediction of Protein Cellular Attributes using Pseudo-amino-acid Composition. Protein: struct. Funct. Genet. 43, 246–255 (2001)CrossRefGoogle Scholar
  21. 21.
    Zhou, G.P.: An Intriguing Controversy over Protein Structural Class Prediction. J. Protein Chem. 17, 729–738 (1998)CrossRefGoogle Scholar
  22. 22.
    Zhou, G.P., Assa-Munt, N.: Some Insight into Protein Structural Class Prediction. Protein: Structure, Function, and Genetics 50, 44–48 (2001)CrossRefGoogle Scholar
  23. 23.
    Chou, K.C., Zhang, C.T.: Review: Prediction of Protein Structural Classes. Crit. Rev. Biochem. Mol Biol. 30, 275–349 (1995)CrossRefGoogle Scholar
  24. 24.
    Cedano, J., Aloy, P., P’erez-Pons, J.A., Querol, E.: Relation between Amino Acid Composition and Cellular Location of Protein. J. Mol. Biol. 266, 594–600 (1997)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Tongliang Zhang
    • 1
  • Yongsheng Ding
    • 1
  • Shihuang Shao
    • 1
  1. 1.College of Information Sciences and TechnologyDonghua UniversityShanghaiChina

Personalised recommendations