Advertisement

Springer Nature is making SARS-CoV-2 and COVID-19 research free. View research | View latest news | Sign up for updates

AAIndexLoc: predicting subcellular localization of proteins based on a new representation of sequences using amino acid indices

Summary.

Identifying a protein’s subcellular localization is an important step to understand its function. However, the involved experimental work is usually laborious, time consuming and costly. Computational prediction hence becomes valuable to reduce the inefficiency. Here we provide a method to predict protein subcellular localization by using amino acid composition and physicochemical properties. The method concatenates the information extracted from a protein’s N-terminal, middle and full sequence. Each part is represented by amino acid composition, weighted amino acid composition, five-level grouping composition and five-level dipeptide composition. We divided our dataset into training and testing set. The training set is used to determine the best performing amino acid index by using five-fold cross validation, whereas the testing set acts as the independent dataset to evaluate the performance of our model. With the novel representation method, we achieve an accuracy of approximately 75% on independent dataset. We conclude that this new representation indeed performs well and is able to extract the protein sequence information. We have developed a web server for predicting protein subcellular localization. The web server is available at http://aaindexloc.bii.a-star.edu.sg.

This is a preview of subscription content, log in to check access.

References

  1. M Bhasin GP Raghava (2004) ArticleTitleESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST Nucleic Acids Res 32 W414–W419 Occurrence Handle15215421 Occurrence Handle1:CAS:528:DC%2BD2cXlvFKnurk%3D Occurrence Handle10.1093/nar/gkh350

  2. MP Brown WN Grundy D Lin N Cristianini CW Sugnet TS Furey M Ares SuffixJr D Haussler (2000) ArticleTitleKnowledge-based analysis of microarray gene expression data by using support vector machines Proc Natl Acad Sci USA 97 262–267 Occurrence Handle10618406 Occurrence Handle1:CAS:528:DC%2BD3cXjvVGjtw%3D%3D Occurrence Handle10.1073/pnas.97.1.262

  3. YD Cai GP Zhou KC Chou (2003) ArticleTitleSupport vector machines for predicting membrane protein types by using functional domain composition Biophys J 84 3257–3263 Occurrence Handle12719255 Occurrence Handle1:CAS:528:DC%2BD3sXjvFGju7o%3D

  4. J Cedano P Aloy JA Perez-Pons E Querol (1997) ArticleTitleRelation between amino acid composition and cellular location of proteins J Mol Biol 266 594–600 Occurrence Handle9067612 Occurrence Handle1:CAS:528:DyaK2sXhslKksL4%3D Occurrence Handle10.1006/jmbi.1996.0804

  5. C Chen YX Tian XY Zou PX Cai JY Mo (2006a) ArticleTitleUsing pseudo-amino acid composition and support vector machine to predict protein structural class J Theor Biol 243 444–448 Occurrence Handle1:CAS:528:DC%2BD28XhtFKlsL3N Occurrence Handle10.1016/j.jtbi.2006.06.025

  6. C Chen X Zhou Y Tian X Zou P Cai (2006b) ArticleTitlePredicting protein structural class with pseudo-amino acid composition and support vector machine fusion network Anal Biochem 357 116–121 Occurrence Handle1:CAS:528:DC%2BD28XpsVOgs78%3D Occurrence Handle10.1016/j.ab.2006.07.022

  7. J Chen H Liu J Yang KC Chou (2007) ArticleTitlePrediction of linear B-cell epitopes using amino acid pair antigenicity scale Amino Acids 33 423–428 Occurrence Handle17252308 Occurrence Handle1:CAS:528:DC%2BD2sXpvVagsrc%3D Occurrence Handle10.1007/s00726-006-0485-9

  8. YL Chen QZ Li (2007) ArticleTitlePrediction of the subcellular location of apoptosis proteins J Theor Biol 245 775–783 Occurrence Handle17189644 Occurrence Handle1:CAS:528:DC%2BD2sXjsVCjsLw%3D Occurrence Handle10.1016/j.jtbi.2006.11.010

  9. KC Chou (2000a) ArticleTitlePrediction of protein structural classes and subcellular locations Curr Protein Pept Sci 1 171–208 Occurrence Handle1:CAS:528:DC%2BD3cXnsVeisL0%3D Occurrence Handle10.2174/1389203003381379

  10. KC Chou (2000b) ArticleTitlePrediction of protein subcellular locations by incorporating quasi-sequence-order effect Biochem Biophys Res Commun 278 477–483 Occurrence Handle1:CAS:528:DC%2BD3cXotlKksbs%3D Occurrence Handle10.1006/bbrc.2000.3815

  11. KC Chou (2000c) ArticleTitleReview: prediction of protein structural classes and subcellular locations Curr Protein Peptide Sci 1 171–208 Occurrence Handle1:CAS:528:DC%2BD3cXnsVeisL0%3D Occurrence Handle10.2174/1389203003381379

  12. KC Chou (2001) ArticleTitlePrediction of protein cellular attributes using pseudo-amino acid composition Proteins 43 246–255 Occurrence Handle11288174 Occurrence Handle1:CAS:528:DC%2BD3MXjtFOls74%3D Occurrence Handle10.1002/prot.1035

  13. KC Chou (2002) A new branch of proteomics: prediction of protein cellular attributes PW Weinrer Q Lu (Eds) Gene cloning and expression technologies Eaton Publishing Westborough MA 57–70

  14. KC Chou (2005) ArticleTitleUsing amphiphilic pseudo amino acid composition to predict enzyme subfamily classes Bioinformatics 21 10–19 Occurrence Handle15308540 Occurrence Handle1:CAS:528:DC%2BD2MXisVWitw%3D%3D Occurrence Handle10.1093/bioinformatics/bth466

  15. KC Chou YD Cai (2002) ArticleTitleUsing functional domain composition and support vector machines for prediction of protein subcellular location J Biol Chem 277 45765–45769 Occurrence Handle12186861 Occurrence Handle1:CAS:528:DC%2BD38XovFKjurg%3D Occurrence Handle10.1074/jbc.M204161200

  16. KC Chou YD Cai (2003) ArticleTitleA new hybrid approach to predict subcellular localization of proteins by incorporating gene ontology Biochem Biophys Res Commun 311 743–747 Occurrence Handle14623335 Occurrence Handle1:CAS:528:DC%2BD3sXos12lurs%3D Occurrence Handle10.1016/j.bbrc.2003.10.062

  17. KC Chou YD Cai (2005) ArticleTitlePrediction of membrane protein types by incorporating amphipathic effects J Chem Inf Model 45 407–413 Occurrence Handle15807506 Occurrence Handle1:CAS:528:DC%2BD2MXht1aqtLs%3D Occurrence Handle10.1021/ci049686v

  18. KC Chou DW Elrod (1998) ArticleTitleUsing discriminant function for prediction of subcellular location of prokaryotic proteins Biochem Biophys Res Commun 252 63–68 Occurrence Handle9813147 Occurrence Handle1:CAS:528:DyaK1cXnsVKnur8%3D Occurrence Handle10.1006/bbrc.1998.9498

  19. KC Chou DW Elrod (1999a) ArticleTitlePrediction of membrane protein types and subcellular locations Proteins 34 137–153 Occurrence Handle1:CAS:528:DyaK1MXjtFGisg%3D%3D Occurrence Handle10.1002/(SICI)1097-0134(19990101)34:1<137::AID-PROT11>3.0.CO;2-O

  20. KC Chou DW Elrod (1999b) ArticleTitleProtein subcellular location prediction Protein Eng 12 107–118 Occurrence Handle1:CAS:528:DyaK1MXhvFehs7g%3D Occurrence Handle10.1093/protein/12.2.107

  21. KC Chou HB Shen (2006a) ArticleTitleHum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization Biochem Biophys Res Commun 347 150–157 Occurrence Handle1:CAS:528:DC%2BD28Xmslyrsbc%3D Occurrence Handle10.1016/j.bbrc.2006.06.059

  22. KC Chou HB Shen (2006b) ArticleTitlePredicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-Nearest Neighbor classifiers J Proteome Res 5 1888–1897 Occurrence Handle1:CAS:528:DC%2BD28XmvVeitr0%3D Occurrence Handle10.1021/pr060167c

  23. KC Chou HB Shen (2006c) ArticleTitlePredicting protein subcellular location by fusing multiple classifiers J Cell Biochem 99 517–527 Occurrence Handle1:CAS:528:DC%2BD28XhtVSktL3J Occurrence Handle10.1002/jcb.20879

  24. KC Chou HB Shen (2007a) ArticleTitleEuk-mPLoc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites J Proteome Res 6 1728–1734 Occurrence Handle1:CAS:528:DC%2BD2sXjs1SrsbY%3D

  25. KC Chou HB Shen (2007b) ArticleTitleLarge-scale plant protein subcellular location prediction J Cell Biochem 100 665–678 Occurrence Handle1:CAS:528:DC%2BD2sXht1Slu7c%3D Occurrence Handle10.1002/jcb.21096

  26. KC Chou HB Shen (2007c) ArticleTitleMemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM Biochem Biophys Res Commun 360 339–345 Occurrence Handle1:CAS:528:DC%2BD2sXnslSqtLw%3D Occurrence Handle10.1016/j.bbrc.2007.06.027

  27. KC Chou HB Shen (2007d) ArticleTitleRecent progress in protein subcellular location prediction Anal Biochem 370 1–16 Occurrence Handle1:CAS:528:DC%2BD2sXhtVOmur%2FF Occurrence Handle10.1016/j.ab.2007.07.006

  28. KC Chou HB Shen (2007e) ArticleTitleSignal-CF: a subsite-coupled and window-fusing approach for predicting signal peptides Biochem Biophys Res Commun 357 633–640 Occurrence Handle1:CAS:528:DC%2BD2sXkslCju78%3D Occurrence Handle10.1016/j.bbrc.2007.03.162

  29. KC Chou CT Zhang (1994) ArticleTitlePredicting protein folding types by distance functions that make allowances for amino acid interactions J Biol Chem 269 22014–22020 Occurrence Handle8071322 Occurrence Handle1:CAS:528:DyaK2cXlslCls7o%3D

  30. KC Chou CT Zhang (1995) ArticleTitlePrediction of protein structural classes Crit Rev Biochem Mol Biol 30 275–349 Occurrence Handle7587280 Occurrence Handle1:CAS:528:DyaK2MXosFentb8%3D Occurrence Handle10.3109/10409239509083488

  31. S Clausmeyer RB Klosgen RG Herrmann (1993) ArticleTitleProtein import into chloroplasts. The hydrophilic lumenal proteins exhibit unexpected import and sorting specificities in spite of structurally conserved transit peptides J Biol Chem 268 13869–13876 Occurrence Handle8314754 Occurrence Handle1:CAS:528:DyaK3sXltFentbw%3D

  32. YS Ding TL Zhang KC Chou (2007) ArticleTitlePrediction of protein structure classes with pseudo amino acid composition and fuzzy support vector machine network Protein Pept Lett 14 811–815 Occurrence Handle17979824 Occurrence Handle1:CAS:528:DC%2BD2sXhtlWiur7J Occurrence Handle10.2174/092986607781483778

  33. P Du Y Li (2006) ArticleTitlePrediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence BMC Bioinformatics 7 518 Occurrence Handle17134515 Occurrence Handle10.1186/1471-2105-7-518 Occurrence Handle1:CAS:528:DC%2BD28XhtlWlsb7E

  34. O Emanuelsson H Nielsen S Brunak G von Heijne (2000) ArticleTitlePredicting subcellular localization of proteins based on their N-terminal amino acid sequence J Mol Biol 300 1005–1016 Occurrence Handle10891285 Occurrence Handle1:CAS:528:DC%2BD3cXks1OntrY%3D Occurrence Handle10.1006/jmbi.2000.3903

  35. T Endo I Shimada D Roise F Inagaki (1989) ArticleTitleN-terminal half of a mitochondrial presequence peptide takes a helical conformation when bound to dodecylphosphocholine micelles: a proton nuclear magnetic resonance study J Biochem (Tokyo) 106 396–400 Occurrence Handle1:CAS:528:DyaL1MXlvV2ktbk%3D

  36. ZP Feng (2001) ArticleTitlePrediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition Biopolymers 58 491–499 Occurrence Handle11241220 Occurrence Handle1:CAS:528:DC%2BD3MXisVSntb8%3D Occurrence Handle10.1002/1097-0282(20010415)58:5<491::AID-BIP1024>3.0.CO;2-I

  37. ZP Feng (2002) ArticleTitleAn overview on predicting the subcellular location of a protein In Silico Biol 2 291–303 Occurrence Handle12542414 Occurrence Handle1:CAS:528:DC%2BD38Xpsl2lu7k%3D

  38. ZP Feng CT Zhang (2001) ArticleTitlePrediction of the subcellular location of prokaryotic proteins based on the hydrophobicity index of amino acids Int J Biol Macromol 28 255–261 Occurrence Handle11251233 Occurrence Handle10.1016/S0141-8130(01)00121-0

  39. QB Gao ZZ Wang C Yan YH Du (2005a) ArticleTitlePrediction of protein subcellular location using a combined feature of sequence FEBS Lett 579 3444–3448 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KjsL0%3D Occurrence Handle10.1016/j.febslet.2005.05.021

  40. Y Gao S Shao X Xiao Y Ding Y Huang Z Huang KC Chou (2005b) ArticleTitleUsing pseudo amino acid composition to predict protein subcellular location: approached with Lyapunov index, Bessel function, and Chebyshev filter Amino Acids 28 373–376 Occurrence Handle1:CAS:528:DC%2BD2MXlt1Kmurw%3D Occurrence Handle10.1007/s00726-005-0206-9

  41. JL Gardy C Spencer K Wang M Ester GE Tusnady I Simon S Hua K deFays C Lambert K Nakai FS Brinkman (2003) ArticleTitlePSORT-B: Improving protein subcellular localization prediction for Gram-negative bacteria Nucleic Acids Res 31 3613–3617 Occurrence Handle12824378 Occurrence Handle1:CAS:528:DC%2BD3sXltVWisrY%3D Occurrence Handle10.1093/nar/gkg602

  42. A Garg M Bhasin GP Raghava (2005) ArticleTitleSupport vector machine-based method for subcellular localization of human proteins using amino acid compositions, their order, and similarity search J Biol Chem 280 14427–14432 Occurrence Handle15647269 Occurrence Handle1:CAS:528:DC%2BD2MXjtFSmt7g%3D Occurrence Handle10.1074/jbc.M411789200

  43. J Guo Y Lin X Liu (2006a) ArticleTitleGNBSL: a new integrative system to predict the subcellular location for Gram-negative bacteria proteins Proteomics 6 5099–5105 Occurrence Handle1:CAS:528:DC%2BD28XhtFarsbzO Occurrence Handle10.1002/pmic.200600064

  44. YZ Guo M Li M Lu Z Wen K Wang G Li J Wu (2006b) ArticleTitleClassifying G protein-coupled receptors and nuclear receptors on the basis of protein power spectrum from fast Fourier transform Amino Acids 30 397–402 Occurrence Handle1:CAS:528:DC%2BD28Xls1egs7o%3D Occurrence Handle10.1007/s00726-006-0332-z

  45. PK Hammen DG Gorenstein H Weiner (1994) ArticleTitleStructure of the signal sequences for two mitochondrial matrix proteins that are not proteolytically processed upon import Biochemistry 33 8610–8617 Occurrence Handle7913339 Occurrence Handle1:CAS:528:DyaK2cXksFalt7k%3D Occurrence Handle10.1021/bi00194a028

  46. A Hoglund P Donnes T Blum HW Adolph O Kohlbacher (2006) ArticleTitleMultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs, and amino acid composition Bioinformatics 22 1158–1165 Occurrence Handle16428265 Occurrence Handle10.1093/bioinformatics/btl002 Occurrence Handle1:CAS:528:DC%2BD28Xktlaku78%3D

  47. S Hua Z Sun (2001) ArticleTitleSupport vector machine approach for protein subcellular localization prediction Bioinformatics 17 721–728 Occurrence Handle11524373 Occurrence Handle1:CAS:528:DC%2BD3MXntFKjsb0%3D Occurrence Handle10.1093/bioinformatics/17.8.721

  48. Y Huang Y Li (2004) ArticleTitlePrediction of protein subcellular locations using fuzzy k-NN method Bioinformatics 20 21–28 Occurrence Handle14693804 Occurrence Handle1:CAS:528:DC%2BD2cXns1Ol Occurrence Handle10.1093/bioinformatics/btg366

  49. L Jin H Tang W Fang (2005) ArticleTitlePrediction of protein subcellular locations using a new measure of information discrepancy J Bioinform Comput Biol 3 915–927 Occurrence Handle16078367 Occurrence Handle1:CAS:528:DC%2BD2MXhtVKltrjF Occurrence Handle10.1142/S0219720005001399

  50. KD Kedarisetti L Kurgan S Dick (2006) ArticleTitleClassifier ensembles for protein structural class prediction with varying homology Biochem Biophys Res Commun 348 981–988 Occurrence Handle16904630 Occurrence Handle1:CAS:528:DC%2BD28XosVOitL4%3D Occurrence Handle10.1016/j.bbrc.2006.07.141

  51. K Keegstra K Cline (1999) ArticleTitleProtein import and routing systems of chloroplasts Plant Cell 11 557–570 Occurrence Handle10213778 Occurrence Handle1:CAS:528:DyaK1MXjtFSrtbY%3D Occurrence Handle10.1105/tpc.11.4.557

  52. EW Klee JA Finlay C McDonald JR Attewell D Hebrink R Dyer B Love G Vasmatzis TM Li JM Beechem GG Klee (2006) ArticleTitleBioinformatics methods prioritizing serum biomarker candidates J Clin Chem 52 2162–2164 Occurrence Handle1:CAS:528:DC%2BD28XhtFOktbzP Occurrence Handle10.1373/clinchem.2006.072868

  53. LA Kurgan W Stach J Ruan (2007) ArticleTitleNovel scales based on hydrophobicity indices for secondary protein structure J Theor Biol 248 354–366 Occurrence Handle17572446 Occurrence Handle1:CAS:528:DC%2BD2sXpsFWrsb0%3D Occurrence Handle10.1016/j.jtbi.2007.05.017

  54. K Lee DW Kim D Na KH Lee D Lee (2006) ArticleTitlePLPD: reliable protein localization prediction from imbalanced and overlapped datasets Nucleic Acids Res 34 4655–4666 Occurrence Handle16966337 Occurrence Handle1:CAS:528:DC%2BD28XhtFeisb7J Occurrence Handle10.1093/nar/gkl638

  55. Y Lee CK Lee (2003) ArticleTitleClassification of multiple cancer types by multicategory support vector machines using gene expression data Bioinformatics 19 1132–1139 Occurrence Handle12801874 Occurrence Handle1:CAS:528:DC%2BD3sXks1Wrtbo%3D Occurrence Handle10.1093/bioinformatics/btg102

  56. Z Lei Y Dai (2005) ArticleTitleAn SVM-based system for predicting protein subnuclear localizations BMC Bioinformatics 6 291 Occurrence Handle16336650 Occurrence Handle10.1186/1471-2105-6-291 Occurrence Handle1:CAS:528:DC%2BD28XjslCm

  57. H Lin QZ Li (2007a) ArticleTitlePredicting conotoxin superfamily and family by using pseudo amino acid composition and modified Mahalanobis discriminant Biochem Biophys Res Commun 354 548–551 Occurrence Handle1:CAS:528:DC%2BD2sXhtlOgtLo%3D Occurrence Handle10.1016/j.bbrc.2007.01.011

  58. H Lin QZ Li (2007b) ArticleTitleUsing pseudo amino acid composition to predict protein structural class: approached by incorporating 400 dipeptide components J Comput Chem 28 1463–1466 Occurrence Handle1:CAS:528:DC%2BD2sXlslSgs7w%3D Occurrence Handle10.1002/jcc.20554

  59. DQ Liu H Liu HB Shen J Yang KC Chou (2007) ArticleTitlePredicting secretory protein signal sequence cleavage sites by fusing the marks of global alignments Amino Acids 32 493–496 Occurrence Handle17103116 Occurrence Handle1:CAS:528:DC%2BD2sXlsVGnsL8%3D Occurrence Handle10.1007/s00726-006-0466-z

  60. H Liu M Wang KC Chou (2005a) ArticleTitleLow-frequency Fourier spectrum for predicting membrane protein types Biochem Biophys Res Commun 336 737–739 Occurrence Handle1:CAS:528:DC%2BD2MXhtVegtLfP Occurrence Handle10.1016/j.bbrc.2005.08.160

  61. H Liu J Yang M Wang L Xue KC Chou (2005b) ArticleTitleUsing fourier spectrum analysis and pseudo amino acid composition for prediction of membrane protein types Protein J 24 385–389 Occurrence Handle1:CAS:528:DC%2BD2MXht1OqsLjL Occurrence Handle10.1007/s10930-005-7592-4

  62. M Mahdavi Y-H Lin (2007) ArticleTitleFalse positive reduction in protein–protein interaction predictions using gene ontology annotations BMC Bioinformatics 8 262 Occurrence Handle17645798 Occurrence Handle10.1186/1471-2105-8-262 Occurrence Handle1:CAS:528:DC%2BD2sXhtVSitbnP

  63. S Matsuda JP Vert H Saigo N Ueda H Toh T Akutsu (2005) ArticleTitleA novel representation of protein sequences for prediction of subcellular location using support vector machines Protein Sci 14 2804–2813 Occurrence Handle16251364 Occurrence Handle1:CAS:528:DC%2BD2MXhtF2it77K Occurrence Handle10.1110/ps.051597405

  64. BW Matthews (1975) ArticleTitleComparison of the predicted and observed secondary structure of T4 phage lysozyme Biochim Biophys Acta 405 442–451 Occurrence Handle1180967 Occurrence Handle1:CAS:528:DyaE2MXlslCksbk%3D

  65. S Mondal R Bhavna R Mohan Babu S Ramakumar (2006) ArticleTitlePseudo amino acid composition and multi-class support vector machines approach for conotoxin superfamily classification J Theor Biol 243 252–260 Occurrence Handle16890961 Occurrence Handle1:CAS:528:DC%2BD28XhtVygtbzM Occurrence Handle10.1016/j.jtbi.2006.06.014

  66. P Mundra M Kumar KK Kumar VK Jayaraman BD Kulkarni (2007) ArticleTitleUsing pseudo amino acid composition to predict protein subnuclear localization: Approached with PSSM Pattern Recogn Lett 28 1610–1615 Occurrence Handle10.1016/j.patrec.2007.04.001

  67. RF Murphy MV Boland M Velliste (2000) ArticleTitleTowards a systematics for protein subcelluar location: quantitative description of protein localization patterns and automated analysis of fluorescence microscope images Proc Int Conf Intell Syst Mol Biol 8 251–259 Occurrence Handle10977086 Occurrence Handle1:STN:280:DC%2BD3M7gt1aksQ%3D%3D

  68. K Nakai (2000) ArticleTitleProtein sorting signals and prediction of subcellular localization Adv Protein Chem 54 277–344 Occurrence Handle10829231 Occurrence Handle1:CAS:528:DC%2BD3cXltFSqs70%3D Occurrence Handle10.1016/S0065-3233(00)54009-1

  69. K Nakai P Horton (1999) ArticleTitlePSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization Trends Biochem Sci 24 34–36 Occurrence Handle10087920 Occurrence Handle1:CAS:528:DyaK1MXks12qtLk%3D Occurrence Handle10.1016/S0968-0004(98)01336-X

  70. K Nakai M Kanehisa (1992) ArticleTitleA knowledge base for predicting protein localization sites in eukaryotic cells Genomics 14 897–911 Occurrence Handle1478671 Occurrence Handle1:CAS:528:DyaK3sXhs1Clsbw%3D Occurrence Handle10.1016/S0888-7543(05)80111-9

  71. H Nakashima K Nishikawa (1994) ArticleTitleDiscrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies J Mol Biol 238 54–61 Occurrence Handle8145256 Occurrence Handle1:CAS:528:DyaK2cXivFemtrw%3D Occurrence Handle10.1006/jmbi.1994.1267

  72. B Niu YD Cai WC Lu GZ Li KC Chou (2006) ArticleTitlePredicting protein structural class with AdaBoost Learner Protein Pept Lett 13 489–492 Occurrence Handle16800803 Occurrence Handle1:CAS:528:DC%2BD28XlsVGqs7o%3D Occurrence Handle10.2174/092986606776819619

  73. YX Pan ZZ Zhang ZM Guo GY Feng ZD Huang L He (2003) ArticleTitleApplication of pseudo amino acid composition for predicting protein subcellular location: stochastic signal processing approach J Protein Chem 22 395–402 Occurrence Handle13678304 Occurrence Handle1:CAS:528:DC%2BD3sXmsFejs7s%3D Occurrence Handle10.1023/A:1025350409648

  74. KJ Park M Kanehisa (2003) ArticleTitlePrediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs Bioinformatics 19 1656–1663 Occurrence Handle12967962 Occurrence Handle1:CAS:528:DC%2BD3sXnt1Gqu78%3D Occurrence Handle10.1093/bioinformatics/btg222

  75. X Pu J Guo H Leung Y Lin (2007) ArticleTitlePrediction of membrane protein types from sequences and position-specific scoring matrices J Theor Biol 247 259–265 Occurrence Handle17433369 Occurrence Handle1:CAS:528:DC%2BD2sXmtlWgu7w%3D Occurrence Handle10.1016/j.jtbi.2007.01.016

  76. A Reinhardt T Hubbard (1998) ArticleTitleUsing neural networks for prediction of the subcellular location of proteins Nucleic Acids Res 26 2230–2236 Occurrence Handle9547285 Occurrence Handle1:CAS:528:DyaK1cXjtFylsLw%3D Occurrence Handle10.1093/nar/26.9.2230

  77. D Sarda GH Chua KB Li A Krishnan (2005) ArticleTitlepSLIP: SVM based protein subcellular localization prediction using multiple physicochemical properties BMC Bioinformatics 6 152 Occurrence Handle15963230 Occurrence Handle10.1186/1471-2105-6-152 Occurrence Handle1:CAS:528:DC%2BD2MXpsFGrsLw%3D

  78. H Shen KC Chou (2005a) ArticleTitleUsing optimized evidence-theoretic K-nearest neighbor classifier and pseudo-amino acid composition to predict membrane protein types Biochem Biophys Res Commun 334 288–292 Occurrence Handle1:CAS:528:DC%2BD2MXmt1aqsLw%3D Occurrence Handle10.1016/j.bbrc.2005.06.087

  79. HB Shen KC Chou (2005b) ArticleTitlePredicting protein subnuclear location with optimized evidence-theoretic K-nearest classifier and pseudo amino acid composition Biochem Biophys Res Commun 337 752–756 Occurrence Handle1:CAS:528:DC%2BD2MXhtFCjs7%2FI Occurrence Handle10.1016/j.bbrc.2005.09.117

  80. HB Shen KC Chou (2006) ArticleTitleEnsemble classifier for protein fold pattern recognition Bioinformatics 22 1717–1722 Occurrence Handle16672258 Occurrence Handle1:CAS:528:DC%2BD28Xotl2rsLY%3D Occurrence Handle10.1093/bioinformatics/btl170

  81. HB Shen KC Chou (2007a) ArticleTitleGpos-PLoc: an ensemble classifier for predicting subcellular localization of Gram-positive bacterial proteins Protein Eng Des Sel 20 39–46 Occurrence Handle1:CAS:528:DC%2BD2sXhvFWmtr8%3D Occurrence Handle10.1093/protein/gzl053

  82. HB Shen KC Chou (2007b) ArticleTitleHum-mPLoc: an ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites Biochem Biophys Res Commun 355 1006–1011 Occurrence Handle1:CAS:528:DC%2BD2sXivVahur0%3D Occurrence Handle10.1016/j.bbrc.2007.02.071

  83. HB Shen KC Chou (2007c) ArticleTitleUsing ensemble classifier to identify membrane protein types Amino Acids 32 483–488 Occurrence Handle1:CAS:528:DC%2BD2sXlsVGnsLY%3D Occurrence Handle10.1007/s00726-006-0439-2

  84. HB Shen KC Chou (2007d) ArticleTitleVirus-PLoc: a fusion classifier for predicting the subcellular localization of viral proteins within host and virus-infected cells Biopolymers 85 233–240 Occurrence Handle1:CAS:528:DC%2BD2sXhvFWhs70%3D Occurrence Handle10.1002/bip.20640

  85. HB Shen J Yang KC Chou (2006) ArticleTitleFuzzy KNN for predicting membrane protein types from pseudo-amino acid composition J Theor Biol 240 9–13 Occurrence Handle16197963 Occurrence Handle1:CAS:528:DC%2BD28Xjs1Knt70%3D Occurrence Handle10.1016/j.jtbi.2005.08.016

  86. HB Shen J Yang KC Chou (2007) ArticleTitleEuk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction Amino Acids 33 57–67 Occurrence Handle17235453 Occurrence Handle1:CAS:528:DC%2BD2sXotVWru7Y%3D Occurrence Handle10.1007/s00726-006-0478-8

  87. JY Shi SW Zhang Q Pan YM Cheng J Xie (2007) ArticleTitlePrediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition Amino Acids 33 69–74 Occurrence Handle17235454 Occurrence Handle1:CAS:528:DC%2BD2sXotVWru7g%3D Occurrence Handle10.1007/s00726-006-0475-y

  88. XD Sun RB Huang (2006) ArticleTitlePrediction of protein structural classes using support vector machines Amino Acids 30 469–475 Occurrence Handle16622605 Occurrence Handle1:CAS:528:DC%2BD28Xls1ehu7c%3D Occurrence Handle10.1007/s00726-005-0239-0

  89. V Vapnik (1995) The nature of statistical learning theory Springer-Verlag New York

  90. M Wang J Yang KC Chou (2005) ArticleTitleUsing string kernel to predict signal peptide cleavage site based on subsite coupling model Amino Acids 28 395–402 Occurrence Handle15838592 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KmtbY%3D Occurrence Handle10.1007/s00726-005-0189-6

  91. M Wang J Yang GP Liu ZJ Xu KC Chou (2004) ArticleTitleWeighted-support vector machines for predicting membrane protein types based on pseudo-amino acid composition Protein Eng Des Sel 17 509–516 Occurrence Handle15314209 Occurrence Handle1:CAS:528:DC%2BD2cXos1GisLY%3D Occurrence Handle10.1093/protein/gzh061

  92. SQ Wang J Yang KC Chou (2006) ArticleTitleUsing stacked generalization to predict membrane protein types based on pseudo-amino acid composition J Theor Biol 242 941–946 Occurrence Handle16806277 Occurrence Handle1:CAS:528:DC%2BD28Xps1Oku70%3D Occurrence Handle10.1016/j.jtbi.2006.05.006

  93. JJ Ward LJ McGuffin BF Buxton DT Jones (2003) ArticleTitleSecondary structure prediction with support vector machines Bioinformatics 19 1650–1655 Occurrence Handle12967961 Occurrence Handle1:CAS:528:DC%2BD3sXnt1Gqu7w%3D Occurrence Handle10.1093/bioinformatics/btg223

  94. Z Wen M Li Y Li Y Guo K Wang (2007) ArticleTitleDelaunay triangulation with partial least squares projection to latent structures: a model for G-protein coupled receptors classification and fast structure recognition Amino Acids 32 277–283 Occurrence Handle16729188 Occurrence Handle1:CAS:528:DC%2BD2sXhtFyhtLY%3D Occurrence Handle10.1007/s00726-006-0341-y

  95. X Xiao S Shao Y Ding Z Huang KC Chou (2006a) ArticleTitleUsing cellular automata images and pseudo amino acid composition to predict protein subcellular location Amino Acids 30 49–54 Occurrence Handle1:CAS:528:DC%2BD28XhsFCksrk%3D Occurrence Handle10.1007/s00726-005-0225-6

  96. X Xiao SH Shao ZD Huang KC Chou (2006b) ArticleTitleUsing pseudo amino acid composition to predict protein structural classes: approached with complexity measure factor J Comput Chem 27 478–482 Occurrence Handle10.1002/jcc.20354 Occurrence Handle1:CAS:528:DC%2BD28XitFyqsr4%3D

  97. X Xiao S Shao Y Ding Z Huang Y Huang KC Chou (2005) ArticleTitleUsing complexity measure factor to predict protein subcellular location Amino Acids 28 57–61 Occurrence Handle15611847 Occurrence Handle1:CAS:528:DC%2BD2MXhsVKqsro%3D Occurrence Handle10.1007/s00726-004-0148-7

  98. D Xie A Li M Wang Z Fan H Feng (2005) ArticleTitleLOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST Nucleic Acids Res 33 W105–W110 Occurrence Handle15980436 Occurrence Handle1:CAS:528:DC%2BD2MXlslyrt7w%3D Occurrence Handle10.1093/nar/gki359

  99. CS Yu CJ Lin JK Hwang (2004) ArticleTitlePredicting subcellular localization of proteins for Gram-negative bacteria by support vector machines based on n-peptide compositions Protein Sci 13 1402–1406 Occurrence Handle15096640 Occurrence Handle1:CAS:528:DC%2BD2cXjsFKks74%3D Occurrence Handle10.1110/ps.03479604

  100. Z Yuan (1999) ArticleTitlePrediction of protein subcellular locations using Markov chain models FEBS Lett 451 23–26 Occurrence Handle10356977 Occurrence Handle1:CAS:528:DyaK1MXjs1Sis7w%3D Occurrence Handle10.1016/S0014-5793(99)00506-2

  101. SW Zhang Q Pan HC Zhang ZC Shao JY Shi (2006a) ArticleTitlePrediction of protein homo-oligomer types by pseudo amino acid composition: Approached with an improved feature extraction and Naive Bayes Feature Fusion Amino Acids 30 461–468 Occurrence Handle1:CAS:528:DC%2BD28Xls1egsr0%3D Occurrence Handle10.1007/s00726-006-0263-8

  102. T Zhang Y Ding KC Chou (2006b) ArticleTitlePrediction of protein subcellular location using hydrophobic patterns of amino acid sequence Comput Biol Chem 30 367–371 Occurrence Handle1:CAS:528:DC%2BD28XhtVSisrzN Occurrence Handle10.1016/j.compbiolchem.2006.08.003

  103. TL Zhang YS Ding (2007) ArticleTitleUsing pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes Amino Acids 33 623–629 Occurrence Handle17308864 Occurrence Handle1:CAS:528:DC%2BD2sXhtlSlsrjI Occurrence Handle10.1007/s00726-007-0496-1

  104. GP Zhou (1998) ArticleTitleAn intriguing controversy over protein structural class prediction J Protein Chem 17 729–738 Occurrence Handle9988519 Occurrence Handle1:CAS:528:DyaK1MXnslaltw%3D%3D Occurrence Handle10.1023/A:1020713915365

  105. GP Zhou N Assa-Munt (2001) ArticleTitleSome insights into protein structural class prediction Proteins 44 57–59 Occurrence Handle11354006 Occurrence Handle1:CAS:528:DC%2BD3MXktlSnsbk%3D Occurrence Handle10.1002/prot.1071

  106. GP Zhou K Doctor (2003) ArticleTitleSubcellular location prediction of apoptosis proteins Proteins 50 44–48 Occurrence Handle12471598 Occurrence Handle1:CAS:528:DC%2BD3sXlsVKmug%3D%3D Occurrence Handle10.1002/prot.10251

  107. XB Zhou C Chen ZC Li XY Zou (2007) ArticleTitleUsing Chou’s amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes J Theor Biol 248 546–551 Occurrence Handle17628605 Occurrence Handle1:CAS:528:DC%2BD2sXhtVWhsrnI Occurrence Handle10.1016/j.jtbi.2007.06.001

Download references

Author information

Additional information

Authors’ address: Kuo-Bin Li, Center for Systems and Synthetic Biology, National Yang-Ming University, Taipei 112, Taiwan

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Tantoso, E., Li, K. AAIndexLoc: predicting subcellular localization of proteins based on a new representation of sequences using amino acid indices. Amino Acids 35, 345–353 (2008). https://doi.org/10.1007/s00726-007-0616-y

Download citation

  • Keywords: Subcellular localization – Support vector machine – Amino acid indices