Skip to main content
Log in

Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction

  • Published:
Amino Acids Aims and scope Submit manuscript

Summary

With the avalanche of newly-found protein sequences emerging in the post genomic era, it is highly desirable to develop an automated method for fast and reliably identifying their subcellular locations because knowledge thus obtained can provide key clues for revealing their functions and understanding how they interact with each other in cellular networking. However, predicting subcellular location of eukaryotic proteins is a challenging problem, particularly when unknown query proteins do not have significant homology to proteins of known subcellular locations and when more locations need to be covered. To cope with the challenge, protein samples are formulated by hybridizing the information derived from the gene ontology database and amphiphilic pseudo amino acid composition. Based on such a representation, a novel ensemble hybridization classifier was developed by fusing many basic individual classifiers through a voting system. Each of these basic classifiers was engineered by the KNN (K-Nearest Neighbor) principle. As a demonstration, a new benchmark dataset was constructed that covers the following 18 localizations: (1) cell wall, (2) centriole, (3) chloroplast, (4) cyanelle, (5) cytoplasm, (6) cytoskeleton, (7) endoplasmic reticulum, (8) extracell, (9) Golgi apparatus, (10) hydrogenosome, (11) lysosome, (12) mitochondria, (13) nucleus, (14) peroxisome, (15) plasma membrane, (16) plastid, (17) spindle pole body, and (18) vacuole. To avoid the homology bias, none of the proteins included has ≥25% sequence identity to any other in a same subcellular location. The overall success rates thus obtained via the 5-fold and jackknife cross-validation tests were 81.6 and 80.3%, respectively, which were 40–50% higher than those performed by the other existing methods on the same strict dataset. The powerful predictor, named “Euk-PLoc”, is available as a web-server at http://202.120.37.186/bioinf/euk. Furthermore, to support the need of people working in the relevant areas, a downloadable file will be provided at the same website to list the results predicted by Euk-PLoc for all eukaryotic protein entries (excluding fragments) in Swiss-Prot database that do not have subcellular location annotations or are annotated as being uncertain. The large-scale results will be updated twice a year to include the new entries of eukaryotic proteins and reflect the continuous development of Euk-PLoc.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • R Apweiler A Bairoch CH Wu WC Barker B Boeckmann S Ferro E Gasteiger H Huang R Lopez M Magrane MJ Martin DA Natale C O’Donovan N Redaschi LS Yeh (2004) ArticleTitleUniProt: the Universal Protein knowledgebase Nucleic Acids Res 32 D115–D119 Occurrence Handle14681372 Occurrence Handle10.1093/nar/gkh131 Occurrence Handle1:CAS:528:DC%2BD3sXhtVSru7vK

    Article  PubMed  CAS  Google Scholar 

  • M Ashburner CA Ball JA Blake D Botstein H Butler JM Cherry AP Davis K Dolinski SS Dwight JT Eppig MA Harris DP Hill L Issel-Tarver A Kasarskis S Lewis JC Matese JE Richardson M Ringwald GM Rubin G Sherlock (2000) ArticleTitleGene ontology: tool for the unification of biology Nature Genet 25 25–29 Occurrence Handle10802651 Occurrence Handle10.1038/75556 Occurrence Handle1:CAS:528:DC%2BD3cXjtFSlsbc%3D

    Article  PubMed  CAS  Google Scholar 

  • A Bairoch R Apweiler (2000) ArticleTitleThe SWISS-PROT protein sequence data bank and its supplement TrEMBL Nucleic Acids Res 25 31–36 Occurrence Handle10.1093/nar/25.1.31

    Article  Google Scholar 

  • E Camon M Magrane D Barrell V Lee E Dimmer J Maslen D Binns N Harte R Lopez R Apweiler (2004) ArticleTitleThe Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology Nucleic Acids Res 32 D262–D266 Occurrence Handle14681408 Occurrence Handle10.1093/nar/gkh021 Occurrence Handle1:CAS:528:DC%2BD3sXhtVSrur%2FM

    Article  PubMed  CAS  Google Scholar 

  • Y Cao S Liu L Zhang J Qin J Wang K Tang (2006) ArticleTitlePrediction of protein structural class with Rough Sets BMC Bioinformatics 7 20 Occurrence Handle16412240 Occurrence Handle10.1186/1471-2105-7-20 Occurrence Handle1:CAS:528:DC%2BD28Xht1Sks7s%3D

    Article  PubMed  CAS  Google Scholar 

  • J Cedano P Aloy JA P’erez-Pons E Querol (1997) ArticleTitleRelation between amino acid composition and cellular location of proteins J Mol Biol 266 594–600 Occurrence Handle9067612 Occurrence Handle10.1006/jmbi.1996.0804 Occurrence Handle1:CAS:528:DyaK2sXhslKksL4%3D

    Article  PubMed  CAS  Google Scholar 

  • C Chen X Zhou Y Tian X Zou P Cai (2006) ArticleTitlePredicting protein structural class with pseudo-amino acid composition and support vector machine fusion network Anal Biochem 357 116–121 Occurrence Handle16920060 Occurrence Handle10.1016/j.ab.2006.07.022 Occurrence Handle1:CAS:528:DC%2BD28XpsVOgs78%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (1995) ArticleTitleA novel approach to predicting protein structural classes in a (20-1)-D amino acid composition space Proteins Struct Funct Genet 21 319–344 Occurrence Handle7567954 Occurrence Handle10.1002/prot.340210406 Occurrence Handle1:CAS:528:DyaK2MXls12rsb0%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (2000a) ArticleTitleReview: prediction of protein structural classes and subcellular locations Curr Protein Peptide Sci 1 171–208 Occurrence Handle10.2174/1389203003381379 Occurrence Handle1:CAS:528:DC%2BD3cXnsVeisL0%3D

    Article  CAS  Google Scholar 

  • KC Chou (2000b) ArticleTitleReview: prediction of tight turns and their types in proteins Anal Biochem 286 1–16 Occurrence Handle10.1006/abio.2000.4757 Occurrence Handle1:CAS:528:DC%2BD3cXntlKrsL0%3D

    Article  CAS  Google Scholar 

  • KC Chou (2001) ArticleTitlePrediction of protein cellular attributes using pseudo amino acid composition Proteins Struct Funct Genet 43 246–255 Occurrence Handle11288174 Occurrence Handle10.1002/prot.1035 Occurrence Handle1:CAS:528:DC%2BD3MXjtFOls74%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (2004) ArticleTitleReview: structural bioinformatics and its impact to biomedical science Curr Med Chem 11 2105–2134 Occurrence Handle15279552 Occurrence Handle1:CAS:528:DC%2BD2cXlslWltbw%3D

    PubMed  CAS  Google Scholar 

  • KC Chou (2005) ArticleTitleUsing amphiphilic pseudo amino acid composition to predict enzyme subfamily classes Bioinformatics 21 10–19 Occurrence Handle15308540 Occurrence Handle10.1093/bioinformatics/bth466 Occurrence Handle1:CAS:528:DC%2BD2MXisVWitw%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou YD Cai (2002) ArticleTitleUsing functional domain composition and support vector machines for prediction of protein subcellular location J Biol Chem 277 45765–45769 Occurrence Handle12186861 Occurrence Handle10.1074/jbc.M204161200 Occurrence Handle1:CAS:528:DC%2BD38XovFKjurg%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou YD Cai (2003) ArticleTitleA new hybrid approach to predict subcellular localization of proteins by incorporating gene ontology Biochem Biophys Res Commun 311 743–747 Occurrence Handle14623335 Occurrence Handle10.1016/j.bbrc.2003.10.062 Occurrence Handle1:CAS:528:DC%2BD3sXos12lurs%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou YD Cai (2004) ArticleTitlePrediction of protein subcellular locations by GO-FunD-PseAA predictor Biochem Biophys Res Commun 320 1236–1239 Occurrence Handle15249222 Occurrence Handle10.1016/j.bbrc.2004.06.073 Occurrence Handle1:CAS:528:DC%2BD2cXls1eisL0%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou YD Cai (2005) ArticleTitlePrediction of membrane protein types by incorporating amphipathic effects J Chem Inform Model 45 407–413 Occurrence Handle10.1021/ci049686v Occurrence Handle1:CAS:528:DC%2BD2MXht1aqtLs%3D

    Article  CAS  Google Scholar 

  • KC Chou DW Elrod (1999) ArticleTitleProtein subcellular location prediction Protein Eng 12 107–118 Occurrence Handle10195282 Occurrence Handle10.1093/protein/12.2.107 Occurrence Handle1:CAS:528:DyaK1MXhvFehs7g%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou HB Shen (2006) ArticleTitlePredicting protein subcellular location by fusing multiple classifiers J Cell Biochem 99 517–527 Occurrence Handle16639720 Occurrence Handle10.1002/jcb.20879 Occurrence Handle1:CAS:528:DC%2BD28XhtVSktL3J

    Article  PubMed  CAS  Google Scholar 

  • KC Chou CT Zhang (1994) ArticleTitlePredicting protein folding types by distance functions that make allowances for amino acid interactions J Biol Chem 269 22014–22020 Occurrence Handle8071322 Occurrence Handle1:CAS:528:DyaK2cXlslCls7o%3D

    PubMed  CAS  Google Scholar 

  • KC Chou CT Zhang (1995) ArticleTitleReview: prediction of protein structural classes Crit Rev Biochem Mol Biol 30 275–349 Occurrence Handle7587280 Occurrence Handle1:CAS:528:DyaK2MXosFentb8%3D

    PubMed  CAS  Google Scholar 

  • KC Chou CT Zhang GM Maggiora (1997) ArticleTitleDisposition of amphiphilic helices in heteropolar environments Proteins Struct Funct Genet 28 99–108 Occurrence Handle9144795 Occurrence Handle10.1002/(SICI)1097-0134(199705)28:1<99::AID-PROT10>3.0.CO;2-C Occurrence Handle1:CAS:528:DyaK2sXjtVKltrY%3D

    Article  PubMed  CAS  Google Scholar 

  • TM Cover PE Hart (1967) ArticleTitleNearest neighbour pattern classification IEEE Trans Inform Theory IT-13 21–27 Occurrence Handle10.1109/TIT.1967.1053964

    Article  Google Scholar 

  • T Denoeux (1995) ArticleTitleA k-nearest neighbor classification rule based on Dempster-Shafer theory IEEE Trans Systems Man Cybern 25 804–813 Occurrence Handle10.1109/21.376493

    Article  Google Scholar 

  • QS Du ZQ Jiang WZ He DP Li KC Chou (2006) ArticleTitleAmino acid principal component analysis (AAPCA) and its applications in protein structural class prediction J Biomol Struct Dyn 23 635–640 Occurrence Handle16615809 Occurrence Handle1:CAS:528:DC%2BD28XkvVCntLw%3D

    PubMed  CAS  Google Scholar 

  • ZP Feng (2001) ArticleTitlePrediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition Biopolymers 58 491–499 Occurrence Handle11241220 Occurrence Handle10.1002/1097-0282(20010415)58:5<491::AID-BIP1024>3.0.CO;2-I Occurrence Handle1:CAS:528:DC%2BD3MXisVSntb8%3D

    Article  PubMed  CAS  Google Scholar 

  • ZP Feng (2002) ArticleTitleAn overview on predicting the subcellular location of a protein In Silico Biol 2 291–303 Occurrence Handle12542414 Occurrence Handle1:CAS:528:DC%2BD38Xpsl2lu7k%3D

    PubMed  CAS  Google Scholar 

  • QB Gao ZZ Wang C Yan YH Du (2005a) ArticleTitlePrediction of protein subcellular location using a combined feature of sequence FEBS Lett 579 3444–3448 Occurrence Handle10.1016/j.febslet.2005.05.021 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KjsL0%3D

    Article  CAS  Google Scholar 

  • Y Gao SH Shao X Xiao YS Ding YS Huang ZD Huang KC Chou (2005b) ArticleTitleUsing pseudo amino acid composition to predict protein subcellular location: approached with Lyapunov index, Bessel function, and Chebyshev filter Amino Acids 28 373–376 Occurrence Handle10.1007/s00726-005-0206-9 Occurrence Handle1:CAS:528:DC%2BD2MXlt1Kmurw%3D

    Article  CAS  Google Scholar 

  • A Garg M Bhasin GP Raghava (2005) ArticleTitleSupport vector machine-based method for subcellular localization of human proteins using amino acid compositions, their order, and similarity search J Biol Chem 280 14427–14432 Occurrence Handle15647269 Occurrence Handle10.1074/jbc.M411789200 Occurrence Handle1:CAS:528:DC%2BD2MXjtFSmt7g%3D

    Article  PubMed  CAS  Google Scholar 

  • J Guo Y Lin X Liu (2006a) ArticleTitleGNBSL: a new integrative system to predict the subcellular location for Gram-negative bacteria proteins Proteomics 6 5099–5105 Occurrence Handle10.1002/pmic.200600064 Occurrence Handle1:CAS:528:DC%2BD28XhtFarsbzO

    Article  CAS  Google Scholar 

  • YZ Guo M Li M Lu Z Wen K Wang G Li J Wu (2006b) ArticleTitleClassifying G protein-coupled receptors and nuclear receptors based on protein power spectrum from fast Fourier transform Amino Acids 30 397–402 Occurrence Handle10.1007/s00726-006-0332-z Occurrence Handle1:CAS:528:DC%2BD28Xls1egs7o%3D

    Article  CAS  Google Scholar 

  • A Hoglund P Donnes T Blum HW Adolph O Kohlbacher (2006) ArticleTitleMultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition Bioinformatics 22 1158–1165 Occurrence Handle16428265 Occurrence Handle10.1093/bioinformatics/btl002 Occurrence Handle1:CAS:528:DC%2BD28Xktlaku78%3D

    Article  PubMed  CAS  Google Scholar 

  • JM Keller MR Gray JA Givens (1985) ArticleTitleA fuzzy k-nearest neighbours algorithm IEEE Trans Syst Man Cybern 15 580–585

    Google Scholar 

  • V Lee E Camon E Dimmer D Barrell R Apweiler (2005) ArticleTitleWho tangos with GOA?-Use of Gene Ontology Annotation (GOA) for biological interpretation of ‘-omics’ data and for validation of automatic annotation tools In Silico Biol 5 5–8 Occurrence Handle15972001 Occurrence Handle1:CAS:528:DC%2BD2MXksVejtrk%3D

    PubMed  CAS  Google Scholar 

  • H Liu M Wang KC Chou (2005a) ArticleTitleLow-frequency Fourier spectrum for predicting membrane protein types Biochem Biophys Res Commun 336 737–739 Occurrence Handle10.1016/j.bbrc.2005.08.160 Occurrence Handle1:CAS:528:DC%2BD2MXhtVegtLfP

    Article  CAS  Google Scholar 

  • H Liu J Yang JG Ling KC Chou (2005b) ArticleTitlePrediction of protein signal sequences and their cleavage sites by statistical rulers Biochem Biophys Res Commun 338 1005–1011 Occurrence Handle10.1016/j.bbrc.2005.10.046 Occurrence Handle1:CAS:528:DC%2BD2MXht1Wjur3F

    Article  CAS  Google Scholar 

  • G Lubec L Afjehi-Sadat JW Yang JP John (2005) ArticleTitleSearching for hypothetical proteins: theory and practice based upon original data and literature Prog Neurobiol 77 90–127 Occurrence Handle16271823 Occurrence Handle10.1016/j.pneurobio.2005.10.001 Occurrence Handle1:CAS:528:DC%2BD2MXht1GhtbvK

    Article  PubMed  CAS  Google Scholar 

  • RY Luo ZP Feng JK Liu (2002) ArticleTitlePrediction of protein strctural class by amino acid and polypeptide composition Eur J Biochem 269 4219–4225 Occurrence Handle12199700 Occurrence Handle10.1046/j.1432-1033.2002.03115.x Occurrence Handle1:CAS:528:DC%2BD38Xnt1eiur8%3D

    Article  PubMed  CAS  Google Scholar 

  • PC Mahalanobis (1936) ArticleTitleOn the generalized distance in statistics Proc Natl Inst Sci India 2 49–55

    Google Scholar 

  • Mardia KV, Kent JT, Bibby JM (1979) Multivariate analysis chapter 11: Discriminant analysis; chapter 12: Multivariate analysis of variance; chapter 13: Cluster analysis. Academic Press, London pp 322–381

  • S Matsuda JP Vert H Saigo N Ueda H Toh T Akutsu (2005) ArticleTitleA novel representation of protein sequences for prediction of subcellular location using support vector machines Protein Sci 14 2804–2813 Occurrence Handle16251364 Occurrence Handle10.1110/ps.051597405 Occurrence Handle1:CAS:528:DC%2BD2MXhtF2it77K

    Article  PubMed  CAS  Google Scholar 

  • BW Matthews (1975) ArticleTitleComparison of the predicted and observed secondary structure of T4 phage lysozyme Biochim Biophys Acta 405 442–451 Occurrence Handle1180967 Occurrence Handle1:CAS:528:DyaE2MXlslCksbk%3D

    PubMed  CAS  Google Scholar 

  • K Nakai (2000) ArticleTitleProtein sorting signals and prediction of subcellular localization Adv Protein Chem 54 277–344 Occurrence Handle10829231 Occurrence Handle1:CAS:528:DC%2BD3cXltFSqs70%3D Occurrence Handle10.1016/S0065-3233(00)54009-1

    Article  PubMed  CAS  Google Scholar 

  • K Nakai P Horton (1999) ArticleTitlePSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization Trends Biochem Sci 24 34–36 Occurrence Handle10087920 Occurrence Handle10.1016/S0968-0004(98)01336-X Occurrence Handle1:CAS:528:DyaK1MXks12qtLk%3D

    Article  PubMed  CAS  Google Scholar 

  • H Nakashima K Nishikawa (1994) ArticleTitleDiscrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies J Mol Biol 238 54–61 Occurrence Handle8145256 Occurrence Handle10.1006/jmbi.1994.1267 Occurrence Handle1:CAS:528:DyaK2cXivFemtrw%3D

    Article  PubMed  CAS  Google Scholar 

  • H Nakashima K Nishikawa T Ooi (1986) ArticleTitleThe folding type of a protein is relevant to the amino acid composition J Biochem 99 152–162

    Google Scholar 

  • KJ Park M Kanehisa (2003) ArticleTitlePrediction of protein subcellular locations by support vector machines using compositions of amino acid and amino acid pairs Bioinformatics 19 1656–1663 Occurrence Handle12967962 Occurrence Handle10.1093/bioinformatics/btg222 Occurrence Handle1:CAS:528:DC%2BD3sXnt1Gqu78%3D

    Article  PubMed  CAS  Google Scholar 

  • KCS Pillai (1985) Mahalanobis D2 S Kotz NL Johnson (Eds) Encyclopedia of statistical sciences NumberInSeries5 Wiley New York 176–181

    Google Scholar 

  • T Radford (2003) ArticleTitleMetaphors and dreams The Scientist 17 24–26

    Google Scholar 

  • A Reinhardt T Hubbard (1998) ArticleTitleUsing neural networks for prediction of the subcellular location of proteins Nucleic Acids Res 26 2230–2236 Occurrence Handle9547285 Occurrence Handle10.1093/nar/26.9.2230 Occurrence Handle1:CAS:528:DyaK1cXjtFylsLw%3D

    Article  PubMed  CAS  Google Scholar 

  • HB Shen KC Chou (2005) ArticleTitleUsing optimized evidence-theoretic K-nearest neighbor classifier and pseudo amino acid composition to predict membrane protein types Biochem Biophys Res Commun 334 288–292 Occurrence Handle16002049 Occurrence Handle10.1016/j.bbrc.2005.06.087 Occurrence Handle1:CAS:528:DC%2BD2MXmt1aqsLw%3D

    Article  PubMed  CAS  Google Scholar 

  • HB Shen J Yang KC Chou (2006) ArticleTitleFuzzy KNN for predicting membrane protein types from pseudo amino acid composition J Theor Biol 240 9–13 Occurrence Handle16197963 Occurrence Handle10.1016/j.jtbi.2005.08.016 Occurrence Handle1:CAS:528:DC%2BD28Xjs1Knt70%3D

    Article  PubMed  CAS  Google Scholar 

  • HB Shen J Yang XJ Liu KC Chou (2005) ArticleTitleUsing supervised fuzzy clustering to predict protein structural classes Biochem Biophys Res Commun 334 577–581 Occurrence Handle16023077 Occurrence Handle10.1016/j.bbrc.2005.06.128 Occurrence Handle1:CAS:528:DC%2BD2MXmsVOgurg%3D

    Article  PubMed  CAS  Google Scholar 

  • XD Sun RB Huang (2006) ArticleTitlePrediction of protein structural classes using support vector machines Amino Acids 30 469–475 Occurrence Handle16622605 Occurrence Handle10.1007/s00726-005-0239-0 Occurrence Handle1:CAS:528:DC%2BD28Xls1ehu7c%3D

    Article  PubMed  CAS  Google Scholar 

  • GL Wang RL Dunbrack SuffixJr (2003) ArticleTitlePISCES: a protein sequence culling server Bioinformatics 19 1589–1591 Occurrence Handle12912846 Occurrence Handle10.1093/bioinformatics/btg224 Occurrence Handle1:CAS:528:DC%2BD3sXntlKmsLo%3D

    Article  PubMed  CAS  Google Scholar 

  • M Wang J Yang KC Chou (2005a) ArticleTitleUsing string kernel to predict signal peptide cleavage site based on subsite coupling model Amino Acids 28 395–402 Occurrence Handle10.1007/s00726-005-0189-6 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KmtbY%3D

    Article  CAS  Google Scholar 

  • M Wang J Yang GP Liu ZJ Xu KC Chou (2004) ArticleTitleWeighted-support vector machines for predicting membrane protein types based on pseudo amino acid composition Protein Eng Des Select 17 509–516 Occurrence Handle10.1093/protein/gzh061 Occurrence Handle1:CAS:528:DC%2BD2cXos1GisLY%3D

    Article  CAS  Google Scholar 

  • M Wang J Yang ZJ Xu KC Chou (2005b) ArticleTitleSLLE for predicting membrane protein types J Theor Biol 232 7–15 Occurrence Handle10.1016/j.jtbi.2004.07.023 Occurrence Handle1:CAS:528:DC%2BD2cXovVKkur4%3D

    Article  CAS  Google Scholar 

  • SQ Wang J Yang KC Chou (2006) ArticleTitleUsing stacked generalization to predict membrane protein types based on pseudo amino acid composition J Theor Biol 242 941–946 Occurrence Handle16806277 Occurrence Handle10.1016/j.jtbi.2006.05.006 Occurrence Handle1:CAS:528:DC%2BD28Xps1Oku70%3D

    Article  PubMed  CAS  Google Scholar 

  • Wen Z, Li M, Li Y, Guo Y, Wang K (2007) Delaunay triangulation with partial least squares projection to latent structures: a model for G-protein coupled receptors classification and fast structure recognition. Amino Acids (in press) (DOI: 10.1007/s00726-006-0341-y)

  • X Xiao S Shao Y Ding Z Huang Y Huang KC Chou (2005) ArticleTitleUsing complexity measure factor to predict protein subcellular location Amino Acids 28 57–61 Occurrence Handle15611847 Occurrence Handle10.1007/s00726-004-0148-7 Occurrence Handle1:CAS:528:DC%2BD2MXhsVKqsro%3D

    Article  PubMed  CAS  Google Scholar 

  • X Xiao SH Shao YS Ding ZD Huang KC Chou (2006a) ArticleTitleUsing cellular automata images and pseudo amino acid composition to predict protein sub-cellular location Amino Acids 30 49–54 Occurrence Handle10.1007/s00726-005-0225-6 Occurrence Handle1:CAS:528:DC%2BD28XhsFCksrk%3D

    Article  CAS  Google Scholar 

  • X Xiao SH Shao ZD Huang KC Chou (2006b) ArticleTitleUsing pseudo amino acid composition to predict protein structural classes: approached with complexity measure factor J Comput Chem 27 478–482 Occurrence Handle10.1002/jcc.20354 Occurrence Handle1:CAS:528:DC%2BD28XitFyqsr4%3D

    Article  CAS  Google Scholar 

  • SW Zhang Q Pan HC Zhang ZC Shao JY Shi (2006) ArticleTitlePrediction protein homo-oligomer types by pseudo amino acid composition: approached with an improved feature extraction and naive Bayes feature fusion Amino Acids 30 461–468 Occurrence Handle16773245 Occurrence Handle10.1007/s00726-006-0263-8 Occurrence Handle1:CAS:528:DC%2BD28Xls1egsr0%3D

    Article  PubMed  CAS  Google Scholar 

  • GP Zhou (1998) ArticleTitleAn intriguing controversy over protein structural class prediction J Prot Chem 17 729–738 Occurrence Handle10.1023/A:1020713915365 Occurrence Handle1:CAS:528:DyaK1MXnslaltw%3D%3D

    Article  CAS  Google Scholar 

  • GP Zhou N Assa-Munt (2001) ArticleTitleSome insights into protein structural class prediction Proteins Struct Funct Genet 44 57–59 Occurrence Handle11354006 Occurrence Handle10.1002/prot.1071 Occurrence Handle1:CAS:528:DC%2BD3MXktlSnsbk%3D

    Article  PubMed  CAS  Google Scholar 

  • GP Zhou K Doctor (2003) ArticleTitleSubcellular location prediction of apoptosis proteins Proteins Struct Funct Genet 50 44–48 Occurrence Handle12471598 Occurrence Handle10.1002/prot.10251 Occurrence Handle1:CAS:528:DC%2BD3sXlsVKmug%3D%3D

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Electronic Supplementary Material

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shen, HB., Yang, J. & Chou, KC. Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction. Amino Acids 33, 57–67 (2007). https://doi.org/10.1007/s00726-006-0478-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00726-006-0478-8

Navigation