Skip to main content
Log in

Predicting secretory protein signal sequence cleavage sites by fusing the marks of global alignments

  • Published:
Amino Acids Aims and scope Submit manuscript

Summary.

A newly synthesized secretory protein in cells bears a special sequence, called signal peptide or sequence, which plays the role of “address tag” in guiding the protein to wherever it is needed. Such a unique function of signal sequences has stimulated novel strategies for drug design or reprogramming cells for gene therapy. To realize these new ideas and plans, however, it is important to develop an automated method for fast and accurately identifying the signal sequences or their cleavage sites. In this paper, a new method is developed for predicting the signal sequence of a query secretory protein by fusing the results from a series of global alignments through a voting system. The very high success rates thus obtained suggest that the novel approach is very promising, and that the new method may become a useful vehicle in identifying signal sequence, or at least serve as a complementary tool to the existing algorithms of this field.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • P Arrigo F Giuliano F Scalia A Rapallo G Damiani (1991) ArticleTitleIdentification of a new motif on nucleic acid sequence data using Kohonen’s self-organizing map Comput Appl Biosci 7 353–357 Occurrence Handle1913218 Occurrence Handle1:CAS:528:DyaK38XhsVCqsrs%3D

    PubMed  CAS  Google Scholar 

  • P Baldi S Brunak (1998) Bioinformatics: the machine learning approach MIT Press Cambridge/Mass

    Google Scholar 

  • JD Bendtsen H Nielsen G von Heijne S Brunak (2004) ArticleTitleImproved prediction of signal peptides: SignalP 3.0 J Mol Biol 340 783–795 Occurrence Handle15223320 Occurrence Handle10.1016/j.jmb.2004.05.028 Occurrence Handle1:CAS:528:DC%2BD2cXlt1emu7s%3D

    Article  PubMed  CAS  Google Scholar 

  • JD Blake FE Cohen (2001) ArticleTitlePairwise sequence alignment below the twilight zone J Mol Biol 307 721–735 Occurrence Handle11254392 Occurrence Handle10.1006/jmbi.2001.4495 Occurrence Handle1:CAS:528:DC%2BD3MXhvFWns7o%3D

    Article  PubMed  CAS  Google Scholar 

  • C Chen X Zhou Y Tian X Zou P Cai (2006) ArticleTitlePredicting protein structural class with pseudo-amino acid composition and support vector machine fusion network Anal Biochem 357 116–121 Occurrence Handle16920060 Occurrence Handle10.1016/j.ab.2006.07.022 Occurrence Handle1:CAS:528:DC%2BD28XpsVOgs78%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (2001a) ArticleTitlePrediction of protein signal sequences and their cleavage sites Proteins Struct Function Genet 42 136–139 Occurrence Handle10.1002/1097-0134(20010101)42:1<136::AID-PROT130>3.0.CO;2-F Occurrence Handle1:CAS:528:DC%2BD3MXitVWntg%3D%3D

    Article  CAS  Google Scholar 

  • KC Chou (2001b) ArticleTitlePrediction of signal peptides using scaled window Peptides 22 1973–1979 Occurrence Handle10.1016/S0196-9781(01)00540-X Occurrence Handle1:CAS:528:DC%2BD38Xitlaguw%3D%3D

    Article  CAS  Google Scholar 

  • KC Chou (2001c) ArticleTitleUsing subsite coupling to predict signal peptides Protein Eng 14 75–79 Occurrence Handle10.1093/protein/14.2.75 Occurrence Handle1:CAS:528:DC%2BD3MXjsVektrs%3D

    Article  CAS  Google Scholar 

  • KC Chou (2002) ArticleTitleReview: Prediction of protein signal sequences Curr Protein Pep Sci 3 615–622 Occurrence Handle10.2174/1389203023380468 Occurrence Handle1:CAS:528:DC%2BD38XosF2ku7w%3D

    Article  CAS  Google Scholar 

  • KC Chou (2004) ArticleTitleReview: Structural bioinformatics and its impact to biomedical science Curr Med Chem 11 2105–2134 Occurrence Handle15279552 Occurrence Handle1:CAS:528:DC%2BD2cXlslWltbw%3D

    PubMed  CAS  Google Scholar 

  • KC Chou HB Shen (2006) ArticleTitlePredicting protein subcellular location by fusing multiple classifiers J Cell Biochem 99 517–527 Occurrence Handle16639720 Occurrence Handle10.1002/jcb.20879 Occurrence Handle1:CAS:528:DC%2BD28XhtVSktL3J

    Article  PubMed  CAS  Google Scholar 

  • KC Chou CT Zhang (1995) ArticleTitleReview: Prediction of protein structural classes Crit Rev Biochem Mol Biol 30 275–349 Occurrence Handle7587280 Occurrence Handle1:CAS:528:DyaK2MXosFentb8%3D

    PubMed  CAS  Google Scholar 

  • R Durbin S Dear (1998) ArticleTitleBase qualities help sequencing software Genome Res 8 161–162 Occurrence Handle9521917 Occurrence Handle1:CAS:528:DyaK1cXitlWlur0%3D

    PubMed  CAS  Google Scholar 

  • RM Durbin SR Eddy A Krogh G Mitchison (1998) Biological sequence analysis Cambridge University Press Cambridge

    Google Scholar 

  • O Emanuelsson H Nielsen G von Heijne (1999) ArticleTitleChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites Protein Sci 8 978–984 Occurrence Handle10338008 Occurrence Handle1:CAS:528:DyaK1MXjtFGmsL0%3D

    PubMed  CAS  Google Scholar 

  • ZP Feng (2001) ArticleTitlePrediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition Biopolymers 58 491–499 Occurrence Handle11241220 Occurrence Handle10.1002/1097-0282(20010415)58:5<491::AID-BIP1024>3.0.CO;2-I Occurrence Handle1:CAS:528:DC%2BD3MXisVSntb8%3D

    Article  PubMed  CAS  Google Scholar 

  • ZP Feng (2002) ArticleTitleAn overview on predicting the subcellular location of a protein In Silico Biol 2 291–303 Occurrence Handle12542414 Occurrence Handle1:CAS:528:DC%2BD38Xpsl2lu7k%3D

    PubMed  CAS  Google Scholar 

  • RJ Folz JI Gordon (1987) ArticleTitleComputer-assisted predictions of signal peptidase processing sites Biochem Biophys Res Commun 146 870–877 Occurrence Handle3304288 Occurrence Handle10.1016/0006-291X(87)90611-5 Occurrence Handle1:CAS:528:DyaL2sXlvFeru74%3D

    Article  PubMed  CAS  Google Scholar 

  • QB Gao ZZ Wang C Yan YH Du (2005) ArticleTitlePrediction of protein subcellular location using a combined feature of sequence FEBS Lett 579 3444–3448 Occurrence Handle15949806 Occurrence Handle10.1016/j.febslet.2005.05.021 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KjsL0%3D

    Article  PubMed  CAS  Google Scholar 

  • YZ Guo M Li M Lu Z Wen K Wang G Li J Wu (2006) ArticleTitleClassifying G protein-coupled receptors and nuclear receptors based on protein power spectrum from fast Fourier transform Amino Acids 30 397–402 Occurrence Handle16773242 Occurrence Handle10.1007/s00726-006-0332-z Occurrence Handle1:CAS:528:DC%2BD28Xls1egs7o%3D

    Article  PubMed  CAS  Google Scholar 

  • I Ladunga F Czako I Csabai T Geszti (1991) ArticleTitleImproving signal peptide prediction accuracy by simulated neural network Comput Appl Biosci 7 485–487 Occurrence Handle1747780 Occurrence Handle1:CAS:528:DyaK38Xht1Chu7g%3D

    PubMed  CAS  Google Scholar 

  • H Liu J Yang JG Ling KC Chou (2005) ArticleTitlePrediction of protein signal sequences and their cleavage sites by statistical rulers Biochem Biophys Res Commun 338 1005–1011 Occurrence Handle16256954 Occurrence Handle10.1016/j.bbrc.2005.10.046 Occurrence Handle1:CAS:528:DC%2BD2MXht1Wjur3F

    Article  PubMed  CAS  Google Scholar 

  • G Lubec L Afjehi-Sadat JW Yang JP John (2005) ArticleTitleSearching for hypothetical proteins: theory and practice based upon original data and literature Prog Neurobiol 77 90–127 Occurrence Handle16271823 Occurrence Handle10.1016/j.pneurobio.2005.10.001 Occurrence Handle1:CAS:528:DC%2BD2MXht1GhtbvK

    Article  PubMed  CAS  Google Scholar 

  • RY Luo ZP Feng JK Liu (2002) ArticleTitlePrediction of protein structural class by amino acid and polypeptide composition Eur J Biochem 269 4219–4225 Occurrence Handle12199700 Occurrence Handle10.1046/j.1432-1033.2002.03115.x Occurrence Handle1:CAS:528:DC%2BD38Xnt1eiur8%3D

    Article  PubMed  CAS  Google Scholar 

  • DJ McGeoch (1985) ArticleTitleOn the predictive recognition of signal peptide sequences Virus Res 3 271–286 Occurrence Handle3000102 Occurrence Handle10.1016/0168-1702(85)90051-6 Occurrence Handle1:CAS:528:DyaL2MXmtV2is78%3D

    Article  PubMed  CAS  Google Scholar 

  • K Nakai (2000) ArticleTitleProtein sorting signals and prediction of subcellular localization Adv Protein Chem 54 277–344 Occurrence Handle10829231 Occurrence Handle1:CAS:528:DC%2BD3cXltFSqs70%3D Occurrence Handle10.1016/S0065-3233(00)54009-1

    Article  PubMed  CAS  Google Scholar 

  • SB Needleman CD Wunsch (1970) ArticleTitleA general method applicable to the search for similarities in the amino acid sequence of two proteins J Mol Biol 48 443–453 Occurrence Handle5420325 Occurrence Handle10.1016/0022-2836(70)90057-4 Occurrence Handle1:CAS:528:DyaE3cXktVShu74%3D

    Article  PubMed  CAS  Google Scholar 

  • H Nielsen J Engelbrecht S Brunak G von Heijne (1997) ArticleTitleIdentification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites Protein Eng 10 1–6 Occurrence Handle9051728 Occurrence Handle10.1093/protein/10.1.1 Occurrence Handle1:CAS:528:DyaK2sXhsVersrs%3D

    Article  PubMed  CAS  Google Scholar 

  • H Nielsen A Krogh (1998) ArticleTitlePrediction of signal peptides and signal anchors by a hidden Markov model Intell Syst Mol Biol 6 122–130 Occurrence Handle1:STN:280:DyaK1cvls1GntA%3D%3D

    CAS  Google Scholar 

  • B Niu YD Cai WC Lu GY Zheng KC Chou (2006) ArticleTitlePredicting protein structural class with AdaBoost learner Protein Peptide Lett 13 489–492 Occurrence Handle10.2174/092986606776819619 Occurrence Handle1:CAS:528:DC%2BD28XlsVGqs7o%3D

    Article  CAS  Google Scholar 

  • G Schneider S Rohlk P Wrede (1993) ArticleTitleAnalysis of cleavage-site patterns in protein precusor sequences with a perceptron-type neural network Biochem Biophys Res Commun 194 951–959 Occurrence Handle8343174 Occurrence Handle10.1006/bbrc.1993.1913 Occurrence Handle1:CAS:528:DyaK3sXlsFGnt7Y%3D

    Article  PubMed  CAS  Google Scholar 

  • G Schneider P Wrede (1993) ArticleTitleSignal analysis of protein targeting sequences Protein Seq Data Anal 5 227–236 Occurrence Handle1:CAS:528:DyaK2cXpvFyqtw%3D%3D

    CAS  Google Scholar 

  • XD Sun RB Huang (2006) ArticleTitlePrediction of protein structural classes using support vector machines Amino Acids 30 469–475 Occurrence Handle16622605 Occurrence Handle10.1007/s00726-005-0239-0 Occurrence Handle1:CAS:528:DC%2BD28Xls1ehu7c%3D

    Article  PubMed  CAS  Google Scholar 

  • G von Heijne (1986) ArticleTitleA new method for predicting signal sequence cleavage sites Nucleic Acids Res 14 4683–4690 Occurrence Handle3714490 Occurrence Handle10.1093/nar/14.11.4683 Occurrence Handle1:CAS:528:DyaL28XksFKmt7w%3D

    Article  PubMed  CAS  Google Scholar 

  • M Wang J Yang KC Chou (2005a) ArticleTitleUsing string kernel to predict signal peptide cleavage site based on subsite coupling model Amino Acids 28 395–402 Occurrence Handle10.1007/s00726-005-0189-6 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KmtbY%3D

    Article  CAS  Google Scholar 

  • M Wang J Yang ZJ Xu KC Chou (2005b) ArticleTitleSLLE for predicting membrane protein types J Theor Biol 232 7–15 Occurrence Handle10.1016/j.jtbi.2004.07.023 Occurrence Handle1:CAS:528:DC%2BD2cXovVKkur4%3D

    Article  CAS  Google Scholar 

  • Wen Z, Li M, Li Y, Guo Y, Wang K (2006) Delaunay triangulation with partial least squares projection to latent structures: a model for G-protein coupled receptors classification and fast structure recognition. Amino Acids (in press)

  • X Xiao S Shao Y Ding Z Huang Y Huang KC Chou (2005) ArticleTitleUsing complexity measure factor to predict protein subcellular location Amino Acids 28 57–61 Occurrence Handle15611847 Occurrence Handle10.1007/s00726-004-0148-7 Occurrence Handle1:CAS:528:DC%2BD2MXhsVKqsro%3D

    Article  PubMed  CAS  Google Scholar 

  • X Xiao SH Shao YS Ding ZD Huang KC Chou (2006a) ArticleTitleUsing cellular automata images and pseudo amino acid composition to predict protein sub-cellular location Amino Acids 30 49–54 Occurrence Handle10.1007/s00726-005-0225-6 Occurrence Handle1:CAS:528:DC%2BD28XhsFCksrk%3D

    Article  CAS  Google Scholar 

  • X Xiao SH Shao ZD Huang KC Chou (2006b) ArticleTitleUsing pseudo amino acid composition to predict protein structural classes: approached with complexity measure factor J Comput Chem 27 478–482 Occurrence Handle10.1002/jcc.20354 Occurrence Handle1:CAS:528:DC%2BD28XitFyqsr4%3D

    Article  CAS  Google Scholar 

  • SW Zhang Q Pan HC Zhang ZC Shao JY Shi (2006) ArticleTitlePrediction protein homo-oligomer types by pseudo amino acid composition: approached with an improved feature extraction and naive Bayes feature fusion Amino Acids 30 461–468 Occurrence Handle16773245 Occurrence Handle10.1007/s00726-006-0263-8 Occurrence Handle1:CAS:528:DC%2BD28Xls1egsr0%3D

    Article  PubMed  CAS  Google Scholar 

  • GP Zhou (1998) ArticleTitleAn intriguing controversy over protein structural class prediction J Protein Chem 17 729–738 Occurrence Handle9988519 Occurrence Handle10.1023/A:1020713915365 Occurrence Handle1:CAS:528:DyaK1MXnslaltw%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • GP Zhou K Doctor (2003) ArticleTitleSubcellular location prediction of apoptosis proteins Proteins Struct Funct Genet 50 44–48 Occurrence Handle12471598 Occurrence Handle10.1002/prot.10251 Occurrence Handle1:CAS:528:DC%2BD3sXlsVKmug%3D%3D

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, DQ., Liu, H., Shen, HB. et al. Predicting secretory protein signal sequence cleavage sites by fusing the marks of global alignments. Amino Acids 32, 493–496 (2007). https://doi.org/10.1007/s00726-006-0466-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00726-006-0466-z

Navigation