Mammalian Genome

, Volume 14, Issue 12, pp 859–865

Secreted protein prediction system combining CJ-SPHMM, TMHMM, and PSORT

Article

DOI: 10.1007/s00335-003-2296-6

Cite this article as:
Chen, Y., Yu, P., Luo, J. et al. Mamm Genome (2003) 14: 859. doi:10.1007/s00335-003-2296-6

Abstract

To increase the coverage of secreted protein prediction, we describe a combination strategy. Instead of using a single method, we combine Hidden Markov Model (HMM)-based methods CJ-SPHMM and TMHMM with PSORT in secreted protein prediction. CJ-SPHMM is an HMM-based signal peptide prediction method, while TMHMM is an HMM-based transmembrane (TM) protein prediction algorithm. With CJ-SPHMM and TMHMM, proteins with predicted signal peptide and without predicted TM regions are taken as putative secreted proteins. This HMM-based approach predicts secreted protein with Ac (Accuracy) at 0.82 and Cc (Correlation coefficient) at 0.75, which are similar to PSORT with Ac at 0.82 and Cc at 0.76. When we further complement the HMM-based method, i.e., CJ-SPHMM + TMHMM with PSORT in secreted protein prediction, the Ac value is increased to 0.86 and the Cc value is increased to 0.81. Taking this combination strategy to search putative secreted proteins from the International Protein Index (IPI) maintained at the European Bioinformatics Institute (EBI), we constructed a putative human secretome with 5235 proteins. The prediction system described here can also be applied to predicting secreted proteins from other vertebrate proteomes.

Copyright information

© Springer-Verlag New York Inc. 2003

Authors and Affiliations

  1. 1.College of Life Sciences, National Laboratory of Protein Engineering and Plant Genetic Engineering, and Centre of BioinformaticsPeking University, Beijing 100871China