PUEPro: A Computational Pipeline for Prediction of Urine Excretory Proteins

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10086)


A computational pipeline is developed to accurately predict urine excretory proteins and the possible origins of the proteins. The novel contributions of this study include: (i) a new method for predicting if a cellular protein is urine excretory based on unique features of proteins known to be urine excretory; and (ii) a novel method for identifying urinary proteins originating from the urinary system. By integrating these tools, our computational pipeline is capable of predicting the origin of a detected urinary protein, hence offering a novel tool for predicting potential biomarkers of a specific disease, which may have some of their proteins urine excreted. One application is presented for this prediction pipeline to demonstrate the effectiveness of its prediction. The pipeline and supplementary materials can be accessed at the following URL: http://csbl.bmb.uga.edu/PUEPro/.


Urine excretory proteins Support vector machine recursive feature elimination Biomarkers of disease 



This work is supported by the National Natural Science Foundation of China (Grant Nos. 81320108025, 61402194, 61572227), Development Project of Jilin Province of China (20140101180JC) and China Postdoctoral Science Foundation (2014T70291).


  1. 1.
    Lee, S., Huang, H., Zelen, M.: Early detection of disease and scheduling of screening examinations. Stat. Methods Med. Res. 13, 443–456 (2004)MathSciNetCrossRefMATHGoogle Scholar
  2. 2.
    Thongboonkerd, V.: Practical points in urinary proteomics. J. Proteome Res. 6, 3881–3890 (2007)CrossRefGoogle Scholar
  3. 3.
    Shao, C., Li, M., Li, X., Wei, L., Zhu, L., Yang, F., Jia, L., Mu, Y., Wang, J., Guo, Z., Zhang, D., Yin, J., Wang, Z., Sun, W., Zhang, Z., Gao, Y.: A tool for biomarker discovery in the urinary proteome: a manually curated human and animal urine protein biomarker database. Mol. Cell. Proteomics 10(M111), 010975 (2011)Google Scholar
  4. 4.
    Abogunrin, F., O’Kane, H.F., Ruddock, M.W., Stevenson, M., Reid, C.N., O’Sullivan, J.M., Anderson, N.H., O’Rourke, D., Duggan, B., Lamont, J.V., Boyd, R.E., Hamilton, P., Nambirajan, T., Williamson, K.E.: The impact of biomarkers in multivariate algorithms for bladder cancer diagnosis in patients with hematuria. Cancer 118, 2641–2650 (2012)CrossRefGoogle Scholar
  5. 5.
    Raimondo, F., Morosi, L., Corbetta, S., Chinello, C., Brambilla, P., Della Mina, P., Villa, A., Albo, G., Battaglia, C., Bosari, S., Magni, F., Pitto, M.: Differential protein profiling of renal cell carcinoma urinary exosomes. Mol. BioSyst. 9, 1220–1233 (2013)CrossRefGoogle Scholar
  6. 6.
    Malavaud, B., Salama, G., Miedouge, M., Vincent, C., Rischmann, P., Sarramon, J.P., Serre, G.: Influence of digital rectal massage on urinary prostate-specific antigen: interest for the detection of local recurrence after radical prostatectomy. Prostate 34, 23–28 (1998)CrossRefGoogle Scholar
  7. 7.
    Ghoniem, G., Faruqui, N., Elmissiry, M., Mahdy, A., Abdelwahab, H., Oommen, M., Abdel-Mageed, A.B.: Differential profile analysis of urinary cytokines in patients with overactive bladder. Int. Urogynecol. J. 22, 953–961 (2011)CrossRefGoogle Scholar
  8. 8.
    Abdullah-Soheimi, S.S., Lim, B.K., Hashim, O.H., Shuib, A.S.: Patients with ovarian carcinoma excrete different altered levels of urine CD59, kininogen-1 and fragments of inter-alpha-trypsin inhibitor heavy chain H4 and albumin. Proteome Sci. 8, 58 (2010)CrossRefGoogle Scholar
  9. 9.
    Li, Y., Zhang, Y., Qiu, F., Qiu, Z.: Proteomic identification of exosomal LRG1: a potential urinary biomarker for detecting NSCLC. Electrophoresis 32, 1976–1983 (2011)CrossRefGoogle Scholar
  10. 10.
    Abdalla, M.A., Haj-Ahmad, Y.: Promising urinary protein biomarkers for the early detection of hepatocellular carcinoma among high-risk hepatitis C virus Egyptian patients. J Cancer. 3, 390–403 (2012)CrossRefGoogle Scholar
  11. 11.
    Hong, C.S., Cui, J., Ni, Z., Su, Y., Puett, D., Li, F., Xu, Y.: A computational method for prediction of excretory proteins and application to identification of gastric cancer markers in urine. PLoS ONE 6, e16875 (2011)CrossRefGoogle Scholar
  12. 12.
    Li, S.J., Peng, M., Li, H., Liu, B.S., Wang, C., Wu, J.R., Li, Y.X., Zeng, R.: Sys-BodyFluid: a systematical database for human body fluid proteome research. Nucleic Acids Res. 37, D907–D912 (2009)CrossRefGoogle Scholar
  13. 13.
    Legrain, P., Aebersold, R., Archakov, A., Bairoch, A., Bala, K., Beretta, L., Bergeron, J., Borchers, C.H., Corthals, G.L., Costello, C.E., Deutsch, E.W., Domon, B., Hancock, W., He, F., Hochstrasser, D., Marko-Varga, G., Salekdeh, G.H., Sechi, S., Snyder, M., Srivastava, S., Uhlen, M., Wu, C.H., Yamamoto, T., Paik, Y.K., Omenn, G.S.: The human proteome project: current state and future direction. Mol. Cell. Proteomics 10(M111), 009993 (2011)Google Scholar
  14. 14.
    Adachi, J., Kumar, C., Zhang, Y., Olsen, J.V., Mann, M.: The human urinary proteome contains more than 1500 proteins, including a large proportion of membrane proteins. Genome Biol. 7, R80 (2006)CrossRefGoogle Scholar
  15. 15.
    Li, Q.R., Fan, K.X., Li, R.X., Dai, J., Wu, C.C., Zhao, S.L., Wu, J.R., Shieh, C.H., Zeng, R.: A comprehensive and non-prefractionation on the protein level approach for the human urinary proteome: touching phosphorylation in urine. Rapid Commun. Mass Spectrom. 24, 823–832 (2010)CrossRefGoogle Scholar
  16. 16.
    Marimuthu, A., O’Meally, R.N., Chaerkady, R., Subbannayya, Y., Nanjappa, V., Kumar, P., Kelkar, D.S., Pinto, S.M., Sharma, R., Renuse, S., Goel, R., Christopher, R., Delanghe, B., Cole, R.N., Harsha, H.C., Pandey, A.: A comprehensive map of the human urinary proteome. J. Proteome Res. 10, 2734–2743 (2011)CrossRefGoogle Scholar
  17. 17.
    Decramer, S., Gonzalez de Peredo, A., Breuil, B., Mischak, H., Monsarrat, B., Bascands, J.L., Schanstra, J.P.: Urine in clinical proteomics. Mol. Cell. Proteomics 7, 1850–1862 (2008)CrossRefGoogle Scholar
  18. 18.
    Hoorn, E.J., Pisitkun, T., Zietse, R., Gross, P., Frokiaer, J., Wang, N.S., Gonzales, P.A., Star, R.A., Knepper, M.A.: Prospects for urinary proteomics: exosomes as a source of urinary biomarkers. Nephrology (Carlton) 10, 283–290 (2005)CrossRefGoogle Scholar
  19. 19.
    Jia, L., Li, X., Shao, C., Wei, L., Li, M., Guo, Z., Liu, Z., Gao, Y.: Using an isolated rat kidney model to identify kidney origin proteins in urine. PLoS ONE 8, e66911 (2013)CrossRefGoogle Scholar
  20. 20.
    Wood, S.L., Knowles, M.A., Thompson, D., Selby, P.J., Banks, R.E.: Proteomic studies of urinary biomarkers for prostate, bladder and kidney cancers. Nat. Rev. Urol. 10, 206–218 (2013)CrossRefGoogle Scholar
  21. 21.
    Cui, J., Liu, Q., Puett, D., Xu, Y.: Computational prediction of human proteins that can be secreted into the bloodstream. Bioinformatics 24, 2370–2375 (2008)CrossRefGoogle Scholar
  22. 22.
    Finn, R.D., Bateman, A., Clements, J., Coggill, P., Eberhardt, R.Y., Eddy, S.R., Heger, A., Hetherington, K., Holm, L., Mistry, J., Sonnhammer, E.L., Tate, J., Punta, M.: Pfam: the protein families database. Nucleic Acids Res. 42, D222–D230 (2014)CrossRefGoogle Scholar
  23. 23.
    Storey, J.D., Tibshirani, R.: Statistical significance for genomewide studies. Proc. Natl. Acad. Sci. U.S.A. 100, 9440–9445 (2003)MathSciNetCrossRefMATHGoogle Scholar
  24. 24.
    Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002)CrossRefMATHGoogle Scholar
  25. 25.
    Scholkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J., Williamson, R.C.: Estimating the support of a high-dimensional distribution. Neural Comput. 13, 1443–1471 (2001)CrossRefMATHGoogle Scholar
  26. 26.
    Hampton, T.: Cancer genome atlas. JAMA 296, 1958 (2006)Google Scholar
  27. 27.
    Harris, N.S., Winter, W.E.: Multiple myeloma and related serum protein disorders: an electrophoretic guide, Demos Medical Publishing (2012)Google Scholar
  28. 28.
    Consortium, U: Reorganizing the protein space at the Universal Protein Resource (UniProt). Nucleic Acids Res. 40, D71–D75 (2012)CrossRefGoogle Scholar
  29. 29.
    Huang, D.W., Sherman, B.T., Lempicki, R.A.: Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. Nature Protoc. 4, 44–57 (2009)CrossRefGoogle Scholar
  30. 30.
    Nolen, B.M., Lomakin, A., Marrangoni, A., Velikokhatnaya, L., Prosser, D., Lokshin, A.E.: Urinary protein biomarkers in the early detection of lung cancer. Cancer Prev. Res. (Phila) (2014) Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  1. 1.College of Computer Science and TechnologyChangchunChina
  2. 2.College of Public HealthJilin UniversityChangchunChina
  3. 3.School of Natural and Computing SciencesUniversity of AberdeenAberdeenUK
  4. 4.Computational Systems Biology Lab, Department of Biochemistry and Molecular Biology and Institute of BioinformaticsUniversity of GeorgiaAthensUSA

Personalised recommendations