Skip to main content

Retention Time Prediction and Protein Identification

  • Protocol
  • First Online:
Mass Spectrometry Data Analysis in Proteomics

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1007))

Abstract

In bottom-up proteomics, proteins are typically identified by enzymatic digestion into peptides, tandem mass spectrometry and comparison of the tandem mass spectra with those predicted from a sequence database for peptides within measurement uncertainty from the experimentally obtained mass. Although now decreasingly common, isolated proteins or simple protein mixtures can also be identified by measuring only the masses of the peptides resulting from the enzymatic digest, without any further fragmentation. Separation methods such as liquid chromatography and electrophoresis are often used to fractionate complex protein or peptide mixtures prior to analysis by mass spectrometry. Although the primary reason for this is to avoid ion suppression and improve data quality, these separations are based on physical and chemical properties of the peptides or proteins and therefore also provide information about them. Depending on the separation method, this could be protein molecular weight (SDS-PAGE), isoelectric point (IEF), charge at a known pH (ion exchange chromatography), or hydrophobicity (reversed phase chromatography). These separations produce approximate measurements on properties that to some extent can be predicted from amino acid sequences. In the case of molecular weight of proteins without posttranslational modifications this is straightforward: simply add the molecular weights of the amino acid residues in the protein. For IEF, charge and hydrophobicity, the order of the amino acids, and folding state of the peptide or protein also matter, but it is nevertheless possible to predict the behavior of peptides and proteins in these separation methods to a degree which renders such predictions useful. This chapter reviews the topic of using data from separation methods for identification and validation in proteomics, with special emphasis on predicting retention times of tryptic peptides in reversed-phase chromatography under acidic conditions, as this is one of the most commonly used separation methods in proteomics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Laemmli UK (1970) Cleavage of structural proteins during assembly of head of bacteriophage-T4. Nature 227:680–685

    Article  PubMed  CAS  Google Scholar 

  2. Stacey CC, Kruppa GH, Watson CH, Wronka J, Laukien FH, Banks JF, Whitehouse CM (1994) Reverse-phase liquid chromatography/electrospray-ionization Fourier-transform mass spectrometry in the analysis of peptides. Rapid Commun Mass Spectrom 8:513–516

    Article  CAS  Google Scholar 

  3. Voyksner RD (1997) In: Cole RB (ed) Electrospray ionization mass spectrometry. Wiley, New York, pp 323–341

    Google Scholar 

  4. Jensen PK, Pasa-Tolic L, Peden KK, Martinovic S, Lipton MS, Anderson GA, Tolic N, Wong KK, Smith RD (2000) Mass spectrometric detection for capillary isoelectric focusing separations of complex protein mixtures. Electrophoresis 21:1372–1380

    Article  PubMed  CAS  Google Scholar 

  5. Käll L, Storey JD, MacCoss MJ, Noble WS (2008) Posterior error probabilities and false discovery rates: two sides of the same coin. J Proteome Res 7:40–44

    Article  PubMed  Google Scholar 

  6. Käll L, Storey JD, MacCoss MJ, Noble WS (2008) Assigning significance to peptides identified by tandem mass spectrometry using decoy databases. J Proteome Res 7:29–34

    Article  PubMed  Google Scholar 

  7. Eriksson J, Chait BT, Fenyö D (2000) A statistical basis for testing the significance of mass spectrometric protein identification results. Anal Chem 72:999–1005

    Article  PubMed  CAS  Google Scholar 

  8. Victor B, Gabriel S, Kanobana K, Mostovenko E, Polman K, Dorny P, Deelder AM, Palmblad M (2012) Partially sequenced organisms, decoy searches and false discovery rates. J Proteome Res 11:1991–1995

    Article  PubMed  CAS  Google Scholar 

  9. Pardee AB (1951) Calculations on paper chromatography of peptides. J Biol Chem 190:757–762

    PubMed  CAS  Google Scholar 

  10. Knight CA (1951) Paper chromatography of some lower peptides. J Biol Chem 190:753–756

    PubMed  CAS  Google Scholar 

  11. Sanger F, Thompson EOP (1953) The amino-acid sequence in the glycyl chain of insulin. Biochem J 53:353–374

    PubMed  CAS  Google Scholar 

  12. Cornette JL, Cease KB, Margalit H, Spouge JL, Berzofsky JA, DeLisi C (1987) Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. J Mol Biol 195:659–685

    Article  PubMed  CAS  Google Scholar 

  13. Palmblad M, Mills DJ, Bindschedler LV, Cramer R (2007) Chromatographic alignment of LC-MS and LC-MS/MS datasets by genetic algorithm feature extraction. J Am Soc Mass Spectrom 18:1835–1843

    Article  PubMed  CAS  Google Scholar 

  14. Petritis K, Kangas LJ, Ferguson PL, Anderson GA, Pasa-Tolic L, Lipton MS, Auberry KJ, Strittmatter EF, Shen Y, Zhao R, Smith RD (2003) Use of artificial neural networks for the accurate prediction of peptide liquid chromatography elution times in proteome analyses. Anal Chem 75:1039–1048

    Article  PubMed  CAS  Google Scholar 

  15. Meek JL (1980) Prediction of peptide retention times in high-pressure liquid chromatography on the basis of amino acid composition. Proc Natl Acad Sci U S A 77:1632–1636

    Article  PubMed  CAS  Google Scholar 

  16. Meek JL, Rossetti ZL (1981) Factors affecting retention and resolution of peptides in high-performance liquid-chromatography. J Chromatogr 211:15–28

    Article  CAS  Google Scholar 

  17. Browne CA, Bennett HP, Solomon S (1982) The isolation of peptides by high-performance liquid chromatography using predicted elution positions. Anal Biochem 124:201–208

    Article  PubMed  CAS  Google Scholar 

  18. Guo DC, Mant CT, Taneja AK, Parker JMR, Hodges RS (1986) Prediction of peptide retention times in reversed-phase high-performance liquid-chromatography. 1. Determination of retention coefficients of amino-acid-residues of model synthetic peptides. J Chromatogr 359:499–517

    Article  CAS  Google Scholar 

  19. Guo DC, Mant CT, Taneja AK, Hodges RS (1986) Prediction of peptide retention times in reversed-phase high-performance liquid-chromatography. 2. Correlation of observed and predicted peptide retention times and factors influencing the retention times of peptides. J Chromatogr 359:519–532

    Article  CAS  Google Scholar 

  20. Wilce MCJ, Aguilar MI, Hearn MTW (1991) High-performance liquid-chromatography of amino-acids, peptides and proteins. 107. Analysis of group retention contributions for peptides separated with a range of mobile and stationary phases by reversed-phase high-performance liquid-chromatography. J Chromatogr 536:165–183

    Article  CAS  Google Scholar 

  21. Wilce MCJ, Aguilar MI, Hearn MTW (1993) High-performance liquid-chromatography of amino-acids, peptides and proteins. 122. Application of experimentally derived retention coefficients to the prediction of peptide retention times—studies with myohemerythrin. J Chromatogr 632:11–18

    Article  PubMed  CAS  Google Scholar 

  22. Palmblad M, Ramström M, Markides KE, Håkansson P, Bergquist J (2002) Prediction of chromatographic retention and protein identification in liquid chromatography/mass spectrometry. Anal Chem 74:5826–5830

    Article  PubMed  CAS  Google Scholar 

  23. Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105–132

    Article  PubMed  CAS  Google Scholar 

  24. Terabe S, Konaka R, Inouye K (1979) Separation of some polypeptide hormones by high-performance liquid-chromatography. J Chromatogr 172:163–177

    Article  PubMed  CAS  Google Scholar 

  25. Hearn MTW, Aguilar MI (1987) High-performance liquid-chromatography of amino-acids, peptides and proteins. 69. Evaluation of retention and bandwidth relationships of myosin-related peptides separated by gradient elution reversed-phase high-performance liquid-chromatography. J Chromatogr 392:33–49

    Article  PubMed  CAS  Google Scholar 

  26. Hearn MT, Aguilar MI, Mant CT, Hodges RS (1988) High-performance liquid chromatography of amino acids, peptides and proteins. LXXXV. Evaluation of the use of hydrophobicity coefficients for the prediction of peptide elution profiles. J Chromatogr 438:197–210

    Article  PubMed  CAS  Google Scholar 

  27. Mant CT, Hodges RS (2006) Context-dependent effects on the hydrophilicity/hydrophobicity of side-chains during reversed-phase high-performance liquid chromatography: implications for prediction of peptide retention behaviour. J Chromatogr A 1125:211–219

    Article  PubMed  CAS  Google Scholar 

  28. Mant CT, Burke TWL, Black JA, Hodges RS (1988) Effect of peptide-chain length on peptide retention behavior in reversed-phase chromatography. J Chromatogr 458:193–205

    Article  PubMed  CAS  Google Scholar 

  29. Krokhin OV, Craig R, Spicer V, Ens W, Standing KG, Beavis RC, Wilkins JA (2004) An improved model for prediction of retention times of tryptic peptides in ion pair reversed-phase HPLC—its application to protein peptide mapping by off-line HPLC-MALDI MS. Mol Cell Proteomics 3:908–919

    Article  PubMed  CAS  Google Scholar 

  30. Krokhin OV, Ying S, Cortens JP, Ghosh D, Spicer V, Ens W, Standing KG, Beavis RC, Wilkins JA (2006) Use of peptide retention time prediction for protein identification by off-line reversed-phase HPLC-MALDI MS/MS. Anal Chem 78:6265–6269

    Article  PubMed  CAS  Google Scholar 

  31. Krokhin OV (2006) Sequence-specific retention calculator. Algorithm for peptide retention prediction in ion-pair RP-HPLC: application to 300- and 100-angstrom pore size C18 sorbents. Anal Chem 78:7785–7795

    Article  PubMed  CAS  Google Scholar 

  32. Strittmatter EF, Ferguson PL, Tang K, Smith RD (2003) Proteome analyses using accurate mass and elution time peptide tags with capillary LC time-of-flight mass spectrometry. J Am Soc Mass Spectrom 14:980–991

    Article  PubMed  CAS  Google Scholar 

  33. Petritis K, Kangas LJ, Yan B, Monroe ME, Strittmatter EF, Qian WJ, Adkins JN, Moore RJ, Xu Y, Lipton MS, Camp DG 2nd, Smith RD (2006) Improved peptide elution time prediction for reversed-phase liquid chromatography-MS by incorporating peptide sequence information. Anal Chem 78:5026–5039

    Article  PubMed  CAS  Google Scholar 

  34. Eisenberg D, Weiss RM, Terwilliger TC (1982) The helical hydrophobic moment—a measure of the amphiphilicity of a helix. Nature 299:371–374

    Article  PubMed  CAS  Google Scholar 

  35. Eisenberg D, Weiss RM, Terwilliger TC (1984) The hydrophobic moment detects periodicity in protein hydrophobicity. Proc Natl Acad Sci U S A 81:140–144

    Article  PubMed  CAS  Google Scholar 

  36. Eisenberg D (1984) 3-dimensional structure of membrane and surface-proteins. Annu Rev Biochem 53:595–623

    Article  PubMed  CAS  Google Scholar 

  37. Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297

    Google Scholar 

  38. Klammer AA, Yi XH, MacCoss MJ, Noble WS (2007) Improving tandem mass spectrum identification using peptide retention time prediction across diverse chromatography conditions. Anal Chem 79:6111–6118

    Article  PubMed  CAS  Google Scholar 

  39. Pfeifer N, Leinenbach A, Huber CG, Kohlbacher O (2007) Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics. BMC Bioinformatics 8:468

    Article  PubMed  Google Scholar 

  40. Scholkopf B, Smola AJ, Williamson RC, Bartlett PL (2000) New support vector algorithms. Neural Comput 12:1207–1245

    Article  PubMed  Google Scholar 

  41. Meinicke P, Tech M, Morgenstern B, Merkl R (2004) Oligo kernels for datamining on biological sequences: a case study on prokaryotic translation initiation sites. BMC Bioinformatics 5:169

    Article  PubMed  Google Scholar 

  42. Kohlbacher O, Reinert K, Gropl C, Lange E, Pfeifer N, Schulz-Trieglaff O, Sturm M (2007) TOPP—the OpenMS proteomics pipeline. Bioinformatics 23:E191–E197

    Article  PubMed  CAS  Google Scholar 

  43. Moruz L, Tomazela D, Kall L (2010) Training, selection, and robust calibration of retention time models for targeted proteomics. J Proteome Res 9:5209–5216

    Article  PubMed  CAS  Google Scholar 

  44. Rousseeuw PJ, Van Driessen K (2006) Computing LTS regression for large data sets. Data Min Knowl Discov 12:29–45

    Article  Google Scholar 

  45. Zimmerman JM, Eliezer N, Simha R (1968) Characterization of amino acid sequences in proteins by statistical methods. J Theor Biol 21:170–201

    Article  PubMed  CAS  Google Scholar 

  46. Keller A, Eng J, Zhang N, Li XJ, Aebersold R (2005) A uniform proteomics MS/MS analysis platform utilizing open XML file formats. Mol Syst Biol 1:2005.0017

    Article  PubMed  Google Scholar 

  47. Bruce JE, Anderson GA, Wen J, Harkewicz R, Smith RD (1999) High-mass-measurement accuracy and 100 % sequence coverage of enzymatically digested bovine serum albumin from an ESI-FTICR mass spectrum. Anal Chem 71:2595–2599

    Article  PubMed  CAS  Google Scholar 

  48. Conrads TP, Anderson GA, Veenstra TD, Pasa-Tolic L, Smith RD (2000) Utility of accurate mass tags for proteome-wide protein dentification. Anal Chem 72:3349–3354

    Article  PubMed  CAS  Google Scholar 

  49. Hodges RS, Parker JM, Mant CT, Sharma RR (1988) Computer simulation of high-performance liquid chromatographic separations of peptide and protein digests for development of size-exclusion, ion-exchange and reversed-phase chromatographic methods. J Chromatogr 458:147–167

    Article  PubMed  CAS  Google Scholar 

  50. Mant CT, Burke TW, Zhou NE, Parker JM, Hodges RS (1989) Reversed-phase chromatographic method development for peptide separations using the computer simulation program ProDigest-LC. J Chromatogr 485:365–382

    Article  PubMed  CAS  Google Scholar 

  51. The Cygwin homepage, http://www.cywin.com/

  52. Oinn T, Addis M, Ferris J, Marvin D, Senger M, Greenwood M, Carver T, Glover K, Pocock MR, Wipat A, Li P (2004) Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20:3045–3054

    Article  PubMed  CAS  Google Scholar 

  53. Rost B (2001) Review: protein secondary structure prediction continues to rise. J Struct Biol 134:204–218

    Article  PubMed  CAS  Google Scholar 

  54. de Bruin JS, Deelder AM, Palmblad M (2012) Scientific workflow management in proteomics. Mol Cell Proteomics 11:M111.010595

    Article  PubMed  Google Scholar 

  55. Nesvizhskii AI, Keller A, Kolker E, Aebersold R (2003) A statistical model for identifying proteins by tandem mass spectrometry. Anal Chem 75:4646–4658

    Article  PubMed  CAS  Google Scholar 

  56. Lesk AM (2008) Introduction to bioinformatics. Oxford University Press, New York

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media, LLC

About this protocol

Cite this protocol

Henneman, A.A., Palmblad, M. (2013). Retention Time Prediction and Protein Identification. In: Matthiesen, R. (eds) Mass Spectrometry Data Analysis in Proteomics. Methods in Molecular Biology, vol 1007. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-62703-392-3_4

Download citation

  • DOI: https://doi.org/10.1007/978-1-62703-392-3_4

  • Published:

  • Publisher Name: Humana Press, Totowa, NJ

  • Print ISBN: 978-1-62703-391-6

  • Online ISBN: 978-1-62703-392-3

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics