Abstract
Mass spectrometry (MS)-based shotgun proteomics allows protein identifications even in complex biological samples. Protein abundances can then be estimated from the counts of MS/MS spectra attributable to each protein, provided that one corrects for differential MS-detectability of the contributing peptides. We describe the use of a method, APEX, which calculates Absolute Protein EXpression levels based on learned correction factors, MS/MS spectral counts, and each protein’s probability of correct identification.
The APEX-based calculations consist of three parts: (1) Using training data, peptide sequences and their sequence properties, a model is built that can be used to estimate MS-detectability (O i) for any given protein. (2) Absolute abundances of proteins measured in an MS/MS experiment are calculated with information from spectral counts, identification probabilities and the learned O i-values. (3) Simple statistics allow for significance analysis of differential expression in two distinct biological samples, i.e., measuring relative protein abundances. APEX-based protein abundances span more than four orders of magnitude and are applicable to mixtures of hundreds to thousands of proteins from any type of organism.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Abbreviations
- APEX:
-
Absolute Protein EXpression
- MS:
-
Mass spectrometry
- MS/MS:
-
Tandem mass spectrometry
References
Nesvizhskii AI, Keller A, Kolker E, Aebersold R (2003) A statistical model for identifying proteins by tandem mass spectrometry. Anal Chem 75:4646–4658
Oda Y, Huang K, Cross FR et al (1999) Accurate quantitation of protein expression and site-specific phosphorylation. Proc Natl Acad Sci USA 96:6591–6596
Ong SE, Mann M (2005) Mass spectrometry-based proteomics turns quantitative. Nat Chem Biol 1:252–262
Ong SE, Blagoev B, Kratchmarova I et al (2002) Stable isotope labeling by amino acids in cell culture, SILAC, as a simple and accurate approach to expression proteomics. Mol Cell Proteomics 1:376–386
Gygi SP, Rochon Y, Franza BR, Aebersold R (1999) Correlation between protein and mRNA abundance in yeast. Mol Cell Biol 19:1720–1730
Gerber SA, Rush J, Stemman O et al (2003) Absolute quantification of proteins and phosphoproteins from cell lysates by tandem MS. Proc Natl Acad Sci USA 100:6940–6945
Ishihama Y, Oda Y, Tabata T et al (2005) Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein. Mol Cell Proteomics 4:1265–1272
Silva JC, Gorenstein MV, Li GZ et al (2006) Absolute quantification of proteins by LCMSE: a virtue of parallel MS acquisition. Mol Cell Proteomics 5:144–156
Malmstrom J, Beck M, Schmidt A et al (2009) Proteome-wide cellular protein concentrations of the human pathogen Leptospira interrogans. Nature 460:762–765
Kislinger T, Gramolini AO, Pan Y et al (2005) Proteome dynamics during C2C12 myoblast differentiation. Mol Cell Proteomics 4:887–901
Kislinger T, Cox B, Kannan A et al (2006) Global survey of organ and organelle protein expression in mouse: combined proteomic and transcriptomic profiling. Cell 125:173–186
Blondeau F, Ritter B, Allaire PD et al (2004) Tandem MS analysis of brain clathrin-coated vesicles reveals their critical involvement in synaptic vesicle recycling. Proc Natl Acad Sci USA 101:3833–3838
States DJ, Omenn GS, Blackwell TW et al (2006) Challenges in deriving high-confidence protein identifications from data gathered by a HUPO plasma proteome collaborative study. Nat Biotechnol 24:333–338
Florens L, Washburn MP, Raine JD et al (2002) A proteomic view of the Plasmodium falciparum life cycle. Nature 419:520–526
Gao J, Friedrichs MS, Dongre AR, Opiteck GJ (2005) Guidelines for the routine application of the peptide hits technique. J Am Soc Mass Spectrom 16:1231–1238
Gao J, Opiteck GJ, Friedrichs MS et al (2003) Guidelines for the routine application of the peptide hits technique. J Proteome Res 2:643–649
Liu H, Sadygov RG, Yates JR 3rd (2004) A model for random sampling and estimation of relative protein abundance in shotgun proteomics. Anal Chem 76:4193–4201
Steen H, Pandey A (2002) Proteomics goes quantitative: measuring protein abundance. Trends Biotechnol 20:361–364
Elias JE, Gibbons FD, King OD et al (2004) Intensity-based protein identification by machine learning from a library of tandem mass spectra. Nat Biotechnol 22:214–219
Gay S, Binz PA, Hochstrasser DF, Appel RD (2002) Peptide mass fingerprinting peak intensity prediction: extracting knowledge from spectra. Proteomics 2:1374–1391
Craig R, Cortens JP, Beavis RC (2005) The use of proteotypic peptide libraries for protein identification. Rapid Commun Mass Spectrom 19:1844–1850
Kuster B, Schirle M, Mallick P, Aebersold R (2005) Scoring proteomes with proteotypic peptide probes. Nat Rev Mol Cell Biol 6:577–583
Le Bihan T, Robinson MD, Stewart II, Figeys D (2004) Definition and characterization of a “trypsinosome” from specific peptide characteristics by nano-HPLC-MS/MS and in silico analysis of complex protein mixtures. J Proteome Res 3:1138–1148
Mallick P, Schirle M, Chen SS et al (2007) Computational prediction of proteotypic peptides for quantitative proteomics. Nat Biotechnol 25:125–131
Tang H, Arnold RJ, Alves P et al (2006) A computational approach toward label-free protein quantification using predicted peptide detectability. Bioinformatics 22:e481–e488
Lu P, Vogel C, Wang R, Yao et al (2007) Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation. Nat Biotechnol 25:117–124
Ghaemmaghami S, Huh WK, Bower K et al (2003) Global analysis of protein expression in yeast. Nature 425:737–741
Newman JR, Ghaemmaghami S, Ihmels J et al (2006) Single-cell proteomic analysis of S. cerevisiae reveals the architecture of biological noise. Nature 441:840–846
Futcher B, Latter GI, Monardo P et al (1999) A sampling of the yeast proteome. Mol Cell Biol 19:7357–7368
Lopez-Campistrous A, Semchuk P, Burke L et al (2005) Localization, annotation, and comparison of the Escherichia coli K-12 proteome under two states of growth. Mol Cell Proteomics 4:1205–1209
Laurent J, Vogel C, Kwon T et al (2010) Protein abundances are more conserved than mRNA abundances across diverse taxa. Proteomics 23(10):4209–4212
Wang R, Marcotte EM (2008) The proteomic response of Mycobacterium smegmatis to anti-tuberculosis drugs suggests targeted pathways. J Proteome Res 7:855–865
Baerenfaller K, Grossmann J, Grobei MA et al (2008) Genome-scale proteomics reveals Arabidopsis thaliana gene models and proteome dynamics. Science 320:938–941
Vogel C, de Sousa AR, Ko D et al (2010) Sequence signatures and mRNA concentration can explain two-thirds of protein abundance variation in a human cell line. Mol Syst Biol 6:400
Schmidt MW, Houseman A, Ivanov AR, Wolf DA (2007) Comparative proteomic and transcriptomic profiling of the fission yeast Schizosaccharomyces pombe. Mol Syst Biol 3:79
Schrimpf SP, Weiss M, Reiter L et al (2009) Comparative functional analysis of the Caenorhabditis elegans and Drosophila melanogaster proteomes. PLoS Biol 7:e48
Keller A, Nesvizhskii AI, Kolker E, Aebersold R (2002) Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. Anal Chem 74:5383–5392
Braisted JC, Kuntumalla S, Vogel C et al (2008) Quantitative proteomics tool: generating protein quantitation estimates from LC-MS/MS proteomics results. BMC Bioinformatics 9:529
Vogel C, Marcotte EM (2008) Calculating absolute and relative protein abundance from mass spectrometry-based protein expression data. Nat Protoc 3:1444–1451
Cagney G, Amiri S, Premawaradena T et al (2003) In silico proteome analysis to facilitate proteomics experiments using mass spectrometry. Proteome Sci 1:5
Neidhardt FC, Umbarger HE (eds) (1996) Escherichia coli and Salmonella typhimurium: cellular and molecular biology, part 4. ASM Press, Washington, DC
Sundararaj S, Guo A, Habibi-Nazhad B et al (2004) The CyberCell Database (CCDB): a comprehensive, self-updating, relational database to coordinate and facilitate in silico modeling of Escherichia coli. Nucleic Acids Res 32:D293–D295
Fasman GD ed. (1976) “Handbook of Biochemistry and Molecular Biology”, 3rd ed., Proteins – Volume 1, CRC Press, Cleveland
Chou PY, Fasman GD (1978) Prediction of the secondary structure of proteins from their amino acid sequence. Adv. Enzymol. 47: 45–148
Wertz DH, Scheraga HA (1978) Influence of water on protein structure. An analysis of the preferences of amino acid residues for the inside or outside and for specific conformations in a protein molecule. MacroÂmolecules 11:9–15
Zimmerman JM, Eliezer N, Simha R (1968) The characterization of amino acid sequences in proteins by statistical methods. J Theor Biol 21:170–201
Klein P, Kanehisa M, DeLisi C (1984) Prediction of protein function from sequence properties: Discriminant analysis of a data base. Biochim Biophys Acta 787:221–226
Eisenberg D, McLachlan AD (1986) SolvaÂtion energy in protein folding and binding. Nature 319:199–203
Fauchere JL, Charton M, Kier LB, Verloop A, Pliska V (1988) Amino acid side chain parameters for correlation studies in biology and pharmacology. Int J Peptide Protein Res 32:269–278
Vihinen M, Torkkila E, Riikonen P (1994) Accuracy of protein flexibility predictions. Proteins 19:141–149
Guy HR (1985) Amino acid side-chain partition energies and distribution of residues in soluble proteins. Biophys J 47:61–70
Nozaki Y, Tanford C (1971) The solubility of amino acids and two glycine peptides in aqueous ethanol and dioxane solutions. J Biol Chem 246:2211–2217
Acknowledgments
C.V. acknowledges support by the International Human Frontier Science Program. We thank John Braisted and Srilatha Kuntumalla from JCVI for many useful discussions regarding the APEX calculations. This work was supported by grants from the Welch (F-1515) and Packard Foundations, the National Science Foundation, and National Institutes of Health (to E.M.M.).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Vogel, C., Marcotte, E.M. (2012). Label-Free Protein Quantitation Using Weighted Spectral Counting. In: Marcus, K. (eds) Quantitative Methods in Proteomics. Methods in Molecular Biology, vol 893. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-61779-885-6_20
Download citation
DOI: https://doi.org/10.1007/978-1-61779-885-6_20
Published:
Publisher Name: Humana Press, Totowa, NJ
Print ISBN: 978-1-61779-884-9
Online ISBN: 978-1-61779-885-6
eBook Packages: Springer Protocols