The effects of mass accuracy, data acquisition speed, and search algorithm choice on peptide identification rates in phosphoproteomics
- 317 Downloads
Proteomic analyses via tandem mass spectrometry have been greatly enhanced by the recent development of fast, highly accurate instrumentation. However, successful application of these developments to high-throughput experiments requires careful optimization of many variables which adversely affect each other, such as mass accuracy and data collection speed. We examined the performance of three shotgun-style acquisition methods ranging in their data collection speed and use of mass accuracy in identifying proteins from yeast-derived complex peptide and phosphopeptide-enriched mixtures. We find that the combination of highly accurate precursor masses generated from one survey scan in the FT-ICR cell, coupled with ten data-dependent tandem MS scans in a lower-resolution linear ion trap, provides more identifications in both mixtures than the other examined methods. For phosphopeptide identifications in particular, this method identified over twice as many unique phosphopeptides as the second-ranked, lower-resolution method from triplicate 90-min analyses (744 ± 50 vs. 308 ± 50, respectively). We also examined the performance of four popular peptide assignment algorithms (Mascot, Sequest, OMSSA, and Tandem) in analyzing the results from both high-and low-resolution data. When compared in the context of a false positive rate of approximately 1%, the performance differences between algorithms were much larger for phosphopeptide analyses than for an unenriched, complex mixture. Based upon these findings, acquisition speed, mass accuracy, and the choice of assignment algorithm all largely affect the number of peptides and proteins identified in high-throughput studies.
KeywordsMass accuracy Phosphorylation analysis Database searching Shotgun sequencing High-throughput proteomics
This work was supported in part by National Institutes of Health Grants GM67945 and HG3456 (to S. P. G.). We would like to thank J. Elias, J. Mintseris, J. Villen, and S. Beausoleil for helpful advice and discussions.
- 12.Mayya V, Rezaul K, Cong YS, Han D (2005) Mol Cell Proteomics 4:214–223Google Scholar
- 19.Mortimer RK, Johnston JR (1986) Genetics 113:35–43Google Scholar
- 21.Pedrioli PG, Eng JK, Hubley R, Vogelzang M, Deutsch EW, Raught B, Pratt B, Nilsson E, Angeletti RH, Apweiler R, Cheung K, Costello CE, Hermjakob H, Huang S, Julian RK, Kapp E, McComb ME, Oliver SG, Omenn G, Paton NW, Simpson R, Smith R, Taylor CF, Zhu W, Aebersold R (2004) Nat Biotechnol 22:1459–1466CrossRefGoogle Scholar