De Novo Correction of Mass Measurement Error in Low Resolution Tandem MS Spectra for Shotgun Proteomics

Egertson, Jarrett D.; Eng, Jimmy K.; Bereman, Michael S.; Hsieh, Edward J.; Merrihew, Gennifer E.; MacCoss, Michael J.

doi:10.1007/s13361-012-0482-z

De Novo Correction of Mass Measurement Error in Low Resolution Tandem MS Spectra for Shotgun Proteomics

Research Article
Published: 25 September 2012

Volume 23, pages 2075–2082, (2012)
Cite this article

Download PDF

Journal of The American Society for Mass Spectrometry

De Novo Correction of Mass Measurement Error in Low Resolution Tandem MS Spectra for Shotgun Proteomics

Download PDF

Jarrett D. Egertson¹,
Jimmy K. Eng¹,
Michael S. Bereman¹,
Edward J. Hsieh¹,
Gennifer E. Merrihew¹ &
…
Michael J. MacCoss¹

1679 Accesses
8 Citations
1 Altmetric
Explore all metrics

Abstract

We report an algorithm designed for the calibration of low resolution peptide mass spectra. Our algorithm is implemented in a program called FineTune, which corrects systematic mass measurement error in 1 min, with no input required besides the mass spectra themselves. The mass measurement accuracy for a set of spectra collected on an LTQ-Velos improved 20-fold from –0.1776 ± 0.0010 m/z to 0.0078 ± 0.0006 m/z after calibration (avg ± 95 % confidence interval). The precision in mass measurement was improved due to the correction of non-linear variation in mass measurement accuracy across the m/z range.

Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses

Article 21 August 2017

The Analysis of Peptide-Centric Mass-Spectrometry Data Utilizing Information About the Expected Isotope Distribution

DIALib-QC an assessment tool for spectral libraries in data-independent acquisition proteomics

Article Open access 16 October 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The field of proteomics has experienced significant growth in the past decade due to advancements in mass spectrometer instrumentation and computational tools for data interpretation. Instrument scan speed, dynamic range, sensitivity, resolution, and mass measurement accuracy (MMA) continue to improve, allowing for more comprehensive analysis of complex protein digests.

Systematic mass measurement error (SMME) is typically corrected by routine instrument calibration. External calibration is performed by analyzing a standard with molecules of known elemental composition and calibrating the instrument to match the measurements of the calibrant ions to their known mass-to-charge (m/z). Ion species covering a wide range of m/z are often used for calibration because systematic mass measurement error can vary with m/z. Over time, the mass calibration drifts, requiring periodic recalibration. Even the most sophisticated instruments will have some systematic and/or random mass measurement error specifically if they are not properly calibrated.

Mass measurement accuracy can be further improved by analyzing calibrants spiked into each sample (i.e., internal calibration). Internal calibration is usually coupled with high-resolution mass analyzers (e.g., TOF [1, 2], FTICR [3, 4], Orbitrap [5]) where instrument factors such as space charge effects, electric fields, peak intensity, and mass analyzer temperature vary during the course of an analysis. These factors ultimately cause mass measurements to deviate. On trapping instruments, the implementation of automatic gain control helps alleviate the mass deviations due to space charge by controlling for total ion population [6]. As a result, internal calibration is not necessarily needed in these instruments to achieve high MMA and can be detrimental to performance due to the loss in sensitivity and dynamic range associated with introducing a calibrant ion.

Computational calibration techniques aim to match the robustness of internal calibration without requiring the addition of specific calibrants. To calibrate precursor data, ion species known to be present in a sample can be used as internal calibrants in lieu of spiked in calibrants. Peptides expected to be present in a data set a priori [2, 7, 8], peptides confidently identified by database searching [9–13], or commonly observed contaminants can be used in this manner [14]. These techniques improve mass measurement accuracy but will likely struggle with low resolution data due to the difficulty of unambiguously mapping theoretical ion species to low resolution precursor features. Charge state pairs can be used to detect frequency shifts in precursor spectra without a priori knowledge of sample content [15–17]. Unfortunately, the reliance of this technique on resolving charge-state precludes its application to low-resolution data where resolving power is too low to resolve isotopic peaks for charge state determination. Monoisotopic, singly-charged peak masses in peptide mass fingerprinting data collected on MALDI-TOF instruments can be calibrated without a priori knowledge of the sample content [18–20].

Fewer computational techniques exist for the calibration of MS/MS spectra. Fragmentation spectra can be calibrated de novo by detection of type-1 peak edges [21] or by analysis of trypsin auto-lysis products [22]. Confident peptide-spectrum matches from a database search can be used to calibrate MS/MS data as well [23], although such methods can be computationally expensive and require protein annotations from a genome sequence.

Herein, we demonstrate that low resolution ion trap instruments can be subject to systematic mass measurement error even after external calibration. We implement a novel method for de novo calibration of peptide MS/MS data collected on low resolution instruments capable of improving mass measurement accuracy and mass measurement precision quickly (<1 minute) without the need for protein annotations or even knowledge of the organism(s) being analyzed. The algorithm is implemented in a freely-available, open-source software package named “FineTune.” We demonstrate improved mass measurement accuracy after calibration with FineTune for MS/MS spectra acquired on both an LTQ and LTQ-Velos mass spectrometer by analyzing mass measurement error in confident peptide-spectrum matches pre and post-calibration. The robustness of FineTune is demonstrated by successfully calibrating data sets with only a small percentage of the total spectra used as input for the calibration. The impact of calibrating MS/MS spectra on database search results is tested using the Mascot, SEQUEST, and X!Tandem search algorithms.

2 Materials and Methods

2.1 Collection of Data for Testing the De Novo Calibration Algorithm

A S. cerevisiae sample digest was analyzed by LC-MS/MS on an LTQ-Orbitrap-Velos (Thermo Fisher Scientific, San Jose, CA, USA) hybrid mass spectrometer. Peptides were separated by reversed-phase high performance liquid chromatography (RP-HPLC) across a 100-min linear acetonitrile gradient on a 40 cm column with 75 μM inner-diameter. Mass spectra were collected using a top-13 data-dependent acquisition scheme with precursor scans acquired in the Orbitrap (60,000 resolving power @ 400 m/z) in profile mode. Peptide fragmentation by resonance collision induced dissociation (CID) and subsequent mass analysis was executed in the dual pressure linear-ion trap in centroid mode. Dynamic exclusion was enabled with a 50 entry exclusion list and 180 second exclusion time.

A S. cerevisiae sample digest was analyzed in a similar manner on an LTQ-FTICR (Thermo Fisher Scientific, San Jose, CA, USA) hybrid mass spectrometer. A 125-min linear acetonitrile gradient was used. Mass spectra were collected using a top-5 data dependent acquisition scheme with precursor scans acquired on the FT-ICR (50,000 resolution @ 400 m/z) in profile mode. Fragmentation spectra were acquired by resonance CID in the linear ion trap in centroid mode. Dynamic exclusion was enabled with a 50 entry list and 30 second exclusion time.

2.2 Generation of a Theoretical Fragment Ion Map for MS/MS Spectra

Previous studies demonstrate that polypeptide masses occupy “allowable regions” of the mass range of width ~0.2 m/z spaced apart by about 1.00045475 m/z [24]. We made a similar observation by analyzing fragment ion masses in spectra from our C. elegans Bibliospec library [25].

Every MS/MS spectrum and matching peptide sequence was extracted from the C. elegans Bibliospec library (ver. 5.1) of confident peptide spectrum matches. For each of these spectra, a theoretical MS/MS spectrum was generated using monoisotopic amino acid masses for singly-charged b and y ions. Intensities of theoretical fragment ions were matched to the intensity of the nearest peak in the experimental spectrum within ±0.5 m/z. The spectra were then binned with a bin width of 0.009995454567 m/z and summed to generate a theoretical fragment ion map (Figure 1). This bin width was selected to be amenable to applying the discrete fast Fourier transform (FFT) to model the location of the peaks in the theoretical ion map using a sinusoid function (see Section 2.3). Using a bin width of 0.009995454567 m/z ensures that one of the bins of the discrete FFT will correspond to the component of the data with a period of 1.00045475 m/z, which is the average distance between peaks in the theoretical fragment ion map [24, 26–30].

2.3 De Novo Calibration of MS/MS Data Using the Theoretical Fragment Ion Map

The technique used to calibrate MS/MS spectra is summarized in Figure 2. MS/MS spectra are binned with a bin width of 0.009995454567 m/z and summed to generate the observed fragment ion map. Misalignment between the observed and theoretical fragment ion maps is due to systematic mass measurement error (SMME). Therefore, the systematic mass measurement error can be determined by calculating the m/z shift required to align the observed map to the theoretical one. Because SMME can vary with respect to m/z, the SMME is calculated at an interval of every 20 m/z.

To calculate the SMME at m/z α, a subset of both fragment ion maps are analyzed between α − ε and α + ε. The value ε is the minimum value for which the total ion current in the experimental fragment ion map between α − ε and α + ε exceeds 2.5 × 10⁹ ions/s and ε ≥ 20 m/z. If the values α − ε or α + ε are outside of the range of observed m/z values, the center of the window is shifted. Adjusting the window width in this manner compensates for variability in signal intensity with respect to m/z.

To reduce the impact of noise and multiply-charged peaks on the algorithm, a 2¹⁸ point discrete fast Fourier transform is used to determine the phase of the signal component with period 1.00045475 m/z for both the theoretical (ϴt) and experimental (ϴe) ion map subsets. This is essentially a computationally efficient method of fitting a sinusoid to the observed signal. The SMME is the difference in the phase of these two frequency components (ϴe – ϴt). After SMME has been calculated for every 20 m/z interval, the points are interpolated linearly to allow for the reporting of SMME at any m/z.

FineTune was coded in C++ and compiled with GNU gcc-4.3.3 on a 64-bit system running Linux kernel v. 2.6.29.6. FineTune uses the Boost Build system to allow for cross-platform compilation (tested on Linux and Windows). FineTune can read and write mzML, mzXML, MGF, MS2, CMS2, and BMS2 formats (http://proteowizard.sourceforge.net/formats.shtml). Additionally, the Windows version can read Agilent, Bruker, Thermo, Waters, and AB-SCIEX vendor formats. Source code and binaries are available at the MacCoss lab website (http://proteome.gs.washington.edu/software/finetune).

2.4 Calculation of Mass Measurement Error by Database Searching

The systematic mass measurement error detected by FineTune is compared with that detected by analyzing confident peptide spectrum matches (PSMs) identified by SEQUEST [31] using a target-decoy strategy, and post-processed by Percolator (ver. 1.14) [32]. If precursor scans are collected on a high-resolution instrument, Bullseye [33] is used to determine more accurate precursor masses prior to database searching. A threshold for peptide-level FDR of ≤0.01 is applied to the set of PSMs. For each PSM, the mass measurement error for each theoretical singly charged b- and y-ion with a matching peak within ±0.5 m/z is determined. Mass measurement accuracy is reported as the mean of the mass measurement error and its 95 % confidence interval. When comparing mass measurement error before and after de novo calibration, the same spectra analyzed prior to calibration are analyzed post-calibration.

3 Results

3.1 A Theoretical Fragment Ion Map for Peptide Fragmentation Spectra

Mass excess is the nominal mass (i.e., mass number) of an atom subtracted from the exact mass. The 20 amino acids coded by the standard genetic code have a very similar mass excess, which causes the masses of polypeptides to group together into regions of the mass range spaced roughly 1.0005 m/z apart [28] (Figure 1). The regions of the mass range between these mass clusters have been termed “forbidden zones” [29] because it is theoretically impossible for the mass of a polypeptide built from these 20 amino acids to fall in these zones. To visualize the distribution of “allowable” fragment ion masses, MS/MS spectra in the C. elegans BiblioSpec spectrum library were summed (Figure 1). Only annotated MS/MS peaks were included in the summation, and each peak’s m/z was corrected to match its theoretical m/z. Supplementary Figure 1A shows a “zoomed out” view of the summed spectrum. Fragment ions between 200 and 800 m/z contribute roughly the same intensity to the summed spectrum throughout this range despite the greater number of fragment ions at the lower end of this range (Supplementary Figure 1B). Above 800 m/z, the summed intensity of fragment ions gradually decreases with increasing m/z.

3.2 De Novo Calibration Corrects Systematic Mass Measurement Error

FineTune was tested on 44,944 low resolution MS/MS spectra acquired by a shotgun LC-MS/MS analysis of S. cerevisiae lysate on an LTQ-Orbitrap-Velos (Thermo Fisher Scientific, San Jose, CA, USA) mass spectrometer. The recalibration determined de novo matches very closely to that determined by analyzing 6,628 confident (q ≤ 0.01) peptide spectrum matches from a database search (Figure 3a, b). The mean mass measurement error prior to de novo calibration is –0.1776 ± 0.0010 m/z (95 % confidence interval) due to calibration drift. After calibration with FineTune, the mean mass measurement error is 0.0078 ± 0.0006 m/z, thus the precision is improved by about half in addition to the improvement in mass accuracy (Figure 3a, b). The mass precision is uncharacteristically poor in this dataset to illustrate the capabilities of FineTune. This poor mass calibration is caused by detector and thus automatic gain control miscalibration resulting in significant space charge effects (Figure 3a). In a more common scenario, the mass error does not vary so drastically with m/z (Supplementary Figure 2A) and thus the improvement in the mass error spread will be more subtle. Additionally, the mass error between the caffeine and MRFA ions as well as between the MRFA and the first Ultramark ion on the LTQ-Velos (Figure 3a, Supplementary Figure 2A) is a result of the absence of m/z calibrants in that region and a non-linear response between the rf ejection frequency and m/z. The mass calibration has since been improved by the instrument manufacturer using a new calibration procedure and will be available in future releases of the Velos and Velos Pro Tune software (personal communication, Jae Schwartz).

FineTune was also tested on 21,433 MS/MS spectra acquired on the linear ion trap of an LTQ-FTICR (Thermo Fisher Scientific, San Jose, CA, USA) mass spectrometer, improving mean mass measurement error from 0.0444 ± 0.0011 m/z to 0.0055 ± 0.0011 m/z (Figure 3c, d). In contrast to the LTQ-Orbitrap-Velos data shown above, these data have very little systematic mass measurement error (SMME); the mass error distribution is centered close to zero and there is almost no variation in SMME with m/z (Figure 3c). FineTune corrects for the slight m/z-independent shift in SMME and importantly does not detect any false trends in SMME from noise in the data indicating that FineTune is applicable to data with extreme or subtle SMME.

3.3 De Novo Calibration is Robust to a Reduction in Signal

To test the response of FineTune to a reduction in signal (i.e. few peptide MS/MS spectra), FineTune was applied to the LTQ-Orbitrap-Velos data with varying numbers of MS/MS spectra removed. For each MS/MS spectrum in the data, a random number between 1 and 100 was generated, and if the number was greater than or equal to a threshold, the spectrum was removed from the data set. Therefore, if a threshold of 60 is applied, the resulting data should contain about 60 % of the original spectra.

Five “thinned” data sets were generated containing ~75 %, 50 %, 25 %, 10 %, and 1 % of the 44,944 original MS/MS spectra from the LTQ-Orbitrap-Velos data (Figure 4). Even after removing 90 % of the spectra, FineTune is extremely robust in high-signal regions of the data. In low-signal regions (e.g. m/z >1200) SMME is still improved albeit not completely corrected (Figure 4c, d). Once 99 % of the spectra are removed, FineTune detects the average mass measurement error, but not the m/z-dependent variation in systematic mass measurement error (Figure 4e, f). The mass accuracy is still improved, but the mass precision remains the same. The mean mass measurement error is –0.1776 ± 0.0010 for the uncalibrated data. The mean mass measurement error is improved in all of the aforementioned cases to 0.0162 ± 0.0009, –0.0097 ± 0.0006, 0.0078 ± 0.0006, 0.0045 ± 0.0006, 0.0064 ± 0.0006, and 0.0078 ± 0.0006 m/z when removing 99 %, 90 %, 75 %, 50 %, 25 %, and 0 % of the spectra respectively (Figure 4g). Importantly, we have yet to find a case where FineTune negatively impacts the data.

3.4 De Novo Calibration and Database Search Results

The calibrated and uncalibrated (control) LTQ-Orbitrap-Velos data was searched using X!Tandem (Cyclone 2010.12.01.1). All data was first processed by Bullseye and then searched against S. cerevisiae sequence (target) and reversed sequence (decoy) databases with a precursor mass tolerance of 10 ppm. The fragment ion tolerance for the searches was varied between ±0.01 and ±0.5 Da. The expectation values from target and decoy search results were used to determine the number of unique peptides at a false discovery rate (FDR) threshold of 0.01.

At fragment ion tolerances less than ±0.2 Da, the calibrated data returns many more peptide identifications than the data that has not been calibrated (Figure 5). For example, at a fragment ion tolerance of ±0.1 Da, the calibrated data returns 2.5 times more peptide identifications. However, at fragment ion tolerances greater than ±0.2 Da, calibrating the data does not increase the number of peptide identifications. Calibrated data had a similar but far more subtle impact on Mascot search results (Supplementary Figure 3). The impact of calibration on SEQUEST results is less straightforward due to how spectra are binned and is discussed in Supplementary Results. Using calibrated data and the optimal fragment ion tolerance/binning parameters SEQUEST, Mascot and X!Tandem identify 3,755; 2,622; and 1,710 unique peptides respectively at a q-value ≤0.01.

4 Discussion

FineTune enables the de novo, unsupervised calibration of MS/MS spectra acquired on low resolution instrumentation. The algorithm corrects nonlinear systematic mass measurement error and does not negatively-impact a data set even when it is trained on only 1 % of the spectra in the data set. This robustness, speed, low memory (~10 MB) and low processor requirements (the bottleneck is typically file I/O) make FineTune suitable for integration into an unsupervised data processing pipeline run on every data set acquired. Calibration can then be tracked over time which aids in the scheduling of instrument calibrations, especially for newer instruments which tend to drift out of calibration quickly. Additionally, the algorithm can be applied retroactively to old data that a researcher may suspect was collected on an improperly calibrated instrument.

Calibration of fragment ion masses had little impact on peptide identifications from three popular database searching algorithms. FineTune improved search results for X!Tandem at small fragment ion mass tolerance values. One might expect the search with fragment ion tolerance ±0.2 Da to perform better than larger fragment ion tolerances because ±0.2 Da is roughly the width of the fragment ion mass error distribution for the calibrated data (Figure 3b). Following this line of reasoning, when the fragment ion tolerance is made greater than ±0.2 Da the only effect would be to increase the number of incorrect peaks under consideration when calculating the match score for any particular peptide spectrum match. However, it appears that the scoring function used by X!Tandem (Figure 5) and Mascot (Supplementary Figure 3) are both unaffected by this phenomenon. Calibration does not greatly improve Mascot results at any fragment ion tolerance.

These algorithms are classified as database search algorithms due to their reliance on a database of candidate protein sequences to test each spectrum against. De novo search algorithms are designed to interpret spectra without a database of candidate sequences. Because FineTune does not rely on sequence information, it is a natural complement to de novo search algorithms often used when protein sequence information is unreliable, incomplete, or nonexistent. De novo search algorithms can benefit from improved fragment ion mass accuracy [34, 35]. Thus, in future work we will assess the impact of de novo calibration on de novo spectrum interpretation.

FineTune corrects systematic mass measurement error reliably by only making corrections when enough signal is present in the data to justify these corrections. FineTune improved the mass measurement accuracy of every data set tested to <0.01 m/z. This reliability makes FineTune suitable for use as an unsupervised preprocessor applied to any collection of MS/MS spectra prior to database searching. While improved mass measurement accuracy sometimes improves database search results (up to 40 % increase in peptide identifications), the data presented herein indicates that database search algorithms could be optimized to take better advantage of more accurate fragment ion mass measurements. In future experiments, we look to expand the algorithm to be able to correct mass measurement error trends that vary with retention time as well as test the impact of calibration on de novo spectrum interpretation algorithms.

References

Beavis, R.C., Chait, B.T.: High-accuracy molecular mass determination of proteins using matrix-assisted laser desorption mass spectrometry. Anal. Chem. 62(17), 1836–1840 (1990)
Article CAS Google Scholar
Strittmatter, E.F., Ferguson, P.L., Tang, K., Smith, R.D.: Proteome analyses using accurate mass and elution time peptide tags with capillary LC time-of-flight mass spectrometry. J. Am. Soc. Mass Spectrom. 14(9), 980–991 (2003)
Article CAS Google Scholar
Henry, K.D., Quinn, J.P., McLafferty, F.W.: High-resolution electrospray mass spectra of large molecules. J. Am. Chem. Soc. 113(14), 5447–5449 (1991)
Article CAS Google Scholar
Hannis, J.C., Muddiman, D.C.: A dual electrospray ionization source combined with hexapole accumulation to achieve high mass accuracy of biopolymers in Fourier transform ion cyclotron resonance mass spectrometry. J. Am. Soc. Mass Spectrom. 11(10), 876–883 (2000)
Article CAS Google Scholar
Olsen, J.V., de Godoy, L.M., Li, G., Macek, B., Mortensen, P., Pesch, R., Makarov, A., Lange, O., Horning, S., Mann, M.: Parts per million mass accuracy on an Orbitrap mass spectrometer via lock mass injection into a C-trap. Mol. Cell. Proteom. 4(12), 2010–2021 (2005)
Article CAS Google Scholar
Syka, J.E., Marto, J.A., Bai, D.L., Horning, S., Senko, M.W., Schwartz, J.C., Ueberheide, B., Garcia, B., Busby, S., Muratore, T., Shabanowitz, J., Hunt, D.F.: Novel linear quadrupole ion trap/FT mass spectrometer: Performance characterization and use in the comparative analysis of histone H3 post-translational modifications. J. Proteome Res. 3(3), 621–626 (2004)
Article CAS Google Scholar
Strittmatter, E.F., Rodriguez, N., Smith, R.D.: High mass measurement accuracy determination for proteomics using multivariate regression fitting: Application to electrospray ionization time-of-flight mass spectrometry. Anal. Chem. 75(3), 460–468 (2003)
Article CAS Google Scholar
Tolmachev, A.V., Monroe, M.E., Jaitly, N., Petyuk, V.A., Adkins, J.N., Smith, R.D.: Mass measurement accuracy in analyses of highly complex mixtures based upon multidimensional recalibration. Anal. Chem. 78(24), 8374–8385 (2006)
Article CAS Google Scholar
Lasonder, E., Ishihama, Y., Andersen, J.S., Vermunt, A.M., Pain, A., Sauerwein, R.W., Eling, W.M., Hall, N., Waters, A.P., Stunnenberg, H.G., Mann, M.: Analysis of the Plasmodium falciparum proteome by high-accuracy mass spectrometry. Nature 419(6906), 537–542 (2002)
Article CAS Google Scholar
Mortensen, P., Gouw, J.W., Olsen, J.V., Ong, S.E., Rigbolt, K.T., Bunkenborg, J., Cox, J., Foster, L.J., Heck, A.J., Blagoev, B., Andersen, J.S., Mann, M.: MSQuant, an open source platform for mass spectrometry-based quantitative proteomics. J. Proteome Res. 9(1), 393–403 (2010)
Article CAS Google Scholar
Petyuk, V.A., Mayampurath, A.M., Monroe, M.E., Polpitiya, A.D., Purvine, S.O., Anderson, G.A., Camp, D.G.I.I., Smith, R.D.: DtaRefinery, a software tool for elimination of systematic errors from parent ion mass measurements in tandem mass spectra data sets. Mol. Cell. Proteom. 9(3), 486–496 (2010)
Article CAS Google Scholar
Petyuk, V.A., Jaitly, N., Moore, R.J., Ding, J., Metz, T.O., Tang, K., Monroe, M.E., Tolmachev, A.V., Adkins, J.N., Belov, M.E., Dabney, A.R., Qian, W.J., Camp, D.G.I.I., Smith, R.D.: Elimination of systematic mass measurement errors in liquid chromatography-mass spectrometry based proteomics using regression models and a priori partial knowledge of the sample content. Anal. Chem. 80(3), 693–706 (2008)
Article CAS Google Scholar
Kil, Y.J., Becker, C., Sandoval, W., Goldberg, D., Bern, M.: Preview: A program for surveying shotgun proteomics tandem mass spectrometry data. Anal. Chem. 83(13), 5259–5267 (2011)
Article CAS Google Scholar
Haas, W., Faherty, B.K., Gerber, S.A., Elias, J.E., Beausoleil, S.A., Bakalarski, C.E., Li, X., Villen, J., Gygi, S.P.: Optimization and use of peptide mass measurement accuracy in shotgun proteomics. Mol. Cell. Proteom. 5(7), 1326–1337 (2006)
Article CAS Google Scholar
Bruce, J.E., Anderson, G.A., Brands, M.D., Pasa-Tolic, L., Smith, R.D.: Obtaining more accurate Fourier transform ion cyclotron resonance mass measurements without internal standards using multiply charged ions. J. Am. Soc. Mass Spectrom. 11(5), 416–421 (2000)
Article CAS Google Scholar
Kaiser, N.K., Anderson, G.A., Bruce, J.E.: Improved mass accuracy for tandem mass spectrometry. J. Am. Soc. Mass Spectrom. 16(4), 463–470 (2005)
Article CAS Google Scholar
Cox, J., Mann, M.: MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26(12), 1367–1372 (2008)
Article CAS Google Scholar
Gras, R., Muller, M., Gasteiger, E., Gay, S., Binz, P.A., Bienvenut, W., Hoogland, C., Sanchez, J.C., Bairoch, A., Hochstrasser, D.F., Appel, R.D.: Improving protein identification from peptide mass fingerprinting through a parameterized multi-level scoring algorithm and an optimized peak detection. Electrophoresis 20(18), 3535–3550 (1999)
Article CAS Google Scholar
Wool, A., Smilansky, Z.: Precalibration of matrix-assisted laser desorption/ionization-time of flight spectra for peptide mass fingerprinting. Proteomics 2(10), 1365–1373 (2002)
Article CAS Google Scholar
Wolski, W.E., Farrow, M., Emde, A.K., Lehrach, H., Lalowski, M., Reinert, K.: Analytical model of peptide mass cluster centres with applications. Proteome Sci. 4, 18 (2006)
Article Google Scholar
Yan, B., Pan, C., Olman, V.N., Hettich, R.L., Xu, Y.: A graph-theoretic approach for the separation of b and y ions in tandem mass spectra. Bioinformatics 21(5), 563–574 (2005)
Article CAS Google Scholar
Gentzel, M., Kocher, T., Ponnusamy, S., Wilm, M.: Preprocessing of tandem mass spectrometric data to support automatic protein identification. Proteomics 3(8), 1597–1610 (2003)
Article CAS Google Scholar
Matthiesen, R., Bunkenborg, J., Stensballe, A., Jensen, O.N., Welinder, K.G., Bauw, G.: Database-independent, database-dependent, and extended interpretation of peptide mass spectra in VEMS V2.0. Proteomics 4(9), 2583–2593 (2004)
Article CAS Google Scholar
Nefedov, A.V., Mitra, I., Brasier, A.R., Sadygov, R.G.: Examining troughs in the mass distribution of all theoretically possible tryptic peptides. J. Proteome Res. 10(9), 4150–4157 (2011)
Article CAS Google Scholar
Frewen, B.E., Merrihew, G.E., Wu, C.C., Noble, W.S., MacCoss, M.J.: Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries. Anal. Chem. 78(16), 5678–5684 (2006)
Article CAS Google Scholar
Gay, S., Binz, P.A., Hochstrasser, D.F., Appel, R.D.: Modeling peptide mass fingerprinting data using the atomic composition of peptides. Electrophoresis 20(18), 3527–3534 (1999)
Article CAS Google Scholar
Yates III, J.R., Eng, J.K., Clauser, K.R., Burlingame, A.L.: Search of sequence databases with uninterpreted high-energy collision-induced dissociation spectra of peptides. J. Am. Soc. Mass Spectrom. 7(11), 1089–1098 (1996)
Article Google Scholar
Mann, M. Useful Tables of Possible and Probable Peptide Masses. Proceedings of the American Society of Mass Spectrometry Annual Conference on Mass Spectrometry and Allied Topics, Atlanta, GA, (1995)
Frahm, J.L., Howard, B.E., Heber, S., Muddiman, D.C.: Accessible proteomics space and its implications for peak capacity for zero-, one-, and two-dimensional separations coupled with FT-ICR and TOF mass spectrometry. J. Mass Spectrom. 41(3), 281–288 (2006)
Article CAS Google Scholar
Demirev, P.A., Zubarev, R.A.: Probing combinatorial library diversity by mass spectrometry. Anal. Chem. 69(15), 2893–2900 (1997)
Article CAS Google Scholar
Eng, J.K., McCormack, A.L., Yates III, J.R.: An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom. 5(11), 976–989 (1994)
Article CAS Google Scholar
Kall, L., Canterbury, J.D., Weston, J., Noble, W.S., MacCoss, M.J.: Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat. Methods 4(11), 923–925 (2007)
Article Google Scholar
Hsieh, E.J., Hoopmann, M.R., MacLean, B., MacCoss, M.J.: Comparison of database search strategies for high precursor mass accuracy MS/MS data. J. Proteome Res. 9(2), 1138–1143 (2010)
Article CAS Google Scholar
Grossmann, J., Roos, F.F., Cieliebak, M., Liptak, Z., Mathis, L.K., Muller, M., Gruissem, W., Baginsky, S.: AUDENS: A tool for automated peptide de novo sequencing. J. Proteome Res. 4(5), 1768–1774 (2005)
Article CAS Google Scholar
Horn, D.M., Zubarev, R.A., McLafferty, F.W.: Automated de novo sequencing of proteins by tandem high-resolution mass spectrometry. Proc. Nat. Acad. Sci. U.S.A. 97(19), 10313–10317 (2000)
Article CAS Google Scholar

Download references

Acknowledgments

The authors acknowledge support for this work in part by National Institutes of Health Grants P41 GM103533, F31 AG037265, R01 DK069386, and the University of Washington’s Proteomics Resource (UWPR95794). They acknowledge Jesse D. Canterbury for helpful discussion regarding the calibration algorithm and LTQ-Velos Pro calibration routine, Jae Schwartz and Philip M. Remes for helpful discussion regarding the m/z error distribution observed on the LTQ-Velos, and Vagisha Sharma and Barbara Frewen for help with extracting MS/MS spectra from Bibliospec libraries.

Author information

Authors and Affiliations

Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
Jarrett D. Egertson, Jimmy K. Eng, Michael S. Bereman, Edward J. Hsieh, Gennifer E. Merrihew & Michael J. MacCoss

Authors

Jarrett D. Egertson
View author publications
You can also search for this author in PubMed Google Scholar
Jimmy K. Eng
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Bereman
View author publications
You can also search for this author in PubMed Google Scholar
Edward J. Hsieh
View author publications
You can also search for this author in PubMed Google Scholar
Gennifer E. Merrihew
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. MacCoss
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael J. MacCoss.

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

ESM 1

(PDF 609 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Egertson, J.D., Eng, J.K., Bereman, M.S. et al. De Novo Correction of Mass Measurement Error in Low Resolution Tandem MS Spectra for Shotgun Proteomics. J. Am. Soc. Mass Spectrom. 23, 2075–2082 (2012). https://doi.org/10.1007/s13361-012-0482-z

Download citation

Received: 07 June 2012
Revised: 17 August 2012
Accepted: 18 August 2012
Published: 25 September 2012
Issue Date: December 2012
DOI: https://doi.org/10.1007/s13361-012-0482-z

Key words

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

De Novo Correction of Mass Measurement Error in Low Resolution Tandem MS Spectra for Shotgun Proteomics

Abstract

Similar content being viewed by others

Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses

The Analysis of Peptide-Centric Mass-Spectrometry Data Utilizing Information About the Expected Isotope Distribution

DIALib-QC an assessment tool for spectral libraries in data-independent acquisition proteomics

1 Introduction

2 Materials and Methods

2.1 Collection of Data for Testing the De Novo Calibration Algorithm

2.2 Generation of a Theoretical Fragment Ion Map for MS/MS Spectra

2.3 De Novo Calibration of MS/MS Data Using the Theoretical Fragment Ion Map

2.4 Calculation of Mass Measurement Error by Database Searching

3 Results

3.1 A Theoretical Fragment Ion Map for Peptide Fragmentation Spectra

3.2 De Novo Calibration Corrects Systematic Mass Measurement Error

3.3 De Novo Calibration is Robust to a Reduction in Signal

3.4 De Novo Calibration and Database Search Results

4 Discussion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic Supplementary Material

ESM 1

Rights and permissions

About this article

Cite this article

Key words

Navigation

De Novo Correction of Mass Measurement Error in Low Resolution Tandem MS Spectra for Shotgun Proteomics

Abstract

Similar content being viewed by others

Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses

The Analysis of Peptide-Centric Mass-Spectrometry Data Utilizing Information About the Expected Isotope Distribution

DIALib-QC an assessment tool for spectral libraries in data-independent acquisition proteomics

1 Introduction

2 Materials and Methods

2.1 Collection of Data for Testing the De Novo Calibration Algorithm

2.2 Generation of a Theoretical Fragment Ion Map for MS/MS Spectra

2.3 De Novo Calibration of MS/MS Data Using the Theoretical Fragment Ion Map

2.4 Calculation of Mass Measurement Error by Database Searching

3 Results

3.1 A Theoretical Fragment Ion Map for Peptide Fragmentation Spectra

3.2 De Novo Calibration Corrects Systematic Mass Measurement Error

3.3 De Novo Calibration is Robust to a Reduction in Signal

3.4 De Novo Calibration and Database Search Results

4 Discussion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic Supplementary Material

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation