1 Introduction

The field of proteomics has experienced significant growth in the past decade due to advancements in mass spectrometer instrumentation and computational tools for data interpretation. Instrument scan speed, dynamic range, sensitivity, resolution, and mass measurement accuracy (MMA) continue to improve, allowing for more comprehensive analysis of complex protein digests.

Systematic mass measurement error (SMME) is typically corrected by routine instrument calibration. External calibration is performed by analyzing a standard with molecules of known elemental composition and calibrating the instrument to match the measurements of the calibrant ions to their known mass-to-charge (m/z). Ion species covering a wide range of m/z are often used for calibration because systematic mass measurement error can vary with m/z. Over time, the mass calibration drifts, requiring periodic recalibration. Even the most sophisticated instruments will have some systematic and/or random mass measurement error specifically if they are not properly calibrated.

Mass measurement accuracy can be further improved by analyzing calibrants spiked into each sample (i.e., internal calibration). Internal calibration is usually coupled with high-resolution mass analyzers (e.g., TOF [1, 2], FTICR [3, 4], Orbitrap [5]) where instrument factors such as space charge effects, electric fields, peak intensity, and mass analyzer temperature vary during the course of an analysis. These factors ultimately cause mass measurements to deviate. On trapping instruments, the implementation of automatic gain control helps alleviate the mass deviations due to space charge by controlling for total ion population [6]. As a result, internal calibration is not necessarily needed in these instruments to achieve high MMA and can be detrimental to performance due to the loss in sensitivity and dynamic range associated with introducing a calibrant ion.

Computational calibration techniques aim to match the robustness of internal calibration without requiring the addition of specific calibrants. To calibrate precursor data, ion species known to be present in a sample can be used as internal calibrants in lieu of spiked in calibrants. Peptides expected to be present in a data set a priori [2, 7, 8], peptides confidently identified by database searching [913], or commonly observed contaminants can be used in this manner [14]. These techniques improve mass measurement accuracy but will likely struggle with low resolution data due to the difficulty of unambiguously mapping theoretical ion species to low resolution precursor features. Charge state pairs can be used to detect frequency shifts in precursor spectra without a priori knowledge of sample content [1517]. Unfortunately, the reliance of this technique on resolving charge-state precludes its application to low-resolution data where resolving power is too low to resolve isotopic peaks for charge state determination. Monoisotopic, singly-charged peak masses in peptide mass fingerprinting data collected on MALDI-TOF instruments can be calibrated without a priori knowledge of the sample content [1820].

Fewer computational techniques exist for the calibration of MS/MS spectra. Fragmentation spectra can be calibrated de novo by detection of type-1 peak edges [21] or by analysis of trypsin auto-lysis products [22]. Confident peptide-spectrum matches from a database search can be used to calibrate MS/MS data as well [23], although such methods can be computationally expensive and require protein annotations from a genome sequence.

Herein, we demonstrate that low resolution ion trap instruments can be subject to systematic mass measurement error even after external calibration. We implement a novel method for de novo calibration of peptide MS/MS data collected on low resolution instruments capable of improving mass measurement accuracy and mass measurement precision quickly (<1 minute) without the need for protein annotations or even knowledge of the organism(s) being analyzed. The algorithm is implemented in a freely-available, open-source software package named “FineTune.” We demonstrate improved mass measurement accuracy after calibration with FineTune for MS/MS spectra acquired on both an LTQ and LTQ-Velos mass spectrometer by analyzing mass measurement error in confident peptide-spectrum matches pre and post-calibration. The robustness of FineTune is demonstrated by successfully calibrating data sets with only a small percentage of the total spectra used as input for the calibration. The impact of calibrating MS/MS spectra on database search results is tested using the Mascot, SEQUEST, and X!Tandem search algorithms.

2 Materials and Methods

2.1 Collection of Data for Testing the De Novo Calibration Algorithm

A S. cerevisiae sample digest was analyzed by LC-MS/MS on an LTQ-Orbitrap-Velos (Thermo Fisher Scientific, San Jose, CA, USA) hybrid mass spectrometer. Peptides were separated by reversed-phase high performance liquid chromatography (RP-HPLC) across a 100-min linear acetonitrile gradient on a 40 cm column with 75 μM inner-diameter. Mass spectra were collected using a top-13 data-dependent acquisition scheme with precursor scans acquired in the Orbitrap (60,000 resolving power @ 400 m/z) in profile mode. Peptide fragmentation by resonance collision induced dissociation (CID) and subsequent mass analysis was executed in the dual pressure linear-ion trap in centroid mode. Dynamic exclusion was enabled with a 50 entry exclusion list and 180 second exclusion time.

A S. cerevisiae sample digest was analyzed in a similar manner on an LTQ-FTICR (Thermo Fisher Scientific, San Jose, CA, USA) hybrid mass spectrometer. A 125-min linear acetonitrile gradient was used. Mass spectra were collected using a top-5 data dependent acquisition scheme with precursor scans acquired on the FT-ICR (50,000 resolution @ 400 m/z) in profile mode. Fragmentation spectra were acquired by resonance CID in the linear ion trap in centroid mode. Dynamic exclusion was enabled with a 50 entry list and 30 second exclusion time.

2.2 Generation of a Theoretical Fragment Ion Map for MS/MS Spectra

Previous studies demonstrate that polypeptide masses occupy “allowable regions” of the mass range of width ~0.2 m/z spaced apart by about 1.00045475 m/z [24]. We made a similar observation by analyzing fragment ion masses in spectra from our C. elegans Bibliospec library [25].

Every MS/MS spectrum and matching peptide sequence was extracted from the C. elegans Bibliospec library (ver. 5.1) of confident peptide spectrum matches. For each of these spectra, a theoretical MS/MS spectrum was generated using monoisotopic amino acid masses for singly-charged b and y ions. Intensities of theoretical fragment ions were matched to the intensity of the nearest peak in the experimental spectrum within ±0.5 m/z. The spectra were then binned with a bin width of 0.009995454567 m/z and summed to generate a theoretical fragment ion map (Figure 1). This bin width was selected to be amenable to applying the discrete fast Fourier transform (FFT) to model the location of the peaks in the theoretical ion map using a sinusoid function (see Section 2.3). Using a bin width of 0.009995454567 m/z ensures that one of the bins of the discrete FFT will correspond to the component of the data with a period of 1.00045475 m/z, which is the average distance between peaks in the theoretical fragment ion map [24, 2630].

Figure 1
figure 1

Theoretical fragment ion map. The theoretical fragment ion map is plotted for m/z 635–695. The map was generated by summing over 100,000 fragmentation spectra. The peaks repeating every ~1.0005 m/z represent regions where a singly charged b- or y-ion is theoretically likely to occur in a peptide fragmentation spectrum. The inset is a zoomed view of a subset of the fragment ion map. Notably, there are repeating regions in m/z-space where these peaks do not occur attributable to most peptides having a similar mass excess

2.3 De Novo Calibration of MS/MS Data Using the Theoretical Fragment Ion Map

The technique used to calibrate MS/MS spectra is summarized in Figure 2. MS/MS spectra are binned with a bin width of 0.009995454567 m/z and summed to generate the observed fragment ion map. Misalignment between the observed and theoretical fragment ion maps is due to systematic mass measurement error (SMME). Therefore, the systematic mass measurement error can be determined by calculating the m/z shift required to align the observed map to the theoretical one. Because SMME can vary with respect to m/z, the SMME is calculated at an interval of every 20 m/z.

Figure 2
figure 2

Steps for de novo calibration of MS/MS spectra. (a) All MS/MS spectra in the input file are binned and added to each other to generate an empirical fragment ion map for the file. This step only happens once for each file being calibrated. (b) Systematic mass measurement error (SMME) is detected in 20 m/z intervals along the m/z range of the empirical fragment ion map. To detect the SMME at m/z α: (i) Analyze a slice of the empirical ion map with boundaries α ± ε such that there is adequate signal for detection of SMME. The full empirical ion map (top) and a zoomed in slice of the empirical ion map (bottom) are shown. (ii) Apply a discrete fast Fourier transform (DFFT) to this slice to isolate the frequency component of the signal with period 1.00045475 m/z (red). This isolated sinusoid component fits the observed peak clusters (blue). (iii) Use the same technique as in steps i and ii to isolate the same frequency component from the theoretical ion map. The difference in phases of the empirical and theoretical frequency components is the systematic mass measurement error at m/z α. Once SMME has been detected at each 20 m/z interval, use linear interpolation to determine the SMME at any m/z between these intervals

To calculate the SMME at m/z α, a subset of both fragment ion maps are analyzed between α − ε and α + ε. The value ε is the minimum value for which the total ion current in the experimental fragment ion map between α − ε and α + ε exceeds 2.5 × 109 ions/s and ε ≥ 20 m/z. If the values α − ε or α + ε are outside of the range of observed m/z values, the center of the window is shifted. Adjusting the window width in this manner compensates for variability in signal intensity with respect to m/z.

To reduce the impact of noise and multiply-charged peaks on the algorithm, a 218 point discrete fast Fourier transform is used to determine the phase of the signal component with period 1.00045475 m/z for both the theoretical (ϴt) and experimental (ϴe) ion map subsets. This is essentially a computationally efficient method of fitting a sinusoid to the observed signal. The SMME is the difference in the phase of these two frequency components (ϴe – ϴt). After SMME has been calculated for every 20 m/z interval, the points are interpolated linearly to allow for the reporting of SMME at any m/z.

FineTune was coded in C++ and compiled with GNU gcc-4.3.3 on a 64-bit system running Linux kernel v. 2.6.29.6. FineTune uses the Boost Build system to allow for cross-platform compilation (tested on Linux and Windows). FineTune can read and write mzML, mzXML, MGF, MS2, CMS2, and BMS2 formats (http://proteowizard.sourceforge.net/formats.shtml). Additionally, the Windows version can read Agilent, Bruker, Thermo, Waters, and AB-SCIEX vendor formats. Source code and binaries are available at the MacCoss lab website (http://proteome.gs.washington.edu/software/finetune).

2.4 Calculation of Mass Measurement Error by Database Searching

The systematic mass measurement error detected by FineTune is compared with that detected by analyzing confident peptide spectrum matches (PSMs) identified by SEQUEST [31] using a target-decoy strategy, and post-processed by Percolator (ver. 1.14) [32]. If precursor scans are collected on a high-resolution instrument, Bullseye [33] is used to determine more accurate precursor masses prior to database searching. A threshold for peptide-level FDR of ≤0.01 is applied to the set of PSMs. For each PSM, the mass measurement error for each theoretical singly charged b- and y-ion with a matching peak within ±0.5 m/z is determined. Mass measurement accuracy is reported as the mean of the mass measurement error and its 95 % confidence interval. When comparing mass measurement error before and after de novo calibration, the same spectra analyzed prior to calibration are analyzed post-calibration.

3 Results

3.1 A Theoretical Fragment Ion Map for Peptide Fragmentation Spectra

Mass excess is the nominal mass (i.e., mass number) of an atom subtracted from the exact mass. The 20 amino acids coded by the standard genetic code have a very similar mass excess, which causes the masses of polypeptides to group together into regions of the mass range spaced roughly 1.0005 m/z apart [28] (Figure 1). The regions of the mass range between these mass clusters have been termed “forbidden zones” [29] because it is theoretically impossible for the mass of a polypeptide built from these 20 amino acids to fall in these zones. To visualize the distribution of “allowable” fragment ion masses, MS/MS spectra in the C. elegans BiblioSpec spectrum library were summed (Figure 1). Only annotated MS/MS peaks were included in the summation, and each peak’s m/z was corrected to match its theoretical m/z. Supplementary Figure 1A shows a “zoomed out” view of the summed spectrum. Fragment ions between 200 and 800 m/z contribute roughly the same intensity to the summed spectrum throughout this range despite the greater number of fragment ions at the lower end of this range (Supplementary Figure 1B). Above 800 m/z, the summed intensity of fragment ions gradually decreases with increasing m/z.

3.2 De Novo Calibration Corrects Systematic Mass Measurement Error

FineTune was tested on 44,944 low resolution MS/MS spectra acquired by a shotgun LC-MS/MS analysis of S. cerevisiae lysate on an LTQ-Orbitrap-Velos (Thermo Fisher Scientific, San Jose, CA, USA) mass spectrometer. The recalibration determined de novo matches very closely to that determined by analyzing 6,628 confident (q ≤ 0.01) peptide spectrum matches from a database search (Figure 3a, b). The mean mass measurement error prior to de novo calibration is –0.1776 ± 0.0010 m/z (95 % confidence interval) due to calibration drift. After calibration with FineTune, the mean mass measurement error is 0.0078 ± 0.0006 m/z, thus the precision is improved by about half in addition to the improvement in mass accuracy (Figure 3a, b). The mass precision is uncharacteristically poor in this dataset to illustrate the capabilities of FineTune. This poor mass calibration is caused by detector and thus automatic gain control miscalibration resulting in significant space charge effects (Figure 3a). In a more common scenario, the mass error does not vary so drastically with m/z (Supplementary Figure 2A) and thus the improvement in the mass error spread will be more subtle. Additionally, the mass error between the caffeine and MRFA ions as well as between the MRFA and the first Ultramark ion on the LTQ-Velos (Figure 3a, Supplementary Figure 2A) is a result of the absence of m/z calibrants in that region and a non-linear response between the rf ejection frequency and m/z. The mass calibration has since been improved by the instrument manufacturer using a new calibration procedure and will be available in future releases of the Velos and Velos Pro Tune software (personal communication, Jae Schwartz).

Figure 3
figure 3

Robust de novo detection of systematic mass measurement error. In (a) and (c) the mass measurement error detected by FineTune is compared with that detected by analyzing peptide-spectrum matches from a database search on the LTQ-Orbitrap-Velos (6628 PSMs) and LTQ-FTICR (4,961 PSMs) data, respectively. The heatmap in the background is a two-dimensional histogram of the mass error determined by comparing experimental fragment ion masses to theoretical masses from confident peptide-spectrum matches. The log10(total intensity) for peaks falling in any bin is mapped to color as indicated in the colorbar. The black curve on top of the heatmap is the systematic mass measurement error detected by FineTune. The cross hairs indicate the theoretical m/z of the calibrant ions in the manufacturer-supplied calibration mix. Next to each two-dimensional histogram is a one-dimensional histogram of mass measurement error detected from the confident peptide spectrum matches. (b) and (d) show the mass measurement error after using FineTune to calibrate the LTQ-Orbitrap-Velos and LTQ-FTICR data respectively. The data for these plots was generated using the same set of peptide-spectrum-matches as for (a) and (c) but with calibrated fragment ion masses

FineTune was also tested on 21,433 MS/MS spectra acquired on the linear ion trap of an LTQ-FTICR (Thermo Fisher Scientific, San Jose, CA, USA) mass spectrometer, improving mean mass measurement error from 0.0444 ± 0.0011 m/z to 0.0055 ± 0.0011 m/z (Figure 3c, d). In contrast to the LTQ-Orbitrap-Velos data shown above, these data have very little systematic mass measurement error (SMME); the mass error distribution is centered close to zero and there is almost no variation in SMME with m/z (Figure 3c). FineTune corrects for the slight m/z-independent shift in SMME and importantly does not detect any false trends in SMME from noise in the data indicating that FineTune is applicable to data with extreme or subtle SMME.

3.3 De Novo Calibration is Robust to a Reduction in Signal

To test the response of FineTune to a reduction in signal (i.e. few peptide MS/MS spectra), FineTune was applied to the LTQ-Orbitrap-Velos data with varying numbers of MS/MS spectra removed. For each MS/MS spectrum in the data, a random number between 1 and 100 was generated, and if the number was greater than or equal to a threshold, the spectrum was removed from the data set. Therefore, if a threshold of 60 is applied, the resulting data should contain about 60 % of the original spectra.

Five “thinned” data sets were generated containing ~75 %, 50 %, 25 %, 10 %, and 1 % of the 44,944 original MS/MS spectra from the LTQ-Orbitrap-Velos data (Figure 4). Even after removing 90 % of the spectra, FineTune is extremely robust in high-signal regions of the data. In low-signal regions (e.g. m/z >1200) SMME is still improved albeit not completely corrected (Figure 4c, d). Once 99 % of the spectra are removed, FineTune detects the average mass measurement error, but not the m/z-dependent variation in systematic mass measurement error (Figure 4e, f). The mass accuracy is still improved, but the mass precision remains the same. The mean mass measurement error is –0.1776 ± 0.0010 for the uncalibrated data. The mean mass measurement error is improved in all of the aforementioned cases to 0.0162 ± 0.0009, –0.0097 ± 0.0006, 0.0078 ± 0.0006, 0.0045 ± 0.0006, 0.0064 ± 0.0006, and 0.0078 ± 0.0006 m/z when removing 99 %, 90 %, 75 %, 50 %, 25 %, and 0 % of the spectra respectively (Figure 4g). Importantly, we have yet to find a case where FineTune negatively impacts the data.

Figure 4
figure 4

De novo calibration is robust to a reduction in signal. Mass measurement error heatmaps were generated using all 6628 significant peptide spectrum matches in the LTQ-Orbitrap-Velos data as described in Figure 3 and in Section 2. (a), (c), and (e) depict the systematic mass measurement error detected by FineTune using 25 %, 10 %, and 1 % of the (44,944) MS/MS spectra from the original file, respectively; (b), (d), and (f) depict the mass measurement error after correcting for the systematic mass measurement error detected in (a), (c), and (e), respectively; (g) shows the median mass measurement error from the 6628 PSMs with no calibration and after calibration by FineTune with varying percentages of the original spectra as input. The whiskers are 1.5 times the inner quartile

3.4 De Novo Calibration and Database Search Results

The calibrated and uncalibrated (control) LTQ-Orbitrap-Velos data was searched using X!Tandem (Cyclone 2010.12.01.1). All data was first processed by Bullseye and then searched against S. cerevisiae sequence (target) and reversed sequence (decoy) databases with a precursor mass tolerance of 10 ppm. The fragment ion tolerance for the searches was varied between ±0.01 and ±0.5 Da. The expectation values from target and decoy search results were used to determine the number of unique peptides at a false discovery rate (FDR) threshold of 0.01.

At fragment ion tolerances less than ±0.2 Da, the calibrated data returns many more peptide identifications than the data that has not been calibrated (Figure 5). For example, at a fragment ion tolerance of ±0.1 Da, the calibrated data returns 2.5 times more peptide identifications. However, at fragment ion tolerances greater than ±0.2 Da, calibrating the data does not increase the number of peptide identifications. Calibrated data had a similar but far more subtle impact on Mascot search results (Supplementary Figure 3). The impact of calibration on SEQUEST results is less straightforward due to how spectra are binned and is discussed in Supplementary Results. Using calibrated data and the optimal fragment ion tolerance/binning parameters SEQUEST, Mascot and X!Tandem identify 3,755; 2,622; and 1,710 unique peptides respectively at a q-value ≤0.01.

Figure 5
figure 5

De novo calibration improves X!Tandem search results. The LTQ-Orbitrap-Velos data was searched with X!Tandem using a target-decoy strategy to determine q values for each unique peptide from reported expect scores. The number of unique peptides identified by X!Tandem for the control and calibrated data with q ≤ 0.01 was compared using various fragment ion tolerances in X!Tandem

4 Discussion

FineTune enables the de novo, unsupervised calibration of MS/MS spectra acquired on low resolution instrumentation. The algorithm corrects nonlinear systematic mass measurement error and does not negatively-impact a data set even when it is trained on only 1 % of the spectra in the data set. This robustness, speed, low memory (~10 MB) and low processor requirements (the bottleneck is typically file I/O) make FineTune suitable for integration into an unsupervised data processing pipeline run on every data set acquired. Calibration can then be tracked over time which aids in the scheduling of instrument calibrations, especially for newer instruments which tend to drift out of calibration quickly. Additionally, the algorithm can be applied retroactively to old data that a researcher may suspect was collected on an improperly calibrated instrument.

Calibration of fragment ion masses had little impact on peptide identifications from three popular database searching algorithms. FineTune improved search results for X!Tandem at small fragment ion mass tolerance values. One might expect the search with fragment ion tolerance ±0.2 Da to perform better than larger fragment ion tolerances because ±0.2 Da is roughly the width of the fragment ion mass error distribution for the calibrated data (Figure 3b). Following this line of reasoning, when the fragment ion tolerance is made greater than ±0.2 Da the only effect would be to increase the number of incorrect peaks under consideration when calculating the match score for any particular peptide spectrum match. However, it appears that the scoring function used by X!Tandem (Figure 5) and Mascot (Supplementary Figure 3) are both unaffected by this phenomenon. Calibration does not greatly improve Mascot results at any fragment ion tolerance.

These algorithms are classified as database search algorithms due to their reliance on a database of candidate protein sequences to test each spectrum against. De novo search algorithms are designed to interpret spectra without a database of candidate sequences. Because FineTune does not rely on sequence information, it is a natural complement to de novo search algorithms often used when protein sequence information is unreliable, incomplete, or nonexistent. De novo search algorithms can benefit from improved fragment ion mass accuracy [34, 35]. Thus, in future work we will assess the impact of de novo calibration on de novo spectrum interpretation.

FineTune corrects systematic mass measurement error reliably by only making corrections when enough signal is present in the data to justify these corrections. FineTune improved the mass measurement accuracy of every data set tested to <0.01 m/z. This reliability makes FineTune suitable for use as an unsupervised preprocessor applied to any collection of MS/MS spectra prior to database searching. While improved mass measurement accuracy sometimes improves database search results (up to 40 % increase in peptide identifications), the data presented herein indicates that database search algorithms could be optimized to take better advantage of more accurate fragment ion mass measurements. In future experiments, we look to expand the algorithm to be able to correct mass measurement error trends that vary with retention time as well as test the impact of calibration on de novo spectrum interpretation algorithms.