Introduction

The application of tandem mass spectrometry of protonated peptides as a powerful analytical technique for determining the amino acid identities and the sequence of peptides and proteins has been essentially developed in last 30 y. Numerous studies regarding the interpretation of protonated peptide spectra and the ion fragmentation mechanisms have been published [1,2,3,4,5]. In comparison with the well documented mass spectrometry of protonated peptides, mass spectrometry of deprotonated peptides has received less attention in proteomic research, although spectra of negative ions of peptides provide an important complement to positive ion spectra in sequencing peptides in many cases. Bowie and coworkers studied the mass spectrometry of deprotonated peptides [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20] and described the fundamental backbone cleavages and the characteristic side-chain fragmentations of amino acids residues. These studies also demonstrated that often there are fragmentation pathways that depend on the specific side-chain structure of the amino acid. Harrison and coworkers [21,22,23,24,25] reported the sequence-specific fragmentation of deprotonated peptides containing alkyl side chains and indicated that there were more extensive sequence-specific fragmentations using low energy collision-induced dissociation (CID) mode instead of high energy CID. Subsequent studies were carried out on deprotonated peptides containing phenylalanine, glutamic acid, and proline. Cassady and coworkers [26,27,28,29,30,31] and other groups [32, 33] also have studied the fragmentations of a variety of deprotonated peptides.

The studies mentioned above show that the side chains of some amino acids affect the fragmentation reactions of deprotonated peptides, both in peptide backbone cleavage and in side-chain fragmentation. Peptide bond dissociation of the negative ions can provide sequence information similar to that obtained from positive peptide ions dissociation, although the negative ions exhibit additional dissociation pathways (such as c-ions). Fragmentation of the amino acid side chain is especially important in the identification of certain amino acid residues by their characteristic fragments. Bowie and coworkers [15, 17, 20] published a table that includes side-chain fragmentations for 15 amino acid residues in small peptides; many of them cannot be observed in the spectra of peptides that contain more than four amino acid residues. In this work, we studied the MS/MS spectra of deprotonated peptides using 390 dipeptides and many longer peptides, mainly commercial bioactive peptides. The spectra were acquired with a Thermo Orbitrap Elite mass spectrometer to collect both ion-trap and HCD (Higher Energy Collision Dissociation) spectra for inclusion in the NIST MS/MS library [34]. We extend the previous results on the side chain neutral losses and also present a comprehensive fragment ion list from 17 amino acid residues, confirmed with high mass accuracy. These fragment ions include those produced from the side chains as well as the fragments produced from whole amino acid residues. Statistical methods were applied to estimate the relative abundance of the characteristic side chain fragmentations.

Experimental1

The dipeptides were purchased from GenScript (Atlanta, GA, USA), LifeTein (Somerset, NJ, USA, or Sigma-Aldrich (St. Louis, MO, USA) and the other peptides were purchased mainly from American Peptide Company (Sunnyvale, CA, USA). The peptides were dissolved in acetonitrile/water/formic acid (50:50:0.1) (v:v:v) at a concentration of about 0.1 mg/mL and infused into an Orbitrap Elite mass spectrometer (Thermo Fisher Scientific, Waltham, MA, USA) via a nano-electrospray source. Ion trap (IT and FT-CID) and HCD spectra were acquired in the positive and negative mode. The gases N2 (99.999 %) and He (99.999 %) were utilized as collision gases for the HCD and IT spectra, respectively. Ion trap spectra were acquired at a relative collision energy of 35%. The HCD spectra were collected by using up to 18 different collision voltages ranging from 2 V to 180 V, which is beyond the voltage where no precursor ions remained. The resolution for MS2 was set at 30,000. Spectra were acquired in ‘profile’ mode for both FT-CID and HCD.Footnote 1

Results and Discussion

Tandem mass spectra of 1800 peptides, as protonated and deprotonated ions, were acquired at different collision energies, with high mass accuracy, and are included in the 2017 version of the NIST tandem mass spectral library. The results for the 527 deprotonated peptides containing two to six amino acid residues were analyzed and are discussed below.

Neutral Losses from the Side Chains of Amino Acid Residues

Spectra of negative ions of peptides are as informative as those of positive ions since sequence information is provided from backbone cleavage. Both positive and negative peptide ions produce the corresponding y-, b-, and a-ions, but the negative peptides also produce abundant c-ions and rarely observed x- and z-ions. Additionally, there are certain side-chain cleavage reactions of negative peptide ions that readily identify particular amino acid residues. Bowie and coworkers [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20] reported some characteristic negative-ion fragmentations of side chains of amino acid residues, which are useful to identify these residues and to interpret mass spectra. However, a limited number of peptides were studied to address the neutral losses of side chains. It appears that many of the neutral losses that were proposed are not present in the spectra of peptides that contain more than four amino acid residues. The results presented here from measurement of 527 peptides, comprised of 390 dipeptides and 137 peptides with three to six amino acids, using high mass accuracy, provide a more detailed analysis and permit us to estimate the relative abundances of the neutral losses from the side chains of amino acid residues. The spectra of several hundred peptides containing seven or more amino acids were deposited in the NIST library, but not used in the statistical analysis because correlations with specific residues became less certain.

The side chain neutral losses from deprotonated peptides are summarized in Table 1. For each deprotonated peptide, we searched for all the ions that were produced by side-chain neutral loss from each amino acid residue. Using the HCD and FT-IT spectra, we identified the most abundant peak for neutral loss from each residue, including loss from the precursor ion as well as losses from y-, b-, a-, or c-ions. Then for each amino acid we averaged the abundances of all the peaks observed with all the peptides containing that residue, including the results for peptides which contain that residue but lack the corresponding peak in their spectra. The most frequently observed ion with the highest peak intensity is that for the loss of C2H4O (acetaldehyde) from threonine. This loss was observed in all 52 spectra of peptides containing threonine, with an average intensity of 78.4% of base peak. The abundance of this neutral loss peak was set as 100 and the losses from the other amino acids were estimated relative to threonine. It should be noted that a similar calculation for the spectra of 475 peptides which do not contain threonine gave a value of 0.2%, probably due to incidental small peaks. This small percentage was not deducted from the positive result except as discussed below for the loss of water and CO2.

Table 1 Neutral Losses from Side Chains of Amino Acid Residues in the Tandem Mass Spectra of Deprotonated Peptides and Their Estimated Relative Abundances

The second most abundant neutral loss is that of CH2O (formaldehyde) from serine. It was observed in 61 out of 65 peptides containing serine, with an average intensity of 66%. Its relative abundance compared with threonine is estimated as 80 (Table 1). Other neutral losses listed in Table 1 occur with specific amino acids and their abundances were estimated from peptides containing the specific residue. Tryptophan loses the whole indolylmethyl side chain but arginine loses only the tail end of the side chain, CH2N2, with the same relative abundance, 60. Tyrosine, histidine, and phenylalanine lose the whole side chain with decreasing relative abundances, 20, 6, and 3, respectively. It is noted, however, that phenylalanine loses C6H5CH3 whereas tyrosine loses an oxidized form, CH2=C6H4=O. Histidine also loses an oxidized form of its side chain.

Loss of H2O and CO2, which take place from the side chains of aspartic and glutamic acids, also occur from the terminal carboxyl group of many peptides not containing D or E. Therefore, to estimate the abundance of these losses from D and E, we compared their abundances in peptides containing D or E with those in peptides containing neither D nor E. Loss of water was found to be five to seven times more abundant if the peptide contained D or E, respectively. The relative abundances given for the loss of water in Table 1 were estimated by subtracting the abundance in peptides containing neither D nor E from the values found in peptides containing D or E. A similar comparison for the loss of CO2 did not show significant differences between peptides containing D or E, or neither of these residues. Therefore, we examined the loss of a second CO2 from the various peptides and found significant abundances only in peptides containing D or E. Table 1 shows that loss of CO2 from the side chain of E is eight times more abundant than loss from D. Cysteine also undergoes two neutral losses, an abundant loss of H2S and a loss of CH2S, which is eight times less abundant. Three losses were detected from methionine: an abundant loss of CH4S and losses of C3H6S and C2H6S, which are 70 and 270 times less abundant, respectively.

Losses of alkenes were detected from P, V, I, and L residues but their relative abundances are very low (<0.5). They were observed only at high collision energies after the more favorable dissociations had taken place. Proline appears to lose C2H4 from its ring much more abundantly than C3H6. However, V, I, and L appear to lose the whole side chain more abundantly than a smaller alkene.

Oxidized methionine undergoes abundant losses of CH3, CH3SOH, or the whole side chain C3H7OS, and a very minor loss of C2H6OS. These losses are listed separately in the bottom of Table 1 because they were estimated from the spectra of only 10 dipeptides containing oxidized methionine.

The relative abundances in Table 1 were estimated from the average intensity of the corresponding peaks in all the peptides containing the amino acid residue irrespective of its position within the peptide chain. It was noted, however, that while the neutral losses from the side chains of threonine and serine were independent of the location of the amino acid within the peptide, other losses may be dependent on location. The most pronounced difference was found in dipeptides containing arginine, where the loss of CH2N2 is about five times more abundant from C-terminal than from N-terminal arginine, and with cysteine, where the loss of H2S is at least 10 times more abundant from the N-terminus than from the C-terminus.

By comparison with the neutral losses from the side chains of peptide-negative ions, neutral losses from positive ion side chains are abundant mainly from methionine and arginine. From the spectra of 525 singly protonated peptides containing two to six amino acid residues we found the following neutral losses (relative abundance): from methionine CH3SH (53), from arginine CH2N2 (19), from threonine C2H4O (8), from serine CH2O (8), from cysteine H2S (7), from tryptophan C9H7N (2), and from tyrosine C7H6O (1). There is also significant loss of H2O from glutamic acid (40) but much less from aspartic acid (6); the latter abundance values were estimated by deducting the abundance of water loss from peptides containing neither E nor D. The losses from serine and threonine are site-specific, as discussed before [35].

Fragment Ions from Amino Acid Residues

Although earlier studies [7,8,9,10,11, 15,16,17,18, 20] on side-chain fragmentation discussed mostly neutral losses, the authors also mentioned two negative ions formed from the side chains: C6H5CH2 - from F and HOC6H4CH2 - from Y. In the present study we confirm these findings and extend the results to other amino acid residues, such as the formation of C9H8N- from W or C4H5N2 - from H. Such characteristic ions can be used to identify amino acids within the peptide sequence. Side chain-negative ions may be produced while the amino acid residue is retained within the peptide backbone or after that residue is released following backbone cleavage. After the amino acids are released from the peptide chain as negative ions, they may dissociate further to produce smaller negative ions. In the current measurements, HCD spectra were acquired at various collision energies, including high energies in which the fragmentation of the single residues is observed. It should be pointed out, however, that since deprotonated peptides often form c-ions, and sometimes x-ions, what we consider here to be a single residue often includes an additional NH group from the adjacent residue.

In Table 2 we list the significant negative ions observed from single residues, including those obtained from the side chain alone and those from the whole residue. Fragment ions that are specific to one amino acid and are observed with significant abundance in the spectra of at least 20% of the peptides containing that amino acid are denoted with a bold mass. When the mass value is not bold, that fragment ion is produced either with low abundance or from more than one amino acid and thus is not useful for amino acid identification. No fragment ions are listed for glycine and alanine because all the ions that can be formed from these residues can be formed also from larger amino acids following side-chain losses. Also, no fragment ions are listed for threonine because this residue loses acetaldehyde from the side chain very rapidly, even before peptide bond cleavage, leaving behind a glycine residue. Serine loses its side chain (formaldehyde) slightly more slowly than threonine and thus the c-ion product from serine can be observed, but with very low abundance. It should be noted, however, that an earlier paper [36] reported the observation of abundant c-ions from deprotonated peptides containing serine and threonine. The difference between those results and our current work is likely due to the difference in collision energy; they used low collision energy CID (SORI-CID) whereas we used higher energy collision dissociation (HCD).

Table 2. Amino Acid Fragment Ions from Deprotonated Peptides

The last column in Table 2 points out the relation between the observed fragment ion and the precursor amino acid (aa). In most cases this is indicated by showing the aa losing a proton and then losing identifiable neutral species such as H2O, NH3, or CO2. When the fragment ion is produced from a c-ion product, the relation is indicated by “aa – OH + NH2 – H”. An example of an ion derived from a c-ion, i.e., from a residue with an added NH2 group, is the fragment ion at m/z 119.0285 from cysteine, which corresponds to the formula NH2CH(CH2S-)CONH2. This ion is not observed when cysteine is in the C-terminal position in the peptide, except when the peptide is amidated, but it can be formed from cysteine at any other position.

Table 2 shows that several amino acids have multiple characteristic negative ions. For example, the following ions are derived from histidine (Scheme 1).

Scheme 1
scheme 1

Proposed structures for negative fragment ions from histidine

These nine ions are observed in significant abundance and are characteristic of histidine only. Their abundances vary with peptide composition and with collision energy. For example, the peak of the fragment ion C4H5N2 at m/z 81.0458 is observed in the spectra of all dipeptides containing histidine with intensity >5%, 86% of the spectra exhibit the peak with intensity >25%, and 68% of the spectra show intensity >50%. The abundance of this peak in the spectra of peptides containing histidine decreases as the peptide length increases, clearly due to competing pathways. For example, the peak is exhibited with intensity >5% by 100% of dipeptides, but only by 70% of peptides with three to six residues, by 54% of peptides with seven to egjt residues, and by <1% of the peptides with eight to 15 residues. Thus, the fragment ion peak at m/z 81.0458 can be used to identify histidine in peptides with less than eight amino acids. All nine ions shown above are good candidates for identifying histidine in peptides. Identification with multiple characteristic negative ions formed from a single amino acid provides greater confidence.

In parallel with histidine, phenylalanine produces five characteristic fragment ions with significant abundance, which are not observed with peptides not containing F. Similarly, tyrosine produces 10 characteristic fragment ions, not observed in the absence of Y, and eight of them appear with significant abundance. Tryptophan produces six characteristic fragment ions, four of which with significant abundance. Arginine may produce eight fragment ions, but three of them appear with low abundance or are formed from other residues. Proline produces two fragment ions but only one of them is characteristic and can be used to identify proline. The characteristic ion from proline is C5H6NO- with m/z 96.0455. However, this peak must be measured with high mass accuracy to distinguish it from a peak at m/z 96.0091 of the ion C4H2NO2 - formed from D or N. Other amino acids have only one characteristic fragment ion but some amino acids have none, such as glutamine. Seven fragment ions are listed in Table 2 for glutamine, but none of them are useful for identification because they can be produced also from glutamic acid, although sometimes with lower abundance (and thus not listed under glutamic acid). On the other hand, glutamic acid produces one characteristic fragment ion that is not observed from glutamine.

Comparison to Fragments of Positive Ions

The negatively charged fragment ions produced from deprotonated peptides (Table 2) are mostly different from the positively charged fragment ions produced from protonated peptides. Several authors have discussed “immonium ions” and other single amino acid fragments from protonated peptides [37, 38] and have incorporated them into software for interpretation of tandem mass spectra of peptides, such as Mascot [39]. During our analysis of peptide mass spectra for inclusion in the NIST peptide library, we began by using the list of immonium and other single residue fragment ions from Mascot for annotating the spectra, and then, by manual analysis of specific peptide spectra, we found additional fragment ion candidates for specific amino acid residues. The candidate ions with high mass accuracy were then confirmed by statistical analysis of all the peptides containing the specific residue. Our modified list is presented in Table 3. Comparison of Tables 2 and 3 shows that 18 fragments appear in positive and negative forms that differ by two protons. All these ions are formally derived from the protonated or deprotonated single residue by loss of the same neutrals (NH3, H2O, 2H2O). In addition, 17 of the fragment ions have the same formula with the negative and positive charge, differing in mass by two electrons only. All the other fragments have unrelated formulas due to differences in fragmentation pathways between positive and negative peptide ions. Even for those cases with the same formulas, the structures of the positive and negative ions may be different because the positive and negative charges are localized at different sites. For example, the immonium ion derived from glutamic acid has the formula C4H8NO2 and the structure +NH2=CHCH2CH2CO2H, but the observed negative ion fragment with the same formula is more likely to have the structure NH2CH2CH2CH2CO2 -. Similarly, the fragment from tyrosine with the formula C8H10NO has the structure +NH2=CHCH2C6H4OH in positive mode but NH2CH2CH2C6H4O- in negative mode.

Table 3. Amino Acid Fragment Ions from Protonated Peptidesa

The fragment ion with the formula C6H7O+, i.e., protonated phenol, is formed from tyrosine by dissociating the phenol group from the side chain along with a proton. The C6H7O+ ion, however, is also found to be formed from protonated peptides containing phenylalanine or tryptophan but no tyrosine. This ion is clearly formed from a phenyl cation, produced from these amino acids, by reaction with water in the collision cell, as reported before for many other aromatic compounds [40]. In addition, the phenyl cation also reacts with N2, present in the collision cell of the Orbitrap, to produce the stable phenyldiazonium cation, C6H5N2 +. This ion is also observed to be formed from phenylalanine, tyrosine, and tryptophan, and was reported before for many other compounds [41]. This reaction with N2 is specific to positive ions and is not observed with negative ions. The reaction with water is observed mainly with positive ions and rarely with negative ions.

Conclusions

Highly specific and abundant neutral losses and negative ion products from many amino acid residues were determined with high mass accuracy from the MS/MS spectra of 527 deprotonated peptides, and are useful in the identification of specific residues within peptides. Side chain neutral losses are less abundant with protonated peptides. Single residue fragment ions are also different in positive and negative mode, with less than a third of the ions having related formulas. An updated list of immonium and other fragments from single residues in protonated peptides is proposed for future annotation of fragments in peptide mass spectra.