1 Introduction

The accuracy of MS/MS-based peptide sequencing significantly depends on the refined knowledge of gas-phase peptide chemistry to implement reliable fragmentation rules in bioinformatics tools used by the proteomics community for protein identification [1]. Many researchers have tackled worldwide such an important issue during the last decade providing a buoyant literature on peptide fragmentation chemistry upon low energy collision induced dissociation (CID) conditions and on the gas-phase structures of the major fragment ions, i.e. b, a, and y signals. The mobile proton model [25] covers the mobility/sequestering of ionizing protons, which triggers the peptide chain dissociations. These b, a, and y fragment ions that are directly related to the primary polyamide skeleton are used to deduce the amino acid sequence. However, many MS/MS spectra remains unassigned by sequencing algorithms due to the formation of abundant peaks that do not belong to the direct sequence ion series and related structures (internal and immonium ions, loss of H2O, NH3). Many complex rearrangements have been evidenced to explain such nondirect sequence ion production [612], the scrambling effect being recently one of the most studied phenomena [1322]. Indeed, these atypical peptide fragmentations deserve great attention in the search for more comprehensive dissociation pathways suitable for efficient automated MS/MS spectra interpretation. Thus, our attention was focussed on low molecular weight singly charged peptides that often failed to be identified by sequencing algorithms looking for specific rearrangement behaviors other than sequence scrambling. Following a statistical approach, we have in hand a database of MS/MS spectra recorded upon collision-induced dissociation (CID) and laser-induced dissociation (LID) conditions in ESI and MALDI mass spectrometry from 392 synthetic peptides of various amino acid composition and length [2325]. Surprisingly, the loss of the C-terminal amino acid residue that has been previously reported for some arginine-containing peptide singly charged ions [610] was very often observed for any of the 20 proteinogenic amino acids, whatever the MS/MS activation method. Besides, the fair to very abundant intensities of many of these fragment ions prompted us to scrutinize our available MS/MS data to define all conditions required for such a C-terminal specific rearrangement.

The peptide database can be divided in two subsets according to the nature of the C-terminal backbone function: a carboxylic acid (peptide free acids) or an amide (peptide amides), the latter case corresponding to digestion of amidated proteins issued from a specific C-terminal post-translational modification that was of interest in the laboratory as part of ongoing research projects [26]. As depicted in Figure 1, among the peptide free acids, some were designed to mimic protein digest sequences such as tryptic chains having either arginine (R) or lysine (K) at their C-terminus [27], as well as Lys-N proteolytic peptides that all possess N-terminal lysine [28, 29]. Peptides illustrating trypsin miscleavages with either R or K within the sequence were also exemplified. Two situations could be encountered. First, the tryptic miscleaved peptide originated from the protein internal sequence and had either R or K at the C-terminus. The second came from the protein C-terminal sequence and it possessed any of the 20 proteinogenic amino acids at the C-terminal end. These two types of peptides were respectively named in Figure 1 trypsin miscleavages and C-ter trypsin miscleavages. Furthermore, the peptide collection contained some sequences that were not related to any protein enzymatic digestion. These peptides could be assigned to the C-terminal part of any protein subjected to proteolysis. Finally, we ensured that the synthetic peptide free acids were presenting an overall amino acid occurrence similar to the residue natural relative abundance in proteins as shown in Figure 1. Moreover, the 20 proteinogenic amino acids being located randomly at any position within the available sequences, we checked that the five different classes (aliphatic, aromatic, basic, acid, and not charged side chains) were inserted at least once at the C-terminal position. All sequences are reported in Table S1 of the Supplemental Information section.

Figure 1
figure 1

Database of synthetic peptides and the corresponding amino acid occurrence

Surprisingly, we have observed that some of these 392 peptides exhibited a peculiar MS/MS behavior leading to the abundant loss of the residue located at the C-terminus, as shown in Table 1. This neutral loss that has been reported from metastable decomposition [6] and from CID experiments [710] of singly charged arginine-containing peptide ions was observed whatever the instrument used to record the MS/MS spectra (ESI-QqTof and MALDI-TOF/TOF), and whatever the nature of the amino acid present in that position. However, it appeared that such residue exclusion only occurred when three conditions were simultaneously satisfied: (1) the precursor ion must be singly charged; (2) a basic residue must be present in the sequence; (3) the C-terminal backbone function must be a carboxylic acid. Within the frame of the mobile proton model [25], provided that at least one of the ionizing protons is sufficiently mobile to be intramolecularly transferred onto any amide bond along the peptide backbone, multiple peptide bond ruptures will be triggered in MS/MS experiments upon precursor ion activation. These competitive fragmentation pathways led to series of ions related to the N-terminal part of the peptide (b/a ions) and C-terminal end (y ions) allowing efficient sequencing. On the contrary, it is well known that peptides containing strong basic residues (such as arginine and histidine) sequester the ionizing proton [30, 31] and, thus, may be very difficult to dissociate by low energy CID activation of the monoprotonated precursor ion [3235]. Since this situation was encountered for all peptides exhibiting the C-terminal residue exclusion (conditions 1 and 2), it confirmed that the fragmentation mechanism by rearrangement [710], in competition with the regular bn-ym pathway [4], was necessarily requiring proton sequestering on a basic side chain and mobilization of the C-terminal carboxylic acid proton. Even if this C-terminal rearrangement mechanism has been debated in the literature for some times [710], recent investigations by Bythell et al. on the RGD tripeptide by means of experimental and theoretical work have proven that salt-bridge stabilized and anhydride transition structures were involved [10]. Thus, the structural constraints that have been established from many experimental data provide further compelling evidence of the C-terminal rearrangement mechanism established for the RGD tripeptide [10]. These results were also in agreement with literature data describing the production of truncated peptides from sodiated ions [3638]. Lebrilla et al. have demonstrated that lithiated glycylglycine was rearranged to form an anhydride intermediate that further fragment to give the product with a carboxyl group at the new C-terminus [39]. Although neutral fragments released from both N- and C-termini of singly protonated peptides were studied more than a decade ago [40], the loss of a residue was identified as an aziridinone (three-membered cyclic amide) and solely related to the N-terminal part of the peptide, which accompanies ym ion formation. Working with synthetic peptides having well-defined sequences allowed to ascertain that the residue loss was produced in our case from the C-terminal end. This point was crucial for relevant peptide sequencing since a mistake in the identification of the amino acid present at one chain extremity might bring confusion between the sequence and retrosequence attribution. Indeed, a fragment obtained by C-terminal residue elimination still contains the N-terminal part of the original peptide but will be assigned as a y ion and thus be attributed to the C-terminal sequence as depicted in Scheme 1. Such confusion could be made since the a1-ym–1 fragmentation channel allowing the potent formation of N-terminal a1 ion and C-terminal ym–1 ion has been proposed to occur in competition with the common major bn-ym dissociation pathway [4]. Thereafter, the designation of the fragment ion produced by C-terminal residue loss could be ambiguous. Since the remaining peptide skeleton present in this fragment ion corresponds to the N-terminal portion, it has been described as a so-called (bn–1 + H2O) ion [710] and named as such in this paper.

Table 1 Synthetic Peptides Exhibiting C-terminal Residue Exclusion
Scheme 1
scheme 1

Ambiguous loss of neutral residue from N- and C-terminal peptide extremities

To confirm the occurrence of such competitive ambiguous rearrangement in peptide sequencing by singly charged precursor ion activation, we report the behavior in MS/MS experiments of several peptides of various lengths containing at least one basic residue. Since some amino acids are well-known to exhibit specific reactivity in MS/MS experiments such as proline [4144] and few residues that present functionalized side chains that can be involved in acid/base and nucleophile/electrophile reactions [32, 34, 35, 4547], we ensured a nonbiased peptide collection with sequences possessing such tricky amino acids (see Table S1). To further assess the generality of such an exclusion fragmentation behavior, a series of peptides displaying the same sequence (FPARVGX) but differing by the nature of the C-terminal amino acid (X classified according to their aliphatic, aromatic, not charged, acidic, and basic side chains in Figure 1) was analyzed as shown in Table 1 (entries R-1 to R-9). As a proof of concept, to ascertain the published fragmentation mechanism from protonated and cationized species [10, 38], comparative dissociation of singly charged molecular ions (MH+) of C-terminal peptide free acids and amides in ESI-QqTOF and MALDI-TOF/TOF experiments were undertaken. Besides, some sodiated ions were also dissociated and the recorded MS/MS spectra were compared with the protonated ones. Finally, the fragmentation profile of the gas-phase recovered truncated peptide (through an MS3 experiment) and the corresponding synthetic sequence of such considered shortened peptide (MS2 spectrum) were compared. From all these data, we found that the relative abundance of the fragment ion resulting from the C-terminal residue exclusion depended from both the amino acid nature and the MS/MS activation method (low energy collisionally activation in ESI-QqTof versus high energy collisionally activation/metastable decomposition in MALDI-TOF/TOF).

2 Experimental

2.1 Chemicals and Reagents

All solvents were of analytical grade. Acetonitrile was purchased from Carlo Erba. The deionized water used in all the experiments was obtained using a Milli-Q system (Millipore, Milford, MA, USA). MALDI matrix [2,5-dihydroxybenzoic acid (DHB) and α-cyano-4-hydroxycinnamic acid (HCCA)] were purchased from Fluka. A peptide calibration standard kit was purchased from Bruker Daltonics.

2.2 Syntheses

Peptides were synthesized on the IBMM peptide synthesis platform using the conventional methodology of solid-phase peptide synthesis using Fmoc chemistry [48] and purified by preparative LC/MS (Autopurification System; Waters, Milford, CA, USA). According to the chosen linker, the peptides were prepared as C-terminal carboxylic acids or amides. More details are available in the Supplemental Information section.

2.3 Mass Spectrometry

Mass spectrometry analyses were conducted on instruments located in the Laboratoire de Mesures Physiques of University Montpellier 2, IBMM platform of instrumentation.

ESI mass spectra were recorded on a Q-TOF I mass spectrometer (Waters, Milford, CA, USA) fitted with an electrospray ion source. Data were acquired and processed with the MassLynx software. The mass spectrometer was calibrated in the positive ion mode using 1% phosphoric acid in water/acetonitrile solution (H2O/CH3CN, 50/50, vol/vol). Data were acquired by the TOF analyzer at 1 acquisition/s from m/z 50 to m/z 2000 with a resolution of 5000. One hundred acquisitions were summed to produce the final spectrum. Samples were dissolved in a mixture H2O/CH3CN (50/50, vol/vol) and infused into the ESI source at a flow rate of 5 μL/min. Voltages were set at +2.5 kV for the capillary and adjusted for the sampling cone. The source was heated at 80 °C. Nitrogen constituted both nebulizing and desolvation gas. The latter was heated at 150 °C. In MS/MS experiments, the second quadrupole analyzer was set in the rf mode using argon as collision gas and the collision energy varied from 25 to 90 eV depending on the samples to get optimal fragmentation.

MALDI mass spectra were recorded on an Ultraflex III TOF/TOF instrument (Bruker Daltonics, Wissembourg, France) equipped with LIFT capability. A pulsed Nd:YAG laser at a wave length of 355 nm was operated at a frequency of 100 Hz (MS data) or 200 Hz (MS/MS data) with a delayed extraction time of 30 ns. The source was operated in the positive mode. Data were acquired with the Flex Control software and processed with the Flex Analysis software. A solution of either the α -cyano-4-hydroxycinnamic acid (HCCA) matrix in water/acetonitrile (70/30, vol/vol) at a concentration of 10 mg/mL or the 2,5-dihydroxybenzoic acid (DHB) matrix in water/acetonitrile (50/50, vol/vol) at a concentration of 10 mg/mL was mixed with the peptide sample in equal amount and 0.5 μL of this solution was deposited onto the MALDI target according to the dried droplet procedure. After evaporation of the solvent, the MALDI target was introduced into the mass spectrometer ion source. External calibration was performed with the commercial peptide mixture (calibration peptide standard 2, Bruker Daltonics, Wissembourg, France). MS data were acquired under the following MS conditions. An acceleration voltage of 25.0 kV (IS1) was applied for a final acceleration of 21.95 kV (IS2). The reflectron mode was used for the TOF analyzer (voltages of 26.3 and 13.8 kV). Mass spectra were acquired from 250 laser shots, the laser fluence being adjusted for each studied sample (laser fluence 1). Ions were detected over a mass range from m/z 200 to 2000. MS/MS data were acquired under the following conditions. An acceleration voltage of 8.0 kV (IS1) was applied for a final acceleration of 7.25 kV (IS2). The reflectron mode was used for the TOF analyzer (voltages of 29.5 and 13.9 kV). Mass spectra were acquired from 600 to 800 laser shots, the laser fluence being adjusted for each studied peptide above the threshold for generation of molecular ions (laser fluence 2 > laser fluence 1). MS/MS experiments were performed under laser induced dissociation (LID) conditions with the LIFT cell voltage parameters set at 19.0 kV (LIFT 1) and 3.2 kV (LIFT 2) for a final acceleration of 29.5 kV (reflector voltage) and a pressure in the LIFT cell around 4 × 10–7 mbar. The precursor ion selector was set manually to the first monoisotopic peak of the molecular ion pattern for all analyses. For LID experiments, no collision gas was added (gas off spectra). For LID/CID experiments (gas on spectra), the same tuning was used but argon was allowed to fill the collision cell (pressure around 2 × 10–7 mbar).

3 Results and Discussion

Among all available peptide free acids dissociated upon CID on a QqTOF mass analyzer configuration (Table S1 of the Supplemental Information section), for roughly 25% of the available MS/MS synthetic peptide data we observed an abundant fragment ion that corresponded seemingly to a truncated sequence where the residue in the C-terminal position has been lost. As expected from published data on small arginine-containing peptides [710], this dissociation only occurred from the protonated precursor ion leading to a single atypical fragment ion in the upper mass range of the MS/MS spectrum. The loss of a neutral residue was only observed for peptides in the MS/MS data set corresponding to the free acids. Moreover, in that category, only peptides bearing a basic amino acid in their sequence such as arginine, histidine and lysine revealed C-terminal exclusion provided the singly charged precursor ion was selected for activation/dissociation. These two constraints must be simultaneously satisfied to trigger the C-terminal sequence truncation.

The 42 peptide free acids (over 152 entries) exhibiting such amino acid exclusion are reported in Table 1. They are classified according to the basic residue present within the sequence, i.e., arginine (entries R-1 to R-19), histidine (entries H-1 to H-6), or lysine (entries K-1 to K-4), which are required to trigger the C-terminal exclusion with fair to high intensities. Peptides possessing multiple basic residues were counted separately (entries B-1 to B-9). Obviously, the C-terminal position has not been considered for such a categorization, this amino acid being lost as a neutral molecule. It should be noted that few tryptic peptides that were all terminated by lysine behaved unexpectedly upon collision-induced activation of the singly charged molecular ion (roughly 10% of all tryptic peptides). Although these peptides did not contain any basic amino acid except the C-terminal one (K), they produced the residue exclusion. This has not been observed previously, the formation of (bn-1 + H2O) ion being only known up to now to be promoted by the presence of basic amino acids in a non-C-terminal position [7, 10]. In particular, such behavior in CID has been reported for deprotonated peptides containing C-terminal glutamic acid, aspartic acid, or serine residues [9]. Nevertheless, the corresponding fragment ion issued from nonbasic residue-containing tryptic peptides was always recorded with a low abundance as depicted in Table 1. These C-terminal lysine peptides were thus considered separately (entries X-1 to X-4).

To illustrate this specific dissociation pathway, we choose the two arginine-containing peptide free acids ALGSRSAGYP and WFAPPRVGYL (Table 1, entries R-15 and R-18, respectively), giving rise to an abundant loss of 97 Da (proline residue, P) and 113 Da (leucine residue, L) from the MH+ precursor ions (R-15: m/z 978.5 → m/z 881.5 and R-18: m/z 1205.6 → m/z 1092.4, respectively) as displayed in Figure 2. More examples of histidine- and lysine-containing peptides are provided in the Supplemental Information section (Figures S1 to S3, entries H-5, K-4, and X-1 in Table 1).

Figure 2
figure 2

ESI MS/MS spectra upon CID of the [M + H]+ precursor ion of (a) peptide R-15 (ALGSRSAGYP), (b) peptide R-18 (WFAPPRVGYL)

In the search for previous description of such elimination of the C-terminal residue during peptide sequencing in proteomics studies, fragment ions assigned as (bn + H2O)+ or (bn–1 + H2O)+ ions has been occasionally noted on few published MS/MS spectra of monoprotonated peptides [49, 50], but they were not considered significant for bioinformatics tools due to their very low relative abundance. Since peptide sequencing is most of the time undertaken in the positive mode by dissociation of singly and multiply charged molecular ions, we decided to further investigate this rather abundant ion formation to identify the experimental conditions prerequisite for such amino acid neutral loss in the positive mode.

At this stage of our studies, we needed to answer several questions to refine the precise context of this C-terminal rearrangement:

  • Is it dependent on the ion activation method?

  • Is the free carboxylic acid function really compulsory for exclusion?

  • Is the charge state of the selected precursor ion actually playing a role?

  • Is it dependent on the nature of the amino acid positioned at the C-terminus?

  • Is the basic residue necessary for the exclusion behavior?

  • What is the significance of the exclusion pathway compared with the production of all other non-scrambled sequence fragment ions, which are not directly issued from the bn-ym backbone amide pathway?

3.1 Influence of Precursor Ion Activation Method on C-Terminal Residue Exclusion

To answer the first question, all 42 peptide free acids previously analyzed in ESI-QqTOF were subjected to MALDI-TOF/TOF experiments making available two sets of MS/MS spectra recorded with complementary dissociation techniques based on low- and high-energy CID fragmentations. In contrast to the former ESI-MS/MS setup, MALDI-TOF/TOF is not only limited to CID but allows also dissociation induced by laser irradiation known as laser-induced dissociation (LID) [51]. In such a case, no gas was added to the collision cell; only variation of the laser fluence in the ion source was employed to trigger the dissociation processes according to metastable fragmentation in a field free region [52]. Given the availability of both LID and LID/CID high energy dissociation regimes, MALDI-TOF/TOF constitutes a major versatile analytical tool in proteomics [53, 54] and more generally in any field requiring high throughput analyses [55]. The same results were obtained under all dissociation conditions as depicted in Table 1. All C-terminal amino acid exclusions that were detected in low energy CID were also seen in LID and LID/CID MS/MS spectra. The corresponding fragment ion, i.e., the protonated shortened peptide, was as abundant in MALDI as in ESI MS/MS, as seen in Figures 2 and 3 for instance. In a high number of cases, this ion signal was even more intense in MALDI than in ESI dissociations (Table 1). To evaluate the relative abundance of the fragment ion under interest, the precursor ion was not considered since ESI and MALDI data were not acquired under the same MS/MS process. Hence, the signal related to the C-terminal residue exclusion was measured according to the most intense fragment signal (base peak of the MS/MS spectrum) as described in Table 1. Changing the collision energy or the laser fluence in CID and LID MS/MS experiments did not really alter the abundance of the observed shortened peptide ion remaining weak (+), fair (++), abundant (+++), or very abundant (++++). Moreover, to enlarge the scope of such observation, other experimental parameters were also modified such as the nature of the matrix and the combination of precursor ion activation (LID and LID combined with CID). For all studied 42 peptide dissociations, the MS/MS data recorded with 2,5-dihydroxybenzoic acid matrix were very similar to the ones with α -cyano-4-hydroxycinnamic acid. No major difference was noticed between high energy LID and LID/CID regimes in terms of peptide fragment ion production, as already observed in previous MALDI-TOF/TOF studies [24, 25, 56]. Finally, some peptides that did not exhibit the exclusion in ESI-QqTOF were also tested in MALDI-TOF/TOF and as expected no truncated protonated peptide was detected. The C-terminal residue exclusion was thus not dependent on the precursor ion vibrational activation method.

Figure 3
figure 3

LID MALDI MS/MS spectra of the [M + H]+ precursor ion of (a) peptide R-15 (ALGSRSAGYP), (b) peptide R-18 (WFAPPRVGYL)

3.2 Influence of the C-Terminal Backbone Function on C-Terminal Residue Exclusion

None of the peptides terminated by an amide function (240 sequences, Table S1) exhibited the C-terminal neutral loss, indicating that the presence of a carboxylic acid at the C-terminus was compulsory to trigger the residue exclusion, while keeping the terminal hydroxyl group onto the produced truncated peptide ion. To reinforce this statement, some of the peptides exhibiting the C-terminal exclusion were prepared as amides, the C-terminal carboxylic acid function being replaced by the amide function as the only modification. As expected, the latter amidated sequences did not lose their final residue. Besides, regarding these two peptide free acid and amide MS/MS data subsets, sodiated peptide free acids showed a charge-mediated rearrangement leading to the loss of the last residue whereas the sodiated peptide amides were found to behave differently, showing no such C-terminal amino acid exclusion. Such results were in good agreement with collision activated dissociation experiments that were observed in previous studies with sodiated C-terminal free acid or amide peptides [36, 39].

To ascertain these results, two analytical strategies were undertaken by performing multiple dissociation stages and by fragmenting sodiated molecular ions.

First, MS3 experiments were conducted to dissociate the shortened peptide fragment ion. For instance, the MS3 mass spectra recorded in ESI-QqTOF from the truncated C-terminal fragment of the two selected singly protonated peptide free acids (ALGSRSAGY from ALGSRSAGYP and WFAPPRVGY from WFAPPRVGYL) are reproduced in Figure 4. These mass spectra can be compared with the MS/MS data recorded in both CID ESI-QqTOF and LID MALDI-TOF/TOF shown in Figures 2 and 3, respectively. As expected, the truncated ions that were dissociated showed again the exclusion of the C-terminal residue while keeping the other N-terminal related sequences ions unchanged, proving that the peptide structural modification only affected the C-terminal part and confirming the structure of such a sequence fragment ion. Moreover, to gain confidence in the attribution of a (bn–1 + H2O) ion for the truncated peptide fragment ion and gain better insights into the rupture pathway, the MS3 data of two peptides (R-1: FPARVGS and R-4: FPARVGK) were matched up to the MS/MS spectrum of the corresponding synthetic truncated peptide (FPARVG) as displayed in Figure 5. The two MS3 were similar to the MS2 data of the short synthetic peptide. The fragment ion produced by C-terminal residue exclusion was certainly structurally very close to a linear protonated sequence.

Figure 4
figure 4

ESI MS3 spectra upon CID of the [(M – X) + H]+ precursor ion of (a) peptide R-15 (ALGSRSAGYP), (b) peptide R-18 (WFAPPRVGYL)

Figure 5
figure 5figure 5

ESI MS3 spectra upon CID of the [(M – X) + H]+ precursor ion of (a) peptide R-1 (FPARVGS), (b) peptide R-3 (FPARVGI), and ESI MS/MS spectra upon CID of the [M + H]+ precursor ion of (c) peptide R-19 (FPARVG)

Second, sodiated molecular ions of peptide free acids (MNa+) were fragmented and the MS/MS data were compared with the ones recorded from the corresponding protonated species (MH+). Regarding the former, the exclusion of the C-terminal residue was observed as expected from literature data [36, 39] (Figure S4, entry K-4). Besides, most of the fragment ions that were detected from the MNa+ precursor ion corresponded to the sodiated or protonated species of the sequence fragment ions recorded from the MH+ species. This experiment allowed gathering additional information leading to the C-terminal sequence restriction from charge-sequestered molecular ion as proposed by Bythell et al. [10] and Lebrilla et al. [39].

3.3 Influence of the Charge State of the Precursor Ion on C-Terminal Residue Exclusion

Although it has been depicted in the literature that some peptides were prompt to lose a part of their chain as a neutral molecule both from the N- and C-termini of their sequence, it was only described from doubly charged precursor ions resulting in doubly charged sequence ions (b 2+n and y 2+m series) [57]. The exclusion of one residue from doubly charged molecular ion dissociation was never spotted in our peptide ESI-QqTOF MS/MS database. The lack of mobile proton was thus necessary for the rearrangement mechanism. Trapping all ionization protons was obviously more difficult to achieve from multiply charged ions of proteolytic peptides compared with singly charged species. Taking into account that most of the proteomic studies are currently carried out according to shotgun strategies involving the collision-induced dissociations in ESI-MS/MS of doubly- and triply charged precursor ions [58], the described exclusion phenomenon mainly concerns sequencing methodologies based on MALDI-TOF/TOF analyses. This atypical fragmentation might be deleterious for efficient correct sequence assignment, especially when proteolysis is carried out with others enzymes than trypsin such as Lys-N [28], Asp-N [59], and Glu-C [60] proteases, which produce peptides bearing arginine and histidine anywhere within the proteolytic sequences.

3.4 Influence of the Amino Acid Positioned at the C-Terminus on C-Terminal Residue Exclusion

A series of peptide free acids differing from the nature of the amino acid at the C-terminus (FPARVGX) were prepared to ascertain the influence of this residue on the exclusion. At least one amino acid from each class (aliphatic, aromatic, not charged, acidic, and basic side chains) was selected for the determination of the significance of the C-terminal position (X = I, Y, S, C, Q, E, R, K, H). All peptides with different molecular ion m/z values exhibited upon activation the same fragment ion at m/z 646 issued from the amino acid exclusion and corresponding to the common protonated FPARVG sequence ion (Figure 6, entries R-1, R-4, and R-8 in Table 1). Besides, as discussed previously, the MS/MS spectrum of the synthetic FPARVG peptide was identical to any of the MS3 data recorded from the m/z 646 (exclusion ion present in all dissociation spectra). Thus, production of such a non-scrambled fragment ion was not dependent on the nature of the amino acid located at the extreme sequence end undergoing the rearrangement. However, the abundance of this ion was affected. Very surprisingly, in contrast to literature data [7, 9, 10], the presence of a basic residue at the C-terminal position provided the most intense signal for the neutral loss, as demonstrated in Table 1, by comparing the relative abundance of the m/z 646 ion both in ESI- and MALDI-MS/MS (Figure 6, Figures S5 and S6, entries R1 to R-9). Altogether, the exclusion occurred from all classes of amino acids (Table 1). All aliphatic, aromatic, basic, and acid residues were excluded at least once. Only the loss of methionine, threonine, and asparagine belonging to the non-charged residues was not exemplified in our peptide free acid data set, whereas serine, proline, cysteine, and glutamine were successfully tested.

Figure 6
figure 6figure 6

ESI MS/MS spectra upon CID of the [M + H]+ precursor ion of (a) peptide R-1 (FPARVGS), (b) peptide R-4 (FPARVGK), and (c) peptide R-8 (FPARVGR)

3.5 Necessity of the presence of at least one basic amino acid on C-terminal residue exclusion

Provided that singly charged precursor ions of peptide free acid were activated, only sequences bearing a basic residue (R, H, or K) showed the C-terminal neutral exclusion. Above all, it should be emphasized that arginine caused the most outstanding exclusion as previously observed by others [610]. Indeed, compared with histidine and lysine, the presence of arginine within the chain led to the most abundant truncated peptide, which is encountered in the case of trypsin miscleavage of the C-terminal protein sequence or with other digestion enzymes (Lys-N, Asp-N, Glu-C). This exclusion signal was even more intense, provided that another basic residue (R, H, or K) was located at the C-terminal position. The latter situation with C-terminal R and K corresponds to trypsin miscleavage peptides (Table 1, entries R-4, R-8, B-7). When histidine and lysine were present within the sequence, the exclusion was less prominent even for tryptic peptides terminated by arginine (Table 1, entries H-2, H-5, H-6) or by lysine (Table 1, entries H-3, H-4). The relative position of the basic residue sequestering the ionizing proton in the sequence did not significantly influence the C-terminal exclusion (Table 1, entries R-11, R-12). On the other hand, we have unexpectedly observed lysine exclusion for some lysine-terminated tryptic peptides that did not contain any other additional basic residue (Table 1, entries X-1 to X-4). The fragment ion was always weak and this behavior was never encountered with C-terminal arginine-terminated tryptic peptides. Considering all these results, in the context of trypsin digestion, the detection of an abundant truncated ion from the singly charged precursor ion strongly indicated a miscleavage peptide bearing arginine within the chain. Besides, the neutral loss designated the C-terminal residue (loss of 128 for K and loss of 156 for R). For tryptic peptides that were assigned as miscleavage sequence in conventional proteomic shotgun workflow, it could be judicious to perform the dissociation of the singly charged molecular ion, either in ESI- or MALDI-MS/MS, to ascertain the sequencing output by observing, or not, the expected neutral loss of 128 or 156 Da.

3.6 Importance of the Exclusion Pathway among the Production of Non-Scrambled Sequence Fragment Ions Other than bn/an/ym Ions

Efficient peptide sequencing relies on the extensive formation of complementary bn and ym fragment ion series giving information on both N-terminal and C-terminal chains, respectively. However, in addition to such common protonated amide backbone dissociation producing direct sequence ions, i.e., protonated five-membered ring oxazolones (b ions) and protonated truncated peptides (y ions), other fragmentation pathways have been reported in the literature [46]. Indeed, after the ionizing proton migration along the peptide chain, several dissociation mechanisms are in competition, leading to the production of ions that may not be directly related to the initial primary sequence [4]. Such ions formed by rearrangement and multiple bond cleavages disturb the MS/MS spectra and may even lead to incorrect sequence assignment. One illustration of such detrimental behavior is known as peptide scrambling by means of cyclization and reopening of bn ions [12, 13]. The extent of such nondirect sequence fragment ion formation and the impact of these sequence permutations on peptide identification relevance in shotgun proteomics studies are under debate today [14, 15]. Despite the great interest raised by the scrambling phenomenon [1222], less attention has been devoted in the proteomics community to the presence of fragment ions that are not produced by the conventional bn-ym backbone amide pathway but that retain the primary sequence information. As such, one can mention, first, nonspecific fragmentation by the loss of neutral molecules (mainly H2O, NH3, CO, …) [61, 62], and second, the production of internal fragment ions [63, 64], especially when residues such as proline, histidine, and arginine were present in the sequence [23, 24]. These ‘parasite’ additional ions complicated the MS/MS spectra, which were overall exhibiting lots of ion series (bn, an, ym, and so-called ‘bn-type’ internal ions), but they are less deleterious for correct sequencing than scrambled sequences. Although the detection of these non-scrambled internal fragments did not hamper automatic peptide MS/MS assignment in proteomic workflow [15], we believe that they can provide relevant information for de novo peptide sequencing, especially for the C-terminal amino acid exclusion. Thus, we have explored our peptide data set to probe the formation extent of fragment ions that keep the primary sequence unaffected but differs from the fully characterized bn/an and ym ions. As expected, the loss of water and ammonia, as well as the formation of non scrambled internal b-type ions were observed for some peptides, leading to duplicated sequence ion series on the MS/MS spectra. The C-terminal exclusion behavior contrasts with all aforementioned fragmentations initiating multiple formations of ions from the activated precursor ion (such as series of bn, an, ym, bn-H2O, bn-NH3, internal ions, …). Only very few ions in the MS/MS spectra were related to a C-terminal neutral loss, since in most cases, only the extreme residue was concerned and scarcely the C-terminal dipeptide or tripeptide. The fragment ions corresponding to these dissociations were not as abundant as the single amino acid exclusion. For instance, the MS/MS data of the two peptide free acids ALGSRSAGYP and WFAPPRVGYL (Table 1, entries R-15 and R-18, respectively) reproduced in Figure 2 showed the exclusion of a single amino acid (P or L) and formally of a dipeptide (YP or YL), the latter loss being far less intense. Even if MS3 experiments shown in Figure 4 showed the loss of tyrosine from both protonated truncated y-type fragment ions as discussed previously, it is difficult to exclude that this ion could also be directly produced from the protonated molecular ion by the loss of a neutral cyclic dipeptide (diketopiperazine). Moreover, when the fragment ion produced by a C-terminal neutral exclusion was very abundant (>50% relative abundance at least in the MS/MS spectrum), some other second generation fragmentations occurring from the protonated truncated (bn–1 + H2O) fragment ion were observed in few cases (Figures 2b and 3b). Regarding other non-scrambled sequence ions, loss of small neutral molecules such as water and ammonia usually occurs from several fragment ions duplicating the sequence bn and ym fragment ion series. In the same manner, the production of b-type internal ions are typically triggered by a specific amino acid (P, R, or H) but the length of the fragment is varying a lot leading to populated internal ion series as seen in Figures 2 and 3. In contrast, C-terminal exclusion of a single residue or even of a dipeptide produced few non-scrambled fragment ions indicating a very specific dissociation pathway that can be exploited for detailed sequence assignment (confirmation of trypsin miscleavage, determination of the amino acid at the C-terminus).

4 Conclusion

A rather frequent peptide backbone fragmentation occurring from basic amino acid-containing free acid sequences was pointed out by screening a data set of 392 MS/MS data gathered from the collision- and laser-induced dissociation of singly charged molecular ions. Far from being an epiphenomenon, the loss of a neutral residue can be envisaged on both N- and C-terminal sides of the peptide chain. The a1-ym–1 pathway led to a C-terminal y ion by the loss of the N-terminal residue, whereas the observed C-terminal neutral residue loss gave raise to a shortened peptide possessing the structure of a (bn–1 + H2O) ion. Great caution must be taken when detecting the loss of a residue from the precursor ion that represents a potent pitfall especially in de novo peptide sequencing (N- versus C-terminal amino acid determination).