Introduction

Carbohydrates are key constituents of all living organisms. Nearly 1% of each genome is devoted to highly conserved sugar processing enzymes, namely glycosyltransferases and glycosidases, which in a concerted work are able to create thousands of structures not only in eukaryotic cells but also in the microbial world [1]. Conjugation of sugars to proteins considerably increases variation among gene products with a total of 13 monosaccharides and eight amino acids forming at least 41 types of different glycosidic linkages [2]. Glycoproteins play a wide range of roles in biological processes. Glycans covering cell surfaces can be considered as a molecular code for cells to communicate with each other, a “glycocode”, which controls several biological functions [3]. There is strong evidence that aberrant post-translational modifications may play a central role in the onset and progression of autoimmune diseases [4] where glycoproteins/glycolipids can trigger an immunogenic response as a consequence of a modification of the glycosylation pattern. However, the non-template driven synthesis of glycans and their heterogeneity complicate the analyses of glycoconjugates.

Autoimmune diseases are a large group of disorders characterized by humoral and/or cell-mediated immune response to one or more self-antigens. The serum of autoimmune disease patients contains a wide range of autoantibodies, fluctuating with disease activity. Enzymatic post-translational modifications, like glycosylation and phosphorylation, are not the unique chemical modifications responsible for the onset of autoimmune diseases. Even non-enzymatic chemical modifications can lead to the formation of autoantigens. Protein glycation is an example of this type of modification and, though it is generally a physiological process leading to biological aging, it is responsible for the accumulation of immunogenic products that are recognized as non-self in certain conditions, particularly in the case of type I diabetes [5].

Multiple sclerosis is one of the most known inflammatory, demyelinating diseases of the central nervous system (CNS) possibly triggered by an autoimmune reaction against different CNS myelin components, like myelin basic protein (MBP) [6], proteolipid protein, and myelin oligodendrocyte glycoprotein [7].

In order to probe the nature of the modifications involved in the triggering onset of multiple sclerosis, a library of glycopeptide mimetics was screened following a “chemical reverse approach” [8] leading to the selection of post-translationally modified peptides, as optimized antigenic probes [9]. In particular, a type I′ β-turn peptide structure characterized by 21 amino acids named CSF114(Glc) was able to identify high titers of autoantibodies in patients’ sera [10]. This N-glucosylated peptide was characterized by the presence of a β-D-glucopyranosyl moiety linking an asparagine residue on the tip of the β-turn structure by an amide bond [11]. There is no evidence for the existence of such a modification in eukaryotic cells, but recently a bacterial glycosyltransferase Haemophilus influenzae was described to be able to transfer hexose units (mainly glucose but possibly also galactose as mono- or dihexose molecules) to N-glycosylation sites [1214]. Recently, the Actinobacillus pleuropneumoniae N-glycosyltransferase ApNGT was expressed in E. coli together with its substrate proteins and the N-glucosylation was detected by using specific anti-N-glucosylation human antibodies present in multiple sclerosis patients’ sera. The approach allowed the detection of the target proteins, and their N-hexosylated residues were identified by LC-MS/MS, using ETD as fragmentation mode. Some endogenous E. coli proteins were also demonstrated to be modified by the enzyme [15]. Glucosylation by bacterial enzymes was already evidenced in other pathological states. For example Clostridium difficile, one of the major causes of intestinal infections in hospitals, was described to produce virulence factors that are indeed O-glucosylating toxins, acting on serine or threonine sites [16]. This observation led some of the authors to hypothesize that an aberrant N-glucosylation of myelin proteins can be one of the etiological causes for neoantigens formation triggering an early autoimmune response in multiple sclerosis.

Further studies based on the screening of differently glycosylated peptides and unmodified ones were able to demonstrate that the glucosylated asparagine is specifically recognized by the sera of multiple sclerosis patients’ subpopulation, whereas peptides carrying different sugars or different modification sites had a lower or no affinity for the autoantibodies present in patients’ sera [17]. A bioinformatic approach was explored to select the human myelin proteins mimicked by the synthetic antigenic probe CSF114(Glc), and thus sharing sequences characterized by a high degree of structural homology and the presence of a sequon (i.e., an N-glycosylation site). The study led to the identification of five proteins containing at least one peptide aligning CSF114(Glc) [18]. Therefore, the characterization of these glycopeptide mimetics is important from the analytical point of view addressing the issue of unveiling anti-sugar peptide structure antibodies as biomarkers that can be useful as diagnostic and/or prognostic tools. A recent study based on solid-phase ELISA using CSF114(Glc) as a probe has shown the presence of specific IgM autoantibodies in patients affected by Rett syndrome [19]. Even though shorter synthetic linear and cyclic sequences (eight residues) showed a nanomolar affinity, the 21-mer CSF114(Glc) still remains the best antigenic probe to detect IgMs as biomarkers of antibody-mediated forms of multiple sclerosis and Rett syndrome [20].

The aim of this work is to provide a mechanistic study on the fragmentation by multi-stage mass spectrometry of different sugar-conjugated forms at position 7 of the type I′ beta-turn structural probe CSF114 in order to find signature cleavages able to unequivocally identify either the type or the site of the modification. During CID fragmentations of glycopeptides, the sugar moiety tends to fragment more easily than the peptide backbone, giving poor information on the peptide sequence and on the modification site. The spectrum is strongly dominated by neutral losses occurring on the glycan moiety [21, 22]. The clarification of the fragmentation pathway is a common issue in the study of biomarkers containing sugar-peptide structures. Attempts to achieve more structural information were carried out using a multi-stage mass spectrometry approach on a MALDI-LTQ Orbitrap mass spectrometer (ThermoScientific,  Bremen, Germany). The design of this instrument is characterized by the coupling of a MALDI source with an accurate mass, high-resolution hybrid mass spectrometer [23]. This instrument is suitable for obtaining structural information because of its ability to trap ions in the LTQ analyzer and, subsequently, to perform MSn experiments.

Experimental

Reagents

Rink amide (p-methylbenzhydrylamine)-resin (MBHA-resin, 100–200 mesh, 0.52 mmol/g), O-(7-azobenzotriazol-1-yl)-1,1,3,3-tetramethyluronium hexafluorophosphate (HATU), Fmoc-Arg(Pbf)-OH, Fmoc-Gly-OH, and Fmoc-Ser-tert-butyl (t-Bu)-OH were purchased from Iris Biotech GmbH, Marktredwitz, Germany. O-(benzotriazol-1-yl)-1,1,3,3-tetramethyluronium hexafluorophosphate (HBTU) and Fmoc-His(Trt)-OH were purchased from Novabiochem (La Jolla, CA, USA). (1-2H)-D-glucose (98% D) was purchased from Eurisotop (Saint-Aubin, France). Fmoc-L-Lys(Boc)(2,3:4,5-di-O-isopropylidene-1-deoxyfructopyranosyl)-OH was purchased from Polypeptide Laboratories (Strasbourg, France). 2,5-Dihydroxybenzoic acid (DHB) matrix was purchased from LaserBio Labs (Sophia-Antipolis, France).

Synthesis of Fmoc-Asn[Glc(OAc)4]-OH

The building block was prepared according to a previously reported procedure [24]. The corresponding deuterated building block 5 was synthesized following the same protocol and starting from (1- 2H)-D-glucose (Scheme 1).

Scheme 1
scheme 1

Reagents and conditions for synthesis of building block 5: (a) Ac2O, pyridine, 16 h; (b) 33% HBr in AcOH, CH2Cl2, 0 °C to rt, 3 h; (c) NaN3, CH3CN, reflux, 16 h, 61% in 3 steps; (d) H2, Pd/C 10%, Et3N, THF, 1h, 91%; (e) 4, DMSO, 1 h, 86%

(1- 2H)-2,3,4,6-Tetra-O-Acetyl-α-D-Glucopyranosyl Bromide (1) [25]

To a stirred suspension of (1-2H)-D-glucose (992 mg, 5.5 mmol) in pyridine (6 mL) was added dropwise acetic anhydride (3 mL). The reaction mixture was stirred at room temperature (rt) for 16 h and concentrated. The residue was taken up in EtOAc, successively washed with 1 M aqueous HCl solution and saturated aqueous NaHCO3 solution, dried (MgSO4), and concentrated.

The residue was then dissolved in CH2Cl2 (3 mL) and 33% hydrogen bromide solution in acetic acid (6 mL) was added dropwise at 0 °C. The reaction mixture was slowly warmed to rt and stirred for 3 h, then washed with ice-water. Organic phase was washed with cold saturated aqueous NaHCO3 solution, dried (MgSO4), and concentrated to give glucosyl bromide 1 used in the next step: 1H NMR (300 MHz, CDCl3, δ) 5.55 (dd, 1H, J H3-H2 = 9.9 Hz, J H3-H4 = 9.3 Hz, H3), 5.16 (dd, 1H, J H4-H5 = 9.9 Hz, J H4-H3 = 9.3 Hz, H4), 4.83 (d, 1H, J H2-H3 = 9.9 Hz, H2), 4.36-4.25 (m, 2H, H5, H6a), 4.12 (d, J H6b-H6a = 10.8 Hz, H6b), 2.09, 2.05, 2.03 (3s, 12H, Ac); 13C NMR (75 MHz, CDCl3, δ ) 170.6, 170.0, 169.9, 169.6 (CO), 86.4 (t, J C1-D = 28 Hz, C1), 72.3 (C5), 70.7 (C2), 70.3 (C3), 67.3 (C4), 61.1 (C6), 20.8, 20.8, 20.7, 20.7 (Ac).

(1-2H)-2,3,4,6-Tetra-O-Acetyl-β-D-Glucopyranosyl Azide (2)

A solution of 1 and sodium azide (1.2 g, 19 mmol, 3.5 eq.) in acetonitrile (15 mL) was refluxed for 16 h, then filtered and concentrated. Crystallization in absolute ethanol afforded glucosyl azide 2 as a white solid (1.27 g, 61% in three steps): 1H NMR (300 MHz, CDCl3, δ ) 5.22 (dd, 1H, J H3-H4 = 9.6 Hz, J H3-H2 = 9.3 Hz, H3), 5.10 (dd, 1H, J H4-H5 = 9.9 Hz, J H4-H3 = 9.6 Hz, H4), 4.96 (d, 1H, J H2-H3 = 9.3 Hz, H2), 4.28 (dd, 1H, J H6a-H6b = 12.6 Hz, J H6a-H5 = 4.8 Hz, H6a), 4.17 (dd, 1H, J H6b-H6a = 12.6 Hz, J H6b-H5 = 2.1 Hz, H6b), 3.79 (ddd, 1H, J H5-H4 = 9.9 Hz, J H5-H6a = 4.8 Hz, J H5-H6b = 2.1 Hz, H5), 2.11, 2.08, 2.03, 2.01 (4s, 12H, Ac); 13C NMR (75 MHz, CDCl3, δ ) 170.8, 170.3, 169.5, 169.4 (CO), 87.7 (t, J C1-D = 23 Hz, C1), 74.1 (C5), 72.7 (C3), 70.7 (C2), 68.0 (C4), 61.8 (C6), 20.9, 20.7, 20.7 (Ac).

(1-2H)-2,3,4,6-Tetra-O-Acetyl-β-D-Glucopyranosyl Amine (3)

To a stirred solution of 2 (1.25 g, 3.4 mmol) in THF (10 mL) were added triethylamine (480 μL, 3.4 mmol, 1 eq.) and Pd/C 10% (35 mg, 34 μmol, 1% mol). The reaction vessel was purged from air and filled with H2 (3 bar). The reaction was stirred at rt for 1 h, then diluted with ethyl acetate, filtered through Celite, and concentrated, to give crude glucosyl amine 3 (1.06 g, 91%): 1H NMR (300 MHz, CDCl3, δ ) 5.22 (dd, 1H, J H3-H2 = 9.7 Hz, J H3-H4 = 9.5 Hz, H3), 5.02 (dd, 1H, J H4-H5 = 10.0 Hz, J H4-H3 = 9.5 Hz, H4), 4.81 (d, 1H, J H2-H3 = 9.7 Hz, H2), 4.21 (dd, 1H, J H6a-H6b = 12.3 Hz, J H6a-H5 = 4.8 Hz, H6a), 4.08 (dd, 1H, J H6b-H6a = 12.3 Hz, J H6b-H5 = 2.2 Hz, H6b), 3.67 (ddd, 1H, J H5-H4 = 10.0 Hz, J H5-H6a = 4.8 Hz, J H5-H6b = 2.2 Hz, H5), 2.07, 2.05, 2.00, 1.99 (4s, 12H, Ac); 13C NMR (75 MHz, CDCl3, δ ) 170.8, 170.3, 169.7 (CO), 84.7 (t, J C1-D = 23 Hz, C1), 73.2 (C3), 72.8 (C5), 72.1 (C2), 68.9 (C4), 62.4 (C6), 20.9, 20.9, 20.7, 20.7 (Ac).

N-Fmoc- L-Aspartic Anhydride (4)

Anhydride 4 was prepared as previously described [24] in two steps starting from L-aspartic acid.

N-α-Fmoc-N-γ-(1-2H)-2,3,4,6-Tetra-O-Acetyl-β-D-Glucopyranosyl]- L-Asparagine (5)

To a stirred solution of 3 (1.06 g, 3.05 mmol, 1 eq.) in DMSO (2 mL) was added anhydride 4 (1.02 g, 3.05 mmol, 1 eq.). The reaction mixture was stirred at rt for 1 h, diluted with water, and extracted with EtOAc. Organic phase was washed twice with saturated aqueous NaCl solution, dried (MgSO4), and concentrated, to give 5 as a white solid (1.80 g, 86%), which was used without further purification step: 1H NMR (300 MHz, CDCl3, δ) 7.73 (d, 2H, J = 7.3 Hz, Har), 7.57 (d, 2H, J = 7.3 Hz, Har), 7.36 (dd, 2H, J = 7.3 Hz, J = 7.2 Hz, Har), 7.31-7.23 (m, 2H, Har), 7.04 (broad s, 1H, Hδ2), 6.26 (d, 1H, J H N -H α = 7.9 Hz, HN), 5.33 (dd, 1H, J H3-H4 = 9.6 Hz, J H3-H2 = 9.5 Hz, H3), 5.07 (dd, 1H, J H4-H5 = 9.8 Hz, J H4-H3 = 9.6 Hz, H4), 4.95 (d, 1H, J H2-H3 = 9.5 Hz, H2), 4.65-4.57 (m, 1H, Hα), 4.41-4.30 (m, 2H, CH 2 Fmoc), 4.27 (dd, 1H, J H6a-H6b = 12.0 Hz, J H6a-H5 = 3.9 Hz, H6a), 4.24-4.15 (m, 1H, CH Fmoc), 4.04 (d, 1H, J H6b-H6a = 12.0 Hz, H6b), 3.88-3.80 (m, 1H, H5), 2.91 (dd, 1H, J H β -H β’ = 16.5 Hz, J H β -H α = 3.9 Hz, Hβ), 2.77 (dd, 1H, J H β’ -H β = 16.5 Hz, J H β’ -H α = 4.2 Hz, Hβ’), 2.01, 2.00 (2s, 12H, Ac); 13C NMR (75 MHz, CDCl3, δ) 173.7 (C’), 171.4, 170.8, 170.1, 169.7 (CH3 CO), 156.6 (CO Fmoc), 143.8, 141.4 (Cq Fmoc), 127.9, 127.2, 125.3, 120.1 (CHar), 73.8 (C5), 72.9 (C5), 70.8 (C2), 68.2 (C4), 67.6 (CH2 Fmoc), 61.8 (C6), 50.6 (Cα), 47.1 (CH Fmoc), 37.7 (Cβ), 20.8, 20.7 (Ac).

Synthesis of the Sugar-Modified Peptide Analogs of CSF114 (I-V)

Both the pentapeptide I corresponding to the central beta-turn motif included in CSF114(Glc) and its deuterated analog II (Table 1) were synthesized manually by solid-phase Fmoc/t-Bu strategy on a Rink amide resin in a fritted syringe (0.05 mmol scale).

Table 1 Sequences of the Synthetic Sugar-Modified Peptide Analogs of the Type I′ β-turn structure CSF114

Ac-RN(Glc)GHS-NH2 (I)

Fmoc protection was removed with a solution of piperidine in DMF (20%, 5 min then 15 min). Stepwise coupling reactions were performed in DMF with Fmoc-protected natural amino acids, HBTU and NMM (5/5/7 eq.) for 20 min at rt. Coupling reaction with Fmoc-protected synthesized amino acid was performed with HATU and NMM (3/3/5 eq.) for 1 h at rt. After removal of the last Fmoc protecting group, the peptide was N-acetylated with a solution of acetic anhydride/NMM (1:1) in CH2Cl2 (20%, 2 × 45 min). The resin was washed with CH2Cl2, CH3OH and dried under vacuum. It was then treated with TFA/H2O/TIS (95:2.5:2.5; 25 mL/g resin, 3 h, rt), and the cleaved peptide was precipitated in diethyl ether and lyophilized.

Crude peptide was dissolved in MeOH/H2O (5:1) and K2CO3 was added (pH ≈ 8–9) to remove the O-acetyl protecting groups. After completion of the reaction (2–3 h), the mixture was neutralized, concentrated, and lyophilized. Purification by RP-HPLC on a C18 semi-preparative column using a 30 min 0%–5% linear acetonitrile (0.1% TFA) gradient in an aqueous solution (0.1% TFA) afforded I as a white powder (14.3 mg, 28%) after lyophilization. Retention time of the product was 8.3 min. The peptide was characterized by MALDI-TOF MS (m/z): [M + H]+ calcd, 773.35; found, 773.40.

Ac-RN[(1-2H)Glc]GHS-NH2 (II)

The same procedure gave II as a white powder (10.8 mg, 21%). Retention time of the product was 8.6 min. The peptide was characterized by MALDI-TOF MS (m/z): [M + H]+ calcd, 774.36; found, 774.49.

TPRVERN(Glc)GHSVFLAPYGWMVK (III) and TPRVERN(GlcNAc)GHSVFLAPYGWMVK (IV)

Peptides CSF114(Glc) (III) and CSF114(GlcNAc) (IV) were synthesized as previously described by Lolli et al. [9], including, respectively, at Asn-7 a β-D-glucopyranosyl and an N-acetyl-β-D-glucosylamine moiety.

TPRVERK(1-Deoxyfructopyranosyl)GHSVFLAPYGWMVK (V)

The glycated peptide [Lys7(1-deoxyfructopyranosyl)]CSF114 (V) was synthesized introducing the building-block Fmoc-L-Lys(Boc)(2,3:4,5-di-O-isopropylidene-1-deoxyfructopyranosyl)-OH reported for the first time by Carganico et al. [26]

Briefly, the peptide [Lys7(1-deoxyfructopyranosyl)]CSF114 was synthesized by SPPS, using a microwave-assisted peptide synthesizer (CEM LibertyBlue; CEM Corporation, Matthews, NC, USA), following the fluorenylmethoxycarbonyl (Fmoc)/(t-Bu) strategy. The peptide was prepared starting from a Fmoc-Lys(Boc)-Wang resin (0.31 mmol/g). All the reactions were performed in a Teflon vessel, where mixing and filtration are provided by a N2 flow. Predissolved reagents are 0.2 M orthogonally protected Fmoc-amino acids, 0.5 M DIC, and 1 M Oxyma solutions in DMF. Fmoc-deprotection cycles were, respectively, of 15 s (75 °C, 155 W) and 30 s (90 °C, 30 W). Couplings of Arg residues were performed in 1500 s (25 °C, 0 W) + 300 s (75 °C, 30 W). Coupling of His residue was performed in 120 s (25 °C, 0 W) + 240 s (50 °C, 35 W). Couplings of other residues were performed in 15 s (90 °C, 170 W) + 110 s (90 °C, 30 W).

The protected building-block Fmoc-L-Lys(Boc)(2,3:4,5-di-O-isopropylidene-1-deoxyfructopyranosyl)-OH was introduced by manual batch SPPS with PLS 4×4 (Advanced ChemTech, Louisville, KY, USA) with an excess of 2.5 eq of HATU as activator, and 3.5 eq of NMM, for 1 h at rt.

The protected building-block Fmoc-L-Lys(Boc)(2,3:4,5-di-O-isopropylidene-1-deoxyfructopyranosyl)-OH was introduced by manual batch SPPS with PLS 4×4 (Advanced ChemTech) with an excess of 2.5 eq, 2.5 eq of HATU as activator, and 3.5 eq of NMM, for 1 h at rt.

Cleavage from the resin and side-chain deprotection was performed at rt for 6 h using a mixture of TFA/TIS/H2O (95:2.5:2.5 v/v/v) (1 mL/100 mg resin). The resin was filtered off and the solution was concentrated by flushing N2. The peptide was precipitated from cold Et2O, centrifuged, and lyophilized. The peptide was purified by semi-preparative RP-HPLC performed on a Waters (Waters Corporation, Milford, MA, USA) apparatus using a Phenomenex (Phenomenex Inc., Torrance, CA, USA) Jupiter column C18 (10 μm, 250 × 10 mm), with a gradient 25%–40% B in 30 min at 4 mL/min with solvent system A (0.1% TFA in H2O), and B (0.1% TFA in CH3CN). Peptide was obtained with a purity of at least 95%, as a white powder (yield of 4%).

Peptide characterization was performed by analytical HPLC-MS using an Alliance Chromatography (Waters) with a Phenomenex Kinetex C18 column (2.6 μm , 3.0 × 100 mm), with a gradient 20%–70% B in 5 min at 0.6 mL/min, coupled to a single quadrupole ESI-MS (Micromass ZQ). Rt: 3.99 min. m/z calcd. 2620.4 [M + H]+, found 874.8 [M + 3H]3+.

Mass Spectrometry

The mass spectrometry experiments were performed on a ThermoScientific MALDI LTQ Orbitrap XL mass spectrometer. DHB was used as matrix at a concentration of 30 mg/mL in EtOH/H2O/TFA, (7:3:0.1, v/v). Samples were dissolved in aq. 0.1% formic acid (v/v) to a final concentration of 0.5 μM. All full scan analyses were performed in positive ion mode, using the Orbitrap as detector and a resolving power of 60,000. Automatic gain control (AGC) allowed the control of the filling status of the linear ion trap. Five microscans were recorded for each cycle. Crystal positioning system (CPS) was used to optimize the choice of shot position, and one microscan was registered for each step. Laser intensity was set between 15 and 25 kV in order to avoid in-source fragmentation. Fragmentation experiments were conducted in the linear ion trap with ion trap detection manually choosing the best normalized collision energy and using Q = 0.250 and T = 30 ms.

Results and Discussion

The mechanistic study has been focused on a series of sugar-modified peptides (Table 1).

First we analyzed two N-glucosylated pentapeptides: the N-glucosylated peptide I corresponding to the central beta-turn motif included in CSF114(Glc) and its deuterated form on C-1 of the glucose moiety, the analog II. This short sequence with only 5 amino acids was chosen to aid spectral interpretation focusing on the cleavages at the level of the sugar moiety and of the modification site. C-terminus amidation, mimicking an amide bond within a larger sequence, was useful to reduce water losses in the peptide backbone, thus facilitating the attribution of neutral losses occurring in the sugar moiety.

Fragmentation experiments on the two parent ions of I and II at m/z 773.39 and 774.39, respectively, were conducted using different values of normalized collision energy (CE), using the linear ion trap as collision cell and detector (data not shown). CID fragments were detected in the LTQ trap because of its higher sensitivity. The best collision energy was chosen to obtain sequence information as well as the modification site.

The MS/MS spectra obtained at lower levels of CE (between 5% and 20%) displayed the presence of peaks probably corresponding to the cleavage at the level of the most labile bonds giving rise to fragments at m/z 653.42 and 594.42 (data not shown). Neutral losses frequently occur in the sugar moiety of glycoconjugates where analytes undergo water and formaldehyde losses forming detectable intermediary oxonium ions. The mechanism has been extensively investigated in the case of protein glycation [27], where the elimination of water molecules was described with a proton-driven mechanism leading to the formation of oxonium ions.

The same neutral losses are the most intense peaks even in the MS2 spectrum obtained at CE 27% (Supplementary Figure 1). Table 2 summarizes the products arising from MS2 and MS3 experiments carried out on peptides I and II. MS2 spectra are populated by successive water losses, with the loss of three water units already described in glycopeptide esters [28]. We can interpret the peak at m/z 653.42 as the loss of a unit of C4H8O4 (MH+ – 120 Da) on the base of the chemical formula for glucose, whereas the peak at 594.42 can be explained with the loss of C6H10O5 + NH3 (MH+ – 179 Da). The neutral loss of 120 Da in the deuterated N-glucosylated pentapeptide (II) still exhibits a mass shift of 1 Da if compared with the corresponding undeuterated I, whereas the fragment corresponding to the neutral loss of 179 Da has no mass shifts, clearly showing that the C-1 of the sugar moiety is not implicated in the loss of 120 Da but it is lost in the second case when the entire sugar moiety is lost together with an ammonia molecule. The inspection of the spectrum allows also the visualization of additional neutral losses. The signal at m/z 743.33 can be easily interpreted as the elimination of a formaldehyde unit (30 Da), whereas peaks at m/z 713.25 and 683.33 can be due to the elimination of two CH2O and three CH2O units, with a nominal mass of 60 and 90 Da, respectively. Another possible interpretation can be found in the loss of aldose units of Cx(H2O)x. The fragment at m/z 683.33 might have undergone the elimination of a C2H4O2 (glycolaldehyde) unit followed by the elimination of a formaldehyde unit, or of the elimination of a C3H6O3 (glyceraldehyde) unit. All these species still contain the deuterium on C-1. The elimination of the whole glucose moiety, corresponding to a loss of 162 Da at m/z 611.33, shows no mass difference among the deuterated peptide II and the undeuterated I. It is worth noting that this loss is of lower intensity if compared to the losses of 120 and 179 Da. This observation has induced us to explore the potential of multi-stage tandem mass spectrometry to validate our hypotheses. MS3 experiments were carried out on the fragments corresponding to the neutral losses of 120, 162, and 179 Da.

Table 2 List of the Monoisotopic Mass of the Most Significant Ions Observed in Product Ion Spectra (MS2 and MS3) of Peptide I and Peptide II Recorded on MALDI-LTQ Orbitrap, Using CID Fragmentation in the LTQ Orbitrap. Observed and Calculated Values are Listed in the Table

The fragment at m/z 611.33 is not very intense in the MS/MS spectrum compared with the other neutral losses, but its MS3 fragmentation allows the reconstruction of the peptide sequence. The spectra of the undeuterated and deuterated species do not display any differences, underlying that the deuterium is lost during the previous fragmentation event. Both the b- and y-series are represented in the figure, giving complete sequence coverage of this small peptide.

MS3 experiments on the species losing the entire glucose moiety and an ammonia molecule (Supplementary Figure 3) are obtained by a more intense precursor compared with the neutral loss of 162 Da. Most probably the additional neutral loss of 17 Da occurs on the amide of the asparagine residues. In the MS3 spectrum, further eliminations of ammonia units are possible, for example the peak at m/z 473.17 can be explained by the loss of two ammonia units and a glucose unit from the b4 ion. The fragment at m/z 594.33 even undergoes a neutral loss of 30 Da forming m/z 564.42. We can easily interpret this peak as a formaldehyde loss due to the fragmentation at the level of the carbonyl of the asparagine residue. In the MS2 spectrum, the signal corresponding to the loss of 179 Da is higher than the signal corresponding to the loss of only the glucose moiety and, for this reason, we consider that this loss is indeed a signature of this modification.

The most intense diagnostic fragmentation is the neutral loss of 120 Da (Supplementary Figure 4). This species still contains the glucose C-1, as is clearly demonstrated by the increase of 1 Da on many peaks in the spectrum corresponding to the deuterated species peptide II. This cleavage can be explained by the loss of four consecutive formaldehyde molecules, two C2H4O2 (glycolaldehyde) units, or a C4H8O4 (erythrose) from the open ring structure as shown in Scheme 2A, summarizing the most significant sugar ring cleavages described above. Scheme 2A also displays the possibility to consider several resonance structures for the parent ion, thus explaining the relative stability of the amide bond between asparagine and glucose, as well as the coexistence of the open-ring conformation together with the pyranosidic ring.

Scheme 2
scheme 2

Hypothetical fragmentation mechanism of the asparagine-modified peptides: resonance structures for the precursor ions are displayed. (a) Fragmentation of the N-glucosylated peptide: the precursor N-glucosylated-asparagine is evidenced by the dotted rectangle. (b) Fragmentation of the N-acetylglucosamine-modified peptide: the precursor is evidenced by the dotted rectangle

Exploring the MS3 spectrum, it was surprising not to detect the loss of the residual modification leading to the theoretical mass of 611.3 Da (unglucosylated form), whereas a loss of 59 Da can be interpreted as the loss of a unit containing the residual modification as well as an ammonia molecule (NH2CH2COH). A fragment attributable to the loss of a unit containing five carbon atoms is observed at m/z 623.42 and 624.42 in peptides I and II, respectively. The difference of 1 Da between the two species indicates that this fragment still contains the deuterium bound to the C-1. This peculiar fragmentation leading to the complete degradation of the glucose moiety from its open ring structure was already evidenced by Jerić et al. in the case of glycopeptide esters [28].

Sugar-Peptide Analogs of CSF114

Starting from this characterization, we have exploited the potential of multi-stage mass spectrometry by using the MALDI-LTQ Orbitrap to study the fragmentation behavior of the type I′ β-turn structure CSF114(Glc) (III). This peptide was responsible for specific detection of serum autoantibodies in a multiple sclerosis patient subpopulation [10]. In the attempt to identify signature fragmentations for the N-glucosylation motif, multistage mass spectrometry experiments were carried out. The pattern of fragmentation obtained for this peptide was first compared with the fragmentation of standard peptides, and then with the series of CSF114 peptide structures bearing different sugar modifications, namely the peptide CSF114(GlcNAc) (IV), modified with a residue of N-acetylglucosamine on an Asn residue at position 7, and the peptide [Lys7(1-deoxyfructopyranosyl)]CSF114 (V) obtained by introducing during the solid-phase synthesis the protected building block Fmoc-L-Lys(Boc)(2,3:4,5-di-O-isopropylidene-1-deoxyfructopyranosyl)-OH (26) at position 7 of the peptide structure. This last modification, known as protein glycation, is due in vivo to the non-enzymatic reaction of glucose with free amino groups, giving a Schiff base that is easily rearranged forming a ketoamine named Amadori product [29].

Figure 1 shows the multi-stage mass spectrometry characterization of the peptide CSF114(Glc) (III). Figure 1a displays the Orbitrap detection of the monoisotopic mass of this peptide at m/z 2606.33 Da. The model of fragmentation proposed for the standard pentapeptides I and II is clearly confirmed in Figure 1b, representing its MS2 spectrum. The diagnostic neutral losses of 162, 120, and 179 Da are all present at m/z 2444.18, at m/z 2486.18, and at m/z 2427.09, respectively, as well as the loss of single formaldehyde units and aldose moieties. The spectrum is registered in the mass range m/z 715–2800 because of the low-mass cutoff of the LTQ trap. In addition to the signals already present in the previous analyses, here we find a further signal in the high mass region of the spectrum at m/z 2562.18 corresponding to a neutral loss of 44 Da. One of the positions where the loss can take place is the C-terminal end of the peptide where CO2 units can rarely be lost. This loss was not detected in the peptides I and II with a C-terminal amide. But, there is a second possibility where the loss can be located to the N-terminal threonine residue. It has already been reported [30] that threonine residues can undergo the loss of CH3CHO units in peptides containing the sequence TT- or TS- at the N-terminal side. The condition for this is a previous water loss in the side chain of the amino acid in second position generating a cyclic intermediate, which may eliminate an acetaldehyde fragment. In this case the cyclic intermediate was mimicked by proline, whose presence may promote the loss of 44 Da without any previous water loss in the peptide backbone. To confirm this hypothesis, the MS3 spectra of the daughter ion at m/z 2562.18 (Supplementary Figure 5) was carried out. The spectrum clearly confirms the second hypothesis: in fact all the b-fragments are detected with the loss of 44 Da. For example, the b7 fragment has mass to charge ratio of 971.39 instead of 1015.55 in the MS2 spectrum. Conversely, the y-series remain unmodified (as shown for fragment y16 at m/z 2023.82) until the fragment bears the neutral loss, which is the N-terminal amino acid. Another confirmation of this hypothesis is given in the MS3 spectrum of the fragment at m/z 2444.30, corresponding to the loss of the entire sugar moiety (Figure 1c). In this spectrum, we should be able to detect a fragment corresponding to the loss of 44 Da on the threonine residue, and indeed this fragment is present at m/z 2400.09, clearly showing that the loss was located on the N-terminal end of the peptide backbone. Thus, the MS3 spectrum of the neutral loss of 162 allowed not only the elucidation of the peptide sequence but even the loss of 44 Da was explained. The MS2 spectrum exhibits the peak at m/z 2427.09 corresponding to the loss of the entire sugar moiety and of the ammonia molecule (−179 Da) already described for the standard peptides. This peak is more abundant than the one corresponding to the loss of only the sugar moiety even in this case.

Figure 1
figure 1

Multi-stage mass spectrometry analysis of peptide III [CSF114(Glc)]. (a) MALDI-Orbitrap detection of precursor ion at m/z 2606.33. (b) MS/MS fragmentation of precursor ion obtained at CE 40% in the LTQ trap. The high mass region of the spectrum was magnified in order to highlight neutral losses in the sugar moiety. (c) MS3 spectrum of the peak at m/z 2444.16 corresponding to [MH+ – C6H10O5]

As for the cross-ring cleavages in the glucose moiety, the loss of a formaldehyde unit can be detected in the MS2 spectrum at m/z 2576.18, as well as the loss of a unit of 60 Da, corresponding to the loss of two formaldehyde units or of a glycolaldehyde unit (m/z 2546.18). In addition to this, at m/z 2516.18 the loss of 90 Da can be easily explained by the loss of three formaldehyde units, or of the glycolaldehyde fragment and a formaldehyde molecule, or of a glyceraldehyde moiety. The peak at m/z 2486.09 corresponding to the loss of 120 Da is more intense compared with the above-mentioned neutral losses. As it was previously explained, our interpretation for this loss leading to the cross-ring 0,2X-type fragment is the loss of two glycolaldehyde units or, most probably, of an erythrose molecule. In analogy with the previous analyses on the N-glucosylated pentapeptides I and II, the fragmentation was investigated for the species at m/z 2516.18 and 2486.18 (Supplementary Figures 6 and 7). This last one is higher than the peak corresponding to the loss of the sugar moiety, as already explained in the analyses of I and II.

Finally, the CSF114(Glc) (III) presents the typical fragmentation pattern for the glucose moiety that can be considered the signature of this modification in the research for biomarkers in autoimmune neurodegenerative diseases.

The following step was used to assess if the type of sugar-peptide linkage and the kind of monosaccharide moiety can influence the fragmentation behavior and then the characterization of the sugar-modification. In this respect, we report the fragmentation spectrum of the peptide CSF114(GlcNAc) (IV) bearing a unit of N-acetylglucosamine on asparagine 7. The aim was to establish if the presence of an N-acetyl-group on the C-2 was responsible for a different signature fragmentation. This modification induces a mass increment of 203.08 Da on the asparagine residue in position 7. The monoisotopic mass of this peptide was accurately detected by Orbitrap analysis and displayed at m/z 2647.35 in Figure 2a. Exploring the MS2 spectrum of this precursor in Figure 2b, the loss of the CH3CHO unit on the N-terminal threonine can be easily recognized at m/z 2603.55, as previously noticed. This peak is related to the peak at m/z 2585.45 with the loss of an extra water molecule. The neutral loss of two water molecules is detected at m/z 2611.45, and it is the maximum number of water molecules lost during the experiment. The analysis of the spectrum shows that the elimination of the N-acetyl-hexose generates the peak at m/z 2444.36, as previously reported, for the peptide CSF114(Glc). However, the abundance of this fragment is very low compared with the losses of 120 and 220 Da. The latter, corresponding to the elimination of the sugar moiety accompanied by the loss of an ammonia molecule from the amide of the asparagine residue, is represented by a very abundant peak at m/z 2427.36.

Figure 2
figure 2

Multi-stage mass spectrometry analysis of peptide IV [CSF114(GlcNAc)]. (a) MALDI-Orbitrap detection of precursor ion at m/z 2647.35. (b) MS/MS fragmentation of precursor ion obtained at CE 40%in the LTQ trap. The high mass region of the spectrum was magnified in order to highlight neutral losses in the sugar moiety. (c) MS3 spectrum of the peak at m/z 2444.36 corresponding to [MH+ – C6H10O4NHCOCH3 ]

Figure 2c represents the MS3 spectrum of the species at m/z 2444.36 that displays a profile very similar to Figure 1c, with the complete fragmentation of the peptide backbone.

At m/z 2527.45 the neutral loss of 120 Da can be detected. Considering the intensity of this peak, we can infer that the cleavage is highly favored during the fragmentation process, probably because of the presence of the N-acetyl group on the C-2. However, our interpretation for this fragmentation differs from the same neutral loss mechanism proposed for the peptide CSF114(Glc) (III). The first difference lies in the fact that the losses of formaldehyde and aldose units were not detected in the MS2 spectrum of the peptide CSF114(GlcNAc) (IV). The delocalization of the lone pairs of electrons of nitrogen and oxygen of the N-acetyl-group allows the representation of the structure as a resonance hybrid with more resonance structures compared with the CSF114(Glc) (III). The N-acetyl-group extends electron delocalization promoting the cleavage of the sugar ring to generate an X0,2-type cross-ring cleavage during CID fragmentation, as summarized in Scheme 2B, where resonance structures are also displayed for the parent ion. Conversely, the peptide CSF114(Glc) tends to fragment by consecutively eliminating formaldehyde and aldose molecules.

Asparagine residues modified in animal proteins by an N-acetylglucosamine in the consensus sequence for N-glycosylation were recently found in plants. A. thaliana proteins bearing this modification were explained as the product of the reaction of the enzyme endo-β-N-acetylglucosaminidase on larger high mannose N-glycans [31]. In animal cells, endoglycosidases are encoded in the genome even if it is not clear if their activity is directed towards glycoproteins, glycopeptides, or free oligosaccharide [32, 33]. In fact, little evidence exists confirming the actual in vivo activity of these endoglycosydases. Chalkley et al. [34] proposed that the identification of GlcNAc peptides in post-synaptic density preparations from murine brains might be due to an artifact during sample preparation.

In contrast to the enzymatic attachment of sugar units to proteins catalyzed by oligosaccharyl transferases (OST), there is another pathway for addition of sugars to proteins: the glycation reaction. Glycation of proteins occurs through spontaneous and reversible condensation of a reducing sugar and the free amino group of a protein at the N-terminus and/or on lysine side chains, to form a Schiff base. This then undergoes a rearrangement into the more stable ketoamine known also as the Amadori product [29]. In the case of glucose, the initially formed Schiff base rearranges into the more stable 1-deoxyfructopyranosyl moiety. Subsequent dehydration, condensation, fragmentation, oxidation, and cyclization reactions lead to the irreversible formation of advanced glycation end products (AGEs). Glycation of CSF114 was achieved introducing at position 7 the completely protected building block Fmoc-L-Lys(Boc)(2,3:4,5-di-O-isopropylidene-1-deoxyfructopyranosyl)-OH, previously described [26]. Glycation induces a mass shift of 162 Da on the modified residue. The monoisotopic mass of the glycated CSF114 is displayed in Figure 3a at m/z 2620.36. In the fragmentation spectrum in Figure 3b, we can notice a peak at m/z 2458.45 corresponding to the precursor after the loss of the entire carbohydrate. This peak is more intense than the peak at m/z 2500.45 corresponding to the loss of 120 Da; both of them were already reported [35], as well as the presence of water losses. A typical signature fragmentation for the glycated structure is represented by the peak at m/z 2536.45, describing the loss of a partial dehydrated glucose (C4H4O2) as previously reported [36]. The same neutral loss was interpreted as a loss of three water molecules and a formaldehyde unit in previous works, with the formation of furylium ions [27, 37]. It is worth noting that here the loss of the entire sugar moiety at m/z 2458.45 is represented by a more intense peak compared with the neutral loss of 120 Da at m/z 2500.45, and this behavior is completely different if compared with the N-modified glucopeptides. Furthermore, the loss of the sugar moiety together with the amine group, which was predominant in the above-mentioned forms of CSF114, is completely absent here, probably because it could generate an unstable primary carbocation on the residual group of the lysine. The spectrum is dominated by the cleavages inside the sugar moiety and b-ions are very weak or even absent, as previously reported; [38] consequently, it is very difficult to obtain sequence information from this spectrum. For this reason, MS3 fragmentation was used to have a better sequence coverage for the peptide. In Figure 3b, the spectrum corresponding to the neutral loss of 162 Da, the entire sugar moiety, displays the improvements achieved for sequence information. However, very limited information is obtained on the modification site, probably because of the difficult ionization conditions of the fragments bearing the sugar moiety. To summarize, in Table 3 all the most significant neutral losses observed in product ion spectra of peptides III, IV, and V are listed in order to better compare the fragmentation pattern of these glycopeptides.

Figure 3
figure 3

Multi-stage mass spectrometry analysis of peptide V [Lys7(1-deoxyfructopyranosyl)]CSF114. (a) MALDI-Orbitrap detection of precursor ion at m/z 2620.37. (b) MS/MS fragmentation of precursor ion obtained at CE 40%in the LTQ trap. The high mass region of the spectrum was magnified in order to highlight neutral losses in the sugar moiety. (c) MS3 spectrum of the peak at S 2458.45 corresponding to [MH+ – C6H10O5]

Table 3 List of the Monoisotopic Mass of the Most Significant Neutral Losses Observed in Product Ion Spectra (MS2) of Peptide III, IV, and V recorded on MALDI-LTQ Orbitrap, Using CID Fragmentation in the LTQ Orbitrap. Observed and Calculated Values are Listed in the Table. The Neutral Losses that are Not Detected in a Particular Case are Indicated with ND (Not Detected)

Conclusions

Sugar-conjugated peptides present a very characteristic pattern of neutral losses during CID fragmentation experiments. Herein, we propose the analysis of synthetic modified peptides with different sugar moieties by the use of multi-stage mass spectrometry on a MALDI-LTQ Orbitrap. The study allowed the identification of signature fragmentations to trace the nature of the monosaccharide moiety and the linking amino acid introduced at position 7 on the tip of the type I′ β-turn model peptide CSF114 in three different variants (III–V). To the best of our knowledge, CID fragmentation of a peptide with a β-D-glucopyranosyl moiety linked to an asparagine residue is deeply investigated for the first time. The inspection of spectra revealed a different behavior when comparing the fragmentation of this glucose-modified peptide to the N-acetylglucosamine-modified peptide the same position (position 7). The main difference was the more complex fragmentation pattern of the glucose-conjugated peptide comprising the neutral losses of formaldehyde and aldose units, even though some similarities are also evident, as the preferential cross-ring cleavage leading to the neutral loss of 120 Da.

The more common non-enzymatic glucose modification represented by protein glycation was also investigated by fragmenting a peptide carrying the isobaric modification of lysine residue at the same position 7 by the 1-deoxyfructopyranosyl moiety. Even in this case, the comparison revealed peculiar signature neutral losses, concerning the type of fragments and the relative abundance among them. It is worth noting that in this case the peak corresponding to the complete loss of the sugar moiety (162 Da) was preferred to the cross-ring cleavage (loss of 120 Da).

All these consideration might constitute some preliminary insights into the development of simple analytical methods to analyze aberrantly glycosylated antigens in autoimmune diseases. MALDI-MS has emerged as a rapid and reliable tool, already used in other biomarkers and diagnostics analytical methods. Here, the coupling of a MALDI source to the highly accurate FT analyzer allowed not only the univocal identification of autoantigens but also their fragmentation exploiting the efficiency of the linear ion trap fragmentation and the possibility of performing multi-stage fragmentation events.