Introduction

Wheat grain proteins are divided into two major groups: non-gluten proteins (20%) and gluten proteins (80%). Non-gluten proteins are soluble proteins (albumin and globulin), playing mainly structural and metabolic functions and a minor role in wheat quality [1]. These proteins might be involved in IgE-mediated food allergies such as Baker’s asthma and intestinal inflammation [2, 3]. Gluten proteins are a mixture of storage proteins (gliadins and glutenins), conferring the rheological properties to wheat dough. Gluten proteins are rich in glutamine (30 to 35%) and proline (10 to 15%) residues, which make them resistant to complete proteolytic digestion, a property that contributes to the immunogenic nature of gluten for patients with celiac disease (CD) [4,5,6]. Besides CD, gluten-related disorders also include wheat-dependent exercise-induced anaphylaxis, atopic dermatitis, urticaria, and gluten sensitivity [7, 8].

Accurate, reliable, and sensitive detection methods for gluten are, indeed, mandatory to support current EU regulations [9]. Enzyme-linked immunosorbent assays (ELISA) remain the method of choice [10]. However, the accuracy of commercially available ELISA kits is controversial due to the lack of certified reference material; cross-reactivity of antibodies; and large variety of kit with different antibody specificity, extraction conditions, and matrix effects [11, 12]. Over the last decades, the field of proteomics has seen a huge expansion and has been driven by the development of new, mostly MS-based, technologies for protein identification, separation, and quantification [13, 14]. Mass spectrometry–based approaches have then emerged as a core tool for large-scale gluten analysis owing to their specificity, sensitivity, and ability to identify hydrolyzed gluten [15]. Anyway, for a complete proteomic analysis, a full description of the compounds generated by gluten digestion is still lacking.

Considering that gluten is entrapped in a complex matrix of starch, protein, and lipids, the surface area accessible for digestive enzymes is very much dependent by the food matrices [16, 17]. This makes particularly problematic a reproducible identification and quantification of peptides deriving from digestion, which is particularly important in the case of gluten epitopes associated with CD. Ideally, digestion should be studied in vivo but this is not always possible for ethical and economic reasons. In vitro digestion models mimicking the gastrointestinal tract have been widely used to study the digestive fate of gluten, from single static systems to multi-compartmental and dynamic systems. Nevertheless, the diversity of the systems used makes the comparison between the different studies almost impossible [12, 18,19,20,21]. For quantitative proteomics applications, the reproducible generation of peptides is closely related to several parameters including the type and amount of enzymes, salts, and pH values used in these methods, which resulted in substantial variability [12]. The type of cleavage enzyme was extensively studied because it greatly influenced both the sequence and the amounts of wheat peptides identified after digestion. For instance, 341 peptides were identified with chymotrypsin, 407 with thermolysin, and 105 with trypsin [18], while a multi-enzymatic digestion (LysC, trypsin, and chymotrypsin) enabled the identification of 434 peptide sequences from gluten [20].

Moving toward a standardized model, COST INFOGEST network has developed an internationally harmonized static model that simulates digestive processes by defining key parameters and conditions [22]. In order to test this model for assessing the digestive fate of gluten (pure extracts, or within wheat flour), several samples were analyzed in this work, mostly to test two hypotheses: (i) if gluten digestion within wheat matrix or as an extract modifies the resulting peptides/protein profile; and (ii) if changes in protein accessibility lead to changes in the amounts of peptides associated with CD. To this end, the sequential extraction of wheat proteins was carried out to obtain soluble fraction (SP), gliadins (GLIA), and glutenins (GLUT) as well as the pellet was recovered. These extracts were dried and then subjected to digestion. A mixture of the extracts and the pellets (MIX) was remixed, dried, and digested. These samples (SP, GLIA, GLUT, PELLET, and MIX) were compared to the digesta of whole wheat flour (WWF). To track to the fate of gluten, an extensive characterization (qualitative and quantitative) of the peptides generated after digestion using both targeted and untargeted proteomics methods was used. Untargeted proteomics was used to obtain the global peptide profile of the digested wheat protein samples, while targeted proteomics was used to specifically quantify peptides which contain sequences able to trigger celiac disease.

Material and Methods

Plant Material

Grains of durum wheat (Triticum durum Desf.) of different pure varieties (Saragolla, Svevo, Maestrale, Creso, Simeto, and Cappelli) were milled using a laboratory mill (Ika Werke, Staufen, Germany). Whole wheat fine flour was sieved (160 μm sieve), and then stored in plastic bags at 4 °C until analysis. Table 1 summarizes some characteristics of the samples, in which they were ordered according to the release date.

Table 1 Pedigree and Year of Release of the Studied Wheat Genotypes

Sequential Extraction of Wheat Proteins

The sequential extraction of wheat proteins was performed as previously described by [23] with minor modifications. Flours (100 mg) were extracted with a buffered salt solution (2 times × 1 mL 0.067 mol/L K2HPO4/KH2PO4-buffer, 0.4 mol/L NaCl, pH = 7.6) at 22 °C (room temperature) to obtain soluble fraction (albumin and globulin, SP). The residues were extracted with ethanol (60%, v/v; 3 times × 0.5 mL) at 22 °C (gliadins, GLIA) followed by the glutenin extraction solvent (2 times × 1 mL; 50% (v/v) 1-propanol, 0.1 mol/L TRIS-HCl, pH 7.5, 0.06 mol/L (w/v) dithiothreitol (glutenins, GLUT). After addition of the respective solvent, each flour suspension was vortexed for 2 min, stirred for 30 min, and centrifuged for 20 min at 3550g and 22 °C. The pellet was also recovered. Protein fractions (SP, GLIA, and GLUT) and the pellets were dried under nitrogen and stored until analysis.

For each variety, mixtures (MIX) were prepared by combining the total volume of the protein extracts (SP, GLIA, and GLU) and their corresponding pellets, and then dried under nitrogen flux and stored until analysis.

In Vitro Gastrointestinal Digestion

Dried samples (protein fractions (SP, GLIA, and GLUT), pellets, mixtures) and whole wheat flours (WWF) were subjected to simulated gastrointestinal digestion [22]. In brief, 1 g of WWF was incubated 2 min with 1 mL simulated saliva containing porcine amylase (Sigma-Aldrich, St. Louis, MI, USA; 75 U/mL of digesta); then, 2 mL of simulated gastric juice containing porcine pepsin (Sigma-Aldrich, St. Louis, MI, USA; 2000 U/mL of digesta) was added and the sample was incubated for 2 h after adjusting the pH to 3. Subsequently, 4 mL of duodenal juice containing porcine pancreatin (Sigma-Aldrich, St. Louis, MI, USA; 100 U trypsin activity/mL of digesta) and porcine bile (Sigma-Aldrich, St. Louis, MI, USA; 10 mmol/L in the total volume) was added and the sample was incubated for 2 h after adjusting the pH to 7. All the digestion steps were carried out at 37 °C under constant gentle mixing. Then, to inactivate the enzymes, the sample was boiled for 10 min at 95 °C. After centrifugation (3220g, 4 °C, 45 min), 295 μL of each sample supernatant was added to 5 μL of internal standard solution (TQQPQQPF(d5)PQQPQQPF(d5)PQ; 1.6 mmol L−1). The standard is the labeled form (on the two phenylalanine residues) of the most abundant immunogenic peptide, which was synthesized following the method of Prandi and others [24].

Protein Profile

Protein contents of samples were quantified using a Qubit fluorometer (Invitrogen, Grand Island, NY, USA). The volume of gel-ready protein was computed considering that the quantity required in each well was 20 μg. If the required volume was > 10.5 μL, it was dried under nitrogen flux and then resolved in 15 μL of reducing buffer solution (protein concentration in the sample buffer was 1.3 μg/μL). Protein fractions (SP, GLIA, and GLUT) before and after digestion as well as the digests of pellets, mixtures, and whole wheat flours were subjected to SDS-PAGE [25].

Peptidomic Profile: UHPLC/ESI-MS/MS Analysis

The digests were chromatographically separated by reverse-phase (RP) column (Aeris Peptide 1.7 μm XB-C18, 150 × 2.10 mm, Phenomenex, Torrance, CA, USA) in an UHPLC/ESI-MS/MS (Dionex Ultimate 3000, Thermo Scientific, Waltham, MA, USA, equipped with a triple quadrupole TSQ Vantage (Thermo Scientific, Waltham, MA, USA). Eluent A was water with 0.1% formic acid and 0.2% acetonitrile, eluent B was acetonitrile with 0.1% formic acid and 0.2% water; gradient 0–7 min, 100% A; 7–50 min, from 100% A to 50% A; 50–52.6 min, 50% A; 52.6–53 min, from 50% A to 0% A; 53–58.2 min, 0% A; 58.2–59 min, from 0% A to 100% A; 59–72 min, 100% A. Product ion scan modality was carried using a variable collision energy based on the mass and charge of the ion to be fragmented. The setting of the UHPLC/ESI-MS/MS analysis was flow 0.2 mL/min; analysis time 72 min; column temperature 35 °C; sample temperature 18 °C; injection volume 3 μL; acquisition time 7–58.2 min; ionization type positive ions; scan range 100–1500 m/z, micro scans 1, scan time 0.50, Q1 PW 0.70, spray voltage 3200 V, capillary temperature 250 °C, vaporizer temperature 250 °C, sheath gas flow 22 units. The peptide sequences were assigned based on the mass spectra obtained. Briefly, the software FindPept (http://web.expasy.org/findpept/) was used to find the peptide sequences whose molecular weight matched with the experimental data. Then, the software Proteomics Toolkit (http://db.systemsbiology.net/proteomicsToolkit/FragIonServlet.html) was used to verify the correspondence between the theoretical MS/MS fragmentation and the obtained spectra.

Untarget Peptidomic: LTQ-OrbiTrap Analyses

Digested samples were analyzed by μHPLC-LTQ-ORBITRAP using a C18 column (Phenomenex Jupiter 4 μm Proteo 90 Å 150 mm × 0.3 mm) equipped with an enrichment cartridge (μ-Precolumn Cartridge, Acclaim PepMap100 C18 5 μm, 100 Å, 300 μm × 5 mm; loading flow 30 μL/min, 50% eluent A and 50% eluent B). Eluent A was water with 0.2% formic acid and eluent B was acetonitrile with 0.2% formic acid (gradient 0–4 min 10% B, 4–60 min linear from 10% B to 50% B, 60–62 min from 50 to 95% B, 62–72 min 95% B, 72–73 min from 95% B to 10% B, 73–82 min 10% B). The analysis parameters were flow 5 μL/min; analysis time 82 min; column temperature 35 °C; sample temperature 10 °C; injection volume 5 μL; acquisition time 0–72 min; ionization type positive ions. High-resolution mass spectrometry (HRMS) acquisition was performed through 5 subsequent events: event 1: full scan acquisition from 250 to 2000 m/z in high-resolution mode (resolution at 400 m/z = 30,000); events from 2 to 5: data-dependent scan, at each cycle, the four most intense ions (with charge z > 1 and with a minimum signal of 500 counts) identified in event 1 are fragmented. The same ion (tolerance 10 ppm and isolation window 2 m/z) can be observed for a maximum of 2 cycles, and then it is automatically inserted in the exclusion list for a maximum time of 20 s. Fragmentation is performed in the linear trap of the instrument in CID mode with collision energy of 35. Proteins were identified either with Peaks Studio (Bioinformatics Solutions, Waterloo, ON, Canada). Parameters were precursor ion tolerance 5 ppm, fragment ion tolerance 0.8 Da, decoy database search: strict 0.01, relaxed 0.05, fixed modifications: cysteine carbamidomethylation, variable modifications: methionine oxidation, hydroxyproline, and hydroxylysine. The spectral datasets were searched against the Poaceae. Results filtration parameters were peptide − 10lgP ≥ 15, protein − 10lgP ≥ 20, protein-unique peptides ≥ 0, de novo ALC score ≥ 50. False discovery rates were as follows: soluble proteins (FDR peptide-spectrum matches 23.0%, FDR peptide sequences 38.9%, FDR protein 5.5%), gliadins (FDR peptide-spectrum matches 6.7%, FDR peptide sequences 12.0%, FDR protein 3.0%), glutenins (FDR peptide-spectrum matches 12.2%, FDR peptide sequences 13.3%, FDR protein 0.0%), pellet (FDR peptide-spectrum matches 0.0%, FDR peptide sequences 0.0%, FDR protein 0.0%), mix (FDR peptide-spectrum matches 2.1%, FDR peptide sequences 3.3%, FDR protein 0.0%), and whole wheat flour (FDR peptide-spectrum matches 2.7%, FDR peptide sequences 6.5%, FDR protein 0.0%).

Target Peptidomic: UPLC/ESI-MS Analysis

Digested samples were separated using UPLC/ESI-MS system (ACQUITY Ultra-Performance UPLC with a single quadrupole mass spectrometer SQD, Waters Milford, MA, USA) a RP column (ACQUITY UPLC BEH 300, C18, 1.7 mm, 2.1 × 150 mm; Waters, Milford, MA, USA) equipped with a VanGuard Pre-column (ACQUITY UPLC BEH C18, 130 Å, 1.7 μm, 2.1 × 5 mm, Waters, Milford, MA, USA). The gradient elution is made with eluent A (bi-distilled water solution with 0.1% formic acid and 0.2% acetonitrile 0.2%) and eluent B (acetonitrile solution with 0.1% formic acid). Gradient elution was carried out as follows: 0–7 min 100% eluent A; 7–50 min from 100 to 50% eluent A; 50–52.6 min 50% eluent A; 52.6–53 min from 50 to 0% eluent A; 53–58.2 min 0% eluent A; 58.2–59 min from 0 to 100% eluent A; 59–72 min 100% eluent A. The digests were analyzed with UPLC/ ESI-MS in the Full Scan mode. Flow is 0.2 mL/min; analysis time 72 min; column temperature 35 °C; sample temperature 18 °C; injection volume 5 μL; acquisition time 7–58.2 min; ionization type is positive ions; scan range 100–2000 m/z; capillary voltage 3.2 kV; cone voltage 30 V; source temperature 150 °C; desolvation temperature 300 °C; cone gas flow 100 L/h; desolvation gas flow 650 L/h. Two determinations were performed for each sample. For CD-related peptide quantification, the areas of the identified peptides and internal standard (TQQPQQPF(d5)PQQPQQPF(d5)PQ; 1.6 mmol L−1) were integrated with the MassLynx software. Due to the high number of peptides (n = 10) to be quantified, the main immunogenic peptide found (IP4) was synthesized and used as labeled standard.

Statistical Analysis

To study significant differences among the variables, analysis of variance (ANOVA) was performed, with a confidence interval of 95% (p ≤ 0.05). Principal components analysis was performed based on the correlation matrix. All the statistical analyses were determined using the program SPSS for Windows (Version 24.0, SPSS Inc., Chicago, IL, USA).

Results and Discussion

Efficiency of Gluten In Vitro Digestion

Wheat proteins were extracted using a modified Osborne protocol for the recovery of soluble fraction, gliadin, and glutenin. To assess the efficiency of gluten digestion, protein fractions were subjected to SDS-PAGE before and after in vitro digestion (Figure 1).

Figure 1
figure 1

Changes in protein profile before and after digestion (Cappelli is shown as example) (Ext, extract; Dig, digest)

The characteristic bands for undigested protein fractions were observed at the corresponding MW ranges of 7,000–97,000 for soluble proteins, 31,000–66,000 for gliadins, and 14,000–120,000 for glutenin. After in vitro digestion, only minor traces were observed in the digested pure protein fractions due to their severe hydrolysis by digestive enzymes.

Also, the recovered pellet from the sequential extraction of proteins was digested and analyzed using SDS-PAGE (Figure 1), and in this case, the results showed that after digestion, no bands were anymore detectable. On the other side, in the case of the mixture obtained by blending all protein extracts and pellet, some bands were observed at MW ranges of 25,000–66,000 Da, indicating an incomplete digestion. The same was also observed for the whole flour digest, with the observed bands ranging from 14,000–97,000 Da. Therefore, it can be concluded that protein profile after digestion is closely related to the matrix surrounding the proteins.

The Fate of Gluten

A combined peptidomic approach was used to track the fate of gluten after digestion. To this end, the digests were chromatographically separated by UPLC/ESI-MS to select all the peptide ions that yielded detectable peaks. For each of these peaks, the molecular weight was determined.

By comparing the peak profile among the different digests, it was immediately apparent that, qualitatively speaking, the detected profiles did not vary among the different wheat varieties used, whereas differences were observed among the different extracts and total wheat flour within each variety. As an example, total ion chromatograms obtained from UPLC/ESI-MS analysis of the digests relative to the variety Cappelli are shown in Figure 2. For the acquisition time, from 0 to 7 min of the chromatographic run was diverted to waste due to the high salt and sugar content; from 12 to 30 min, peptides ranging from MW 200 to 3,600 were eluted; and from 35 to 72 min, bile salts deriving from the digestive solutions were present. To compare the peak profile of the six digests (SP, GLI, GLU, PEL, MIX, and WWF), exclusively, the segment ranging from 12 in to 30 min was analyzed. It is very easy to see that the relative response of peptides was different between the different digests in this range, which can be associated with the compositional differences among the digested samples (different samples with varying amount of each component to start with) (Figure 2).

Figure 2
figure 2

Total ion chromatograms obtained from UPLC/ESI-MS of the different digests (Cappelli is shown as example). Peptides elute from 12 to 30 min

In this range, peaks were significantly higher in WWF and in the pellet. The digest of the mix was different, showing less intensity in the peptide area. As for protein fractions, the chromatograms of gliadin and soluble proteins were apparently very similar, whereas that of glutenin showed less peaks from 7 to 15 min and no peaks at all beyond 15 min.

Then, the sequences of the most relevant peptides were determined through a further UHPLC/ESI-MS/MS and LTQ-OrbiTrap analysis, by determining the single specific sequences by analysis of the independent fragment peaks matching the theoretical peptide fragments (Supp. Material 1). This combined MS-based peptidomic approach enabled the identification of 227 sequences (Supp. Material 2) belonging to several wheat proteins, thereby giving a good representation of the wheat gluten peptidome after digestion. The number of peptides and the type of proteins differed among the different digests (Table 2). In soluble protein digests, the 27 identified peptides were mainly fragments from alpha-amylase/trypsin inhibitor CM3 and beta-amylase (85%), γ-gliadin (11%), and LMW-glutenin (4%). CM3 and beta-amylase, in fact, belong to the soluble fraction of wheat proteins. Small amount of γ-gliadin and LMW-glutenin were also probably co-extracted during the fractionation procedure. The peptides identified in the digest of gliadin were almost equally resulting from α- and γ-gliadin (49% and 42%, respectively) followed by β-gliadin (5%) and LMW-glutenin (5%), consistently with their solubility features. The sequences found in the glutenin digests were mainly from LMW-glutenin (77%), followed by α-gliadin, (14%) and γ-gliadin (14%). Most peptides identified in the digested pellet were from α-gliadin (80%), followed by γ-gliadin (13%) and HMW-glutenin (7%). As for the digest of the mixture, peptides were mainly from γ-gliadin (45%), α-gliadin (30%), and HMW-glutenin (24%). Digestion of the whole wheat flour generated 80 identifiable peptides, comparable to previous findings ([12] (84 peptides); [26](77 peptides)). These peptides were mostly deriving from α-gliadin (78%), γ-gliadin (13%), and HMW-glutenin (8%). Few peptides deriving from non-gluten proteins were also identified, arising from α-amylase/trypsin inhibitor CM3. No peptides from the ω-gliadin were identified in any of the digested samples.

Table 2 Identified Peptides (%) in the Digests of Protein Fractions, Pellet, Mix and Whole Wheat Flour

The fractionation of wheat proteins allowed to identify more peptides (27 (SP) + 43 (GLI) + 14(GLU) + 30 (PELLET) = 114 peptides), when compared to that of digested whole wheat flour (80 peptides), maintaining the same acquiring time for the data.

Results showed different types and relative amounts of peptides in the fractions, pellet, and mix. This finding is suggestive for the fact that gluten protein digestion is hampered by the matrix effect in whole flour. This can be due to a limitation in enzyme accessibility to proteins, or to an increased competition among them for accessing the enzyme-active sites.

Peptides Eliciting Adverse Reactions in CD Patients

Identification of Peptides Associated with CD

In the context of CD, some gluten peptides were identified in the digests already known in literature to be involved in the adaptive or the innate immune response. Table 3 summarizes the list of gluten peptides associated with CD, which were identified in the digested samples, which was consistent with previous works [24, 26, 27]. These peptides may be subdivided into two groups: sequences triggering the adaptive or the immune response. Seven immunogenic peptides deriving from γ-gliadin were identified as involved in the adaptive immune response (“immunogenic peptide,” TI), containing within their sequences the epitope DQ2.5-glia-γ4c (QQPQQPFPQ) [28]. Three peptides deriving from α-gliadin were identified as involved in the innate immune response (“toxic peptides,” TT), containing within their sequences toxic epitopes (PSQQ, QQQP, QQPY, or QPYP) [29].

Table 3 Identified Peptides Associated with CD

The 33-mer peptide (LQLQPFPQPQLPYPQPQLPYPQPQLPYPQPQPF) from α2-gliadin is generally referred as the most immunodominant gluten peptide (Arentz-Hansen et al., 2000; Shan et al., 2002). In the case of CD patients, the ingestion of gluten can result in the initiation of a strong immune response because it contains three T cell epitopes (PFPQPQLPY (DQ2.5-glia-α1a), PYPQPQLPY (DQ2.5-glia-α1b), and PQPQLPYPQ (DQ2.5-glia-α2)) [30, 31]. However, the complete fragment of 33-mer was not identified in all the studied digests deriving from durum wheat. Indeed, several studies [23, 32] found the entire sequence of this peptide exclusively in common wheat (Triticum aestivum) flour because it is encoded by D-genome.

Quantification of Peptides Associated with CD

Through target proteomic (UPLC/ESI-MS), performing a quantification based on a labeled internal standard, the changes in the contents of peptides associated with CD were monitored among the different digests (Table 4 (peptides triggering adaptive immune response); Table 5 (peptides triggering innate immune response)). Regarding peptides triggering the adaptive immune response (Table 4), digests of whole wheat flours recorded the highest amounts than those deriving from the extracts. Such a result might be attributed to the fact that the more complete digestion in the extracts rather than WWF allows digestive enzymes to reduce immunogenic peptides into smaller sequences. In the case of WWF, immunogenic peptides are encased within a compact network with starch particles thereby likely reducing their accessibility toward digestive enzymes. As well, the mixture (MIX = SP + GLU + GLU + PELLET) contained less amounts of peptides compared to the whole wheat flour (WWF), which can be attributed to the fact that extraction possibly increased the availability of proteins to be cleaved by the digestive enzyme. The same trend was observed in all the six studied varieties. The amount of peptides in soluble fraction was found to be fluctuating with no clear trend, consistently with the gluten peptides being present in this fraction only as contaminants, or/and the fact that globulins and gliadins shared some common sequences [4, 33]. Such results can suggest that the extraction method greatly influences not only the identified peptides but also their amounts. Peptides triggering the innate immune response which were identified, listed in Table 3, derive from α-gliadin, which are the smallest ones among the gluten proteins. Thus, the salt solution probably allows their extraction, while for higher MW proteins (γ-gliadin and glutenins), the use of aqueous alcohol or denaturing agents is needed.

Table 4 Monitoring Peptides Triggering Adaptive Immune Response (ppm) Associated with CD in the Digests
Table 5 Monitoring Peptides Triggering Innate Immune Response (ppm) Associated with CD in the Digests

A better picture was obtained by putting together all the results through principal component analysis (PCA). The first principal components (PC1 and PC2) explained 86% of the total variability. PC1 (44%) was explained by three peptides triggering adaptive immune response (PQTQQPQQPFPQFQQPQQPFPQPQQP, FPQQPQLPFPQQPQQPFPQPQQPQ, and QQPQQPFPQPQQTFPQQPQLPFPQQPQQPFP) and two peptides triggering innate immune response (RPQQPYPQPQPQ and LQPQNPSQQQPQEQVPL), while PC2 (42%) was a function of four peptides triggering the adaptive immune response (TQQPQQPFPQ, SQQPQQPFPQPQ, QAFPQQPQQPFPQ, and TQQPQQPFPQQPQQPFPQ) and the remaining peptide triggering innate immune response (LQPQNPSQQQPQ). The biplots of the first two components (Figure 3a) showed that all the peptides were located in the same quadrant. The overlapping of the digests’ scores on the consensus space generated by PC1 and PC2 is presented in Figure 3b. A clear clustering was obtained, where four groups were separated: Group 1 gathered digested WWF, which are characterized by the highest content of peptides associated with CD; group 2 was made by the digested mixtures; group 3 included the digested fractions (SP, GLI, and GLU); and group 4 was made with digested pellet, which was located in the opposite side of WWF. It is interesting to note that the digestion of the extracts (SP, GLI, and GLU) leads at having of all the six varieties tested much closed together, overwhelming possible inter-genotype variations. The digestion of WWF, indeed, allows to better separate the different genotypes on the score plot, enhancing the peculiarities of each variety, as already demonstrated in previous papers [24, 26, 32, 34].

Figure 3
figure 3

Principal component analysis. (a) Load plot of principal component analysis explaining 86% of total variability. (b) Score plot with the wheat samples projected onto the first two principal components

Conclusion

Combining multiple enzymatic digestion and MS is a powerful tool that enabled the full peptidomic description of the digest mixtures, and the detection and the quantification of gluten peptides associated with CD. Particularly, 227 peptides deriving from gluten were identified, in which 9 were known to be involved in offsetting CD. The quantification of these peptides in the digested extracts and whole wheat flour showed significant difference. For both groups, the peptides eliciting the innate and adaptive immune responses, the superiority of the amounts of these peptides was maintained in the digested whole wheat flour compared to digested mixture, independently of wheat genotype. Such findings indicate a significant effect of the matrix on protein availability and thereby their digestibility. Therefore, gluten deriving material (extracts or within matrix) greatly influenced the efficacy of in vitro digestion, with profound effects on the composition of the mixture and the outcome for CD patients and other subjects sensitized to gluten. Overall, by demonstrating that the full wheat matrix leads to generation of more peptides triggering CD, it also outlines how, properly working on the matrix, it might be possible to enhance the digestion of toxic and immunogenic peptides, thereby reducing the adverse potential of wheat to sensitized subjects.