Abstract
Tandem mass spectrometry of peptides is of utmost importance in proteomics. Collision-induced dissociation usually generates y type fragment ion series from tryptic peptides, carrying information on their primary structure. Amino acid side chains or differences in their basicity could alter fragmentation processes considerably. The well-known proline effect is a cleavage preference at the N-terminus of proline residues in peptides, usually yielding a very abundant y ion while suppressing others. Previously, we reported a similar phenomenon occurring at the C-terminus of citrulline residues and coined the term Cit effect. To confirm the presence of Cit effect in large proteomic datasets, we analyzed 293 peptides containing Cit residues based on the human proteome database mining work of Lee et al. (2018). The occurrence of Cit effect was found to be 44%. Comparing bond scissions at the amide linkage between Cit-Zzz (citrulline followed by a specified residue) to Aaa1-Aaa2 (Aaa can be any residue except Cit), 5 Cit-Zzz cleavages were significantly (CL = 95.0%) more frequent in > 85% of the cases in terms of relative sequential base beak occurrence. We used Pro effect to compare with Cit effect and obtained very similar results. On the other hand, our study showed that Cit effect is slightly inferior in the overall incidence to Pro effect (50% vs. 33%, CL = 95%) among deiminated peptides when Pro residues were also present in the sequence. Our results suggest that Cit effect is a characteristic feature and a possible biasing factor of deiminated peptides which can confirm the position of citrullination sites.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Protein citrullination or deimination is a post-translational modification (PTM) converting certain arginines to citrullines [1]. While citrullination was proved to be crucial in myelinization, immune and gene regulation, as well as in skin homeostasis, both hypercitrullination and hypocitrullination have been linked to pathological conditions in humans and animals too [2]. The strongest evidence is for the involvement in rheumatoid arthritis in which autoimmune response is triggered against citrullinated proteins [2–3]. Further associations have been found in multiple sclerosis [4–5], other neurodegenerative diseases [6–7], cancer [8–9], and heart failure [10]. The modification results in a loss of a positive charge and a small increase in molecular mass (0.9840 Da) per occurrence, which could lead to alteration in protein function and structure. Characterization of this low-abundance PTM by LC-MS experiments is still problematic [11–12]. A selective loss of isocyanic acid (HNCO) was previously observed for deiminated peptides upon collision-induced dissociation (CID) by Hao et al. [13] by which discrimination between citrullination and the isobaric transition, deamidation of Asn and Gln residues might have become possible via MS/MS. An alternative of CID, electron-transfer dissociation (ETD) technique usually results in better sequence coverage; however, a selective neutral loss for deiminated peptides in ETD is lacking [14]. Creese et al. used the neutral loss of isocyanic acid as a marker to select parent ions for a further ETD fragmentation process by supplemental activation (saETD) [15]. Although false positive hits were eliminated this way, false negative rate was still substantial; therefore, optimization of MS/MS identification of deiminated residues is still a key research field.
Followed by enzymatic digestion of proteins, sample preparation, and chromatographic separation, peptides are usually identified via generation of b and y ion series as a result of amide cleavages [16–17] by CID. These ion series are characteristic to the sequence, and identification scores largely rely on them. Certain amino acids induce cleavages at their N- or C-termini with a higher preference, yielding the corresponding b or y ion with high intensity. These processes may hinder proper identification since they may suppress other valuable fragmentation routes. Proline effect which results in intensive peaks corresponding to the cleavage at the N-terminus of Pro residues (Aaa-Pro) due to the higher basicity of Pro has been long time known for tryptic peptides [18]. We previously reported a similar phenomenon for Cit residues upon fragmenting Cit-containing model peptides, but in these cases, the cleavage occurred at the C-terminus of Cit residues (Cit-Aaa) [19] (Figure 1) as in the case of the aspartic acid effect, where the preferred cleavage is at the Asp-Aaa linkage [18]. However, our previous sample size was not adequate (only 11 citrulline-containing synthetic model peptides were used), data were lacking on the overall occurrence, and the hypothesis was not tested on real proteomic samples before. Also, we could not determine whether the amino acid following Cit had an impact on Cit effect or not.
In 2018, a deep proteome mining was performed by Lee et al. exploring the human citrullinome for 30 tissues by a thorough examination of MS/MS spectral repositories with rigorous methods [20]. This group also exploited the presence of Cit-selective iminium ions in the MS/MS spectra of deiminated peptides to confirm the Cit-content of the hits. This was proved to be advantageous to filter out false positive (deamidated) peptides. Therefore, we used their published data of validated Cit-peptide sequences for our statistical evaluation omitting repetitive sequences to improve our knowledge concerning the nature of Cit effect.
Experimental Procedures
Experimental Design
The list of valid citrullinated peptides from proteomic data published by Lee et al. was used for our statistical evaluation (Table S5 in reference 20). Identical sequences originated from different human tissues were counted only once. Peptide sequences with matched and annotated MS/MS spectra were included only from PRIDE archive (data set identifier: PXD008970; filename: F095332_all_tissues_cit_extracted.dat) [21] via the use of PRIDE converter [22] and PRIDE inspector [23]. Peptides involved in our calculations were of tryptic origin with only few exceptions. No other selection criteria were used to avoid further possibility of sampling error. The 293 peptides included in our experiment are listed in Table S1.
For clarity and ease of interpretation, we defined the term sequential base peak (BP*). A sequential base peak is the most intensive sequentially informative ion (except for a few cases, these were y ions). Thus, sequences could be sorted either producing a BP* cleavage of a given amino acid linkage or not (dichotomous set). The proportion of the number of peptides where BP* peaks correspond to a given Zzz1-Zzz2 cleavage and the overall number of peptides containing that given amino acid connection were calculated (relative BP* occurrence or population proportion). Post-translationally modified amino acids were only taken into consideration if they were interconversions; e.g., deamidated Asn and Gln were considered Asp and Glu residues respectively. Please note that Aaa stands for any alpha amino acid while Zzz denotes a specific one.
Theory
The statistical evaluation for determining the overall occurrence of Pro and Cit effect was done by a Wilson score method without continuity correction (CL = 95%) [24]. The approximate binomial calculation concerning Cit-Zzz and Aaa1-Aaa2 relative BP* occurrences for confidence intervals (CI) was also performed by Wilson score method without continuity correction (CL = 95%). This method is also applicable in case of low sample sizes’ proportions. The comparison between Cit-Zzz vs. Aaa1-Aaa2 proportions was carried out via an interval estimation for difference between two independent proportions using the Newcombe-Wilson method without continuity correction [25]. Aaa1-Aaa2 connections with an occurrence lower than N = 10 were excluded. This resulted in 172 Aaa1-Aaa2 links altogether. To eliminate the additive effect of the amino acid following citrulline on enhanced cleavage probability, we also compared the Cit-Zzz vs. Aaa-Zzz connections. This means, e.g., that Cit-Ala connections were investigated against all Aaa-Ala linkages to rule out that the effect is influenced solely by the presence of Ala residues. A positive value of percentage for both the lower and upper bound CI was accepted as statistically and practically significant and means the superiority of cleavage preference for the given Cit-Zzz vs. Aaa1-Aaa2. A negative percentage for the lower bound and a positive for the upper one were considered ambiguous (overlying CIs, or comparable cases). If both bounds had negative values, the Cit-Zzz cleavage was considered significantly less preferred than the comparator Aaa1-Aaa2. In order to compare our results with the earlier described, often observed Pro effect, we made our calculations for Zzz-Pro vs. Aaa1-Aaa2 as well.
Results and Discussion
In this work, sequential base peak (BP*) occurrences were determined for a number of natural deiminated peptides to obtain data on the amino acid effect of Cit. The relative BP* occurrence for all 293 peptides indicating Cit effect was found to be 44%. A typical spectrum showing a marked Cit effect is depicted in Figure 2. Data evaluation was carried out by the software “mMass” [26]. Six connections, Cit-Ala, Cit-Asp, Cit-Phe, Cit-Gly, Cit-Asn, and Cit-Ser amide bonds, were cleaved significantly more often to yield a BP* in > 85% of the cases compared with any cleavages between two given types of amino acids (Aaa1-Aaa2) included in our investigations (Table 1). For instance, Cit-Asp bond was found to be superior to 151/171 (87%) possible Aaa1-Aaa2 linkages. The most pronounced occurrence was observed for Cit-Gly and Cit-Ser. None of these six linkages was found to be inferior to that of any Aaa1-Aaa2. Five amide bonds including Cit-Ala, Cit-Asp, Cit-Gly, Cit-Asn, and Cit-Ser satisfied the criteria to be superior to their corresponding Aaa-Zzz category (e.g., Cit-Ala vs. Aaa-Ala, Cit-Asp vs. Aaa-Asp etc.) (Table S2), although Aaa does not often cover the whole set of amino acids possibly due to the low incidence of citrullinated peptides [12] and high sequence similarity of them. The amide bond at Cit-Glu was also found to be favorable for enhanced cleavage in 72% of possible linkages. It was found to be significantly superior than any other Aaa-Glu connection with the exception of Met-Glu. The increase of relative BP* vs. control varies greatly for the individual cases (Table S2).
The median strength of occurrence for cases when Cit effect was superior to other cleavages is also shown in Table 1, represented by the median of the lower and upper value of Newcombe-Wilson confidence interval. These values demonstrate an average ability to produce BP* for the connections described above vs. any amide bonds. For example, the Cit-Gly amide bond is cleaved to yield the corresponding y ion as a BP* at least with an average of ~ (36–74)% more often than any other amide bonds. Interestingly, no BP* was observed for any Aaa-Cit connection further confirming that Cit-Zzz cleavage is preferred instead in case of tryptic peptides. Aspartic acid effect [19] was also not prevalent among deiminated peptides used in this study (Table S2).
Manual inspection also revealed that Cit effect was the most prominent fragmentation pathway when Cit was followed by a His or Pro residue due to an additive His and Pro effect resulting in suppressed fragment ion intensity for other ions, but these linkages seem to be underrepresented in Cit-containing peptides and were excluded from the statistical investigation.
By comparison of Cit and Pro effect, it was found that the latter was more frequent than the former regarding overall relative BP* occurrences, when both Cit and Pro residues were present in the sequences (50% vs. 33%; CIPro = 42–58%, CICit = 26–41%) (Figure 3). The data on Aaa-Pro were also collected and summarized (Table 2). The results are very similar to that of obtained for Cit effect. Remarkably, cleavage of Gly-Pro amide bond was highly unfavorable to yield a BP* (population proportion was only found to be 1/28). Cleavage of Arg-Pro bonds was also suppressed (0/12) which may be explained by the superior basicity of Arg to Pro, resulting in the proton sequestration by the former to yield b ions instead of the corresponding y one.
Conclusions
A simplified cleavage preference matrix for deiminated peptides was created to confirm our hypothesis on Cit-Zzz connection as a site of favorable amide bond scission in deiminated peptides using CID fragmentation. We used the largest validated dataset [20] available for this purpose collected by Lee et al. (2018). To avoid statistical distortion, we applied Newcombe-Wilson method—which is less sensitive for low sample sizes—for evaluating the difference between the confidence intervals based on sequential base peak generation. Almost all analyzed peptides were of tryptic origin. Our statistically and practically significant results indicate that our previous hypothesis could be accepted for at least 5 connections that are Zzz = Ala, Asp, Gly, Asn, Ser. These cleavage preferences are superior in occurrence to most of the other ones and correlate with the natural occurrence of Cit-Aaa links [20] which may implicate that citrulline effect is a universal phenomenon or there is a significant bias toward the identification of the peptides carrying these types of connections. Another explanation could be the strict substrate preference of deimination, yielding a low number of citrullinated peptides in a physiological system despite the large number of tandem mass spectra used by Lee et al. On the other hand, Cit effect was found to be slightly inferior to Pro effect in terms of occurrence when both residues were present in the peptides. After careful optimization, Cit effect may be a promising candidate to improve the identification of citrullination sites by, e.g., collision energy-stepping or charge state manipulation along with the previously described loss of isocyanic acid and the use of Cit-selective iminium ion in proteomic analyses.
Data Availability
Deiminated peptides that were used in our data analyses are listed in Table S1. Statistical calculations for all cases could be found in Table S2. By changing values x (number of favorable events, BP*) and N (number of events) in the cells highlighted in green, the Newcombe-Wilson lower and upper intervals are recalculated.
References
Nicholas, A., Bhattacharya, S.: Protein Deimination in Human Health and Disease, 1st edn. Springer-Verlag, New York, NY (2014)
Baka, Z., György, B., Géher, P., Buzás, E.I., Falus, A., Nagy, G.: Citrullination under physiological and pathological conditions. Joint Bone Spine. 79, 431–436 (2012)
Wegner, N., Lundberg, K., Kinloch, A., Fisher, B., Malmström, V., Feldmann, M., Venables, P.J.: Autoimmunity to specific citrullinated proteins gives the first clues to the etiology of rheumatoid arthritis. Immunol. Rev. 233, 34–54 (2010)
Moscarello, M.A., Mastronardi, F.G., Wood, D.D.: The role of citrullinated proteins suggests a novel mechanism in the pathogenesis of multiple sclerosis. Neurochem. Res. 32, 251–256 (2007)
Stys, P.K., Zamponi, G.W., van Minnen, J., Geurts, J.J.G.: Will the real multiple sclerosis please stand up? Nat. Rev. Neurosci. 13, 507–514 (2012)
Nicholas, A.P.: Dual immunofluorescence study of citrullinated proteins in Parkinson diseased substantia nigra. Neurosci. Lett. 495, 26–29 (2011)
Ishigami, A., Masutomi, H., Handa, S., Nakamura, M., Nakaya, S., Uchida, Y., Saito, Y., Murayama, S., Jang, B., Jeon, Y.-C., Choi, E.-K., Kim, Y.-S., Kasahara, Y., Maruyama, N., Toda, T.: Mass spectrometric identification of citrullination sites and immunohistochemical detection of citrullinated glial fibrillary acidic protein in Alzheimer’s disease brains. J. Neurosci. Res. 93, 1664–1674 (2015)
Thålin, C., Lundström, S., Seignez, C., Daleskog, M., Lundström A., Henriksson, P., Helleday, T., Phillipson, M., Wallén, H., Demers, M.: Citrullinated histone H3 as a novel prognostic blood marker in patients with advanced cancer. PLoS One, doi: https://doi.org/10.1371/journal.pone.0191231 (2018).
Qu, Y., Olsen, J.R., Yuan, X., Cheng, P.F., Levesque, M.P., Brokstad, K.A., Hoffman, P.S., Oyan, A.M., Zhang, W., Kalland, K.-H., Ke, X.: Small molecule promotes β-catenin citrullination and inhibits Wnt signaling in cancer. Nat. Chem. Biol. 14, 94–101 (2018)
Fert-Bober, J., Giles, J.T., Holewinski, R.J., Kirk, J.A., Uhrigshardt, H., Crowgey, E.L., Andrade, F., Bingham 3rd, C.O., Park, J.K., Halushka, M.K., Kass, D.A., Bathon, J.M., van Eyk, J.E.: Citrullination of myofilament proteins in heart failure. Cardiovasc. Res. 108, 232–242 (2015)
Clancy, K.W., Weerapana, E., Thompson, P.R.: Detection and identification of protein citrullination in complex biological systems. Curr. Opin. Chem. Biol. 30, 1–6 (2016)
Hermansson, M., Artemenko, K., Ossipova, E., Eriksson, H., Lengqvist, J., Makrygiannakis, D., Catrina, A.I., Nicholas, A.P., Klareskog, L., Savitski, M., Zubarev, R.A., Jakobsson, P.-J.: MS analysis of rheumatoid arthritic synovial tissue identifies specific citrullination sites on fibrinogen. Proteomics Clin. Appl. 4, 511–518 (2010)
Hao, G., Wang, D., Gu, J., Shen, Q., Gross, S.S., Wang, Y.: Neutral loss of isocyanic acid in peptide CID spectra: a novel diagnostic marker for mass spectrometric identification of protein citrullination. J. Am. Soc. Mass. Spectrom. 20, 723–727 (2009)
Zolg, D.P., Wilhelm, M., Schmidt, T., Médard, G., Zerweck, J., Knaute, T., Wenschuh, H., Reimer, U., Schnatbaum, K., Kuster, B.: ProteomeTools: Systematic characterization of 21 post-translational protein modifications by liquid chromatography tandem mass spectrometry (LC-MS/MS) using synthetic peptides. Mol. Cell. Proteomics. 17, 1850–1863 (2018)
Creese, A.J., Grant, M.M., Chapple, I.L.C., Cooper, H.J.: On-line liquid chromatography neutral loss-triggered electron transfer dissociation mass spectrometry for the targeted analysis of citrullinated peptides. Anal. Methods. 3, 259–266 (2011)
Roepstorff, P., Fohlman, J.: Proposal for a common nomenclature for sequence ions in mass spectra of peptides. Biomed Mass Spectrom. 11, 601 (1984)
Biemann, K.: Contributions of mass spectrometry to peptide and protein structure. Biomed. Environ. Mass Spectrom. 16, 99–111 (1988)
Medzihradszky, K.F., Chalkley, R.J.: Lessons in de novo peptide sequencing by tandem mass spectrometry. Mass Spectrom. Rev. 34, 43–63 (2015)
Steckel, A., Uray, K., Turiák, L., Gömöry, Á., Drahos, L., Hudecz, F., Schlosser, G.: Mapping the tandem mass spectrometric characteristics of citrulline-containing peptides. Rapid Commun Mass Spectrom. 32, 844–850 (2018)
Lee, C.Y., Wang, D., Wilhelm, M., Zolg, D.P., Schmidt, T., Schnatbaum, K., Reimer, U., Pontén, F., Uhlén, M., Hahne, H., Kuster, B.: Mining the human tissue proteome for protein citrullination. Mol. Cell. Proteomics. 17, 1378–1391 (2018)
Vizcaino, J.A., Csordas, A., Del-Toro, N., Dianes, J.A., Griss, J., Lavidas, I., Mayer, G., Perez-Riverol, Y., Reisinger, F., Ternent, T., Xu, Q.W., Wang, R., Hermjakob, H.: 2016 update of the PRIDE database and its related tools. Nucleic Acids Res. 44, 11,033 (2016)
Cote, R.G., Griss, J., Dianes, J.A., Wang, R., Wright, J.C., van den Toorn, H.W.P., van Breukelen, B., Heck, A.J.R., Hulstaert, N., Martens, L., Reisinger, F., Csordas, A., Ovelleiro, D., Perez-Riverol, J., Barsnes, H., Hermjakob, H., Vizcaino, J.A.: The PRIDE Converter 2 framework: an improved suite of tools to facilitate data submission to the PRIDE database and the ProteomeXchange consortium. Mol. Cell. Proteomics. 11, 1682–1689 (2012)
Perez-Riverol, Y., Xu, Q.W., Wang, R., Uszkoreit, J., Griss, J., Sanchez, A., Reisinger, F., Csordas, A., Ternent, T.: del-Toro, N., Dianes, J. A., Eisenacher, M., Hermjakob, H., Vizcaino, J. A.: PRIDE Inspector Toolsuite: moving toward a universal visualization tool for proteomics data standard formats and quality assessment of ProteomeXchange datasets. Mol. Cell. Proteomics. 15, 305–317 (2016)
Newcombe, R.G.: Two-sided confidence intervals for the single proportion: comparison of seven methods. Stat. Med. 17, 857–872 (1998)
Newcombe, R.G.: Interval estimation for the difference between independent proportions: comparison of eleven methods. Stat. Med. 17, 873–890 (1998)
Strohalm, M., Kavan, D., Novák, P., Volný, M., Havlíček, V.: mMass 3: A cross-platform software environment for precise analysis of mass spectrometric data. Anal. Chem. 82, 4648–4651 (2010)
Acknowledgements
The research was supported by the MTA Premium Post-Doctorate Research Program and by the MTA-MedInProt Programme of the Hungarian Academy of Sciences (HAS, MTA). This work was completed in the ELTE Institutional Excellence Program (783-3/2018/FEKUTSRAT) supported by the Hungarian Ministry of Human Capacities.
Funding
Open access funding provided by Eötvös Loránd University (ELTE).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Steckel, A., Schlosser, G. Citrulline Effect Is a Characteristic Feature of Deiminated Peptides in Tandem Mass Spectrometry. J. Am. Soc. Mass Spectrom. 30, 1586–1591 (2019). https://doi.org/10.1007/s13361-019-02271-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13361-019-02271-x