Glycomics using mass spectrometry
- First Online:
- Cite this article as:
- Wuhrer, M. Glycoconj J (2013) 30: 11. doi:10.1007/s10719-012-9376-3
- 2.4k Downloads
Mass spectrometry plays an increasingly important role in structural glycomics. This review provides an overview on currently used mass spectrometric approaches such as the characterization of glycans, the analysis of glycopeptides obtained by proteolytic cleavage of proteins and the analysis of glycosphingolipids. The given examples are demonstrating the application of mass spectrometry to study glycosylation changes associated with congenital disorders of glycosylation, lysosomal storage diseases, autoimmune diseases and cancer.
KeywordsCancer Congentical disorders of glycosylation Lysosomal storage diseases MALDI-TOF-MS Permethylation
Mass spectrometry (MS) based glycomics techniques are broadly used to analyze free oliogsaccharides, glycosaminoglycans as well as the glycan portions of glycoproteins, proteoglycans and glycolipids. A wide range of MS equipments are available for glycoconjugate analysis. Both matrix-assisted laser desorption-ionization (MALDI) and electrospray ionization (ESI) are commonly applied. MS may be used as a stand-alone technique, or coupled online to separation methods such as HPLC [1, 2, 3, 4] and capillary electrophoresis (CE) [5, 6, 7]. Carbohydrate and glycoconjugate analysis by MALDI-MS has been comprehensively reviewed by Harvey [8, 9]. Other useful review articles, which cover a range of analytical techniques including tandem MS (MS/MS) of glycoconjugates have appeared in recent years [8, 9, 10, 11, 12, 13, 14, 15].
This review aims at giving a concise overview of MS based glycomics technology, together with selected applications in clinical research.
Analysis of free glycans
Protein-linked N-glycans and O-glycans are typically released by enzymatic and chemical methods, respectively . Also glycosaminoglycans are generally degraded by chemical or enzymatic means for subsequent analysis [5, 17]. Analysis of released (or “free”) glycans may be achieved by a variety of techniques such as mass spectrometry, HPLC of reductively aminated glycans employing fluorescence or UV detection and capillary gel electrophoresis with laser-induced fluorescence detection (CGE-LIF) of labeled glycans [16, 18, 19, 20, 21]. MS is particularly advantageous for analyzing very complex glycan mixtures containing unusual oligosaccharide structures for which the standardized migration positions in HPLC or CGE-LIF have not yet been determined. Importantly, the mass of the analyzed glycan – when determined with sufficient accuracy or accompanied by a tandem MS experiment – will directly provide information on the glycan composition in terms of hexoses, N-acetylhexosamines, deoxyhexoses, etc. By contrast, this direct link between the observed glycan species and its molecular composition is not inherently present for HPLC and CGE-LIF experiments and additional efforts are required such as the use of glycan standards or exoglycosidase treatments for the determination of terminal monosaccharides [22, 23]. On the other hand, separation-based methods for glycan analysis will often resolve structural isomers such as the 6-arm and 3-arm isomers of monogalactosylated biantennary glycans , while their distinction is not easily achieved by MS and requires additional efforts such as tandem MS analysis .
Therefore, while very complex pools of oligosaccharides can be analyzed by MS(/MS) without separation , many researchers choose to perform glycan analysis by LC-MS [1, 2, 3] or - less frequently - CE-MS coupling [5, 6, 7]. (Normalized) retention and migration times, precursor masses and fragmentation spectra may then be used for structural elucidation as in the case of O-glycan alditol analysis by porous graphitized carbon (PGC) HPLC coupled online to MS [27, 28, 29]. PGC-HPLC appears to have a particularly high power in separating oligosaccharide structural isomers, which makes this method very useful for in-depth structural analysis of complex oligosaccharide mixtures . Another popular separation technique hyphenated with MS for oligosaccharide analysis is HILIC, which likewise features isomer separation [24, 30, 31]. High-performance anion-exchange chromatography (HPAEC) coupled with online-desalting and online-ESI-MS is another approach which is particularly useful for the analysis of underivatized oligosaccharides .
Derivatization is often useful to support mass spectrometric detection and identification of carbohydrates . For example, oligosaccharides may be reduced to alditols resulting in a 2 Da mass tag on the innermost monosaccharide which facilitates fragment assignment in tandem MS. Analysis of O-glycan alditols obtained by reductive beta-elimination may be achieved by porous graphitized carbon (PGC) HPLC coupled via online, negative-mode electrospray ionization to ion trap-tandem mass spectrometry (MS/MS) [27, 28]. An online database has been made available by the UniCarb-DB partners allowing structural assignment of O-glycan alditols on the basis of MS and MS/MS spectra in addition to retention times (http://www.unicarb-db.com/). Similarly, N-glycans may be structurally assigned on the basis of mass and retention time in PGC-ESI-MS. This approach has been introduced by Altmann and coworkers .
Within the range of mass spectrometric techniques, negative-mode MS of glycans has recently obtained increased attention, both for MALDI and ESI ionization [33, 34]. There are several attractive features of analyzing glycans in negative-ion mode. Negative-mode ionization is particularly effective for acidic glycan structures. In this respect, labeling of glycans at the reducing end with an acidic tag such as 2-aminobenzoic acid (anthranilic acid; AA) is advantageous, as it confers acidic properties to all glycans including neutral species, thereby allowing the efficient detection of both sialylated and non-sialylated AA-labeled oligosaccharides in negative-mode MALDI-time of flight (TOF)-MS . In addition, negative-mode MS/MS of oligosaccharides has attractive features, for example that the glycosidic linkages of fucose are rather stable, in contrast to their labile behavior in positive-ion mode . Harvey has described several diagnostic ions, which are observed in negative-ion mode MS/MS of N-glycans and allow the elucidation of antenna compositions as well as the differentiation between the 6-branch and 3-branch of the glycan [36, 37].
Analysis of permethylated glycans in combination with ESI-ion trap-MS is particularly attractive. When this approach is combined with multistage fragmentation of permethylated glycans, the combination of various characteristic fragmentation spectra of sub-structures of the precursor oligosaccharides allows the unambiguous structural assignment of large oligosaccharide structures as impressively demonstrated by Reinhold and coworkers [25, 26].
Internal standards for MS may be obtained by isotope labeling during the derivatization step. For example reductive amination or permethylation using deuterated or C13-labeled versions of the tag / chemicals have been shown to be advantageous for oligosaccharide quantification and the detailed comparison of glycan profiles . It has to be noted, however, that most of these isotope labeling strategies have not yet been applied to clinical glycomics research questions.
Analysis of glycopeptides
In addition to the analysis of released glycans studying protein glycosylation at the level of glycopeptides is rapidly gaining importance [40, 41, 42, 43, 44]. The peptide portion may be seen as a tag, which potentially allows the assignment of the glycan to a specific N- or O-glycosylation site on a specific protein. However, this approach is complicated by several obstacles. First, proteolytic cleavage is often hindered in highly glycosylated proteins, resulting in very large, highly and heterogeneously glycosylated peptide moieties, which are hardly accessible for MS analysis . Second, a variety of glycans are generally found attached to one specific glycosylation site (microheterogeneity of glycosylation), and different N-glycosylation sites on one protein often have different glycan patterns. Therefore, glycopeptides generally occur substoichiometrically, making them difficult to analyze by MS in the presence of a majority of non-glycosylated peptides. Various enrichment techniques including lectin affinity chromatography are available to purify glycopeptides for MS analysis [3, 46]. A very promising technique for enriching N-glycopeptides is hydrophilic interaction liquid chromatography-solid phase extraction (HILIC-SPE), which may be performed using silica-based or carbohydrate-based stationary phases [30, 31, 47, 48]. Third, depending on the size of the glycan moiety and the chosen MS/MS approach, it is often hard to obtain peptide sequence information, which is in most cases needed for unambiguous assignment of the glycan to a specific protein . Popular approaches are electron capture dissociation (ECD) and electron transfer dissociation (ETD) of glycopeptides as well as various types of (multistage) CID [4, 43, 44]. In ECD and ETD the glycan portion is generally stable, and peptide backbone cleavages tend to provide (some) peptide sequence information . Single stage low-energy CID (as occurring on an ion trap) is generally characterized by fragmentation of glycosidic bonds, and peptide backbone cleavages are usually minor, if detectable at all. Fragmentation of the peptide portion may be achieved by performing ion trap-multistage MS/MS, and has been successfully applied in various cases for the identification of glycosylated proteins and glycosylation sites [41, 43]. Alternatively, fragmentation of glycopeptides at elevated energies in MALDI-TOF/TOF-MS and MALDI- or ESI-quadrupole-TOF-MS has been reported to provide peptide sequence information next to information on glycan composition and structure .
Glycopeptide analysis is almost exclusively performed on protonated species in positive-ion mode. It has been observed that under these conditions glycan moieties may undergo rearrangements in MS/MS, of which prominent examples are the migration of fucoses between N-glycan antennae, or from the core to outer portions of the N-glycan structure [49, 50]. These rearrangements may not only be observed for N-glycopeptides, but also for O-glycopeptides. Obviously, awareness of these processes is required for avoiding misinterpretation of glycopeptide fragmentation spectra.
The major bottle-neck in glycopeptidomics-based proteomics of complex samples is data analysis. Software supporting data analysis is desperately needed, and several promising approaches have recently been reported ( and references cited therein). Yet additional, concerted efforts in developing data analysis tools are needed to boost the impact of this analytical approach.
Analyzing protein glycosylation at the glycopeptide level may be categorized as part of a bottom-up glycoproteomics approach. The analysis of the intact mass of glycoproteins, together with a bottom-up analysis, often allows the detailed structural assignment of protein species such as monoclonal antibodies [51, 52]. In addition, top-down glycoproteomics, i.e., the MS analysis of intact glycoproteins followed by their tandem MS analysis for the characterization of posttranslational modifications including glycosylation, has high potential but needs to be further developed [53, 54].
Analysis of glycolipids
Next to glycoproteins, glycolipids play an important role in cellular interaction and cellular differentiation . The majority of glycolipids observed in humans have a ceramide portion (sphingoid base carrying an amide-linked fatty acid) and are, therefore, categorized as glycosphingolipids. They occur mainly in the outer leaflet of the plasma membrane and also in the inner membranes. Glycosphingolipids show a marked tissue- and cell type-specific expression pattern [59, 60]. This is for example reflected by the fact that various human cluster of differentiation (CD) markers, which are differentially expressed between leukocytes, are glycolipids such as CD60a (GD3; Neu5Ac(α2-8)Neu5Ac(α2-3)Gal(β1-4)Glc(β1-1)ceramide), CD60b (9-O-acetyl GD3), CD60c (7-O-acetyl GD3), CD77 (Gb3; globotriaosylceramide; Gal(α1-4)Gal(β1-4)Glc(β1-1)ceramide)) (http://www.hcdm.org/).
Glycosphingolipids are generally subjected to MS in intact form . Recently, chip-based approaches for glycosphingolipid analysis have been reviewed [59, 62]. Importantly, there is also technology available to combine the most commonly applied separation technique in lipid, as well as glycolipid analysis, high-performance thin layer chromatography (HPTLC), with MS. For example, glycosphingolipids separated by HPTLC can be probed by overlay detection using carbohydrate-binding proteins such as lectins, bacterial toxins, and antibodies, followed by the MS analysis of positive bands, either directly from the HPTLC plate or after lipid extraction, as reviewed recently by Meisen et al. . Alternatively, glycolipids may be analyzed by HILIC-nano-LC-MS  using slightly adjusted solvent conditions when compared to methods used for glycan and glycopeptide separation . Using this approach, it has been demonstrated that α2-3-sialylated and α2-6-sialylated isomers of lactoneotetraosylceramides can be baseline separated from complex mixtures and characterized individually by tandem MS .
Clinical glycomics applications
The importance of the above described techniques is illustrated by their application in clinical studies. Glycosylation changes play important roles in the cellular mechanisms of health and disease , and glycans have a great potential as biomarkers for different types of cancer [66, 67]. There is a vast range of studies of human glycobiology in healthy and diseased people employing MS, and some selected examples will be presented demonstrating the potential of mass spectrometric approaches for clinical glycomics.
MS has been shown to be useful to type congenital disorders of glycosylation. Guillard et al. established an approach that relies on N-glycan release from total plasma, permethylation, and MALDI-ion trap-MS measurement , allowing in-depth analysis of glycans by tandem MS (Fig. 1). This approach was applied to determine plasma N-glycan profiles of congenital disorder of glycosylation (CDG) type II patients, as well as controls . A total of 38 peaks were assigned in terms of molecular composition, and changes in the N-glycan profiles were found to be useful to distinguish between the patient groups. The authors also successfully addressed the challenge of differentiating CDG type II diseases from other diseases with secondary causes of underglycosylation. This method is now being successfully applied in clinical research, including research on patients with defects in 1-4-galactosyltransferase I (B4GAT1), which leads to the expression of largely truncated glycans on plasma proteins .
The analysis of protein degradation products from bio fluids has repeatedly led to the identification of glycopeptides, thereby shedding new light on protein glycosylation. For example, apolipoprotein CIII-derived O-glycopeptides were found in the urine of Schistosoma mansoni infected individuals . Remarkably, these glycopeptides did not exhibit the sialylated T-antigen glycan structures found on apolipoprotein CIII from human serum, but instead carried larger O-glycan structures with a high degree of sialylation. In another study, an O-glycosylated peptide stemming from the C-terminus of the fibrinogen α-chain was found to be increased in the urine during urinary tract infection with Escherichia coli . Recently, O-glycosylated amyloid β-peptides representing a potential disease biomarker were characterized from cerebrospinal fluid of Alzheimer patients using both CID and ECD fragmentation .
Cancer glycomics biomarker discovery has recently been reviewed [66, 67], and MS is becoming an important research tool in this field. Novotny and Mechref with coworkers chose to analyze serum N-glycan profiles after permethylation using MALDI-TOF-MS. Using this approach, they demonstrated vastly different N-glycan profiles in metastatic prostate cancer as compared to healthy tissue . A variety of mainly fucosylated, complex-type N-glycans were found to be increased in cancer vs. control. In another study the relative abundances of a set of 8 complex-type serum N-glycans were found to be indicative of the progression of breast cancer . Other studies have focused on the glycosylation analysis of specific acute-phase proteins. For example, MALDI-MS of 2-aminobenzoic acid-labeled N-glycans showed that the N-glycan fucosylation of α-1-acid glycoprotein is significantly increased in ovarian cancer . Notably, most of the reported cancer glycomics studies focus on the analysis of the total plasma or serum N-glycome or certain acute-phase proteins [66, 67]. While these approaches are promising, an increase in sensitivity and specificity may be expected when tumor-derived antigens isolated from body fluids are characterized together with their specific glycosylation profiles.
In the coming years the field of mass spectrometric analysis of protein glycosylation is expected to show an increase in measurement sensitivity and precision as well as sample throughput, allowing the in-depth analysis of biological systems, with data analysis being a major challenge .
Despite the limitations mentioned in this review, glycoproteomics approaches focusing on the glycopeptide level will gain in popularity. Mass spectrometric analyses of (tryptic) glycopeptides are rewarding as they have intrinsically the potential of assigning specific glycan structures to a specific site on a specific protein. This information is often of utmost importance, as the primary role of glycans is modulating the properties (such as function, activity, stability, targeting) of their carrier proteins. Notably, approaches based on the analysis of released N-glycans often fail to provide this information on protein- and site-specificity and are, therefore, of limited value.
Another important aspect is the analysis of intact glycoproteins by MS, which is expected to gain importance in the next years. On the one hand, MS analysis of intact glycoproteins allows the integration of the information obtained at the glycopeptide and released glycan level to obtain an overall view of protein glycosylation . On the other hand intact protein analysis may be accompanied by top-down tandem mass spectrometric analysis for characterization of posttranslational modifications, including glycosylation .
It is anticipated that the concept of a specific protein having specific functions will undergo refinement, and specific proteins will be perceived as an assembly of isoforms (including glycoforms) that are caused by a variety of posttranslational modifications including glycosylation. Defining such “protein species” is of utmost importance for functional proteomics supporting systems biology  and will require bioinformatics tools and databases to facilitate posttranslational modification analysis at the glycopeptide level .
The author thanks Dr. Gerhild Zauner, Maurice H.J. Selman and Albert Bondt for critically reading the manuscript. This work has been supported by funding from the European Union’s Seventh Framework Programme (FP7-Health-F5-2011) under grant agreement n°278535 (HighGlycan).
This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.