Abstract
Oligosaccharyltransferase (OST) catalyzes oligosaccharide transfer to the Asn residue in the N-glycosylation sequon, Asn-X-Ser/Thr, where Pro is strictly excluded at position X. Considering the unique structural properties of proline, this exclusion may not be surprising, but the structural basis for the rejection of Pro residues should be explained explicitly. Here we determined the crystal structure of an archaeal OST in a complex with a sequon-containing peptide and dolichol-phosphate to a 2.7 Å resolution. The sequon part in the peptide forms two inter-chain hydrogen bonds with a conserved amino acid motif, TIXE. We confirmed the essential role of the TIXE motif and the adjacent regions by extensive alanine-scanning of the external loop 5. A Ramachandran plot revealed that the ring structure of the Pro side chain is incompatible with the ϕ backbone dihedral angle around −150° in the rigid sequon-TIXE structure. The present structure clearly provides the structural basis for the exclusion of Pro residues from the N-glycosylation sequon.
Similar content being viewed by others
Introduction
Asparagine-linked (N-linked) glycosylation is one of the most ubiquitous post-translational modifications of proteins conserved in all domains of life. All eukaryotic and archaeal organisms have N-glycosylation systems1,2,3. N-glycans are oligosaccharides attached to Asn residues, and they affect various physicochemical properties of glycoproteins4,5. In their biological roles, the N-glycans function as tags recognized by other proteins and are involved in protein quality control and sorting inside cells, as well as cell–cell and host–pathogen interactions on cell surfaces5,6. N-Glycosylation is essential in Eukarya and the modern phyla of Archaea7. In contrast, in the ancient archaeal phylum Euryarchaeota, the N-glycosylation is dispensable for growth in laboratory conditions but presumably requisite for survival in harsh environments8. N-Glycosylation also occurs in the eubacterial genera Campylobacter3 and Helicobacter9, in which the N-glycosylation is non-essential for growth but important for virulence by promoting the adhesion of these human enteropathogenic bacteria to host cells10. Interestingly, some deep-sea-dwelling eubacteria also have the N-glycosylation system11. This fact implies that the horizontal gene transfer occurred multiple times independently from common ancestors of Eukarya and Archaea. Consequently, the eubacterial N-glycosylation systems have hybrid features of their eukaryotic and archaeal counterparts2.
The oligosaccharide transfer occurs on the side-chain carboxamide group of the Asn residue in the N-glycosylation sequon, Asn-X-Ser/Thr, in polypeptide chains, where X denotes any amino acid residue except for proline12. Hereafter, the residue position is defined as X-2-X-1-Asn0-X+1-Ser/Thr+2-X+3-X+4-X+5. Eubacteria use an extended 5-residue sequon13, Asp/Glu-2-X-Asn-X-Ser/Thr, although the presence of an acidic residue at position -2 is not absolutely required14,15. The amino acid bias in the middle position (position +1) of the N-glycosylation sequon is an interesting phenomenon. Statistical analyses of many glycosylated sites in glycoproteins revealed little preference for a particular amino acid at position X, except for the strict Pro exclusion in eukaryotic16,17, archaeal18, and eubacterial glycoproteins19. The N-oligosaccharyl transfer is catalyzed by an integral membrane enzyme, oligosaccharyltransferase (OST)20,21. The OST enzyme determines the non-preference and the exclusion of amino acid residues at position +1 of the glycosylated sequons22,23. To clarify the structural basis of the sequon selection rules, we need the three-dimensional structures of the OST enzymes in complexes with the two substrates, an oligosaccharide donor and an oligosaccharide acceptor. In contrast to the relatively invariable properties of the amino acid sequences of the acceptor sequon, the oligosaccharide donor is highly diverse among the three domains of life. The oligosaccharide donor has the general structure of lipid-phosphate(s)-oligosaccharide, and is thus referred to as a lipid-linked oligosaccharide (LLO). The lipid part is dolichol in Eukarya and Archaea, and polyprenol in Eubacteria24,25. Polyprenol is a long chain isoprenoid alcohol with the general formula, [α-terminus] HO-(CH2-CH=C(CH3)-CH2)n-H [ω-terminus], and dolichol is a special type of polyprenol that contains a saturated isoprene unit at the α-terminus. A diphosphate-type LLO is commonly used as the oligosaccharide donor for the OST catalyzed transfer reactions in the three domains of life, but a subset of Archaea, Euryarchaeota, exceptionally uses a monophosphate-type LLO26,27,28. The chemical structure of the oligosaccharide part is also diverse. Most eukaryotes use a well-conserved canonical 14-residue oligosaccharide structure, Glc3Man9GlcNAc2, and lower eukaryotes use a shorter version of the 14-residue structure, lacking the terminal glucose and/or mannose residues29. In contrast, Archaea and Eubacteria use completely different sets of oligosaccharide structures from species to species, with respect to the number, composition, and branching pattern of the monosaccharides8. Considering the substantial divergence of the oligosaccharide donor structures, comparisons between distantly related OST enzymes can capture the essence of substrate recognition and enzyme catalysis.
The OST enzymes are hetero-oligomeric protein complexes in most eukaryotes, and single-subunit proteins in lower eukaryotes20,21. The archaeal and eubacterial OSTs are also single-subunit enzymes. The OST enzymes are located in the endoplasmic reticulum membranes of eukaryotic cells and the plasma membranes of archaeal and eubacterial cells. The crystal structures of the eubacterium Campylobacter lari OST (alias ClPglB) were reported in complexes with an acceptor peptide (PDB: 3RCE)30, an acceptor peptide plus a non-hydrolyzable LLO analog (PDB: 5OGL)31, and an inhibitory peptide plus a reactive LLO analog (PDB: 6GXC)32. The crystal structures of the euryarchaeon Archaeoglobus fulgidus OST (alias AfAglB) were determined in complexes with a sulfate ion, which mimics the phosphate group of LLO (PDB: 3WAJ)33, and with an acceptor peptide (PDB: 5GMY)23. These binary and ternary complex structures provided many valuable insights into the oligosaccharyl transfer reaction. Recently, the cryo-EM single-particle structures of yeast OST and two human OST paralogs were reported34,35,36. The catalytic subunits, Stt3, in the multi-subunit OST enzymes have essentially identical structures to those of ClPglB and AfAglB21. The two human OST structures contain an endogenous dolichol-phosphate, which was co-purified during purification, and one of them also contains a model for an acceptor peptide of unknown origin. Unfortunately, the resolutions (3.3–3.5 Å) of the cryo-EM structures are not sufficient to discuss the details of the sequon recognition and the catalytic mechanism.
Several conserved short amino acid motifs have been identified in the diverse Stt3, AglB, and PglB protein sequences (identity <20%). The C-terminal globular domain contains the WWDYG and DK/MI motifs37, where the slash delimiter indicates domain-specific conservation. The DK motif is found in Eukarya and a subset of Archaea, whereas the MI motif is present in the remaining Archaea and Eubacteria. The WWDYG and DK/MI motifs form a binding site for the Ser/Thr residue in the sequon30,33. The DGGK motif is conserved among eubacterial PglBs and euryarchaeal AglBs and presumed to be involved in LLO binding38. The equivalent in the eukaryotic Stt3 and the AglBs from the ASGARD and TACK superphyla of Archaea is a double sequon motif, DNXTZNX[T/S], where X and Z can be any residue20. The N-glycan attached to the double sequon motif is involved in the interactions with other subunits in the multi-subunit OST complexes34,35,36. The N-terminal transmembrane (TM) region of the Stt3/AglB/PglB proteins consists of 13 TM helices and contains two DXD motifs on the first and second external loops (EL1 and EL2) and a TIXE/SVSE motif on the fifth external loop (EL5). The TIXE motif is found in Archaea and Eubacteria, whereas the SVSE motif is present in Eukarya. The mutually independent conformational changes of the N-terminal and C-terminal halves of the EL5 loop are considered to be essential for the binding of the LLO and sequon, respectively39,40. In response to the conformational changes of the EL5 loop, the catalytic structure dynamically forms by integrating the Glu residue in the TIXE/SVSE motif. No appropriate functional groups were found around the side-chain carboxamide group of the acceptor Asn, and thus a hypothetical twisted amide mechanism was proposed for the activation of the inert amide nitrogen30. In this mechanism, the N-C bond in the carboxamide group is transiently twisted through bipartite interactions with the two carboxy groups of the conserved acidic residues in the first DXD and the TIXE/SVSE motifs. The twisting abolishes the conjugation of the lone-pair electrons on the nitrogen atom with the carbonyl group, and thus increases the nucleophilic reactivity of the amide nitrogen41.
Here, we determined the crystal structure of the ternary complex of the A. fulgidus AglB protein (AfAglB) with a sequon peptide and a dolichol-phosphate molecule. The catalytic structure around the bound metal ion is almost the same as that of the binary AfAglB-peptide complexes determined previously23. Our analysis of the sequon recognition revealed the special roles of the TIXE motif in the EL5 loop. Although the conservation of the TIXE/SVSE motif was previously reported, its precise role in the oligosaccharyl transfer reaction has not been identified. We now report the formation of the inter-chain hydrogen bonds between the sequon and the TIXE motif in the AfAglB protein. The requirement of a special ϕ dihedral angle in the rigid sequon-TIXE structure clearly explains the structural basis for the strict exclusion of Pro at the middle position of the N-glycosylation sequon.
Results
Crystallization and structure determination
We used the lipidic cubic phase (LCP) method to obtain crystals of the AfAglB in a complex with a donor LLO molecule and an acceptor peptide. Native LLO was isolated from cultured A. fulgidus cells. The AfLLO preparation that produced diffraction-quality co-crystals was a crude mixture of LLOs with variable numbers of monosaccharides (6 and 7), isoprene units (C55 and C60), saturated isoprene units (3, 4, and 5), and the sulfate group (0 and 1)27. The peptide used for crystallization was custom synthesized. To compensate for the weak affinity, the sequon peptide was tethered to the AfAglB protein via a disulfide bond, to shift the association-dissociation equilibrium to the bound state23. A cysteine residue was introduced as a sole tethering point (G617C) in the AfAglB protein. To stop the transfer reaction, the Asn residue in the sequon was replaced by a L-2,4-diaminobutyrate (Dab) residue. The replacement of the amide group by an amino group is known to inhibit the oligosaccharyl transfer reaction in a competitive manner42. The peptide sequence is TAMRA-APY(Dab)VTASCR-OH, in which the non-reactive sequon is underlined and the cysteine residue for tethering is italicized. The N-terminal α-amino group is modified with a fluorescent carboxytetramethylrhodamine (TAMRA) dye for color detection. We chose 7.7 MAG as the host lipid with consideration of the larger water channel and reduced interfacial curvature of the cubic mesophase, which is suitable for the crystallization of membrane proteins with a large soluble domain43. Microcrystals were grown in a lipidic sponge mesophase under buffer conditions of 19–22% PEG400, 0.1 M Na-citrate, pH 6.0, and 50 mM NaCl. The positions and shapes of the crystals were easily identified by the magenta color of the TAMRA dye (Supplementary Fig. 1). Diffraction data were collected from 2529 microcrystals at the microfocus beamline BL32XU, SPring-8, Japan. A small-wedge data set was collected from each crystal and merged to complete the data set. The structure was determined by the molecular replacement method to a resolution of 2.7 Å (Table 1).
Even though the tethered peptide contained the non-reactive Dab residue at position 0, the LLO binding site was occupied by a dolichol-phosphate, instead of an intact LLO, suggesting that the LLO was hydrolyzed during the prolonged crystallization period. An omit electron density map revealed the clear densities for the dolichol-phosphate, except for the isoprene units in the middle part (Fig. 1d, orange mesh). Another omit electron density map also revealed the clear density for the sequon part in the tethered peptide (Fig. 1c, red mesh). Consequently, the models were reliably built for the sequon segment, A-3PYDabVT+2, and the dolichol(C60)-phosphate. The construction of the models of the A+3SC+5 linker segment of the tethered peptide and the central part of the dolichol chain was guided by the chemical structures of the amino acids and isoprene unit. Three water molecules around the metal ion were visible in the difference map and modeled (Fig. 1c, blue mesh).
Overall structure
AfAglB comprises the N-terminal TM region (residues 6–498) and the C-terminal globular domain (residues 499–868). The flexible EL5 loop (residues 331–374) in the TM region is fully visible and constitutes integral parts of the binding sites for the substrate peptide and LLO (Fig. 1b). The comparison with the previous binary AfAglB-peptide complex23 provides a dynamic view of the EL5 loop. The N-terminal half (residues 335-350) of the EL5 loop is disordered when the LLO binding site is vacant and becomes ordered to form an α-helix in the presence of dolichol-phosphate. Except for the conformational change of EL5, the overall structure of the AfAglB protein in the ternary complex is almost identical to that of the binary structure (rmsd 0.60 Å for 703 Cα atoms).
Catalytic structure around the metal ion
There are three water molecules around the metal ion, presumably Mn2+, at distances of 1.9–2.5 Å (Fig. 2). The metal ion also interacts with the protein via coordination to the carboxylate group of Asp47 in the first DXD motif (Gly-Asn-Asp), and the carboxylate group of Asp161 and the imidazole group of His163 in the second DXD motif (Asp-His-His) (Fig. 2b). These six ligands have a regular octahedral arrangement around the metal ion. The carboxylate group of Glu360 in the TIXE motif (Thr-Ile-Ala-Glu) indirectly participates in the coordination to the metal ion through two of the three water molecules. The phosphate group of the dolichol-phosphate also interacts with the metal ion indirectly, through another two of the three water molecules.
Mutagenesis study of the EL5 loop
We conducted alanine-scanning mutagenesis of the 44 continuous residues in the EL5 loop to identify the important residues for the oligosaccharyl transfer reaction (Fig. 3, Supplementary Figs. 2, 7, and 8, Supplementary Data 1). The three Ala residues in the 44 residues were replaced by Gly. In the N-terminal half of the EL5 loop, no critical residues were identified, whereas the C-terminal half showed sharp decreases of the activity in the segment Leu356-Phe365. This is a convincing result because this segment contains the conserved TIXE motif, Thr357-Ile-Ala-Glu360.
Attempts to uncouple the LLO hydrolysis activity from the oligosaccharyl transfer activity
The present crystal structure contains a dolichol-phosphate molecule instead of an intact LLO molecule. The eukaryotic and eubacterial OST enzymes are known to have hydrolytic activity that releases a free N-glycan (FNG) from LLO in the absence of a peptide substrate containing a sequon44,45. We have recently identified special point mutations of the yeast OST enzyme that uncouple the two activities46. Each mutation of Lys586 and Met590 in the DK motif (Asp-XX-Lys-XXX-Met) to Ala in the catalytic Stt3 protein subunit suppressed the LLO hydrolysis activity but retained the oligosaccharyl transfer activity. Interestingly, the DK motif is distant from the catalytic center (Fig. 2a). This indicates that the LLO hydrolytic activity can be remotely manipulated by modulating the peptide binding mode. Based on the yeast OST, we introduced point mutations in the AfAglB protein, expecting to obtain suitable mutants for crystallization with an intact LLO. The tested point mutations include His81 and Arg426 (close to the phosphate site), Asp161 and His162 (close to the metal site), and Asp552, Gln571, and Lys618 (close to the peptide site). We measured the two enzymatic activities (Supplementary Figs. 3 and 9, Supplementary Data 3). Disappointingly, no mutations with the desired suppression of the LLO hydrolysis activity relative to the oligosaccharyl transfer activity were obtained.
Discussion
We determined the crystal structure of a ternary complex of the AfAglB protein with an acceptor peptide and dolichol-phosphate. The resolution of the present structure is one of the best (2.7 Å) among the available AfAglB and ClPglB crystal structures (2.7–3.5 Å). The acceptor peptide was tethered to the AfAglB protein via an engineered disulfide bond23. The amino acid residues around the cysteine at the tethering point have high B-factors, indicating the minimal influence of the tethering on the sequon conformation. Although we used a non-reactive peptide containing an asparagine analog at position 0, a dolichol-phosphate molecule was bound in the complex, suggesting the hydrolysis of LLO. This is not surprising, considering that the eukaryotic and eubacterial OST enzymes have LLO hydrolytic activity to release free N-oligosaccharides44,45. Here, we showed that an archaeal OST/AglB also has the LLO hydrolytic activity (Supplementary Figs. 3 and 9, Supplementary Data 3).
The detailed structural comparison across the different domains of life yields a unified view of the structure and function of the OST enzyme. The functional structures are formed by conserved amino acid residues, mainly residing in short amino acid motifs such as the two DXD, TIXE/SVSE, WWDYG, DGGK/DNXTZNX[T/S], and DK/MI motifs (Fig. 1a). The spatial arrangements of the conserved residues are strikingly similar between the distantly related AfAglB and ClPglB (Supplementary Fig. 4). The almost perfect superimposition suggests that the common substrate recognition and catalytic mechanisms were preserved through evolution.
The WWDYG and DK/MI motifs are involved in the formation of the Ser/Thr-binding pocket in the C-terminal globular domain, which explains the requirement of the hydroxy amino acid residues in the N-glycosylation sequon30,33. The sequon part, Dab-Val-Thr, plus the Ala residue at position +3 in the acceptor peptide, adopts an extended conformation and lies side-by-side in an antiparallel manner with the 4-residue TIXE motif, Thr-Ile-Ala-Glu, in the C-terminal half of the EL5 loop (Fig. 2c). Two inter-chain hydrogen bonds are formed between the carbonyl oxygen of Thr357 and the amide group of Ala+3, and between the carbonyl oxygen of Val+1 and the amide group of Ala359. The hydrogen donor-acceptor distances are 2.7 Å and 3.0 Å, which are typical for moderate-strength hydrogen bonds. Although identical structures also exist in the previous AfAglB-peptide structure and ClPglB-peptide-LLO structures (Supplementary Fig. 4), they were not mentioned explicitly at the time of publication23,31,32, probably due to insufficient resolutions (PDB: 5GMY and 6GXC) or poor focus (PDB: 5OGL). The present structure has revealed that the AfAglB protein recognizes the sequon sequences through not only the side-chain groups of Asn0 and Ser/Thr+2, but also the main-chain groups of the X+1 and X+3 residues by the TIXE motif (Supplementary Fig. 4). The essential role of the TIXE motif was confirmed by the alanine-scanning study of the EL5 loop (Fig. 3, Supplementary Data 1). No similar exhaustive mutation scanning experiments of the EL5 loop have been performed for other OST enzymes. Recognizing the structure and function of the TIXE motif is the key toward understanding the sequon recognition by the OST enzyme.
The exclusion of a Pro residue at position +1 in the N-glycosylation sequon is absolutely strict. No glycosylated Asn-Pro-Thr or Asn-Pro-Ser sites exist in the N-GlycositeAtlas database, an archive of more than 35,000 reviewed N-glycosylated sequences derived from human glycoproteins47. A significantly low level of glycosylation with Pro at position +3 has also been reported16,17. The unique structural features of Pro are easily presumed to be a possible cause of the exclusion. (1) The five-membered ring of Pro leads to a restricted ϕ dihedral angle around −75°. (2) Pro lacks a backbone amide hydrogen atom, which can be a donor for a hydrogen bond. (3) The X-Pro peptide bond tends to adopt a cis configuration. Consequently, proline destabilizes secondary structures and causes kinks in polypeptide chains.
The reason for the exclusion of Pro residues at positions +1 and +3 became obvious when the Ramachandran plots were generated (Fig. 4, Supplementary Data 2). All residues in the sequon sequence and the TIXE motif have ϕ values in the range from −60° to −120°, except for Val+1 in the sequon, which has a high ϕ value around −150°. A similar high ϕ value of Ala+1 in the eubacterial sequon sequence is also evident in the ClPglB structure. The rigid ring structure of the side chain does not allow Pro to adopt such a high ϕ value and thereby excludes Pro from position +1 in the N-glycosylation sequon. The similar tendency of high ϕ values of Ala+3 and PPN+3 (para-nitrophenylalanine) in the two structures also accounts for the very low frequency of proline at position +3. Because the residue at position +3 is located at the boundary region of the rigid sequon-TIXE structure, the exclusion of Pro is probably less strict at position +3 than at the middle of the sequon.
Although AfAglB and ClPglB catalyze the same reaction, AfAglB uses a mono-phosphate type LLO as the oligosaccharide donor, whereas ClPglB uses a di-phosphate type LLO. The lipid chains are also different. The lipid chain of AfLLO is dolichol, in which the α-terminal isoprene unit is saturated and contains a tetrahedral carbon with an S-configuration, while that of ClLLO is polyprenol, which retains a double bond in the α-isoprene unit. Thus, comparative studies would provide insights into substrate recognition and enzymatic catalysis. We examined the correspondence relation of the phosphate groups in the AfAglB and ClPglB structures. The phosphate group directly attached to the oligosaccharide moiety in ClLLO is the counterpart of the single phosphate group in AfLLO (Fig. 5a). This implies a common catalytic mechanism for the two types of LLOs.
Next, we focus on the binding mode of the lipid chains. In the present AfAglB structure, the ω-terminus of the dolichol is in the tunnel structure formed at the interface between the two TM helices, TM helix 8 and helix 9 (Fig. 5b). This tunnel structure implies that the LLO molecule enters the binding site through the gap between TM helix 8 and helix 9 (Fig. 5c). TM helix 9 must move in concert with the conformational change of the EL5 loop, to enlarge the gap upon LLO binding. A similar mechanism, referred to as LLO entry gate, was proposed for the Stt3 subunit in the yeast OST, although no LLO molecule was bound in the determined cryo-EM structure34. The arrangements of the TM helix 8 and helix 9 are similar to each other between yeast Stt3 and AfAglB, but distinct in ClPglB21. Consistently, for ClPglB, the LLO was assumed to thread into the binding site under the disordered EL5, while the TM helix 9 stayed in place31. The mutagenesis of the Tyr293 residue in the EL5 loop of ClPglB resulted in a 7000-fold reduction of the glycosylation turnover rate40. By contrast, the V349A mutation, which is located at the corresponding position in AfAglB, exhibited a moderate reduction in the oligosaccharyl transfer activity, but the effect was not as significant as in the ClPglB case (Fig. 3). The discrepancy in the mutational effects is attributable to the different binding modes of the dolichol/polyprenol chains.
Finally, we discuss the yet-to-be-defined activation mechanism of the inert amide nitrogen in the acceptor Asn. To date, no convincing experimental evidence to support the twisted amide mechanism has been reported32. The OST enzymes might adopt a supportive mechanism to compensate for the poor nucleophilicity of the carboxamide group of the acceptor Asn. Locher and coworkers proposed that the divalent metal ion might directly activate the glycosidic oxygen to generate a reactive electrophile32. Alternatively, the rigid frame structure composed of the sequon and the TIXE/SVSE motif could function as a guiding device to bring the nitrogen atom in the vicinity of the C1 carbon atom (Supplementary Fig. 5). In the transition state, the amide nitrogen and the C1 carbon are forced to move within a closer reaction distance by the restriction of concerted motions to one direction. As the result, the unreactive amide nitrogen attacks the C1 carbon of LLO to perform a nucleophilic substitution, by converting energy from the conformational to chemical coordinate48.
In conclusion, the present structural and mutagenesis studies revealed the dual roles of the TIXE motif in the sequon recognition and catalytic mechanism. First, the TIXE motif participates in the formation of the rigid sequon-TIXE frame structure to recognize sequon sequences at the main-chain level (Fig. 2). The sequon-TIXE frame forces the amino acid residues at positions +1 and +3 to adopt high ϕ dihedral angles (Fig. 4), which are inaccessible to Pro. This is the structural basis for the exclusion of Pro residues at the middle position and the position after the Ser/Thr residue of the N-glycosylation sequon. As the second role, the rigid sequon-TIXE frame structure effectively restricts the motion of the acceptor Asn residue, which could compensate for the poor nucleophilicity of the carboxamide nitrogen (Supplementary Fig. 5).
Methods
Reagents
n-Dodecyl β-D-maltopyranoside (DDM) was purchased from Dojindo. 1-(7Z-tetradecenoyl)-rac-glycerol (7.7 MAG) was purchased from Avanti Polar Lipids. The borane-dimethylamine complex and 2-aminopyridine (2-AP) were purchased from FujiFilm Wako Pure Chemical. The peptides were custom synthesized by Hayashi Kasei and Toray Research Center. A TAMRA fluorescent group was introduced during the peptide synthesis to either the N-terminus or the side chain of a lysine residue.
Protein design for crystallization
The binding affinity of a peptide containing a sequon is not sufficient for co-crystallization. To compensate for the weak affinity, we tethered a sequon peptide to the AfAglB protein via a disulfide bond. The cross-linked AfAglB-peptide was previously shown to be active in glycopeptide production in an intramolecular manner, which justifies the use of the tethered complex for structural studies. In the previous study, a cysteine residue at position +4 was used for tethering to a cysteine residue introduced by the G617C mutation in AfAglB23. For unknown reasons, the cross-linking level did not exceed 90%. In this study, we used position +5, since the final cross-linking level reached almost 100% (Supplementary Figs. 6 and 10, Supplementary Data 4). The Asn residue in the sequon was changed to an L-2,4-diaminobutyrate (Dab) residue. The replacement of the amide group by an amino group inhibits the oligosaccharyl transfer reaction in a competitive manner42. A Dab-containing peptide was successfully used in the crystal structure determination of ClPgB with a reactive LLO analog32. The non-reactive peptide sequence we used is TAMRA-APY(Dab)VTASCR-OH, where the sequon is underlined and the cysteine residue for tethering is italicized. The N-terminal α-amino group is modified with a fluorescent carboxytetramethylrhodamine (TAMRA) dye for color detection, whereas the C-terminal carboxy group is unmodified. We confirmed the absence of oligosaccharyl transfer to the Dab-containing peptide, even in the state tethered to the AfAglB protein.
Protein expression and purification
The DNA encoding A. fulgidus AglB-L (AglB-L, the longest homolog of the three AglB proteins) was amplified from the genomic DNA, and subcloned into pET-52b(+) (Novagen) between the NcoI and SacI sites. The amino acid sequence (868 residues) is available through UniProtKB as UniProt entry O29867(AGLB3_ARCFU). An inverse PCR-based site-directed mutagenesis kit (SMK-101, TOYOBO) was used to generate single-point mutations of the AfAglB sequence. The transformed Escherichia coli C43 (DE3) cells (Lucigen) were grown at 310 K to an OD600 of 0.8–1.0 in Terrific Broth, supplemented with 100 mg L−1 ampicillin. Then, isopropyl β-D-1-thiogalactopyranoside was added at a final concentration of 0.5 mM. After 18-h induction at 291 K, the cells were harvested by centrifugation, and disrupted by sonication in 50 mM Tris·HCl, pH 8.0, 100 mM NaCl. The membrane fractions were collected by ultracentrifugation at 100,000 × g for 2 h, and solubilized in the same buffer containing 1% (w/v) DDM (Dojindo). After ultracentrifugation at 100,000 × g for 1 h, the recombinant protein in the supernatant was purified by affinity chromatography on nickel Sepharose High Performance resin (GE Healthcare) in the buffer containing 0.1% DDM. The AfAglB protein was expressed with a C-terminal His10-tag after a thrombin cleavage site. For crystallization, the His-tag was cleaved by thrombin after affinity chromatography. Consequently, the protein contains an extra 7-residue spacer sequence, ELALVPR, at the C-terminus. After the protein was concentrated with an Amicon Ultra-4 device (100 kDa NMWL) (Merck Millipore), gel filtration chromatography using Superdex 200 10/300 GL (GE Healthcare) was performed in 20 mM Tris-HCl, pH 8.0, 300 mM NaCl, and 0.05% (w/v) DDM. For disulfide bond tethering, purified AfAglB(G617C) was incubated with a peptide at pH 8.0, at a molar ratio of 1:10. After an overnight incubation at room temperature, the AfAglB−peptide complex was separated from the unreacted peptide monomers and the byproduct peptide dimers by membrane filtration using an Amicon Ultra-4 (100 kDa NMWL) and concentrated to 33 mg mL−1 by membrane filtration, in 20 mM Tris-HCl, pH 7.5, 200 mM NaCl, and 0.05% DDM. For the oligosaccharyl transfer and FNG generation assays, the AfAglB mutant proteins were purified by nickel affinity chromatography only, and the His-tag at the C-terminus was not removed.
Lipid-linked oligosaccharide from A. fulgidus cells
Archaeoglobus fulgidus strain DSM 4304 (NBRC 100126) was obtained from the NITE Biological Resource Center (Tokyo). The cells were grown anaerobically without shaking, at 80 °C for 3 days. The culture medium was a simplified version of the predefined medium (NBRC Medium No. 1019)49. The cells were collected by centrifugation and disrupted in hypoosmotic buffer, consisting of 20 mM Tris-HCl, pH 7.5, and 2 mM MgCl2. The cell disruption solution was supplemented with benzonase (Novagen) and complete protein inhibitor mixture, EDTA-free (Roche). The cell suspension was centrifuged at 8500 × g for 15 min, and the supernatant was discarded. The pellet was suspended in the buffer and homogenized with a probe sonicator. After centrifugation for debris removal, the supernatant was ultracentrifuged at 100,000 × g for 2 h to collect the membrane fractions. The membrane pellets were stored at −80 °C until use. The membrane pellets were resuspended in 20 mM Tris-HCl, pH 7.5, containing 0.1 M NaCl. Aliquots were transferred into glass round-bottomed centrifuge tubes. Methanol and chloroform were added to yield a methanol:chloroform:membrane fraction ratio of 2:1:0.8. The mixture was centrifuged to collect the clarified supernatants. Chloroform and water were added to the supernatants to yield a chloroform:water:supernant ratio of 1:1:3.8. The lower clear organic phase, containing the LLO, was collected and dried under a nitrogen gas stream in a draft chamber, redissolved in a small volume of CHCl3:CH3OH:H2O 10:10:3 (v/v/v) (CMW), and stored in glass containers at −20 °C. The resultant crude AfLLO is a mixture of LLOs with variable numbers of monosaccharides (6 and 7), isoprene units (C55 and C60), saturated isoprene units (3, 4, and 5), and a sulfate group (0 and 1)27.
The crude AfLLO was further separated by anion exchange chromatography with a HiTrap DEAE FF column (GE Healthcare), which was equilibrated with CMW containing 3 mM acetic acid. The absorbing materials were eluted with CMW containing 0.3 M ammonium acetate. The LLO concentrated by the two-phase partitioning was further separated by normal phase HPLC on a SUPELCO column (581513-u, Sigma-Aldrich), with a linear gradient from 100% solvent A CHCl3:CH3OH:NH4OH 800:195:5 (v/v/v) to 100% solvent B CHCl3:CH3OH:H2O:NH4OH 450:450:95:5 (v/v/v/v). The coincidence between the peaks in the TIC (total ion chromatogram) and the oligosaccharyl transfer activity indicated the high purity of the LLOs in the fractions from the HPLC column. The collected AfLLO fractions were dried in a SpeedVac concentrator, redissolved in a small volume of CMW, and stored in glass containers at −20 °C. The purified AfLLO has a fixed number of monosaccharides (6 or 7) and a sulfate group (1), but has variable numbers of isoprene units (C55 and C60) and saturated isoprene units (3, 4, and 5)27.
LCP crystallization and microcrystal harvesting
The purified cross-linked AfAglB-SS-peptide was reconstituted into a lipidic mesophase formed by the 7.7 MAG lipid. In the crystallization screening, we tested native LLO preparations of different purity grades isolated from cultured A. fulgidus cells. When we used purified LLO preparations, which were eluted as a single peak from a normal phase HPLC column, many microcrystals of fine appearance were obtained, but the quality of their X-ray diffractions was poor. We then switched to crude LLO preparations, prepared only by two-phase partitioning with a chloroform/methanol/water solvent system, and found that the microcrystals provided good diffraction data. Consequently, the AfLLO in the crystallization drops was a mixture of LLOs with variable numbers of monosaccharides (6 and 7), isoprene units (C55 and C60), saturated isoprene units (3, 4, and 5), and a sulfate group (0 and 1)27. The AfLLO dissolved in CMW was mixed with the melted 7.7 MAG lipid. The volume ratio was calculated to adjust the molar ratio of protein to AfLLO to 1:2–3 in the final cubic phase sample. The extra CMW was evaporated from the mixture fluid with a SpeedVac concentrator. The AfAglB-SS-peptide was mixed with the lipid mixture fluid, using a coupled syringe-mixing device at a protein solution/lipid mixture fluid ratio of 1:1 (w/w) at room temperature. Fifty-nanoliter (30-nL in a smaller scale) drops of the cubic phase sample were dispensed onto a lower film plate (Diffrax, Molecular Dimensions) and overlaid with 0.75 μL (0.45 μL) of precipitant solutions with a Crystal Gryphon LCP (Art Robbins Instruments). The plates were sealed with a thin upper film and stored at 20 °C. Initial crystallization screening was performed using MemGold, MemGold2, and MemStart+MemSys kits (Molecular Dimensions). Crystals were obtained under the following precipitation conditions: 0.1 M Li2SO4, 0.1 M glycine, pH 9.5, 24–30% PEG400; 50 mM NaCl, 0.1 M Na-citrate, pH 6.0, 18–22% PEG400; 0.2 M Na-citrate, 0.1 M Tris-HCl, pH 8.8, 26–30% PEG400. Crystals appeared after 1 or 2 days and grew to full size in a week. The microcrystals were needle-shaped with a length greater than 100 μm and a width/thickness less than 5 μm. Individual compartments that contained microcrystals grown in 50 mM NaCl, 0.1 M Na-citrate, pH 6.0, 18–22% PEG400 were excised with a homebuilt punching device, “AOMUSHI”. The positions and shapes of the crystals were easily identified by the magenta color of the TAMRA dye attached to the peptide substrate (Supplementary Fig. 1). The 6-mm-diameter pieces of the bilaminar films were mounted on homemade acrylic cryoloops with grease as an adhesive, flash cooled in liquid nitrogen, placed in Unipucks (Crystal Positioning Systems, New York), and stored in liquid nitrogen.
X-ray data collection and data processing
The X-ray diffraction data were collected at beamlines BL32XU and BL44XU in SPring-8 (Hyogo, Japan). The final X-ray diffraction data were collected at beamline BL32XU using an EIGER X 9M detector (Dectris, Switzerland). A micro-focused beam of 10 μm × 15 μm (horizontal × vertical) with a wavelength of 1.0000 Å was used for both the raster scan and data collection, under a cryo stream operating at 100 K. The datasets in 10° wedges were collected from microcrystals with a frame rate of 50 Hz in a shutterless operation mode at a dose of 10 MGy. The automated data collection system ZOO, developed at SPring-850, was used for automatic data collection from 2529 microcrystals supported on 5 cryoloops. Data sets indexed with consistent unit cell parameters were subjected to a hierarchical cluster analysis based on unit-cell similarity. Finally, 483 datasets were integrated, merged, and scaled to 2.7 Å using the automatic data processing system KAMO51. KAMO is an open-source data-processing pipeline, which utilizes existing programs, including the XDS52 and BLEND53.
Structure determination
The program phenix.phaser in PHENIX was used for the initial phase determination54, by molecular replacement with the structure of the revised version of the AfAglB-SS-Ac-RYNVTAC-NH2 structure (PDB: 5GMY) as the search model, after removing the EL5 loop and the tethered peptide. The asymmetric unit contained one protein molecule. Crystallographic refinement was performed with the program phenix.refine in PHENIX55 and the program REFMAC5 in CCP456. Further manual model rebuilding was performed with the program COOT57. The TLS refinement was done with three TLS groups, consisting of the N-terminal TM region (residues 6-494 of chain A), the C-terminal globular domain (residues 495-872 of chain A), and the bound peptide (residues 1-9 of chain B). The N-terminal 5 residues, MQNAE, and C-terminal 3 residues, VPR, were not modeled. In the previous structure determinations of the OST proteins, a metal ion bound to the catalytic site of the AglB/PglB/Stt3 proteins was assumed to be Mg2+, Mn2+, or Zn2+ contained in the crystallization solutions as an ingredient23,30,32,33. Mn2+ and Mg2+ have been suggested as the physiological cation of the OST enzymes58. In the present structure, Mn2+ was selected as an endogenous metal ion, because no metal ions were explicitly added in the crystallization solution. Fo—Fc maps suggested that Mg2+ had too few electrons, but Mn2+ fitted well. The extra nonprotein densities were modeled as the dolichol-phosphate (1 molecule), 7.7 MAG (4 molecules), PEG (di-hydroxyethyl ether, 5 molecules), and bound water (16 molecules). The chemical structure of the dolichol moiety used for modeling had twelve isoprene units (C60), of which three units were saturated on the ω-terminal side. The program eLBOW in PHENIX was used to generate the CIF restraints of the dolichol-phosphate, 7.7 MAG, and PEG by using the SMILES strings, CC(C)CCCC(C)CCCC(C)CCCC(\C)=C/CCC(\C)=C/CCC(\C)=C/CCC(\C)=C/CCC(/C)=C\CC\C(C)=C\CC/C(C)=C\CC/C(C)=C\CC[C@H](C)CCO[P](O)(O)=O, CCCCCC\C=C/CCCCCC(=O)OC[C@H](O)CO, and OCCOCCO, respectively. Data collection and refinement statistics are summarized in Table 1.
The AfAglB crystal structure was embedded in a model DOPC lipid bilayer generated by the Membrane/Bilayer builder in the CHARMM-GUI program59. Figures were generated with PyMOL, version 2.3 (Schrödinger). Ramachandran plots were generated with the program RAMPAGE (CCP4 supported program)60.
Revision of the previous AfAglB-peptide complex structure
We previously reported a crystal structure of the AfAglB-peptide complex (PDB: 5GMY)23. The metal ion was assumed to be Mg2+, which was derived from the crystallization solution. During the present study, we noticed that the metal ion was misplaced at one of the bound waters. This could be a consequence of the same number of electrons in H2O and Mg2+. The corrected coordinates were used in the molecular replacement in the present study as the search model, after removing the EL5 loop and the tethered peptide. Data collection and refinement statistics of the revised coordinates are provided in Supplementary Table 1. The revised version of the coordinates was deposited with the same PDB entry name, 5GMY, using the PDB entry versioning system.
Oligosaccharyl transfer assay
The oligosaccharyl transfer assay was performed by the PAGE method, as described previously61,62. The reaction mixture comprised the wild-type or mutated AfAglB protein (3 nM), an acceptor peptide substrate (3 μM), and crude AfLLO (0.2 μM) in a 10 µL reaction solution, containing 100 mM Tris-HCl, pH 7.5, 10 mM MnCl2, and 0.02% (v/v) Tween 20. The acceptor peptide is Ac-AAYNVTKRK(TAMRA)-OH, in which a fluorescent TAMRA dye is attached to the side-chain amino group of the C-terminal Lys residue for detection. The requisite amount of AfLLO in a chloroform/methanol/water solvent was dried and re-dissolved in the reaction solution, which contained Tween 20 to solubilize the LLO. The reaction was performed in an oven at 65 °C for 30 min or 1 h. To stop the reaction, 5 × SDS sample buffer was added. The in-gel fluorescence images of the SDS–PAGE gels were recorded with an LAS-3000 multicolor image analyzer (Fuji Photo Film), with green LED excitation.
FNG generation assay
The wild-type or mutated AfAglB protein (0.1 μM) was incubated with crude AfLLO (5 µM) at 65°C for 1 h in a 100 µL reaction solution, containing 100 mM Tris-HCl, pH 7.5, 5 mM MnCl2, and 0.02% (v/v) Tween 20. The requisite amount of LLO in a chloroform/methanol/water solvent was dried and re-dissolved in the reaction solution, which contained Tween 20 to solubilize the LLO. The reaction was terminated by the addition of 10 µL of 0.2 M EDTA-NaOH, pH 8.0. A 330 µL aliquot of ethanol was added, and the reaction solution was incubated for 15 min at 4 °C. After centrifugation at 15,000 × g for 15 min, the supernatant was evaporated to dryness. The dried oligosaccharides were dissolved in 500 μL of water and loaded on a PD MiniTrap column G-25 (GE Healthcare) for desalting. The column was preequilibrated with 5% ethanol before use. The fraction containing oligosaccharides was collected and the eluted oligosaccharides were evaporated to dryness. The reducing end of the dried oligosaccharides was derivatized with 2-AP. The dried oligosaccharides were incubated with 20 μL of 2-AP in acetic acid at 80 °C for 1 h. After the reaction, the mixture was incubated with 20 μL of a borane-dimethylamine complex in acetic acid, at 80 °C for 30 min. The excess 2-AP was removed using a MonoFas silica gel spin column (GL Sciences). The spin column was washed with water and then preequilibrated twice with 800 μL of 100% acetonitrile before use. The sample solution was mixed with 460 μL of 100% acetonitrile and loaded onto the spin column. The column was washed twice with 800 μL of 95% (v/v) acetonitrile. Water was added to the column to elute the fluorescently labeled oligosaccharides. The pyridylaminated oligosaccharides were separated by HILIC (hydrophilic interaction liquid chromatography) chromatography with an AdvanceBio Glycan Mapping column (Agilent Technologies), using an Infinity 1290 UPLC system (Agilent Technologies) equipped with an in-line fluorescence detector. Solvent A was 100 mM ammonium acetate buffer, pH 4.5, and solvent B was 100% acetonitrile. The column was equilibrated with 20% solvent A at a flow rate of 0.5 ml min−1. The gradient cycle was 12 min with a 1-min isocratic segment at 20% solvent A, a 1-min linear gradient to 27% A, a 6-min linear gradient to 40% A, a 1-min isocratic segment at 100% A, and a 3-min isocratic segment at 20% A.
Statistics and reproducibility
Statistical analyses were performed with the EZR statistical software63, an open-source statistical software program based on R and R commander64. Data were analyzed by one-way analysis of variance (ANOVA) followed by Dunnett’s two-sided post hoc test for multiple comparisons.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The atomic coordinates of AfAglB-SS-TAMRA-APY(Dab)VTASCR in a complex with dolichol-phosphate have been deposited in the PDB, with the accession code 7E9S. The source data underlying Figs. 3, 4, Supplementary Figs. 3a, b, and 6c are provided in Supplementary Data 1–4. The uncropped gel images related to Fig. 2a, b, Supplementary Figs. 3d, and 6d are provided as Supplementary Figures.
References
Kelleher, D. J. & Gilmore, R. An evolving view of the eukaryotic oligosaccharyltransferase. Glycobiology 16, 47R–62R (2006).
Nikolayev, S., Cohen-Rosenzweig, C. & Eichler, J. Evolutionary considerations of the oligosaccharyltransferase AglB and other aspects of N-glycosylation across Archaea. Mol. Phylogenet. Evol. 153, 106951 (2020).
Larkin, A. & Imperiali, B. The expanding horizons of asparagine-linked glycosylation. Biochemistry 50, 4411–4426 (2011).
Varki, A. Biological roles of glycans. Glycobiology 27, 3–49 (2017).
Varki, A. et al. Essentials of Glycobiology, third edition. (Cold Spring Harbor Laboratory Press, 2017).
Aebi, M. N-linked protein glycosylation in the ER. Biochim. Biophys. Acta 1833, 2430–2437 (2013).
Meyer, B. H. & Albers, S. V. AglB, catalyzing the oligosaccharyl transferase step of the archaeal N-glycosylation process, is essential in the thermoacidophilic crenarchaeon Sulfolobus acidocaldarius. Microbiologyopen 3, 531–543 (2014).
Eichler, J. N-glycosylation in Archaea—new roles for an ancient posttranslational modification. Mol. Microbiol. 114, 735–741 (2020).
Jervis, A. J. et al. Characterization of N-linked protein glycosylation in Helicobacter pullorum. J. Bacteriol. 192, 5228–5236 (2010).
Szymanski, C. M., Burr, D. H. & Guerry, P. Campylobacter protein glycosylation affects host cell interactions. Infect. Immun. 70, 2242–2244 (2002).
Mills, D. C. et al. Functional analysis of N-linking oligosaccharyl transferase enzymes encoded by deep-sea vent proteobacteria. Glycobiology 26, 398–409 (2016).
Gavel, Y. & von Heijne, G. Sequence differences between glycosylated and non-glycosylated Asn-X-Thr/Ser acceptor sites: implications for protein engineering. Protein Eng. 3, 433–442 (1990).
Kowarik, M. et al. Definition of the bacterial N-glycosylation site consensus sequence. EMBO J. 25, 1957–1966 (2006).
Scott, N. E. et al. Comparative proteomics and glycoproteomics reveal increased N-linked glycosylation and relaxed sequon specificity in Campylobacter jejuni NCTC11168 O. J. Proteome Res. 13, 5136–5150 (2014).
Ollis, A. A. et al. Substitute sweeteners: Diverse bacterial oligosaccharyltransferases with unique N-glycosylation site preferences. Sci. Rep. 5, 1–13 (2015).
Ben-Dor, S., Esterman, N., Rubin, E. & Sharon, N. Biases and complex patterns in the residues flanking protein N-glycosylation sites. Glycobiology 14, 95–101 (2004).
Petrescu, A. J., Milac, A. L., Petrescu, S. M., Dwek, R. A. & Wormald, M. R. Statistical analysis of the protein environment of N-glycosylation sites: Implications for occupancy, structure, and folding. Glycobiology 14, 103–114 (2004).
Abu-Qarn, M. & Eichler, J. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons. Archaea 2, 73–81 (2007).
Scott, N. E. et al. Simultaneous glycan-peptide characterization using hydrophilic interaction chromatography and parallel fragmentation by CID, higher energy collisional dissociation, and electron transfer dissociation MS applied to the N-linked glycoproteome of campylobact. Mol. Cell. Proteom. 10, S1–S18 (2011).
Shrimal, S. & Gilmore, R. Oligosaccharyltransferase structures provide novel insight into the mechanism of asparagine-linked glycosylation in prokaryotic and eukaryotic cells. Glycobiology 29, 288–297 (2018).
Kohda, D. Structural basis of protein asn-glycosylation by oligosaccharyltransferases. Adv. Exp. Med. Biol. 1104, 171–199 (2018).
Chen, M. M., Glover, K. J. & Imperiali, B. From peptide to protein: Comparative analysis of the substrate specificity of N-linked glycosylation in C. jejuni. Biochemistry 46, 5579–5585 (2007).
Matsumoto, S., Taguchi, Y., Shimada, A., Igura, M. & Kohda, D. Tethering an N-glycosylation sequon-containing peptide creates a catalytically competent oligosaccharyltransferase complex. Biochemistry 56, 602–611 (2017).
Hartley, M. D. & Imperiali, B. At the membrane frontier: a prospectus on the remarkable evolutionary conservation of polyprenols and polyprenyl-phosphates. Arch. Biochem. Biophys. 517, 83–97 (2012).
Eichler, J. & Guan, Z. Lipid sugar carriers at the extremes: the phosphodolichols Archaea use in N-glycosylation. Biochim. Biophys. Acta—Mol. Cell Biol. Lipids 1862, 589–599 (2017).
Larkin, A., Chang, M. M., Whitworth, G. E. & Imperiali, B. Biochemical evidence for an alternate pathway in N-linked glycoprotein biosynthesis. Nat. Chem. Biol. 9, 367–373 (2013).
Taguchi, Y., Fujinami, D. & Kohda, D. Comparative analysis of archaeal lipid-linked oligosaccharides that serve as oligosaccharide donors for Asn glycosylation. J. Biol. Chem. 291, 11042–11054 (2016).
Chang, M. M., Imperiali, B., Eichler, J. & Guan, Z. N-linked glycans are assembled on highly reduced dolichol phosphate carriers in the hyperthermophilic archaea Pyrococcus furiosus. PLoS ONE 10, 1–11 (2015).
Samuelson, J. et al. The diversity of dolichol-linked precursors to Asn-linked glycans likely results from secondary loss of sets glycosyltranferases. Proc. Natl Acad. Sci. U.S.A. 102, 1548–1553 (2005).
Lizak, C., Gerber, S., Numao, S., Aebi, M. & Locher, K. P. X-ray structure of a bacterial oligosaccharyltransferase. Nature 474, 350–355 (2011).
Napiórkowska, M. et al. Molecular basis of lipid-linked oligosaccharide recognition and processing by bacterial oligosaccharyltransferase. Nat. Struct. Mol. Biol. 24, 1100–1106 (2017).
Napiórkowska, M., Boilevin, J., Darbre, T., Reymond, J. L. & Locher, K. P. Structure of bacterial oligosaccharyltransferase PglB bound to a reactive LLO and an inhibitory peptide. Sci. Rep. 8, 1–9 (2018).
Matsumoto, S. et al. Crystal structures of an archaeal oligosaccharyltransferase provide insights into the catalytic cycle of N-linked protein glycosylation. Proc. Natl Acad. Sci. 110, 17868–17873 (2013).
Bai, L., Wang, T., Zhao, G., Kovach, A. & Li, H. The atomic structure of a eukaryotic oligosaccharyltransferase complex. Nature 555, 328–333 (2018).
Wild, R. et al. Structure of the yeast oligosaccharyltransferase complex gives insight into eukaryotic N-glycosylation. Science 359, 545–550 (2018).
Ramirez, A. S., Kowal, J. & Locher, K. P. Cryo-electron microscopy structures of human oligosaccharyltransferase complexes OST-A and OST-B. Science 366, 1372–1375 (2019).
Igura, M. et al. Structure-guided identification of a new catalytic motif of oligosaccharyltransferase. EMBO J. 27, 234–243 (2008).
Barre, Y. et al. A conserved DGGK motif is essential for the function of the PglB oligosaccharyltransferase from Campylobacter jejuni. Glycobiology 27, 978–989 (2017).
Shrimal, S., Cherepanova, N. A. & Gilmore, R. One flexible loop in OST lassos both substrates. Nat. Struct. Mol. Biol. 24, 1009–1010 (2017).
Lizak, C. et al. A catalytically essential motif in external loop 5 of the bacterial oligosaccharyltransferase PglB. J. Biol. Chem. 289, 735–746 (2014).
Meng, G., Shi, S., Lalancette, R., Szostak, R. & Szostak, M. Reversible twisting of primary amides via ground state N-C(O) destabilization: highly twisted rotationally inverted acyclic amides. J. Am. Chem. Soc. 140, 727–734 (2018).
Imperiali, B., Shannon, K. L., Unno, M. & Rickert, K. W. A mechanistic proposal for asparagine-linked glycosylation. J. Am. Chem. Soc. 114, 7944–7945 (1992).
Misquitta, L. V. et al. Membrane protein crystallization in lipidic mesophases with tailored bilayers. Structure 12, 2113–2124 (2004).
Harada, Y. et al. Eukaryotic oligosaccharyltransferase generates free oligosaccharides during N-glycosylation. J. Biol. Chem. 288, 32673–32684 (2013).
Dwivedi, R., Nothaft, H., Reiz, B., Whittal, R. M. & Szymanski, C. M. Generation of free oligosaccharides from bacterial protein N-linked glycosylation systems. Biopolymers 99, 772–783 (2013).
Yamasaki, T. & Kohda, D. Uncoupling the hydrolysis of lipid-linked oligosaccharide from the oligosaccharyl transfer reaction by point mutations in yeast oligosaccharyltransferase. J. Biol. Chem. 295, 16072–16085 (2020).
Sun, S. et al. N-GlycositeAtlas: A database resource for mass spectrometry-based human N-linked glycoprotein and glycosylation site mapping. Clin. Proteom. 16, 1–11 (2019).
Kohen, A. Role of dynamics in enzyme catalysis: Substantial versus semantic controversies. Acc. Chem. Res. 48, 466–473 (2015).
Fujinami, D., Nyirenda, J., Matsumoto, S. & Kohda, D. Structural elucidation of an asparagine-linked oligosaccharide from the hyperthermophilic archaeon, Archaeoglobus fulgidus. Carbohydr. Res. 413, 55–62 (2015).
Hirata, K. et al. ZOO: An automatic data-collection system for high-throughput structure analysis in protein microcrystallography. Acta Crystallogr. Sect. D. Struct. Biol. 75, 138–150 (2019).
Yamashita, K., Hirata, K. & Yamamoto, M. KAMO: towards automated data processing for microcrystals. Acta Crystallogr. Sect. D. Struct. Biol. 74, 441–449 (2018).
Kabsch, W. XDS. Acta Crystallogr. Sect. D. Biol. Crystallogr. 66, 125–132 (2010).
Foadi, J. et al. Clustering procedures for the optimal selection of data sets from multiple crystals in macromolecular crystallography. Acta Crystallogr. Sect. D. Biol. Crystallogr. 69, 1617–1632 (2013).
Liebschner, D. et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr. Sect. D. Struct. Biol. 75, 861–877 (2019).
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. Sect. D. Biol. Crystallogr. 66, 213–221 (2010).
Murshudov, G. N. et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. Sect. D. Biol. Crystallogr. 67, 355–367 (2011).
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. Sect. D. Biol. Crystallogr. 66, 486–501 (2010).
Gerber, S. et al. Mechanism of bacterial oligosaccharyltransferase: In vitro quantification of sequon binding and catalysis. J. Biol. Chem. 288, 8849–8861 (2013).
Allouche, A. Software news and updates gabedit—a graphical user interface for computational chemistry softwares. J. Comput. Chem. 32, 174–182 (2012).
Lovell, S. C. et al. Structure validation by Cα geometry: φ, ψ and Cβ deviation. Proteins Struct. Funct. Genet. 50, 437–450 (2003).
Kohda, D., Yamada, M., Igura, M., Kamishikiryo, J. & Maenaka, K. New oligosaccharyltransferase assay method. Glycobiology 17, 1175–1182 (2007).
Yamasaki, T. & Kohda, D. A radioisotope-free oligosaccharyltransferase assay method. Bio-Protoc. 9, e3186 (2019).
Kanda, Y. Investigation of the freely available easy-to-use software ‘EZR’ for medical statistics. Bone Marrow Transpl. 48, 452–458 (2013).
Fox, J. The R Commander: a basic-statistics graphical user interface to R. J. Stat. Softw. 14, 1902 (2005).
Acknowledgements
We thank Drs. Tadashi Suzuki and Yoichiro Harada for advice on the FNG generation assay, and Mr. Seiichiro Hayashi and Ms. Hisano Yajima for mass spectrometry of the reaction mixture for the preparation of the AfAglB-peptide complex. The DNA sequencing service was provided by the Laboratory for Research Support at the Medical Institute of Bioregulation at Kyushu University. The experiments at beamline BL44XU, SPring-8, were performed under the Cooperative Research Program of the Institute for Protein Research of Osaka University, as Proposals 20196914 and 20206514. The experiments at beamline BL32XU, SPring-8, were supported by the Platform Project for Supporting Drug Discovery and Life Science Research (Basis for Supporting Innovative Drug Discovery and Life Science Research, BINDS) from AMED under Grant Number JP19am0101070 to K.H., and by JSPS KAKENHI Grant Number JP21H02448 to D.K.
Author information
Authors and Affiliations
Contributions
Y.T., T.Y., M.I., Y.K., R.Y., and M.M. performed experiments under supervision by K.H. and D.K. Y.T. performed protein and LLO preparations, LCP crystallization, X-ray data collection, and structure refinement calculations. T.Y., M.I., Y.K., and R.Y. performed alanine-scanning mutagenesis, protein purification, and oligosaccharyl transfer assays. M.M. performed mutagenesis, oligosaccharyl transfer, and FNG generation assays. K.H. developed the ZOO system and contributed to the structure refinement calculation. Y.T. and D.K. wrote the paper.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Communications Biology thanks Huilin Li and the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: George Inglis Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Taguchi, Y., Yamasaki, T., Ishikawa, M. et al. The structure of an archaeal oligosaccharyltransferase provides insight into the strict exclusion of proline from the N-glycosylation sequon. Commun Biol 4, 941 (2021). https://doi.org/10.1038/s42003-021-02473-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42003-021-02473-8
- Springer Nature Limited
This article is cited by
-
Molecular basis for glycan recognition and reaction priming of eukaryotic oligosaccharyltransferase
Nature Communications (2022)