1 Introduction

Integral membrane proteins (IMPs) play an essential role in numerous biological processes. In addition, IMPs are important drug targets [1, 2]. Yet, the understanding of IMP structure and function is limited compared with the vast amount of information available for water-soluble proteins [3]. IMPs possess a largely nonpolar exterior that interacts with acyl chains of the membrane bilayer. Most IMPs are α-helix bundles. They contain hydrophobic (or amphipathic) transmembrane helices that are linked by hydrophilic loops, which protrude into the aqueous environment [4]. Due to their nonpolar character, IMPs are prone to aggregation in solution. Solubilization in detergents such as dodecyl-β-D-maltoside (DM) is required for structural/functional studies on native IMPs outside their natural lipid bilayer environment [5].

The difficulties associated with X-ray or NMR structural investigations [3] have spurred the development of low-resolution approaches for predicting IMP topologies (i.e., the location of helices and loops, and the protein orientation relative to the cytoplasm [6]). Kyte and Doolittle hydropathy analyses represent a widely used strategy. In this approach transmembrane helices are tentatively identified as sequence regions of ~19 hydrophobic residues that are separated by hydrophilic stretches (loops) [7]. Prediction algorithms of this type have been refined in recent years by allowing the incorporation of homology information and various constraints [4, 8, 9]. Nonetheless, even these advanced computational tools can generate misleading topology predictions [10]. Experimental validation of the resulting models, therefore, is absolutely essential.

Substituted cysteine accessibility methods (SCAM [10]) represent a key approach for the validation of topology models. Only thiol groups located in solvent-accessible regions react with -SH specific labeling agents. The extent of labeling can be monitored using fluorescence methods. SCAM requires a cysteine-free background construct, and a large number of single site mutants [10, 11]. Protease accessibility assays coupled with gel electrophoresis represent another possible method [12, 13]. In addition, truncated IMP variants can be expressed as fusion proteins. Colorimetric assays or fluorescence techniques can then report on the location of the fused moiety (i.e., cytoplasmic versus extracellular (or periplasmic) [6, 1316]). A drawback of these traditional biochemical methods is their labor intensive nature. More importantly, any sequence alterations may disrupt the native protein fold [10].

Mass spectrometry (MS) is increasingly being used for various types of IMP studies [1, 1728]. Covalent labeling coupled with MS [29] can provide structural information on IMPs. Protein exposure to a hydrophilic reactive probe induces covalent modifications in solvent-accessible regions, whereas buried segments are sterically protected from the labeling agent [30]. The location(s) and the extent of labeling can be determined by LC-MS/MS peptide mapping [31, 32]. Such experiments have been used to explore conformational features and interactions of IMPs [3242]. For IMP topology studies, however, this approach remains underutilized.

Hydroxyl radical (⋅OH) is a widely used covalent labeling agent that can be produced in various ways [31, 35, 4347]. A simple strategy is the photolysis of dilute H2O2 by a pulsed UV laser [48, 49]. This laser-induced oxidative labeling approach is sometimes referred to as fast photochemical oxidation of proteins (FPOP) [50]. Under single-exposure conditions, this technique is free of labeling-induced structural artifacts [50, 51]. Oxidative modifications typically appear as +16 Da adducts, although other mass shifts can occur as well [31]. The extent of modification at any given side chain depends on a combination of solvent-accessibility and intrinsic reactivity [52]. The presence of sulfur in Cys and Met makes these two amino acids by far the most reactive [53]. For IMPs it is not uncommon to observe hydroxyl radical-induced modifications almost exclusively at these sulfur-containing residues [33, 54, 55]. Our laboratory has recently applied laser-induced oxidative labeling for mapping conformational transitions of bacteriorhodopsin, an IMP with a well known three-dimensional structure [42]. In the current work, we extend this approach to probe the topology of an IMP for which no high-resolution structural information is available.

Lipopolysaccharide (LPS) is a major component of the outer membrane of gram-negative bacteria [56]. LPS consists of lipid A, core oligosaccharide, and O antigen [57]. The O antigen is synthesized as a lipid-linked glycan intermediate, and it is ligated to the lipid A-core oligosaccharide prior to export of the complete LPS molecule to the bacterial surface [58]. WaaL is an inner membrane IMP that catalyzes this ligation [13, 15, 59]. Application of the prediction algorithm TMHHM [60] in combination with biochemical assays yielded a topology model of E. coli WaaL that involves 12 transmembrane helices (Figure 1 [13]). Both termini are located on the cytoplasmic side. Molecular modeling suggests that the large extramembrane domain EL5 connecting the putative helices 9 and 10 adopts a tightly folded globular structure [13].

Figure 1
figure 1

Topology of WaaL predicted by TMHMM [60] (modified from Pérez et al. [13]). Transmembrane helices 1, 2, … are denoted as rectangles. A large external domain is denoted as EL5. Residues that undergo oxidative labeling are highlighted using red circles, whereas blue squares indicate Cys/Met residues that are protected from oxidation (see later, Figure 4). One Met and one Cys were not covered by peptide mapping (not highlighted in this Figure). Three conserved residues involved in enzyme function (R215, R288, and H338) [13] are marked in black. The topology map was displayed using VHMTP [66]

This study employs laser-induced oxidative labeling and MS to validate the existing WaaL topology model [13]. The resulting data support many features of the original prediction. Nonetheless, it is found that a slightly revised WaaL topology is in better agreement with the observed oxidation behavior than the original model.

2 Experimental

2.1 Materials and Sample Preparation

Sodium phosphate, DM, ammonium bicarbonate, and formic acid were purchased from Sigma (St. Louis, MO, USA). Sequencing-grade modified trypsin was purchased from Promega (Madison, WI, USA). RapiGest SF is an acid-labile surfactant that was obtained from Waters (Milford, MA, USA). All chemicals were used as received. Wild-type WaaL consists of 419 amino acids. The experiments of this work were carried out on a WaaL construct with a C-terminal FLAG-His10x tag, containing 32 additional residues, and resulting in an overall average MW of 50794.8 Da. The protein was purified from E. coli JM105v by Ni2+-affinity chromatography, and it was subsequently concentrated using a 30 kDa cutoff membrane as described [59]. This method afforded ~1 mg of WaaL protein per liter of culture. The experiments of this study were conducted on DM-solubilized protein. Enzyme activity was verified by an in vitro ligation assay [59]. Buffer exchange yielded stock solutions containing 100 μM WaaL, 0.05% (wt/vol) DM, 150 mM sodium chloride, 12.5 mM imidazole, 10% glycerol, and 50 mM sodium phosphate (pH 8).

2.2 Oxidative Labeling

Pulsed ⋅OH labeling was performed using a two-syringe continuous-flow device similar to that described previously [61]. Syringe 1 contained 20 μM WaaL in 0.05 % DM and 150 mM NaCl, 2.5 mM imidazole, 50 mM phosphate buffer (pH 8), 2% glycerol, and 30 mM glutamine, while syringe 2 contained 0.3% (vol/vol) H2O2 and 0.05% DM in phosphate buffer (pH 8). Both syringes were advanced simultaneously at 20 μL min–1 using a syringe pump (Harvard Apparatus, Boston, MA,, USA). A KrF excimer laser (EX 50IN; GAM Laser, Orlando, FL, USA) producing 18 ns pulses at 248 nm, 32 Hz, and 37 mJ pulse–1 was used to generate hydroxyl radicals by H2O2 photolysis within the reaction capillary made of a 100 μm i.d. fused silica. The beam diameter at the irradiation point was around 1 mm. Glutamine acts as a radical scavenger that quenches the labeling reaction on the time scale of 1 μs [48].

Capillary outflow aliquots of ~120 μL were collected in microcentrifuge tubes containing 10 μL of 200 mM methionine and 10 μL 2 μM catalase (pH 7) for deactivating residual H2O2 and other oxidizing agents that otherwise might contribute to undesired “secondary” oxidation [6265]. All samples were lyophilized and subsequently redissolved in 60 μL of 100 mM ammonium bicarbonate buffer (pH 8) containing 0.2% (wt/vol) RapiGest. The resulting solutions were digested with trypsin for 24 h at 37 °C using a 1:10 (wt/wt) enzyme:protein ratio. Tryptic peptides were analyzed by UPLC/ESI-MS employing a C18 BEH 130 column (2.1 × 100 mm, particle size 1.7 μm, Waters) and a Waters Q-TOF Ultima mass spectrometer. For each injection, 8 μL of digested sample were loaded onto the UPLC column without surfactant removal. Chromatographic separations were carried out at 40 °C and at a flow rate of 80 μL min–1. Solvent A was 0.1% aqueous formic acid, and solvent B consisted of 50:50 (vol/vol) acetonitrile/isopropanol with 0.1% formic acid. Elution started with isocratic flow (4 min, 2% B for desalting), followed by a 50 min linear gradient from 15% to 90% B in 50 min for peptide separation. After each run, 20 μL of formic acid were injected onto the column, and the column was washed with 90% B for 30 min. Confirmation of peptide identities and localization of oxidation sites was performed by collision-induced dissociation (CID) tandem MS conducted in data-dependent acquisition mode. The resulting MS/MS data were analyzed manually using the MassLynx software package supplied by the instrument manufacturer.

2.3 Data Analysis

The extent of oxidation for each peptide is reported as “fraction unmodified,” F u , defined as

$$ {F_u} = {A_u}/\left( {{A_u} + {A_{ox}}} \right) $$
(1)

where A u and A ox are the integrated MS peak areas of the unmodified species and its oxidation products, respectively [31]. The application of this equation rests on the assumption that unmodified peptides and their corresponding oxidation products exhibit similar ionization efficiencies. Previous studies have confirmed that this assumption is justified for the types of modifications discussed here [31, 44]. Fortuitously, none of the WaaL tryptic peptides was found to contain more than one oxidized residue (as discussed below). Hence, the peptide F u values reported in this work directly reflect the oxidation behavior of individual residues.

Even in the absence of laser exposure, the protein exhibits a low amount of background oxidation. This effect can be caused by the presence of H2O2 in the reaction mixture, but some oxidation may also occur during sample handling and storage [6265]. Following procedures established previously [54], all F u values were therefore subjected to background correction according to

$$ {F_u} = \frac{{F_u^{app}}}{{F_u^{bgr}}} $$
(2)

where \( F_u^{app} \) is the “apparent” fraction unmodified, obtained by applying eq 1 directly to data acquired after labeling. \( F_u^{bgr} \) represents the fraction unmodified for non-irradiated control samples. F u values reported below represent an average of three independent measurements, each with its own background correction. Error bars reflect the maximum deviation from the average value.

3 Results and Discussion

3.1 Oxidative Labeling of WaaL

Laser-induced oxidative labeling of DM-solubilized WaaL results in several +16 Da modifications. Using manual inspection of the experimental data, we did not find evidence for reaction products associated with any other mass shifts. Tryptic peptide mapping was employed to identify oxidation sites. UPLC/MS allowed the detection of 21 tryptic peptides, corresponding to ~93% sequence coverage (Table 1). Significant oxidative labeling was found only for four peptides: T3-5, T13, T22, and T25. Oxidation of these protein segments is evident from intense +16 Da signals in the spectra of the laser-irradiated samples (Figure 2b, d, j, and l). These oxidation signals are virtually absent in the spectra of the non-irradiated controls (Figure 2a, c, i, and k). All other peptides show mass distributions that are virtually indistinguishable before and after ⋅OH exposure. Figure 2 exemplifies this lack of oxidative labeling for T14-15 (Figure 2e, f), T17-18 (Figure 2g, h), and T28 (Figure 2m, n). Each of the peptides included in Figure 2 contains at least one highly reactive sulfur-containing residue (Cys or Met). Unfortunately, residues M1 and C203 were not covered by our mapping approach because the corresponding peptides (T1 and T16) were not detectable under the conditions used here (Table 1).

Table 1 Identified tryptic peptides of WaaL. Methionine and cysteine residues are underlined. T2, T6, etc. denotes the peptide location within the WaaL sequence, numbers in brackets represent the amino acid range. Species such as T3-5 represent peptides with missed cleavages. Peptides T1 (1-6) MLTSFK, and T16 (193-215) NHPFLFLLNSCAVLYVLALTQTR were not detected
Figure 2
figure 2

Illustrative UPLC/ESI mass spectra of tryptic WaaL peptides that contain potential oxidation sites (Met and Cys). The panels are arranged in pairs (a/b, c/d, etc.), depicting data for unlabeled control samples and for oxidatively labeled WaaL, respectively. Potential oxidation sites are indicated along the top. Oxidized peptides are denoted as “+ 16”

The four oxidized peptides (T3-5, T13, T22, and T25) were subjected to MS/MS for pinpointing the exact locations of modification sites. Figure 3 shows partial tandem mass spectra of T3-5 for a non-irradiated control (panel a), and T3-5 +16 after oxidative labeling (panel b). In both spectra b 2 ions appear at their expected m/z ratios. Starting from b 3 , the signals originating from the oxidized sample are shifted by 16 Da. This observation identifies M19 as the site of oxidation in T3-5 + 16. The absence of unmodified b n ions with n > 2 in the MS/MS data of Figure 3b reveals that M19 is the only oxidized residue in T3-5 + 16. This finding is notable, because there are three additional highly reactive residues in this peptide (C29, M33, and C54). The lack of oxidation at these three other residues implies that they are not solvent-accessible. MS/MS experiments on the peptide T25 (Figure 3c, d) reveal that C-terminal fragment ions up to y 9 appear at the same m/z values before and after oxidative labeling. The same is true for b 3 , whereas b 4 exhibits a 16 Da shift for the labeled sample. The oxidation site in T25 +16 can therefore be identified as M316 (Figure 3c, d). Similar MS/MS experiments were conducted for T13 and T22. In addition to M19 and M316, those measurements identified M151 and M291 as being oxidized after ⋅OH exposure of WaaL (data not shown).

Figure 3
figure 3

Partial MS/MS spectra of tryptic peptide T3-5 from unlabeled WaaL (precursor m/z 1343.74, (a), and for the peptide T3-5 +16 after laser-induced oxidative labeling (precursor m/z 1347.74, (b). These MS/MS data identify M19 as the only oxidation site in this peptide. Also shown (along the top) is the amino acid sequence of T3-5. Similarly, the bottom two panels illustrate the MS/MS behavior of T25 from an unlabeled sample [precursor m/z 1482.72, (c)] and for oxidatively labeled WaaL [precursor m/z 1490.72, (d)]. These data reveal that M316 is the only oxidation site in T25 + 16

The results discussed so far reveal that laser-induced ⋅OH labeling of WaaL results in four detectable oxidative modifications. All of these correspond to +16 Da mass shifts at Met residues, attributable to methionine sulfoxide (MetO) formation [31]. Oxidation of these residues reveals that they are solvent-accessible. In contrast, methionines M33, M177, M351 are not oxidized and hence not solvent-accessible. Cysteine is known to have an intrinsic ⋅OH reactivity that is even higher than that of methionine [53]. Cys oxidation should give rise to mass shifts of +32 Da (sulfinic acid) or +48 Da (sulfonic acid) [31, 55], and possibly other oxidation products such as sulfenic acid (+16 Da). Yet, none of the cysteines we mapped were oxidized, implying that C29, C54, and C225 are not solvent-accessible. Consistent with their lower intrinsic reactivities [53], none of the other 18 types of residues were found to be oxidizable under the conditions of our work. This chemical selectivity is reminiscent of data previously obtained for other membrane proteins [33, 54, 55]. The effect is likely caused by the presence of surfactants and other additives that act as ⋅OH scavengers and that outcompete less reactive oxidation pathways. As a result, any oxidation at non-sulfur residues is suppressed [48, 54].

3.2 Quantitative Analysis of Oxidation Levels

To quantitatively express the extent of oxidative labeling, we calculated F u (fraction unmodified) values as outlined in equation 2 [31]. Cys or Met residues that do not undergo oxidation are characterized by F u  ≈ 1, reflecting a lack of solvent access. In contrast, solvent-accessible reactive side chains show significantly lower F u values. Roughly 35% of the protein molecules in the reaction mixture undergo hydroxyl radical exposure for the conditions used here [51]. This implies that F u values around (1 – 0.35) = 0.65 are expected for residues that are completely accessible. Values between 0.65 and 1 signify partial solvent access.

Tryptic mapping provides coverage for 10 of the 12 sulfur-containing residues in WaaL. F u values of ~0.65 were observed for M19, M291, and M316, revealing that these three residues are completely solvent-accessible (Figure 4). M151 shows an F u value of 0.88, indicating that this residue is only partially solvent-accessible. Residues that undergo complete or partial labeling are highlighted in red (Figure 4). Complete protection with F u  ≈ 1 is seen for C29, M33, C54, M177, C225, and M351 (blue in Figure 4), implying that these residues are not solvent accessible.

Figure 4
figure 4

Oxidative labeling behavior of WaaL depicted for all Met and Cys residues that were covered by tryptic peptide mapping. Shown in this figure is the “fraction unmodified” (F u ) of potential oxidation sites (listed along the bottom) after hydroxyl radical exposure. Residues that undergo oxidation are indicated in red, while those that are not oxidizable are shown in blue. The color scheme used here mirrors that used in Figures 1 and 5

3.3 Topology Models and Oxidative Labeling Behavior

Comparison of the WaaL oxidative labeling data (Figure 4) with the predicted topology (Figure 1) [13] reveals that all six non-oxidized Cys and Met residues are in putative transmembrane helical regions. Three of the oxidized Met side chains, all with F u values of ~0.65, are situated in regions that are predicted to be solvent-accessible. Specifically, M19 is located at the cytoplasmic membrane surface, and M291/M316 are positioned in the extramembrane domain EL5. The behavior of these nine residues is consistent with the topology model of Figure 1. However, the partial labeling of M151 is difficult to reconcile with the proposed position of this residue in transmembrane helix 5. Other side chains that are predicted to occupy buried locations at a similar depth (e.g., M33 in helix 1) are completely protected from oxidation. Partial oxidation of M151, therefore, points to some inaccuracies of the originally proposed topology (Figure 1) [13].

The model of Figure 1 had been developed in 2008 [13] using the topology prediction routine TMHMM [60] (http://www.cbs.dtu.dk/services/TMHMM/), which employs hydropathy analyses in combination with a hidden Markov model to distinguish transmembrane helices from extramembrane loops. TMHMM generally provides models that quite closely resemble the actual topology of IMPs [60]. PolyPhobious (http://phobius.sbc.su.se/poly.html) is a program that operates on similar principles, but it additionally incorporates global sequence alignment data. Thus, PolyPhobious provides improved model predictions as verified in tests on reference IMPs of known topology [9]. Not surprisingly, subjecting the WaaL sequence to PolyPhobious results in a somewhat modified topology model (Figure 5).

Figure 5
figure 5

Modified WaaL topology model obtained using PolyPhobious [9]. The notation used is identical to that of Figure 1. Red circles indicate residues that undergo oxidative labeling. Non-oxidizable sites are highlighted by blue squares. As discussed in the text, this modified model is in better agreement with our oxidative labeling data than the original model of Figure 1

The overall features of the initial WaaL model (Figure 1) [13] and the modified version (Figure 5) are very similar. Both models predict 12 transmembrane helices, with a large periplasmic EL5 domain. Close comparison, however, reveals some differences in the boundaries of transmembrane helices relative to their connecting extramembrane loops. Importantly, M151 in helix 5 is located closer to the periplasmic membrane surface in the modified model (position 3 in helix 5 [Figure 5], versus position 6 [Figure 1]). This modified position is in better agreement with the oxidation behavior of M151 (Figure 4). In particular, the observation of partial labeling (F u  = 0.88) for M151 is consistent with the topology description of Figure 5 where this residue is placed close to the membrane interface, but not in full contact with the aqueous periplasmic space. One can contrast this behavior with that of M19, which is predicted to be directly at the interface (helix 1, Figure 5), consistent with its more extensive oxidation (F u  = 0.66, Figure 4).

M351 represents another residue that has been placed closer to the periplasmic surface in the revised model (helix 10, Figure 5). Interestingly, this residue is inaccessible (F u  ≈ 1, Figure 4). This labeling behavior of M351 is consistent with molecular dynamics simulations that predict a tightly folded globular structure for the extramembrane domain EL5 [13]. Helices 9 and 10 form the “stem” of EL5. Our results strongly suggest that this globular domain is packed tightly against the periplasmic interface of WaaL, thereby shielding the terminus of helix 10 (including M351) from solvent access in a lid-like fashion. The oxidation behavior of M151 indicates that this shielding does not extend to helix 5.

Overall, our laser-induced oxidative labeling experiments provide information on the solvent-accessibility of 10 highly reactive residues (Cys and Met) in WaaL. These data were used to test the accuracy of computer-generated topology maps. Most features of the measured protection pattern (Figure 4) are consistent with the initially developed model (Figure 1) [13]. However, the agreement between experiment and model can be further improved by employing the more comprehensive PolyPhobious algorithm that yields a slightly revised topology (Figure 5). Notably, the revised topology remains consistent with all of the mutational assays and fusion protein data that had originally been used to support the model of Figure 1 [13].

4 Conclusions

Computer-generated topology maps are widely used for low-resolution IMP structure predictions. In the absence of experimental data to support the resulting models, however, the value of such predictions remains questionable [10]. It is common practice to use various biochemical and mutational assays for the validation of computer-generated topology models [6, 1016]. Those experimental techniques, however, are labor intensive and may induce structural artifacts.

The current work highlights an approach that can complement (or even replace) some of those traditional validation tools. We demonstrate that laser-induced oxidative labeling is suitable for rapidly testing computer-generated topology predictions. The experiments are based on the premise that differential protection of extramembrane regions and transmembrane helices will be reflected in the ⋅OH labeling levels. In cases where different software algorithms provide diverging models, oxidative labeling offers a stringent selection tool. Indeed, we demonstrate that it is possible to differentiate residues that exhibit different degrees of labeling, ranging from completely buried in the bilayer, to partially buried at the membrane interface, to fully solvent-accessible.

We anticipate the possibility to include oxidative labeling data as a priori constraints in future topology prediction programs. Such an approach may be preferable to the use of oxidative labeling data for the post-processing assessment of competing models. The mutational introduction of additional easily oxidizable residues in strategic locations will further enhance the resolution of the approach used here [61]. In addition, it may be possible to conduct labeling experiments on IMPs in their natural bilayer environment (possibly even in vivo), instead of relying on purified proteins. Encouraging initial steps in this direction have recently been reported [34, 54].