Introduction

Monoclonal antibodies (mAb) are among the most common protein therapeutics used to alleviate disease by specifically binding a foreign target (antigen) through a high-affinity interaction between the antibody paratope and antigen epitope interface. Antibodies and their fragment antigen binding regions (i.e., Fab-1) are being developed as therapeutics because they have highly specialized functions [1]. Driven by the need to expedite the development of antibody therapeutics and a better understanding of their mechanism of action, fast and sensitive approaches for epitope mapping are required.

Protein therapeutics typically have molecular weights spanning over 50 kDa [2], and unlike traditional small-molecule therapeutics (<500 Da), they can be difficult to characterize using the traditional high-resolution tools of NMR and X-ray crystallography. Such macromolecules undergo a complex folding process to form higher order structures (HOS), which are the basis for their activity and function. Misfolding of mAbs and other changes in the HOS may result in loss of biologic or therapeutic activity, and/or could potentiate immunogenicity and toxicity [3]. Thus, more rapid and sensitive structural characterization approaches are needed to complement traditional biophysical techniques.

Mass spectrometry (MS)-based techniques such as native top-down MS and bottom-up MS (protein footprinting) provide structural information with high sensitivity, fast turnaround, and small sample consumption. These protein MS tools are also very powerful for understanding the overall conformational stability of the therapeutic in question, especially when high-resolution structures are unachievable or time-consuming to obtain. Furthermore, such methods also have the advantage of elucidating structural features located at the binding interface and distal from the direct interaction, revealing conformational mechanisms that may not be resolved by static techniques alone. To understand better the value of both native top-down and bottom-up MS technologies for the structural characterization and epitope/paratope mapping of biotherapeutics, we have been systematically evaluating multiple MS technologies by using the well-characterized Fab-1:VEGF complex, for which a high resolution structure and comprehensive alanine-scanning are available [4].

Using native top-down MS with electron capture dissociation (ECD), we previously determined that the binding stoichiometry is 2:2 Fab-1:VEGF, as seen in the crystal structure [5]. In addition, native top-down MS coupled with multiple types of fragmentation identified highly flexible regions in VEGF that were not found in crystal structure. Thus, we rationalized that certain regions with high flexibility can hamper crystallization and, therefore, may be purposely truncated to facilitate crystallization [5]. In parallel, we utilized the bottom-up MS approach of carboxyl-group footprinting for epitope mapping, which uses carbodiimide/glycine ethyl ester (GEE) to label solvent-accessible carboxylic acid moieties found in proteins (i.e., aspartic acid (D), glutamic acid (E), and the C-terminus) [6]. With carboxyl-group footprinting, we identified multiple D and E residues involved in the Fab-1:VEGF binding interface [7], and the results agree with observations obtained by alanine-scanning and crystallography [4].

In the current work, we expanded on our bottom-up MS approaches by examining the Fab-1:VEGF complex by using fast photochemical oxidation of proteins (FPOP) [8]. This approach utilizes a high-power laser to photolyze hydrogen peroxide into hydroxyl radicals (·OH), facilitating the sub-millisecond labeling (oxidation) of proteins. Similar to carboxyl-group footprinting, FPOP is an irreversible labeling technique that, when coupled to protein digestion and LC-MS/MS analysis, enables the identification of solvent accessibility of protein side chains within a protein or protein complexes. However, ·OH radicals react relatively non-specifically, allowing modification of 14 of the different 20 standard amino acids [9] and potentially increasing the resolution of the site-specific labeling compared with carboxyl-group footprinting [7] (limited to D and E residues and the C-terminus). Similar to carboxyl-group footprinting, FPOP is best utilized in a comparative analysis in which the extent of labeling for a given peptide/residue is determined under two or more different conditions, such as an unbound versus a bound state.

Here, we report epitope and paratope mapping by determining the extent of labeling for the unbound Fab-1 and VEGF, and compare this information to the Fab-1:VEGF complex, using our recently improved FPOP format [10]. We observed several tryptic peptides with significantly reduced modification in the bound state and used high-resolution MS2 to obtain residue-level information. When mapped to the crystal structure, much of the observed reduction in solvent accessibility in the complex is indeed located at the predicted epitope and paratopes (on VEGF and Fab-1, respectively) [4]. We also observed residues with reduced modifications that are distal to the binding region, indicative of remote conformational changes that occur in complex. The data for both the binding region and distal region showing a remote conformational change are consistent with our previously published carboxyl group footprinting results [7]. We also observed that regions with some of the highest levels of modifications in the complex are located at the regions with relatively high flexibility according to our native top-down MS analysis [5], suggesting a correlation between protein flexibility and side-chain solvent accessibility.

Experimental

Chemicals and Materials

H2O2 (30%), trifluoroacetic acid (TFA), formic acid (FA), L-glutamine, L-methionine, cytochrome c, Trizma base, catalase, dithiothreitol (DTT), sodium iodoacetate (IAA), and all HPLC-grade solvents (H2O and acetonitrile, ACN) were obtained from Sigma-Aldrich (St. Louis, MO, USA) at the highest purity available.

Protein Expression, Fab-1:VEGF Preparation, and Purification

VEGF and Fab-1 were produced in E. coli, purified, and formulated as bulk drug substances. A 2:1 molar ratio of Fab-1:VEGF was incubated at 37 °C for 15 min. The complex and the individual proteins were purified using a Tosoh Bioscience (San Francisco, CA, USA) size-exclusion chromatography (SEC) TSKgel G2000SWXL (7.8 × 300 mm, 5 μm) column on an Agilent (Santa Clara, CA, USA) 1100 HPLC system equipped with a fraction collector. Separation was performed using an isocratic run time of 30 min at 0.5 mL/min using the mobile phase buffer (0.2 M K2HPO4, 0.25 M KCl, pH 6.2) at ambient temperature. All fractionated samples were purified to >98% purity as previously described [7] and were concentrated to approximately 40 g/L. Samples were buffer exchanged into an arginine-succinate buffer.

FPOP Labeling

FPOP was performed as described previously [10]. A 248 nm KrF excimer laser (GAM Laser Inc., Orlando, FL, USA) adjusted to approximately 30 mJ/pulse was used to irradiate the flowing sample solution. The laser was focused through a convex lens (Edmunds Optics, Barrington, NJ, USA) onto a 150 μm i.d. fused-silica tubing (Polymicro Technologies, Pheonix, AZ, USA), giving a 2.5–3.0 mm irradiation window. The laser-pulse frequency was controlled by an external pulse generator (B&K Precision, Yorbal Linda, CA, USA). H2O2 and the scavenger (glutamine or the formulation buffer) in a Tris buffer were mixed with the protein solutions in the flow system just prior to the FPOP irradiation window. The mixing involved inserting the silica tubing into a micro-tee mixer (Cobert Associates Lab, St. Louis, MO, USA), giving thorough and rapid mixing by using different i.d. silica tubing in the tee (Polymicro Technologies) at different flow rates. The oxidatively modified sample was collected in low protein binding tubes containing 10 μL of 50 nM catalase and 200 mM methionine to decompose residual H2O2.

Proteolysis

Eight identical samples of the proteins of interest (25 μg in 60 μL) were subjected to FPOP labeling, collected and pooled together, and concentrated by an MWCO PES cutoff filter (Vivaproducts, Littleton, MA, USA) to ~150 μL (200 μg total protein for each digestion). The size of the membrane filter was chosen on the basis of the molecular weight (MW) of the proteins (i.e., 3 kDa for VEGF and 10 kDa for the Fab-1 and the complex). All samples were run in triplicate as separate FPOP experiments. Trypsin digestion of the proteins was carried out as described previously [7]. Briefly, the concentrated samples following FPOP treatment were diluted with denaturing 360 mM Tris buffer (pH 8.6) containing 6 M guanidinium hydrochloride (GdnCl) and 0.1 M EDTA. Subsequently, the protein samples were reduced with DTT (10 mM at 37 °C for 1 h), alkylated with IAA (25 mM at ambient temperature in the dark for 30 min), and quenched with DTT (50 mM at ambient temperature for 5 min). The alkylated samples were then loaded onto a NAP-5 desalting column (GE Healthcare Bio-Sciences, Pittsburgh, PA, USA), eluted with 25 mM Tris buffer, and collected in 1.5 mL, low-binding tubes. The desalted samples were digested at 37 °C for 1.5 h with sequencing grade trypsin (Roche, Indianapolis, IN, USA) at an enzyme:protein ratio of 1:20. The digestion was quenched by adding TFA to a 0.3% final concentration.

Mass Spectrometry Analysis

Tryptic peptides (20 μg) were separated using an Agilent 1200 HPLC with a Waters (Milford, MA, USA) BEH300 C18 (1.7 μm, 2.1 × 150 mm) column. Peptide elution was performed using a gradient from 100% solvent A (water, 0.1% TFA) to 55% solvent B (acetonitrile, 0.08% TFA) over 45 min at a flow rate of 0.3 mL/min and column temperature of 77 °C. Samples were analyzed on an Orbitrap Elite (Thermo Fisher, Bremen, Germany). A full mass scan in the positive-ion mode (60,000 mass resolving power at m/z 400 from m/z 400 to 2000) was performed using the Orbitrap analyzer. Data-dependent acquisition (DDA) MS2 analysis was performed in the ion trap (IT) analyzer for the product-ion analysis of the six most abundant ions.

Peptide Identification and Data Analysis

The .raw data files were converted to .mgf using MassMatrix Mass Spectrometric Data File Conversion Tools (ver. 3.9) [11], submitted to Mascot (Matrix Science, London, UK), and then searched against a custom-built database containing sequences of both VEGF and Fab-1 for peptide identification. For the search, all known oxidation-induced modifications [12] were considered as variable modifications. The search criteria include peptide MS1 and MS2 mass tolerances of 10 ppm and 0.8 Da, respectively. The Mascot search was also performed against a decoy (reverse) sequence, and ambiguous identifications were rejected.

For quantitative analysis of oxidative modification at the peptide level, only the intensities of the extracted ion chromatogram (XIC) that were identified as corresponding to either unmodified peptides or related modified peptides from VEGF or Fab-1 were used to report extent of modification [10].

All the assignments were based on MS2 verification. In a few cases, MS2 assignment of a modification at a single residue was ambiguous owing to co-eluted peptides or incomplete fragmentation information. In such cases, the extent of modification was assigned to a range of residues.

Results and Discussion

Normalization of the Oxidative Potential by Varying Scavenger and Protein Amounts

In a typical FPOP experiment, the lifetime of the highly reactive ·OH is primarily controlled by the addition of a scavenger (e.g., simple amino acid), but ·OH radicals are also consumed by the target protein, recombination (or self-quenching), and by potentially any other excipients in solution (e.g., buffer salts, cryo-protectants, surfactants). Thus, the total oxidative potential of the solution should be kept constant for all samples tested, which is typically achieved by using an inert buffer system such as phosphate-buffered saline (PBS) with no other components but scavenger and protein. However, biotherapeutic proteins are typically formulated for therapeutic use and typically contain many components that may affect the oxidative potential of the solution. Therefore, a direct comparison of two different buffer systems may be difficult if the oxidative potentials are not consistent. To investigate the feasibility to normalize the oxidative potential through sample preparation conditions, we compared the global oxidation of our current conditions of 2.8 mM Arg in 10 mM Tris buffer, to that of previous FPOP conditions, which used 20 mM Gln in PBS [10]. We were able to achieve the same level of global oxidation by using 20 mM Gln in PBS [10] (previous data = 73.7 ± 0.6%), compared with our current condition of 2.8 mM Arg in 10 mM Tris buffer (current data = 75.7 ± 0.7%) (Supplementary Figure S1). In addition, the same level of oxidation is observed for different charge states (Supplementary Figure S2), indicating normalization of oxidative potential may be an appropriate strategy for comparing two different buffer systems.

In addition to buffer components, the amount of protein in solution will contribute to the overall oxidation potential. Previous work using gamma-ray radiolysis of water demonstrated that increasing protein concentration (from 11 to 44 μM) resulted in a subsequent decrease in total protein oxidation [13]. This is because at higher protein concentration there is less total oxidative events per protein molecule. However, unlike FPOP, gamma-ray radiolysis of water does not utilize the addition of a scavenging reagent to control the lifetime of the generated hydroxyl radicals. The chemical kinetics of FPOP relies on the principles that the dominating chemistry for controlling the lifetime of the radical is the scavenger. Therefore, to investigate if total protein oxidation dependent on protein concentration, we compared the oxidative footprint of Fab-1 at our experimental conditions (~0.5 g/L, 10 μM protein) and a 4-fold increase (2 g/L, 40 μM). We observed that protein concentration did not significantly affect total oxidation for the individual peptides of Fab-1 (Supplementary Figure S3). These data demonstrate that protein concentration does not affect the total of oxidative events per molecule under conditions similar to that of gamma-ray radiolysis. These findings indicate that the total protein in solution (up to 2 g/L) does not contribute to the overall oxidative potential under our FPOP experimental conditions.

Mapping an epitope by FPOP yet requires identifying peptides with significantly less modification in the bound versus unbound states. Although we demonstrated that the total protein concentration does not affect the oxidative potential of our FPOP experiments, we employed an “equal-weight” strategy in which each sample had the same amount of total protein (g/L), scavenger, and H2O2 to further ensure similar oxidative potential for the free proteins (Fab-1 ~50 kDa, VEGF dimer ~40 kDa) and the complex (Fab-1:VEGF ~140 kDa). All of the samples have a different molar amount of protein, but they all contain 25 μg/60 μL (0.42 g/L) of footprinted sample (Fab-1 = 8.4 μM, VEGF = 10.5 μM, complex = 4.7 μM) described in the Supporting Information. Based on the results shown in Supplementary Figure S3, the total amount of protein for each sample in our epitope mapping experiment (~0.5 g/L) does not affect the total number of oxidative events per molecule, and thus Fab-1 and VEGF should be equally reactive in unbound state as the bound state.

Epitope and Paratope Mapping for VEGF and Fab-1 at the Peptide Level

To obtain the epitope and paratope maps, we digested the protein samples in the unbound and bound states, after being subjected to hydroxyl radical labeling, with trypsin, and analyzed the peptide mixtures by LC-MS/MS. Based on Mascot database searching and manual analysis, we obtained approximately 88%, 90%, and 86% sequence coverage for VEGF, Fab-1 heavy chain (HC), and light chain (LC), respectively. The missing peptides were small and hydrophilic, ranging from one to five amino acids that eluted at or prior to the void volume. The modification extents of peptides were calculated based on integrating the XIC corresponding to both unmodified and modified peptides, as previously described [11, 12]. All peptides chosen for interpretation satisfied two criteria: accurate mass within 10 ppm and a correct MS2 assignment by manual inspection after evaluation of the Mascot results. It is well known that methionine residues, for example, can be oxidized during storage and sample preparation. Such background (non-FPOP) oxidation can be estimated from the analysis of the non-laser control samples, the results of which are background-subtracted from laser-exposed samples. Figure 1 shows example extracted ion chromatograms for a VEGF-containing tryptic peptide in the unbound and bound state.

Figure 1
figure 1

MS (a), XIC peaks (b), and product-ion (MS/MS) (c) spectra of peptide V17-23 of VEGF, both unmodified (black traces) and modified (blue and red traces). Blue traces in (a) represent mass spectra of oxidized peptide. Blue and red traces in (b) and (c) represent XIC peak or product-ion spectra of peptide oxidation on residue M and Y, respectively

For VEGF, we identified 10 tryptic peptides by LC-MS/MS. The modification extents varied (Figure 2) from 0% for several peptides to ~19% for the N-terminal peptide (V1-16). Comparing the differences in modification extents between the unbound and Fab-1-bound VEGF, we found seven peptides that showed no significant difference (single variable t-test using a 95% confidence interval), indicating that solvent accessibility and, hence, conformation in these regions do not change upon VEGF binding. Three peptides (V17-23, V57-82, and V83-101) of VEGF, however, were significantly less modified upon Fab-1 binding, all three of which contain residues involved in binding, according to alanine-scanning results [4].

Figure 2
figure 2

Extent of modification of free VEGF (red bars) compared with Fab-1-bound VEGF (blue bars) at the peptide level (data from triplicate sample preparations and injections). Regions with significant differences (p < 0.05) between the two states are shown by red asterisks

For Fab-1, we identified 10 tryptic peptides from the HC (Figure 3a) and nine peptides for the LC (Figure 3b). We were unable to detect several peptides of less than five amino acids located at the C-terminus of the HC. Four peptides from HC (HC20-38, HC44-65, HC77-87, and HC99-127) and three from LC (LC19-42, LC46-61, and LC62-103) were significantly less modified upon VEGF binding, indicating reduced solvent accessibility or perhaps lower flexibility in the bound state. The crystal structure [4] indicates that three loops from both Fab-1 HC and LC comprise the binding region (CDR1, CDR2, and CDR3) of Fab-1. Peptides LC19-42, LC46-61, and LC62-103 undergo reduced modification in the bound versus unbound states, and they cover the CDR1, CDR2, and CDR3 regions of Fab-1 LC, respectively. Likewise, for Fab-1 HC, peptides HC20-38, HC44-65, and HC99-127 cover the CDR1, CDR2, and CDR3 regions, respectively. Interestingly, peptide HC77-87 is not associated with the binding interface, as determined by X-ray crystallography, yet also shows reduced solvent accessibility in Fab-1 upon VEGF binding.

Figure 3
figure 3

Extent of modification of (a) HC from free (red bars) compared with VEGF-bound Fab-1 (blue bars) and (b) LC from free (red bars) compared with VEGF-bound Fab-1 (blue bars) at peptide level (data in triplicate). Regions with significant differences (p < 0.05) between the two states are shown by red asterisks

In addition to observing peptides with significantly less modification in the bound versus unbound state, we observed many peptides in both Fab-1 and VEGF that remain unchanged. These peptides not only indicate regions that are not affected by the binding interactions but also strongly suggest that the overall oxidative potential of bound and unbound samples is equivalent.

Epitope and Paratope Mapping for VEGF and Fab-1 on the Amino-Acid Residue Level

In general, peptide-level analysis provides adequate information to demonstrate the regions of a protein with changes in solvent accessibility from the unbound versus bound states [14]. However, with high quality MS2 data, residue-level information can be achieved with certain peptides. To demonstrate this, we analyzed all oxidized residues by manually checking all available product-ion spectra of the modified peptides, as demonstrated in Figure 1. We extracted residue-level information from the XICs for various modifications (e.g., +16, peptides usually eluting earlier than the unmodified) and assigned each modification site on the basis of its product-ion spectrum.

For VEGF, three residues (Y21, M81, and M94, from peptides V17-23, V57-82, and V83-101, respectively) show significant decreases in modification upon Fab-1 binding (Figure 4). In fact, mutation of M81 to alanine results in over 70-fold decrease in binding affinity [4]. These three residues lie in regions that contact Fab-1, and their side chains point toward the binding interface.

Figure 4
figure 4

Extent of modification of free VEGF (red bars) compared with Fab-1-bound VEGF (blue bars) at the residue level (data in triplicate). Residues with significant differences (p < 0.05) between the two states are shown by red asterisks

In addition to mapping the epitope on VEGF, we identified residues that appear to be involved in binding on the Fab-1 paratope. We found four specific residues of Fab-1 HC (H31 and M34 from peptide HC20-38, Y54 from peptide HC44-65, and M83 from peptide HC77-87) and two specific residues of Fab-1 LC (F50 and L54 from peptide LC46-61) that displayed significantly less modification upon VEGF binding (Figure 5). In addition, both the Fab-1 HC and Fab-1 LC each contained several peptides that appear be associated with binding, but our MS2 data were unable to identify the exact modified residue. The modifications for these several peptides are denoted by a range of residues within the peptide (Figure 5).

Figure 5
figure 5

Extent of modification of (a) HC from free Fab-1 (red bars) compared with VEGF-bound Fab-1 (blue bars), and (b) LC from free Fab-1 (red bars) compared with VEGF-bound Fab-1 (blue bars) at residue level (data in triplicate). Residues with significant differences (p < 0.05) between the two states are shown with red asterisks

As expected, the HC-containing residues M34 and Y54 displayed decreases in binding affinity when mutated to alanine (6.3-fold and 9.4-fold, respectively), but surprisingly we found no significant effect on affinity for the LC-containing residues F50 and L54 (loss of 1.4-fold and 0.83-fold, respectively). Although the affinity data from the latter residues are consistent with the notion that LC CDR2 is often not required for binding [15], our data demonstrate that these residues are, in part, affected by binding.

Three of the four residues of the Fab-1 HC (H31, M34, and Y54) showed significant decreases in modification upon binding, and all lie on the paratope region in the Fab-1:VEGF structural model. Only residue M83 is remote from the paratope region shown in the X-ray structure, and the change in modification may be caused by a remote conformational change upon binding. In addition, we observed several isomeric modified peptides at residues F132 (HC128-139), F50 (LC46-61), F116 and F118 (LC109-126), L154 (LC150-196), and L182 (LC170-183). The isomers of each residue are of comparable abundance and display significantly lower modification levels in the bound state (Figure 5). Peptides in which the MS2 data were unable to identify the exact modified residue are denoted as a range of residues for VEGF (Figure 4) and the Fab-1 HC and Fab-1 LC (Figure 5). Mapping of our FPOP data onto the crystal structure reveals a complete picture for the Fab-1:VEGF binding interface (Figure 6).

Integrative Mapping by Multiple MS-Based Approaches

Mapping antibody–antigen interfaces and monitoring the HOS of proteins are important applications of MS in biotechnology and therapeutic protein development. Multiple approaches from both top-down and bottom-up MS analysis can provide comprehensive structural information for the target systems. We previously showed that native top-down MS with ECD fragmentation can probe the flexible region of Fab-1:VEGF complexes [5], assisting in the identification of protein regions that may hinder crystallization. In addition, we demonstrated that our bottom-up approach using site-specific carboxyl-group labeling successfully identified some of the peptides/residues at the interface between Fab-1:VEGF [7]. Here, we show that FPOP provides higher structural resolution (i.e., more residue-level information) of the binding interface compared with carboxyl-group footprinting and confirms the suspected remote conformational changes within the complex detected in our previous data. Each of these methods has its own utility and methodological benefits for deciphering solution-state structural interactions (described below).

The site-specific carboxyl-group labeling of Fab-1:VEGF is a simpler bench-top footprinting method that does not require the laser platform of FPOP. In addition, the site-specific labeling of carboxyl groups makes data analysis more straightforward than FPOP analysis [7]. However, this method is limited by the location of the carboxyl groups, which may not be in the sites of interest (i.e., binding interfaces), and it produces a slower “snapshot” (longer exposure) than does FPOP. Comparing the two techniques, we previously showed using carboxyl-group footprinting that only one residue (E93) of VEGF and two residues of Fab-1 HC (D28 and E57) in the binding region displayed significantly reduced rates of labeling upon Fab-1 binding [7] (Table 1). These three residues are located in the same peptides in our FPOP experiments that displayed significant reduction in ·OH labeling, demonstrating the complementarity of these methods (Figure 6). In addition, we observed that residue E89 in the Fab-1 HC (HC88-98), which is distal from the Fab-1 CDR, also displayed significant reduction in carboxyl-group rate of labeling in the bound state. If the crystal structure were not available, the conformational effects on E89 may have been mistakenly assigned as involved in binding. Nonetheless, these data correlate with our FPOP data, demonstrating a different Fab-1 HC tryptic peptide (HC77-87) that is also distal to the binding interface and showing a significantly lower modification in the presence of VEGF. Further residue-level analysis assigned M83 as the primary modified residue in this peptide, and although M83 and E89 are associated with different tryptic peptides, these residues map to a similar region in the presence of VEGF (Figure 6). Thus, we suggest that either of these two complementary footprinting approaches can support structural information about the conformation of a binding interface and also provide evidence for identifying binding-induced changes remote from the interface.

Table 1 Comparison of the Fab-1:VEGF Complex Higher Order Structure Data Obtained from Different Techniques
Figure 6
figure 6

FPOP data mapped onto a previously published X-ray structure (PDB: 1BJ1). (a) Full view of the Fab-1:VEGF complex represented as cartoon structures for VEGF (orange), Fab-1 HC (blue), and Fab-1 LC (green). (b) Expanded view of the epitope and paratope regions. Residues with positive identification by MS2 with reduced solvent accessibility in the Fab-1:VEGF complex are represented by sticks in cyan (VEGF) and red (Fab-1 HC and Fab-1 LC). Peptide regions in the Fab-1 HC with reduced solvent accessibility in the Fab-1:VEGF complex in which MS2 could not identify the exact modified residues are represented by sticks in magenta (Y99-103) and orange (H107-Y109). Residues previously identified with a reduced rate of GEE labeling in the Fab-1:VEGF complex are represented by sticks in brown [7]. Peptide regions previously identified as flexible using Native Top-Down MS with ECD are colored in yellow for Fab-1 HC and LC and grey for VEGF [5]

By comparing the structural data that we obtained from our previous native top-down MS study on the Fab-1:VEGF complex [5] to the current FPOP data, we can further demonstrate the utility of combining MS-based approaches for biotherapeutic HOS characterization (Table 1). Using native top-down MS with ECD fragmentation, we identified multiple regions of the complexes of VEGF, Fab-1 HC, and Fab-1 LC that appear to have conformational flexibility: (1) the N-terminus of VEGF (from A1 to H12), (2) both N- and C-termini of Fab-1 HC (from E1 to G10 and from S225 to L231, respectively), and (3) the N-terminus of Fab-1 LC (D1-S9). We hypothesize that these regions may be more solvent-accessible, and indeed we observed a higher extent of FPOP labeling for the N-terminal peptide of VEGF (V1-16). This peptide was not present in the crystal structure, suggesting on the basis of our native top-down MS data that this peptide was purposely omitted during expression of VEGF to facilitate crystal formation. Interestingly, the N-terminus for both the Fab-1 HC and LC do not appear to be the most solvent accessible compared with the rest of the Fab-1 molecule, even though our native top-down MS analysis indicates these regions may have more conformational flexibility, most likely due to the lack of residues with high reactivity towards oxidative labeling, such as methionine. Therefore, although there does appear to be a correlation between conformational flexibility and solvent accessibility, further experiments are required to strengthen the interpretation of the data yielded by these technologies. This work demonstrates that combining top-down and bottom-up MS-based approaches significantly improves our understanding of the higher order structure of complex protein–protein binding interactions. To assess further our HOS toolbox, we are currently investigating the Fab-1:VEGF complex using hydrogen/deuterium exchange analysis. However, the disulfide bonds found in VEGF have proven to be difficult using our current online reduction strategy.

Conclusion

Hydroxyl radical labeling techniques such as FPOP provide insight about the changes in solvent accessibility of side chains. Using an “equal-weight” strategy, we successfully analyzed two different proteins (with presumably two different rates of radical consumption) in their respective unbound and bound states, to obtain not only solution epitope and paratope mapping but also conformational stability and remote structural changes. We demonstrate that the data generated from FPOP are consistent with the predicted interactions at the binding interface as previously determined by crystallography and alanine-scanning. In addition, measuring the decrease in solvent accessibility by FPOP provides more information on the binding interface between Fab-1 and VEGF than does carboxyl-group footprinting. Nevertheless, we demonstrate the complementarity of FPOP, carboxyl-group footprinting, and native top-down MS analysis, and show that combining the data from these approaches can offer a more complete picture about the solution conformation of a protein and protein complex. Thus, we have a continued interest in understanding the complementarity within our HOS toolbox. By applying multiple solid-state and solution HOS technologies to a single system like the Fab-1:VEGF complex, we intend to demonstrate the value of each of these techniques for biotherapeutic development and characterization.