Investigating the Structural Compaction of Biomolecules Upon Transition to the Gas-Phase Using ESI-TWIMS-MS
- 1.4k Downloads
Collision cross-section (CCS) measurements obtained from ion mobility spectrometry-mass spectrometry (IMS-MS) analyses often provide useful information concerning a protein’s size and shape and can be complemented by modeling procedures. However, there have been some concerns about the extent to which certain proteins maintain a native-like conformation during the gas-phase analysis, especially proteins with dynamic or extended regions. Here we have measured the CCSs of a range of biomolecules including non-globular proteins and RNAs of different sequence, size, and stability. Using traveling wave IMS-MS, we show that for the proteins studied, the measured CCS deviates significantly from predicted CCS values based upon currently available structures. The results presented indicate that these proteins collapse to different extents varying on their elongated structures upon transition into the gas-phase. Comparing two RNAs of similar mass but different solution structures, we show that these biomolecules may also be susceptible to gas-phase compaction. Together, the results suggest that caution is needed when predicting structural models based on CCS data for RNAs as well as proteins with non-globular folds.
KeywordsIon mobility spectrometry Electrospray ionization Mass spectrometry Proteins RNAs
The advent of electrospray ionisation (ESI) transformed the field of mass spectrometry (MS) by providing the ability to routinely analyze not only large proteins but also noncovalently bound biomolecular complexes. In the three decades since this development, there has been a significant body of literature providing evidence of the native-like state of biomolecules measured by both ESI-MS and, more recently, ESI-ion mobility spectrometry-MS (ESI-IMS-MS) [1, 2, 3].
Ion mobility spectrometry (IMS) is a separation technique based on the gas-phase mobility of ions as they travel, under the influence of a weak electric current, through a drift tube filled with an inert gas [4, 5, 6]. Ions are separated based on their charge and shape: briefly, compact ions travel faster than extended ions carrying the same number of charges, whilst ions with a high number of charges travel faster than ions carrying a lower number of charges derived from the same precursor molecules. When coupled with MS, the data output is a 3D array of m/z versus intensity versus IMS drift time. The IMS drift time for ions can be converted to collision cross-section (CCS) directly if the IMS drift tube is a linear one [6, 7, 8], or indirectly following a calibration procedure [9, 10, 11, 12] if the drift tube is of a traveling wave (TW)  design. The CCS of an ion corresponds to the averaged rotational 2D projection of the biomolecule’s 3D structure. Hence, ESI-IMS-MS is a unique and powerful tool that can separate and characterize biomolecules, providing both mass and shape (via CCS) information on individual species within an ensemble in a single, rapid, experiment. Indeed, ESI-IMS-MS has been employed to study the 3D architecture and conformational properties of many proteins and noncovalently bound biomolecular complexes [4, 5, 6, 7, 8, 9, 14, 15, 16, 17, 18, 19, 20, 21, 22].
In 1997, Joseph Loo stated there are three camps of opinion concerning the retention of native protein structure upon transition into the gas-phase: “believers, nonbelievers, and undecided” , and quite possibly he was correct to hint at caution because despite the high number of successes reported, there has been a slow, low level emergence of literature demonstrating the “collapse” of certain proteins upon transition into the gas phase [23, 24, 25], one key example being antibodies [26, 27, 28]. Here, by systematic analysis of different non-globular proteins and RNA molecules using ESI-TWIMS-MS, we provide evidence of compaction in the gas-phase, highlighting a potential caveat in studying these specific biomolecules using this technique. The degree of compaction has been revealed by comparing the CCS values estimated from the ESI-IMS-MS data with CCS values calculated from the PDB structures of these biomolecules and also, in the case of the proteins, with in vacuo Molecular Dynamics (MD) simulations.
The monoclonal antibody, mAb1, was supplied by Medimmune (Cambridge, UK); (I27)5 was expressed recombinantly as described elsewhere  with amino acid linkers in between each subunit (see Supporting Information, Supplementary Table S1); POTRAs were cloned and expressed recombinantly as described elsewhere ; a SasG construct (G51-G57-Strep-CysCys) was produced as described previously . The RNA molecules were purchased from Integrated DNA Technologies BVBA, Leuven, Belgium.
Protein Mass Spectrometry Analyses
All nanoESI-TWIMS-MS protein measurements were carried out using a Synapt HDMS mass spectrometer (Waters Corp., Wilmslow, UK). Samples were introduced to the mass spectrometer using in-house pulled borosilicate capillaries (Sutter Instrument Co., Novato, CA, USA) coated with palladium using a sputter coater (Polaron SC7620; Quorum Technologies Ltd., Kent, UK). All protein samples were analyzed in positive ESI mode. The m/z scale was calibrated using 10 mg/mL aqueous caesium iodide (CsI) clusters across the acquisition range (typically m/z 500–15,000).
Protein samples were dialyzed into 150 mM aqueous ammonium acetate before being infused into the Synapt HDMS instrument. nESI-MS and nESI-TWIMS-MS experiments were conducted under the following settings: capillary voltage, 1.5 kV; sample cone, 30 V; extraction cone, 4 V; source temperature, 60–80 °C; backing pressure, 3.0–5.0 mBar; trap voltage, 10–40 V; trap/transfer gas flow, 1.5 mL/min; IMS nitrogen gas flow, 20 mL/min, IMS wave height (ramped), 5–30 V; and traveling wave speed, 300 ms.
All data were processed and analyzed with the MassLynx v4.1 and Driftscope software, supplied with the mass spectrometer.
ESI-TWIMS-MS CCS Calibrations for Proteins
ESI-TWIMS-MS experiments were carried out on a Synapt HDMS mass spectrometer using traveling wave IMS. Calibration of the traveling wave drift cell was carried out using a previously published method . The calibrant proteins used were: beta-lactoglobulin, concanavalin A, alcohol dehydrogenase, and pyruvate kinase, taken from the Clemmer/Bush database . Calibrant proteins were dissolved at a concentration of 10 μM in 200 mM ammonium acetate before being analyzed under the same conditions as the protein analytes.
The new corrected drift time (t′′D) was then plotted against the cross-section of the calibrant proteins (taken from the database ) to generate the calibration plots (see Supporting Information, Supplementary Figures S1, S2, and S3).
RNA Mass Spectrometry Analyses
All nanoESI-TWIMS-MS RNA measurements were carried out using a Synapt G2-S mass spectrometer (Waters Corp., Wilmslow, UK). Samples were introduced to the mass spectrometer using in-house pulled borosilicate capillaries (Sutter Instrument Company, Novato, CA, USA) coated with palladium using a sputter coater (Polaron SC7620; Quorum Technologies Ltd., Kent, UK). All RNA samples were analyzed in negative ESI mode. The m/z scale was calibrated using 10 mg/mL aqueous caesium iodide (CsI) clusters across the acquisition range (typically m/z 500–15,000).
On receipt, the two 35-nucleotide RNAs (PDB structures 2PCV and 2DRB) were diluted to 200 μM in RNAse-free MilliQ water (Millipore UK, Watford, UK), separated into 100 μL aliquots and stored at –80 °C. When ready for analysis, these samples were diluted with 50 mM ammonium acetate to a concentration of 40 μM, desalted using spin columns (BioRad, Hemel Hempstead, UK) and then diluted to 8 μM with 50 mM ammonium acetate. nESI-TWIMS-MS experiments were conducted under the following settings: capillary voltage, –0.8 kV; sample cone, 30 V; source temperature, 60–80 °C; IMS sheath gas, He 2.42e-2 bar; IMS buffer gas, Ar 3.01 bar; IMS wave height 40 V; IMS wave velocity 400–750 m/s per scan.
ESI-TWIMS-MS CCS Calibration for RNAs
ESI-TWIMS-MS experiments for the RNAs were carried out on a Synapt G2-S mass spectrometer using traveling wave IMS with negative ionisation electrospray. Calibration of the traveling wave drift cell was carried out using the method described previously in this document for protein samples but with an enhanced duty cycle delay coefficient (CEDC) of 1.41. The calibrant was a DNA polythymine of 10 nucleotides (d[T]10), the CCS of which had been measured and reported by Clemmer  (see Supporting Information, Supplementary Figure S4).
Theoretical CCS Calculation
In Vacuo Molecular Dynamics (MD) Simulations
MD simulations were run using the NAMD software (NAMD 2.9), using the CHARMM force field . Structures were simulated in a solvent-free system. For the simulation, a constant temperature of 300 K with Langevin thermostat was used and a time-step of 2.0 fs with a radial cut-off distance of 12 Å used throughout. Energy minimization in vacuo was implemented for a total of 0.5 ns before an equilibration of 10 ns; the cut-off distance, force field, and time step remained as described above throughout the simulation. Visual molecular dynamics (VMD)  was then used to visualize the simulation, and individual frames were saved as PDB coordinates in order to compute the CCS using MOBCAL. The VMD software was also used to calculate the root mean square deviation (RMSD) and radius of gyration (Rg). Analysis of the RMSD revealed whether a protein had equilibrated by the end of the 10 ns simulation; any sample that had not finished equilibrating was resubmitted for a further 10 ns until equilibration was reached. The NAMD and VMD software was operated under a Linux operating system.
Results and Discussion
Insights into the Gas-Phase Collapse of Monoclonal Antibodies
Molecular modeling is a useful tool to aid the study of biomolecules in the gas-phase. Although CCSs measured using ESI-TWIMS-MS methods can be compared directly to solved X-ray crystal or NMR structures from the Protein Data Bank (PDB), it is becoming clearer that this is not suitable for all proteins. For example, the conditions used to crystallize proteins can be very different to the conditions used for mass spectrometric analysis. Further, some proteins are inherently flexible or disordered, and may not have a PDB structure with which to compare the measured CCS, and additionally a subset of structures in the PDB consist only of fragments of the full protein in question. In vacuo modeling, therefore, allows us to achieve a glimpse of how such proteins may behave within the gas-phase. Adopting a similar in vacuo MD simulation approach as used by , we also observe a collapse around the hinge region of mAb1 such that the measured CCS of the mAb is substantially less than both the predicted CCS from its crystal structure  and the in vacuo MD simulation (Figure 1a iii).
To understand the role of the hinge region and determine whether this flexible linker was the main attributor to the collapse observed, we released the Fab and Fc regions of mAb1 using Lys-C proteolysis and analyzed the two fragments independently (Figure 1b, c). The CCS values determined by ESI-TWIMS-MS, estimated from the PDB coordinates, and indicated by in vacuo MD simulations are compared for both the Fab and the Fc regions (Figure 1b and c, respectively). The MD simulations indicated that both proteins collapsed to some extent in vacuo compared with their crystal structures, with the Fab region collapsing 11% compared with the 17% collapse of the Fc region. Furthermore, the CCS of the Fab region measured by ESI-TWIMS-MS was closer in agreement to the CCS predicted from its PDB structure than with its equilibrated collapsed MD structure, whereas the CCS of the Fc region measured by ESI-TWIMS-MS was closer to that of its collapsed MD structure than with its crystal structure. Although both of these fragments consist of four Ig domains, the Fc region retains the majority of the hinge region, supporting the notion that the flexible hinge plays a prominent role in the gas-phase collapse observed.
Investigating the Gas-Phase Collapse of Other Nonglobular Proteins
The mass spectrum of (I27)5 indicated a narrow charge state distribution (13+ to 16+ ions) (Figure 2a i). As neither a crystal nor NMR structure was available for (I27)5, a model was built based on the solution structure of the I27 monomer (1TIT, ) and building in the 4–6 residue linker regions (see Supporting Information) (Figure 2a i). This enabled a theoretical CCS for the five-domain construct to be established and formed the starting point for the in vacuo MD simulations (see Supporting Information). The measured CCS for (I27)5 (39.8 nm2)  is lower than both the modeled value predicted for the native structure (63.1 nm2) and the MD simulation end point (49.4 nm2) (Figure 2a ii). Upon in vacuo minimization and equilibration, the protein undergoes compaction, then collapses around the flexible linker regions between the individual subunits, which is reflected by the CCS at the end of the simulation.
ESI-TWIMS-MS analysis of the combined POTRA domains from BamA again produced a mass spectrum with a narrow charge state distribution (12+ to 16+ ions) (Figure 2b i). The ESI-TWIMS-MS data indicate that the CCS (35.1 nm2) obtained for the lowest charge state ions (12+) is closer to the predicted CCS of the in vacuo-equilibrated structure (37.1 nm2) than the CCS value predicted from the crystal structure (5D0O; 45.3nm2) (Figure 2b ii). The MD collapse observed for the POTRA domains is attributable to compaction around the short hinge regions between the individual domains, as well as to an overall collapse with POTRA1 moving towards POTRA5, resulting in a more ring-like structure in the equilibrated molecule (Figure 2b iii).
Together, the data presented for mAb1, (I27)5, and the POTRA domains suggest that non-globular proteins with flexible linker or hinge regions are susceptible to gas-phase collapse. To determine how linear, elongated molecules without any distinct linker regions behave upon transition to the gas phase, we studied the protein SasG (Figure 2c). SasG consists of repeats of two domains (G5 and E), in which the C-terminus of any given domain is directly connected to the N-terminus of the subsequent domain (Figure 2c). Furthermore, SasG (G51–G57) has been shown to form long, elongated fibrillary structures that maintain a highly extended conformation in solution, with no evidence of compaction . The ESI-MS data indicate a native-like conformation, centered on the 20+ and 21+ charge state ions, together with a highly charged, more unfolded conformation (centered on the 48+ charge state ions) (Figure 2c i). The ESI-TWIMS-MS CCS of the compact conformation was measured at 57.7 nm2 (18+ ions). In comparison, the predicted CCS based on the structure obtained from SAXS data  was 137.8 nm2, whereas the in vacuo MD simulations indicate that the protein collapses in the absence of solvent to a species with a CCS of 80.7 nm2 (Figure 2c ii, iii). Thus, an elongated linear protein, with no obvious linker or hinge regions, can also undergo significant compaction in the gas-phase.
Gas-Phase Collapse of Other Biomolecules
The Molecular Masses, Sequences, and Predicated CCSs of the Two 35-Nucleotide RNAs: 2PCV and 2DRB
5′- GGA CCC GCC ACU GCA GAG AUG CAA UCC AGU GGU CC -3′
5′- GGC CCG GGG CGG UUC GAU UCC GUU CUG GGC CAC CA -3′
ESI-TWIMS-MS analysis of the RNAs yielded identical CCSs for all of the corresponding charge state ions (4- to 7-ions; CCS ~10–11 nm2) (Figure 3b). Comparing the TWIMS CCS values with the CCSs estimated from the PDB structures, the TWIMS CCS data were significantly lower than the predicted values for either 2PCV (14.45 nm2) or 2DRB (11.46 nm2). For example, in the case of the 5-ions, TWIMS CCSs of 10.21 nm2 for 2PCV and 10.16 nm2 for 2DRB were measured, thus indicating both RNAs undergo gas-phase collapse. It may be argued that the ESI-MS solution conditions (50 mM aqueous ammonium acetate) differ from the crystallography conditions used for 2DRB (50 mM HEPES, 80 mM ammonium sulfate [n.b. some crystals were detected in the absence of the sulfate ions], 0.2 M tri-lithium citrate, and 20% PEG4000 ) and from the NMR solution conditions used for 2PCV (5 mM cacodylate, 50 mM NaCl, and 0.1 mM EDTA ), and that this may have affected the CCS values obtained from the three biophysical techniques. Although beyond the scope of this study, a systematic analysis of the effects of counter-ions, pH, oligonucleotide length, and sequence on collapse in the gas-phase with parallel MD simulations [46, 50] could be informative to cast more light on the response of RNA molecules in the gas-phase in general. However, here the collapse of both of the RNAs to a similar degree in the gas-phase is evident.
The question remains: can the solution structure of proteins be retained upon transfer into the gas phase? For stable, globular proteins, the answer is undoubtedly “yes,” backed by an impressive number of literature examples. However, here we have presented a small number of protein examples from our 14 years’ experience with ESI-IMS-MS where we have found that the CCS values measured underestimate the physical size of the solution structure and modeled data of the biomolecule under scrutiny. This phenomenon has been reported elsewhere in the case of antibodies [26, 27, 28], but here we have shown by studying isolated regions of an antibody that the Fc region, which contains the majority of the flexible hinge region, is more prone to gas-phase compaction than the Fab region. Other proteins we have identified that undergo gas-phase compaction include those with flexible hinge regions in between more structured domains, such as an engineered concatamer, (I27)5, in addition to the BamA complex with its extended array of POTRA domains. Other non-globular proteins such as SasG, an elongated linear protein, can also exhibit this behavior. Gas-phase compaction is not limited to proteins, as illustrated with reference to two 35-nucleotide RNA molecules of similar mass but different shape. Both RNAs appeared from the ESI-TWIMS-MS data to be significantly smaller than expected from their 3D crystal or solution structures.
We do not intend this report to be perceived as a negative message to the use of ESI-TWIMS-MS. Indeed, the advantages of this technique far outweigh any disadvantages. However, there are certain classes of biomolecules for which due caution should be employed when interpreting the results.
The authors thank the Biotechnology and Biological Sciences Research Council (BBSRC) and Medimmune for funding P.W.A.D. (BB/J011819/1), and the Engineering and Physical Sciences Research Council (EPSRC) and GSK for funding H.C.F. (EP/I501495/1). The Synapt HDMS mass spectrometer was funded by the BBSRC (BB/E012558/1) and the Synapt G2-S donated by Drs. M. Morris and K. Giles, Waters UK Ltd. S.E.R. and A.E.A. also acknowledge funding from the European Research Council under the European Union’s Seventh Framework Programme (FP7/2007-2013; ERC grant 322408). The authors thank Drs. D. R. Brockwell and B. Schiffin for providing (I27)5 and the BAM POTRA domains.
- 14.Bernstein, S.L., Dupuis, N.F., Laz, N.D., Wyttenbach, T., Condron, M.M., Bitan, G., Teplow, B.D., Shea, J., Ruotolo, B.T., Robinson, C.V., Bowers, M.T.: Amyloid-β protein oligomerization and the importance of tetramers and dodecamers in the aetiology of Alzheimer's disease. Nat. Chem. 1, 326 (2009)CrossRefGoogle Scholar
- 26.Campuzano, I.D.G., Larriba, C., Bagal, D., Schnier, P.D.: Ion mobility and mass spectrometry measurements of the humanized IgGk NIST monoclonal antibody. Chapter 4, ACS Symposium Series, Vol. 1202, State-of-the-Art and Emerging Technologies for Therapeutic Monoclonal Antibody Characterization Volume 3. Defining the Next Generation of Analytical and Biophysical Techniques, 75–112 (2015)Google Scholar
- 27.Debaene, F., Boeuf, A., Wagner-Rousset, E., Colas, O., Ayoub, D., Corvaia, N., Van Dorsselaer, A., Beck, A., Cianferani, S.: Innovative native MS methodologies for antibody drug conjugate characterization: high resolution native MS and IM-MS for average DAR and DAR distribution assessment. Anal. Chem. 86, 10674–10683 (2014)CrossRefGoogle Scholar
- 31.Gruska, D.T., Whelan, F., Farrance, O.E., Fung, H.K., Paci, E., Jeffries, C.M., Svergun, D.I., Baldock, C., Baumann, C.G., Brockwell, D.J., Potts, J.R., Clarke, J.: Cooperative folding of intrinsically disordered domains drives assembly of a strong elongated protein. Nat. Commun. 6, 7271 (2015)CrossRefGoogle Scholar
- 35.Brooks, B.R., Bruccoleri, R.E., Olafson, B.D., States, D.J., Swaminathan, S., Karplus, M.: CHARMM: a program for macromolecular energy, minimization, and dynamics calculations. J. Comput. Chem. 4, (1983).Google Scholar
- 41.von Castelmur, E., Marino, M., Svergun, D.I., Kreplak, L., Ucurum-Fotiadis, Z., Konarev, P.V., Urzhumtsev, A., Labeit, D., Labeit, S., Mayans, O.: A regular pattern of Ig super-motifs defines segmental flexibility as the elastic mechanism of the titin chain. Proc. Natl. Acad. Sci. U. S. A. 105, 1186–1191 (2008)CrossRefGoogle Scholar
- 46.Arcella, A., Dreyer, J., Ippoliti, E., Ivani, I., Portella, G., Gabelica, V., Carloni, P., Orozco, M.: Structure and dynamics of oligonucleotides in the gas phase. Angew. Chem. Int. Ed. Engl. 54, 467–471 (2015)Google Scholar
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.