Introduction

Since the determination of the three-dimensional structure in 1957 [1], myoglobin (Mb) has been extensively studied and is commonly used as a protein model to better understand the mechanisms involved in protein folding [25]. In particular, Mb functionality (and hence the tertiary structure) results from the presence of a prosthetic group, Fe protophorphyrin IX, which reversibly binds molecular oxygen for transportation and delivery to muscle tissue [6]. The heme-cavity interactions play an important role during Mb folding via transitions from native (N) to molten globule (MG) to acid-unfolded (UA) forms [7, 8]. Although the Mb tertiary structure in the N state is generally well accepted, during the unfolding process there is no defined tertiary structure in the MG state [9] before it further unfolds to the rod-like, completely denatured state [7].

The multiple kinetic intermediate states of Mb (and other globular proteins) have triggered multiple experimental approaches to better understand the folding process, including optical spectroscopy (e.g., circular dichroism and time-resolved fluorescence and infra-red spectroscopy), hydrogen-deuterium exchange combined with nuclear magnetic resonance and ion mobility spectrometry (IMS), electron capture dissociation and hydrogen-deuterium exchange (HDX) combined with mass spectrometry [1017]. In particular, ESI-MS experiments have shown non-cooperative unfolding for Mb, which involves multiple protein conformations [14, 18, 19]. Moreover, changes in the H/D exchange rate for holo and apo charge states have also been identified as a function of time, providing general characteristics pertaining to structural features [20]. IMS has been widely used to study the conformational properties of globular proteins, and when complemented with molecular dynamics, it provides a powerful tool for structural biology [2135]. Recently, HDX experiments in a traveling wave ion guide at short labeling times (0.1–10 ms) enabled the interrogation of apo Mb conformers [36].

With the recent development of trapped ion mobility spectrometry (TIMS) [37, 38], higher mobility separation [39, 40] and the possibility to interrogate and simultaneously measure the ion-neutral collision cross section (CCS) as a function of the time after the molecular ion formation (gas-phase time after desolvation or trapped time in the TIMS) has permitted kinetic studies of molecular ions in the millisecond to second time scale as well as the influence of the collision partner on the conformational space accessible to the molecular ions in the gas phase [41, 42]. In the current study, the unique potential of TIMS to hold ions while interacting with bath gas molecules (“TIMS” thermostat) is utilized to study the folding motifs of holo- and apoMb by measuring the CCS and HDX under the same folding conditions (e.g., solvent conditions and time after molecular ion formation). A novel candidate structure generation algorithm was applied for a more efficient sampling of the conformational space of proteins and the generation of candidate structures for each IMS bands with candidate filtering using the HDX data [43]. A folding pathway is proposed for the native (N) to molten globule (MG) to acid-unfolded (UA) forms including (Xn) kinetic intermediates.

Experimental

Materials and Reagents

Equine holoMb was purchased from Sigma-Aldrich (St. Louis, MO, USA) and used as received. All solvents and ammonium acetate salts used in these studies were analytical grade or better and purchased from Fisher Scientific (Pittsburg, PA, USA). Low concentration tune mix (G2421A; Agilent Technologies, Santa Clara, CA, USA) was used as a mobility calibration standard and used as received. HoloMb and apoMb samples were prepared by diluting an equine holomyoglobin stock, 100 μM, in 10 mM ammonium acetate (pH 6 and pH 4.5) to a final concentration of 1 μM. The pH was adjusted using acetic acid. Deuterated solutions of Mb were prepared using analytical grade D2O (Sigma-Aldrich) in 10 mM ammonium acetate (pH 6 and pH 4.5) to a final concentration of 1 μM for HDX back exchange experiments.

HDX-TIMS-MS Analysis

Details regarding the TIMS operation and specifics compared with traditional IMS can be found elsewhere [3739, 41, 42]. Briefly, TIMS mobility separation is based on holding the ions stationary using an electric field against a moving gas. The separation in a TIMS device can be described by the center of the mass frame using the same principles as in a conventional IMS drift tube [44]. Since mobility separation is related to the number of ion-neutral collisions (or drift time in traditional drift tube cells), the mobility separation in a TIMS device depends on the bath gas drift velocity, ion confinement, and ion elution parameters. Using the same principle, HDX-TIMS-MS provides the possibility to measure the rate of back exchange by varying the time the molecular ions spend in the TIMS cell. The mobility, K, of an ion in a TIMS cell is described by:

$$ K=\frac{v_g}{E}=\frac{A}{V_{elution}-{V}_{base}} $$
(1)

where vg, E, Velution, and Vbase are the velocity of the gas, applied electric field, elution, and base voltages, respectively. The constant A can be determined using calibration standards of known reduced mobilities (e.g., from low concentration tune mix: m/z = 322 Ko = 1.376 cm2V−1 s−1, m/z = 622 Ko = 1.013 cm2V−1 s−1, and m/z = 922 Ko = 0.835 cm2V−1 s−1). In TIMS operation, multiple geometric isomers/conformers are trapped simultaneously at different E values resulting from a voltage gradient applied across the TIMS tunnel. After thermalization, geometric isomers/conformers are eluted by decreasing the electric field in stepwise decrements (referred to as the “ramp”). Each isomer/conformer eluting from the TIMS cell can be described by a characteristic voltage (i.e., Velution – Vbase). Eluted ions are then mass-analyzed and detected by a maXis impact Q-ToF mass spectrometer (Bruker Daltonics Inc., Billerica, MA, USA).

In a TIMS device, the total analysis time can be described as:

$$ \mathrm{To}\mathrm{tal}\ \mathrm{I}\mathrm{M}\mathrm{S}\ \mathrm{time} = {\mathrm{T}}_{\mathrm{trap}} + \left({\mathrm{V}}_{\mathrm{elut}\mathrm{ion}}/{\mathrm{V}}_{\mathrm{ramp}}\right)*{\mathrm{T}}_{\mathrm{ramp}} + \mathrm{T}\mathrm{o}\mathrm{F} = {\mathrm{T}}_{\mathrm{o}} + \left({\mathrm{V}}_{\mathrm{elut}}/{\mathrm{V}}_{\mathrm{ramp}}\right)*{\mathrm{T}}_{\mathrm{ramp}} $$
(2)

where, Ttrap is the thermalization/trapping time, ToF is the time after the mobility separation, and Vramp and Tramp are the voltage range and time required to vary the electric field, respectively. The elution voltage can be experimentally determined by varying the ramp time for a constant ramp voltage. This procedure also determines the time ions spend outside the separation region To (e.g., ion trapping and time-of-flight). The same rf (880 kHz and 200 Vp-p) was applied to all electrodes including the entrance funnel, the mobility separating section, and the exit funnel. An electrospray ionization source (ESI Apollo II design, Bruker Daltonics, Inc., MA, USA) was used for all the analyses. TIMS separation was performed using nitrogen as a bath gas at ca. 300 K, and the gas flow velocity was controlled by the pressure difference between entrance funnel P1 = 2.6 mbar and the exit funnel P2 = 1.0 mbar. P1 and P2 values were held constant for all experiments. The TIMS cell was operated using a fill/trap/ramp/wait sequence of 1–10/1–10/5–1950/50 ms. Average mobility resolution at 10/10/500/50 was 160–190.

Reduced mobility values (K0) were correlated with CCS (Ω) using the equation:

$$ \varOmega =\frac{{\left(18\pi \right)}^{1/2}}{16}\frac{ze}{{\left({k}_BT\right)}^{1/2}}{\left[\frac{1}{m_{{}_I}}+\frac{1}{m_b}\right]}^{1/2}\frac{1}{K_0}\frac{1}{N^{*}} $$
(3)

where z is the charge of the ion, kB is the Boltzmann constant, N* is the number density and mI and mb refer to the masses of the ion and bath gas, respectively [44].

Theoretical

A novel candidate structure generation algorithm was used for a more efficient sampling of the conformational space of proteins and the generation of candidate structures for each IMS band with candidate filtering using the HDX data (more details in [43]). The 1YMB protein data bank entry for Mb was utilized as the starting structure with the presence or absence of the heme, for the holo and apo theoretical calculations, respectively [45]. Briefly, molecular dynamics simulations were used to reproduce the experimental conditions (e.g., “TIMS” thermostat) and to generate the identity vectors that define Mb conformational space. This approach is similar to that previously described in reference [46], with the main characteristic that the initial search targets the generation of the identity vectors, followed by charged assignment (based on the accessible surface area), and energy minimization. Once the candidate structures are identified for each IMS band (a minimum of four CM was considered per IMS band [43]), charge assignment is performed by scoring the accessible surface area based on the basicity/acidity of the chemical groups. That is, charge assignment is based on the score of the amino acid residues from each tri-dimensional structure [47, 48]. For example, solvent accessibility and the pka of the acidic (aspartic acid and glutamic acid) and basic (arginine, histidine, and lysine) amino acid residues are primarily utilized to assign the protonation and deprotonation sites. It is known that charge localization can influence electrostatic interactions and, therefore, the conformational dynamics of molecular ions [49, 50]. To account for the charge state influence on the CCS, energy optimization steps are performed after the charge site assignments for all candidate structures. All simulations were performed using an NVT thermostat using AMBER03 force field in YASARA software (http://www.yasara.org). The molecular dynamic simulations yielded identity vectors that cover the transition from the N to the denatured UA states.

Theoretical CCSHe were calculated using the MOBCAL software for helium [51]. To account for the reported 15% underestimation of CCS that occurs when using the PA algorithm [17, 52], all CCS calculations were scaled accordingly. Correlation between CCSHe and CCSN2 was performed using previously reported CCS values for denatured and native proteins [24]. That is, CCS(N2) = 0.9916*CCS(He) + 524.66 (native) and CCS(N2) = 1.174 CCS(He) + 466.34 (denatured) equations were used for holoMb and apoMb conversion, respectively.

Results and Discussion

Confirmation of the presence of the holo and apo forms of Mb in the starting solution was performed using mass spectrometry (see Supplemental Figure 1). An acid-induced transition from holoMb to apoMb was used to generate both forms, which was easily confirmed by the mass shift related to the loss of the prosthetic group in apoMb. The acid-induced transition is also accompanied by protein unfolding and yields a characteristic charge state distributions (in good agreement with previous experiments [14, 18]). In particular, the loss of the heme, which influences overall protein flexibility, accompanied by the deviation from physiological conditions results in a large distribution of charge states for apoMb (e.g., [M + 9H]+9–[M + 19H]+19). Changes in the protein flexibility as a function of the charge state for the holo and apoMb were easily detected by TIMS-MS (see Figure 1 for IMS projections of the holo [M + 8H]+8–[M + 9H]+9 and apo [M + 9H]+9–[M + 19H]+19 forms of Mb). Previous IMS studies of Mb reported IMS bands with a similar trend [19, 53, 54]; however, a larger number of mobility bands was observed as a result of the higher mobility resolution of the TIMS analyzer. The presence of the heme, in particular the mobility band corresponding to the [M + 8H]+8 of the holoMb, illustrates the structural rigidity that can inhibit the adoption of multiple conformers. To simplify the discussion, the peak width of the holoMb [M + 8H]+8 was used as the criterion to propose the minimum width of an IMS band corresponding to a single resolved conformation, or a population of conformations indistinguishable by IMS. The experimental data also illustrates that larger CCS values were observed for ions of higher charge state (see Table 1). In addition, inspection of Figure 1 shows that the charge state has a large influence on the protein stability. For example, as the charge state increases, the holoMb changes from a single to a bimodal IMS distribution (in good agreement with [53]). In the case of the apoMb, the loss of the heme destabilizes the interaction between the A, G, and H helices and makes the charge state effect more pronounced.

Figure 1
figure 1

Experimental TIMS-MS data for (a) apo and (b) holo myoglobin. Conformations for each charge state have been labeled. The number of conformation bands have been identified using mobility band criteria of the [M + 8H]+8 holo form of the protein

Table 1 Holo and apo Mb Experimental and Theoretical CCS Radius and Gyration and number of H/D Exchanges Observed per Charge State

IMS data reflect changes in the protein folding (average of size) and can provide an indirect measurement of the solution states; however, when complemented with theoretical calculations, a more detailed description can be achieved (e.g., discrimination between a protein expansion preserving a folded state or an extended state is possible). That is, changes in the CCS values can be interpreted in a similar fashion to the radius of gyration in order to describe a change in a protein conformation, typically an increase in size [55]. Deviations in the radius of gyration for N and MG states of the holoMb (19 ± 1 and 23 ± 2, respectively) are in good correspondence with the small CCS variation for the [M + 8H]+8 and [M + 9H]+9 holoMb conformers; this small change in CCS is in good agreement with a previous observation of similar folding motifs for the N and MG states [56]. In the case of apoMb, larger differences in CCS from the [M + 9H]+9 suggest the presence of several intermediate conformations of varying degrees of unfolding that occur in the N → MG → UA transition pathway. Here, the increase in CCS observed across the apoMb charge state distributions, particularly [M + 10H]+10–[M + 14H]+14 suggests Xn intermediate conformations, where Xn corresponds to intermediate unfolded states that occur in the process of Mb unfolding prior to reaching the rod-like chain characteristic of the UA state. Here the Xn notation refers to conformations that are not characteristic of previously identified transitions (i.e., N → MG → UA) and are, therefore, intermediates. Previous studies have identified specific structure intermediates [7, 57, 58]. Our experimental observations suggest that the process of Mb unfolding occurs via several transitions, N → MG → Xn → UA, where n refers to several conformational transitions as illustrated by the apoMb CCS distribution. That is, in the absence of the heme, it is expected that Mb will exist in several intermediate denatured states before complete unfolding to the UA form. Conformers corresponding to lowest charge state of the apoMb form, [M + 9H]+9, are then most likely attributed to the MG state or intermediate unfolded states that can occur via the MG → UA transition. The progressive increase in CCS with ion charge state suggests that unfolding is the result of the loss of the stabilizing intramolecular protein interactions via the absence of the porphyrin heme that have been reported as essential to both maintaining and allowing refolding of Mb [2]. CCS increase with charge state is also expected since it may/can be accompanied by stronger Coulombic repulsion from the protonated residue side chains. For higher charge states of apoMb, [M + 14H]+14–[M + 19H]+19, the number of IMS bands decreases, suggesting that further protein unfolding is not energetically favorable and/or the unfolding intermediates adopt comparable conformation, resulting in similar CCS values. This trend may be related to a point of stability in the Xn → UA transition; that is, the protonation sites are distant and minimize the Coulombic repulsion that was the driving force resulting in an increase in CCS. UA structures have been reported when harsher conditions are employed, but the corresponding CCS is much larger (>4300 Å2) than experimentally obtained in the current study [7].

H/D back exchange experiments were conducted under the same conditions as the CCS measurements. The number of IMS bands did not show any variation over time; however, in a 5–2000 ms time frame, the large number of collisions in the TIMS cell permitted the study of the H/D back exchange rate (Figure 2 and Table 1). No major changes were observed for the holoMb and apoMb [M + 9H]+9 HDX distribution (rate and number of exchanges), which suggests that despite the absence of the heme, no major changes in the protein conformation occurs; that is, the porphyrin cavity is preserved. There is, however, a general expansion of the protein following the loss of the porphyrin heme and/or a loosening of the porphyrin heme cavity, confirmed by the increase in CCS. This result can be interpreted as the transition from holoMb to apoMb resulting in the adoption of the MG conformation from the N state. That is, this change in conformation does not yield a difference in the extent of the unfolding [9] and, therefore, should not be accompanied by additional hydrogen-deuterium exchange. This information was used in the candidate assignment of the [M + 9H]+9 apoMb potential candidates (see Supplemental Material, which includes both compact and partially unfolded conformations).

Figure 2
figure 2

Rate of deuterium back exchange for the [M + 9H]+9 charge state of the apo and holo myoglobin (inset) and [M + 10H]+10–[M + 19H]+19 charge states of apo myoglobin. HDX experiments were conducted on the order of 5 ms to 2 s time frame

A comparison in the HDX exchange observed for the higher apoMb charge states, [M + 10H]+10–[M + 19H]+19 (Figure 2), yields similar rates, suggesting a commonality in the unfolding pathway. The increase in the number of exchanges for each of the charge states can be related to an unfolding of the protein exposing additional inner cavity amino acid residues.

Candidate conformers have been proposed for all IMS bands (see Figure 3 for the most stable conformation for each identified IMS band). Inspection of the candidate structures pertaining to the protein unfolding observed in Mb suggest that unfolding occurs via the N → MG → Xn transition. That is, several intermediates (Xn) are observed in the [M + 10H]+10 – [M + 19H]+19 apoMb charge states, which lack a native tertiary structure as a result from the loss of the interior cavity structure (shown in red in Figure 3) with further unfolding accompanying each additional protonation. Theoretical CCS calculations also suggest that the experimental conditions (e.g., charge state distribution) utilized here are not harsh enough to populate the UA state (e.g., pH 2 would yield unfolding to the rod-like UA state). Close similarity between the candidate structures for the [M + 14H]+14 –[M + 19H]+19 apoMb charge states suggests that intramolecular Coulombic repulsion aides in stabilizing the conformational motifs presented (Figure 4). This can be corroborated by the pairwise RMSD comparisons of conformations that occur initially via the N → MG → Xn transition (i.e., [Mholo + 9H]+9 → [MApo + 9H]+9 → [MApo + 10H]+10]). For example, the increase in RMSD observed for these transitions in comparison to RMSD within a charge state infers larger differences in the three-dimensional structure that is accompanied by the loss of important protein stabilizing interactions. This can be further corroborated by a comparison of the secondary structure composition of the conformations; an increase in charge state results in an increase in random coil contribution in place of α-helix content (see Supplemental Information for more details).

Figure 3
figure 3

Comparison of the most stable candidate structures corresponding to each identified conformation from Figure 1 for both the holo and apo forms of myoglobin. Heme cavity is depicted in red

Figure 4
figure 4

Pathway of protein unfolding following the transition from holo to apomyoglobin. Initial loss of the heme results in a loosening of the heme cavity, while additional unfolding requires the loss of essential interactions between the alpha helices. The alpha helices have been depicted in purple (A, residues 4–19), blue (B, residues 21–37), cyan (C, residues 40–44), green (D, residues 52–59), light green (E, residues 61–78), yellow (F, residues 83–97), orange (G, residues 101–120), and red (H, residues 127–151)

Experimental and theoretical results did not result in compact denatured structures as previously reported for Mb intermediate states [7]. On the contrary, our results suggest that only unfolded, elongated intermediates correspond to the [M + 10H]+10–[M + 19H]+19 apoMb states with varying degrees of unfolding related to the loss of the interactions between helices A and G that are important to maintaining the globular shape [59]. This loss takes place upon the MG → Xn transition for the [M + 10H]+10 apoMb charge state. Helices A, G, and H become stable early in the protein folding process [60, 61] and the loss of the interaction of the helices may reflect the existence of multiple tertiary structures (or structural diversity by the lack of constraints).

Conclusion

The potential of simultaneous interrogation of protein conformations using IMS combined with HDX has been applied to the study of Mb kinetic intermediates. In particular, the unique capability of TIMS to sample the molecular ions after their formation over a large time scale (1–2000 ms) and simultaneously acquire CCS and HDX data permits a better screening of candidate structures. This translates into the possibility to postulate or confirm folding pathways and intermediates during Mb protein unfoldings: N → MG → Xn → UA. That is, results confirm the existence of intermediate denatured states, Xn, which occur prior to completely unfolding to UA. It was shown that the Xn intermediates are a consequence of the destabilization of the interactions between A and G helices. The ability to simultaneously study multiple conformation (using HDX and IMS) using a TIMS analyzer offers a unique advantage for time-resolved conformational studies of biomolecules.