# Collidoscope: An Improved Tool for Computing Collisional Cross-Sections with the Trajectory Method

## Abstract

Ion mobility-mass spectrometry (IM-MS) can be a powerful tool for determining structural information about ions in the gas phase, from small covalent analytes to large, native-like or denatured proteins and complexes. For large biomolecular ions, which may have a wide variety of possible gas-phase conformations and multiple charge sites, quantitative, physically explicit modeling of collisional cross sections (CCSs) for comparison to IMS data can be challenging and time-consuming. We present a “trajectory method” (TM) based CCS calculator, named “Collidoscope,” which utilizes parallel processing and optimized trajectory sampling, and implements both He and N_{2} as collision gas options. Also included is a charge-placement algorithm for determining probable charge site configurations for protonated protein ions given an input geometry in pdb file format. Results from Collidoscope are compared with those from the current state-of-the-art CCS simulation suite, IMoS. Collidoscope CCSs are within 4% of IMoS values for ions with masses from ~18 Da to ~800 kDa. Collidoscope CCSs using X-ray crystal geometries are typically within a few percent of IM-MS experimental values for ions with mass up to ~3.5 kDa (melittin), and discrepancies for larger ions up to ~800 kDa (GroEL) are attributed in large part to changes in ion structure during and after the electrospray process. Due to its physically explicit modeling of scattering, computational efficiency, and accuracy, Collidoscope can be a valuable tool for IM-MS research, especially for large biomolecular ions.

## Keywords

Ion mobility Native mass spectrometry Native IM-MS Collisional cross-section Computational theory Noncovalent complexes Trajectory method## Introduction

Native ion mobility-mass spectrometry (IM-MS) can provide a wealth of information about the size, shape, charge state, mass, stoichiometry, and/or topology of ionized, native-like biomolecules and biomolecular complexes, in addition to other properties [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30]. IM spectrometry separates ions upon passage through a buffer gas (typically helium, argon, or nitrogen) under the influence of an electric field as a result of the size- and conformation-dependent drag force induced by repeated collisions between the ion and buffer gas. In native IM-MS, ion mobility values in the low-field limit are typically measured using drift tube, traveling wave ion guide, or, more recently, trapped ion mobility instrumentation before subse-quent mass spectrometry analysis [10, 31]. Structural informa-tion is typically obtained by converting measured drift times or ion mobility values to collisional cross-sections (CCSs) and comparing these to other experimental data or CCSs computed for model structures. However, accurately calculating CCSs with explicit computation of the scattering process can be very challenging and time-consuming due to contributions of long-range polarization interactions, orbiting, multiple scattering, and diffuse scattering, among other interactions. Due to the computational difficulty of modeling CCSs in a physically explicit manner, ions with masses greater than ~10 kDa are commonly modeled using non-explicit methods (the “projection approximation”, PA [32, 33, 34]; the “projected superposition approximation”, PSA [35]; and the “local collision probability approximation”, LCPA [36]), or using hard-sphere collisions (the “exact” and “diffuse hard spheres scattering” methods, EHSS [33, 37] and DHSS [9, 38, 39]). Although physically explicit methods (the “trajectory method”, TM [33], and the “diffuse trajectory method”, DTM [9, 38, 39]) can be highly accurate, they can be prohibitively slow for large ions as currently implemented.

Features Present (+) or Absent (−) in Various CCS Calculation Tools. Methods in Bold Type use Two-Dimensional Projections rather than Explicit Three-Dimensional Scattering for CCS Computations. Asterisks (*) Indicate that a Geometry-Dependent Correction Factor is used to Partially Account for 3-Dimensional Scattering

Computational suite | Method | Momentum transfer type | Explicit trajectories | Temperature/charge state-dependent | 3D geometry-dependent | Collision gas |
---|---|---|---|---|---|---|

| − | − | − / − | − | He | |

EHSS | Elastic | + | − / − | + | He | |

TM | Elastic | + | + / + | + | He | |

WebPSA [35] | | − | − | + / + | + | He/N |

[36] | LCPA | − | − | + / + | + | He/N |

| − | − | − / − | – (+) | He | |

DTM | Elastic and inelastic | + | + / + | + | He/N | |

DHSS | Elastic and inelastic | + | + / − | + | He/N | |

Collidoscope | TM | Elastic | + | + / + | + | He/N |

“Ideal” | Exact | + | + / + | + | all |

The TM evaluates an ion’s CCS by approximating the solution to the non-equilibrium Boltzmann Transport Equation [40, 41] as the momentum transfer integral *Ω* _{ avg } ^{(1,1)} . A common choice of a potential energy surface for the TM is the sum of Lennard-Jones 6-12 (L-J 6-12) potential energy surface of each of the atoms in the ion for the buffer gas of interest, and, at added computational expense, an ion-induced dipole (“*C* _{ 4 }”) term [33, 39]. In principle, CCSs from the TM account for long-range scattering, multiple scattering, the ion’s charge, and temperature effects. However, final computed CCSs for large biomolecules and assemblies using elastic collisions are typically about 5% higher than experimentally measured values [3]. This is believed to result in part from gas-phase compaction that occurs due to self-solvation of the biomolecular ion during the electrospray ionization process, in addition to any inherent errors in the approximation method, or in converting experimental drift times to CCSs [30].

Here, we introduce a new open-source program, Collidoscope, which approximates CCSs via the TM, using parallelized code to optimize performance. Collidoscope also contains an option to compute CCSs with N_{2} as the buffer gas, using optimized L-J 6-12 parameters that result in CCSs closely matching experimental values. The essential features of the computational strategy and optimization procedure used for programming Collidoscope are described below, and results from Collidoscope are compared with experimental and other computational data. The utility of Collidoscope for interpreting structures of very large native-like biomolecular ions (with masses approaching ~1 MDa) is also discussed.

## Theory

### Physical Model

*Ω*

_{ avg }(

*T*), which arises in the solution to the Boltzmann Transport Equation. Collisions are simulated by integrating trajectories along the total ion-collision particle potential energy surface, approximated as the sum of L-J 6-12 potentials centered at each atom of the ion plus a “

*C*

_{ 4 }” term to account for ion-induced dipole effects. According to classical scattering theory,

*Ω*

_{ avg }(

*T*) can be approximated in the low-field limit as the binary collision integral

*g*is the initial speed of the collision particle relative to the ion;

*b*is the impact parameter (defined as the initial distance of the approaching particle to the collision axis, see Figure 1);

*θ*,

*φ*, and

*γ*are the angles that define the relative orientation of the ion and collision particle;

*k*

_{ B }is the Boltzmann constant;

*T*is the absolute temperature;

*χ*is the scattering angle; and

*μ*is the reduced mass of the system [33, 40, 41]. (Note that the

*g*

^{5}term reflects laboratory-frame orientational averaging of the particle and ion velocities, with probabilities given by Maxwell-Boltzmann distributions, as well as momentum transfer.)

*θ*and

*φ*define the direction of the collision axis, whereas

*γ*describes the ion’s relative rotation about that axis. Since the

*γ*and

*b*parameters together describe a plane perpendicular to the collision axis, the integral over

*bdbdγ*is equivalent to an integral over the differential area,

*dA*. In addition, the double integral over

*sinφdφdθ*can be written as an integral over differential solid angle,

*dω*. Making the above substitutions results in the following equation:

*dA*is the differential area of a plane representing the origin of the collision particles before scattering (the “plane of origin”, see Figure 1).

In Equation 2, *x* and *y* describe the initial position of the collision particle in the plane of origin, whereas *θ* and *φ* describe the orientation of the plane of origin relative to the ion, and the denominator is a normalization factor. Collidoscope uses judicious choices of initial parameters to achieve uniform sampling of *dω* and *dA* as well as a Maxwell-Boltzmann-weighted sampling of *g*. These sampling choices, as well as the choice of potential surface and trajectory integration method, are described in greater detail below.

### Sampling of Relative Orientation

Orientational averaging is achieved using a user-defined set of “vantage points” that determine a set of collision axes and corresponding planes of origin for the scattering simulations. To accurately approximate the integral over *dω* in Equation 2, these vantage points should ideally be sampled uniformly over the sphere of possible ion orientations with high density; otherwise, a poor estimate of *Ω* _{ avg }(*T*) may be obtained. Vertices of regular or quasi-regular polyhedra are thus reasonable choices for the sets of vantage points.

### Sampling of Impact Parameter

Within each plane of origin, trajectories are initiated from points arranged in a uniformly-spaced square grid. Thus, *dA* in Equation 2 is approximated as the square of the distance between neighboring grid points for each trajectory. Scattering trajectories are computed only for the grid points for which both plane-of-origin coordinates (*x* and *y* in Figure 1) are smaller than the furthest distance of an atom in the ion to the center of mass, plus an additional “grid buffer” distance. Although the distance from the collision axis for which collision particles scatter strongly depends on the size and shape of the ion, use of a sufficiently large grid buffer distance ensures that significantly scattered trajectories are well-sampled at minimal computational expense.

For additional computational efficiency, Collidoscope performs a calculation to assess whether a trajectory within the region described above should be included in the computation of *Ω* _{ avg } ^{(1,1)} . The potential energy of a collision particle at its closest approach to any atom in the ion in the absence of any scattering is calculated. Scattering trajectories are then computed only if this potential energy is larger than a specified cut-off value because such trajectories are likely to scatter significantly. Effectively, for each plane of origin, the trajectories included in the CCS calculation are initiated from a region that broadly resembles an expanded silhouette of the ion in that plane. This procedure for choosing trajectories to run results in more efficient computation. The computation time as a function of the potential energy cut-off is shown in Supplementary Figure S1 in the Supporting Information.

### Sampling of Relative Velocities

*Ω*

_{ avg }

^{(1,1)}obtained for each collision particle kinetic energy is weighted by a probability factor to obtain

*Ω*

_{ avg }

^{(1,1)}(

*T*) (see Equation 2). Each initial kinetic energy state’s contribution is weighted by a corresponding analytic integral of the probability density function from Equation 2, \( {g}^5* \exp \left(\frac{-\mu {g}^2}{2{k}_B T}\right) \), over a range of energies containing that initial kinetic energy state (see Figure 2). The number of initial kinetic energy states and the minimum and maximum initial kinetic energies used can be input by the user. For each sampled initial kinetic energy, the bounds of integration, which determine the weighting, are spaced equally in kinetic energy, with the exception that the lowest and highest bounds are 0 and infinity, respectively. This assures that

*Ω*

_{ avg }

^{(1,1)}(

*T*) is not affected by energy state sampling in the case that it is strictly independent of temperature. (Indeed, temperature dependence of CCSs appears to be very weak for ions with a fixed conformation containing at least a few dozen atoms over the range ~200–400 K; see Figure 2 and Supplementary Figure S2). However, even the weak dependence of cross-section on temperature necessitates more densely sampled parameters for temperatures far from room temperature.

### Model Potential Energy Surface

*C*

_{ 4 }” potential attributable to the polarizability of the collision particle and the localized charges on the ion:

*ε*

_{ i }and

*r*

_{ min,i }are the L-J 6-12 parameters for well-depth and radius at minimum potential between atom

*i*and the collision particle, respectively, ε

_{0}is the permittivity of free space,

*α*is the effective isotropic polarizability volume of the collision particle,

*e*is the elementary charge,

*z*

_{ i }is the charge state of each atom, and

*r*

_{ x,i },

*r*

_{ y,i },

*r*

_{ z,i }is the vector between atom

*i*and the collision particle (with length

*r*

_{ i }). Collisions are modeled elastically, and the positions of the atoms and charges in the ions are held fixed throughout the scattering computations.

Ion-induced dipole interactions with He as the collision particle are modeled using its static electric dipolar polarizability [43]. N_{2} is also modeled as a quasi-spherical collision particle, using an effective polarizability that is equal to the arithmetic mean of its principal polarizabilities determined by ab initio computations [44]. Based on its moment of inertia, over the duration of a typical simulated trajectory (tens of ps), a molecule of N_{2} thermalized at room temperature would typically rotate about each axis a few dozen times. Thus, this quasi-spherical approximation assumes that the molecule is rotating rapidly enough that no significant rotational alignment effects occur due to collision particle polarization.

### Integration of Trajectories

Integration of trajectories in time in Collidoscope is presently performed using Euler (i.e., first-order Runge-Kutta) integration. Therefore, in order to minimize calculation inaccuracy due to trajectory integration errors, multiple checks are performed to determine the numerical validity of a trajectory. The total energy of the system must be conserved to within a threshold over the course of a trajectory for it to be included in computing the CCS. To enforce this, the center-of-mass-frame kinetic and potential energy of the collision particle are calculated at the beginning and end of a trial trajectory. If the total energy changes by more than the allowed amount, the trajectory is recalculated with a shorter time step, and energy conservation is checked again. Using a smaller time step promotes stricter energy conservation, so recalculated trajectories converge toward exact energy conservation as the time step is reduced.

If a particle loses energy due to integration error, it can occasionally become trapped near the ion by the attractive portion of the potential energy surface. Therefore, any trajectories exceeding a maximum number of steps are recalculated with a shorter time step. If a trajectory either does not conserve energy or exceeds the maximum number of time steps for five consecutive attempts, the trajectory is considered to have failed and is omitted from the CCS calculation.

### Parallelization

Significantly reduced computation time is achieved in Collidoscope through parallelization of trajectory simulations. At present, many common CPUs have 12 parallel hardware threads on which calculations can be performed. Each thread can calculate independent trajectories, decreasing the wall time when using 12 threads to ~9% of the wall time needed when using one thread at a time. By default, Collidoscope automatically determines the number of hardware threads available on a computer and maximizes the number of trajectories that are run simultaneously. A version of Collidoscope that utilizes GPU parallelism will be released soon.

### Ions Modeled

CCSs for molecules and ions ranging in mass from 12 Da to ~800 kDa with charge states between 0 and 73+ were calculated using Collidoscope and IMoS TM with He or N_{2} as the collision gas, and the MOBCAL TM, PA, and EHSS methods. Collidoscope and MOBCAL calculations were performed using the Linux OS, with Collidoscope parallelized over 12 threads, on an Intel X5650 CPU. IMoS computations were performed using the Windows OS, parallelized over seven threads on an Intel Core i7-4790 CPU. All computed values are reported using the notation CCS_{He} and CCS_{N2}, by analogy with the notation suggested by Barran and co-workers [45]. Sources for the atomic coordinates used in the computations as well as the charge states used are described in further detail in Supplementary Table S1 in the Supporting Information. Charge for small, non-protein ions was equally distributed among all atoms for CCS computations. For protein ions, charge configurations were generated using the charge placement algorithm described below.

### Charge Placement Algorithm for Protonated Protein Ions

Extending the charge placement algorithms of Williams [46, 47], Grandori [48], and Konermann [49], Collidoscope uses a Metropolis-Hastings-like [50, 51] charge placement algorithm for protonated protein ions based on the ion’s input atomic coordinates, a user-defined charge state, the calculated point-charge electrostatic repulsion energy of a given charge configuration, and the total intrinsic proton affinity of the ion. The N-terminal amine group as well as each residue is considered to be a possible charge site throughout the charge placement computation, with one possible charge per residue. Initially, the user-defined number of charges are placed in a random configuration on the defined residues, and a total apparent proton affinity, *PA* _{ app }, is calculated as the sum of the intrinsic proton affinities of all charged sites minus the electrostatic repulsion energy of the point-charge sites. This procedure is repeated 1000 times, and the charge configuration with the greatest *PA* _{ app } is selected for Metropolis-Hastings optimization. One of the charges from this configuration is picked at random and moved to a new, random location not already occupied by a charge. The updated *PA* _{ app } of the ion is calculated, and the new charge configuration is accepted if this value is greater than before the charge was moved. If the updated value of *PA* _{ app } is instead lower, it is accepted with a probability \( \exp \left(-\frac{\varDelta P{A}_{app}}{k_B T}\right) \). This assures that the algorithm can escape local energy minima to find a low-energy charge configuration. This procedure is repeated until the standard deviation of *PA* _{ app } is less than 6 *k* _{ B } *T* for the last 25% of the iterations and the average value of *PA* _{ app } increases by less than 0.1 *k* _{ B } *T* between the second-to-last and last 25% of the iterations. At this point, the optimization is considered to have converged, and the charge configuration with the highest *PA* _{ app } among all the iterations is used for CCS calculations. Intrinsic proton affinities used in Collidoscope are identical to those used by Konermann and co-workers [49], and a relative permittivity of 2.5 is used in calculating electrostatic repulsion for all proteins other than GroEL, for which a value of 4 is used. For the proteins investigated here, the main effect of the charge placement algorithm is to spread the charges out among basic sites near the surface of the ion. (Examples of convergence of the charge placement for GroEL^{73+} are illustrated in Supplementary Figure S3.)

## Results

### Default Parameter Optimization

Extensive tests were performed to optimize default values for parameters that minimize computation time while preserving accuracy of the resulting CCSs. Particularly important to the optimization process were the grid buffer distance and the impact parameter spacing. Tests were performed on species spanning four orders of magnitude in mass (water, ondansetron, melittin, and LF_{N}), and results were compared to determine if the parameter should be scaled as a function of the mass or charge state of the ion. Calculated CCSs tend asymptotically toward a “dense-sampling limit” as parameters are sampled more densely (impact parameter spacing, initial kinetic energies, time step, and vantage points) or as the computations are more extensive (grid buffer distance, initial collision particle-ion separation, and potential energy threshold). Each parameter was optimized such that calculations with the optimized value generally produced CCSs within 1% of the dense-sampling limit for that parameter. These optimization tests are described in further detail below and in the Supporting Information (Supplementary Figures S1 and S4–S13).

Impact parameter spacing in Collidoscope is optimized to scale with the number of atoms in the ion, such that the CCS of an ion with 100 atoms is computed with an impact parameter spacing equal to the van der Waals radius of a carbon atom (1.7 Å; see Supplementary Figure S8) [52]. The size of each square in the plane-of-origin grid is scaled by the number of atoms, so that the number of trajectories calculated scales approximately linear with the number of atoms in the ion, for ions of similar density.

The grid buffer distance is optimized to be 15 Å. Smaller grid buffer distances result in significant deviations for the CCSs of ions with masses < ~1 kDa, and a grid buffer distance of 15 Å results in CCSs within 0.004% of their dense-sampling limit for all four ions (see Supplementary Figure S9). This default grid buffer distance value ensures that some trajectories traveling through regions of low-magnitude potential energy are included in the calculation, but extraneous computation time and memory use are avoided by omitting many trajectories that would deflect negligibly. Note that the grid buffer affects the CCS less for large ions, so it can be significantly reduced or excluded (to reduce computation time) for large enough ions.

The initial distance from the plane of origin to the center of mass, the cut-off energy used to fine-grain omission of trajectories, the number and geometry of vantage points, the range and number of energy states computed, and the integration time step were all optimized similarly, but these parameters were found to have less significant impact on computed CCSs than the parameters described above. Results used for optimization of these parameters, along with sample scattering trajectories simulated for C_{60} ^{+}, melittin, and GroEL, are illustrated in the Supplementary Figures S4, S5, and S10–S14).

### Comparison of Collidoscope CCSs to Other Calculated and Experimental Values

_{He}for species listed in Supplementary Table S1 calculated using Collidoscope are compared with those obtained using IMoS in Figure 3. These Collidoscope CCS

_{He}agree with those obtained via IMoS’s TM to within 4% for all species larger than 1 kDa. Because Collidoscope and IMoS calculations were performed on different CPUs, their computation times are not directly comparable. However, Collidoscope calculations were completed with wall times typically ~2% of the equivalent TM calculations in MOBCAL, on the same CPU. This decrease in computation time is largely due to parallelism.

_{He}and the CCS

_{N2}for GroEL

^{73+}were computed with Collidoscope, respectively, in 3.5 and 6 d (see Figure 4). (CCS

_{He}were not computed for GroEL 14-mer ions using MOBCAL TM due to prohibitive computational expense, nor were they calculated using IMoS due to software restrictions on the size of molecule that can be used.) PA and EHSS CCS

_{He}for this ion were computed with MOBCAL, and these values are 26.8% lower and 1.2% higher than the TM CCS computed with Collidoscope, respectively. These results are in agreement with previous observations that EHSS values tend to be very close to TM values for large, globular ions in low-field IMS, whereas PA values tend to be somewhat lower. No noticeable trend is observed in the relative differences between CCS

_{He}calculated with either MOBCAL or IMoS and with Collidoscope as a function of CCS. It is therefore expected that Collidoscope can be used to accurately calculate TM CCSs of ions with masses between ~10 kDa and several MDa that may require prohibitive computation time or memory use with other methods.

A log-log plot of computation time for CCS_{He} and CCS_{N2} versus ion mass is shown in Figure 4. Interestingly, distinct scaling law behaviors are observed for small, dense ions and for larger biomolecular ions. In particular, computation time scales with ion mass, *m*, as *m* ^{0.97} (resp., *m* ^{0.92}) for low-mass ions and as *m* ^{1.46} (resp., *m* ^{1.36}) for larger biomolecular ions with He (resp., N_{2}) collision gas. This result is attributed to the different densities of these two types of ions, which in Collidoscope requires calculation of more trajectories per unit mass for large biomolecular ions than for smaller, denser ions. Furthermore, computations with N_{2} buffer gas require roughly two to three times as much time as with He (see Figure 4).

_{He}and CCS

_{N2}to experimental results [18, 21, 53] is shown in Figure 5. Collidoscope CCS

_{He}are typically ~1%–3% higher than IMoS, and both values are typically higher than experimental CCSs. CCS

_{He}calculated with Collidoscope are typically between 3% and 15% higher than experimental CCSs, with smaller relative errors for smaller ions (CCS < ~200 Å

^{2}) than for larger ions (Figure 5). By comparison, results from IMoS TM for this set of ions are typically between 1% and 13% higher than experimental CCSs.

For Collidoscope CCS_{N2} calculations, optimized L-J 6-12 parameters for the quasi-spherical N_{2} collision particle were determined by extensive parametrization tests for *r* _{ min } and *ε* _{ i } values for both carbon and hydrogen, using experimental data for an assortment of aromatic hydrocarbon ions (see Supplementary Table S2). The *r* _{ min } and *ε* _{ i } parameters for carbon and hydrogen were optimized simultaneously, and default values were chosen that agree maximally with experimental results. We note that these optimized L-J 6-12 parameters are in some cases somewhat different from the parameters used by IMoS. This result indicates that the TM can be robust to variation of the L-J 6-12 parameters for large ions. CCS_{N2} obtained using these optimized parameters are typically 1% to 18% higher than experimental values for all ions studied, for both Collidoscope and IMoS (excluding GroEL, for which discrepant values have been reported in the literature [6, 29, 53]; see Figure 5).

## Discussion

The overall approach used to optimize Collidoscope is to reduce computational time without sacrificing accuracy and precision, and to provide a flexible, open-source, and parallelizable code base in anticipation of future needs for researchers in the field of native IM-MS. For example, if a researcher needs to calculate a CCS for a very large ion in Collidoscope with greater precision with limited computational time, s/he may choose to change the impact parameter grid spacing or the number of energy states used. Because the computation time varies as the inverse square of the grid spacing and only linearly with the number of energy states, it is less computationally expensive to increase the number of energy states used rather than to decrease the grid spacing. However, care must be taken when increasing the number of energy states, in order to ensure that the *dg* integral (see Equation 2) is accurately estimated.

A major difference between Collidoscope and previous TM implementations is the type of sampling used for trajectory parameters. Instead of using Monte-Carlo sampling, Collidoscope samples all trajectory parameters uniformly and with sufficient density to obtain CCSs close to the dense-sampling limit. Computation times vary from a few seconds for small ions to a few days for ~MDa-sized complexes, such as the GroEL 14-mer. The above results indicate that Collidoscope computes TM CCS_{He} in close agreement with IMoS’s TM implementation.

Because it accounts for long-range polarization effects, multiple scattering, and collision gas temperature, Collidoscope inherently models the physics of the scattering process at a greater level of detail than the PA/PA* or EHSS methods. Computing TM results for large ions is useful for validating empirical “geometry corrections” of values obtained from computationally inexpensive, but less physically detailed, methods such as PA and EHSS. Notably, the EHSS CCS_{He} of GroEL computed with MOBCAL is very close to the TM CCS_{He} for GroEL^{66+} computed with Collidoscope, though both values are considerably higher than previously reported experimental results for this ion in N_{2} buffer gas [6, 29, 53]. The MOBCAL PA CCS_{He} for GroEL is *higher* than the experimental CCS_{N2}, though for geometrical reasons the PA CCS generally provides a lower bound for the experimental low-field CCS of a dense, native-like ion. For each ion studied here, CCS_{N2} is strictly *greater* than CCS_{He}, due to both the larger size of the N_{2} molecule and its greater polarizability. These results all strongly suggest that the experimental GroEL CCSs previously reported pertain to conformations that are significantly more collapsed than the X-ray crystal structure. Therefore, calibration of IM-MS data using published experimental GroEL CCS_{He} or CCS_{N2} should be undertaken with careful consideration of the structures adopted under the experimental conditions used. Collidoscope can thus be a valuable tool for evaluating and developing calibration protocols for IM-MS data and for studying the conformational space of large ions.

Based on the comparison between Collidoscope CCSs with experimental values shown in Figure 5, Collidoscope predicts CCSs for protein geometries derived from X-ray crystallography data that are consistently between 3% and 15% (respectively, 1% and 18%) higher than experimental IM-MS values for native-like ions using He (respectively, N_{2}) collision gas. This agrees with previous observations that many native-like biomolecular ions undergo some compaction during the electrospray ionization process due to removal of solvent and subsequent self-solvation [3]. Empirically, a scaling factor of 0.91 for Collidoscope CCS_{He} and CCS_{N2} obtained with default parameters using X-ray crystallography-derived geometries of proteins ions is recommended if the user desires to empirically “correct” for this compaction effect for native-like ions with masses between ~1 kDa and ~1 MDa. Based on the results reported here, after this correction, the method precision for the TM using both IMoS and Collidoscope is about 10% for protein ions.

_{He}computed using Collidoscope agree very closely with those computed using either the IMoS TM or MOBCAL EHSS (Figure 3). Although N

_{2}buffer gas is not implemented in MOBCAL without modification, CCS

_{N2}are in close agreement between IMoS and Collidoscope. In addition, ratios between the CCS

_{N2}and CCS

_{He}computed using Collidoscope agree very well with experimental ratios for these values (Figure 6). Both sets of ratios converge toward an asymptotic value (~1.15) similar to those previously calculated for organic macro-ions using a variety of computational methods [9]. Together, these results indicate that Collidoscope’s implementation of the TM for both He and N

_{2}gas typically provides accurate estimates of CCSs for the input structures used. Differences between Collidoscope CCSs and experimental values (see Figure 5) for large protein ions are therefore attributed predominantly to discrepancies in the structures they represent (i.e., the experimental gas-phase structures studied are likely different from the crystal structures used here). A detailed analysis of possible structures adopted in these experiments is beyond the scope of this paper, but the ability to greatly reduce computational time for TM CCSs with Collidoscope (via GPU parallelism and other optimizations) will facilitate future work investigating the gas-phase conformational landscape for large ions.

Although we recommend using the extensively optimized default parameters presented here, customization of Collidoscope to suit the needs of the user is possible and relatively simple because of its object-oriented programming. Lennard-Jones parameters, collision gas properties, and the trajectory integration algorithm are coded separately and may be modified as needed.

## Conclusions

Collidoscope is a computationally efficient tool for calculating CCSs of ions with a wide range of masses using the Trajectory Method that produces results in close agreement with measured low-field ion mobility values for both He and N_{2} collision gas. Computation time is significantly decreased relative to MOBCAL TM calculations due to parallelized computing and optimized sampling of trajectory parameters. A GPU-parallelized version of Collidoscope is currently under development and will further reduce computational time. Because TM CCSs for megadalton-sized ions can be calculated using Collidoscope in a few days, the program makes detailed IM-MS analysis of the conformational space of large ions tractable at a high level of sophistication. CCSs computed with Collidoscope can also be used in combination with modeled structures to inform calibrations of IM-MS data for very large ions for which conformations are not precisely known or can vary with instrumental conditions, such as GroEL.

In the future, a more rigorous treatment of N_{2} will be implemented in Collidoscope to account for its permanent quadrupole moment and full dipolar polarizability tensor. Higher-order Runge-Kutta integration methods will also be made available as an option to further reduce trajectory integration error. Finally, other buffer gases, including argon, will be implemented. Collidoscope is available for public use upon request from the authors.

## Notes

### Acknowledgements

Computations on the University of Oregon ACISS Supercomputing Cluster were supported by the National Science Foundation (grant OCI-0960354). The authors thank Elliott Ewing for helpful discussions.

## Supplementary material

## References

- 1.Maisser, A., Premnath, V., Ghosh, A., Nguyen, T.A., Attoui, M., Hogan, C.J.: Determination of gas phase protein ion densities via ion mobility analysis with charge reduction. Phys. Chem. Chem. Phys.
**13**, 21630–21641 (2011)CrossRefGoogle Scholar - 2.Alexeev, Y., Fedorov, D.G., Shvartsburg, A.A.: Effective ion mobility calculations for macromolecules by scattering on electron clouds. J. Phys. Chem. A
**118**, 6763–6772 (2014)CrossRefGoogle Scholar - 3.Jurneczko, E., Barran, P.E.: How useful is ion mobility mass spectrometry for structural biology? The relationship between protein crystal structures and their collision cross-sections in the gas phase. Analyst
**136**, 20–28 (2011)CrossRefGoogle Scholar - 4.Ruotolo, B.T., Benesch, J.L.P., Sandercock, A.M., Hyung, S.-J., Robinson, C.V.: Ion mobility-mass spectrometry analysis of large protein complexes. Nat. Protoc.
**3**, 1139–1152 (2008)CrossRefGoogle Scholar - 5.Bernstein, S.L., Dupuis, N.F., Lazo, N.D., Wyttenbach, T., Condron, M.M., Bitan, G., Teplow, D.B., Shea, J.E., Ruotolo, B.T., Robinson, C.V., Bowers, M.T.: Amyloid-beta protein oligomerization and the importance of tetramers and dodecamers in the aetiology of Alzheimer's disease. Nat. Chem.
**1**, 326–331 (2009)CrossRefGoogle Scholar - 6.Hogan, C.J., Ruotolo, B.T., Robinson, C.V., de la Mora, J.F.: Tandem differential mobility analysis-mass spectrometry reveals partial gas-phase collapse of the GroEL complex. J. Phys. Chem. B
**115**, 3614–3621 (2011)CrossRefGoogle Scholar - 7.Lanucara, F., Holman, S.W., Gray, C.J., Eyers, C.E.: The power of ion mobility-mass spectrometry for structural characterization and the study of conformational dynamics. Nat. Chem.
**6**, 281–294 (2014)CrossRefGoogle Scholar - 8.Zhou, M., Dagan, S., Wysocki, V.H.: Protein subunits released by surface collisions of noncovalent complexes: nativelike compact structures revealed by ion mobility mass spectrometry. Angew. Chem. Int. Ed.
**51**, 4336–4339 (2012)CrossRefGoogle Scholar - 9.Larriba-Andaluz, C., Fernandez-Garcia, J., Ewing, M.A., Hogan, C.J., Clemmer, D.E.: Gas molecule scattering and ion mobility measurements for organic macro-ions in He versus N-2 environments. Phys. Chem. Chem. Phys.
**17**, 15019–15029 (2015)CrossRefGoogle Scholar - 10.Ewing, M.A., Glover, M.S., Clemmer, D.E.: Hybrid ion mobility and mass spectrometry as a separation tool. J. Chromatogr. A
**1439**, 3–25 (2016)CrossRefGoogle Scholar - 11.Niu, S., Rabuck, J.N., Ruotolo, B.T.: Ion mobility-mass spectrometry of intact protein–ligand complexes for pharmaceutical drug discovery and development. Curr. Opin. Chem. Biol.
**17**, 809–817 (2013)CrossRefGoogle Scholar - 12.Pacholarz, K.J., Garlish, R.A., Taylor, R.J., Barran, P.E.: Mass spectrometry based tools to investigate protein-ligand interactions for drug discovery. Chem. Soc. Rev.
**41**, 4335–4355 (2012)CrossRefGoogle Scholar - 13.Beveridge, R., Chappuis, Q., Macphee, C., Barran, P.: Mass spectrometry methods for intrinsically disordered proteins. Analyst
**138**, 32–42 (2013)CrossRefGoogle Scholar - 14.Laganowsky, A., Reading, E., Allison, T.M., Ulmschneider, M.B., Degiacomi, M.T., Baldwin, A.J., Robinson, C.V.: Membrane proteins bind lipids selectively to modulate their structure and function. Nature
**510**, 172–175 (2014)Google Scholar - 15.Vahidi, S., Stocks, B.B., Konermann, L.: Partially disordered proteins studied by ion mobility-mass spectrometry: implications for the preservation of solution phase structure in the gas phase. Anal. Chem.
**85**, 10471–10478 (2013)CrossRefGoogle Scholar - 16.Morsa, D., Defize, T., Dehareng, D., Jérôme, C., De Pauw, E.: Polymer topology revealed by ion mobility coupled with mass spectrometry. Anal. Chem.
**86**, 9693–9700 (2014)CrossRefGoogle Scholar - 17.Freeke, J., Robinson, C.V., Ruotolo, B.T.: Residual counter ions can stabilise a large protein complex in the gas phase. Int. J. Mass Spectrom.
**298**, 91–98 (2010)CrossRefGoogle Scholar - 18.Campuzano, I., Bush, M.F., Robinson, C.V., Beaumont, C., Richardson, K., Kim, H., Kim, H.I.: Structural characterization of drug-like compounds by ion mobility mass spectrometry: comparison of theoretical and experimentally derived nitrogen collision cross sections. Anal. Chem.
**84**, 1026–1033 (2012)CrossRefGoogle Scholar - 19.Pukala, T.L., Ruotolo, B.T., Zhou, M., Politis, A., Stefanescu, R., Leary, J.A., Robinson, C.V.: Subunit architecture of multiprotein assemblies determined using restraints from gas-phase measurements. Structure
**17**, 1235–1243 (2009)CrossRefGoogle Scholar - 20.Quintyn, R.S., Yan, J., Wysocki, V.H.: Surface-induced dissociation of homotetramers with D2 symmetry yields their assembly pathways and characterizes the effect of ligand binding. Chem. Biol.
**22**, 583–592 (2015)CrossRefGoogle Scholar - 21.Salbo, R., Bush, M.F., Naver, H., Campuzano, I., Robinson, C.V., Pettersson, I., Jørgensen, T.J.D., Haselmann, K.F.: Traveling-wave ion mobility mass spectrometry of protein complexes: accurate calibrated collision cross-sections of human insulin oligomers. Rapid Commun. Mass Spectrom.
**26**, 1181–1193 (2012)CrossRefGoogle Scholar - 22.Trimpin, S., Tan, B., Bohrer, B.C., O'Dell, D.K., Merenbloom, S.I., Pazos, M.X., Clemmer, D.E., Walker, J.M.: Profiling of phospholipids and related lipid structures using multidimensional ion mobility spectrometry-mass spectrometry. Int. J. Mass Spectrom.
**287**, 58–69 (2009)CrossRefGoogle Scholar - 23.Sterling, H.J., Kintzer, A.F., Feld, G.K., Cassou, C.A., Krantz, B.A., Williams, E.R.: Supercharging protein complexes from aqueous solution disrupts their native conformations. J. Am. Soc. Mass Spectrom.
**23**, 191–200 (2012)CrossRefGoogle Scholar - 24.Shepherd, D.A., Marty, M.T., Giles, K., Baldwin, A.J., Benesch, J.L.P.: Combining tandem mass spectrometry with ion mobility separation to determine the architecture of polydisperse proteins. Int. J. Mass Spectrom.
**377**, 663–671 (2015)CrossRefGoogle Scholar - 25.Ferguson, C.N., Benchaar, S.A., Miao, Z., Loo, J.A., Chen, H.: Direct ionization of large proteins and protein complexes by desorption electrospray ionization-mass spectrometry. Anal. Chem.
**3**, 6468–6473 (2011)CrossRefGoogle Scholar - 26.Silveira, J., Fort, K.L., Kim, D., Servage, K.A., Pierson, N.A., Clemmer, D.E., Russell, D.H.: From solution to the gas phase: stepwise dehydration and kinetic trapping of Substance P reveals the origin of peptide conformations. J. Am. Chem. Soc.
**135**, 19147–19153 (2013)CrossRefGoogle Scholar - 27.Gabelica, V., Baker, E.S., Teulade-Fichou, M.P., De Pauw, E., Bowers, M.T.: Stabilization and structure of telomeric and c-myc region intramolecular G-quadruplexes: the role of central cations and small planar ligands. J. Am. Chem. Soc.
**129**, 895–904 (2007)CrossRefGoogle Scholar - 28.Uetrecht, C., Rose, R.J., van Duijn, E., Lorenzen, K., Heck, A.J.R.: Ion mobility mass spectrometry of proteins and protein assemblies. Chem. Soc. Rev.
**39**, 1633–1655 (2010)CrossRefGoogle Scholar - 29.van Duijn, E., Barendregt, A., Synowsky, S., Versluis, C., Heck, A.J.R.: Chaperonin complexes monitored by ion mobility mass spectrometry. J. Am. Chem. Soc.
**131**, 1452–1459 (2009)CrossRefGoogle Scholar - 30.Allen, S.J., Bush, M.F.: Radio-Frequency (rf) confinement in ion mobility spectrometry: apparent mobilities and effective temperatures. J. Am. Soc. Mass Spectrom.
**27**, 2054–2063 (2016)CrossRefGoogle Scholar - 31.Liu, F.C., Kirk, S.R., Bleiholder, C.: On the structural denaturation of biological analytes in trapped ion mobility spectrometry-mass spectrometry. Analyst
**141**, 3722–3730 (2016)CrossRefGoogle Scholar - 32.Marklund, E.G., Degiacomi, M.T., Robinson, C.V., Baldwin, A.J., Benesch, J.L.P.: Collision cross-sections for structural proteomics. Structure
**23**, 791–799 (2015)CrossRefGoogle Scholar - 33.Mesleh, M.F., Hunter, J.M., Shvartsburg, A.A., Schatz, G.C., Jarrold, M.F.: Structural information from ion mobility measurements: effects of the long-range potential. J. Phys. Chem.
**100**, 16082–16086 (1996)CrossRefGoogle Scholar - 34.Paizs, B.: A divide-and-conquer approach to compute collision cross sections in the projection approximation method. Int. J. Mass Spectrom.
**378**, 360–363 (2015)CrossRefGoogle Scholar - 35.Bleiholder, C., Contreras, S., Bowers, M.T.: A novel projection approximation algorithm for the fast and accurate computation of molecular collision cross -ections (IV). Application to polypeptides. Int. J. Mass Spectrom.
**354–355**, 275–280 (2013)CrossRefGoogle Scholar - 36.Bleiholder, C.: A local collision probability approximation for predicting momentum transfer cross sections. Analyst
**140**, 6804–6813 (2015)CrossRefGoogle Scholar - 37.Shvartsburg, A.A., Jarrold, M.F.: An exact hard-spheres scattering model for the mobilities of polyatomic ions. Chem. Phys. Lett.
**261**, 86–91 (1996)CrossRefGoogle Scholar - 38.Larriba, C., Hogan, C.J.: Ion mobilities in diatomic gases: measurement versus prediction with non-specular scattering models. J. Phys. Chem. A
**117**, 3887–3901 (2013)CrossRefGoogle Scholar - 39.Larriba, C., Hogan, C.J.: Free molecular collision cross section calculation methods for nanoparticles and complex ions with energy accommodation. J. Comput. Phys.
**251**, 344–363 (2013)CrossRefGoogle Scholar - 40.Kihara, T.: The mathematical theory of electrical discharges in gases. B. Velocity-distribution of positive ions in a static field. Rev. Mod. Phys.
**25**, 844–852 (1953)CrossRefGoogle Scholar - 41.Mason, E.A., Schamp, H.W.: Mobility of gaseous lons in weak electric fields. Ann. Phys.
**4**, 233–270 (1958)CrossRefGoogle Scholar - 42.Wyttenbach, T., von Helden, G., Batka, J.J., Carlat, D., Bowers, M.T.: Effect of the long-range potential on ion mobility measurements. J. Am. Soc. Mass Spectrom.
**8**, 275–282 (1997)CrossRefGoogle Scholar - 43.Pachucki, K., Sapirstein, J.: Relativistic and QED corrections to the polarizability of helium. Phys. Rev. A
**63**, 012504 (2000)CrossRefGoogle Scholar - 44.Spelsberg, D., Meyer, W.: Static dipole polarizabilities of N2, O2, F2, and H2O. J. Chem. Phys.
**101**, 1282–1288 (1994)CrossRefGoogle Scholar - 45.Surman, A.J., Robbins, P.J., Ujma, J., Zheng, Q., Barran, P.E., Cronin, L.: Sizing and discovery of nanosized polyoxometalate clusters by mass spectrometry. J. Am. Chem. Soc.
**138**, 3824–3830 (2016)CrossRefGoogle Scholar - 46.Schnier, P.D., Gross, D.S., Williams, E.R.: On the maximum charge state and proton transfer reactivity of peptide and protein ions formed by electrospray ionization. J. Am. Soc. Mass Spectrom.
**6**, 1086–1097 (1995)CrossRefGoogle Scholar - 47.Schnier, P.D., Gross, D.S., Williams, E.R.: Electrostatic forces and dielectric polarizability of multiply protonated gas-phase cytochrome
*c*ions probed by ion/molecule chemistry. J. Am. Chem. Soc.**117**, 6747–6757 (1995)CrossRefGoogle Scholar - 48.Marchese, R., Grandori, R., Carloni, P., Raugei, S.: A computational model for protein ionization by electrospray based on gas-phase basicity. J. Am. Soc. Mass Spectrom.
**23**, 1903–1910 (2012)CrossRefGoogle Scholar - 49.Popa, V., Trecroce, D.A., McAllister, R.G., Konermann, L.: Collision-induced dissociation of electrosprayed protein complexes: an all-atom molecular dynamics model with mobile protons. J. Phys. Chem. B
**120**, 5114–5124 (2016)CrossRefGoogle Scholar - 50.Hastings, W.K.: Monte Carlo sampling methods using Markov chains and their applications. Biometrika
**57**, 97–109 (1970)CrossRefGoogle Scholar - 51.Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N., Teller, A.H., Teller, E.: Equation of state calculations by fast computing machines. J. Chem. Phys.
**21**, 1087–1092 (1953)CrossRefGoogle Scholar - 52.Bondi, A.: van der Waals volumes and radii. J. Phys. Chem.
**68**, 441–451 (1964)CrossRefGoogle Scholar - 53.Bush, M.F., Hall, Z., Giles, K., Hoyes, J., Robinson, C.V., Ruotolo, B.T.: Collision cross-sections of proteins and their complexes: a calibration framework and database for gas-phase structural biology. Anal. Chem.
**82**, 9557–9565 (2010)CrossRefGoogle Scholar