Introduction

Metabolite identification studies play a critical part of the early pharmaceutical discovery process and are required to speed up lead optimization and candidate selection for development. However, a major limitation of these studies is that neither radiolabeled material nor analytical standards of drug metabolites are usually available for quantification of the amounts of the various metabolites formed. The ionization efficiency of even closely related molecules within the ion-source of a mass spectrometer may vary, and simple metabolic modifications to a drug can drastically alter the mass spectral response, prohibiting comparison of MS peak areas to estimate abundance of the identified metabolites. Furthermore, it is known that both molar absorptivity (ε) and the maximum absorption wavelength (λmax) may be altered as a drug is metabolically modified, making the comparison of UV-derived peak areas without correction factors ambiguous. Therefore, development of a sensitive and specific technique for the quantitation of drug metabolites without the use of synthetic analytical standards or radiolabel would represent a major advance in preliminary metabolism screening in drug discovery.

Efforts toward obtaining a universal quantitative detection have been focused upon several detectors such as ultraviolet (UV) [1], evaporative light scattering detector (ELSD) [1, 2], chemiluminescence nitrogen detector (CLND) [1, 3], and charged aerosol detector (CAD) [4], among others. All of these detectors, however, have advantages as well as limitations. Proton nuclear magnetic resonance (1H NMR) [1, 5] and Electronic REference To access In-vivo Concentrations (ERETIC) method [1, 6] has also been utilized as a “Gold Standard” to enable quantification by integrating proton signals. However, the ERETIC method is time-consuming and cost-ineffective [1].

The objective of this study is to develop a novel mass spectrometric quantitative detection strategy without a requirement for specific analyte standards to achieve a rapid, cost-effective, and readily automated quantification method. The major challenging aspect of this work is the correct identification of the most important parameters that influence analyte response during the electrospray ionization (ESI) process. Various solution-phase factors such as mobile-phase additives, solution pH, pKa, analyte concentration and solvent composition [713] and gas-phase reactions including proton affinity [14, 15] have been reported to have a significant effect on ESI response.

In order to establish bounds on concentrations of trace analytes, it would be substantially more convenient to rely on validated relationships between computed (i.e., in silico) molecular properties and ionization response factors, since many computed properties can be determined efficiently and quickly. Various workers have employed uni- or multivariate statistical models to identify correlations between electrospray ion intensities and various computed (or measured) molecular properties for alkaloid cations [16], polar metabolites [17], alcohols [15], small molecule pharmaceuticals [13], and tripeptides [18]. Key quantities identified as having potential utility in multivariate relationships include polar and nonpolar surface areas, gas-phase basicities/proton affinities, solvation free energies [13, 18], molecular volume, octanol-water distribution coefficient (log D), and absolute mobility [17].

Advances in electronic structure theory [19] and quantum mechanical continuum solvation models [20, 21] have made the computation of proton affinities [2224] and solvation free energies [25, 26] increasingly accurate and affordable (and molecular surface areas are trivially computed, of course). Importantly, insofar as the goal is to determine a statistically relevant relationship between ESI MS response factors and molecular properties, it is not necessarily critical to find computational models that are accurate in an absolute sense—all that is required is that they be systematically accurate in a relative sense (that is, trends in a computed property must be predicted accurately in order to identify correlations with experimental properties, but the absolute quantities themselves may be off so long as they are off by systematic amounts). This reduced demand on computational accuracy permits a wide range of efficient theoretical models to have potential utility in developing useful multivariate predictive models.

In this work, we report the assembly of a training set of 66 drugs and drug-like molecules for which we computed various chemometric quantities, including polar and molecular surface areas, aqueous solvation free energies, and gas-phase and aqueous proton affinities. We identify a statistically significant relationship between measured mass spectral ion counts at a specific concentration and the combination of aqueous proton affinity and total molecular surface area. When applied to a test set of 11 drug-like molecules, the derived protocol provides a good correlation of prediction with experiment, suggesting its potential utility for broader application.

Experimental

Chemicals and Materials

Unless otherwise indicated, reagents used were purchased from Sigma-Aldrich (St. Louis, MO, USA). N-acetyl mesalazine was purchased from Enamine (Monmouth Junction, NJ, USA). Novartis compounds were synthesized in-house. LC-MS grade acetonitrile and water were purchased through VWR International (Radnor, PA, USA).

Mass Spectrometry

A linear ion trap hybrid Orbitrap Elite (Thermo Scientific, San Jose, CA, USA) mass spectrometer interfaced with a Dionex UltiMate 3000 (Thermo Scientific) HPLC and a CTC PAL autosampler was used in this study. All m/z measurements were performed within the Orbitrap. Data was collected in full scan profile mode between m/z 100 and 1000 in positive ionization mode. The resolution was set at 30,000 at m/z 400 with an AGC target of 1.00e6. The following source parameters were used for all samples analyzed; sheath gas flow rate 40, aux gas flow rate 30, sweep gas flow rate 2 (all units arbitrary), spray voltage 4.00 kV, capillary temperature 275 °C, and S-Lens rf level of 60%.

Sample Preparation and Introduction

Stock solutions (1–10 mM corrected for salt, purity ≥99%) of 66-molecule training set in LC-MS grade acetonitrile were diluted to 10 μM in 1:1 acetonitrile:water with 0.1% formic acid. For flow injection analysis the HPLC was controlled using Chromeleon Xpress (Thermo Scientific), the mass spectrometer was controlled using Tune Plus (Thermo Scientific), and the autosampler using Xcalibur. Fifteen μL was injected with a flow rate of 700 μL/min with a mobile-phase composition of 50% acetonitrile, 50% water, and 0.1% formic acid. For flow injection analysis, each compound was injected three times on three different days. Data was analyzed using Xcalibur Qual browser (Thermo Scientific) and Microsoft Excel. Extracted ion chromatograms were generated using the theoretical monoisotopic masses, including any observed adducts or neutral losses ±5 ppm. Peak areas were used for all calculations. The intra- and inter-day variability was ≤5% (zonisamide) (intra-day) and ≤4% (acetaminophen) (inter-day) percent relative standard deviation (%RSD).

To examine data reproducibility, six commercially available compounds (acetaminophen, traxoprodil, N-acetyl mesalazine, zonisamide, verapamil, and dexamethasone) from the 66-molecule training set were carefully selected based on the diversity of their chemical structures and to cover a wide range of molecular weight (150–450 Da), aqueous proton affinity (–250 to –300 kcal/mol), and total molecular surface area (~203 to 600 Å2). To account for the effect of mobile composition on the MS ionization efficiency, all six compounds were well separated and spread out at the entire HPLC gradient system used for the standard biotransformation protocol. Compounds (10 μM) were spiked into 0.1M phosphate buffer pH 7.4 and vortex mixed. The sample was then diluted with three volumes of LC-MS grade acetonitrile. The samples were vortex mixed and centrifuged at 10,000 rpm for 10 min. Two hundred μL of the supernatant was transferred to a 96-well plate and the sample was dried to completion under nitrogen. The sample was redissolved into 200 μL of 95% water, 5% acetonitrile with 0.1% formic acid; 20 μL was injected onto a Waters (Milford, MA, USA) Symmetry C18 column (2.1 × 150 mm, 5 μm beads) and analyzed by LC-MS. Buffer A was LC-MS grade water with 0.1% formic acid and buffer B was LC-MS grade acetonitrile with 0.1% formic acid. The following gradient was used: 0–5 min 5% B, 15 min 40% B, 25 min 95% B, 25–30 min 95% B, 31 min 5% B, 31–36 min 5% B. The column eluent from the first 5 min was diverted to waste. To make sure the results were reproducible, a total of three injections were made. Data was analyzed using Xcaliber Qual browser and Excel. Extracted ion chromatograms were generated using the theoretical monoisotopic masses, including any observed adducts or neutral losses ±5 ppm. Peaks were manually picked and peak areas were used for all calculations.

Computational Procedures

For each studied molecule, the structure was first drawn in GaussView [27] (a visualization tool that accompanies the electronic structure suite Gaussian 09 [28], used for all density functional calculations described below) and saved as a Gaussian input file. That structure was then imported to the molecular mechanics software program PC Model [29]. Next, the GMMX search algorithm within the PC Model program was employed to search for the lowest energy conformer (typically 1000 steps of search with otherwise default selection of parameters; this search covers rotation about single bonds (including amide and ester bonds) and ring conformations) using the MMX force field that is part of the PC Model program. The lowest energy conformer from the MMX search was then optimized (gas phase) at the M06-L [30] level of density functional theory, using the MIDI! basis set [31, 32] and a density fitting basis set that improves efficiency when employed with a local density functional [19] (this level of theory is denoted M06-L/MIDI!), The optimized structure was re-imported into PC Model and its polar, saturated nonpolar, unsaturated nonpolar, and total molecular surface areas were computed using the standard van der Waals radii employed by the package (we note that the definitions of “polar,” “saturated nonpolar,” and “unsaturated nonpolar” surface areas are not universal, however, as we found no significance for these variables in our analyses, we do not consider their details further here).

The most basic site of the molecule was then identified (either through chemical intuition or, when necessary, through examination of multiple possibilities), and the molecule was protonated at that site. Again, the structure was optimized (gas phase) at the M06-L/MIDI! level of theory (see Figure 1).

Figure 1
figure 1

Representative electrostatic potentials (ESPs) for the optimized structures of carbaryl, aminoglutethimide, and captan, three compounds chosen for this study, showing a ball-and-stick representation of the optimized M06-L/MIDI! geometries with gray balls representing carbon, white hydrogen, blue nitrogen, red oxygen, green chlorine, and yellow sulfur. The most red site on the ESP represents the most negative electrostatic potential (attractive to a positive test charge) and was selected for protonation. Blue regions are repulsive while intermediate colors of the spectrum reflect the quantitative change from attractive to repulsive

Next, for the gas-phase geometries of both the conjugate base and acid forms (i.e., as single points), aqueous solvation free energies were computed using the SMD aqueous solvation model [33] (this level of theory is denoted SMD(aq)/M06-L/MIDI!//M06-L/MIDI!). The aqueous proton affinity is computed as the difference in the single-point energies in solution of the protonated species and the neutral species (typically in the range of –250 to –300 kcal/mol; note that this quantity is consistent with the typical ion convention in solution, which takes the free energy of the solvated proton to be zero [34]). In preliminary work, we examined whether reoptimization of structures in aqueous solution led to improved results. Such reoptimizations were found to have negligible statistical significance. As reoptimization in solution roughly doubles the computational time per molecule, we relied on single-point calculations alone to compute aqueous proton affinities (see also Results and Discussion section).

Results and Discussion

Measurement of Target Property

A total of 66 drug-like molecules including eight Novartis compounds (compounds A, B, C, D, E, F, G, and H) and commercially available drugs (training set) were chosen based on their availability and diverseness of molecular weight, structure (phase I and phase II metabolism), and physical chemical properties. Flow injection analysis demonstrated concentration-dependent peak areas between 10 and 100 μM (data not shown). A concentration of 10 μM was chosen because at that concentration the detector was not saturated and that concentration is the substrate concentration used in our standard biotransformation assays. Compounds used for data reproducibility (acetaminophen, traxoprodil, N-acetyl mesalazine, zonisamide, verapamil, and dexamethasone) eluted between 7 and 18 min and peak areas for all compounds were reproducible with a % RSD ≤ 7.4%.

Chemometric and Statistical Analysis

Beginning with a training set of data for 40 molecules (Table 1), we considered a number of different quantities, including the aqueous free energy of solvation of the neutral form of the solute, computed either from a single-point energy calculation at the gas-phase optimized geometry or including relaxation in aqueous solution (columns 2 and 3 in Table 1), the solute proton affinity, computed either in the gas phase or including solvation free energies from either single-point calculations at gas-phase geometries or including relaxation effects in solution (columns 4–6 in Table 1), as well as the saturated nonpolar, unsaturated nonpolar, polar, and total molecular surface areas for the M06-L/MIDI! optimized structures (columns 7–11 in Table 1).

Table 1 Chemometric Data for 40 Molecules

Insofar as the chemometric quantities in Table 1 are either free energies, or surface areas, which are likely to be correlated linearly with free energy quantities associated with surface tensions, volatilities, etc., their variations should be associated with log variations in concentration measures. In the case of ion counts, the observed quantity may be related to a concentration (while all analytes are at a common total concentration, their conjugate acid/base speciation may vary, which may influence ionization counts), but is also related to efficiency of ionization and other factors associated with the electrospray experiment. As the measured data may be expected to have some concentration-like character, and as they span nearly three orders of magnitude in ion counts (and no computed molecular quantity is likely to span so similarly large a range in linear space), we chose to seek a correlation with the common logarithm of the ion counts measured from experiments with 10 μM analyte concentrations (column 1 of Table 1). The last row of Table 1 indicates the single variable correlation coefficient R between the various individual tabulated quantities and the log10 ion counts. It is immediately apparent that the solvation free energy of the neutral solute is completely uncorrelated with the ion count data (R < 0.02), whereas strong correlation is observed with the gas-phase and aqueous proton affinities computed with or without solvent relaxation (R > 0.66), as well as with the saturated nonpolar and total surface areas (R > 0.70).

The observation of a good correlation with proton affinity (basicity) is consistent with prior multivariate analyses along similar lines [7, 8]. Considering the various proton affinity computational protocols, they provide data that are highly correlated (R > 0.94 between methods). As single-point solvated calculations include the important physical effect of aqueous solvation at very small cost, we elected to continue to explore further with this descriptor. With respect to the surface areas, the saturated nonpolar and total surface areas show the highest correlation with the target ion count data. Those two descriptors are also highly correlated with one another (R = 0.915). As the total surface area is much more unambiguously defined, we elected to carry this descriptor forward for further analysis as well.

Considering multilinear regressions that included, along with the single-point aqueous proton affinity and the total surface area, other descriptors not already strongly correlated with those two (i.e., neutral molecule solvation free energy, unsaturated nonpolar surface area, and polar surface area) failed to provide meaningful statistical improvements. However, the correlation coefficient between the proton affinity and total surface area descriptors themselves over the 40 molecules in Table 1 is R = 0.482, suggesting that the training set could be improved through addition of molecules reducing this cross-correlation.

To that end, we added an additional 26 molecules, computing only the necessary two descriptors (Table 2). When taken together over the 66 molecules in the expanded training set, the cross-correlation between the proton affinity and total surface area descriptors is reduced to R = 0.228, which we deem acceptable.

Table 2 Chemometric Data for an Additional 26 Molecules

Over the 66 molecules in the training set, bilinear regression of the log10 ion counts (IC) on the aqueous proton affinity (PA) and total molecular surface area (SA) provides

$$ { \log}_{10}\left(\mathrm{I}\mathrm{C}\right)=-4.824-\left(3.912\times {10}^{-2\ }{\mathrm{molkcal}}^{-1}\right)\cdotp \mathrm{P}\mathrm{A}+\left(3.682\times {10}^{-3}\kern0.35em {\mathring{\mathrm{A}}}^{-2}\right)\cdotp \mathrm{S}\mathrm{A} $$
(1)

with multiple R value 0.791, multilinear regression F ratio of 52.5, and p values for the intercept, c 1 and c 2 of 1.4 × 10–3, 4.3 × 10–10, and 2.5 × 10–6, respectively. The standard errors on the intercept, c 1 and c 2 are 1.447, 0.530 × 10–2 mol kcal–1, and 0.711 × 10–3 Å–2, respectively. Note that qualitatively, Equation 1 predicts that molecules having proton affinities (or, more loosely, basicities) that are larger in magnitude—recalling that the quantity itself is negative—will show increased ion counts, as will molecules that are larger in overall surface area. Thus, many of the molecules in Tables 1 and 2 with larger observed ion counts include basic amine-type functionality of one sort or another.

A plot of predicted versus experimental log data for a training set of 66 molecules (purple circles) and a test set of 11 molecules (brown squares, see below for further discussion) at nominal 10 μM concentration is provided in Figure 2. Note that the most significant outliers are at lower left, that is, compounds with very low measured ion counts but reasonably high predicted ion counts. The three lowest points are naproxen and two glucuronides.

Figure 2
figure 2

Log/log plot of predicted versus observed mass spectral ion counts for a training set of 66 molecules (purple circles) and a test set of 11 molecules (brown squares) at nominal 10 mM concentration. Expanded area of graph includes the 11 molecule test data set

If one treats these three compounds as outliers, regression on the remaining data leads to a significantly improved F ratio (67.5). However, the regression itself is affected primarily in the intercept, which is expected in the event of removing such outliers (that are over-predicted). On the whole, regression on the full data set seems likely to be the most general and useful in the absence of further expansion of the training set. While one could consider making the presence of a glucuronide group a variable (a constant), with only four in the training set, this seems statistically dubious. Over the full 66-molecule training set, the mean absolute deviation between experiment and prediction from Equation 1 is 0.44 log units.

To assess the robustness of our fit, we next considered a test set of 11 molecules, chosen from a proprietary list of Novartis compounds and having limited similarity to molecules used in the training set. Employing Equation 1, we predicted log ion count data that compare very well to experimental measurement, as illustrated in Figure 2. There is an excellent correlation of predicted values with the variation in the data, and the mean absolute deviation between the predicted and experimental values is 0.29 log units. This deviation is smaller than that observed over the full training set. It is also systematic (i.e., every prediction somewhat overestimates the observed value). We reconsidered whether a regression equation removing the lowest outliers mentioned above would have improved predictive performance on the 11-molecule test set, but it was essentially unaffected.

We further examined the robustness of our predictive protocol by pooling all 77 data and randomly assigning each point to one of seven 11-member subsets in order to perform a cross-validation analysis. We carried out bilinear regression to generate equations analogous to Equation 1 on the remaining 66 molecules associated with each subset, and we examined the utility of those equations for predicting log ion count data treating the left-out 11 molecules in each case as test sets. The results are summarized in Table 3. For the seven different bilinear regressions, we obtained intercepts of –4.299 ± 0.496, coefficients multiplying the aqueous proton affinity of (–3.768 ± 0.160) × 10–2 mol kcal–1, and coefficients multiplying the total surface area of (3.265 ± 0.181) × 10–2 Å–2. These values all fall within the standard error ranges associated with these parameters for Equation 1 itself. The multilinear regression F ratios predicted for the seven fits range from 58.7 to 80.2, whereas the Pearson correlation coefficients R (for the training set) range from 0.807 to 0.847.

Table 3 Results from Leave-11-Out Cross-Validation Studies on Combined Data

Considering the application of each regression to make predictions for its specific test set, the mean unsigned residuals over the various test sets range from 0.281 to 0.569, whereas the associated Pearson correlation coefficients R range from 0.681 to 0.886. In all cases but that of subset 1, R > 0.8; this reflects the presence of two of the outliers having very low ion counts in subset 1, which also contributes to this subset exhibiting the largest unsigned error in the residuals.

The good observed performance overall of the various bilinear regressions analogous to Equation 1 on the different randomized 11-molecule test sets is encouraging. We anticipate that this overall model may prove to be useful in the future for estimating concentrations of unknown analytes through the comparison of predicted ion count data to observed data.

Conclusions

We have demonstrated that semiquantitative information on drugs and metabolites can be obtained by an in silico approach and without using radiolabeled compounds and chemically synthesized metabolite standards. We propose that the reported methodology herein would allow a quick semiquantitative assessment of drugs and metabolites much earlier in drugs discovery, hence, identifying metabolism-related liabilities of key compounds and effectively managing the development of a lead series. Additionally, this semiquantitative approach could be applied in early development to assess whether humans produce metabolites that are adequately covered by preclinical species or form unique/disproportionate metabolites that require additional safety testing.