Introduction

Metabolism (biotransformation) refers to the biochemical conversion of endogenous and exogenous compounds from one form to another by specific enzymes aiming to increase their excretion from the body. Biotransformation studies are requested by the US Food and Drug Administration [1, 2], European Medicines Agency [3], or International Conference on Harmonization [4] to bring new drug candidates safely into patients. At discovery stage, metabolic liabilities within a drug candidate need to be investigated to optimize metabolic turnover. Objectives of in vitro and in vivo–conducted biotransformation studies at development stage cover (i) the identification of circulating metabolites in human and selected animal toxicity species, (ii) testing of their pharmacological activity, (iii) assessment of the presence of reactive or toxic metabolites, and (iv) mapping of metabolites into metabolic pathways to elucidate elimination pathways [5,6,7]. Liquid chromatography–high-resolution mass spectrometry (LC-HRMS) represents the key analytical technology used for biotransformation studies [8,9,10]. Although LC-HRMS analysis offers a high degree of specificity for metabolite profiling and identification due to obtained retention times, mass-to-charge (m/z) ratios of precursor ions, their mass fragmentation, and exact mass measurements, several challenges remain. For instance, isobaric metabolites or structural isomers cannot readily be differentiated from each other solely based on the beforementioned physicochemical parameters due to (i) co-elution, (ii) similar MS, or (iii) MS2 spectral data. Moreover, low abundant in vivo metabolites can hardly be identified when highly abundant co-extracted interferences from matrix components, co-medications, or residual formulation entities are present causing ion suppression and increased detection limits. Lastly, the absence of radiotracers at early drug development stage further impedes a reliable and complete metabolite elucidation. Fortunately, the combination of ion mobility spectrometry (IMS) with mass spectrometry (MS) analysis (IMS-MS) introduces an additional analytical dimension to overcome some of the previously stated hurdles. IMS-MS enables the separation of ions in the gas phase due to differences in size, shape, and charge [11,12,13]. In the perspective of biotransformation studies, the application of LC-IMS-MS is beneficial in several ways: (i) Elucidation of in vivo metabolites can significantly be facilitated allowing the effective elimination of background ions from matrix components in the MS spectrum [14]. (ii) Positional and structural isomers can readily be separated using either linear [15, 16] or cyclic IMS-MS devices [17,18,19] exhibiting an increased IMS resolution [20]. (iii) Conformational changes of enzymes upon ligand binding can be assessed [21]. The determination of drift time-derived collision cross section (CCS) values, however, represents the major benefit for metabolism studies [12, 22]. Such robust and analyte-specific parameters allow simple identification and tracking of analytes across investigated matrices, different species, and studies [22]. Such alignment is particularly important when drug development spans over multiple years, compounds have been in-licensed, or biotransformation studies were not entirely conducted in-house (same analytical lab or instrument). Ross et al. [23] previously reported changes in measured CCS values (CCSmeas) between parent drugs and their in vitro metabolites which created the foundation of our presented work. Their earlier finding agreed with our internal data from various drug development projects. However, we further observed repetitive shifts in relative CCSmeas values between parent drugs (or precursors) with their corresponding in vitro or in vivo metabolites. In this publication, we present the outcome of our investigation demonstrating if such relative shifts in CCSmeas are biotransformation-specific and highlighting how such relative shifts in CCSmeas further benefit biotransformation studies.

Material and methods

Chemicals, reagents, and reference material

Formic acid (FA), MS grade water, and acetonitrile (ACN) were obtained from Fisher Chemicals (Loughborough, UK). Adenosine-3′-phosphate 5′ phosphate lithium salt (PAPS), dimethyl sulfoxide, leucine enkephalin acetate salt, reduced L-glutathione (GSH), poly-DL-alanine, and sodium formate were purchased from Sigma-Aldrich (Buchs, Switzerland). Potassium phosphate buffer (0.5 M, pH 7.4) was provided by Thermo Fisher Scientific (Allschwil, Switzerland). Pooled liver S9 fractions were purchased from either BioIVT (West Sussex, UK) or Corning (Woburn, MA, USA). Reference material of investigated compounds was obtained either commercially from various vendors (SI, Table S1) or was synthesized in-house. Due to proprietary reasons, drug names, development codes, batch numbers, or obtained m/z values of utilized internal reference material of active development compounds cannot be disclosed. Corresponding material was denoted as either NVSx (parent drugs) or Mx (metabolites). Moreover, molecular structures of selected internal compounds can only be shown partially.

Stock and working solutions

Reference material of commercial or internal compounds was first dissolved in dimethyl sulfoxide to obtain stock solutions at 1 mM. Individual stock solutions were further diluted in water/ACN (95/5, v/v) to obtain working solutions at 10 µM containing up to five different compounds. The selection of compounds, which were pooled into individual working solutions, was based on physicochemical properties (molecular weights, polarities, and ionization profiles). Moreover, neither isobaric metabolites (e.g., 10- and 2-hydroxy imipramine) nor metabolites which potentially get back-converted to the parent drug/precursor due to in-source fragmentation (e.g., sulfates and glucuronides) were merged together into the same working solution. No further sample preparation was required for CCSmeas determination.

In vitro incubations

Several parent drugs or precursors were incubated in pooled liver S9 fractions from CD1 mice, Sprague–Dawley rats, cynomolgus monkeys, or humans to generate specific metabolites for which no reference material was available (neither commercially nor internally). Deep-frozen pooled liver S9 fractions were first thawed at room temperature and subsequently diluted in potassium phosphate buffer (100 mM, pH 7.4) containing either PAPS (4 mM) or GSH (5 mM) as a cofactor for sulfation or glutathione conjugation, respectively. Following preincubation on a Thermomixer C from Eppendorf (Hamburg, Germany) for 5 min at 37 °C while shaking at 600 rpm, incubations were initiated by adding either the parent drug or precursor (7-hydroxy coumarin, estradiol, raloxifene, and serotonin for sulfation while acalabrutinib, afatinib, branebrutinib, ibrutinib, rociletinib, and spebrutinib were used for GSH conjugation). The total incubation volume was 500 µL with a final protein and substrate concentration of 2 mg/mL and 10 µM, respectively. Following 2 h of incubation at 37 °C and 600 rpm (Thermomixer C), 100 µL of each incubate was drawn and the enzymatic activity was quenched with four volumes of ice-cold ACN. After centrifugation for 15 min at 4 °C and 18,000 g, 200 µL of each supernatant was transferred to a fresh 0.5 mL ProteinLobind tube (Eppendorf). The transferred supernatants were evaporated to dryness under a gentle stream of nitrogen (N2) and reconstituted in 50 µL of water/ACN (95/5, v/v). Following brief agitation, each reconstituted supernatant was centrifuged again for 5 min at 4 °C and 18,000 g prior to metabolite profiling.

Preclinical plasma samples of NVS1

NVS1 was administrated once daily either orally (100 mg/kg for 112 days) or intravenously (5 mg/kg for 111 days) to twenty Han Wistar rats or six purebred beagle dogs, respectively. In both cases, mixed-gender animals were used. Rats were obtained from Charles River Laboratories (Raleigh, NC, USA) whereas dogs were received from Marshall BioResources (North Rose, NY, USA). Blood samples (300 µL for rat and 1 mL for dog) were drawn at 0.083 (dog only), 0.25 (rat only), 0.5 (dog only) 1, 3, 7, and 24 h post-dose into K2 EDTA-containing tubes. Plasma was harvested following centrifugation for 10 min at 2700 rpm and 4 °C. Regardless of the species, plasma pools (AUC0-24 h) were generated according to the pooling strategy proposed by Hamilton et al. [24]. Metabolite profiling of the protein-precipitated plasma pools was conducted by LC-HRMS analysis (SI, Table S2).

LC-IMS-MS analysis

CCSmeas determination and metabolite profiling were conducted with an Acquity I-Class UPLC system from Waters (Milford, MA, USA) coupled to a Waters Synapt G2-Si HD QTOF high-resolution mass spectrometer. Depending on the sample and experiment, 2–10 µL were injected onto a Waters Acquity UPLC HSS T3 column (2.1 × 150 mm, 1.8 μm) being maintained at 40 °C. The mobile phases consisted of 0.1% FA in water (A) and 0.1% FA in ACN (B). The elution gradient with a flow rate of 0.4 mL/min was set as follows: 0.0–1.0 min, 5% B; 7.8–8.5 min, 95% B; 8.6–10.0 min, 5% B. The minor part of the post-column split (1/5, v/v) was directed towards the mass spectrometer. The capillary voltage of the electrospray ionization (ESI) was either 3 kV (positive mode) or 2 kV (negative mode). The remaining IMS-MS parameters were set as follows: source temperature 120 °C, sampling cone voltage 10 V, cone gas flow rate 20 L/h (N2), desolvation temperature 350 °C, desolvation gas flow rate 600 L/h (N2), nebulizer gas pressure 6 bar, IMS gas flow rate 90 mL/min (N2), IMS wave velocity and height at 500 m/s and 40 V, respectively. HDMSe data were acquired in resolution mode using the continuum data format: full-scan (m/z 100–1200) with a scan time of 0.5 s either without (function 1) or with collision energy ramp from 10 to 50 eV in the transfer cell (function 2) of the TriWave device for precursor ion TWCCSN2 determination. For product ion TWCCSN2 determination, fixed collision energies (either 12 or 25 eV) were applied in the trap cell of the TriWave device on the low instead of high-energy MS trace while maintaining the same IMS-MS parameters as stated above. Mass and TWCCSN2 calibrations were fully automated (Waters IntelliStart application in the instrument console) and were conducted with 0.5 mM sodium formate in ACN/water (1/1, v/v) containing 0.1% FA and polyalanine solution at 20.0 µg/mL in ACN/water (1/1, v/v) containing acetaminophen at 5.00 µg/mL, respectively. Utilized reference TWCCSN2 values used to convert measured drift times of individual ions into corresponding TWCCSN2 values were previously published by Bush et al. [25]. Those values are further provided in the supplementary information (SI, Table S3). The entire LC-IMS-MS system was operated and controlled under Waters Masslynx (v4.2).

Prediction and determination of measured TWCCSN2 values

Mol files of the molecular structure from each investigated compound were generated with ChemDraw (v19.1) from PerkinElmer (Waltham, MA, USA) prior to their upload to CCSonDemand (Waters), a machine learning algorithm for TWCCSN2 value prediction (TWCCSN2, pred) [26, 27]. Measured TWCCSN2 values (TWCCSN2, meas) were determined with Waters Unifi (v1.9.4). For this, accurate mass screening on LC-IMS-MS data was performed with a mass accuracy of 20 ppm. The [M + H]+ ion of leucine enkephalin at m/z 556.2766 and its drift time were further used for exact mass and CCS correction. Following completion of the post-acquisition data processing, a summary table including identified molecular ions, mass accuracy, retention time, peak intensity, and TWCCSN2, meas was provided for each analyte by Unifi.

Validation of TWCCSN2, meas values

A reference mix (SI, Table S4) was analyzed whenever a new data set was acquired to validate TWCSSN2, meas values. If several sample sets were acquired in a single run, then three reference mix injections were distributed across the entire sequence to ensure no drift in TWCCSN2, meas over time. If the mean TWCCSN2, meas of each compound in the reference mix was within ± 2% from its previously determined and published value [28], then TWCCSN2, meas determination of the entire sample set was considered acceptable. If not, then TWCCSN2 calibration and sample set(s) analysis were repeated.

Accuracy, precision, and mean absolute error of TWCCSN2, meas versus TWCCSN2, pred

The accuracy (% bias) was calculated by dividing the difference between the mean TWCCSN2, meas and TWCCSN2, pred by the TWCCSN2, pred which was further multiplied by a factor of 100. The precision of TWCCSN2, meas was determined by the coefficient of variation (CV): the standard deviation (SD) between individual TWCCSN2 measurements divided by the mean TWCCSN2, meas which was multiplied by a factor of 100. The mean absolute error was calculated by dividing the sum of absolute errors (difference between TWCCSN2, pred and mean TWCCSN2, meas) by the number of investigated compounds. Error bars displayed in subsequent figures correspond to the obtained SD.

Results and discussion

In total, 165 compounds were incorporated into our assessment, and their CCS values (TWCCSN2) were determined following LC-IMS-MS analysis (SI, Table S5). The entire data set, covering an overall mass (m/z) and TWCCSN2 range of m/z 146–862 and 120–280 Å2 respectively, was utilized to compare mean measured (TWCCSN2, meas) with predicted CCS (TWCCSN2, pred) values (see “Correlation between TWCCSN2, pred and mean TWCCSN2, meas values”). Moreover, a reduced data set (n = 139) was further used to verify if the previously observed repetitive shifts in relative TWCCSN2, meas between parent drugs (or precursors) and their metabolites were biotransformation-specific. The selection and inclusion of those compounds for such assessment was conducted based on the possibility of creating compound pairs (n = 86; SI, Table S6) covering three common types of phase I biotransformation [hydroxylation (n = 19), N-oxidation (n = 9), demethylation (n = 13)] and five common phase II biotransformations [O-glucuronidation (n = 14), N-glucuronidation (n = 9), sulfation (n = 10), glutathione conjugation (n = 7), and acetylation (n = 5)]. Although most investigated compounds represented already marketed drugs and their corresponding metabolites, additional internal compounds were included for our assessment to extend the number of compound pairs for certain types of biotransformation. The only prerequisite for internal compound inclusion was to have synthesized reference material with a nuclear magnetic resonance-confirmed structure.

Reproducibility of TWCCSN2, meas values

Similar to other published data [22, 29,30,31], the generated analyte-specific TWCCSN2, meas values were highly reproducible over the entire period of our assessment (8 months) which was not only demonstrated with the reference mix used for system suitability testing (SI, Table S4 and Fig. S1) but also with the entire dataset (SI, Table S5). For the latter, the SD between individually determined TWCCSN2, meas values (up to n = 3) was maximum 1.8 Å2 resulting in a precision ≤ 1.0%.

Correlation between TWCCSN2, pred and mean TWCCSN2, meas values

After achieving excellent reproducibility in TWCCSN2, meas values for our entire data set (n = 165), a better understanding of its accuracy was required. For this, mean TWCCSN2, meas values were further compared with their TWCCSN2, pred values (Waters CCSonDemand) which overall resulted in a good correlation (R2 = 0.9594) as depicted in Fig. 1a. The obtained linear regression with a slope of 0.9036 (95% confidence intervals ranging from 0.8748 to 0.9323) and an intercept at + 18.14 (95% confidence intervals ranging from 12.49 to 23.78) was also close to its optimum (dashed black line). On the individual level, the obtained bias between TWCCSN2, pred and mean TWCCSN2, meas ranged from − 6.5 to + 11.9% (SI, Table S5) while the mean absolute error over the entire data set was 5.1%. The bias was within ± 10% (red area/dashed lines in Fig. 1) for 164 of the investigated compounds (99.4%). By further narrowing the acceptance criterion, most of the investigated compounds (87.3%, n = 144) met the ± 5% threshold (orange area/dotted lines in Fig. 1) while only 64.8% (n = 107) met the most stringent acceptance criterion of ± 3% (green area/solid lines in Fig. 1). It is worth mentioning that additional efforts were conducted to elucidate the origin of obtained bias in our presented data set. Neither the molecular weight (data not shown) nor an increase in TWCCSN2 could be identified as a root cause. The latter demonstrated an equal distribution in negative (n = 93) and positive bias (n = 72) over the investigated TWCCSN2 range (Fig. 1b). Hence, additional investigations would be necessary to clarify why certain TWCCSN2 values could not be predicted as accurately as other ones. Besides CCSonDemand, other computational tools or algorithms, previously reporting excellent correlations between CCSpred and CCSmeas values [32, 33], could further be tested with our publicly shared TWCCSN2, meas values (SI, Table S5 or the separately provided.csv file in the online resources). Such assessment, however, was out of scope for this study. Overall, a high confidence was associated with the generated data set which was used for our assessment of biotransformation-specific relative shifts in TWCCSN2, meas as discussed next.

Fig. 1
figure 1

a Correlation between predicted (TWCCSN2, pred) and mean measured CCS values (TWCCSN2, meas) for 165 investigated compounds. b Bias distribution across investigated TWCCSN2 range. The colored areas or lines represent the deviation of ± 10% (red/dashed), ± 5% (orange/dotted), and ± 3% (green/solid) from the optimum (dashed black line)

Biotransformation-specific relative shifts in mean TWCCSN2, meas values

An almost linear relationship between shifts in mean TWCCSN2, meas values relative to the increase or decrease in mass was obtained for each type of investigated biotransformation (Fig. 2a). For instance, the elimination of a methyl group from the parent drug (− 14 Da) decreased the mean TWCCSN2, meas relative to the parent drug or precursor by - 6.5 ± 2.1 Å2 on average (Table 1). On the other hand, the mean TWCCSN2, meas relative to the parent drug or precursor increased by + 13.5 ± 1.9 Å2 on average following acetylation (+ 42 Da) while sulfation (+ 80 Da) increased the mean TWCCSN2, meas in comparison to the parent drug or precursor even further (+ 17.9 ± 4.4 Å2). Oxygenation as phase I biotransformation, including hydroxylation (+ 3.8 ± 1.4 Å2) and N-oxidation (+ 3.4 ± 3.3 Å2), also perfectly fitted into the almost linear relationship (Fig. 2a) which agreed with other published data [23]. However, both phase I biotransformations could not be differentiated from each other purely based on IMS-MS data since similar relative mean shifts in TWCCSN2, meas were observed (Fig. 2b). Fortunately, other experiments such as hydrogen–deuterium exchange [34, 35] or selective reduction of N-oxides to amines with titanium (III) chloride [36] enable the discrimination between both types of biotransformation. Secondary (n = 3), tertiary (n = 4), and quaternary N-glucuronides (n = 2) included in our assessment exhibited on average a slightly higher relative mean shift in TWCCSN2, meas (+ 41.7 ± 7.5 Å2) compared to the investigated O-glucuronides (+ 38.1 ± 8.9 Å2) with almost identical SDs (Table 1 and Fig. 2c). On some occasions, a much more discriminative relative shift in TWCCSN2, meas values was obtained when the parent drug was either O- or N-glucuronidated (see “Potential to elucidate structural relationships between metabolites”) while the tendency of N-glucuronides exhibiting higher relative shifts in TWCCSN2, meas values compared to O-glucuronides remained constant.

Fig. 2
figure 2

a Correlation between mean measured mass changes and relative mean TWCCSN2, meas shifts. Zoom into b hydroxylation and N-oxidation as well as c O- and N-glucuronidation. d Example for TWCCSN2 calculation (grey italics values) based on the obtained polynomial regression from a in comparison to TWCCSN2, meas values (black values)

Table 1 Obtained mean m/z and relative mean TWCCSN2 shifts based on measured or predicted CCS values

It was further observed that the variation in relative mean shifts in TWCCSN2, meas values was significantly increased for phase II compared to phase I biotransformations which was somehow expected: simple modifications following phase I reactions typically introduce relatively small changes in the molecular structure of the parent drug resulting in fairly consistent shifts in relative TWCCSN2, meas values (except for major dealkylations within the molecule). On the other hand, conjugation of a large and rather flexible residue such as glutathione (tripeptide, + 305 Da) alters the molecular structure significantly which resulted in a mean relative TWCCSN2, meas increase by + 49.2 ± 13.2 Å2 on average (Table 1). Two main factors were identified as a potential origin of the increased variation in relative mean TWCCSN2, meas shifts following phase II biotransformation: (i) bulky entities could exist in different gas phase conformations and (ii) the site of conjugation can further influence changes in relative mean TWCCSN2, meas values. For instance, glucuronic acid conjugated to a flexible aliphatic side chain could have a significantly different TWCCSN2, meas compared to its isobaric version where conjugation occurs on a rather rigid aromatic ring system (see example in “Potential to elucidate structural relationships between metabolites”).

Nonetheless, almost non-overlapping bands in relative mean TWCCSN2, meas shifts were obtained for each type of investigated biotransformation (Fig. 2a and Table 1). The plateau towards the upper end of the correlation between the mean measured mass and relative mean TWCCSN2, meas shift was most likely caused by the underestimation of the TWCCSN2, meas values for the glutathione conjugates which rather tend to be multiply charged instead of singly charged. It is noteworthy mentioning here that IMS and the CCS value of a molecule strongly depend on its charge state [29, 37]. For our work, however, the TWCCSN2, meas of a singly charged ion from a metabolite had to be compared with the one of its parent drug (see also “Limitations”). The obtained relative mean TWCCSN2 shifts based on TWCCSN2, meas values further agreed with the theoretical relative shifts when taking TWCCSN2, pred values into consideration (Table 1 and SI, Fig. S2).

Based on the obtained polynomial regression (y =  − 0.0005x2 + 0.3202x − 1.4506) between the mean measured mass changes and relative mean shifts in TWCCSN2, meas (Fig. 2a), TWCCSN2 values of metabolites could readily be calculated which is illustrated for an internal compound in Fig. 2d: the TWCCSN2 of the parent drug (INE963) following hydroxylation was calculated 207.2 Å2 (TWCCSN2, calc) which agreed with the actual TWCCSN2, meas for metabolite M13 (207.7 Å2). Even after subsequent N-acetylation and demethylation of M13, resulting in metabolite M8, the TWCCSN2, calc (215.4 Å2) was in excellent agreement with the TWCCSN2, meas (216.0 Å2). Hence, the discovered biotransformation-specific relative shifts in mean TWCCSN2, meas values (and the combination thereof) can also be considered for metabolite assignment/confirmation (rather than their absolute TWCCSN2, meas values) complementing the conventional approach where changes in m/z values are compared. Only for glucuronidation or glutathione conjugation, special care must be taken exhibiting increased variabilities and slightly overlapping bands in relative mean TWCCSN2, meas shifts (Fig. 2a and Table 1).

Similar biotransformation-specific shifts in relative TWCCSN2, meas obtained on fragment ion level

Identical biotransformation-specific shifts in relative TWCCSN2, meas were also obtained on fragment ion level as illustrated for hydroxylation, demethylation, and N-acetylation in Fig. 3. Phase II biotransformations such as sulfation and glucuronidation were not considered for this assessment since the major fragment ion from such metabolites is often only the parent drug or corresponding precursor following the characteristic neutral loss of the sulfate (80 Da, SO3) or the glycone (176 Da, C6H8O6), respectively [38,39,40,41]. One prerequisite for the assessment of biotransformation-specific shifts in relative TWCCSN2, meas on fragment ion level was the presence of common fragment ions between the metabolite and parent drug or precursor (with and without the biotransformation). The relative shift in TWCCSN2, meas for mephenytoin after hydroxylation, leading to 4-hydroxy mephenytoin, was + 4.1 Å2 on precursor ion level (Fig. 3a). An almost identical relative shift (+ 3.4 Å2) was obtained after comparing the TWCCSN2, meas values of the major fragment ion for mephenytoin at m/z 134 and 4-hydroxy mephenytoin at m/z 150 (Fig. 3b). For demethylation, the relative shifts in TWCCSN2, meas between diazepam and nordazepam on precursor and fragment ion level were again in the similar range: − 4.6 Å2 on precursor ion level (Fig. 3c) and − 3.8 Å2 on fragment ion level (Fig. 3d) considering the fragment ions at m/z 222 and m/z 208 for diazepam and nordazepam, respectively. The best agreement in biotransformation-specific relative shifts in TWCCSN2, meas determined on precursor (+ 8.1 Å2, Fig. 3e) and fragment ion level (+ 8.0 Å2, Fig. 3f) was obtained with an internal compound (NVS1) and its N-acetylated metabolite M7. The combination of fragment ion TWCCSN2, meas values, allowing the discrimination of ortho-, para-, or meta-conjugated metabolites due to minor changes in TWCCSN2, meas values [16] and the potential to utilize relative shifts in TWCCSN2,meas values for metabolite mapping purposes (see “Potential to elucidate structural relationships between metabolites”), would represent a powerful analytical tool for future metabolism studies to elucidate structural relationships between positional isomers and their subsequently conjugated metabolites, e.g., glucuronides.

Fig. 3
figure 3

Biotransformation-specific shifts in relative TWCCSN2, meas values obtained either on precursor ion (a, c, e) or fragment ion level (b, d, f) exemplified for hydroxylation, demethylation, and N-acetylation (proposals for charge localization and fragment ion structures are only tentative)

Potential to elucidate structural relationships between metabolites

One major benefit to considering relative shifts in TWCCSN2, meas values for biotransformation studies is the potential to elucidate structural relationships between metabolites in order to design metabolic pathways. During in vivo metabolism assessment of an internal compound (NVS1), several hydroxylated and glucuronidated metabolites were detected in pooled plasma samples (AUC0-24 h) following oral (rat) and intravenous administration (dog), respectively. In both studies, conventional MS fragmentation did not assist in structure elucidation and made the localization of the biotransformation impossible since similar product ion spectra were obtained (data not shown). In contrast, relative shifts in TWCCSN2, meas values provided hints about their structural relationships (Fig. 4). The direct O-glucuronide (M2) was conjugated to a rather flexible side chain whereas the N-glucuronide (M4) was conjugated to a rigid aromatic ring system. Consequently, both direct glucuronides exhibited different TWCCSN2, meas values (234.3 Å2 for M2 and 254.1 Å2 for M4). By knowing that hydroxylation of NVS1, leading to metabolite M1, increased the TWCCSN2, meas by + 6.2 Å2 relative to the parent drug, structural relationships of the detected hydroxylated and glucuronidated metabolites (M9, M10 and M11) relative to both direct glucuronides could readily be elucidated: the differences in TWCCSN2, meas of M10 (+ 25.2 Å2) and M11 (+ 25.4 Å2) were too high to originate from M2 (O-glucuronide). On the other hand, when the TWCCSN2, meas of M10 (259.5 Å2) and M11 (259.7 Å2) were compared to the N-glucuronide (M4), then the relative shifts in TWCCSN2, meas values matched with the expected one for NVS1 hydroxylation (+ 5.4 Å2 for M10 and + 5.6 Å2 for M11). Consequently, M10 and M11 were most likely structurally related to M4. On the other hand, M9 was most likely structurally related to M2.

Fig. 4
figure 4

Potential of relative shifts in TWCCSN2, meas values to elucidate structural relationships between metabolites as illustrated with an internal compound (NVS1) and its in vivo obtained hydroxylated and glucuronidated metabolites. Please note that the displayed mapping is based on relative changes in TWCCSN2, meas and not on enzymatic formation (hydroxylation with subsequent glucuronidation)

Limitations

Despite the above-highlighted benefits to consider relative shifts in TWCCSN2, meas values for biotransformation assignment/confirmation and metabolite mapping purposes, several limitations are also associated with the presented methodology: (i) biotransformation-specific relative shifts in TWCCSN2, meas values could only be obtained when TWCCSN2, meas values of the same ion species were compared with each other. It is not recommended to compare the TWCCSN2, meas of a protonated parent drug with one of its deprotonated metabolites. Moreover, TWCCSN2, meas comparison of a protonated parent drug with the sodium or potassium adduct of its metabolite will neither be successful. (ii) So far only singly charged ions were considered for our assessment. Nevertheless, this already allowed the comparison of a broad range of parent drugs/precursors with their corresponding metabolites since low molecular-weighted drugs with a mass of up to 500–600 Da tend to be mainly singly charged. For future activities, the proposed concept of biotransformation-specific relative shifts in TWCCSN2, meas values may also need to be verified for multiple charged ions. (iii) The current data set is still rather limited and needs to be further extended, e.g., by either adding more compound pairs to already investigated types of biotransformation or by incorporating additional types of biotransformation such as dealkylation reaction causing a significantly greater loss in mass.

Conclusions

The emerging role of highly reproducible CCS values, as an instrument and analytical method-independent physicochemical property of a molecule, becomes much more evident in various research domains. In our opinion, this should also be considered more frequently for biotransformation studies. Such analyte-specific parameters enable metabolite alignment between various biotransformation studies conducted at diverse analytical labs at different drug development stages. By accessing publicly available computational tools such as Waters CCSonDemand, accurate TWCCSN2 value prediction also seems nowadays possible eliminating the need to determine them experimentally.

More importantly, however, is to consider relative shifts in TWCCSN2, meas between a parent drug (or any precursor) and its corresponding metabolite rather than their absolute TWCCSN2, meas values: constant and discriminative relative mean shifts in TWCCSN2, meas values apparently exist as demonstrated for eight different phase I and II biotransformation which were not only obtained on precursor ion but also on fragment ion level. In the perspective of metabolism studies, such biotransformation-specific relative shifts in TWCCSN2, meas would mainly have two major benefits: (i) allowing metabolite identification/confirmation as an orthogonal approach to the conventional comparison of changes in m/z values. (ii) The mapping of metabolites in metabolic pathways can significantly be simplified. This would especially be important whenever isobaric metabolites are present, sharing multiple and similar combinations of phase I and II biotransformation but on different molecule locations, which unfortunately cannot be distinguished from each other due to identical MS fragmentation patterns. A simpler metabolite mapping process would be of particular importance for low-abundant circulating metabolites where no synthetic reference material is readily available due to either challenging biosynthesis or resource restrictions. Moreover, in perspective of regulatory guidelines from health authorities, e.g., EMA DDI guidance, demanding to elucidate 90% of detected metabolites in human excreta, the comparison of relative shifts in TWCCSN2, meas to map metabolites correctly would further reduce additional experiments to fully characterize relevant metabolites.