Abstract
The analysis of ceramide (Cer) and sphingomyelin (SM) lipid species using liquid chromatography–tandem mass spectrometry (LC–MS/MS) continues to present challenges as their precursor mass and fragmentation can correspond to multiple molecular arrangements. To address this constraint, we developed ReTimeML, a freeware that automates the expected retention times (RTs) for Cer and SM lipid profiles from complex chromatograms. ReTimeML works on the principle that LC–MS/MS experiments have pre-determined RTs from internal standards, calibrators or quality controls used throughout the analysis. Employed as reference RTs, ReTimeML subsequently extrapolates the RTs of unknowns using its machine-learned regression library of mass-to-charge (m/z) versus RT profiles, which does not require model retraining for adaptability on different LC–MS/MS pipelines. We validated ReTimeML RT estimations for various Cer and SM structures across different biologicals, tissues and LC–MS/MS setups, exhibiting a mean variance between 0.23 and 2.43% compared to user annotations. ReTimeML also aided the disambiguation of SM identities from isobar distributions in paired serum-cerebrospinal fluid from healthy volunteers, allowing us to identify a series of non-canonical SMs associated between the two biofluids comprised of a polyunsaturated structure that confers increased stability against catabolic clearance.
Similar content being viewed by others
Introduction
Sphingolipids (SLs) are a diverse family of structural and signalling lipids that comprise a broad range of biological functions crucial to normal physiology, cell signalling and trophic support1,2,3. SLs are defined by their sphingoid backbone, which in mammals consists predominantly of an 1, 3-dihydroxy, 18-carbon, mono-unsaturated sphingosine (d18:1), with variations to this long chain base also studied, including saturated dihydrosphingosine or sphinganine (d18:0), the di-unsaturated sphingadiene (d18:2), as well as mono- (m18:X) and trihydroxy (t18:X) configurations2,4.
Ceramide (Cer) is the central intermediate of the sphingolipid pathway, that consists of a sphingoid base amide-linked to a fatty acid of variable length, hydroxylation and degree of unsaturation5,6. Modifications to the C1 hydroxyl group of Cer allow for the formation of the more complex glycosylated SLs (e.g., cerebrosides, gangliosides), and the phosphorylated sphingomyelins (SM), which are the most abundant SL class in the plasma membrane of eukaryotes7. SM is frequently studied alongside Cer, owing to the functional importance of Cer-SM balance in cellular and inflammatory processes, as well as in ordered domain function (e.g., lipid rafts)8,9,10,11. Liquid chromatography-tandem mass spectrometry (LC–MS/MS) is the conventional approach for the analysis of Cer and SM lipids, as their fragmentation following positive ionisation ([M + H]+) generates characteristic ions of their sphingoid backbone (e.g., m/z 264 for d18:1) and side groups (e.g., m/z 184 for the choline headgroup of SM) to facilitate their identification (Fig. 1). Their fragmentation can also elucidate structural features of the fatty acyl and sphingoid base chain, including length and degree(s) of unsaturation (Fig. 1), which can help determine the precise position of carbon–carbon double bonds with specialised instruments4,12.
Specific precursor–product ion transitions can be employed in selective (SRM) or multiple (MRM) reaction monitoring experiments to improve the accuracy and reproducibility of SL analysis13,14,15, and are readily adapted into commercial (e.g., LipidSearch, LipidBlast, LipidAnnotator) and freeware toolboxes for the development of automated workflows (including setup, data acquisition and validation), thereby making their analysis amendable to non-experts16,17,18. However, resolving SL identities remains complex, as variations in headgroups, carbon backbone length and sites of unsaturation increase the probable number of variants with similar transitions, including isobaric and isomeric species19, that cannot be distinguished even with high-resolution instruments20. In addition, new structural species are continually being identified21,22, adding to this growing complexity.
Notwithstanding, the resolution of closely related SLs, including those with similar structures or nominal mass, can be achieved through LC separation, augmenting the accuracy of MS/MS assignments23,24,25. SL identities can be further validated by matching their chromatographic separation or retention time (RT) in complex samples against pure external standards or internal (deuterated) compounds (Fig. 2)26. Though this unequivocally resolves their identities, purchasing standards for all SLs of interest is not feasible, given many are not commercially available and the financial burden of purchasing copious numbers of SL variants.
Advancements in computational predictions have vastly improved RT estimation across various lipids (including SLs) and small molecules, for both targeted (SRM/MRM) and global lipid (untargeted) analyses27,28,29,30,31,32,33,34. However, numerous selective modifications to lipid separation (e.g., sample preparation, appropriate solvents, flow rate, chromatography matrix) are made to improve their detection and avoid batch effects. This can impose limitations on consistent RT estimations as modelling is often confined to comparable experimental conditions, including tissue type, potential class-specific biases and/or requires frequent retraining and validation30,33,34. This has precluded the capacity for robust or straightforward integration of RT estimates in lipid identification tools and experimental analyses.
To overcome these limitations in the assessment of SLs, Mass versus Relative Elution Time (MRET) profiling can be employed35, plotting two-dimensional data of the SLs nominated precursor mass (m/z), against their elution time (RT) (Fig. 3). Standards and controls (quality or internal) are utilised as “points of reference”, with the generated 2D plots used to visually extrapolate the position of other SLs within that class, using the knowledge of their known mass and characteristic fragmentation ions (i.e,. sphingoid backbone, C1 head group) as molecular descriptors to determine the RT of unknowns (Fig. 3). Recognising MRET patterns specific to a given SL family (Fig. 3B), as well as structural characteristics (e.g., degree of unsaturation, Fig. 3C,D), allows for the elimination of major interferences, resulting from ions of indistinguishable precursor and product fragmentation.
Presently, MRET profiling is manually performed. In this study, we aimed to automate the process by developing a bespoke, web-based tool entitled ReTimeML, aptly named to describe the task at hand—a calculated Remeasurement of the Time, on column, to predict the elution of (sphingo)lipids. ReTimeML was constructed through Machine-Learned regression of user RT assignments, collected from our database of LC–MS/MS analyses on Cer and SM lipids, and literature sources. Herein, we verified ReTimeML’s capacity to accurately annotate (> 99% accuracy) Cer and SM RTs, compared to expert-user assignments, across multiple tissues and LC–MS/MS experimental conditions. Notably, ReTimeML was successfully applied to aid the identification of noncanonical Cer and SM structures, resolve ion interferences, and guide the accurate annotations for Cer and SM expressional differences in cerebrospinal fluid (CSF) and paired serum collected from the same healthy volunteers (HVs).
Results
Regression model development and selection
We assessed the capacity of the nominated regression algorithms (Supplementary Data 1) to learn from descriptor information for identifying Cer and SM lipids (e.g., precursor mass, fragmentation) and user RT annotations of previous work (Supplementary Data 2). RT data was sequentially increased to determine the optimal training sample size, broken down by molecular features, for which regression model performance (evaluated on validation data) yielded coefficient of determination (R2) and root mean squared error (RMSE) values at the acceptance thresholds (R2 > 0.9, RMSE < 0.25), with the fulfilment of both criteria imperative. As anticipated, performance increased across all models when augmenting the training set size (Fig. 4). Lasso (alpha = 0.001) and ridge regression (alpha = 0.4) outperformed the other machine-learned models (Fig. 4), and were assigned as the optimal regression algorithms for Cer and SM RT estimations, respectively. Though comparable, lasso achieved a slightly more favourable R2 value for Cer (lasso: 0.930; ridge: 0.929, n = 9), while ridge regression appeared to be more beneficial for SM estimations (lasso: 0.915; ridge: 0.928, n = 6). Lasso yielded marginally lower RMSE values for both Cer (lasso: 0.091; ridge: 0.102, n = 9) and SM (lasso: 0.132; ridge: 0.178, n = 6), when applying the same number of datasets to concurrently meet our R2 > 0.9 prerequisites, though both were well-below the acceptable RMSE < 0.25 threshold (Fig. 4, Supplementary Data 2).
Performance evaluation and model robustness
The performance of ReTimeML’s lasso and ridge regression modelling was verified in four, independently performed, LC–MS/MS analyses on frequently observed d18:0, d18:1 and d18:2 Cer and SM lipid species (Tables 1 and 2). LC–MS/MS for Cer and SM species were performed on various tissues/fluids of rodents and humans, and different chromatography conditions (including an isocratic vs. gradient comparison). Cer and SM species for which experimental RTs were known (i.e., calibrators, internal/quality controls, experimentally determined), were assigned as points of reference (‘Train’), with the remaining unknown RTs extrapolated for each experiment (‘Test’). ReTimeML’s output provides users with an RT list for all Cer/SMs of interest, including those listed as references, which can be downloaded as a .csv file or directly copied into Excel or a similar spreadsheet. An MRET profile plot is also generated that displays the position of each calculated SL, organised into different degrees of SL structural unsaturation. Representative figures of ReTimeML’s output are illustrated in Fig. 5A and B. MRET plots were also manually constructed from user-validated assignments (Supplementary Fig. 1), with their eluting order supporting ReTimeML’s output, as well as previous literature on the separation of these SLs under reverse-phase conditions35,36,37. ReTimeML’s estimations displayed exceptional agreement when compared to user-determined RTs (Fig. 5C,D). ReTimeML predicted the RTs of 192 Cer and SM species, across the four LC–MS/MS experiments, with an average and median prediction error of 7.6 and 3.6 s, respectively, with each validation experiment achieving R2 > 0.99 when comparing ReTimeML estimations to experimentally determined RTs (Fig. 5E–H). Of the ReTimeML predicted RTs, 14 deviated more than 3% from user assignments, with errors of this magnitude occurring only when RTs were estimated under isocratic conditions (Tables 1 and 2). For gradient RT estimations, 23 of 142 deviated more than 1%, with only 2 of these estimates exceeding a 2% variance from user assignments (Tables 1 and 2).
Accuracy threshold assessment
ReTimeML’s performance in our verification studies prompted us to evaluate how its modelling accuracy responds to variations in the number and/or type (i.e., structure) of reference material employed. This was performed to ascertain the minimum requirement of reference material (‘Train’) that will achieve, on average, appropriate levels of accuracy (< 3% deviation from experimentally determined values). To investigate this potential constraint, we undertook a random sampling of our user-defined RTs from the verification LC–MS/MS experiments (Tables 1 and 2), which were then utilised as operational ‘Train’ values to evaluate their impact on ReTimeML’s subsequent extrapolation of unknowns. Nominated values were incrementally increased to assess the effects of fatty acyl chain length within a sphingoid base (e.g., d18:1/16:0 → d18:1/24:0), alongside structural variations from degree(s) of unsaturation (d18:0/XX:0, d18:1/XX:0, d18:1/XX:1, d18:2/XX:0 and d18:2/XX:1). In all random samplings, the internal control was maintained, with ReTimeML requiring a minimum of two points of reference to begin the estimation of unknowns.
Regardless of the selected RT material used, ReTimeML’s estimations on gradient LC–MS/MS experiments consistently outperformed isocratic measures, requiring fewer ‘Train’ points to achieve appropriate levels of accuracy (Fig. 6). ReTimeML estimations for gradient assessments were deemed suitable upon the employment of three structural variants (e.g., d18:0/XX:0, d18:1/XX:0 and d18:1/XX:1) as reference material, with average deviations from user-RTs ranging from 0.39 to 0.58% for Cer and 0.3–2.71% for SM (Fig. 6A–C,E–G). Increasing the number of reference points beyond this value does not greatly improve RT accuracy, with the exception of SMs assessed in mouse liver tissue where additional fatty acyl chains of the three structural variants improved accuracy to 1% and below (Fig. 6G). For isocratic LC–MS/MS, a larger number of references (n = 5–7) were required to achieve appropriate levels of accuracy (< 3%), with ReTimeML unable to attain a variance from user-RTs below ~ 1.5% (Fig. 6D,H). These results mirrored the similar numbers of ‘Train’ values and model accuracy achieved in our verification experiments (Tables 1 and 2). In all assessments, increasing the number of fatty acyl chain ‘Train’ points does not as drastically improve RT accuracy when compared to increasing the number of structural variants.
ReTimeML SL resolution in the absence of fragmentation distinguishment.
ReTimeML estimations help clarify the RTs of SLs where the occurrence of both isobaric and isotope distributions interferes with the correct peak identification in the total ion chromatogram (TIC), and cannot be resolved by supporting fragmentation data. This was particularly notable for di-unsaturated SMs that can correspond to either d18:1/XX:1 or d18:2/XX:0 in our MRM analyses, as the m/z 184 transition is insufficient to differentiate these isobars (Fig. 7A–C). To aid their resolution, secondary scanning of the m/z 262 (d18:2) and 264 (d18:1) sphingoid base was performed (Fig. 7D–I, Table 2). Sphingoid fragment selection proved useful in resolving SM(d18:2/XX:0) species (Fig. 7D–F) but presented complications in deriving the identities of SM (d18:1/XX:1) species, as the prominent peaks displayed with m/z 264 scans either did not align with MRET principles (Fig. 7G–H) or displayed multiple peaks, leading to further disambiguation of the correct identity (Fig. 7I). ReTimeML-guided RTs helped to either exclude (Fig. 7G,H) or correctly annotate (Fig. 7I) the SM (d18:1/XX:1) peaks in TICs. Peaks that did not align to ReTimeML estimates for m/z 264 transitions were attributed to isotope interference from mono-unsaturated hexosylceramides (HexCer (d18:1/XX:0), as illustrated (Fig. 7J–L). HexCer (d18:1/XX:0) species give rise to an [M + 1] isotopic ion that can interfere with the [M + H]+ ion of SM (d18:1/XX:1) (Fig. 7M–R). A similar comparison of the isotopic distribution and ReTimeML RT estimation helped to resolve SM (d18:2/XX:1) and SM (d18:1/XX:2), the latter of which were not assessed as their signal-to-noise ratios (S/N) were below the limit of detection (S/N < 3, data not shown).
Correlation of Cer and SM profiles between CSF and serum
ReTimeML-guided Cer/SM lipids were assessed in our paired CSF and serum samples, with both fluids collected from the same HVs. This resulted in the structural characterisation and quantification of 48 Cer and SM lipid species. Demographics for the 51 HVs included in this analysis are described in Supplementary Table 1 and their concentrations (pmol/mL) for each Cer and SM lipid quantified in both the CSF and serum are listed in Supplementary Data 3. Participants were recruited in Cologne, Germany and considered healthy at the time of body fluid withdrawal, with no close relatives having a psychiatric disorder. Participants were comprised of 30 women (58.8%) and 21 men (41.2%), average age of 27.3 (SD = 6.6) years, BMI of 23.0 (SD = 3.4) and were of Caucasian ethnicity, with the exception of two participants of African and Asian origin. Cer and SM profiles were comparable to previously reported CSF and serum concentrations of these SLs in healthy/control cohorts38,39,40,41,42. SMs displayed predominately higher concentrations compared to Cer, with d18:1 being the most prominent sphingoid backbone for both SL classes, regardless of body fluid type (Fig. 8A,B, Supplementary Data 3).
The comparative similarities in their CSF-serum profiles (Fig. 8A,B) prompted us to evaluate the respective associations between these two body fluids. First, pairings were treated independently by matching every SL concentration identified per individual in the CSF to that of its corresponding value in the serum. This resulted in a total of 606 individualised Cer, and 945 SM, CSF-serum pairings (Fig. 8C,D), with Pearson analysis revealing significant (p < 1 × 10−15) positive associations between the biofluids for Cer (r = 0.466) and SM (r = 0.844).
Mean CSF-serum correlations for each characterised Cer and SM structure were evaluated next. From the 48 Cer and SM structures identified, 41 recorded a sufficient number of CSF-serum parings (N > 10) to be subjected to a correlated coefficient analysis (r), with statistical values (p values) adjusted for multiple comparisons using the FDR approach of Benjamini, Krieger, and Yekutieli (Q < 0.05). As a result, 5 SLs were defined as significantly correlating between the CSF and serum (Fig. 8E). Notably, all of these significant associations were comprised of noncanonical, sphingadiene backbone, with 4 SMs displaying positive associations (SM (d18:2/16:1), SM (d18:2/18:1), SM (d18:2/20:1) and SM (d18:2/22:1), alongside Cer (d18:2/16:0) which showed a significant inverse correlation between the two body fluids. Correlation coefficients and statistical values for all Cer and SM associations are summarised in Supplementary Table 2.
Discussion
The objective of our study was to develop a user-friendly, analytically robust, tool for estimating RTs of the major d18:X SLs classes for Cer and SM, utilising our database of MRET behaviour in prior LC–MS/MS experiments, supplemented with literature sources, as the framework for building our model. Presently, no commercially available software or freeware incorporates RT data into their descriptor information to aid the identification of SLs in LC–MS/MS. However, it is strongly recommended that SL elution order be considered to reduce the likelihood of incorrect annotation in their identities16. The lack of RT incorporation is partly due to modelling overreliance on the specifics of the separation system used, with previous studies emphasising the need for future models to (1) be readily adaptable to differing experimental conditions and; (2) require as few reference values as possible28,32,43.
Applying MRET principles35, we demonstrate how ReTimeML is able to correctly assign various d18:0, d18:1 and d18:2 Cer/SM lipids based upon an understanding of their carbon chain length, degree of unsaturation and the C1 headgroup. We chose to focus on d18:X sphingoid bases as these comprise the most abundant SLs in mammalian organisms44. A similar approach has been developed for modelling the RTs of glycerophospholipids, assessing their equivalent carbon number (i.e., acyl chain composition), and expanding this across the different classes (e.g., phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine)45. Both approaches have the distinct advantage of displaying no bias towards an MS-system (e.g., triple quadrupole (QqQ) or high-resolution orbitrap). This could have an added benefit for resolving Cer and SM lipids from global/untargeted LC–MS/MS analyses that employ the appropriate conditions for their separation46, particularly since high-resolution MS systems are without the added precision from MRM specifications to reduce TIC complexity. Though comparable, ReTimeML’s strength lies in its combination of learned material from previous experimental data, alongside the assignment of ‘real’ values (i.e., user-defined references, standards, internal controls) to enhance the decision-making process. This enabled ReTimeML to delineate common patterns for a given SL class, irrespective of LC–MS/MS methodology or experimental conditions, overcoming a major obstacle of RT adaptability for use in lipid identification software. Furthermore, ReTimeML is completely automated, making it accessible to those with limited knowledge of lipid biochemistry, and, thus, translatable to a base of researchers who may have shied away from the complex analyses.
Encouragingly, ReTimeML did not require an excessive number of references to estimate unknowns with a high degree of accuracy, meaning that users are not faced with the considerable costs of purchasing a copious number of standards. Moreover, for most experiments (excluding isocratic analyses) n ≥ 4 reference RTs did not drastically improve the accuracy of unknowns, and in certain instances increased deviations from user RTs, presumably due to data overfitting (Fig. 6). We recommend using a minimum of n = 3 reference points per SL class, be employed, comprised of different structural variants, as this provided the most accurate RT estimations (less than 1% variance for gradient and between 2.5 to 5.3% for isocratic, from user-defined measures, Fig. 6). ReTimeML’s lower accuracy of RT estimations in isocratic systems was attributed to the lesser data available for model training, as well as increases in peak-broadening from using single solvent systems that make analyte detection more difficult, particularly when lipids of interest span a wide polarity range or require the separation of closely related species47,48. Hence, isocratic measures are more suited to low numbers of SL analytes (n < 10) to reduce complexity but have the advantage of reduced run time with no requirement of column equilibration prior to subsequent measurements49.
For most SL analyses, n = 3 reference RTs would be achievable by pooling together a given LC–MS/MS experiments routine internal controls per lipid class (e.g., Cer, SM), or the closest structural/chemical equivalence, that control for lipid extraction inconsistencies and LC–MS/MS normalisation across sample cohorts50,51, alongside calibrators to externally quantify samples and/or QC mixture to control for instrument signal variability (e.g., ion suppression) and mitigate influences from matrix and batch effects during high throughput screening52,53.
Importantly, the accuracy of ReTimeML estimations was also adaptable to the reference material employed (Fig. 6), indicating that if a particular reference for a structure of interest (e.g., d18:2/XX:0) is not available, an alternate standard sourced from the same lipid class would suffice, provided our guidelines (n = 3 references, including structures) are retained. If standards cannot be procured, and we do acknowledge that certain classes of SLs (e.g., SM) are limited in their commercial availability, users can still choose to enter experimentally determined values from samples, though we strongly advise caution using this approach and recommend limiting this to RTs of SLs that are well-established or when signal interference is negligible to unequivocally resolve the correct RT on the TIC.
Herein we would also like to highlight ReTimeML’s proficiency at annotating RTs in complex TICs, particularly where ions of similar transitions interfere with correct peak annotation (Figs. 2 and 7), with their mass differentiation (~ 20 ppm) not achievable at the resolution of a QqQ system (~ 0.1–0.2 Da). ReTimeML correctly assigned RTs to the peaks of SM isobars (d18:1/XX:1 vs. d18:2/XX:0) using only the m/z 184 transition (Fig. 7A–C). This is particularly notable given this diagnostic ion is assigned only to detect the presence of a choline headgroup, with SM analyses requiring a secondary m/z 262, 264 or 266 scan(s) to determine the correct sphingoid backbone. This can be problematic in LC–MS/MS experiments for SMs as the choline headgroup is highly sensitive and its signal (up to 100× stronger) can suppress the detection of the sphingoid ion54, thereby limiting the structural information on SMs to the sum of its components (e.g., SM (36:2)) rather than at the fatty acyl/sphingoid base structure level (e.g., SM (d18:1/18:1))55. SM peak determination was further complicated by the presence of HexCer isotopic distribution, whose sphingoid transitions overlapped with di-unsaturated SMs (Figs. 7G–l). In the absence of RT annotation using ReTimeML, such signals could easily be misinterpreted, particularly if the m/z 184 was not in the MRM experiment or had already been assigned to a particular isobar (Fig. 7A,B).
Although a significant improvement in the resolution of SL variants, ReTimeML remains bound by the user’s LC setup. Should LC conditions not facilitate appropriate separation, interfering SL ions may be indistinguishable on the TIC. This has been proven to potentially cause artificial inflation of SL levels36, and represents a current constraint for ReTimeML to handle isomers of SLs (e.g., galactosyl vs glucosylceramide having been referred to as HexCer), given the chromatography conditions ReTimeML was trained on were incapable of their separation. Additionally, misidentifications from indistinguishable transitions of hydroxylated variants, even at trace levels, could artificiality inflate or cause miss-annotation (e.g., [M + H]+for Cer (d18:1/24:1) and [M + H–H2O]+ for Cer (t18:1/24:0) share the same 648.6 → 264.3 transition). Though not a component of this study, resolution of these hydroxylated variants can be achieved under suitable reverse-phase conditions56, nonetheless remain an important consideration when establishing LC conditions. Appropriate separation is also pivotal when considering the application of ReTimeML for processing SLs using high-resolution MS. As previously aforementioned, these untargeted measures provide no selective bias (i.e., MRM) towards SLs of interest, which could potentially increase the risk of overlapping or interfering ions from other lipid species. Updated versions of ReTimeML shall require training on additional setups that may circumvent these potential sources of interference, including normal-phase LC–MS/MS25 and next-generation ion mobility MS employing RT with collision cross-section57, capable of achieving additional SL class and structure interpretation.
ReTimeML was employed to aid the peak selection and structural annotation of Cer and SM lipids identified in HVs, providing both CSF and serum, allowing us to compare these body fluid profiles in the same subject. CSF is the closest anatomical fluid to the brain, likely to yield more applicable biomarkers for studying neuronal effects and conditions given its composition closely resembles that of the brain's extracellular space58,59,60. However, preconceived notions towards the invasiveness of the procedure (lumbar puncture) and the resulting distress to subjects have severely influenced its inclusion in clinical trials and broader use as a diagnostic fluid61,62. As SLs are enriched in the CNS and have exhibited a capacity to cross the blood–brain barrier63, their peripheral concentrations have the potential to act as surrogate markers in neurological and neuropsychiatric disorders64. To the best of our knowledge, this is the first reported CSF-serum SL comparison, and only the second paired blood-CSF lipid profiling investigation in HVs65.
Although Cer and SM concentrations in the CSF were found to be considerably lower than in serum, their relative structure-distribution patterns remained conserved and positively associated between the two body fluids (Fig. 8A–D). This is consistent with recently published CSF-plasma data65. Although Saito et al., reported contrasting results on overall lipid compositions, a closer inspection of their SM data revealed a similar conserved profile, with a positive correlation for the SM structural variants identified in their study (r = 0.811, average per lipid structure, n = 99; r = 0.760, individuals SL pairwise CSF-plasma pairings, n = 2,079; both p values < 0.0001, Supplementary Fig. 2). Their number of Cer identities was insufficient for an effective comparison (data not shown).
A handful of our SL identities exhibited highly stringent correlations, conserved between the CSF and serum (Q < 0.05, Fig. 8E). Interestingly, all the positively associated comparisons were categorised as belonging to the same SM structural variant, which included the presence of a sphingadiene backbone (d18:2, m/z 262), together with a mono-unsaturated fatty acyl chain. Though the presence of d18:2 on SLs was first identified in the late 60 s66,67,68, it has taken major advancements in LC–MS/MS sensitivity to enable routine assessments of these noncanonical structures, reviewed in69, meaning our understanding of their functional importance is largely undetermined. Sphingadiene backbones are exhibited in mouse kidney, brain, lung, and colon tissue SLs70, are reported to be the second most abundant sphingoid base in human plasma71, and a common constituent in plants and fungi72,73. Natural (soy) sphingadienes have been reported to inhibit intestinal tumoregensis in vivo, through disrupted Akt translocation74, and reduced Wnt transcriptional activity in colon cancer cells75. Clinical investigations have reported that d18:2 SLs may provide protection against the development of obesity and the risk of diabetes and cardiovascular disease76,77. Our own research observed a marked accumulation of d18:2 SLs (unpublished), following the ablation of sphingosine kinase 2 (SphK2) in a mouse model for Alzheimer’s disease (J20), which led to severe defects in myelin integrity78. However, we never resolved whether this shift towards sphingadiene-based SLs was a primary cause of myelin disruption. SphK2 is the major isoform for catalysing the phosphorylation of sphingosine into sphingosine 1-phosphate (S1P), the penultimate step in SL lysosomal catabolism via irreversible degradation by S1P lyase1,2. This pathway has been reported to be less efficient for the clearance of d18:2 SLs, over their d18:1 counterparts in vitro4,70, presumably a consequence of the angled nature of the cis-double bond79. Furthermore, it has been recently shown that d18:2 predominately converts to SMs over glycosphingolipids (i.e., HexCer)79. Hence, we speculate that stronger associations observed for SM (d18:2/XX:1) profiles between these peripheral systems may be rationalised by their preferential formation from sphingadiene precursors and the accompanied stability from cis-double bonds on both the sphingoid base and fatty acyl chain.
In this study, we set ourselves the objective of developing a freeware that could perform data transformation and feature engineering to estimate the RTs of Cer and SM identities from complex LC–MS/MS spectra, adaptable to the experimental conditions applied, and amenable to any level of LC–MS/MS experience and/or knowledge on the biochemistry of SLs. We believe that ReTimeML excelled at this objective, assisting with the identification process across multiple LC–MS/MS conditions, including structural annotation and the removal of interfering RTs from isobaric and isotopic measures. While we recognise the advantages from employing ReTimeML, we acknowledge the existence of unforeseen circumstances and have highlighted probable instances where ReTimeML may not be applicable to a user’s LC–MS/MS design. Hence, it is always advisable for users to conduct secondary analyses/measures to confirm their SL identities, particularly if the existence of isobaric compounds or ion interferences within the TIC are likely, albeit not directly observed.
Moving forward, our objective is the continued optimisation of ReTimeML, ensuring it grows with mass spectrometry development and refine its ability to guide RT annotations for further classes of SLs, including non-canonical variants. We also plan to expand ReTimeML assessments into other lipid classes, searching relevant data repositories (e.g., MetaboLights, Metabolomics Work Bench), and welcome support from users prepared to share their lipidomic data via the options provided (see ‘Data and Code availability’). In the long term, we envisage that with the incorporation of more machine-learned lipid class RT measures, ReTimeML could become an openly accessed and/or integrated function in current automated lipid identification software engines. In achieving routine RT annotations for SLs, we also drew attention to the physiological and pathophysiological importance of non-canonical SLs that are achievable with current mass spectrometric systems. It is hoped that our findings will further scrutinise their significance as molecular mediators in health and disease.
Materials and methods
Data integration
Regression models were fitted to Cer and SM data collated from our prior published material and that of the literature (Supplementary Data 2). Data incorporated met a minimum quantity of molecular and chromatography information, ensuring sufficient variance between descriptors to accurately fit with models. The information incorporated includes the sphingoid base/fatty-acyl naming for Cer and SM55,80 and/or chemical formula of the sphingolipid, m/z of the [M + H]+ precursor ion together, with a minimum of one fragmentation ion to aid the structural characterisation (e.g., m/z 264 for the sphingosine backbone), the chromatography system applied, stationary-phase column used, solvent conditions (gradient vs isocratic) and nominated flow rate. For each dataset, Cer and SM species were broken down into RTs that were either “user-defined” or “known” (e.g., standard), per study. The selected nominations were defined by the experimental datasets chosen controls (internal and QC), compounds used for calibration or to optimise the LC–MS/MS parameters during their method development.
Retention time learning algorithms
Linear and non-linear regression algorithms were assessed and evaluated based on their ability to predict RTs, alongside the required number of training samples to learn and achieve appropriate levels of accuracy. Variables associated with precursor mass, precursor mass squared, square root and the log of the precursor mass for each molecular sample were calculated for every included data point. Adopting the sphingoid base and fatty acyl notation55, a Python (version 3.10, Python Software Foundation) function was used to extract the molecular features, based on the nominated Cer/SM precursor mass and relevant fragmentation to deduce the carbon chain length, both on the sphingoid backbone and fatty acyl chain, the degree(s) of unsaturation, as well as specific modifications to the C1 head group (i.e., m/z 184 for choline phosphate of SM). Incorporated datasets were randomly divided into ‘training’ (70%) and ‘validation’ (30%), using Python’s inbuilt sklearn package, which was also used to train linear, lasso and ridge regression models. Python’s xgboost package was used to train an XGBoost regression algorithm. RMSE and R2 were used to evaluate the performance of each regression model’s RT estimations. The number of training, and hence corresponding validation data points were sequentially varied (± 3 datasets) to assess the minimum training size required for a given regression model to forecast RTs accurately (R2 > 0.95; RMSE < 0.25), with regression models ranked according to the number of training samples required to achieve these measurables. The complete list of regression models, along with R2 and RMSE scores per training size, are summarised in Supplementary Data 1.
Ceramide and sphingomyelin data acquisition
Top-ranked regression models for Cer and SM were selected for secondary verification across four, independently performed, LC–MS/MS assessments spanning human fluids (CSF and serum), mouse liver and rat brain homogenates (all unpublished analyses). Human CSF and serum were provided by our biobank at the Central Institute of Mental Health, Mannheim, with both donated from the same healthy participants (n = 51), originally recruited at the Clinic and Outpatient Clinic of Psychiatry and Psychotherapy, University of Cologne. The Ethics Committee of the Medical Faculty Cologne, University of Cologne, Germany (00-053) approved the use of these samples for this research. Rat brain tissue was procured from our prior animal investigation exploring behavioural changes following different tetrahydrocannabinol preparations81, approved by the regional authority State Agency for Nature, Environment and Consumer Protection of the State North Rhine-Westphalia (LANIUV-NRW). Only tissue from placebo-administered rats was assessed. SLs extracted from C57BL/6 mouse liver were in accordance with protocols (#2019-033), approved by the Research Ethics and Governance Office, Royal Prince Alfred Hospital, Sydney, Australia.
SLs from human fluids were extracted using the conventional Bligh and Dyer method82, while rodent liver and brain tissue SLs were extracted using single-phase methanol/butanol (1:1 v/v83) and two-phase methyl-tert-butyl ether (MTBE)/methanol/water (10:3:2.5, v/v/v4), respectively. Prior to extractions, all samples were loaded with Cer (d18:1/17:0) and SM (d18:1/12:0) as internal controls. All samples underwent MRM analysis performed on a TSQ Altis QqQ mass spectrometer (ThermoFisher), coupled to a Vanquish UHPLC system, as previous4. For SMs, secondary product ion scans for m/z 264 and 262 were included to help distinguish isobars (e.g., SM(d18:1/XX:1) and SM (d18:2/XX:0)) and the potentially conflicting phosphatidylcholine ions, which induce the same m/z 184 choline headgroup fragment. A complete list of Cer and SM lipids, together with their MRM transitions, are provided in Tables 1 and 2.
Chromatographic and stationary phase conditions were also varied between experiments. SLs extracted from human fluids were resolved on a 3 × 150 mm Agilent XDB-C8 column (5 μM pore size), using a modified Hejazi et al.35 binary gradient as follows: 0 min, 20:80 A/B; 2 min, 20:80 A/B; 7 min, 13:87 A/B; 14 min, 0:100 A/B; 20.5 min, 0:100 A/B; 21 min, 20:80 A/B; 24 min, 20:80 A/B. Mobile phase ‘A’ consisted of 0.2% formic acid, 2 mM ammonium formate in water; Mobile phase ‘B’: 0.2% formic acid, 1 mM ammonium formate in methanol. Total run time was 24 min, at a flow rate of 0.2 mL/min. Extracted SLs from mouse liver were separated on the same 3 × 150 mm Agilent XDB-C8 stationary phase, with modified gradient solvents and conditions as follows: 0 min, 20:80 A/B; 1 min, 20:80 A/B; 9 min, 5:95 A/B; 11 min, 0:100 A/B; 17.5 min, 0:100 A/B; 17.6 min 20:80 A/B, 20.5 min 20:80 A/B. Solvent ‘A’ comprised of 0.1% formic acid, 2 mM Ammonium acetate in water; Solvent ‘B’ composition of 0.1% formic acid, 2 mM Ammonium acetate in methanol. Total run time was 20.5 min at a flow rate of 0.3 mL/min. SL analysis of brains (rat) were resolved on a 2.1 × 100 mm Waters Acquity C18 UPLC column (1.7 µm pore size) under isocratic conditions, as previous84, using methanol with 0.2% formic acid as the mobile phase, at a flow rate of 0.25 mL/min for 15 min.
ReTimeML interface
ReTimeML’s pilot version is available as a free, open-source web interface powered by streamlit (https://mikeallwright23-retime-app-lipid3-021zpv.streamlit.app/). Users upload datasets in .csv format, consisting of the Cer/SM lipids of interest, alongside their precursor mass and whether the SL species included are a “Train” (reference with known RT included) or “Test” (unknown) value. Template .csv files are provided for users to test the interface (Supplementary Data 4 and 5), and can be adapted for their own SL analysis. Uploaded data (drag and drop option) triggers the automatic calculation of RTs, utilising the nominated RTs as points-of-reference (Train) to guide the extrapolated unknowns (Test), with users free to amend the number of train/test values. It is of note to mention a minimum of two ‘Train’ values is required for ReTimeML to extrapolate an output. Functions are applied within the web interface to automatically pre-process each data field, using regex functionality in Python, to feature engineer the number of carbon atoms and degree(s) of unsaturation for structural components on the sphingoid and fatty acyl chain, programming these as one hot encoded (ohe) variables, as well as the log of the mass, mass squared and square root of the mass. In addition, ReTimeML also provides an MRET profile output of estimations, annotated using second-order polynomial trendlines. The web interface also provides users the added option to voluntarily upload their own RT estimations, so that our team can evaluate and incorporate them into our working models.
Statistical analysis
Verified Human Cer and SM lipid datasets underwent peak integration using Xcalibur 4.4.16.14 software (ThermoFisher Scientific, San Jose, CA, USA), with Cer and SM species normalised as ratios to their class-specific internal control. A separate Cer/SM mixture, comprising various compounds (Supplementary Table 3), was run every 20 samples. This external mixture acted as a QC and provided additional RT references for Cer and SM. All Cer/SM mixtures were prepared in aqueous/organic proportions reflecting the starting conditions of the respective LC–MS/MS experiments. All Cer and SM values were first log-transformed (natural log) to obtain a normal distribution for Pearson correlations (r). For the assessment of individual lipids between CSF and serum, the resultant p values were adjusted for multiple comparisons using the Benjamini, Krieger, and Yekutieli false discovery rate (FDR) approach, with Q < 0.05 considered significant (GraphPad Prism software, Version 10.0.3, Dotmatics, Boston, MA, USA).
Data availability
All Cer and SM data supporting our findings has been provided in the manuscript and supplementary information. Datasets for ReTimeML training were collated from prior published material. Details for these studies, including the Cer and SM lipids assessed, has been provided in Supplementary Data 2. RT values for these studies can be made available by request to the corresponding authors. Source code for ReTimeML learned regression models is also available upon reasonable request. We encourage users of ReTimeML to voluntarily upload RT data for future lipid regression model development (.xls, .xlsx or .csv format), at either the following Google form https://forms.gle/p9fGuzuujZqDhcpD6 or Dropbox link: https://www.dropbox.com/request/kMOXwBe54IxGWeocE7WS.
References
Hannun, Y. A. & Obeid, L. M. Sphingolipids and their metabolism in physiology and disease. Nat. Rev. Mol. Cell Biol. 19(3), 175–191 (2018).
Merrill, A. H. Jr. Sphingolipid and glycosphingolipid metabolic pathways in the era of sphingolipidomics. Chem. Rev. 111(10), 6387–6422 (2011).
Giussani, P., Prinetti, A. & Tringali, C. The role of Sphingolipids in myelination and myelin stability and their involvement in childhood and adult demyelinating disorders. J. Neurochem. 156(4), 403–414 (2021).
Couttas, T. A. et al. A novel function of sphingosine kinase 2 in the metabolism of sphinga-4,14-diene lipids. Metabolites 10(6), 236 (2020).
Goni, F. M. & Alonso, A. Biophysics of sphingolipids I. Membrane properties of sphingosine, ceramides and other simple sphingolipids. Biochim. Biophys. Acta 1758(12), 1902–1921 (2006).
Hannun, Y. A. & Bell, R. M. Functions of sphingolipids and sphingolipid breakdown products in cellular regulation. Science 243(4890), 500–507 (1989).
van Meer, G., Voelker, D. R. & Feigenson, G. W. Membrane lipids: Where they are and how they behave. Nat. Rev. Mol. Cell Biol. 9(2), 112–124 (2008).
Fanani, M. L. & Maggio, B. The many faces (and phases) of ceramide and sphingomyelin I - single lipids. Biophys. Rev. 9(5), 589–600 (2017).
Taniguchi, M. & Okazaki, T. The role of sphingomyelin and sphingomyelin synthases in cell death, proliferation and migration-from cell and animal models to human disorders. Biochim. Biophys. Acta 1841(5), 692–703 (2014).
Maceyka, M. & Spiegel, S. Sphingolipid metabolites in inflammatory disease. Nature 510(7503), 58–67 (2014).
Kolesnick, R. The therapeutic potential of modulating the ceramide/sphingomyelin pathway. J. Clin. Investig. 110(1), 3–8 (2002).
Ryan, E. et al. Detailed structural characterization of sphingolipids via 193 nm ultraviolet photodissociation and ultra high resolution tandem mass spectrometry. J. Am. Soc. Mass Spectrom. 28(7), 1406–1419 (2017).
Bielawski, J. et al. Simultaneous quantitative analysis of bioactive sphingolipids by high-performance liquid chromatography-tandem mass spectrometry. Methods 39(2), 82–91 (2006).
Couttas, T. A. et al. Age-dependent changes to sphingolipid balance in the human hippocampus are gender-specific and may sensitize to neurodegeneration. J. Alzheimers Dis. 63(2), 503–514 (2018).
Sullards, M. C. & Merrill, A. H. Jr. Analysis of sphingosine 1-phosphate, ceramides, and other bioactive sphingolipids by high-performance liquid chromatography-tandem mass spectrometry. Sci. STKE 2001(67), pl1 (2001).
Peng, B. et al. LipidCreator workbench to probe the lipidomic landscape. Nat. Commun. 11(1), 2057 (2020).
Koelmel, J. P. et al. LipidMatch: an automated workflow for rule-based lipid identification using untargeted high-resolution tandem mass spectrometry data. BMC Bioinform. 18(1), 331 (2017).
Wong, J. W. et al. MMSAT: Automated quantification of metabolites in selected reaction monitoring experiments. Anal. Chem. 84(1), 470–474 (2012).
Merrill, A. H. Jr. & Sullards, M. C. Opinion article on lipidomics: Inherent challenges of lipidomic analysis of sphingolipids. Biochim. Biophys. Acta Mol. Cell Biol. Lipids 1862(8), 774–776 (2017).
Zullig, T. & Kofeler, H. C. High resolution mass spectrometry in lipidomics. Mass Spectrom. Rev. 40(3), 162–176 (2021).
Duan, J. & Merrill, A. H. Jr. 1-Deoxysphingolipids encountered exogenously and made de novo: Dangerous mysteries inside an enigma. J. Biol. Chem. 290(25), 15380–15389 (2015).
Sandhoff, R. Very long chain sphingolipids: Tissue expression, function and synthesis. FEBS Lett. 584(9), 1907–1913 (2010).
Merrill, A. H. Jr. et al. Sphingolipidomics: high-throughput, structure-specific, and quantitative analysis of sphingolipids by liquid chromatography tandem mass spectrometry. Methods 36(2), 207–224 (2005).
Sullards, M. C. et al. Analysis of mammalian sphingolipids by liquid chromatography tandem mass spectrometry (LC-MS/MS) and tissue imaging mass spectrometry (TIMS). Biochim. Biophys. Acta 1811(11), 838–853 (2011).
Zama, K. et al. Simultaneous quantification of glucosylceramide and galactosylceramide by normal-phase HPLC using O-phtalaldehyde derivatives prepared with sphingolipid ceramide N-deacylase. Glycobiology 19(7), 767–775 (2009).
Rana, N. A. et al. Qualitative and quantitative measurements of sphingolipids by mass spectrometry. In Bioactive Sphingolipids in Cancer Biology and Therapy (eds Hannun, Y. A. et al.) 313–338 (Springer, 2015).
Osipenko, S. et al. Machine learning to predict retention time of small molecules in nano-HPLC. Anal. Bioanal. Chem. 412(28), 7767–7776 (2020).
Stanstrup, J., Neumann, S. & Vrhovsek, U. PredRet: Prediction of retention time by direct mapping between multiple chromatographic systems. Anal. Chem. 87(18), 9421–9428 (2015).
D’Archivio, A. A. et al. Cross-column retention prediction in reversed-phase high-performance liquid chromatography by artificial neural network modelling. Anal. Chim. Acta 717, 52–60 (2012).
Fedorova, E. S. et al. Deep learning for retention time prediction in reversed-phase liquid chromatography. J. Chromatogr. A 1664, 462792 (2022).
Yang, Q. et al. Prediction of liquid chromatographic retention time with graph neural networks to assist in small molecule identification. Anal. Chem. 93(4), 2200–2206 (2021).
Aicheler, F. et al. Retention time prediction improves identification in nontargeted lipidomics approaches. Anal. Chem. 87(15), 7698–7704 (2015).
Bouwmeester, R., Martens, L. & Degroeve, S. Comprehensive and empirical evaluation of machine learning algorithms for small molecule LC retention time prediction. Anal. Chem. 91(5), 3694–3703 (2019).
Domingo-Almenara, X. et al. The METLIN small molecule dataset for machine learning-based retention time prediction. Nat. Commun. 10(1), 5811 (2019).
Hejazi, L. et al. Mass and relative elution time profiling: two-dimensional analysis of sphingolipids in Alzheimer’s disease brains. Biochem. J. 438(1), 165–175 (2011).
Huang, H. et al. LC–MS based sphingolipidomic study on A549 human lung adenocarcinoma cell line and its taxol-resistant strain. BMC Cancer 18(1), 799 (2018).
Huynh, K. et al. High-throughput plasma lipidomics: Detailed mapping of the associations with cardiometabolic risk factors. Cell Chem. Biol. 26(1), 71–84 (2019).
Fonteh, A. N. et al. Sphingolipid metabolism correlates with cerebrospinal fluid beta amyloid levels in Alzheimer’s disease. PLoS ONE 10(5), e0125597 (2015).
Hammad, S. M. et al. Plasma sphingolipid profile associated with subclinical atherosclerosis and clinical disease markers of systemic lupus erythematosus: potential predictive value. Front. Immunol. 12, 694318 (2021).
Hanamatsu, H. et al. Altered levels of serum sphingomyelin and ceramide containing distinct acyl chains in young obese adults. Nutr. Diabetes 4(10), e141 (2014).
Hammad, S. M. et al. Blood sphingolipidomics in healthy humans: Impact of sample collection methodology. J. Lipid Res. 51(10), 3074–3087 (2010).
Torretta, E. et al. Particular CSF sphingolipid patterns identify iNPH and AD patients. Sci. Rep. 8(1), 13639 (2018).
Ross, D. H. et al. Evaluating software tools for lipid identification from ion mobility spectrometry-mass spectrometry lipidomics data. Molecules 28(8), 3483 (2023).
Carreira, A. C. et al. Mammalian sphingoid bases: Biophysical, physiological and pathological properties. Prog. Lipid Res. 75, 100988 (2019).
White, J. B. et al. Equivalent carbon number and interclass retention time conversion enhance lipid identification in untargeted clinical lipidomics. Anal. Chem. 94(8), 3476–3484 (2022).
Marian, O. C. et al. Disrupted myelin lipid metabolism differentiates frontotemporal dementia caused by GRN and C9orf72 gene mutations. Acta Neuropathol. Commun. 11(1), 52 (2023).
Gritti, F. General theory of peak compression in liquid chromatography. J. Chromatogr. A 1433, 114–122 (2016).
Haidar Ahmad, I. A. Necessary analytical skills and knowledge for identifying, understanding, and performing HPLC troubleshooting. Chromatographia 80(5), 705–730 (2017).
Schellinger, A. P. & Carr, P. W. Isocratic and gradient elution chromatography: A comparison in terms of speed, retention reproducibility and quantitation. J. Chromatogr. A 1109(2), 253–266 (2006).
Holcapek, M., Liebisch, G. & Ekroos, K. Lipidomic analysis. Anal. Chem. 90(7), 4249–4257 (2018).
Kofeler, H. C. et al. Recommendations for good practice in MS-based lipidomics. J. Lipid Res. 62, 100138 (2021).
Pitt, J. J. Principles and applications of liquid chromatography–mass spectrometry in clinical biochemistry. Clin. Biochem. Rev. 30(1), 19–34 (2009).
Cheng, W. L. et al. Calibration practices in clinical mass spectrometry: Review and recommendations. Ann. Lab. Med. 43(1), 5–18 (2023).
Hsu, F. F. & Turk, J. Structural determination of sphingomyelin by tandem mass spectrometry with electrospray ionization. J. Am. Soc. Mass Spectrom. 11(5), 437–449 (2000).
Liebisch, G. et al. Shorthand notation for lipid structures derived from mass spectrometry. J. Lipid Res. 54(6), 1523–1530 (2013).
Hartler, J. et al. Automated annotation of sphingolipids including accurate identification of hydroxylation sites using MS(n) data. Anal. Chem. 92(20), 14054–14062 (2020).
Camunas-Alberca, S. M. et al. Integrating the potential of ion mobility spectrometry–mass spectrometry in the separation and structural characterisation of lipid isomers. Front. Mol. Biosci. 10, 1112521 (2023).
Blennow, K. et al. Cerebrospinal fluid and plasma biomarkers in Alzheimer disease. Nat. Rev. Neurol. 6(3), 131–144 (2010).
Lee, J. Cerebrospinal fluid biomarkers in various pediatric neurologic diseases. Clin. Exp. Pediatr. 65(2), 81–82 (2022).
Paraskevas, G. P. The role of cerebrospinal fluid biomarkers in dementia and other related neurodegenerative disorders. Brain Sci 12(5), 627 (2022).
Howell, J. C. et al. Research lumbar punctures among African Americans and Caucasians: Perception predicts experience. Front. Aging Neurosci. 8, 296 (2016).
Day, G. S. et al. Deciphering the factors that influence participation in studies requiring serial lumbar punctures. Alzheimers Dement. (Amst.) 12(1), e12003 (2020).
de la Monte, S. M. Triangulated mal-signaling in Alzheimer’s disease: Roles of neurotoxic ceramides, ER stress, and insulin resistance reviewed. J. Alzheimers Dis. 30(Suppl 2), S231–S249 (2012).
van Kruining, D. et al. Sphingolipids as prognostic biomarkers of neurodegeneration, neuroinflammation, and psychiatric diseases and their emerging role in lipidomic investigation methods. Adv. Drug Deliv. Rev. 159, 232–244 (2020).
Saito, K. et al. Profiling of cerebrospinal fluid lipids and their relationship with plasma lipids in healthy humans. Metabolites 11(5), 268 (2021).
Renkonen, O. & Hirvisalo, E. L. Structure of plasma sphingadienine. J. Lipid Res. 10(6), 687–693 (1969).
Polito, A. J., Akita, T. & Sweeley, C. C. Gas chromatography and mass spectrometry of sphingolipid bases. Characterization of sphinga-4,14-dienine from plasma sphingomyelin. Biochemistry 7(7), 2609–2614 (1968).
Panganamala, R. V., Geer, J. C. & Cornwell, D. G. Long-chain bases in the sphingolipids of atherosclerotic human aorta. J. Lipid Res. 10(4), 445–455 (1969).
Lam, B. W. S. et al. The noncanonical chronicles: Emerging roles of sphingolipid structural variants. Cell. Signal 79, 109890 (2021).
Jojima, K. et al. Biosynthesis of the anti-lipid-microdomain sphingoid base 4,14-sphingadiene by the ceramide desaturase FADS3. FASEB J. 34(2), 3318–3335 (2020).
Karsai, G. et al. FADS3 is a Delta14Z sphingoid base desaturase that contributes to gender differences in the human plasma sphingolipidome. J. Biol. Chem. 295(7), 1889–1897 (2020).
Sullards, M. C. et al. Structure determination of soybean and wheat glucosylceramides by tandem mass spectrometry. J. Mass Spectrom. 35(3), 347–353 (2000).
Aida, K. et al. Prevention of aberrant crypt foci formation by dietary maize and yeast cerebrosides in 1, 2-dimethylhydrazine-treated mice. J. Oleo Sci. 54(1), 45–49 (2005).
Fyrst, H. et al. Natural sphingadienes inhibit Akt-dependent signaling and prevent intestinal tumorigenesis. Cancer Res. 69(24), 9457–9464 (2009).
Kumar, A. et al. Chemopreventive sphingadienes downregulate Wnt signaling via a PP2A/Akt/GSK3beta pathway in colon cancer. Carcinogenesis 33(9), 1726–1735 (2012).
Chew, W. S. et al. Large-scale lipidomics identifies associations between plasma sphingolipids and T2DM incidence. JCI Insight 5(13), e126925 (2019).
Othman, A. et al. Plasma C20-Sphingolipids predict cardiovascular events independently from conventional cardiovascular risk factors in patients undergoing coronary angiography. Atherosclerosis 240(1), 216–221 (2015).
Lei, M. et al. Sphingosine kinase 2 potentiates amyloid deposition but protects against hippocampal volume loss and demyelination in a mouse model of Alzheimer’s disease. J. Neurosci. 39(48), 9645–9659 (2019).
Jojima, K. & Kihara, A. Metabolism of sphingadiene and characterization of the sphingadiene-producing enzyme FADS3. Biochim. Biophys. Acta Mol. Cell Biol. Lipids 1868(8), 159335 (2023).
Fahy, E. et al. Update of the LIPID MAPS comprehensive classification system for lipids. J. Lipid Res. 50(Supp 1), S9-14 (2009).
Rohleder, C. et al. Different pharmaceutical preparations of Delta(9)-tetrahydrocannabinol differentially affect its behavioral effects in rats. Addict. Biol. 25(3), e12745 (2020).
Bligh, E. G. & Dyer, W. J. A rapid method of total lipid extraction and purification. Can. J. Biochem. Physiol. 37(8), 911–917 (1959).
Alshehry, Z. H. et al. An efficient single phase method for the extraction of plasma lipids. Metabolites 5(2), 389–403 (2015).
Turner, N. et al. A selective inhibitor of ceramide synthase 1 reveals a novel role in fat metabolism. Nat. Commun. 9(1), 3165 (2018).
Acknowledgements
This research was supported by the Moyira Elizabeth Vine Fund, awarded to both T.A.C. (2023) and F.M.L (2020) for research into schizophrenia; Deputy Vice Chancellor-Start Up Funds from the University of Sydney (F.M.L); Medical Research Futures Fund, Australia (GA268519, B.G.); Brain and Intelligence Science (BISA) Sydney-Fudan University accelerator grant (B.G.); National Health and Medical Research Council Grant, Australia (GNT1162545, Y.Q.); and The Stanley Medical Research Institute (03-NV-003, F.M.L.). A.E.H. and B.J. were supported by Australian Government Research Training Program (RTP) Scholarships. We gratefully acknowledge subsidised access to the LC-MS/MS systems at Sydney Mass Spectrometry, University of Sydney.
Author information
Authors and Affiliations
Contributions
T.A.C. and M.A. conceived and designed the project. M.A. developed and uploaded the software tool under the supervision of B.G. Sample acquisition, analysis and interpretation of data was performed by A.E.H., B.J., L.H.C. and Y.C.J., under the supervision of F.M.L., Y.Q. and T.A.C. Writing of the manuscript was performed by T.A.C., with inputs from M.A., B.F.L.W. and A.E.H., C.R., A.S.D., B.G. and F.M.L. reviewed and edited the manuscript. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests; F.M.L. is a shareholder of curantis UG (Ltd.) and received an Investigator Initiated Trial grant from Endosane Pharmaceuticals GmbH; C.R. is a shareholder of lero bioscience UG (Ltd.); M.A. is the director of Alchem-AI (Ltd.) and a trustee of MyWorld Creative Projects (Charity number: 1197162); B.G. is a director of Pacific Analytics (Pty Ltd.) and SMRTR (Pty Ltd.); B.G. is a an editor at Scientific Reports but was not involved in the peer review or editorial decision-making process for this submission; A.E.H. and C.R. are employees of Endosane Pharmaceuticals GmbH. The remaining authors have nothing to disclose. All research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Allwright, M., Guennewig, B., Hoffmann, A.E. et al. ReTimeML: a retention time predictor that supports the LC–MS/MS analysis of sphingolipids. Sci Rep 14, 4375 (2024). https://doi.org/10.1038/s41598-024-53860-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-53860-0
- Springer Nature Limited