Problems, principles and progress in computational annotation of NMR metabolomics data

Judge, Michael T.; Ebbels, Timothy M. D.

doi:10.1007/s11306-022-01962-z

Problems, principles and progress in computational annotation of NMR metabolomics data

Review Article
Open access
Published: 05 December 2022

Volume 18, article number 102, (2022)
Cite this article

Download PDF

You have full access to this open access article

Metabolomics Aims and scope Submit manuscript

Problems, principles and progress in computational annotation of NMR metabolomics data

Download PDF

3564 Accesses
5 Citations
14 Altmetric
Explore all metrics

Abstract

Background

Compound identification remains a critical bottleneck in the process of exploiting Nuclear Magnetic Resonance (NMR) metabolomics data, especially for ¹H 1-dimensional (¹H 1D) data. As databases of reference compound spectra have grown, workflows have evolved to rely heavily on their search functions to facilitate this process by generating lists of potential metabolites found in complex mixture data, facilitating annotation and identification. However, approaches for validating and communicating annotations are most often guided by expert knowledge, and therefore are highly variable despite repeated efforts to align practices and define community standards.

Aim of review

This review is aimed at broadening the application of automated annotation tools by discussing the key ideas of spectral matching and beginning to describe a set of terms to classify this information, thus advancing standards for communicating annotation confidence. Additionally, we hope that this review will facilitate the growing collaboration between chemical data scientists, software developers and the NMR metabolomics community aiding development of long-term software solutions.

Key scientific concepts of review

We begin with a brief discussion of the typical untargeted NMR identification workflow. We differentiate between annotation (hypothesis generation, filtering), and identification (hypothesis testing, verification), and note the utility of different NMR data features for annotation. We then touch on three parts of annotation: (1) generation of queries, (2) matching queries to reference data, and (3) scoring and confidence estimation of potential matches for verification. In doing so, we highlight existing approaches to automated and semi-automated annotation from the perspective of the structural information they utilize, as well as how this information can be represented computationally.

Good practices and recommendations for using and benchmarking computational metabolomics metabolite annotation tools

Article Open access 05 December 2022

NMRFinder: a novel method for 1D 1H-NMR metabolite annotation

Article 01 February 2021

MAW: the reproducible Metabolome Annotation Workflow for untargeted tandem mass spectrometry

Article Open access 04 March 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

1.1 NMR in metabolomics and compound identification

Metabolomics has become a key component of modern biological and biomedical studies, providing rich information on an organism’s biological status in health and disease. However, to exploit its full potential, the field must address the fundamental problem of metabolite identification (Edison et al., 2021; Garcia-Perez et al., 2020; Monge et al., 2019). Metabolomic assays capturing the widest range of metabolites (untargeted approaches) yield many unidentified features (spectral elements defined across samples), each of which reports on one or more small molecules. Without identification, it is extremely difficult to investigate and understand the biological mechanisms at work, either by expert interpretation or by bioinformatic approaches such as pathway analysis.

NMR is widely accepted as one of the most powerful structural assignment tools available to the analytical chemist, yielding structural information on a wide range of levels for a huge range of molecules. Here, we focus on small molecules, but lipid and protein signals are also commonly detected. Despite these strengths, NMR is faced with several challenges, illustrated in Fig. 1, including overlap (two peaks occupying the same spectral region), peak shifting (due to pH or metal ions; Tredwell et al., 2016), relatively low sensitivity compared to mass spectrometry, spectral crowding, and complex peak shapes (discussed in more detail below). Further complications can arise from differences in resolution, field strength, and line shape across samples or studies. Full utilization of ¹H 1D data requires expert consideration of these variables.

As such, identification of compounds in metabolomic NMR spectra is a nontrivial process which is hard to automate. Unambiguous structural identification typically requires highly time-consuming examination by a field expert who can leverage the rich and nuanced theoretical concepts involved. In theory, any spectrum should be computable from first principles, and there is an excellent literature covering the identification of small molecules by NMR techniques for metabolomics and natural products research (Beniddir et al., 2021; Bingol et al., 2016; Dona et al., 2016; Garcia-Perez et al., 2020; Ellinger et al., 2013; Pauli et al., 2014; van der Hooft, Rankin 2016). We will not discuss these in detail here; instead we give an overview of the general annotation/identification process as shown in Fig. 2. Here we conceptualize a two-step process of generation of hypothesized annotations by comparing experimental and reference signatures, and hypothesis testing by manual, computational and experimental means. A common theme among annotation pipelines is that additional experimentation and final confirmation by comparison to an authentic standard are usually required for a positive and unique identification.

1.2 Annotation as a subprocess of identification

On the other hand, annotation, the assignment of putative candidates to an observed feature using physicochemical properties or spectral similarity and metabolite databases, has become an important step towards final identification (Eghbalnia et al., 2017; Everett, 2015; Monge et al., 2019; Sumner et al., 2007; Ulrich et al., 2019). While annotation does not provide unique and certain identifications, annotations are a first step and their confidence should be expressed on an appropriate scale (Joesten, Kennedy 2019; Sumner et al., 2007). Moreover, the information obtained by annotation methods is often suitable for large-scale biological hypothesis generation. Since this process is less stringent and scale is important, it is sensible to automate annotation when an acceptable balance between confidence and scalability exists.

1.3 Challenges in the automation of annotation

Annotation is also not easy to automate, however. Many difficulties can be traced to the complex nature of the mixtures analysed in metabolomics, as well as the relative lack of sensitivity and resolution of NMR compared to other analytical techniques. Once again, we point to excellent discussions of these issues in previous reviews (Beniddir et al., 2021). Instead, we aim to differentiate between the various computational approaches to automating annotation. Numerous additional difficulties and ambiguities in automation emerge from the implicit application of the knowledge and information the seasoned spectroscopist brings to a spectrum. We therefore find that it is helpful to delineate these approaches by the type(s) of spectral features used, the underlying structural information they utilize, and their computational characteristics. Note that in discussing these types of information we do not intend to replace long-standing terms used by the NMR community; rather, our intent is to suggest nomenclature which refers to how these elements are computationally derived and used in practical annotation. Furthermore, we will not attempt a comprehensive and rigorous assessment of available annotation tools, and point the reader to existing reviews documenting and carefully discussing existing tools (Beniddir et al., 2021; Misra, 2021).

2 Data, feature complexity, and structural information

Spectral elements (objects in a spectrum) are defined from multiple perspectives. First, the structural relationships revealed by different NMR experiments result in spectral elements of varying complexity (Table 1; Fig. 3). Next, these features are extracted and defined using characteristics which depend on that complexity; these in turn inform how features are represented computationally (box in Fig. 4). In most cases, the full range of computational characteristics are not used, and even complex features are reduced to one or two characteristics.

Table 1 Examples of features at different levels for butanone

Full size table

2.1 Features

In NMR, there are several levels of information. First, data points representing signal need to be separated from noise or background. This represents the most basic level of information. Likewise, information is added when a feature is defined; i.e. boundaries and characteristics are applied to a part of that signal, which is then recognized as a meaningful spectral element that can serve as an independent unit and as a basis for spectral comparison. For annotation in typical metabolomics studies, a feature must also be recognizable by one or more of its characteristics across more than one sample. Note that, in the case of a single spectrum, this mapping is implicit in spectral matching. Figure 3 illustrates examples of the types of structural relationships encoded by each feature type from different NMR experiments commonly used in metabolomics. Features gain complexity moving from left to right. As a concrete example using the molecule butanone, we give the common characteristic of chemical shift, or position, for features of each type in Table 1.

The simplest feature, a resonance, represents a Fourier-transformed signal from magnetically equivalent nuclei distributed about a frequency with a Lorentzian or Voigt-like shape (Marshall et al., 1997). Multiple resonances can result from chemically equivalent nuclei that exist in magnetically distinct environments because of spin–spin coupling (Hoye et al., 1994) (Fig. 3, pink ellipses). Resonances are typically described by five characteristics: frequency (at maximum; e.g. 2.42 ppm in Table 1), height (at maximum), Full-Width at Half-Max (FWHM), line shape and signal-to-noise ratio, but a frequency range is sometimes also given (see Fig. 4 for computational characteristics).

In 2D NMR, the basic feature is a crosspeak (2D resonance) resulting from the relationship between distinct nuclei (colored connections in Fig. 3). A crosspeak falls on the respective axes at the resonant frequencies of each nucleus being measured, and its existence directly indicates a specific molecular relationship between two nuclei depending on the type of experiment. The information provided by a crosspeak can be the association of two distinct nuclei across multiple bonds (HMBC and COSY), within a spin-coupling system (TOCSY), or between directly bonded nuclei (HSQC; INADEQUATE). A critical point is that the basic feature in 2D data marks a structural relationship between nuclei. Additionally, 2D relationships can be homonuclear or heteronuclear. Depending on the resolution and type of the experiment, the same resonance characteristics discussed above can be measured with the advantage of being resolved into an additional dimension.

2.2 Peaks and compound features

While 2D features report on structural relationships directly, comound features are needed to accomplish the same task in 1D data. To this end, there are two routes to describing more complex elements of a spectrum which correspond to two directions of information flow (Fig. 4). A spectroscopist can recognize a group of resonances as a peak if it is known to be a part of a signature, or using expert judgment. However, this implies a known connection between spectral elements and molecular structure and this information is often computationally inaccessible. Alternatively, clear patterns can be used to build compound features from simpler ones. On one hand, the features derived from higher levels of knowledge are more reliable because they make fewer assumptions about, for example, feature grouping. However, it may still be advantageous to build features of increasing complexity without this knowledge because they can then be described using more detailed characteristics. The terms ‘peak’, ‘resonance’, and ‘feature’ reflect these differences; their conflation causes a great deal of confusion in practice. Here we favor the NMR definition of a peak: a cluster of one or more resonances derived from chemically equivalent nuclei (e.g. colored protons in Fig. 3).

2.2.1 Simple peaks

Peak shapes are derived from splitting patterns of varying complexity (Hoye & Zhao, 2002; Hoye et al., 1994). Frequency differences (in Hz) between specific pairs of constituent resonances provide J-coupling constants, which are influenced by several aspects of the electromagnetic relationship between coupled nuclei, such as total bond distance, bond angle, and other effects. Coupling type, a description of the observed splitting pattern of a peak, is commonly reported (Wishart et al., 2022) as is the number of observed resonances comprising the peak (e.g. “triplet”). Idealized simple peaks exhibit predictable, symmetric resonance intensities. Additionally, peaks representing coupled nuclei share coupling constant(s), which can be used to confirm a coupling relationship between peaks in simple cases, and can act as a guide in more complex cases.

2.2.2 Complex peaks

Complex splitting can produce unique characteristics which can be utilized in annotation, including symmetry, splitting pattern complexity, and strong coupling effects (e.g. ‘roofing’). Often these peaks are characterized more broadly by center frequency, maximum height, and frequency range, or (most often) by a group of these measurements inherited from constituent resonances (e.g. a collection of frequencies). Lastly, peaks can exhibit specific positional variations (center frequency variation) due to several factors such as pH or metal ion concentration in the local chemical environment (Tredwell et al., 2016). Importantly, a peak which shifts due to these effects will shift as a whole unit with its shape unchanged (e.g., the apparent doublets in Fig. 1).

2.2.3 Peaks vs. compound features

From the computational perspective, when analyzing an unknown feature, peak-level information is rarely known and must be hypothesized based on patterns observable in the data itself. When peaks cannot be well-defined (due to intra- or extra-molecular overlap, or complex patterns), heuristics can still be used to derive hypothesized groupings of resonances for higher-order matching and assignment tasks (Cobas et al., 2013; Golotvin et al., 2002; Hoye & Zhao, 2002; Hoye et al., 1994). Here we will refer to the resulting spectral elements as compound features (Fig. 4; Table 1) to reflect their different origins from known peaks.

2.3 Signatures and metasignatures

The signature of a compound is its spectroscopic profile in a single spectrum for a given experiment type. In other words, it is what would be observed if a pure solution of the compound were measured. However, there are several ways to describe and derive a signature computationally, and these differences are important when comparing spectral profiles. For example, a hypothetical signature can be obtained from a STOCSY analysis (Cloarec et al., 2005), but this would carry different information compared to a reference spectrum of a pure compound, since signals from distinct compounds might share high statistical correlations. Furthermore, even with noise excluded, the full signature is not necessarily the ideal database query or descriptor. Complex overlap, and variable chemical shifts (∂) are not well-captured by a signature which is simply represented as a full-resolution vector. Instead, information needs to be extracted in the form of features at different levels to maintain flexibility when necessary, and the collection of these features together can form a more useful, derived signature (such as a set of “peak-picked” resonances, e.g. Table 1). This process involves data reduction, and needs to be done carefully so as to not exclude useful information. We use the term sub-signature to describe any subset of compound or simple features extracted from a signature.

Likewise, the concept of a signature can be extended to the collection of features attributed to a given compound across experiment types on a single or multiple samples. We refer to this as a meta-signature. Meta-signatures are a familiar but abstract concept, as they are implicitly produced when a spectroscopist aligns crosspeaks in 2D spectra to glean interatomic relationships (represented as a multigraph in the top right of Fig. 4). While a signature can be conceptualized as purely signal (in the case of pure compound spectra) or derived features, a meta-signature integrates the features observed across experiment types, and this integration necessitates feature definition and extraction at some level. This usually means relating features from different experiments based on common dimensions of their characteristics (e.g. aligning spectra based on a common ∂¹H axis). For example, crosspeaks representing ¹H–¹³C bonds in an HSQC can be linked using multi-bond connections observed in an HMBC (Fig. 4), or a COSY crosspeak can be used to link two coupled peaks (Table 1). Meta-signatures can therefore tie together components of a signature which may otherwise appear unrelated. We note that, although metasignatures are typically derived from 2D data, STOCSY analysis on multiple samples can also contribute feature connections in 1D data (Fig. 4; discussed below).

3 Extracting features, compound features, signatures, and subsignatures

Feature characteristics can be derived from different sources and approaches. Resonance characteristics, for example, are commonly extracted using peak-picking algorithms, binning/bucketing algorithms (Sousa et al., 2013), and approaches which attempt to deconvolute to individual resonances or reduce overlap in other ways (Zeng et al., 2020). While the latter would be preferable, deconvolution of NMR data is still an open problem in complex mixture analysis. As a result, extraction of all features should not be expected. Nonetheless, we detail some of the approaches used to extract characteristics for each feature type when possible.

3.1 Bottom-up: data-driven feature extraction

3.1.1 Simple features

In complex mixture data from experimental samples, features are usually extracted directly using data reduction approaches, including binning, peak-picking, and deconvolution. These approaches address common challenges. First, the issue of positional variation (the alignment or correspondence problem) is addressed by alignment algorithms (Vu & Laukens, 2013; Vu et al., 2011) and binning to combine a fixed or dynamic number of adjacent data points to capture the same resonance maximum across multiple spectra (Sousa et al., 2013). Binning yields frequency (∂, ppm) and intensity in the form of signal maximum or integrated area under the curve (AUC). If the spectra are well-aligned or grouped, resonances can be “peak-picked” using a variety of approaches such as local maximum above noise threshold (Koradi et al., 1998), or wavelet filters (Beirnaert et al., 2018; Du et al., 2006; Trbovic et al., 2005). These algorithms tend to be packaged with other utilities, or scripted in-house and do not often receive focused attention. Lastly, resonances can be obscured by overlap—a situation where two or more resonances are overlapped such that apparent resonance characteristics are altered or altogether masked (e.g. Fig. 1). This tends to be more of an issue in ¹H spectra due to the narrow dispersion of proton frequencies for small molecules (~ 0–12 ppm) compared to ¹³C frequencies (~ 0–200 ppm).

Deconvolution algorithms attempt to remedy this issue by decomposing a spectrum into its constituent features. However, the true number of resonances is usually unknown, and varying numbers of resonances can often fit the data equally well, requiring assumptions to be made about resonance characteristics to break this degeneracy (Cobas et al., 2013). A large amount of NMR signal collected in metabolomics experiments often goes unused as a result of inability to define features in overlapped regions. An interesting approach in this area is the complete reduction to amplitude-frequency table (CRAFT) approach, which extracts deconvoluted frequency and position characteristics from time-domain data and obviates several spectral processing steps (Krishnamurthy, 2013).

3.2 Compound features from 1D data

In 1D mixtures a fundamental issue for annotation is knowing if several features belong to the same molecule. In some 1D cases, compound features can be built up from resonances when differences in chemical shift are hypothesized to be J-coupling constants. Likewise, if splitting patterns are simple enough, then the expected signal ratio between peaks or overall peak shape (including symmetry, roofing, etc.) can be used to verify the connection between resonances and allow recognition of a compound feature. Early attempts at compound feature/sub-signature extraction emerged from Computer-Assisted Structure Elucidation (CASE) research, which focused on assignment of spectral features in combinatorial synthesis reactions with known structural motifs (Rossé et al., 2002). PROOFSTR recursively estimated splitting trees from 1D ¹H spectra while accounting for multiplet shape and symmetry (Golotvin et al., 2002). Likewise, MestreNova produced a J-coupling constant extraction tool (Cobas et al., 2005), and later developed an automatic assignment that extracts peak position, number of nuclei, and multiplet shape as a dimensionless “pure-shape characteristic” from the reference and experimental spectra (Cobas et al., 2013). However, these compound features must be considered cautiously, as the assignment of a compound feature to a physical mechanism (e.g. scalar coupling) cannot always be inferred with confidence from the mixture data alone.

3.3 Compound features from 2D data

For 2D multibond correlation experiments (COSY, HMBC), compound features can be built from basic spectral elements more reliably (Figs. 2 and 3, moving from left to right); that is, 2D peaks can be directly agglomerated into compound features by connecting pairs of crosspeaks which share a frequency on either axis. In heteronuclear spectra such as ¹H-¹³C HSQC, crosspeaks typically do not line up, as chemically distinct protons are less often bonded to the same carbon. Even though compound features obtained from 2D data are generally high-confidence because the data communicate bond information directly, orthogonal data are still typically needed to connect them in mixture data. Spin systems are often broken up in a molecule by a ‘silent center’, an atom lacking direct bonds to protons (Dona et al., 2016). In both 1D and 2D data, these silent centers prevent connecting nuclei across the entire molecule, meaning that their corresponding subsignatures are disjoint (e.g. the pink TOCSY signature in Fig. 3) and must be joined by data from other experiments. Alternatively, compound features can be derived from empirically or statistically sourced full signatures (sometimes obtained by a meta-signature or a collection of 1D spectra; see below).

3.4 Signatures and metasignatures are connecting points

Signatures themselves can be derived from four main sources. First, a spectrum of a pure compound can be acquired under controlled conditions, as found in reference databases. While such a signature is less complex than mixture data, features must still be extracted from it to do feature-based matching. These features (assuming no impurities or artefacts) are known to be in the same molecule, but, particularly in 1D cases, the difference between compound features and peaks typically remains unless such annotation is provided from another source (i.e. expert knowledge or other data).

Second, a predicted or computed spectrum can provide a range of signature information. In fact the Hierarchical Organization of Spherical Environments (HOSE) methods (Bremser, 1978) and others provide quite accurate ¹³C 1D spectrum predictions, and several approaches for ¹H 1D prediction have also been developed (Cobas et al., 2013; Dashti et al., 2017, 2018; Smurnyy et al., 2008). Prediction from structure is powerful in theory, because it does not rely as heavily on acquisition parameters, it is more scalable than collecting high-quality experimental data, and provides a first-principles, structural basis to link features to signatures or metasignatures. This offers a great deal of flexibility for matching. Recently, NMR spectrum-based chemical similarity networks which delineate substructure-subsignature releationships have been published (Egan et al., 2021; Flores-Bocanegra et al., 2022; Reher et al., 2020), and offer several opportunities. As these become more widely used, modeling steps could be re-used to expedite modeling of new compounds. Additionally, model updates and useful metadata could be propagated intelligently to compounds in the network, or to compounds in similar natural product compound networks (Kim et al., 2021).

Third, a metasignature can easily be decomposed into signatures. There are two key advantages to sourcing signatures from metasignatures: the signature is supported by orthogonal data types, and high-quality peak information may also be obtained. Previously disjoint compound features (e.g. blue HSQC connections in Figs. 2 and 3) can be linked together by a metasignature (combined connections in Figs. 2 and 3). Then, this information can be carried back down to the signature (i.e. single experiment) level as a high-quality signature (Fig. 4). This latter step is critical because most reference databases do not contain meta-signatures across multiple experiment types; rather, they contain signatures for a given experiment type derived either from pure compound spectra or integration into a metasignature (e.g. Bingol et al., 2012, 2014; Robinette et al., 2008). Meta-signatures themselves can be derived from 1 and 2D data, but generally not from 1D spectra alone. Potential exceptions, albeit much less common in metabolomics settings, include DEPT and ¹H-detected 1D HSQC/HMBC experiments, as bond information between nuclei is conferred in a 1D data type. For 2D multi-bond correlation experiments, such as long COSY (Dona et al., 2016; van der Hooft & Rankin, 2016) or HMBC (Bakiri et al., 2018), a complete signature can be built directly from the data when compound features share a chemical shift and all features are linked.

Lastly, in the case of complex mixtures, correlation-based dereplication approaches can extract compound signatures from a collection of NMR mixture data from different samples. The main statistical dereplication methods relevant to annotation include STOCSY (Cloarec et al., 2005), CLASSY (Robinette et al., 2009), SHOCSY (Zou et al., 2014), and STORM (Posma et al., 2012). For example, the Metabomatching suite uses three statistical methods for obtaining signatures, and then compares them (Khalili et al., 2019). Statistically derived signatures should be used with caution, as they commonly include artifacts or missing peaks, do not provide relative quantification, and are sensitive to overlap.

3.5 Top-down: high-quality subsignatures from signatures

Compound features must be taken as hypotheses when derived directly from complex mixture features and without incorporation of higher-level information. However, when a signature is available (either experimental, statistical, or computed from first principles) it can be broken down into related compound and simple features, or subsignatures typically based on spin systems. In simple 1D cases, modeling is often used to generate a signature and subsignatures. For example, the ChenomX suite includes a pH-dependent reference library which integrates subsignatures based on experimental data and modeled spin-coupling relationships (Mercier et al., 2011). Likewise, the GISSMO library provides a spin system matrix containing all pairwise coupling constants for each compound. Peaks and their profiles are then simulated for any field strength; furthermore, spin systems are compactly represented as frequency differences between resonances (Dashti et al., 2017, 2018). Alternatively, any method yielding compound features could also be applied to a signature.

2D signatures allow for more confident subsignature extraction. Demix demonstrated PCA-based demixing of 1D traces of TOCSY spectra with varied mixing times, where principal components yielded subsignatures corresponding to spin systems (Zhang & Brüschweiler 2004). This method connects data points into subsignatures on the basis of their co-occurrence in a meta-signature derived from statistical integration of multiple spectra collected on the same sample. On the other hand, DemixC decomposes single covariance-processed 2D ¹H-¹H TOCSY spectra into spin systems directly using a clustering approach, which allows it to be more robust against overlap but provides less-confident compound feature definition (Zhang & Brüschweiler, 2007). 2D signatures can also be projected to 1D for broader utility. Bingol et al. (2012) did this for ¹³C–¹³C TOSCY, 2D ¹³C–¹H HSQC-TOCSY, and 2D ¹H-¹H TOCSY data (Bingol et al., 2014), and subsignatures were then compiled into the TOCCATA (TOCSY Customized Carbon Trace Archive) database and ¹H(¹³C)-TOCCATA databases, respectively. Maximal clique-based subgraph extraction was also used to build spin systems from 2D ¹H-¹H TOCSY data, yielding complete spin–spin connectivity information (Li et al., 2017). Compound features, like those obtained using ¹H iterative full-spin analysis (HiFSA) simulations, require precise reporting (Pauli et al., 2014), but allow for more thorough feature characterization (e.g. Pauli et al., 2016) and even modularized structural subsignatures (Napolitano et al., 2015). Lastly, MADByTE utilizes metasignature information derived from TOCSY and HSQC data to pull out “spin-system features”, which are then compared (by chemical shift) with experimental data (Egan et al., 2021; Flores-Bocanegra et al., 2022).

4 Matching: leveraging computational characteristics

The information underlying different types of features determines the computational approaches that can be used for comparison with reference databases in the annotation process. We next give examples of the use of different types of information in database matching, and note important algorithms used. Our discussion is intended to be illustrative rather than comprehensive. Once information has been extracted from the data, spectral comparison to reference data can take place. It must be noted that currently available reference databases are extremely limited in their coverage of the space of all possible metabolites, and the representation of molecules tends to be biased towards those found in the most common sample types (e.g. blood serum, urine, or tissue extracts). Database search styles for spectral data can be divided into three main types, each distinguished by its output (Mohamed et al., 2015; Zürcher et al., 1988). Either (1) an exact match can be sought (with no flexibility), or (2) results most similar to or consistent with the query data can be returned in a ‘ranked’ approach; this is the search method most groups have pursued. Alternatively, (3) an ‘interpretative’ search can provide a set of subsignature-specific matches, which allows a compound structure to be deduced from the combination of subsignatures. (Mohamed et al., 2015) argue that, although an interpretative search is feasible for ¹³C data (Koichi et al., 2014), it is not tenable for 1D ¹H NMR because of overlap and other issues. However, several recent tools have challenged this notion by factoring in subsignatures, or subsignature-substructure relationships, during matching (Charris-Molina et al., 2020; Wang et al., 2019). Such approaches may hold promise in mitigating the issue of reference database coverage and bias by incorporating more information while allowing partial matches to subsignatures that may be shared between metabolites.

Similarly, matching algorithms leverage the advantages of different data types. There are different types of spectral comparisons that can be made to database entries (Mohamed et al., 2015). We point these out, and build on this framework to include different forms of information which can be extracted from common NMR experiments. Indeed, much of the variation between reference-based annotation programs results from the choice of information used for matching.

4.1 Comparisons between feature lists

Common search functions (Cui et al., 2008; Robinette et al., 2008) formulate matching as an assignment problem, and employ the Hungarian algorithm as the core computational workhorse. This classic bipartite weighted graph assignment algorithm permutes the cost (or peak distance) matrix calculated between lists of resonance frequencies to minimize its trace (Kuhn, 1955). One downside to this approach is that, when the query and reference lists have very different sizes, the overall score is not penalized by unmatched peaks. To address this, Robinette et al. applied the algorithm iteratively, eliminating previously matched peaks in each iteration to force (dummy-matched) peaks to be matched with worse scores. The sum total distance of a match is then used for ranking matches (Bingol et al., 2012, 2014; Robinette et al., 2008). This ranking strategy is well-suited for empirically-derived queries such as those obtained from 2D data using e.g. DemixC (Robinette et al., 2008) or TOCCATA (Bingol et al., 2012, 2014), or when reference peaks exceed the number of observed query peaks. However, it is not optimal for the peak lists obtained from statistical deconvolution/dereplication approaches, which are typically rich in false-positives and require more flexibility. In other words, though the match computation is the same, the meaning of a match is different when a query can be assumed to be devoid of false positive signals. Finally, once peaks are matched, the Tanimoto (Jaccard) coefficient can be used as a scoring function that accounts for the number of expected peaks vs. matched peaks (Mohamed et al., 2015). Although not extended yet to complex mixtures, DP4-AI uses an innovative probabilistic matching mechanism that is an extension of the Hungarian algorithm and gives an overall fit metric (Howarth et al., 2020).

4.2 Compound feature/subsignature-driven comparisons

Importantly, while the sum of an independent set of subsignatures and/or resonances looks the same as a signature, there is a difference between the two when it comes to matching. The question of how well a small set of linked subsignatures fits a small set of reference features is much more constrained than the question of which features, if any, correspond to that small set of reference features. This constraint has important implications for false positives as well as computational tractability. Positional variations require flexibility, which necessarily introduces degeneracy and increases the effective number of sub-hypotheses that need to be tested to assess a match. This increases the chance of false positives at the feature matching stage, which will translate to false positive annotations with opaque justification. Subsignatures, therefore, are perhaps the best basis for flexible matching. Compound features can also be used; they are less reliable, but may afford similar flexibility for compounds which only have 1D data in reference databases and do not benefit from high-quality peak information derived from expert annotation or 2D data.

Likewise, while resonance-level information is more granular, it often lacks information needed for dependable matching. Various aspects of 1D ¹H data have been used for matching, including peak position, but also line shape, symmetry, and J-coupling constants. These characteristics were incorporated into a single bespoke match factor for similarity to spectra predicted from structural motifs (Golotvin et al., 2006). Likewise, MestreNova developed an automatic assignment approach that checks annotations by comparing predicted and experimental multiplet pairs on the basis of peak position, number of nuclei, and the multiplet “pure-shape characteristic” (Cobas et al., 2013).

ChenomX fits peak subsignatures to experimental data and allows them to move independently in an interactive manual fitting process. The software also provides an automated fitting function driven by a genetic/simulated annealing algorithm that incorporates the shapes, positions, and resonance widths of each reference multiplet peak cluster for a given 1D ¹H spectrum (Mercier et al., 2011). However, it is important to note that fits are computed for individual experimental 1D spectra. Therefore, matching only relies on the data for one spectrum and does not utilize information from other sources, such as correlation dereplication methods (Note: the algorithms underlying the software have likely developed beyond this point, but we are unaware of literature detailing those updates).

The COLMARm query system uses subsignatures and their 1D projections for computational demixing and annotation (Bingol et al., 2012, 2014; Robinette et al., 2008). Users upload up to three types of 2D spectra for the same complex mixture sample, and receive automated annotation of those spectra based on several databases with minimal input. The large number of high-quality reference spectra, the accuracy of the annotations, and the interface provided make this a very popular tool set.

Other methods employ spin-system extraction approaches driven by 2D data (Li et al., 2017; Napolitano et al., 2015) allow comparison (by lists of chemical shifts) to database subsignatures, and even enable substructure spectra as building blocks for larger molecules (Napolitano et al., 2015).

4.3 Full signature comparisons

Full spectral signature comparison methods allow the most flexibility by capturing the “broader picture” of a signature. These can be integrated with information from lower levels during or after fitting (as filtering steps), and are broadly organized into those which handle reduced signatures (to linked subsignatures or features), and those which utilize full-resolution data directly. A signature as a set of linked features, can be compared using any of the characteristics of those features. However, the integration of this information can be difficult to interpret/control, and can be implicit in the scoring mechanism used, like the Tanimoto (Jaccard) measures (Mohamed et al., 2015).

For full-resolution treatment of signatures, (Mohamed et al., 2015) lay out three classes of spectral comparisons for numerical vectors: correlation-based, distance-based, and tree-based metrics. The latter is less commonly used, and is rooted in the idea of a balance of signal mass across the spectrum, to allow flexible global pattern matching. This approach leverages the intuition that similar molecules have generally similar spectral properties (Castillo et al., 2013; Zürcher et al., 1988), and would be useful for comparing signatures which have positional differences at the local level. However, criteria such as matching coupling constants may need to be screened downstream. Additionally, model-based metabolite fitting and quantification methods, such as BATMAN (Hao et al., 2014), BAYESIL (Ravanbakhsh et al., 2015), and ChenomX (Mercier et al., 2011) are inherently full signature comparisons, and they differ from the metrics discussed above.

Like the fitting methods, deep learning and pattern recognition-based (Dubey et al., 2015; Hubert et al., 2014; Napolitano et al., 2013; Wolfram et al., 2006) methods, in theory, use optimal weighted combinations of spectral information across all levels to satisfy the training goal, whether that be sample classification, molecular classification, identification of substructures, or distinguishing molecules and even isomers. SMART (2D, Zhang et al., 2017) and SMART 2.0 (1D, Reher et al., 2020) approaches use deep learning to learn substructures which distinguish compounds; however, the degree of integration of these as a whole signature is unclear. Finally, another deep learning tool which classifies natural products based on molecular substructures may assist in exploring this direction further (Kim et al., 2021). The major challenge here is the availability of high-quality labelled training data; however, this has been remedied by the Siamese network architecture, which performs well for data with few training examples per class (Zhang et al., 2017). Depending on the nature of the training data (complex or simple mixture data), deep learning methods could conceivably even account for extramolecular spectral cues such as matrix-related effects and biases. It is important to consider these potential influences as more deep learning and full-signature methods are employed.

Much of the recent work in signature-based comparison comes from advances in forward and reverse prediction of ¹³C reference spectra, particularly when the molecular formula is known. Imitation learning was used to generate molecular graph topologies consistent with chemical shifts and peak splitting in a ¹³C 1D NMR signature of an unknown (Jonas, 2019). Another, iterative algorithm builds a tree of structures consistent with the experimental signature using a set of constraints and a de-novo molecule generator. Reference spectra for the molecules are computed using quantum chemistry, and are compared to the signature using the Wasserstein distance (Zhang et al., 2020). In another approach, ¹³C NMR queries are matched directly to HOSE- and MPNN-predicted chemical shifts of candidate molecules using cosine similarity (Kwon et al., 2021). Finally, an ML-driven model was recently trained to recognize hundreds of substructures in 1D ¹³C NMR data, which can then be used for automated structure elucidation. The model uses different pooling layers with the idea of optimizing feature and compound feature characterization separately. The method can also work with ¹H data (Huang, 2021).

We are not aware of any platform that utilizes meta-signatures explicitly; this may be accomplished using multigraph methods or weighted combinations of independent match scores for each and may be a fruitful avenue of future research. However, (Joesten & Kennedy, 2019) outline a general formalized approach to using metasignature information for ranking putative annotations manually. Likewise, (Jonas, 2019) uses a Markov decision process approach to build molecular graph topologies from ¹³C chemical shifts and splittings. A few subsignature-based methods do simultaneously match resonances in multiple spectrum types, including COLMARm (Bingol et al., 2014), and MADByTE (Egan et al., 2021; Flores-Bocanegra et al., 2022), and graph-based comparisons are used in connectome research (Frigo et al., 2021).

5 Assessing and communicating confidence

5.1 Ranking hypotheses

Algorithms for matching an experimental signature to a reference database will almost always return matching results for a given query, as the matching process generally produces at least one (true or false positive) match when sufficiently large reference databases are used. However, for these results to be practically useful, users need an estimate of the matching confidence to filter and rank each potential annotation. This way a decision can be made on further use (e.g. in pathway analysis) or to go forward with confirmatory experiments. A simple proxy for confidence is the match score, which quantifies the goodness of fit between the query signature (e.g. list of chemical shifts) and the matching reference. So how can this be done for matches generated from features of different complexity, or feature lists of different sizes?

The simplest metrics for goodness of fit involve the proportion of observed to all possible features matched within a tolerance (typically ∂ at feature max); this is given by the Jaccard (or Tanimoto) index which has also been applied to chemical structure fingerprints (Bajusz et al., 2015). Total distance between the best-fitting features in lists can be used, where unevenly sized lists are penalized (Robinette et al., 2008). Intensity differences can also be included, but this introduces the issue of ensuring comparable intensities between the query and reference. Generally, a good starting concentration is the maximum allowed such that no reference signal exceeds a matching query signal (Hao et al., 2014).

Metrics useful for signatures and compound features allow for more detailed comparisons using higher-order information, such as peak shape and accounted-for signal. Spectral similarity metrics differ across studies. Strength of similarity can be used to rank annnotations; although in general, p-values from statistical tests associated with correlation-based metrics should not be taken as accurate as spectral data are not independent. It may be more useful to express confidence as signal accounted-for, e.g. using root-mean-square error (RMSE), or Wasserstein distance (Zhang et al., 2020). Composite scores can also incorporate other scores (e.g. proportion of expected peaks matched), each of which must be given an appropriate weight which is hard to determine. Furthermore, if compound features or subsignatures are matched independently, a partial signature match may still be worth reporting. Irrespective of the score used, it can be hard to decide between many highly ranked hits, and setting a threshold on the score for an acceptable match is extremely problematic.

5.2 Controlling false positives

One approach is to find a threshold which limits the rate of false positive annotations, by estimating the distribution of the match score under a suitable null hypothesis. The decoy database is a key idea in this direction, which has been widely used to estimate annotation confidence in mass spectrometry-based analysis (both proteomics and metabolomics) but has not yet received much attention in the NMR annotation field. In this approach, one generates a reference database composed of artificial entities (peptides, metabolites etc.) which are as similar as possible to the true reference database, while being clearly identifiable as incorrect hits. For example, in proteomics, one can use reversed sequences to produce a decoy database of peptides with similar size and amino acid distribution to the reference, yet where each entity cannot be a correct annotation. A simple approach in NMR could involve constructing decoys by picking peaks at random from the reference database, in such a way as to maintain the reference distribution of numbers of peaks per compound. This approach has been applied in MS metabolomics (Elias & Gygi, 2010; Scheubert et al., 2017) and improved upon by using fragmentation tree information and peak co-occurrence. This latter idea also seems straightforward to apply to other spectroscopic data, but it remains to be seen whether decoy databases can provide reliable confidence estimates for NMR annotation.

Whether one uses the decoy idea or not, it is clear that the nature of the reference database will heavily influence the confidence of any annotation. In any database matching problem, a larger reference database increases the chances of a false positive match. This is a well-known problem in sequence based bioinformatics (e.g. BLAST search) where p-values are converted to E-values to account for this effect. On the other hand, the chance of a false negative (i.e. lack of any match) is increased with smaller or incomplete databases, which will nearly always be the case given the vast chemical diversity of potential metabolites. Thus, for realistic matching strategies, there will always be a balance between recall (ability to make any match) and precision (accuracy of the match), and larger databases will require higher fidelity matching to avoid rampant false positives. Beyond just size, the composition of the database will also have a large impact. For example, a database focused on compounds known to be present in normal human blood may be excellent for annotation of plasma derived spectra, but contribute many false positives and negatives when used to annotate samples of a different type.

One aspect of database heterogeneity that can be useful in scoring or confidence estimation is the notion of peak uniqueness. A peak is said to be unique for a given database if there is no other compound with a peak at the same chemical shift. Any match to a unique peak could be considered strong evidence that the reference compound is present (assuming the other reference peaks are also observed). This uniqueness could be taken into account in match scoring or confidence estimation, perhaps by upweighting highly unique matches. The idea can be extended to unique patterns of multiple peaks which has a clear application when scoring reference compounds with identical signatures (Tulpan et al., 2011). It also has the advantage that the weighting can be calculated automatically for any database and does not depend on human interpretation. To summarize, it is currently the case that there is no standard way to estimate confidence of a match. However, there are several ideas which could be applied and which are independent of the matching algorithm itself.

6 Overall conclusions

Computational annotation accelerates the identification process by suggesting high-quality hypotheses that can then be tested computationally and experimentally. These processes fundamentally rely on the various layers of interconnected information contained within and across NMR spectra. Therefore, it is necessary to have a clear understanding of how data are interpreted for structural information, as well as how they are handled in computational comparisons. By viewing the wealth of annotation software available to the NMR community through these lenses, several key points emerge:

Reduction of spectra to individual sets of features may not be as straightforward as it first appears. The same data points can be interpreted in a range of ways.
The optimal comparison to a reference signature is likely to take place at similar information levels for both experimental and reference signatures/subsignatures. Matching at higher levels allows for flexibility and is expected to be more reliable as meaningful features (e.g. peak shapes) are being compared.
Matching algorithms must be appropriate for each data type, and scoring metrics can exert subtle influences on rankings and performance quality.
Significant progress has been made recently in the incorporation of powerful statistical methods like deep learning in the interpretation and prediction of NMR spectra, from features to signatures.

As the field progresses, reporting of annotations, and confidence in them, should be based on the structural information employed in the annotation process. Each level of information provides a different form of evidence for an annotation, and it is expected that this multifaceted approach to the problem will be increasingly used as the field moves forward.

Data availability

Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

References

Bajusz, D., Rácz, A., & Héberger, K. (2015). Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? Journal of Cheminformatics, 7(1), 20. https://doi.org/10.1186/s13321-015-0069-3
Article CAS Google Scholar
Bakiri, A., Hubert, J., Reynaud, R., Lambert, C., Martinez, A., Renault, J.-H., & Nuzillard, J.-M. (2018). Reconstruction of HMBC correlation networks: A novel NMR-based contribution to metabolite mixture analysis. Journal of Chemical Information and Modeling, 58(2), 262–270. https://doi.org/10.1021/acs.jcim.7b00653
Article CAS Google Scholar
Beirnaert, C., Meysman, P., Vu, T. N., Hermans, N., Apers, S., Pieters, L., Covaci, A., & Laukens, K. (2018). Speaq 2.0: A complete workflow for high-throughput 1D NMR spectra processing and quantification. PLOS Computational Biology, 14(3), e1006018. https://doi.org/10.1371/journal.pcbi.1006018
Article CAS Google Scholar
Beniddir, M. A., Kang, K. B., Genta-Jouve, G., Huber, F., Rogers, S., & van der Hooft, J. J. J. (2021). Advances in decomposing complex metabolite mixtures using substructure- and network-based computational metabolomics approaches. Natural Product Reports, 38(11), 1967–1993. https://doi.org/10.1039/D1NP00023C
Article CAS Google Scholar
Bingol, K., Bruschweiler-Li, L., Li, D.-W., & Brüschweiler, R. (2014). Customized metabolomics database for the analysis of NMR 1H–1H TOCSY and 13C–1H HSQC-TOCSY spectra of complex mixtures. Analytical Chemistry, 86(11), 5494–5501. https://doi.org/10.1021/ac500979g
Article CAS Google Scholar
Bingol, K., Bruschweiler-Li, L., Li, D., Zhang, B., Xie, M., & Brüschweiler, R. (2016). Emerging new strategies for successful metabolite identification in metabolomics. Bioanalysis, 8(6), 557–573. https://doi.org/10.4155/bio-2015-0004
Article CAS Google Scholar
Bingol, K., Zhang, F., Bruschweiler-Li, L., & Brüschweiler, R. (2012). TOCCATA: A customized carbon total correlation spectroscopy NMR metabolomics database. Analytical Chemistry, 84(21), 9395–9401. https://doi.org/10.1021/ac302197e
Article CAS Google Scholar
Bremser, W. (1978). Hose—a novel substructure code. Analytica Chimica Acta, 103(4), 355–365. https://doi.org/10.1016/S0003-2670(01)83100-7
Article CAS Google Scholar
Castillo, A. M., Uribe, L., Patiny, L., & Wist, J. (2013). Fast and shift-insensitive similarity comparisons of NMR using a tree-representation of spectra. Chemometrics and Intelligent Laboratory Systems, 127, 1–6. https://doi.org/10.1016/j.chemolab.2013.05.009
Article CAS Google Scholar
Charris-Molina, A., Riquelme, G., Burdisso, P., & Hoijemberg, P. A. (2020). Consecutive queries to assess biological correlation in NMR metabolomics: performance of comprehensive search of multiplets over typical 1D 1H NMR database search. Journal of Proteome Research, 19(8), 2977–2988. https://doi.org/10.1021/acs.jproteome.9b00872
Article CAS Google Scholar
Cloarec, O., Dumas, M.-E., Craig, A., Barton, R. H., Trygg, J., Hudson, J., Blancher, C., Gauguier, D., Lindon, J. C., Holmes, E., & Nicholson, J. (2005). Statistical total correlation spectroscopy: an exploratory approach for latent biomarker identification from metabolic 1H NMR data sets. Analytical Chemistry, 77(5), 1282–1289. https://doi.org/10.1021/ac048630x
Article CAS Google Scholar
Cobas, C., Seoane, F., Vaz, E., Bernstein, M. A., Dominguez, S., Pérez, M., & Sýkora, S. (2013). Automatic assignment of 1H-NMR spectra of small molecules. Magnetic Resonance in Chemistry, 51(10), 649–654. https://doi.org/10.1002/mrc.3995
Article CAS Google Scholar
Cobas, J. C., Constantino-Castillo, V., Martín-Pastor, M., & del Río-Portilla, F. (2005). A two-stage approach to automatic determination of 1H NMR coupling constants. Magnetic Resonance in Chemistry, 43(10), 843–848. https://doi.org/10.1002/mrc.1623
Article CAS Google Scholar
Cui, Q., Lewis, I. A., Hegeman, A. D., Anderson, M. E., Li, J., Schulte, C. F., Westler, W. M., Eghbalnia, H. R., Sussman, M. R., & Markley, J. L. (2008). Metabolite identification via the madison metabolomics consortium database. Nature Biotechnology, 26(2), 162–164. https://doi.org/10.1038/nbt0208-162
Article CAS Google Scholar
Dashti, H., Wedell, J. R., Westler, W. M., Tonelli, M., Aceti, D., Amarasinghe, G. K., Markley, J. L., & Eghbalnia, H. R. (2018). Applications of parametrized NMR spin systems of small molecules. Analytical Chemistry, 90(18), 10646–10649. https://doi.org/10.1021/acs.analchem.8b02660
Article CAS Google Scholar
Dashti, H., Westler, W. M., Tonelli, M., Wedell, J. R., Markley, J. L., & Eghbalnia, H. R. (2017). Spin System modeling of nuclear magnetic resonance spectra for applications in metabolomics and small molecule screening. Analytical Chemistry, 89(22), 12201–12208. https://doi.org/10.1021/acs.analchem.7b02884
Article CAS Google Scholar
Dona, A. C., Kyriakides, M., Scott, F., Shephard, E. A., Varshavi, D., Veselkov, K., & Everett, J. R. (2016). A guide to the identification of metabolites in NMR-based metabonomics/metabolomics experiments. Computational and Structural Biotechnology Journal, 14, 135–153. https://doi.org/10.1016/j.csbj.2016.02.005
Article CAS Google Scholar
Du, P., Kibbe, W. A., & Lin, S. M. (2006). Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching. Bioinformatics, 22(17), 2059–2065. https://doi.org/10.1093/bioinformatics/btl355
Article CAS Google Scholar
Dubey, A., Rangarajan, A., Pal, D., & Atreya, H. S. (2015). Pattern recognition-based approach for identifying metabolites in nuclear magnetic resonance-based metabolomics. Analytical Chemistry, 87(14), 7148–7155. https://doi.org/10.1021/acs.analchem.5b00990
Article CAS Google Scholar
Edison, A. S., Colonna, M., Gouveia, G. J., Holderman, N. R., Judge, M. T., Shen, X., & Zhang, S. (2021). NMR: Unique strengths that enhance modern metabolomics research. Analytical Chemistry, 93(1), 478–499. https://doi.org/10.1021/acs.analchem.0c04414
Article CAS Google Scholar
Egan, J. M., van Santen, J. A., Liu, D. Y., & Linington, R. G. (2021). Development of an NMR-based platform for the direct structural annotation of complex natural products mixtures. Journal of Natural Products, 84(4), 1044–1055. https://doi.org/10.1021/acs.jnatprod.0c01076
Article CAS Google Scholar
Eghbalnia, H. R., Romero, P. R., Westler, W. M., Baskaran, K., Ulrich, E. L., & Markley, J. L. (2017). Increasing rigor in NMR-based metabolomics through validated and open source tools. Current Opinion in Biotechnology, 43, 56–61. https://doi.org/10.1016/j.copbio.2016.08.005
Article CAS Google Scholar
Elias, J. E., & Gygi, S. P. (2010). Target-decoy search strategy for mass spectrometry-based proteomics. In S. J. Hubbard & A. R. Jones (Eds.), Proteome bioinformatics (pp. 55–71). Humana Press.
Chapter Google Scholar
Ellinger, J. J., Chylla, R. A., Ulrich, E. L., & Markley, J. L. (2013). Databases and software for NMR-based metabolomics. Current Metabolomics. https://doi.org/10.2174/2213235x11301010028
Article Google Scholar
Everett, J. R. (2015). A New Paradigm for known metabolite identification in metabonomics/metabolomics: Metabolite identification efficiency. Computational and Structural Biotechnology Journal, 13, 131–144. https://doi.org/10.1016/j.csbj.2015.01.002
Article CAS Google Scholar
Flores-Bocanegra, L., Al Subeh, Z. Y., Egan, J. M., El-Elimat, T., Raja, H. A., Burdette, J. E., Pearce, C. J., Linington, R. G., & Oberlies, N. H. (2022). Dereplication of fungal metabolites by NMR-based compound networking using MADByTE. Journal of Natural Products, 85(3), 614–624. https://doi.org/10.1021/acs.jnatprod.1c00841
Article CAS Google Scholar
Frigo, M., Cruciani, E., Coudert, D., Deriche, R., Natale, E., & Deslauriers-Gauthier, S. (2021). Network alignment and similarity reveal atlas-based topological differences in structural connectomes. Network Neuroscience (cambridge, Mass), 5(3), 711–733. https://doi.org/10.1162/netn_a_00199
Article Google Scholar
Garcia-Perez, I., Posma, J. M., Serrano-Contreras, J. I., Boulangé, C. L., Chan, Q., Frost, G., Stamler, J., Elliott, P., Lindon, J. C., Holmes, E., & Nicholson, J. K. (2020). Identifying unknown metabolites using NMR-based metabolic profiling techniques. Nature Protocols, 15(8), 2538–2567. https://doi.org/10.1038/s41596-020-0343-3
Article CAS Google Scholar
Golotvin, S. S., Vodopianov, E., Lefebvre, B. A., Williams, A. J., & Spitzer, T. D. (2006). Automated structure verification based on 1H NMR prediction. Magnetic Resonance in Chemistry, 44(5), 524–538. https://doi.org/10.1002/mrc.1781
Article CAS Google Scholar
Golotvin, S., Vodopianov, E., & Williams, A. (2002). A new approach to automated first-order multiplet analysis. Magnetic Resonance in Chemistry, 40(5), 331–336. https://doi.org/10.1002/mrc.1014
Article CAS Google Scholar
Hao, J., Liebeke, M., Astle, W., De Iorio, M., Bundy, J. G., & Ebbels, T. M. D. (2014). Bayesian deconvolution and quantification of metabolites in complex 1D NMR spectra using BATMAN. Nature Protocols, 9(6), 1416–1427. https://doi.org/10.1038/nprot.2014.090
Article CAS Google Scholar
Haug, K., Cochrane, K., Nainala, V. C., Williams, M., Chang, J., Jayaseelan, K. V., & O’Donovan, C. (2020). MetaboLights: A resource evolving in response to the needs of its scientific community. Nucleic Acids Research, 48(D1), D440–D444. https://doi.org/10.1093/nar/gkz1019
Article CAS Google Scholar
Howarth, A., Ermanis, K., & Goodman, J. M. (2020). DP4-AI automated NMR data analysis: straight from spectrometer to structure. Chemical Science, 11(17), 4351–4359. https://doi.org/10.1039/D0SC00442A
Article CAS Google Scholar
Hoye, T. R., Hanson, P. R., & Vyvyan, J. R. (1994). A practical guide to first-order multiplet analysis in 1H NMR spectroscopy. The Journal of Organic Chemistry, 59(15), 4096–4103. https://doi.org/10.1021/jo00094a018
Article CAS Google Scholar
Hoye, T. R., & Zhao, H. (2002). A method for easily determining coupling constant values: An addendum to “A Practical Guide to First-Order Multiplet Analysis in 1H NMR spectroscopy.” The Journal of Organic Chemistry, 67(12), 4014–4016. https://doi.org/10.1021/jo001139v
Article CAS Google Scholar
Huang, Z., Chen, M. S., Woroch, C. P., Markland, T. E., & Kanan, M. W. (2021). A framework for automated structure elucidation from routine NMR spectra. Chemical Science, 12(46), 15329–15338. https://doi.org/10.1039/D1SC04105C
Article CAS Google Scholar
Hubert, J., Nuzillard, J.-M., Purson, S., Hamzaoui, M., Borie, N., Reynaud, R., & Renault, J.-H. (2014). Identification of natural metabolites in mixture: A pattern recognition strategy based on 13C NMR. Analytical Chemistry, 86(6), 2955–2962. https://doi.org/10.1021/ac403223f
Article CAS Google Scholar
Joesten, W. C., & Kennedy, M. A. (2019). RANCM: A new ranking scheme for assigning confidence levels to metabolite assignments in NMR-based metabolomics studies. Metabolomics, 15(1), 5. https://doi.org/10.1007/s11306-018-1465-2
Article CAS Google Scholar
Jonas, E. (2019). Deep imitation learning for molecular inverse problems. In Advances in neural information processing systems, 32.
Khalili, B., Tomasoni, M., Mattei, M., Mallol Parera, R., Sonmez, R., Krefl, D., Rueedi, R., & Bergmann, S. (2019). Automated analysis of large-scale NMR data generates metabolomic signatures and links them to candidate metabolites. Journal of Proteome Research, 18(9), 3360–3368. https://doi.org/10.1021/acs.jproteome.9b00295
Article CAS Google Scholar
Kim, H. W., Wang, M., Leber, C. A., Nothias, L.-F., Reher, R., Kang, K. B., van der Hooft, J. J. J., Dorrestein, P. C., Gerwick, W. H., & Cottrell, G. W. (2021). NPClassifier: A deep neural network-based structural classification tool for natural products. Journal of Natural Products, 84(11), 2795–2807. https://doi.org/10.1021/acs.jnatprod.1c00399
Article CAS Google Scholar
Koichi, S., Arisaka, M., Koshino, H., Aoki, A., Iwata, S., Uno, T., & Satoh, H. (2014). Chemical structure elucidation from 13C NMR chemical shifts: Efficient data processing using bipartite matching and maximal clique algorithms. Journal of Chemical Information and Modeling, 54(4), 1027–1035. https://doi.org/10.1021/ci400601c
Article CAS Google Scholar
Koradi, R., Billeter, M., Engeli, M., Güntert, P., & Wüthrich, K. (1998). Automated Peak Picking and Peak Integration in Macromolecular NMR Spectra Using AUTOPSY. Journal of Magnetic Resonance, 135(2), 288–297. https://doi.org/10.1006/jmre.1998.1570
Article CAS Google Scholar
Krishnamurthy, K. (2013). CRAFT (complete reduction to amplitude frequency table)—robust and time-efficient Bayesian approach for quantitative mixture analysis by NMR. Magnetic Resonance in Chemistry, 51(12), 821–829. https://doi.org/10.1002/mrc.4022
Article CAS Google Scholar
Kuhn, H. W. (1955). The Hungarian method for the assignment problem. Naval Research Logistics Quarterly, 2(1–2), 83–97. https://doi.org/10.1002/nav.3800020109
Article Google Scholar
Kwon, Y., Lee, D., Choi, Y.-S., & Kang, S. (2021). Molecular search by NMR spectrum based on evaluation of matching between spectrum and molecule. Scientific Reports, 11(1), 20998. https://doi.org/10.1038/s41598-021-00488-z
Article CAS Google Scholar
Li, D.-W., Wang, C., & Brüschweiler, R. (2017). Maximal clique method for the automated analysis of NMR TOCSY spectra of complex mixtures. Journal of Biomolecular NMR, 68(3), 195–202. https://doi.org/10.1007/s10858-017-0119-4
Article CAS Google Scholar
Marshall, I., Higinbotham, J., Bruce, S., & Freise, A. (1997). Use of Voigt lineshape for quantification of in vivo 1H spectra. Magnetic Resonance in Medicine, 37(5), 651–657.
Article CAS Google Scholar
Mercier, P., Lewis, M. J., Chang, D., Baker, D., & Wishart, D. S. (2011). Towards automatic metabolomic profiling of high-resolution one-dimensional proton NMR spectra. Journal of Biomolecular NMR, 49(3), 307–323. https://doi.org/10.1007/s10858-011-9480-x
Article CAS Google Scholar
Misra, B. B. (2021). New software tools, databases, and resources in metabolomics: Updates from 2020. Metabolomics, 17(5), 49. https://doi.org/10.1007/s11306-021-01796-1
Article CAS Google Scholar
Mohamed, A., Nguyen, C. H., & Mamitsuka, H. (2015). Current status and prospects of computational resources for natural product dereplication: A review. Briefings in Bioinformatics, 17(2), 309–321. https://doi.org/10.1093/bib/bbv042
Article CAS Google Scholar
Monge, M. E., Dodds, J. N., Baker, E. S., Edison, A. S., & Fernández, F. M. (2019). Challenges in identifying the dark molecules of life. Annual Review of Analytical Chemistry (palo Alto Calif), 12(1), 177–199. https://doi.org/10.1146/annurev-anchem-061318-114959
Article CAS Google Scholar
Napolitano, J. G., Lankin, D. C., McAlpine, J. B., Niemitz, M., Korhonen, S.-P., Chen, S.-N., & Pauli, G. F. (2013). Proton fingerprints portray molecular structures: Enhanced description of the 1H NMR spectra of small molecules. The Journal of Organic Chemistry, 78(19), 9963–9968. https://doi.org/10.1021/jo4011624
Article CAS Google Scholar
Napolitano, J. G., Simmler, C., McAlpine, J. B., Lankin, D. C., Chen, S.-N., & Pauli, G. F. (2015). Digital NMR profiles as building blocks: assembling 1H fingerprints of steviol glycosides. Journal of Natural Products, 78(4), 658–665. https://doi.org/10.1021/np5008203
Article CAS Google Scholar
Pauli, G. F., Chen, S.-N., Lankin, D. C., Bisson, J., Case, R. J., Chadwick, L. R., Gödecke, T., Inui, T., Krunic, A., Jaki, B. U., McAlpine, J. B., Mo, S., Napolitano, J. G., Orjala, J., Lehtivarjo, J., Korhonen, S.-P., & Niemitz, M. (2014). Essential parameters for structural analysis and dereplication by 1H NMR spectroscopy. Journal of Natural Products, 77(6), 1473–1487. https://doi.org/10.1021/np5002384
Article CAS Google Scholar
Pauli, G. F., Niemitz, M., Bisson, J., Lodewyk, M. W., Soldi, C., Shaw, J. T., Tantillo, D. J., Saya, J. M., Vos, K., Kleinnijenhuis, R. A., Hiemstra, H., Chen, S.-N., McAlpine, J. B., Lankin, D. C., & Friesen, J. B. (2016). Toward structural correctness: aquatolide and the importance of 1D proton NMR FID archiving. The Journal of Organic Chemistry, 81(3), 878–889. https://doi.org/10.1021/acs.joc.5b02456
Article CAS Google Scholar
Posma, J. M., Garcia-Perez, I., De Iorio, M., Lindon, J. C., Elliott, P., Holmes, E., Ebbels, T. M. D., & Nicholson, J. K. (2012). Subset optimization by reference matching (STORM): An optimized statistical approach for recovery of metabolic biomarker structural information from 1H NMR spectra of biofluids. Analytical Chemistry, 84(24), 10694–10701. https://doi.org/10.1021/ac302360v
Article CAS Google Scholar
Ravanbakhsh, S., Liu, P., Bjordahl, T. C., Mandal, R., Grant, J. R., Wilson, M., Eisner, R., Sinelnikov, I., Hu, X., Luchinat, C., Greiner, R., & Wishart, D. S. (2015). Accurate, fully-automated NMR spectral profiling for metabolomics. PLoS ONE, 10(5), e0124219. https://doi.org/10.1371/journal.pone.0124219
Article CAS Google Scholar
Reher, R., Kim, H. W., Zhang, C., Mao, H. H., Wang, M., Nothias, L.-F., Caraballo-Rodriguez, A. M., Glukhov, E., Teke, B., Leao, T., Alexander, K. L., Duggan, B. M., Van Everbroeck, E. L., Dorrestein, P. C., Cottrell, G. W., & Gerwick, W. H. (2020). A convolutional neural network-based approach for the rapid annotation of molecularly diverse natural products. Journal of the American Chemical Society, 142(9), 4114–4120. https://doi.org/10.1021/jacs.9b13786
Article CAS Google Scholar
Robinette, S. L., Veselkov, K. A., Bohus, E., Coen, M., Keun, H. C., Ebbels, T. M. D., Beckonert, O., Holmes, E. C., Lindon, J. C., & Nicholson, J. K. (2009). Cluster analysis statistical spectroscopy using nuclear magnetic resonance generated metabolic data sets from perturbed biological systems. Analytical Chemistry, 81(16), 6581–6589. https://doi.org/10.1021/ac901240j
Article CAS Google Scholar
Robinette, S. L., Zhang, F., Brüschweiler-Li, L., & Brüschweiler, R. (2008). Web server based complex mixture analysis by NMR. Analytical Chemistry, 80(10), 3606–3611. https://doi.org/10.1021/ac702530t
Article CAS Google Scholar
Rossé, G., Neidig, P., & Schröder, H. (2002). Automated structure verification of small molecules libraries using 1D and 2D NMR techniques. In L. B. English (Ed.), Combinatorial library: Methods and protocols (pp. 123–139). Springer.
Chapter Google Scholar
Salek, R. M., Maguire, M. L., Bentley, E., Rubtsov, D. V., Hough, T., Cheeseman, M., Nunez, D., Sweatman, B. C., Haselden, J. N., Cox, R. D., Connor, S. C., & Griffin, J. L. (2007). A metabolomic comparison of urinary changes in type 2 diabetes in mouse, rat, and human. Physiological Genomics, 29(2), 99–108. https://doi.org/10.1152/physiolgenomics.00194.2006
Article CAS Google Scholar
Scheubert, K., Hufsky, F., Petras, D., Wang, M., Nothias, L.-F., Dührkop, K., Bandeira, N., Dorrestein, P. C., & Böcker, S. (2017). Significance estimation for large scale metabolomics annotations by spectral matching. Nature Communications, 8(1), 1494. https://doi.org/10.1038/s41467-017-01318-5
Article CAS Google Scholar
Smurnyy, Y. D., Blinov, K. A., Churanova, T. S., Elyashberg, M. E., & Williams, A. J. (2008). Toward more reliable 13C and 1H chemical shift prediction: A systematic comparison of neural-network and least-squares regression based approaches. Journal of Chemical Information and Modeling, 48(1), 128–134. https://doi.org/10.1021/ci700256n
Article CAS Google Scholar
Sousa, S. A. A., Magalhães, A., & Ferreira, M. M. C. (2013). Optimized bucketing for NMR spectra: Three case studies. Chemometrics and Intelligent Laboratory Systems, 122, 93–102. https://doi.org/10.1016/j.chemolab.2013.01.006
Article CAS Google Scholar
Sumner, L. W., Amberg, A., Barrett, D., Beale, M. H., Beger, R., Daykin, C. A., Fan, T. W. M., Fiehn, O., Goodacre, R., Griffin, J. L., Hankemeier, T., Hardy, N., Harnly, J., Higashi, R., Kopka, J., Lane, A. N., Lindon, J. C., Marriott, P., Nicholls, A. W., ... Viant, M. R. (2007). Proposed minimum reporting standards for chemical analysis. Metabolomics, 3(3), 211–221. https://doi.org/10.1007/s11306-007-0082-2
Article CAS Google Scholar
Trbovic, N., Dancea, F., Langer, T., & Günther, U. (2005). Using wavelet de-noised spectra in NMR screening. Journal of Magnetic Resonance, 173(2), 280–287. https://doi.org/10.1016/j.jmr.2004.11.032
Article CAS Google Scholar
Tredwell, G. D., Bundy, J. G., De Iorio, M., & Ebbels, T. M. D. (2016). Modelling the acid/base 1H NMR chemical shift limits of metabolites in human urine. Metabolomics, 12(10), 152. https://doi.org/10.1007/s11306-016-1101-y
Article CAS Google Scholar
Tulpan, D., Léger, S., Belliveau, L., Culf, A., & Čuperlović-Culf, M. (2011). MetaboHunter: An automatic approach for identification of metabolites from 1H-NMR spectra of complex mixtures. BMC Bioinformatics, 12(1), 400. https://doi.org/10.1186/1471-2105-12-400
Article CAS Google Scholar
Ulrich, E. L., Baskaran, K., Dashti, H., Ioannidis, Y. E., Livny, M., Romero, P. R., Maziuk, D., Wedell, J. R., Yao, H., Eghbalnia, H. R., Hoch, J. C., & Markley, J. L. (2019). NMR-STAR: Comprehensive ontology for representing, archiving and exchanging data from nuclear magnetic resonance spectroscopic experiments. Journal of Biomolecular NMR, 73(1), 5–9. https://doi.org/10.1007/s10858-018-0220-3
Article CAS Google Scholar
van der Hooft, J. J. J., & Rankin, N. (2016). Metabolite identification in complex mixtures using nuclear magnetic resonance spectroscopy. In G. A. Webb (Ed.), Modern magnetic resonance (pp. 1–32). Springer International Publishing.
Google Scholar
Vu, T. N., & Laukens, K. (2013). Getting your peaks in line: A review of alignment methods for NMR spectral data. Metabolites, 3(2), 259–276. https://doi.org/10.3390/metabo3020259
Article CAS Google Scholar
Vu, T. N., Valkenborg, D., Smets, K., Verwaest, K. A., Dommisse, R., Lemière, F., Verschoren, A., Goethals, B., & Laukens, K. (2011). An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data. BMC Bioinformatics, 12(1), 405. https://doi.org/10.1186/1471-2105-12-405
Article Google Scholar
Wang, C., Zhang, B., Timári, I., Somogyi, Á., Li, D.-W., Adcox, H. E., Gunn, J. S., Bruschweiler-Li, L., & Brüschweiler, R. (2019). Accurate and efficient determination of unknown metabolites in metabolomics by NMR-based molecular motif identification. Analytical Chemistry, 91(24), 15686–15693. https://doi.org/10.1021/acs.analchem.9b03849
Article CAS Google Scholar
Wishart, D. S., Guo, A., Oler, E., Wang, F., Anjum, A., Peters, H., Dizon, R., Sayeeda, Z., Tian, S., Lee, B. L., Berjanskii, M., Mah, R., Yamamoto, M., Jovel, J., Torres-Calzada, C., Hiebert-Giesbrecht, M., Lui, V. W., Varshavi, D., Varshavi, D., ... Gautam, V. (2022). HMDB 5.0: the Human Metabolome Database for 2022. Nucleic Acids Research, 50(D1), D622-d631. https://doi.org/10.1093/nar/gkab1062
Article CAS Google Scholar
Wolfram, K., Porzel, A., & Hinneburg, A. (2006). Similarity Search for Multi-dimensional NMR-Spectra of Natural Products. Knowledge Discovery in Databases: PKDD 2006, Berlin, Heidelberg.
Zeng, Q., Chen, J., Lin, Y., & Chen, Z. (2020). Boosting resolution in NMR spectroscopy by chemical shift upscaling. Analytica Chimica Acta, 1110, 109–114. https://doi.org/10.1016/j.aca.2020.03.032
Article CAS Google Scholar
Zhang, C., Idelbayev, Y., Roberts, N., Tao, Y., Nannapaneni, Y., Duggan, B. M., Min, J., Lin, E. C., Gerwick, E. C., Cottrell, G. W., & Gerwick, W. H. (2017). Small molecule accurate recognition technology (SMART) to enhance natural products research. Scientific Reports, 7(1), 14243. https://doi.org/10.1038/s41598-017-13923-x
Article CAS Google Scholar
Zhang, F., & Brüschweiler, R. (2004). Spectral deconvolution of chemical mixtures by covariance NMR. ChemPhysChem, 5(6), 794–796. https://doi.org/10.1002/cphc.200301073
Article CAS Google Scholar
Zhang, F., & Brüschweiler, R. (2007). Robust Deconvolution of complex mixtures by covariance TOCSY spectroscopy. Angewandte Chemie International Edition, 46(15), 2639–2642. https://doi.org/10.1002/anie.200604599
Article CAS Google Scholar
Zhang, J., Terayama, K., Sumita, M., Yoshizoe, K., Ito, K., Kikuchi, J., & Tsuda, K. (2020). NMR-TS: De novo molecule identification from NMR spectra. Science and Technology of Advanced Materials, 21(1), 552–561. https://doi.org/10.1080/14686996.2020.1793382
Article CAS Google Scholar
Zou, X., Holmes, E., Nicholson, J. K., & Loo, R. L. (2014). Statistical homogeneous cluster spectroscopy (SHOCSY): An optimized statistical approach for clustering of 1H NMR spectral data to reduce interference and enhance robust biomarkers selection. Analytical Chemistry, 86(11), 5308–5315. https://doi.org/10.1021/ac500161k
Article CAS Google Scholar
Zürcher, M., Clerc, J. T., Farkas, M., & Pretsch, E. (1988). General theory of similarity measures for library search systems. Analytica Chimica Acta, 206, 161–172. https://doi.org/10.1016/S0003-2670(00)80839-9
Article Google Scholar

Download references

Acknowledgements

The authors thank Arthur S. Edison, Panteleimon Takis and Beatriz Jiménez for feedback on key ideas. This work was supported by the Biotechnology and Biological Sciences Research Council (BBSRC) (Grant Number BB/T007974/1).

Author information

Authors and Affiliations

Section of Bioinformatics, Division of Systems Medicine, Department of Metabolism, Digestion and Reproduction, Imperial College, 131 Sir Alexander Fleming Building, South Kensington Campus, London, UK
Michael T. Judge & Timothy M. D. Ebbels

Authors

Michael T. Judge
View author publications
You can also search for this author in PubMed Google Scholar
Timothy M. D. Ebbels
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MTJ and TE: wrote the manuscript. All authors read and approved the manuscript.

Corresponding author

Correspondence to Timothy M. D. Ebbels.

Ethics declarations

Conflict of interest

The authors declare they have no competing interests.

Ethical approval

This article does not contain any studies with human and/or animal participants performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Judge, M.T., Ebbels, T.M.D. Problems, principles and progress in computational annotation of NMR metabolomics data. Metabolomics 18, 102 (2022). https://doi.org/10.1007/s11306-022-01962-z

Download citation

Received: 04 July 2022
Accepted: 18 November 2022
Published: 05 December 2022
DOI: https://doi.org/10.1007/s11306-022-01962-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Problems, principles and progress in computational annotation of NMR metabolomics data

Abstract

Background

Aim of review

Key scientific concepts of review

Similar content being viewed by others

Good practices and recommendations for using and benchmarking computational metabolomics metabolite annotation tools

NMRFinder: a novel method for 1D 1H-NMR metabolite annotation

MAW: the reproducible Metabolome Annotation Workflow for untargeted tandem mass spectrometry

1 Introduction

1.1 NMR in metabolomics and compound identification

1.2 Annotation as a subprocess of identification

1.3 Challenges in the automation of annotation

2 Data, feature complexity, and structural information

2.1 Features

2.2 Peaks and compound features

2.2.1 Simple peaks

2.2.2 Complex peaks

2.2.3 Peaks vs. compound features

2.3 Signatures and metasignatures

3 Extracting features, compound features, signatures, and subsignatures

3.1 Bottom-up: data-driven feature extraction

3.1.1 Simple features

3.2 Compound features from 1D data

3.3 Compound features from 2D data

3.4 Signatures and metasignatures are connecting points

3.5 Top-down: high-quality subsignatures from signatures

4 Matching: leveraging computational characteristics

4.1 Comparisons between feature lists

4.2 Compound feature/subsignature-driven comparisons

4.3 Full signature comparisons

5 Assessing and communicating confidence

5.1 Ranking hypotheses

5.2 Controlling false positives

6 Overall conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation