Skip to main content

Advertisement

Log in

NMRFinder: a novel method for 1D 1H-NMR metabolite annotation

  • Original Article
  • Published:
Metabolomics Aims and scope Submit manuscript

Abstract

Introduction

Methods for the automated and accurate identification of metabolites in 1D 1H-NMR samples are crucial, but this is still an unsolved problem. Most available tools are mainly focused on metabolite quantification, thus limiting the number of metabolites that can be identified. Also, most only use reference spectra obtained under the same specific conditions of the target sample, limiting the use of available knowledge.

Objectives

The main goal of this work was to develop novel methods to perform metabolite annotation from 1D 1H-NMR peaks with enhanced reliability, to aid the users in metabolite identification. An essential step was to construct a vast and up-do-date library of reference 1D 1H-NMR peak lists collected under distinct experimental conditions.

Methods

Three different algorithms were evaluated for their capacity to correctly annotate metabolites present in both synthetic and real samples and compared to publicly available tools. The best proposed method was evaluated in a plethora of scenarios, including missing references, missing peaks and peak shifts, to assess its annotation accuracy, precision and recall.

Results

We gathered 1816 peak lists for 1387 different metabolites from several sources across different conditions for our reference library. A new method, NMRFinder, is proposed and allows matching 1D 1H-NMR samples with all the reference peak lists in the library, regardless of acquisition conditions. Metabolites are scored according to the number of peaks matching the samples, how unique their peaks are in the library and how close the spectrum acquisition conditions are in relation to those of the samples. Results show a true positive rate of 0.984 when analysing computationally created samples, while 71.8% of the metabolites were annotated when analysing samples from previously identified public datasets.

Conclusion

NMRFinder performs metabolite annotation reliably and outperforms previous methods, being of great value in helping the user to ultimately identify metabolites. It is implemented in the R package specmine.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Data availability

The proposed 1H-NMR annotation methods are present in the R package specmine (Costa et al., 2016), available in CRAN.

References

  • Alonso, A., Marsal, S., & Julià, A. (2015). Analytical methods in untargeted metabolomics: State of the art in 2015. Frontiers in BioEngineering and BioTechnology, 3, 23.

    Article  Google Scholar 

  • Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B (Methodological), 57(1), 289–300.

    Article  Google Scholar 

  • Costa, C., Maraschin, M., & Rocha, M. (2016). An R package for the integrated analysis of metabolomics and spectral data. Computer Methods and Programs in Biomedicine, 129, 117–124.

    Article  Google Scholar 

  • Cui, Q., Lewis, I. A., Hegeman, A. D., Anderson, M. E., Li, J., Schulte, C. F., et al. (2008). Metabolite identification via the Madison metabolomics consortium database. Nature Biotechnology, 26(2), 162–164.

    Article  CAS  Google Scholar 

  • Filntisi, A., Fotakis, C., Asvestas, P., Matsopoulos, G. K., Zoumpoulakis, P., & Cavouras, D. (2017). Automated metabolite identification from biological fluid 1H NMR spectra. Metabolomics, 13, 146.

    Article  Google Scholar 

  • Hao, J., Astle, W., De Iorio, M., & Ebbels, T. M. (2012). BATMAN—An R package for the automated quantification of metabolites from nuclear magnetic resonance spectra using a Bayesian model. Bioinformatics, 28(15), 2088–2090.

    Article  CAS  Google Scholar 

  • Haug, K., Cochrane, K., Nainala, V. C., Williams, M., Chang, J., Jayaseelan, K. V., & O’Donovan, C. (2020). MetaboLights: A resource evolving in response to the needs of its scientific community. Nucleic Acids Research, 48(D1), D440–D444.

    CAS  PubMed  Google Scholar 

  • Kanehisa, M., & Goto, S. (2000). KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Research, 28(1), 27–30.

    Article  CAS  Google Scholar 

  • Lefort, G., Liaubet, L., Canlet, C., Tardivel, P., Père, M. C., Quesnel, H., et al. (2019). ASICS: An R package for a whole analysis workflow of 1D 1H-NMR spectra. Bioinformatics, 35(21), 4356–4363.

    Article  Google Scholar 

  • National Institute of Advanced Industrial Science and Technology (2020) SDBS—Spectral database for organic compounds. Retrieved from https://sdbs.db.aist.go.jp.

  • Ravanbakhsh, S., Liu, P., Bjordahl, T. C., Mandal, R., Grant, J. R., Wilson, M., et al. (2015). Accurate, fully-automated NMR spectral profiling for metabolomics. PLoS ONE, 10(5), e0124219.

    Article  Google Scholar 

  • Tenenbaum D. (2016) KEGGREST: Client-side REST access to KEGG. R package version 1(1).

  • Tulpan, D., Léger, S., Belliveau, L., Culf, A., & Čuperlović-Culf, M. (2011). MetaboHunter: An automatic approach for identification of metabolites from 1H-NMR spectra of complex mixtures. BMC Bioinformatics, 12(1), 400.

    Article  CAS  Google Scholar 

  • Ulrich, E. L., Akutsu, H., Doreleijers, J. F., Harano, Y., Ioannidis, Y. E., Lin, J., et al. (2007). BioMagResBank. Nucleic Acids Research, 36(suppl_1), D402–D408.

    Article  Google Scholar 

  • Villas-Boas, S. G., Nielsen, J., Smedsgaard, J., Hansen, M. A., & Roessner-Tunali, U. (2007). Metabolome analysis: An introduction (Vol. 24). Hoboken: Wiley.

    Book  Google Scholar 

  • Wishart, D. S., Feunang, Y. D., Marcu, A., Guo, A. C., Liang, K., Vázquez-Fresno, R., et al. (2018). HMDB 4.0: The human metabolome database for 2018. Nucleic Acids Research, 46(D1), D608–D617.

    Article  CAS  Google Scholar 

Download references

Funding

This study was funded by the PhD scholarship with reference SFRH/BD/138951/2018, awarded by the Portuguese Foundation for Science and Technology (FCT).

Author information

Authors and Affiliations

Authors

Contributions

SC and DC collected data and implemented the new peak lists library. SC and MR designed the algorithms. SC implemented the algorithms. SC, MM and MR designed the experiments and analysed the results. SC and MR wrote the manuscript draft. All authors read, reviewed and approved the final manuscript.

Corresponding author

Correspondence to Sara Cardoso.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Research involving human and/or animal participants

This article does not contain any studies with human and/or animal participants performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cardoso, S., Cabral, D., Maraschin, M. et al. NMRFinder: a novel method for 1D 1H-NMR metabolite annotation. Metabolomics 17, 21 (2021). https://doi.org/10.1007/s11306-021-01772-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11306-021-01772-9

Keywords

Navigation