Abstract
Introduction
Methods for the automated and accurate identification of metabolites in 1D 1H-NMR samples are crucial, but this is still an unsolved problem. Most available tools are mainly focused on metabolite quantification, thus limiting the number of metabolites that can be identified. Also, most only use reference spectra obtained under the same specific conditions of the target sample, limiting the use of available knowledge.
Objectives
The main goal of this work was to develop novel methods to perform metabolite annotation from 1D 1H-NMR peaks with enhanced reliability, to aid the users in metabolite identification. An essential step was to construct a vast and up-do-date library of reference 1D 1H-NMR peak lists collected under distinct experimental conditions.
Methods
Three different algorithms were evaluated for their capacity to correctly annotate metabolites present in both synthetic and real samples and compared to publicly available tools. The best proposed method was evaluated in a plethora of scenarios, including missing references, missing peaks and peak shifts, to assess its annotation accuracy, precision and recall.
Results
We gathered 1816 peak lists for 1387 different metabolites from several sources across different conditions for our reference library. A new method, NMRFinder, is proposed and allows matching 1D 1H-NMR samples with all the reference peak lists in the library, regardless of acquisition conditions. Metabolites are scored according to the number of peaks matching the samples, how unique their peaks are in the library and how close the spectrum acquisition conditions are in relation to those of the samples. Results show a true positive rate of 0.984 when analysing computationally created samples, while 71.8% of the metabolites were annotated when analysing samples from previously identified public datasets.
Conclusion
NMRFinder performs metabolite annotation reliably and outperforms previous methods, being of great value in helping the user to ultimately identify metabolites. It is implemented in the R package specmine.
Similar content being viewed by others
Data availability
The proposed 1H-NMR annotation methods are present in the R package specmine (Costa et al., 2016), available in CRAN.
References
Alonso, A., Marsal, S., & Julià, A. (2015). Analytical methods in untargeted metabolomics: State of the art in 2015. Frontiers in BioEngineering and BioTechnology, 3, 23.
Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B (Methodological), 57(1), 289–300.
Costa, C., Maraschin, M., & Rocha, M. (2016). An R package for the integrated analysis of metabolomics and spectral data. Computer Methods and Programs in Biomedicine, 129, 117–124.
Cui, Q., Lewis, I. A., Hegeman, A. D., Anderson, M. E., Li, J., Schulte, C. F., et al. (2008). Metabolite identification via the Madison metabolomics consortium database. Nature Biotechnology, 26(2), 162–164.
Filntisi, A., Fotakis, C., Asvestas, P., Matsopoulos, G. K., Zoumpoulakis, P., & Cavouras, D. (2017). Automated metabolite identification from biological fluid 1H NMR spectra. Metabolomics, 13, 146.
Hao, J., Astle, W., De Iorio, M., & Ebbels, T. M. (2012). BATMAN—An R package for the automated quantification of metabolites from nuclear magnetic resonance spectra using a Bayesian model. Bioinformatics, 28(15), 2088–2090.
Haug, K., Cochrane, K., Nainala, V. C., Williams, M., Chang, J., Jayaseelan, K. V., & O’Donovan, C. (2020). MetaboLights: A resource evolving in response to the needs of its scientific community. Nucleic Acids Research, 48(D1), D440–D444.
Kanehisa, M., & Goto, S. (2000). KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Research, 28(1), 27–30.
Lefort, G., Liaubet, L., Canlet, C., Tardivel, P., Père, M. C., Quesnel, H., et al. (2019). ASICS: An R package for a whole analysis workflow of 1D 1H-NMR spectra. Bioinformatics, 35(21), 4356–4363.
National Institute of Advanced Industrial Science and Technology (2020) SDBS—Spectral database for organic compounds. Retrieved from https://sdbs.db.aist.go.jp.
Ravanbakhsh, S., Liu, P., Bjordahl, T. C., Mandal, R., Grant, J. R., Wilson, M., et al. (2015). Accurate, fully-automated NMR spectral profiling for metabolomics. PLoS ONE, 10(5), e0124219.
Tenenbaum D. (2016) KEGGREST: Client-side REST access to KEGG. R package version 1(1).
Tulpan, D., Léger, S., Belliveau, L., Culf, A., & Čuperlović-Culf, M. (2011). MetaboHunter: An automatic approach for identification of metabolites from 1H-NMR spectra of complex mixtures. BMC Bioinformatics, 12(1), 400.
Ulrich, E. L., Akutsu, H., Doreleijers, J. F., Harano, Y., Ioannidis, Y. E., Lin, J., et al. (2007). BioMagResBank. Nucleic Acids Research, 36(suppl_1), D402–D408.
Villas-Boas, S. G., Nielsen, J., Smedsgaard, J., Hansen, M. A., & Roessner-Tunali, U. (2007). Metabolome analysis: An introduction (Vol. 24). Hoboken: Wiley.
Wishart, D. S., Feunang, Y. D., Marcu, A., Guo, A. C., Liang, K., Vázquez-Fresno, R., et al. (2018). HMDB 4.0: The human metabolome database for 2018. Nucleic Acids Research, 46(D1), D608–D617.
Funding
This study was funded by the PhD scholarship with reference SFRH/BD/138951/2018, awarded by the Portuguese Foundation for Science and Technology (FCT).
Author information
Authors and Affiliations
Contributions
SC and DC collected data and implemented the new peak lists library. SC and MR designed the algorithms. SC implemented the algorithms. SC, MM and MR designed the experiments and analysed the results. SC and MR wrote the manuscript draft. All authors read, reviewed and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Research involving human and/or animal participants
This article does not contain any studies with human and/or animal participants performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Cardoso, S., Cabral, D., Maraschin, M. et al. NMRFinder: a novel method for 1D 1H-NMR metabolite annotation. Metabolomics 17, 21 (2021). https://doi.org/10.1007/s11306-021-01772-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11306-021-01772-9