Skip to main content

Bio- and Chemoinformatics Approaches for Metabolomics Data Analysis

  • Protocol
  • First Online:
Metabolic Profiling

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1738))

Abstract

Metabolomics data analysis includes several repetitive tasks, including data sorting, calculation of exact masses or other physicochemical properties, or searching for identifiers in different databases. Several of these tasks can be automated using command line tools or short scripts in different scripting languages like Perl, Python, or R. This chapter presents simple solutions and short scripts written in R that can be used for the interaction with specific web services or for the calculation of physicochemical properties or molecular formulae.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 119.00
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Benton HP, Wong DM, Trauger SA et al (2008) XCMS2: processing tandem mass spectrometry data for metabolite identification and structural characterization. Anal Chem 80:6382–6389

    Article  CAS  Google Scholar 

  2. Smith CA, Want EJ, O'Maille G et al (2006) XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal Chem 78:779–787

    Article  CAS  Google Scholar 

  3. Müller C, Dietz I, Tziotis D et al (2013) Molecular cartography in acute chlamydia pneumoniae infections—a non-targeted metabolomics approach. Anal Bioanal Chem 405:5119–5131

    Article  Google Scholar 

  4. Stanstrup J, Gerlich M, Dragsted LO et al (2013) Metabolite profiling and beyond: approaches for the rapid processing and annotation of human blood serum mass spectrometry data. Anal Bioanal Chem 405(15):5037–5048

    Article  CAS  Google Scholar 

  5. Kind T, Fiehn O (2007) Seven golden rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry. BMC Bioinformatics 8:105

    Article  Google Scholar 

  6. Tziotis D, Hertkorn N, Schmitt-Kopplin P (2011) Kendrick-analogous network visualisation of ion cyclotron resonance Fourier transform mass spectra: improved options for the assignment of elemental compositions and the classification of organic molecular complexity. Eur J Mass Spectrom 17:415–421

    Article  CAS  Google Scholar 

  7. Witting M, Lucio M, Tziotis D et al (2015) DI-ICR-FT-MS-based high-throughput deep metabotyping: a case study of the Caenorhabditis Elegans–Pseudomonas Aeruginosa infection model. Anal Bioanal Chem 407:1059–1073

    Article  CAS  Google Scholar 

  8. Treutler H, Neumann S (2016) Prediction, detection, and validation of isotope clusters in mass spectrometry data. Meta 6:E37

    Google Scholar 

  9. Kerber A et al (1998) MOLGEN 40 Match-communications in mathematical and in computer. Chemistry 37:205–208

    CAS  Google Scholar 

  10. Peironcely JE et al (2012) OMG: Open Molecule Generator. J Cheminformatics 4:21

    Article  CAS  Google Scholar 

  11. Jaghoori MM et al (2013) PMG: multi-core Metabolite Identification. Electronic Notes in Theoretical Computer Science 299:53–60

    Article  Google Scholar 

  12. Kind T, Scholz M, Fiehn O (2009) How large is the metabolome? A critical analysis of data exchange practices in chemistry. PLoS One 4:e5440

    Article  Google Scholar 

  13. Wohlgemuth G et al (2010) The chemical translation service—a web-based tool to improve standardization of metabolomic reports. Bioinformatics 26:2647–2648

    Article  CAS  Google Scholar 

  14. Steinbeck C et al (2003) The chemistry development kit (CDK): an open-source java library for chemo- and bioinformatics. J Chem Inf Comput Sci 43:493–500

    Article  CAS  Google Scholar 

  15. Willighagen EL, Mayfield JW, Alvarsson J, Berg A, Carlsson L, Jeliazkova N, Kuhn S, Pluskal T, Rojas-Chertó M, Spjuth O, Torrance G, Evelo CT, Guha R, Steinbeck C (2017) The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching. J Cheminform 9:33. https://doi.org/10.1186/s13321-017-0220-4

    Google Scholar 

  16. Cao M et al (2014) Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics. Metabolomics:1–11

    Google Scholar 

  17. Peironcely JE et al (2012) OMG: Open Molecule Generator. J Cheminformatics 4:1–13

    Article  Google Scholar 

  18. Wolf S et al (2010) In silico fragmentation for computer assisted identification of metabolite mass spectra. BMC Bioinformatics 11:148

    Article  Google Scholar 

  19. Gerlich M, Neumann S (2013) MetFusion: integration of compound identification strategies. J Mass Spectrom 48:291–298

    Article  CAS  Google Scholar 

  20. Kanehisa M, Goto S (2000) KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28:27–30

    Article  CAS  Google Scholar 

  21. Kanehisa M et al (2006) From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res 34(suppl 1):D354–D357

    Article  CAS  Google Scholar 

  22. Wishart DS et al (2012) HMDB 3.0—the human metabolome database in 2013. Nucleic Acids Res 41((Database issue)):D801–D807. https://doi.org/10.1093/nar/gks1065

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Wishart DS et al (2009) HMDB: a knowledgebase for the human metabolome. Nucleic Acids Res 37(Database):D603–D610

    Article  CAS  Google Scholar 

  24. Sud M et al (2007) LMSD: LIPID MAPS structure database. Nucleic Acids Res 35(suppl 1):D527–D532

    Article  CAS  Google Scholar 

  25. Caspi R et al (2008) The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res 36(suppl 1):D623–D631

    CAS  PubMed  Google Scholar 

  26. David S. Wishart, Yannick Djoumbou Feunang, Ana Marcu, An Chi Guo, Kevin Liang, Rosa Vázquez-Fresno, Tanvir Sajed, Daniel Johnson, Carin Li, Naama Karu, Zinat Sayeeda, Elvis Lo, Nazanin Assempour, Mark Berjanskii, Sandeep Singhal, David Arndt, Yonjie Liang, Hasan Badran, Jason Grant, Arnau Serra-Cayuela, Yifeng Liu, Rupa Mandal, Vanessa Neveu, Allison Pon, Craig Knox, Michael Wilson, Claudine Manach, Augustin Scalbert; HMDB 4.0: the human metabolome database for 2018, Nucleic Acids Research, gkx1089, https://doi.org/10.1093/nar/gkx1089

  27. Juty N, Le Novère N, Laibe C (2012) Identifiers.org and MIRIAM registry: community resources to provide persistent identification. Nucleic Acids Res 40:D580–D586

    Article  CAS  Google Scholar 

  28. Beisken S et al (2013) KNIME-CDK: workflow-driven cheminformatics. BMC Bioinformatics 14:257

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michael Witting .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media, LLC, part of Springer Nature

About this protocol

Check for updates. Verify currency and authenticity via CrossMark

Cite this protocol

Witting, M. (2018). Bio- and Chemoinformatics Approaches for Metabolomics Data Analysis. In: Theodoridis, G., Gika, H., Wilson, I. (eds) Metabolic Profiling. Methods in Molecular Biology, vol 1738. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7643-0_4

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-7643-0_4

  • Published:

  • Publisher Name: Humana Press, New York, NY

  • Print ISBN: 978-1-4939-7642-3

  • Online ISBN: 978-1-4939-7643-0

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics