El-MAVEN: A Fast, Robust, and User-Friendly Mass Spectrometry Data Processing Engine for Metabolomics
Analysis of large metabolomic datasets is becoming commonplace with the increased realization of the role that metabolites play in biology and pathophysiology. While there are many open-source analysis tools to extract peaks from liquid chromatography-mass spectrometry (LC-MS), gas chromatography-mass spectrometry (GC-MS), and tandem mass spectrometry (LC-MS/MS) data, these tools are not very interactive and are suboptimal when a large number of samples are to be analyzed. El-MAVEN is an open-source analysis platform that extends MAVEN and provides fast, powerful, and interactive analysis capabilities especially for datasets containing over 100 samples. The El-MAVEN workflow is easy to use with just four steps from loading data to exporting of the results. Advanced analysis and software techniques such as multiprocessing, machine learning, and reduction of memory leaks are implemented so as to provide a seamless and interactive user experience. Results from El-MAVEN can be exported in a range of formats allowing continued analysis on other platforms. Additionally, El-MAVEN is also fully integrated with Polly™, a cloud-based analysis platform that provides a range of tools for flux analysis and integrative-omics analysis. El-MAVEN is a powerful tool that enables fast and efficient analysis of large metabolomic datasets to accelerate the process of gaining insight from raw data.
KeywordsMass spectrometry Data processing Metabolomics Bioinformatics Data analysis Metabolic pathways Liquid chromatography-mass spectrometry MAVEN
El-MAVEN is an extension of MAVEN, and as such we would like to acknowledge the creators of MAVEN. We would also like to acknowledge El-MAVEN contributors and community on GitHub, especially Victor Chubukov, Lance Parsons, and Eugene Melamud for their help in identifying bugs and fixes. The content and figures in this paper were structured, written, edited, and formatted by Chandni Valiathan at Illumetis, LLC.
- 2.Mathew AK, Padmanaban VC (2013) Metabolomics: the apogee of the omics trilogy. Int. J. Pharm. Pharm. Sci. 5:45–48Google Scholar
- 9.Clasquin MF, Melamud E, Rabinowitz JD (2012) LC-MS data processing with MAVEN: a metabolomic analysis and visualization engine. Curr Protoc Bioinforma. https://doi.org/10.1002/0471250953.bi1411s37
- 14.Pluskal T, Castillo S, Villar-Briones A, Orešič M (2010) MZmine 2: modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data. BMC Bioinformatics 11. https://doi.org/10.1186/1471-2105-11-395
- 17.Myers OD, Sumner SJ, Li S et al (2017) Detailed investigation and comparison of the XCMS and MZmine 2 chromatogram construction and chromatographic peak detection methods for preprocessing mass spectrometry metabolomics data. Anal Chem 89:8689–8695. https://doi.org/10.1021/acs.analchem.7b01069CrossRefPubMedGoogle Scholar
- 18.Myers OD, Sumner SJ, Li S et al (2017) One step forward for reducing false positive and false negative compound identifications from mass spectrometry metabolomics data: new algorithms for constructing extracted ion chromatograms and detecting chromatographic peaks. Anal Chem 89:8696–8703. https://doi.org/10.1021/acs.analchem.7b00947CrossRefPubMedGoogle Scholar
- 19.Libiseller G, Dvorzak M, Kleb U et al (2015) IPO: a tool for automated optimization of XCMS parameters. BMC Bioinformatics 16. https://doi.org/10.1186/s12859-015-0562-8