Abstract
Recent advances in metabolic profiling techniques allow global profiling of metabolites in cells, tissues, or organisms, using a wide range of analytical techniques such as nuclear magnetic resonance (NMR) spectroscopy and mass spectrometry (MS). The raw data acquired from these instruments are abundant with technical and structural complexity, which makes it statistically difficult to extract meaningful information. Preprocessing involves various computational procedures where data from the instruments (gas chromatography (GC)/liquid chromatography (LC)-MS, NMR spectra) are converted into a usable form for further analysis and biological interpretation. This chapter covers the common data preprocessing techniques used in metabonomics and is primarily focused on baseline correction, normalization, scaling, peak alignment, detection, and quantification. Recent years have witnessed development of several software tools for data preprocessing, and an overview of the frequently used tools in data preprocessing pipeline is covered.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Goodacre R, Vaidyanathan S, Dunn WB et al (2004) Metabolomics by numbers: acquiring and understanding global metabolite data. Trends Biotechnol 22:245–252
Craig A, Cloarec O, Holmes E et al (2006) Scaling and normalization effects in NMR spectroscopic metabonomic data sets. Anal Chem 78:2262–2267
Katajamaa M, Oresic M (2007) Data processing for mass spectrometry-based metabolomics. J Chromatogr A 1158:318–328
Castillo S, Gopalacharyulu P, Yetukuri L et al (2011) Algorithms and tools for the preprocessing of LC–MS metabolomics data. Chemometr Intell Lab Syst 108:23–32
Giskeodegard GF, Bloemberg TG, Postma G et al (2010) Alignment of high resolution magic angle spinning magnetic resonance spectra using warping methods. Anal Chim Acta 683:1–11
Pearce JT, Athersuch TJ, Ebbels TM et al (2008) Robust algorithms for automated chemical shift calibration of 1D 1H NMR spectra of blood serum. Anal Chem 80:7158–7162
Davis RA, Charlton AJ, Godward J et al (2007) Adaptive binning: an improved binning method for metabolomics data using the undecimated wavelet transform. Chemometr Intell Lab Syst 85:144–154
De Meyer T, Sinnaeve D, Van Gasse B et al (2008) NMR-based characterization of metabolic alterations in hypertension using an adaptive, intelligent binning algorithm. Anal Chem 80:3783–3790
Dieterle F, Ross A, Schlotterbeck G et al (2006) Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. Anal Chem 78:4281–4290
Goodacre R, Broadhurst D, Smilde A et al (2007) Proposed minimum reporting standards for data analysis in metabolomics. Metabolomics 3:231–241
van den Berg RA, Hoefsloot HC, Westerhuis JA et al (2006) Centering, scaling, and transformations: improving the biological information content of metabolomics data. BMC Genomics 7:142
Savorani F, Tomasi G, Engelsen SB (2010) icoshift: A versatile tool for the rapid alignment of 1D NMR spectra. J Magn Reson 202:190–202
Sugimoto M, Kawakami M, Robert M et al (2012) Bioinformatics tools for mass spectroscopy-based metabolomic data processing and analysis. Curr Bioinformatics 7:96–108
Ellinger JJ, Chylla RA, Ulrich EL et al (2013) Databases and software for NMR-based metabolomics. Curr Metabolomics 1, doi: 10.2174/2213235X11301010028
Kuhn T, Willighagen EL, Zielesny A et al (2010) CDK-Taverna: an open workflow environment for cheminformatics. BMC Bioinformatics 11:159
Hoffmann N, Stoye J (2009) ChromA: signal-based retention time alignment for chromatography-mass spectrometry data. Bioinformatics 25:2080–2081
Wang SY, Ho TJ, Kuo CH et al (2010) Chromaligner: a web server for chromatogram alignment. Bioinformatics 26:2338–2339
Izquierdo-Garcia JL, Rodriguez I, Kyriazis A et al (2009) A novel R-package graphic user interface for the analysis of metabonomic profiles. BMC Bioinformatics 10:363
Lommen A (2009) MetAlign: interface-driven, versatile metabolomics tool for hyphenated full-scan mass spectrometry data preprocessing. Anal Chem 81:3079–3086
Tautenhahn R, Patti GJ, Kalisiak E et al (2011) metaXCMS: second-order analysis of untargeted metabolomics data. Anal Chem 83:696–700
Smith CA, Want EJ, O’Maille G et al (2006) XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal Chem 78:779–787
Benton HP, Wong DM, Trauger SA et al (2008) XCMS2: processing tandem mass spectrometry data for metabolite identification and structural characterization. Anal Chem 80:6382–6389
Acknowledgments
The author thanks Tone F. Bathen, Guro F. Giskeødegård, and Maria D. Cao for their valuable suggestions and inputs while preparing the manuscript.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer Science+Business Media New York
About this protocol
Cite this protocol
Vettukattil, R. (2015). Preprocessing of Raw Metabonomic Data. In: Bjerrum, J. (eds) Metabonomics. Methods in Molecular Biology, vol 1277. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-2377-9_10
Download citation
DOI: https://doi.org/10.1007/978-1-4939-2377-9_10
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-2376-2
Online ISBN: 978-1-4939-2377-9
eBook Packages: Springer Protocols