Abstract
One-dimensional 1H nuclear magnetic resonance (1D 1H-NMR) has been used extensively as a metabolic profiling tool for investigating urine and other biological fluids. Under ideal conditions, 1H-NMR peak intensities are directly proportional to metabolite concentrations and thus are useful for class prediction and biomarker discovery. However, many biological, experimental and instrumental variables can affect absolute NMR peak intensities. Normalizing or scaling data to minimize the influence of these variables is a critical step in producing robust, reproducible analyses. Traditionally, analyses of biological fluids have relied on the total spectral area [constant sum (CS)] to normalize individual intensities. This approach can introduce considerable inter-sample variance as changes in any individual metabolite will affect the scaling of all of the observed intensities. To reduce normalization-related variance, we have developed a histogram matching (HM) approach adapted from the field of image processing. We validate our approach using mixtures of synthetic compounds that mimic a biological extract and apply the method to an analysis of urine from rats treated with ethionine. We show that HM is a robust method for normalizing 1H-NMR data and propose it as an alternative to the traditional CS method.
Similar content being viewed by others
References
Craig, A., Cloarec, O., Holmes, E., Nicholson, J. K., & Lindon, J. C. (2006). Scaling and normalization effects in NMR spectroscopic metabonomic data sets. Analytical Chemistry, 78, 2262–2267.
Dieterle, F., Ross, A., Schlotterbeck, G., & Senn, H. (2006). Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. application in 1H NMR metabonomics. Analytical Chemistry, 78, 4281–4290.
Gonzalez, R. C., & Woods, R. E. (2001). Digital image processing. Upper Saddle River, NJ: Prentice-Hall.
Lemmerling, P., Vanhamme, L., Romano, R., & Van Huffel, S. (2002). A subspace time-domain algorithm for automated NMR spectral normalization. Journal of Magnetic Resonance, 157, 190–199.
Lewis, I. A., Schommer, S. C., Hodis, B., et al. (2007). Method for determining molar concentrations of metabolites in complex solutions from two-dimensional 1H-13C NMR spectra. Analytical Chemistry, 79, 9385–9390.
Lindon, J., Holmes, E., & Nicholson, J. (2006). Metabonomics techniques and applications to pharmaceutical research & development. Pharmaceutical Resarch, 23, 1075–1088.
Rosenfeld, A., & Kak, A. C. (1976). Digital picture processing. Orlando: Academic Press.
Silverman, B. W. (1986). Density estimation for statistics and data analysis. London: Chapman and Hall.
The Mathworks, I. (2007). Matlab, Ver. 7.4.0.287 (R2007a). Natick, MA.
Torgrip, R. J. O., Lindberg, J., Linder, M., et al. (2006). New modes of data partitioning based on PARS peak alignment for improved multivariate biomarker/biopattern detection in 1H-NMR spectroscopic metabolic profiling of urine. Metabolomics, 2, 1–19.
Wu, W., Daszykowski, M., Walczak, B., et al. (2006). Peak alignment of urine NMR spectra using fuzzy warping. Journal of Chemical Information Modeling, 46, 863–875.
Acknowledgments
We thank Astra Zeneca for co-funding the BioSysteMetrics Group at Stockholm University and Dr. Frank Dieterle for the source code for the PQN algorithm. We also gratefully acknowledge Ian Lewis, Department of Biochemistry, University of Wisconsin-Madison, for sharing the Semi-Synthetic dataset. The spectra of the synthetic mixtures were collected at the National Magnetic Resonance Facility at Madison (NMRFAM; NIH grants P41 RR02301 and P41 GM GM66326).
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
1.1 Algorithm parameters
1.1.1 Histogram matching
Full spectra are used. N = 60.
1.1.2 Probabilistic quotient normalization
Plant data, used interval 0.02–6.00 ppm. Urine data, used interval 0.02–10.0 ppm, the water peak region (4.65–4.95 ppm) was set to zero.
1.1.3 Normalization to CS
Plant data, used interval 0.02–6.00 ppm. Urine data, used interval 0.02–10.0 ppm, the water peak region (4.65–4.95 ppm) was set to zero.
Rights and permissions
About this article
Cite this article
Torgrip, R.J.O., Åberg, K.M., Alm, E. et al. A note on normalization of biofluid 1D 1H-NMR data. Metabolomics 4, 114–121 (2008). https://doi.org/10.1007/s11306-007-0102-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11306-007-0102-2