1 Introduction

Oxidative stress is commonly defined as a state in which the levels of reactive oxygen and nitrogen species, which oxidize lipids, proteins and nucleic acids in cells, and the relevant neutralizing substances are imbalanced (Opara 2006). The interest for studying the oxidative stress is mainly due to its probable role in many pathological conditions, for example in coronary artery disease and cancer (Delimaris et al. 2007; Serdar et al. 2006).

Human serum contains a variety of lipids of which the polyunsaturated fatty acids (PUFAs) in cholesterol esters, phospholipids, and triglycerides are subject to free radical-initiated oxidation (Niki 2009; Schneider 2009; Uchida 2003). The free radical-mediated lipid peroxidation proceeds by chain mechanism yielding conjugated dienic lipid hydroperoxides as primary reaction products (Niki 2009; Pierini and Punta 2006). These hydroperoxides can be reduced to corresponding hydroxy fatty acids or decomposed either into various aldehydes or, if the original fatty acid is arachidonic acid, into isoprostanes (Dotan et al. 2004; Mlakar and Spiteller 1996).

The methods to assess oxidative stress can be divided into three categories (Dotan et al. 2004). Firstly, there are assays that measure the concentrations of antioxidants (Mastaloudis et al. 2001; Sies and Krinsky 1995; Finckh et al. 1995; Dhariwal et al. 1990) or oxidation products of lipids (Roberts and Morrow 2000; Esterbauer et al. 1989), proteins (Fukagawa et al. 2001; Buss et al. 1997) and DNA (Shen and Ong 2000; Halliwell 2000). These methods require the measurements of several different compounds, which may exist in very low concentrations, be unstable or partly unknown. Secondly, there are various assays that assess the reductive capacity of body fluids, as well as the capacity of these fluids to inhibit oxidation (Re et al. 1999; Winston et al. 1998; Benzie and Strain 1996; Wayner et al. 1985). However, various antioxidants react differently in each assay, and thus, the interpretation of the results is not straightforward (Young 2001). Thirdly, there are assays that evaluate the oxidation susceptibility of lipids from serum or separated lipoproteins by exposing the sample under oxidative conditions (Delimaris et al. 2008; Smith et al. 2002; Schnitzer et al. 1995). The separation of lipoproteins is time consuming, and also some loss of water-soluble antioxidants may occur during the separation procedure. The methods measuring the oxidation susceptibility of whole serum are faster, and they also take the effects of serum antioxidants into account.

Oxidation susceptibility of lipids from whole serum is usually determined by initiating the oxidation with copper, which leads to the formation of conjugated dienic hydroperoxides that can be monitored spectrophotometrically since they absorb in the UV range (Delimaris et al. 2007; Schnitzer et al. 1998; Spranger et al. 1998). The result is expressed either as the lag time preceding oxidation, which describes the time before the maximal reaction rate of diene accumulation has been reached, or as the maximal rate of accumulation of absorbing products (Delimaris et al. 2008). However, the serum must be considerably diluted (up to 150-fold) to enable the spectrophotometric measurements of lipid oxidation products. Furthermore, also albumin and the other compounds of serum absorb in the UV range, thus, disturbing the detection of conjugated dienes (Schnitzer et al. 1995, 1998).

The objective of this work was to develop a method to study the oxidation susceptibility of serum lipids by proton nuclear magnetic resonance (1H NMR) spectroscopy. This measurement can be conveniently combined with the NMR measurement of the serum lipid profile (Tukiainen et al. 2008). Previously, NMR spectroscopy has been applied to study lipid peroxidation of low density lipoproteins (LDL) (Soininen et al. 2007; Corso et al. 1997; Lodge et al. 1995; Lodge et al. 1993), and the behavior of the signal arising from the bisallylic protons has shown to be a good measure for the peroxidation state of LDL particles (Lodge et al. 1995). However, NMR has not been used to study the oxidation susceptibility of lipids from whole serum.

The developed NMR oxidation assay was applied to serum samples from the study subjects who had consumed white (WC), dark (DC) or high-polyphenol (HPC) chocolate (Mursu et al. 2004) daily for 3 weeks, and the results were compared with the lag time values obtained using the spectrophotometric method. Random forest (RF) approach was used to find out the metabolic changes induced by the chocolate consumption and to assess the determinants for the oxidation susceptibility of serum lipids.

2 Materials and methods

2.1 Subjects and study setting

The oxidation method was developed using serum obtained from three healthy volunteers. The blood samples were drawn a few days before the first experiments, and the serum was stored at −70°C. The developed oxidation protocol was applied to the chocolate study including fasting serum samples from the subjects who had consumed 75 g of WC, DC or HPC chocolate daily for 3 weeks (Mursu et al. 2004). The blood samples were taken before and after the chocolate consumption. The serum samples of the chocolate study were stored at −70°C and not thawed before. The study population consisted of 45 nonsmoking volunteer men (n = 12) and women (n = 33) that fulfilled all the following criteria: (i) BMI < 32 kg/m2; (ii) no regular use of any drugs or supplements with antioxidative (β-carotene, vitamins C or E) or lipid-lowering properties; (iii) no chronic diseases such as diabetes, coronary heart disease, or other major illnesses; and (iv) willingness to consume 75 g of study chocolate daily for 3 weeks. The study protocol was approved by the Research Ethics Committee, Hospital District of Northern Savo, Finland.

2.2 Sample preparation

The samples to determine the oxidation susceptibility of serum lipids were prepared as follows. First, 0.3 ml of serum, 0.3 ml of NMR buffer and 0.3 ml of 0.15 M sodium chloride solution were combined in a 2 ml eppendorf tube, and the solution was warmed in an incubator at 37°C for 10 min. In the sample preparation, the same volumes of serum and NMR buffer were used as for the serum NMR metabonomics measurements (Soininen et al. 2009) so that these two protocols can be combined in future. Oxidation was initiated by adding 0.1 ml of 5, 10, 20 or 40 mM CuSO4 to the prewarmed solution to yield a copper concentration of 0.5, 1.0, 2.0 or 4.0 mM, respectively. To determine the initial concentration of PUFAs in each sample, also a reference sample without copper was prepared. The copper containing samples were incubated in an end-over-end mixer at 37°C for 1.5, 3, 4.5 or 6 h.

The oxidation reaction was terminated and the lipids were extracted by using a modified Folch method (Folch et al. 1957). Briefly, 10 ml of 1:2 (v/v) methanol/dichloromethane solution and 10 ml of 0.15 M sodium chloride solution were added to the incubated sample. The sample was mixed in a vertical shaker for 5 min and centrifuged (Hettich Rotina 46R, 2400×g, 10 min, at +4°C). The lower dichloromethane phase was recovered and the aqueous phase was extracted again with 5 ml of dichloromethane. The separated organic phases were combined and evaporated to dryness. The lipids were redissolved into 0.6 ml of CDCl3 containing 0.03% tetramethylsilane for chemical shift reference.

To evaluate the repeatability of the method, five replicate samples were prepared within period of 2 months in random order. The chocolate study samples were analyzed in triplicates (WC and HPC groups) or duplicates (DC group).

2.3 NMR spectroscopy

1H NMR spectra were recorded on a Bruker AVANCE 500 DRX (Bruker, Karlsruhe, Germany) spectrometer operating at 500.13 MHz equipped with a 5 mm QNP probe. Shimming and tuning of the samples were performed manually. The spectra were measured at 22°C, and a standard Bruker proton pulse sequence (zg) with a 90° pulse was employed. The number of scans was 64, the number of dummy scans 2, the spectral width 4.4 kHz, and the number of data points 64 K. The total acquisition time was 7.45 s with the delay time (d1) of 0.5 s.

The spectra were processed using PERCH NMR Software (Laatikainen et al. 2011). The measured free induction decays were multiplied by an exponential window function (LB = 0.5 Hz) to increase the signal-to-noise ratio.

2.4 Quantification of lipids and calculation of the NMR oxidation susceptibility

The areas of the lipid resonances were determined using lineshape fitting analysis (Tukiainen et al. 2008). The structures of some multiplets (for example, see Fig. 1), which were defined by the coupling constants, were used as constraints that enabled the quantitative analysis of severely overlapping peaks (Soininen et al. 2005). The signal areas of the bisallylic protons from PUFAs (APUFA) at 2.74–2.88 ppm were used to determine the NMR oxidation susceptibility. First, the difference between the amounts of PUFAs before (REF) and after (OX) the oxidation reaction was calculated. Then the difference was divided by the signal area of PUFAs from the spectrum of non-oxidized sample and converted to a percentage value ((APUFA(REF) − APUFA(OX))/APUFA(REF) × 100%). The obtained value describes the amount of oxidized PUFAs, and thus, the oxidation susceptibility of serum lipids. The signal areas were scaled so that the signal area of total cholesterol C(18)H3 protons was the same both in the spectra of non-oxidized and oxidized serum. It should be noted that many of the cholesterol oxidation products can be identified from 1H NMR spectrum since their C(18)H3 signals resonate at 0.61–0.69 ppm (Bradamante et al. 1992), next to the non-oxidized cholesterol C(18)H3 signals (0.676 and 0.678 ppm). Thus, it is important to take all these signal areas into account when calculating the total cholesterol amount that is used as a reference in the NMR protocol.

Fig. 1
figure 1

A 1H NMR spectrum of extracted serum with signal assignments (a). Some essential parts of the spectra (be) before and after the oxidation are shown at the top of the figure. The signals arising from the double bond protons (b) and the bisallylic protons from PUFAs (c) decrease during oxidation. There are also changes in the amounts of different fatty acids (d). Cholesterol oxidizes slightly under the conditions used and these oxysterol C(18)H3 signals resonate at 0.61–0.69 ppm. The signal areas are referenced to total cholesterol C(18)H3 signals including also the oxidized forms (e). EC esterified cholesterol, FA fatty acid, FC free cholesterol, PC phosphatidylcholine, PG phosphoglyceride, sat saturated, TC total cholesterol, TG triglyceride

2.5 Statistics

The results are expressed as means (±standard deviations). Changes between the baseline and end-point values within the study groups were tested by paired t-test. Means were compared across the study groups by the analysis of variance (ANOVA), and post hoc Tukey’s test was used when a statistically significant heterogeneity between the groups was shown by the ANOVA. Correlations were estimated by Pearson’s or Spearman’s correlation coefficients. Differences with P values of 0.05 or less were considered significant. SPSS software (version 14.0; SPSS, Inc., Chicago, IL, USA) was used for the statistical analyses.

Random forest approach was used to study the chocolate induced metabolic effects as well as to explore the descriptors for the oxidation susceptibility. This method, introduced by Breiman (Breiman 2001) and not common at the time of the original work (Mursu et al. 2004), allowed us to gain more insights into the previously obtained results. Random forests are ensembles of unpruned decision trees. For classification, the trees are combined using majority voting, one vote for each tree. For regression, predictions are generated by averaging over individual trees. In general, the performance of random forests compares favorably with modern machine learning methods such as support vector machines or neural networks (Hastie et al. 2008). Among multivariate methods, random forests are particularly resistant to overfitting, including only two adjustable parameters; the number of trees (Ntree) and the number of variables to be tried in each split (Mtry). Ntree is typically set at 500–2000, as a larger number will provide no extra gain. For Mtry, a value of sqrt(Nvar) is recommended for classification and Nvar/3 for regression; here Nvar is the number of variables. The optimization of Mtry usually provides a slight improvement only. Distinctively, random forests come with a build-in cross-validation using out-of-bag data (usually 30% of the samples). Herewith an estimate for the prediction error is achieved as a by-product, and so there is no actual need for cross-validation with separate train and test sets. There are also two heuristic ways of measuring variable importance in classification, mean decrease in classification accuracy and mean decrease in Gini index. These standard performance indices usually provide slightly different results, and there is no consensus opinion which one should be preferred. The reliability of the prediction can simply be estimated by counting the votes for each class—a sharp voting profile, favoring strongly a certain class, is usually a hallmark of reliability. For regression, the corresponding performance measures are increase in mean square error and increase in node purity. The calculations were performed with the R (R Development Core Team 2008) program package randomForest (Liaw and Wiener 2002).

3 Results and discussion

3.1 Development and optimization of the NMR oxidation assay

The oxidation of PUFAs can be seen from a 1H NMR spectrum of extracted serum by following the signal arising from the bisallylic protons at 2.74–2.88 ppm that decreases when the oxidation reaction proceeds (Fig. 1). In addition, the signals from the double bond protons (5.30–5.45 ppm) as well as the ω-3 (0.98 ppm) and ω-6 and ω-7 fatty acid methyl proton signals (0.89 ppm) decrease during oxidation whereas the ω-9 and saturated fatty acid signal (0.88 ppm) increases (Fig. 1).

Several copper concentrations (0.5, 1.0, 2.0 and 4.0 mM) as well as incubation times (1.5, 3, 4.5 and 6 h) were tested in order to find out the copper concentration and the time point that yield maximal amount of information about the oxidation susceptibility of serum lipids as well as the largest variance between individuals. As can be seen from Fig. 2a, the oxidation reaction proceeds rapidly when copper concentrations of 1–4 mM are used. The copper concentration of 0.5 mM provides milder oxidation conditions and the oxidation proceeds slower, which also results in larger variance between individuals. The Fig. 2b shows that the largest variance between the percentage values describing the amount of oxidized PUFAs of the three subjects is obtained using incubation time of 6 hours. In addition, at this point, the speed of the oxidation reaction has slowed down allowing slight differences in the incubation times without affecting the results.

Fig. 2
figure 2

The amounts of oxidized PUFAs during the copper induced oxidation with four different copper concentrations (a) and with a copper concentration of 0.5 mM for three subjects (b). Each data point in b is a mean of five determinations measured within period of 2 months

The repeatability of the assay was studied by oxidizing five replicate serum samples from three volunteers within period of 2 months in random order (Fig. 2b). At the time points 1.5 and 3 h, the deviations in the oxidation susceptibility range between 0.4 and 2.8 percentage units. At the later time points (4.5 and 6 h), the standard deviations are within 0.5 and 4.3 percentage units.

3.2 Application of the NMR oxidation assay to chocolate study samples

The optimized oxidation assay (0.5 mM CuSO4 and 6 h incubation) was applied to fasting serum samples from the subjects who had consumed 75 g of WC, DC or HPC chocolate for 3 weeks. For the samples, several factors, including the oxidation susceptibility of serum lipids determined spectrophotometrically, were determined earlier (Mursu et al. 2004).

The amounts of oxidized PUFAs before and after the chocolate consumption for each of the study subject are illustrated in Electronic Supplementary Fig. S1. The NMR oxidation assay showed that the oxidation susceptibility of serum lipids decreased in the HPC group (P = 0.016) after the chocolate consumption but there were no significant changes in the DC (P = 0.407) or WC (P = 0.758) groups. The change in the NMR oxidation susceptibility was significantly different between the WC and HPC groups (P = 0.031). As can be seen from Table 1 and Electronic Supplementary Fig. S1, the baseline values of the HPC group are lower than in the WC and DC groups. Thus, the analyses were also performed by adjusting the baseline values of the variables. However, the changes between the groups remained significantly different even after the adjustment (P = 0.043).

Table 1 The NMR oxidation susceptibility (%) and the lag time (min) before (baseline) and after (change) the consumption of study chocolates for 3 weeks

3.3 Correlation between the NMR oxidation assay and the spectrophotometric method

The baseline and end-point oxidation susceptibility values determined by 1H NMR were plotted against the corresponding lag time values, and the strong correlation between these two methods (Spearman’s correlation coefficient −0.825, P < 0.001) is illustrated in Fig. 3. If a sample is very susceptible to oxidation, the lag time is short. By using the NMR method, this is seen as a high amount of oxidized PUFAs. However, even though the baseline and end-point values of the spectrophotometric and NMR method correlate, the change values (end-baseline), which are used in the ANOVA analysis, do not correlate (Pearson’s correlation coefficient −0.158, P = 0.317), and only the NMR oxidation assay reveals significant differences between the chocolate groups.

Fig. 3
figure 3

The amounts of oxidized PUFAs after the copper induced oxidation determined by the 1H NMR method plotted against the corresponding lag time values of the oxidation monitored spectrophotometrically from the chocolate study samples. The regression line is y = −0.2665x + 100.4, r 2 = 0.7308

The different results obtained by the NMR and spectrophotometric assays can be, at least partly, explained by the different settings of these experiments. Firstly, the spectrophotometric method measures the time before the maximal reaction rate of diene accumulation has been reached, and the NMR method gives the amount of oxidized PUFAs after 6-h oxidation. Thus, the NMR method observes the overall oxidation reaction considering both the early and later stages of the reaction whereas the spectrophotometric assay focuses on the early stages of the oxidation reaction. Secondly, there are different incubation temperatures (37°C for the NMR protocol and 30°C for the spectrophotometric method), which have been shown to affect to the kinetics of LDL oxidation (Ramos et al. 1995). Also different copper-to-serum ratios (0.5 mM/300 μl for the NMR method and 0.05 mM/13.4 μl for the spectrophotometric assay) may have some effect on the oxidation reactions. Thirdly, the NMR method is more specific since there are no disturbing signals in the spectrum overlapping with the signals arising from PUFAs whereas in the spectrophotometric assay also albumin and cholesterol oxidation products, mainly 7-ketocholesterol, absorb at the same wavelength as conjugated dienes (Schnitzer et al. 1998).

3.4 Variables used for the random forest analyses

Random forest method was used to study the chocolate consumption induced metabolic changes as well as to assess the determinants for the oxidation susceptibility of serum lipids. The variables included into the analyses consisted of previously measured variables (Mursu et al. 2004) and a total of 23 lipid variables obtained by NMR, which are presented in Table 2. Some of the variables were determined using two different methods. For example, the concentrations of triglycerides, linoleic acid and docosahexaenoic acid measured using the NMR method and an enzymatic colorimetric test or gas chromatography were comparable; the correlations between the concentrations determined with the two different methods were very strong (correlation coefficients 0.964, 0.966 and 0.959, respectively, P < 0.001 for all).

Table 2 The variables included into the RF analyses

3.5 Metabolic profiling

The chocolates administered had similar amounts and profiles of fatty acids, excluding behenic acid, but differed in the amounts of cocoa and polyphenols (mainly catechin, epicatechin and procyanidins). The cocoa content of the DC and HPC chocolates was rather similar but the HPC chocolate contained 557 mg polyphenols in 100 g chocolate whereas the corresponding amount in the DC chocolate was 366 mg.

The RF method allowed clustering of the WC, DC and HPC groups with six variables (Fig. 4), and the out-of-bag error was 22%. Into the analysis, the change (end-baseline) values of the variables shown in Table 2, except behenic acid, were included. Since the HPC chocolate contained tenfold amount of behenic acid (0.35 g/100 g chocolate) compared with the WC (0.048 g/100 g chocolate) and DC (0.041 g/100 g chocolate) chocolates, the increase in behenic acid concentration after the consumption of the HPC chocolate is not a consequence of endogenous metabolic effects. The inclusion of behenic acid into the analysis would elevate it to the most important variable causing the clustering (Electronic Supplementary Fig. S2), but, at the same time, it would prevent the discovery of the other metabolically more relevant variables.

Fig. 4
figure 4

RF clustering of the chocolate study samples [WC (W), DC (D) and HPC (P) groups] with the change variables shown in Table 2 excluding behenic acid. The clustering was obtained with six variables and the variable importance measures, mean decrease in accuracy and mean decrease in Gini index, are shown for each of these variables. 24:1 nervonic acid; 22:6 docosahexaenoic acid, PC phosphatidylcholine, SM sphingomyelin, 14:0 myristic acid

The variable profiles for each chocolate group are shown in Fig. 5, and it appears that the HPC group has different variable profiles than the WC and DC groups. These results suggest that the consumption of the HPC chocolate induces different changes to the amounts of HDL, nervonic acid, docosahexaenoic acid, phosphatidylcholine, sphingomyelin and myristic acid than the consumption of the WC or DC chocolates. Since the main difference between the study chocolates was the amount of polyphenols, it is likely that they have induced the metabolic changes in the HPC group but on the grounds of this study it is impossible to identify the mechanisms of action.

Fig. 5
figure 5

The profiles of the variable changes for the WC (white), DC (light grey), and HPC (dark grey) groups. The values are expressed as standard deviation units from the mean of the whole study population. 24:1 nervonic acid, 22:6 docosahexaenoic acid, PC phosphatidylcholine, SM sphingomyelin, 14:0 myristic acid

Previously, the effects of cocoa and chocolate have been studied in cell-cultures, in laboratory animals and in humans, and the results are reviewed by Rimbach and co-workers (Rimbach et al. 2009). There have been a variety of study settings, and also the outcomes are diverse. The effects obtained from human interventions include, e.g., decrease in LDL oxidation (Mathur et al. 2002), decrease of blood pressure (Faridi et al. 2008), increased HDL concentration (Baba et al. 2007), improved endothelial function (Schroeter et al. 2006), and increased insulin sensitivity (Grassi et al. 2005). Also, there are studies with rats which have shown that flavonoids can act by modulating fatty acid metabolism (Dihal et al. 2008; de Boer et al. 2006; Ohta et al. 2006). For example, quercetin, which is also present in cocoa (Sanbongi et al. 1998), up-regulates genes involved in fatty acid catabolism and lowers the fatty acid levels in plasma (de Boer et al. 2006).

3.6 Determinants of oxidation susceptibility of serum lipids

Random forest approach was also used to assess the determinants for the oxidation susceptibility of serum lipids for the chocolate study samples. The analysis included the baseline and end-point values of the variables (Table 2). The correlation coefficients between the baseline and end-point variables (VARBASE+END) and the corresponding NMR oxidation susceptibility values (NMROXBASE+END) as well as between the variables (VARBASE+END) and the lag time values (LAGBASE+END) are shown in Fig. 6. Due to the inverse correlation of the values of the NMR oxidation method and the spectrophotometric assay, the signs of the correlation coefficients are opposite for these two methods. To ease the comparison, the signs of the coefficients of the spectrophotometric method were inversed. As can be seen from Fig. 6, the correlation coefficients for the NMR oxidation method and the spectrophotometric assay are very similar and most of the correlations are relatively weak (less than |0.4|). The NMROXBASE+END values have the strongest correlations with gamma-glutamyl transferase (γ-GT), arachidonic acid and hemoglobin (Spearman’s correlation coefficients −0.498, 0.477 and −0.449, respectively, P < 0.001 for all) whereas the LAGBASE+END values correlate strongest with phosphatidylcholine, HDL and γ-GT (Spearman’s correlation coefficients 0.544, 0.528 and −0.525, respectively, P < 0.001 for all). For example, the positive correlation coefficient of arachidonic acid means that the NMR oxidation susceptibility increases when the amount of arachidonic acid increases. For the spectrophotometric method, the high amount of arachidonic acid corresponds to a short lag time value.

Fig. 6
figure 6

The correlations between the variables (VARBASE+END) (shown in Table 2) and the NMR oxidation susceptibility (NMROXBASE+END) values (black) and the lag time (LAGBASE+END) values (grey). Since the NMR oxidation susceptibility values and the lag time values are inversely correlated, the inverse correlation coefficients for the spectrophotometric method are presented in order to ease the comparison. P-values are indicated with one (0.001 < P ≤ 0.05) or two (P ≤ 0.001) asterisks

The purpose of the RF analysis was to find out if the oxidation susceptibilities could be explained with the determined variables (Table 2). However, the RF analysis was not straightforward. As can be expected from the relatively weak correlations (Fig. 6), the performance of the regression model is weak and the picture of the mutual importance of the variables remains somewhat unclear since it may change from case to case considerably. The reason for this behavior is that the differences in importance are usually very small for most of the variables. This, together with the small number of samples (only 15 for each chocolate group), tends to make the RF regression procedure unstable. In these conditions the proper variable selection is not a trivial task.

In order to cope with this problem, the following strategy was applied. First, we derived a large number of RF models (n = 200) for both the NMROXBASE+END and LAGBASE+END values, and then the importance values were scaled to the maximum and the data sets were combined. After that an average importance was calculated for each variable (Fig. 7), and the variables with the highest average importance values were selected for the RF models trying to explain the NMROXBASE+END and LAGBASE+END values.

Fig. 7
figure 7

Average variable importances for the variables shown in Table 2. The numbering on the x-axis follows the numbering of the variables in Table 2

RF regression was able to explain 60% of the NMROXBASE+END values and 62% of the LAGBASE+END values with nine variables (Fig. 8). Most of these descriptors are lipid variables including arachidonic acid, docosahexaenoic acid, docosapentaenoic acid, phosphatidylcholine, HDL, choline containing phospholipids and palmitic acid. Previously, the decreased oxidation susceptibility has been related to low amounts of PUFAs (Eritsland 2000; Bonanome et al. 1992), and also here, arachidonic and docosahexaenoic acids are positively correlated with the oxidation susceptibility (Fig. 6). The amount of docosapentaenoic acid acts differently but it is a less abundant fatty acid in serum [approximately 0.2 mol% from all serum fatty acids (Hodson et al. 2008)], and thus, has less effect on overall oxidation susceptibility than arachidonic and docosahexaenoic acids. As can be seen from Fig. 6, the increased oxidation susceptibility is related to increased amounts of phosphatidylcholine and total choline containing phospholipids. This high oxidizability of these phospholipids is probably due to the fact that they contain PUFAs at their sn-2 position of the glycerol backbone. Also the correlation of high HDL concentrations with high oxidation susceptibility (Fig. 6) is in line with the previous findings (Shimonov et al. 1999; Nyyssönen et al. 1997). HDL particles contain high amounts of lipid hydroperoxides (Nyyssönen et al. 1997), and thus, shorten the lag time and increase the amount of oxidized PUFAs.

Fig. 8
figure 8

Increase in mean square error (IncMSE) (%) and increase in node purity (Inc node purity) values for each variable obtained from the RF regression when modeling the oxidation susceptibility values (NMROXBASE+END explained with VARBASE+END) and the lag time values (LAGBASE+END explained with VARBASE+END). 16:0 palmitic acid, 20:4 ω-6 arachidonic acid, 22:5 docosapentaenoic acid, 22:6 docosahexaenoic acid, HB hemoglobin, PC phosphatidylcholine, tot chol total choline phospholipids

In addition to the lipid variables, also hemoglobin and serum γ-GT are descriptors of the NMROXBASE+END and LAGBASE+END values, and they are inversely correlated with the oxidation susceptibility (Fig. 6). It has been shown that hemoglobin can catalyze various oxidative reactions (Alayash et al. 2001). Since hemoglobin containing red blood cells have been removed from the serum samples, the possible oxidative reactions have been occurred prior to the blood separation. Elevated serum γ-GT activity has usually been used as a marker for excessive alcohol consumption but it has also been suggested a marker for oxidative stress (Lee et al. 2004). However, our findings propose opposite. There is evidence of the antioxidative role of cellular γ-GT but the relationship between serum γ-GT and cellular γ-GT is not known (Lee et al. 2004). Serum γ-GT may also be an indirect marker of some other mechanism related to oxidation susceptibility.

The variance importances for the descriptors of the RF regression models for the NMROXBASE+END and LAGBASE+END values are shown in Fig. 8. Arachidonic acid and docosahexaenoic acid seem to be more important determinants for the NMROXBASE+END values than for the LAGBASE+END values, whereas phosphatidylcholine and HDL have higher variable importance values in the model explaining LAGBASE+END values. Since the NMR oxidation protocol measures the amount of oxidized PUFAs, it is logical that the PUFA variables are highlighted among the descriptors.

Previously, the determinants of serum lipid oxidation resistance have been studied by multivariate-regression analysis (Nyyssönen et al. 1997). In that study, vitamin C and urate were proposed to be the strongest determinants of plasma antioxidant capacity (Nyyssönen et al. 1997). Also, α-tocopherol, albumin and HDL concentration have been shown to have a strong impact on serum lipid resistance to oxidation (Nyyssönen et al. 1997). Among our RF results, only HDL cholesterol of these was recognized as a determinant of oxidation susceptibility. However, considering that in our data serum urate and albumin concentrations were lacking and that the vitamin C and α-tocopherol values were based on a 4 day food record, the explanation percentages of the models trying to explain the NMROXBASE+END and LAGBASE+END values (60 and 62%, respectively) are quite high. We also suggest that both the NMR oxidation and spectrophotometric methods yield invaluable measures that contain information on the metabolic state that is related to oxidation susceptibility, which can vary greatly between individuals.

4 Concluding remarks

In this study, a repeatable NMR-based protocol for studying the oxidation susceptibility of serum lipids was developed and assessed. In addition to the oxidizability of serum lipids, the NMR protocol yields vast amount of metabolic descriptors, if combined with the NMR metabonomic analysis of serum. The NMR protocol was applied to the serum samples of the chocolate study, and the oxidation susceptibility decreased significantly in the HPC group. This could not be detected by using the spectrophotometric method in the original study (Mursu et al. 2004), even though the values obtained by these two oxidation methods correlated (r 2 = 0.73). The random forest analysis revealed that the consumption of the HPC chocolate induced changes to the amounts of HDL, phosphatidylcholine, sphingomyelin, and nervonic, docosahexaenoic and myristic acids. Furthermore, the RF analysis was able to explain about 60% of the oxidation susceptibility values with nine descriptors, including arachidonic, docosahexaenoic, docosapentaenoic and palmitic acids, gamma-glutamyl transferase, hemoglobin, HDL, phosphatidylcholine and choline containing phospholipids. Since the administered chocolates had similar amounts and profiles of fatty acids, except behenic acid, the polyphenols could have altered the metabolism of the study subjects and induced the changes to the oxidation susceptibilities.