Background

There are two core methods to explore human brain function: direct measurement of the electrical activity, as with the electro-encephalogram (EEG), or measurement of the vascular response that is indirectly related to the neuronal activity, as in functional MRI (fMRI) [1]. Because both approaches have complementary advantages, attempts have been made to fuse the high temporal resolution of EEG, with the high spatial resolution of fMRI. Although animal data have made it possible to elucidate some of the relationships between neuronal activity and the Blood Oxygen Level-Dependant contrast (BOLD) [25], there is much left to be worked out about the actual relationship between EEG and fMRI. This paper aims at presenting some data showing, through differential reactivity, that not all kinds of EEG responses might be related to the BOLD contrast.

It has been reported that variations in BOLD contrast in response to the presentation of two kinds of rare events, i.e. oddballs, presented a differential reactivity opposite to that observed using Evoked Potentials (EPs). Oddballs are known to evoke a P300, a positive EP around 300–400 ms [6]. EPs are computed by averaging the EEG signal time-locked to the presentation of a stimulus. Activity time-locked to the stimulus emerges from background EEG activity, which is not time-locked and averages to zero [7]. The P300 is mainly perceived on the medial electrodes (Fz, Cz, Pz), although it originates from the temporo-parietal junction and the inferior frontal gyrus [810]. When an oddball has to be detected, i.e. a Target, this potential is of lower amplitude than when it is presented as an event unrelated to the task, i.e. a Novel [6, 11, 12]. On the other hand, there are fMRI studies in which no Novel-related activation has been reported although the temporal-parietal junction and the inferior frontal gyrus were activated by Targets [1316].

This study attempts to replicate these findings using combined EEG and fMRI measurements so that both recordings relate to the same brain activity [17, 18]. Moreover, Event-Related Oscillations (EROs) were computed in order to reveal brain responses not necessarily time-locked to the stimulus. EROs are computed by averaging the time-frequency decomposition of each peristimulus signal. Accordingly, a small jitter in the initiation of an oscillatory response will not average to zero, which is especially valuable in the high-frequency components [1921]. EROs in response to auditory oddballs have been described in the gamma band (20–100 Hz), mainly in the 37–40 Hz interval, 360 ms after a Target is presented [2225]. But the reactivity of EROs to Novels remains to be assessed.

Five subjects were presented with Targets (the letter "X") and Novels (various pictures) mixed with frequent distractors (the letter "O") while EEG and fMRI data were simultaneously recorded. Results are provided as a fixed effect one-tail t-test, and as multi-subject conjunction analysis to ascertain that all the subjects did present the same differential reactivity. The present study replicates the differential reactivity of EPs and BOLD contrast, with respect to Targets and Novels: the P300 was of significantly greater amplitude in response to Novels as opposed to Targets. Oppositely, Targets elicited much more fMRI activation than Novels. In accordance with fMRI results, Target-related gamma oscillations were more intense than their Novel-related counterparts, around 300 ms. The reasons for such opposite differential reactivity between oscillations / fMRI on the one hand, and evoked potentials on the other, are discussed below. Our results support evidence demonstrating that the BOLD signal is better correlated with high-than with low-frequency oscillations.

Results

Behavioral results

the subjects performed the task with an accuracy rate of 98 ± 2%, with a response time of 474 ms ± 71 ms.

fMRI results

The Target condition is related with activations especially in regions said to be the sources of the P300: the inferior frontal gyrus bilaterally and the right temporo-parietal junction (posterior supramarginal gyrus). Other regions were also activated: the right antero-basal frontal region (BA 10), the anterior cingulate and the supplementary motor areas, bilaterally, the inferior parietal lobule and intraparietal sulcus, and the left central sulcus and thalamus (pulvinar and dorso-median nuclei) (table 1 – see Additional file 1, fig. 1). Deactivated regions in the Target condition, were the superior frontal sulcus bilaterally, the left inferior frontal gyrus, the right posterior insula, the precuneus and the posterior cingulate bilaterally, the posterior part of the superior temporal sulcus, and infero-temporal areas (table 1 – see Additional file 1, fig. 1).

Figure 1
figure 1

fMRI resultsLateral views of the normalized brain of 1 subject, colored as a function of contrast: Target-related activation (red and "X" marks), Target-related deactivation (green and "no-smoking" marks), and Novel-related activation (blue) (thresholds p ≤ 0.001, 100 voxel). The network that deactivated on target presentation comprised the superior frontal sulcus (SFS), the parietal cortex and the inferior frontal sulcus (IFS). The latter is supposed to be related with distractor inhibition since its posterior part is over-activated by Novels together with the posterior parietal cortex (PP). The network activated by targets comprised the supra-marginal gyrus (SMG) and the inferior frontal gyrus (IFG). Note that the TPJ is composed of the SMG (BA 40) in its upper part, and of the superior temporal gyrus (BA 39) in its lower part. The peristimulus BOLD signals are displayed for each relevant region for Targets (red), Novels (blue) and frequent distractors (gray). The curves are computed by simple averaging after regressing the other condition effect and removing high- and low-frequency components. The variation in signal intensity is indicated as a percentage of the MRI signal, and the scale is similar for all except the parietal area. Notice the balance between the anterior cingulate area (ACA) and the posterior cingulate area (PCA). Those charts also exclude the possibility for a threshold effect to account for the absence of Novel-related activation of the EXO network.

The Novel condition induced activation in the left inferior frontal sulcus and the occipital regions bilaterally extending to the left lateral temporal and posterior parietal areas (table 1 – see Additional file 1, fig. 1). The graphs in figure 1 represent the peristimulus BOLD signal for each condition in relevant regions after regressing the effect of the other conditions. The absence of Novel-related activation in the regions said to be the sources of the P300, as shown by peristimulus BOLD responses, allowed to ascertain that this observation, was neither due to a threshold effect nor to an inappropriate statistical analysis model.

EEG results

The N200 and P300 potentials were recorded for both Target and Novel conditions on the medial electrodes (Fz, Cz, Pz), with extension on both centrals (C3, C4). Significant differences in amplitude were found at the local, time-cluster and global levels, that were even greater in the Novel than in the Target condition for Cz, Pz, C3 and C4 (fig. 2). This was evidenced in all the subjects on electrode Cz, with a conjunction probability of p = 0.000025 (see table 2 for subject by subject results at 400 ms).

Figure 2
figure 2

Evoked potentials and event-related oscillations for each electrode and for all the subjects. The upper part presents three different views of the average electrode position relative to the areas with BOLD activation (threshold p ≤ 0.001, 100 voxels, Target-related activation – red, Target-related deactivation – green, and rare Distractor (Novel) activation – blue). It is provided for better appreciation of electrical signals, considering fMRI activation. The lower part displays the evoked potentials (EPs) and event-related oscillations (EROs) for each electrode, for the 5 subjects. For C3, Cz, C4, Pz, the "no-smoking" symbol, which is an instance of Novels (blue), yielded larger EPs than Targets (red). Distractors are represented in gray. For all the electrodes, the reverse is true as far as EROs are concerned, with significantly more oscillations around 300–400 ms, regarding Targets. In the case of EPs, signals have been plotted from -325 to +625 ms and the voltage range was kept constant from -9 to +9 μV. The shaded region around 474 ms corresponds to the response time and its standard deviation. The statistics for the comparison of Target-related and Novel-related EPs are computed every 10 ms, between 100 ms and 600 ms, and represented by the gray and black stars above the curves (1 star for pnc ≤ 0.05, 2 starsfor pnc ≤ 0.01). The time-cluster significance, corrected for multiple comparison is indicated by the color of the stars: black if above pc ≤ 0.05, gray if not significant at the corrected time-clusterlevel. The set level is given for each electrode by the number of stars beside the electrode label (1 star for p ≤ 0.05, 2 stars for p ≤ 0.01). For EROs, only the statistics comparing Targets and Novels is given for the 24- to 44-Hz band, and between -320 to 600 ms. The solid and dotted black lines correspond to the response time and standard deviation, respectively. The color code for the statistics is shown on the upper right hand side, and represents the cumulative p distribution stated in percent. Cold colors code for a higher Novel-related power level, and hot colors code for greater Target-related power intensity. Regions above the .05 and .01 thresholds are contoured in black (notice that the numbers 1, 5, 95 and 99 represent the cumulative p values stated in percent).

Table 2 Subject by subject EPs and EROs results

EROs also presented a statistical difference between the Target and Novel conditions, but in an opposite manner. Figure 2 shows the statistical comparison map between Targets (red) and Novels (blue). Targets evoked a greater increase in power than Novels between 200 and 500 ms in the 32-Hz band in F7, Cz, C3, C4 and Pz, and in the 34-Hz band in Fz and F8. Figure 3 (top) shows the time frequency chart for all the electrodes, for Targets (right) and Novels (left), with color-coded standard deviation relative to the reference distribution (-320 to 0 ms). There was a greater difference between the Target and Novel conditions in the 200 to 500 ms interval for the 32- to 38-Hz band. This was confirmed by the statistical map used to compare the two conditions (fig. 3 lower part), with two significant time-frequency clusters between 200 and 500 ms at 32 Hz (p = 0.002) and 38 Hz (p = 0.001) and a global significance of p < 0.0005. There was no significant time-frequency point for the reverse statistics (Novels > Targets). Subject-by-subject conjunction analysis produced the same results, with a multi-subject conjunction probability of p = 0.0014 on electrode Cz at 38 Hz and 320 ms (see table 2 for subject by subject results). It could be argued that EROs were found to be larger in the Target condition, relative to the Novel condition, owing to the limited frequency range explored, so that we performed further analysis in the 25- to 200-Hz range, every 5 Hz. Between 100 and 400 ms, and considering an uncorrected p ≤ 0.01 for report, Novels failed to present larger EROs than Targets, whereas the latter presented larger EROs at 35, 95, 110 and 140 Hz (additional material available on request).

Figure 3
figure 3

Event-related oscillations for all the electrodes.The upper part presents the event-related oscillations (EROs) time-frequency chart for Targets on the left (X-mark), and rare distractors (Novels) on the right ("no-smoking" symbol). It is shown for the 24- to 44-Hz band, and between -320 to 600 ms, summing up all 9 electrodes for all the subjects. The color scale represents the relative amount of power in standard deviation (reference period from -320 to 0 ms). This illustrates that the 32- to 38-Hz EROs in response to Targets are quite low in response to Novels. It thus does not come as a surprise that the statistical comparison of Target-related vs. Novel-related power for all electrodes is significant (lower time-frequency chart). Cold colors code for a trend towards greater power in the Novel condition, whereas hot colors represent greater power level in the Target condition. The scale represents the cumulative p distribution stated in percent. Regions above the .05 and .01 threshold are contoured in black. The numbers 1, 5, 95 and 99 represent the cumulative p value in percent, respectively equivalent to.01, .05 for Novels and .05, .01 for Targets. The solid and dotted black lines correspond to the response time and standard deviation, respectively. The result table provides the statistics for the time interval from 200 to 500 ms, and from 24 to 44 Hz, using a threshold p ≤ 0.01. Volume and time-frequency cluster p values are given corrected for multiple comparisons. Note that the ensemble statistics represent the probability to have the given amount of time-frequency points above the threshold (12 over 165 time-frequency points), and not the number of clusters as in EPs or fMRI. At the same threshold, there were no significantly larger power emissions in the Novel than in the Target condition (not even at the .05 threshold).

Discussion

After confronting these results with those of prior reports, we propound physiological hypotheses to account for this discrepancy between EPs and fMRI/EROs.

In accordance with previous experiments, Novels yielded a P300 of significantly greater amplitude than Targets [6]. The relatively low amplitude of the P300, about 8 μV, compared to the expected 10 to 20 μV [6], might well be related to the use of a common reference [26]. However, we cannot rule out the possibility that part of the P300 could have been regressed with the electro-cardio-balisto-graphic artifact in some trials. The principal component of the pulse artifact looks like a damped oscillation around 7.5 Hz [27] (Otzenberger et al. submitted) which includes part of the theta component of the P300 [6, 28]. However discarding trials in which the largest part of the pulse artifact covered the 300 to 400-ms post-stimulus period in two subjects did not modify their results. Even so, the pulse artifact should equally affect Target- and Novel-related P300 without affecting the difference between them.

Our fMRI results are also in line with the literature [13, 14, 16, 2936]. The putative sources of the P300, the temporal-parietal junction and the inferior frontal gyrus [8], were only activated in the Target condition by all the subjects. Novels activated the left inferior frontal sulcus that was part of a network deactivated by Targets. This region might well support response inhibition [3744].

Our EROs results not only nicely reproduce results reported in previous studies, regarding the frequency (38 Hz) and the delay (360 ms) [2225], but go one step further by showing that over a large frequency range, and between 100 and 400 ms, Targets triggered larger EROs than Novels. Accordingly, the differential reactivity of Target- and Novel-related EROs was in line with that of fMRI. An important limitation of this study however, is that, owing to the limited number of electrodes, we were unable to ascertain whether EROs arose from the same sources as EPs. Yet, the differential effect seems to take place on the same electrodes, as if the dipoles had the same scalp projection. Subsequent intra-cranial or inverse solution experiments are required to establish this point.

From a cognitive point of view, this discrepancy between EPs on one hand and EROs and fMRI on the other hand, may originate from several, possibly interacting, processes: i) different processing for Targets and Novels, related with differences in attentional requirement, ii) different visual processing for the letter "X" and pictures, iii) a different susceptibility to stimulus repetition. Further studies are required to clarify this point, but the interesting fact is that EPs react in an opposite way to fMRI, whereas EROs responds in the same way.

Such discrepancy between EPs and the fMRI signal may not be unique. There is at least one other condition where EPs and fMRI may not present the same differential reactivity. In episodic memory, the contrast between correct recognition of old items vs. correct rejection of distractors activates the left frontal lobe in fMRI [4547]. On the other hand, performing a source analysis based on EPs from EEG and magneto-encephalographic (MEG) data fails to evidence activity in this region [21]. Interestingly enough, in this last study, the time frequency analysis showed a difference in the gamma range projecting on the left fronto-temporal electrodes / MEG sensor position [21].

Our study provides direct evidence that gamma oscillations better match the BOLD signal than EPs do. This observation is well in line with other works on animals. Logothetis and co-workers successfully recorded the BOLD response together with the equivalent of local field potentials in the visual cortex of anesthetized monkeys. They observed that the best correlation was achieved when taking frequencies in the gamma range [5, 48].

The reason why EROs seem to correlate better with the BOLD signal than EPs do, remains, as yet, uncertain. The EEG and BOLD signals both reflect synaptic activity [25, 49, 50]. But EEG and fMRI integrate synaptic activity over different time windows [25]. The former reflects synchronous firing of synapses within milliseconds [49, 50]. On the other hand, fMRI represents the integration of synaptic activity over several hundreds of milliseconds [1]. Thus a small jitter of the neuronal response relative to a stimulus e.g. tens of milliseconds, will dramatically affect the averaging of EPs [20] but will only have a limited impact on the averaging of BOLD responses. On the other hand, EROs do not require to be time-locked to the stimulus so precisely, and will average well despite such jitter [20]. Taking this jitter into account might even become crucial as one considers higher cognitive tasks. With the involvement of decisions requiring longer response times, uncertainty about the time of involvement of some areas might well increase [21]. This however only provides partial explanation, as there is no reason to believe that the jitter affecting Target-related P300 should exceed that affecting Novel-related P300.

A second reason could be related to the fact that EEG and BOLD signals reflect the activity of different cell populations. EEG reflects the synaptic input function of pyramidal cells only, i.e. post-synaptic potential, either excitatory or inhibitory [49, 50], whereas fMRI reflects the synaptic activity of all neural cells [25]. There is no particular reason for EPs to rely on anything else than pyramidal cell synaptic activity, but it has been demonstrated that synchronous oscillations require the involvement of inhibitory interneurons together with pyramidal cells [51, 52]. Accordingly, EPs and EROs might reflect different kinds of neuronal activity, and the BOLD signal might be more sensitive to EROs because of the larger amount of contributing cells and synapses. Animal works illustrate how those two kinds of activity emerge in a task similar to the one we used. If a behaving monkey is presented with an object, but attends to another (analogous to the Novel condition), one only observes a phasic rise in spiking activity lasting for about 50 ms that could well go with an EP on the scalp [53, 54]. But if the object attended to is presented (analogous to the Target condition), the phasic rise is followed by gamma oscillations for 300 ms [55, 56].

Last, it must be noted that positive correlation with the BOLD signal seems only to apply to fast oscillations (> 15-Hz), whereas slow waves, below 12 Hz, were reported to be negatively correlated with the fMRI signal [57] (Foucher et al. submitted). This should not be surprising as low and high frequencies have been described to correlate negatively [58]. The discrepancy observed between EROs and EPs, might be considered not so much as a matter of whether activity is phase-locked or not, but rather as a matter of frequency. Should this be the case, the P300, which ranges in the theta frequency spectrum (4-8Hz) [6], may not be well suited to be positively correlated with BOLD. Unfortunately our methodology did not allow us to test whether or not theta EROs differed from the P300 reactivity (the 7-cycle wavelet would not have fitted in the EEG window limited by gradient artifacts). However, it has been shown in different protocols, that EP frequencies are not necessarily enhanced on the EROs analysis [59]. This calls into question the kind of neuronal activity responsible for the generation of an evoked potential. It has been suggested that part of an EP could correspond to the simple phase resetting of ongoing cerebral activity [59, 60]. Should this turn out to be true, since phase resetting should not consume much energy, it could be another reason why EPs and BOLD signals are not well correlated.

Conclusion

This is the first direct demonstration of an inverse differential reactivity between EPs and the BOLD signal arising from the same brain activity. This observation has to be taken into account for the integration of the EEG and the fMRI data. In the light of this attempt, these results plead for taking into account the EEG oscillatory response, as it was shown to share the same differential reactivity as the BOLD signal. Further explorations are called for to work out the reasons underlying this phenomenon, but it can be suggested that this is because the BOLD signal and EROs are less sensitive to random jitter of the brain response, and because they rely on neural activity of another kind than EPs, involving different population of cells. Differential reactivity is an interesting paradigm to explore the relationship between the BOLD and EEG signals, since the level of noise makes direct correlation between EPs, EROs and BOLD difficult. Further studies, involving different tasks and stimuli, should also consider the localization of the electrical activity using a larger amount of electrodes.

Methods

Subjects and task

Five subjects with no prior history of neurological injury (4 right-handed, 1 left-handed, 3 females, aged 18 to 23 years) gave informed consent prior to participating in this study approved by the local ethical committee. Visual stimuli were presented to them, consisting in series of Frequent Distractors (repeating the same letter "O", 47%), Novels (7%, black and white namable pictures, never repeated), and rare Targets (always the same letter "X", 7%). The proportion of white pixels was kept in the same range for all stimuli to avoid stimulus intensity changes (~20%). Although pictures were kept as simple as possible (pictograms such as a heart, flash of lightning) some were more complex than the letters (e.g. no-smoking, danger or nuclear signs). All the stimuli were square pictures covering ~5° of the visual field, which were presented for 70 ms using the software expe6 [61]. The subjects were instructed to signal Targets by pressing a response key with the right forefinger, and to ignore other stimuli. The remaining 39% were null events (empty screen). All the stimuli were presented pseudo-randomly to maximize the contrasts [62]. Inter-stimuli intervals were homogeneously distributed from 0.4 to 3.6 s (median of 2 s) and intervals between two Targets ranged from 5.2 to 96 s [63]. Each subject performed two sessions involving this 3-stimulus paradigm.

fMRI acquisition

In order to record gradient artifact-free EEG data simultaneously with fMRI, the volumes were separated by a 1,6-s period during which gradients were not commuted [17, 18]. Single-shot gradient echo, echo planar imaging was used for the functional study, relying on the BOLD effect (Bruker 2T / 270 volumes preceded by 5 dummy scans for steady state of T1 partial saturation effect / TR = 4 s / Imaging time = 2.6 s / flip angle of 90° / TE = 43 ms / 4-mm in-plane resolution, FOV = 256 mm, gridding 642, 4-mm slice thickness / 24 slices covering the whole brain except for the cerebellum). Thirty-five rare stimuli out of 40 were presented during the no-gradient period for each condition. This asymmetry escaped all participants' notice. A RARE sequence was used for the acquisition of anatomical images including electrode positioning, thanks to accompanying MRI markers (2T / TR = 15 s / TE = 73.8 ms / 2-mm in-plane resolution, FOV = 256 mm, gridding 1282, 2-mm slice thickness / 60 slices).

fMRI processing

The volumes were realigned, normalized (voxel size after re-sampling : 2 mm isotropic) and smoothed with an 8-mm kernel under SPM99 [64]. The data were further processed through pass-band filtering (the hemodynamic response for low-pass filter, and 47-s cut-off for high-pass filter) and the voxel intensity was scaled relative to the global volume. For the purpose of the statistical analysis, the brain responses to Targets, Novels and Distractors were modeled as stick functions convolved with a hemodynamic response and its temporal derivative. The analysis was looking for regions significantly correlated with those functions in the context of the general linear model [65]. The statistics are provided in two ways: i) a fixed effect, one-tail t-test, to enable comparison between fMRI and EEG results (p ≤ 0.001 uncorrected, cluster size > 100 voxels = 800 mm3), and ii) multi-subject conjunction analysis to allow the extrapolation of the results obtained to the population [66, 67]. The results are given using a threshold p ≤ 0.05 uncorrected for each individual t-map, which is equivalent to a conjunction probability of p ≤ 3.1 10-7 (uncorrected). For all maximally activated voxel per cluster, the minimal percentage of the population that should display activity at p ≤ 0.05 is given uncorrected, using a 95% confidence interval, and assuming the test sensitivity to be 100% (formula n°3 in [67]).

EEG acquisition

The EEG was recorded using 10 shielded electrodes at the F7-Fz-F8-C3-Cz-C7-Pz-O1-O2-A2 sites according to the international 10/20 standard [68], with ground on the nasion. Data were recorded with a referential montage to a common reference, using an MRI-compatible device (Schwarzer – GE). Conductances were tested out of the magnet and remained below 5 kOhm, except for C3 in 2 sessions (data excluded from the analysis). The ECG was amplified separately (Bruker – GE). Signals were sampled at 1000 Hz with pre-amplification filters set from 0.003 to 300 Hz.

EEG processing

The electro-cardio-balisto-graphic artifact [69] (or pulse artifact) was removed by regression of the first eigenvector computed from a set of pulse artifacts taken as a reference at the beginning of the experiment (~80 of 700 ms each, the first eigenvector accounted for more than 30 % of the variance). The signal was further band filtered from 0.1 to 20 Hz using a Fourier-based method [70], and EPs were computed by averaging the signals between -325 and +625 ms, after having discarded the trials whose amplitude variation exceeded 100 μV (there remained 26 ± 4 trials out of 35, per condition and per session) [63]. EROs were computed from the same data set without prior filtering. For each trial and electrode, the signal was first filtered in the ± 2-Hz band every 2 Hz between 24 and 44 Hz, then convoluted with a complex 7-cycle Morlet wavelet [7173]. The module of the convolution product represents the power integrated over a window equivalent to 7 cycles, thus adapted to each frequency [71, 73]. The result was normalized for each frequency using the first 320-ms period before stimulus presentation, and down-sampled to 50 Hz (20-ms period).

The statistical comparison of Target- and Novel-related EPs and EROs was carried out using a randomization approach [74]. First, the effective difference between Targets and Novels was computed. Next, Target and Novel trials were put together and randomized, subsequently separated into two groups from which a new difference was computed. By repeating this procedure 2000 times, a reference distribution of the differences was established every 10 ms for EPs and every 50 ms for EROs. The effective difference was compared to this distribution for each electrode and each time point, using a threshold of 0.05 and 0.01 (one-tail). In order to take into account multiple comparisons, the probability to obtain a given time-cluster size and the total amount of significant points (ensemble level) was assessed from the same randomized set between 200 and 500 ms for each electrode (in the 24- to 44-Hz band for EROs) and using a threshold of 0.01 for report (one-tail). The same statistics were also used to perform subject-by-subject conjunction analysis on electrode Cz to verify that the results thus obtained were not due to a few subjects out of the five, and to allow extrapolation to the population.