Influence of Time-Series Normalization, Number of Nodes, Connectivity and Graph Measure Selection on Seizure-Onset Zone Localization from Intracranial EEG

van Mierlo, Pieter; Lie, Octavian; Staljanssens, Willeke; Coito, Ana; Vulliémoz, Serge

doi:10.1007/s10548-018-0646-7

Influence of Time-Series Normalization, Number of Nodes, Connectivity and Graph Measure Selection on Seizure-Onset Zone Localization from Intracranial EEG

Original Paper
Open access
Published: 26 April 2018

Volume 31, pages 753–766, (2018)
Cite this article

Download PDF

You have full access to this open access article

Brain Topography Aims and scope Submit manuscript

Influence of Time-Series Normalization, Number of Nodes, Connectivity and Graph Measure Selection on Seizure-Onset Zone Localization from Intracranial EEG

Download PDF

Pieter van Mierlo ORCID: orcid.org/0000-0003-1650-5401^1,2,4,
Octavian Lie³,
Willeke Staljanssens¹,
Ana Coito² &
…
Serge Vulliémoz^2,4

2747 Accesses
13 Citations
Explore all metrics

Abstract

We investigated the influence of processing steps in the estimation of multivariate directed functional connectivity during seizures recorded with intracranial EEG (iEEG) on seizure-onset zone (SOZ) localization. We studied the effect of (i) the number of nodes, (ii) time-series normalization, (iii) the choice of multivariate time-varying connectivity measure: Adaptive Directed Transfer Function (ADTF) or Adaptive Partial Directed Coherence (APDC) and (iv) graph theory measure: outdegree or shortest path length. First, simulations were performed to quantify the influence of the various processing steps on the accuracy to localize the SOZ. Afterwards, the SOZ was estimated from a 113-electrodes iEEG seizure recording and compared with the resection that rendered the patient seizure-free. The simulations revealed that ADTF is preferred over APDC to localize the SOZ from ictal iEEG recordings. Normalizing the time series before analysis resulted in an increase of 25–35% of correctly localized SOZ, while adding more nodes to the connectivity analysis led to a moderate decrease of 10%, when comparing 128 with 32 input nodes. The real-seizure connectivity estimates localized the SOZ inside the resection area using the ADTF coupled to outdegree or shortest path length. Our study showed that normalizing the time-series is an important pre-processing step, while adding nodes to the analysis did only marginally affect the SOZ localization. The study shows that directed multivariate Granger-based connectivity analysis is feasible with many input nodes (> 100) and that normalization of the time-series before connectivity analysis is preferred.

Effective Brain Connectivity from Intracranial EEG Recordings: Identification of Epileptogenic Zone in Human Focal Epilepsies

Transient seizure onset network for localization of epileptogenic zone: effective connectivity and graph theory-based analyses of ECoG data in temporal lobe epilepsy

Article 25 January 2019

Quantifying Functional Connectivity Network Through Synchronization and Graph Theory Approaches for Seizure Prediction

Article 03 January 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In approximately 15–25% of the epilepsy patients in the presurgical evaluation, intracranial EEG (iEEG) monitoring is necessary to obtain additional localization information about the seizure-onset zone (SOZ) and eloquent cortex (Carrette et al. 2010). Intracranial EEG is recorded with stereo-EEG or depth electrodes inserted in the brain’s parenchyma, or with strip and grids placed on the top of the cortex. Because of the limited spatial sampling of iEEG, a clear hypothesis about the EZ must be available prior to electrode implantation. In clinical practice, the identification of the SOZ from iEEG is done visually by the epileptologist. This is a time consuming, labor intensive task that requires much expertise and suffers from interpreter dependency.

Functional brain connectivity is defined as the study of temporal correlations between spatially distinct neurophysiological events (Friston et al. 1993). Functional brain connectivity measures have been shown to localize the SOZ from iEEG recordings in an objective manner (van Mierlo et al. 2014). Multivariate connectivity measures based on the concept of Granger causality (Granger 1969), have been particularly successful in the context of SOZ localization. Granger causal modeling estimates whether a time series is useful to predict another. Accordingly, a signal x₁ Granger-causes another signal x₂, if the inclusion of the past values of x₁ help to predict x₂ beyond the information present in the past values of x₂. Granger-causality measures use autoregressive (AR) models to parametrize the signal with a set of AR coefficients encoding the linear contribution of its recent past.

The two most commonly Granger-based measures used to localize the SOZ are the Directed Transfer Function (DTF) (Kaminski and Blinowska 1991) and the Partial Directed Coherence (PDC) (Baccalá and Sameshima 2001). They are multivariate estimators of the network connections between the EEG signals in the frequency domain, and thus, they require multivariate autoregressive (MVAR) models to estimate the present of each signal as a linear contribution of the past values of all signals. In 1994 and 1998, Franaszczuk et al. proposed that the SOZ can be localized in temporal lobe epilepsy patients based on visual interpretation of the propagation patterns derived from DTF analysis of ictal iEEG recordings (Franaszczuk and Bergey 1998; Franaszczuk et al. 1994). More recently, It has been shown that nodes with high outflow based on DTF calculations were highly correlated with the clinically identified foci in pediatric patients with neocortical epilepsy (Wilke et al. 2010a) and with Lennox-Gastaut syndrome (Jung et al. 2011). Furthermore, it has been shown that EZ estimation using DTF analysis mapped better the surgical resection area in patients with successful surgical outcome than those with unsuccessful outcome (Kim et al. 2010). Using the PDC, Baccalá et al. (2004) showed that correct epileptogenic focus localization was obtained by analyzing strongly connected subgraphs of scalp EEG and that the EZ can be localized from stereo-EEG in patients with epilepsy secondary to type II focal cortical dysplasia (Varotto et al. 2012).

All the above methods assumed stationary EEG signals in the analyzed window and therefore did not track the evolution of the connectivity pattern over time. Furthermore, time-varying methods allow modeling non-stationary signals such as the onset of a seizure. Two methods have been proposed for multivariate time-varying connectivity analysis: the short time DTF (SDTF) (Ding et al. 2000), where the DTF is calculated in a short time sliding window, and the adaptive (time-varying) DTF (ADTF) or PDC (APDC) (Astolfi et al. 2008; Wilke et al. 2008) based on Kalman-filter AR models. Mullen et al. (2011) used the first approach to analyze the connectivity between ICA components of ictal iEEG epochs, showing a shift in connectivity during the seizure. Wilke et al. (2010b) used the second approach to investigate the ability of the time-varying ADTF coupled to graph analysis measures to identify critical network nodes during the interictal states and compared this with the critical nodes identified with the DTF and graph analysis during ictal and resting interictal periods. They found that one graph measure, the betweenness centrality, correlated with the resected area in patients who were rendered seizure-free after surgery for all three investigated periods: ictal, resting interictal and interictal spikes. Furthermore, the gamma band was identified as the most important band to be studied because the betweenness centrality in this band correlated most with the resected area. In a previous study, we applied the ADTF coupled to outdegree to estimate the SOZ based on connectivity pattern changes/propagation in 8 patients rendered seizure-free by surgery, and found correspondence between these estimates, the visual iEEG analysis by the epileptologist, and the resected regions (van Mierlo et al. 2013). Furthermore, the obtained time-varying connectivity patterns were consistent in each patient over multiple seizures. For a more extensive overview of studies that localized the epileptogenic focus using functional connectivity we refer the readers to (van Mierlo et al. 2014).

Some important limitations apply to these previous studies. First, because of computational limitations, all previous studies using adaptive AR approaches have been applied to a limited number of iEEG channels/connectivity nodes (< 50–60), often selected based on visible involvement in the course of a seizure. Second, since the mathematical formulation of most AR models used has assumed white noise stationary processes with zero mean, normalization of iEEG time series by z-scoring has been suggested in several studies (Blinowska 2011; Kaminski and Blinowska 2014), but is not always done. The behavior of connectivity estimations using no or other normalization approaches has not been tested so far. Third, a comparison of the performance of several APDC and ADTF measures and graph theory measures used in the literature to localize the SOZ has not been carried out yet.

In this study, we assessed whether localizing the SOZ from a high number of iEEG channels is feasible. We investigated the influence of incorporating more nodes in the functional connectivity analysis. Furthermore, we studied how if we should normalize the time-series and which functional connectivity measure coupled to which graph measures are optimal to localize the SOZ. First, we performed simulations to quantify the performance of the different connectivity analyses. Second, we mapped the SOZ recorded in a high number of iEEG channels in an epilepsy patient and compared the obtained SOZ localization to the resection that rendered the patient seizure-free.

Methods

In this section, we describe how we simulated seizures recorded with intracranial EEG electrodes. Later, we introduce the pipeline used to calculate the time-varying connectivity pattern and how we can localize the SOZ using graph theory measures. Afterwards, we explain how we have quantified the performance of the different connectivity measures coupled with the different graph measures. Finally, we describe how we have analyzed a 113-channel seizure recorded in a patient.

Simulations

We simulated a seizure as recorded with 128 channels iEEG. The simulated iEEG is 5 s long, 2 s baseline followed by 3 s seizure. After the baseline duration, the seizure starts at a randomly chosen channel, the SOZ channel: chan_SOZ. The seizure spreads to 31 other randomly chosen channels with the following parameters: maximal spreading from one channel to three other channels, random onset delay, i.e. the time a node gets involved in the seizure, between 1 and 250 ms and sample delay, i.e. the number of samples the signal of the sending node is delayed with respect to the receiving node, randomly chosen between 1 and 5 samples. In total, 32 channels will show seizure activity. Figure 1 shows a possible seizure network and the ictal iEEG signals. The baseline activity was modeled by 1/f noise and the seizure activity as a time-varying sinusoid with frequency equal to 12 Hz at the start of the seizure (at 0 s) and equal to 8 Hz at the seizure end (at 3 s) plus its first harmonic (van Mierlo et al. 2011). This way we want to mimic the rhythmic phase of the seizure, i.e. the period with periodic ictal spiking. The simulations were inspired by the oscillatory patterns seen in a real seizure originating from the temporal lobe (van Mierlo et al. 2011). The signal-to-noise ratio (SNR) of the seizure activity compared to the baseline activity was set to − 5, 0, 5 and 10 dB. The amplitude of the different channels was chosen randomly between 25 and 100 mV to mimic amplitude changes observed in the iEEG. An overview of the parameters used to simulate the iEEG seizures is shown in Table 1.

Table 1 Simulation parameters

Full size table

From iEEG to SOZ Localization

The complete pipeline to localize the SOZ from the iEEG recordings is introduced below. It comprises three subparts: pre-processing, connectivity calculation and graph analysis. Below we give a detailed overview of all subparts.

Pre-processing

The pre-processing consisted of two steps: a channel selection step and a time series normalization step. In the channel selection step, 32, 64, 96 or all 128 channels were chosen to investigate the effect of adding more nodes to the functional connectivity analysis. The 32 ictal channels were included in all channel selections. In the normalization step, the different options were: no normalization, z-scoring of the complete 5 s time series, sliding window z-scoring with a window length equal to 1 s and baseline z-scoring, where only the first 2 s are used to calculate the mean and standard deviation to perform z-scoring. Due to the preprocessing, each simulation resulted in 16 different datasets that were processed further as indicated below.

Multivariate Time-Varying Connectivity Analysis

In this section, we introduce the different time-varying functional connectivity measures that are used to estimate the dynamic information flow between the considered signals. All measures used in this study are based on the concept of Granger causality. A common method to study Granger causality is to use autoregressive models in which the influence of the past of the signals on the present is estimated. The time-varying bivariate autoregressive (TVAR) model of signal x₁ and x₂ can be described as:

$${x_1}\left( t \right)=\mathop \sum \limits_{{m=1}}^{p} {a_{11,m}}\left( t \right){x_1}\left( {t - m} \right)+\mathop \sum \limits_{{m=1}}^{p} {a_{12,m}}\left( t \right){x_2}\left( {t - m} \right)+~{e_1}\left( t \right)~~$$

(1)

$${x_2}\left( t \right)=\mathop \sum \limits_{{m=1}}^{p} {a_{21,m}}\left( t \right){x_1}\left( {t - m} \right)+\mathop \sum \limits_{{m=1}}^{p} {a_{22,m}}\left( t \right){x_2}\left( {t - m} \right)+~{e_2}\left( t \right)$$

(2)

where a₁₂, a₁₂, a₂₁ and a₂₂ are the TVAR coefficients and e₁ and e₂ the residuals of signals x₁ and x₂.

The bivariate case can be extended to the multivariate case with K channels as follows:

$${\varvec{X}}\left( t \right)=\mathop \sum \limits_{{m=1}}^{p} {{\varvec{A}}_m}\left( t \right){\varvec{X}}(t - m)+{\varvec{E}}(t)$$

(3)

where X(t) is the iEEG matrix, A_m(t) are the model coefficients matrix for delay m and E(t) is the residual matrix. The coefficients of the TVAR model were estimated using a Kalman filter with update coefficient equal to 10⁻³ and Kalman smoothing term equal to 100 (van Mierlo et al. 2011). To investigate the information flow between the nodes in the spectral domain, we apply the Fourier transform to the coefficient matrices at each time point t as follows:

$${\varvec{A}}\left( {f,t} \right)={{\varvec{I}}_{\varvec{K}}} - \mathop \sum \limits_{{m=1}}^{p} {{\varvec{A}}_m}\left( t \right){\text{exp}}\left( { - i2\pi \frac{f}{{{f_s}}}m} \right)$$

(4)

where I_K is the K times K identity matrix. From the Fourier transform of the coefficient matrices A(f,t) the transfer matrix H(f,t) is calculated:

$${\varvec{H}}\left( {f,t} \right)={\varvec{A}}{(f,t)^{ - 1}}$$

(5)

The element A_ij(f,t) and H_ij(f,t) depict the information flow from node j to node i at frequency f and time t. In H(f,t) the cascade flow is modeled, while in A(f,t) only the direct flows are modeled. This means that if there is a connection from node 1 to node 2 and from node 2 to node 3 at frequency f and time t, elements A₂₁(f,t) and A₃₂(f,t) will have a high value, while H₂₁(f,t) and H₃₁(f,t) will be high.

Different normalizations of the A(f,t) and H(f,t) matrices are performed to scale the connectivity values between 0 (no connection) and 1 (high connection). Because in the next step, we will examine the outflow of the nodes, we use the Adaptive Partial Directed Coherence (APDC) and the Adaptive Directed Transfer Function (ADTF) normalized with respect to the inflow at each time t and frequency f as suggested by (Coito et al. 2015; Plomp et al. 2015):

$$APD{C_{ij}}\left( {f,t} \right)=\frac{{{{\left| {{{\varvec{A}}_{ij}}(f,t)} \right|}^2}}}{{\mathop \sum \nolimits_{{k=1}}^{K} {{\left| {{{\varvec{A}}_{ik}}(f,t)} \right|}^2}}}$$

(6)

$$ADT{F_{ij}}\left( {f,t} \right)=\frac{{{{\left| {{{\varvec{H}}_{ij}}(f,t)} \right|}^2}}}{{\mathop \sum \nolimits_{{k=1}}^{K} {{\left| {{{\varvec{H}}_{ik}}(f,t)} \right|}^2}}}$$

(7)

The following normalization holds for both ADTF and APDC:

$$\mathop \sum \limits_{{j=1}}^{K} APD{C_{ij}}\left( {f,t} \right)=1$$

(8)

The ADTF and APDC can be integrated in the frequency band of interest to estimate the connections over time in the considered frequency band, which results in the integrated APDC and ADTF (iAPDC and iADTF) and the full-frequency APDC and ADTF (ffAPDC and ffADTF):

$$iAPD{C_{ij}}\left( t \right)=\frac{1}{{{f_2} - {f_1}}}\mathop \sum \limits_{{f={f_1}}}^{{{f_2}}} \frac{{{{\left| {{{\varvec{A}}_{ij}}\left( {f,t} \right)} \right|}^2}}}{{\mathop \sum \nolimits_{{k=1}}^{K} {{\left| {{{\varvec{A}}_{ik}}\left( {f,t} \right)} \right|}^2}}}$$

(9)

$$iADT{F_{ij}}\left( t \right)=\frac{1}{{{f_2} - {f_1}}}\mathop \sum \limits_{{f={f_1}}}^{{{f_2}}} \frac{{{{\left| {{{\varvec{H}}_{ij}}(f,t)} \right|}^2}}}{{\mathop \sum \nolimits_{{k=1}}^{K} {{\left| {{{\varvec{H}}_{ik}}(f,t)} \right|}^2}}}$$

(10)

$$ffAPD{C_{ij}}\left( t \right)=\mathop \sum \limits_{{f={f_1}}}^{{{f_2}}} \frac{{{{\left| {{{\varvec{A}}_{ij}}(f,t)} \right|}^2}}}{{\mathop \sum \nolimits_{{f'={f_1}}}^{{{f_2}}} \mathop \sum \nolimits_{{k=1}}^{K} {{\left| {{{\varvec{A}}_{ik}}(f',t)} \right|}^2}}}$$

(11)

$$ffADT{F_{ij}}\left( t \right)=\mathop \sum \limits_{{f={f_1}}}^{{{f_2}}} \frac{{{{\left| {{{\varvec{H}}_{ij}}(f,t)} \right|}^2}}}{{\mathop \sum \nolimits_{{f'={f_1}}}^{{{f_2}}} \mathop \sum \nolimits_{{k=1}}^{K} {{\left| {{{\varvec{H}}_{ik}}(f',t)} \right|}^2}}}$$

(12)

The iAPDC, iADTF, ffAPDC and ffADTF are normalized with respect to incoming information flow at each time point t. This means that the following normalization holds for all measures:

$$\mathop \sum \limits_{{j=1}}^{K} iAPD{C_{ij}}(t)=1$$

(13)

The ADTF measures are able to reveal cascade connections, while the APDC measures reveal the direct connections. The ADTF can be used to identify the origin of information flow and the APDC to identify the direct connections.

Graph Measures to Localize the SOZ

Two graph measures are calculated for each time-varying connectivity measure to localize the SOZ: the outdegree and the shortest path length. The outdegree calculates the sum of the outflow from one node to all the other nodes, while the shortest path length defines the shortest paths from one node to another.

The global outdegree was defined as the out-degree during the seizure for each node:

$${\omega _j}=\mathop \sum \limits_{{t={t_1}}}^{{{t_2}}} \mathop \sum \limits_{{k=1}}^{K} {C_{kj}}(t)$$

(14)

here, t₁ is the seizure onset and t₂ is the seizure end, K is the number of channels and C_kj is the connection from node j to node k calculated with the above described time-varying connectivity measures, meaning that C_kj is equal to iADTF_kj, ffADTF_kj, iAPDC_kj or ffAPDC_kj. The node with maximal global outdegree is chosen as SOZ, because this node has maximum outflow during the seizure in the estimated time-varying network.

The second considered graph measure is the global shortest path length that is defined as the sum of shortest path lengths from one node to all other nodes:

$${\varphi _j}=\mathop \sum \limits_{{t={t_1}}}^{{{t_2}}} \mathop \sum \limits_{{k=1}}^{K} {\sigma _{kj}}(t)$$

(15)

where, σ_kj is the shortest path length from node j to node k in the graph where the edge weights between the nodes were set to 1/C_kj. This means that when there is a high connection from j to k, the edge weight will be low and therefore the shortest path will be low. The node with the minimal global shortest path was identified as the SOZ, because it has the overall shortest path to all other nodes.

Evaluation of the Simulations

We use two evaluation measures to evaluate the importance of the processing steps, namely the area under the curve (AUC) and the number of correctly localized SOZ.

AUC Analysis

For each simulated high dimensional intracranial EEG of a seizure, 256 time-varying connectivity matrices were calculated. One for each combination of the following factors:

Channel selection: 32, 64, 96 or 128
SNR: − 5, 0, 5 or 10 dB
Normalization: none, z-score, sliding z-score or baseline z-score
Connectivity measures: iADTF, ffADTF, iAPDC or ffAPDC

The order of the TVAR model to calculate the connectivity measures was set to 5, the update coefficient to 0.001 and the Kalman smoothing term to 100 (van Mierlo et al. 2011). The frequency band of interest was set to 3–30 Hz.

Each estimated time-varying connectivity matrix was compared to the true time-varying connectivity matrix of the simulation. This was done by calculating the true positives (TP), false positives (FP), true negatives (TN) and false negatives (FN) when comparing the thresholded estimated time-varying connectivity matrix with the ground truth. For the APDC measures the direct edges of the simulation were considered as ground truth, while for the ADTF the cascade connections were considered as ground truth. We let the threshold range from 0 to 1 in steps of 0.01 to calculate the TP, FP, TN and FN. Afterwards we computed the sensitivity and the precision, also known as the positive predictive value.

From the sensitivity and precision values we derived the AUC. Afterwards, we performed a univariate ANOVA with AUC as dependent variable and four factors: the SNR, connectivity measure, normalization and channel selection. Bonferroni correction was applied to correct for multiple comparisons.

SOZ Localization

Based on each time-varying connectivity measure we localized the SOZ based on the outdegree and shortest path. For the 100 simulations we assessed the percentage of correctly identified seizure onset zones. We compared the identified SOZ based on the outdegree and on the shortest path length with the simulated SOZ (chan_SOZ).

High Dimensional Intracranial Recordings in Patients

We investigated an ictal epoch recorded with 113 electrodes in the University of Texas Health Center at San Antonio. The patient had a MRI-positive focal cortical dysplasia type II b in the right perisylvian areas. The implantation scheme of the electrodes is shown in Fig. 2. An 8 × 8-contact inferior frontal grid (IFG, interelectrode distance: 5 mm) overlaid the lesion. An additional 2 × 5 superior frontal grid (interelectrode distance: 1 cm) and multiple strips were placed around IFG and interhemispherically. A 2-contact recording reference (G) was placed over the anterior mesial portion of the right superior frontal gyrus, remote from areas of high cortical irritability. The iEEG was sampled at 500 Hz. Visual analysis of the iEEG showed that the lesion was causing the epilepsy. Removal of the lesion rendered the patient seizure free and the patient has been seizure free since 3 years. The intracranial EEG signals of the analyzed seizure and their spectrogram can be found in the supplementary material.

Evaluation of the Patient Data

We performed the proposed methods to localize the SOZ on the high dimensional iEEG epoch that consisted out of 5 s pre-ictal and 30 s ictal activity identified by visual analysis of the epileptologist (OL). We analyzed the full 113-channel iEEG and two subsets: one containing all 64 grid electrodes and one containing 25 electrodes closest to the visually identified SOZ. This was done to investigate the variation of SOZ localization using the proposed method when nodes are added to the connectivity analysis. We performed the four types of normalization and downsampled the time series to 250 Hz. Afterwards we calculated the iADTF, ffADTF, iAPDC, ffAPDC in combination with the outdegree and the shortest path length to localize the SOZ. The model order of the TVAR model was set to 8, update coefficient to 0.001 and kalman smoothing parameter to 100 (van Mierlo et al. 2011). The frequency band of interest was 2–30 Hz since the harmonic seizure frequency lied within this band. We compared the localized SOZ with the surgical resection that rendered the patient seizure free and investigated the influence of the normalization, the number of nodes and the used connectivity measures in combination with the graph measure.