Introduction

The worldwide trend of demographic aging highlights the progress made in healthcare over the last century [1]. Nonetheless, this aging population encounters diverse health issues, notably Alzheimer’s Disease (AD), pronounced in those aged 65 or older [2]. AD, the primary form of dementia (contributing 60–70% of cases) affects 55 million worldwide, with projections estimating over 150 million affected individuals by 2050 [3].

Mild Cognitive Impairment (MCI) is a syndrome marked by cognitive decline that exceeds what is typical for a person’s age and educational level, yet doesn’t significantly disrupt their daily life activities [4]. Occurring between normal cognitive function and dementia, it exhibits an annual progression rate to dementia ranging from 8 to 15% and is estimated to affect 15–20% of persons aged 60 and above [5]. The increasing prevalence of MCI presents a considerable health and economic challenge worldwide, thus the urgent necessity for its detection [6, 7]. Timely identification facilitates effective disease management, enabling prompt therapeutic intervention and the implementation of preventive healthcare measures. Such an approach can halt disease progression and alleviate the emotional and financial strains faced by patients and caregivers [8].

One method to understand neural activity and cognitive performance is the use of electroencephalography (EEG). It’s a crucial tool for exploring indicators of cognitive impairments [9, 10], primarily by analyzing event-related potentials (ERPs) - the brain’s responses to sensory stimuli or tasks involving motor and cognitive functions [11]. This allows researchers to observe the sequential unfolding of cognitive processes before sensory information reaches the peripheral nervous system, continuing after the execution of a behavioral response [12].

Various ERP components serve as indicators for sensory, attentional, and cognitive processes. One such component, the P200, occurring around 200 ms after stimulus onset, reflects exogenous sensory attention or the sensation-seeking behavior of an individual [13, 14]. Studies have shown delayed P200 latency in patients with AD [15, 16], indicating disruptions in sensory attention processes in these individuals. Conversely, the P300 component is associated with attention allocation, and engagement of working memory, and reflects novel information processing and revision of memory representations in the central nervous system (CNS) [17, 18]. Discrepancies in these ERP components, observed during basic stimulus discrimination tasks, offer valuable insights into individual differences in sensory attention capabilities [15], cognitive processing efficiency, and speed [18]. This makes them essential tools for cognitive assessment, enabling the identification and tracking of the onset and progression of neurodegenerative diseases [19].

An essential part of the conventional ERP methodology includes averaging numerous segments of EEG signals for repeated stimulus events. This presupposes that the fundamental stimulus or response-locked signal remains consistent across trials, representing an unchanging psychological process [20, 21]. Nonetheless, it is highly improbable for any psychological process of interest to be evoked in a precisely identical manner across trials due to the differences in stimulus or response properties, the effects of learning or habituation, fatigue during extended experimental tasks, or random variations in engagement. Furthermore, the process of averaging ERPs can mask important complementary functional information, such as variations in information processing observed in single-trial ERPs which could be crucial for understanding the nuances of cognitive functioning [21]. Therefore, analyzing ERP variability becomes indispensable in understanding the fluctuations within psychophysiological processes of interest.

Researchers have explored intra-individual variability through brain imaging, reaction time (RT), sensorimotor, and cognitive performances [22,23,24]. By analyzing fluctuations within a subject across successive trials [26], the stability of task performance can be assessed, where lower variability suggests superior performance and higher variability indicates poorer performance [22,23,24]. This variability is thought to reflect relatively consistent endogenous factors, such as the integrity of the CNS [27], with its increase connected to processes mediated by the frontal cortex, such as attentional lapses [28] and variations in executive control [29].

One method for analyzing variability is the response variance curve (RVC) [30]. It quantifies variance from individual trial ERPs within the average ERP, assessing data point variability within a specific timeframe. This approach addresses the challenges of analyzing single-trial variability (as used by [31]), especially when a poor signal-to-noise ratio complicates individual trial analysis [30]. Moreover, it would be appropriate in conditions that diminish neural plasticity like in late-stage MCI [32], which could contribute to a low signal-to-noise ratio [33].

This method has been utilized in distinguishing individuals with Attention Deficit and Hyperactivity Disorder (ADHD) [21] and Schizophrenia [34, 35] from those without these conditions. This distinction is based on the belief that ADHD and Schizophrenia are characterized by attention deficits linked to the variability in the CNS. To date, scanty studies have investigated ERP variability in AD/MCI and cognitively healthy elderly. Some, like [36] have used the trial-to-trial variability approach while others such as [37] have attempted to investigate variability based on the test-retest reliability of the ERPs. Patterson et al. [36] analyzed latency variability in auditory ERPs (N1, P2, N2, and P3 components) revealing that people with dementia had extended P3 latencies and increased P3 variability compared to healthy individuals.

While exploring the trial-to-trial variability, we also seek to understand if these variabilities are asymmetrical in the two prefrontal hemispheres. Right–left asymmetry of the human brain is a fundamental characteristic, albeit complex and influenced by numerous confounding factors [39], though the extent to which it varies with these factors like age, sex, handedness, brain size, and heredity, is still questionable [40]. Due to increased activity in the right prefrontal lobe and withdrawal reactions to unpleasant stimuli, frontal asymmetry is an indicator of depression [41, 42].

Measures of EEG-based asymmetries, such as Frontal Alpha Asymmetry (FAA) are linked to depression and mood studies [43], while other research endeavors have delved into auditory ERP asymmetries in sound perception [44]. Given depression’s prevalence in MCI patients [45], investigating trial-to-trial variability asymmetry could prove a potential predictive feature for MCI. Recent studies, such as [46], showcase the viability of modern low-density channel EEG devices in primary care and outpatient contexts due to their portability, cost-efficiency, and accessibility. Our prior investigations [47,48,49,50], utilizing a portable 2-channel EEG device positioned at Fp1 and Fp2 according to the 10–20 setup, consistently underscored the promising potential of portable EEG technology in the realm of AD/MCI diagnosis and assessment.

This study aimed to assess the practicality of utilizing trial-to-trial variability and its asymmetry from a portable EEG system in the detection of MCI, focusing on both non-task and task-relevant neural responses in the P200 and P300 ERP components. We hypothesized that increased ERP variability is indicative of MCI-related fundamental neurological destabilization and deficits which result in unstable cognitive processing and decreased task performance.

Primarily, this study seeks to examine: (1) if measures of ERP variability in responses to target and non-target stimuli can distinguish between elderly cognitively healthy and patients with MCI; (2) if ERP variability asymmetry can distinguish the two groups: and (3) the relationships between demographic and neuropsychological measures, ERP variability and its asymmetry measures, and the prevalence of MCI.

To the best of our knowledge, this is the only study that has investigated trial-to-trial ERP variability and its asymmetry between the MCI and cognitively healthy elderly in a relatively larger sample size using a portable EEG device.

Materials and methods

Participants

This study comprised 1,616 participants enlisted from October 2018 to December 2021 at the Gwangju Alzheimer’s Disease and Related Dementias (GARD) center in Gwangju City, South Korea. For analysis, 348 participants were excluded for various reasons: not falling into the CN or MCI categories [n = 85], had no trials with amplitudes within ± 100 µV in either target or non-target responses in any of the two channels [n = 14], contained less than 50% of clean trials [n = 64] or contained extreme variability values [n = 61], having incomplete neuropsychological information [n = 19], and lacking discernable peaks in the RVC curves [n = 105] (See Figure S1 in the supplementary material).

The study involved aged-matched participants categorized into two groups CN and MCI. The categorization was performed following the methodology outlined in [51], which stated, “A comprehensive clinical interview was conducted for all participants, assessing their Clinical Dementia Rating (CDR). CN participants demonstrated a CDR score of 0, indicating typical cognitive function, and exhibited no signs of brain atrophy, white matter changes, lacunae, infarction, or other focal brain lesions in their magnetic resonance imaging (MRI) scans. In contrast, participants diagnosed with MCI met the criteria outlined by [52] and showed a CDR score of 0.5. Their performance on neuropsychological tests fell below − 1.5 standard deviations on at least one of the five domain tests, adjusted for age, education, and sex-specific norms”.

For the final analysis, the CN group consisted of 878 participants, comprising 386 men and 492 women, while the MCI group comprised 390 participants, including 194 men and 196 women.

Neuropsychological battery

The cognitive abilities of participants were assessed using the latest edition of the SNSB (SNSB II) [53, 54]. It encompasses five cognitive domains: attention, language, memory, visuospatial skills, and frontal/executive functions. Widely recognized in South Korea, this neuropsychological screening battery is a standard tool frequently utilized to evaluate cognitive function in individuals with MCI or dementia. Furthermore, the primary screening tool for cognitive assessment was the Korean version of the Mini-Mental State Examination (K-MMSE).

ERP recording

The NeuroNicle FX2 (LAXTHA, Daejeon, South Korea) was utilized to record EEG. Based on the 10–20 system, two monopolar scalp electrodes were placed at Fp1 and Fp2, utilizing the right earlobe as the reference. A bandstop filter ranging from 55 to 65 Hz was utilized, and the contact impedances for all EEG electrodes were kept below 10 kΩ. With a 250 Hz sampling frequency and a 15-bit resolution, data were digitized in continuous recording mode.

During recording, participants were instructed to sit comfortably with their eyes closed, while qualified operators monitored both participant sleepiness and EEG traces to minimize artifacts from muscle and eye movements. ERPs were elicited using an active auditory oddball task, where 64 rare random target stimuli at 2,000 Hz (1/5 ratio) and 256 standard auditory stimuli at 750 Hz (4/5 ratio) were presented [47, 48, 55]. The EEG signals were acquired in a sequence of conditions of 5 min of resting state, 8 min of sensory-evoked potentials, and 5 min of selective attention tasks. More on the EEG acquisition procedure is described in our previous study [47].

In this study, the focus was solely on selective attentional ERPs. Before the experiment, participants were evaluated for auditory hearing capabilities for the rare (2,000 Hz) and standard (750 Hz) tones. Moreover, their ability to discern between the tones was assessed using earphones with a volume level fixed at 70 dB. The participants were further told to press a response key upon identifying a rare tone. To maintain a controlled environment during data collection, all the experiments were conducted in a quiet room with standardized illumination.

EEG data processing

Tailored Python scripts (version 3.8.16) were used to process the data. The EEG data underwent the following preprocessing procedures to extract the artifact-free trials; data segmentation [− 200, 800] ms (with 0 ms as the stimulus onset), baseline correction [− 200, 0] ms, artifact removal [± 100 µV as the threshold], and the final ERP traces were subjected to a nine-order moving average filter. These procedures were applied specifically to correct trials (standard stimuli that received no response and target stimuli that were accurately identified).

To consolidate the data, extracted trial-to-trial variability measures from the prefrontal channels Fp1 and Fp2 were averaged, creating a unified dataset for subsequent analysis. Furthermore, participants who didn’t meet the minimum threshold of 50% of clean trials in the standard and odd neural responses in the two channels were not considered for further analysis.

Response variance curve (RVC) measures

First, we derived the RVC [30, 56] (See Fig. 1) and obtained the ERP variability measures of the P200 and P300 ERP components in two respective ERP windows of 150–300 ms and 300–600 ms after stimulus onset for both the target and non-target neural responses.

Fig. 1
figure 1

The ERP and RVC derivation process

The RVC measures of variabilities in the Peak Amplitude (AMPV), Latency (LATV), 50% Fractional Area Latency (FALV), and the Area Under the Curve (AUCV) were extracted.

RVC asymmetry measures

From the variability measures extracted above, we derived corresponding normalized asymmetry measures for each of the RVC measures of AMPV, AUCV, LATV, and FALV based on (Fp2-Fp1)/(FP2 + Fp1). We divided by the sum to minimize the consequences for the degree of negativity of asymmetry in both groups [57]. We thus, obtained asymmetry measures of AMPV_Asym, AUCV_Asym, LATV_Asym, and FALV_Asym (See Table S1 in supplementary for the asymmetry formula and detailed description).

Statistical analysis

The statistical analyses were performed using R Studio (version 2022.07.2 + 576), running on R (version 4.1.3) for Windows. The analysis included the use of packages such as gtsummary (version 1.6.1), ggplot2 (version 3.4.0), and corrplot (version 0.92) [58,59,60,61]. The significance level of α for all tests was set at 0.05. Independent sample t-tests were conducted using Student’s t-test and Welch’s t-test for continuous variables, while chi-squared tests were employed for categorical variables. Univariate and multiple logistic regression analyses were conducted to compute the odds ratios associated with MCI for each ERP variability and asymmetry measure. Covariates such as age, sex, and education level were controlled for in the analyses. The MMSE score was included as an additional covariate to evaluate the independent relationship between ERP variability, the corresponding asymmetry measures, and MCI.

Results

Participant characteristics

Table 1, consists of demographic and neuropsychological measures of the participants under consideration for analysis. The study included 390 patients with MCI and 878 CN. The patients with MCI had 50% women and 50% men while the CN group had 56% women and 44% men. Patients with MCI were older than CN individuals, with mean age ± standard deviation of 73.82 ± 6.54 and 72.06 ± 6.36 years (p < 0.001) respectively. Expectedly, patients with MCI had lower MMSE scores than CN, with 26.31 ± 2.68 and 27.68 ± 1.90 scores (p < 0.001) respectively. Furthermore, patients with MCI had lower scores in all the SNSB II domains; attention [8.54 ± 1.94], language [− 0.09 ± 0.47], visuospatial [0.25 ± 1.89], memory [− 0.44 ± 0.63], and frontal [− 0.24 ± 0.70] compared to CN individuals [9.71 ± 2.19, 0.21 ± 0.28, 0.54 ± 0.37, 0.30 ± 0.58, and 0.23 ± 0.57] (p < 0.001) respectively. No statistically significant differences were observed in the sex and years of education between the two groups.

Table 1 Demographic characteristics and neuropsychological test domain scores

P2 measures

Standard responses

Patients with MCI exhibited significantly higher variability in amplitudes and AUC values in the standard responses. Specifically, larger measures of AMPV [t = − 3.14, p = 0.002] and AUCV [t = − 3.08, p = 0.002] were observed in the MCI when compared to CN individuals. Nonetheless, no significant differences were observed in the latency variability measures for LATV and FALV among participants in either group (Fig. 2(A) and supplementary Table S2).

Fig. 2
figure 2

Boxplots showing group differences in the trial-to-trial variability measures based on t-tests: (A) P200 non-target (standard) neural responses; (B) P200 target(oddball) neural responses; (C) P300 non-target (standard) neural responses; (D) P300 target (oddball) neural responses; The dotted black horizontal line is the overall mean and the red triangles are the group means; Significance levels are denoted as follows: *** for p < 0.001, ** for p < 0.01, * for p ≤ 0.05, and ns for not significant; Detailed statistical values can be found in Table S2 of the supplementary material

Target responses

As observed in the standard neural responses, patients with MCI exhibited significantly higher variability in amplitudes and AUC in the target neural responses i.e., AMPV [t = − 2.60, p = 0.009], and AUCV [t = − 2.61, p = 0.009] respectively, compared to CN individuals. They also had a higher LATV [t = − 2.35, p = 0.019]. Nonetheless, no significant differences were observed in the latency variability measure of FALV among participants in either group (Fig. 2(B)).

P3 measures

Standard responses

Like the P2 measures, patients with MCI exhibited significantly higher variability in amplitudes and AUC values for the non-target neural responses. That is, larger AMPV [t = − 2.68, p = 0.008] and AUCV [t = − 2.60, p = 0.009] were observed when compared to CN individuals. Nonetheless, no significant differences were observed in the latency variability measures for LATV and FALV among participants in either group (Fig. 2(C)).

Target responses

In the target neural responses, patients with MCI exhibited significantly higher variability in amplitudes and AUC values; AMPV [t = − 2.55, p = 0.011] and AUCV [t = − 2.36, p = 0.018], when compared to CN individuals. Nonetheless, no significant differences were observed in the latency variability measures for LATV and FALV among participants in either group (Fig. 2(D)).

P3 asymmetry measures

Patients with MCI exhibited significantly higher asymmetry in the variability of amplitudes and AUC values in the target responses. Specifically, larger measures of AMPV_Asym [t = − 2.32, p = 0.021] and AUCV_Asym [t = − 2.24, p = 0.025] were observed in the MCI when compared to CN individuals. Nonetheless, no notable differences were observed in the latency asymmetry variability measures of FALV_Asym and LATV_Asym among participants in either group (Fig. 3(B)).

Fig. 3
figure 3

Boxplots showing group differences based on t-tests in the asymmetry measures of trial-to-trial variability : (A) P300 non-target neural responses; (B) P300 target neural responses; Significance levels are denoted as follows: *** for p < 0.001, ** for p < 0.01, * for p ≤ 0.05, and ns for not significant; Detailed statistical values can be found in Table S2 of the supplementary material

Logistic regression models

P2 measures

Standard responses

In the crude or unadjusted model, specific variables showed odds ratios and corresponding 95% confidence intervals that were significantly different from 1, suggesting a possible connection with the risk of MCI. Specifically, the AMPV [OR = 1.22, p = 0.001] and AUCV [OR = 1.21, p = 0.001] demonstrated this potential association. However, measures of latency variability, LATV, and FALV did not demonstrate any significant connection with the risk of MCI.

Upon consideration of demographic characteristics such as sex, age, and education level, AMPV [OR = 1.22, p < 0.001] and AUCV [OR = 1.23, p < 0.001] persisted as MCI predictors, thus underscoring their independence from demographic characteristics in predicting MCI.

On further refinement by incorporating MMSE scores, AMPV [OR = 1.20, p = 0.004] and AUCV [OR = 1.20, p = 0.005] continued to be predictors for MCI. Specifically, a 1 µV2 increase in AMPV and a 1 µV2ms increase in AUCV augmented the risk of MCI by 20%. These findings affirmed the genuine independence of these variability measures of the P200 ERP component from both demographic and MMSE measures as reliable MCI predictors (Fig. 4(A)).

Fig. 4
figure 4

The odd ratios (OR) for the Logitistic regression (LR) models for the various trial-to-trial variability measures: (A) P200 non-target neural responses; (B) P200 target neural responses; (C) P300 non-target neural responses; (D) P3 target neural responses; Model 1 is unadjusted; Model 2 was adjusted for demographic characteristics and Model 3 was adjusted with MMSE score included; Significance levels are denoted as follows: *** for p < 0.001, ** for p < 0.01, * for p ≤ 0.05, and blank (no stars) for not significant; Actual p-values, Confidence Intervals (CI) and OR can be found in Table S3 of the supplementary material

Target responses

In the crude model, specific variables showed odds ratios and corresponding 95% confidence intervals that were significantly different from 1, suggesting a possible connection with the risk of MCI. Specifically, the AMPV [OR = 1.17, p = 0.010], LATV [OR = 1.15, p = 0.019], and AUCV [OR = 1.17, p = 0.010] demonstrated this potential connection. Upon incorporating demographic characteristics such as sex, age, and education level, the AMPV [OR = 1.19, p = 0.005], LATV [OR = 1.16, p = 0.019], and AUCV [OR = 1.19, p = 0.004] remained MCI predictors. On further refinement by incorporating MMSE scores, AMPV [OR = 1.18, p = 0.011], LATV [OR = 1.15, p = 0.033], and AUCV [OR = 1.18, p = 0.012] persisted as MCI predictors. A 1 µV2 increase in AMPV and a 1 µV2ms increase in AUCV augmented the risk of MCI by 18%, while a 1 ms increase in LATV augmented the risk of MCI by 15% (Fig. 4(B)).

P3 measures

Standard responses

In the crude model, specific variables showed odds ratios and corresponding 95% confidence intervals that were significantly different from 1, suggesting a possible connection with the risk of MCI. Specifically, AMPV [OR = 1.18, p = 0.006] and AUCV [OR = 1.17, p = 0.007] demonstrated this potential association. However, latency variability measures of LATV and FALV had no significant connection with the risk of MCI.

Upon accounting for demographic factors such as sex, age, and education level, AMPV [OR = 1.18, p = 0.007] and AUCV [OR = 1.18, p = 0.007] persisted as MCI predictors, thus underscoring their independence from demographic characteristics in MCI prediction.

On further refinement by incorporating MMSE scores, AMPV [OR = 1.18, p = 0.011] and AUCV [OR = 1.16, p = 0.018] persisted as MCI predictors. These findings affirmed the genuine independence of these variability measures of the standard responses of the P300 ERP component from both demographic and MMSE measures as reliable MCI predictors (Fig. 4(C)).

Target responses

In the crude model, specific variables showed odds ratios and corresponding 95% confidence intervals that were significantly different from 1 suggesting a possible connection with the risk of MCI. Specifically, AMPV [OR = 1.16, p = 0.012], and AUCV [OR = 1.15, p = 0.020] demonstrated this connection. However, latency variability measures of LATV and FALV did not demonstrate any significant association with the risk of MCI. Upon consideration of demographic characteristics such as sex, age, and education level, AMPV [OR = 1.17, p = 0.008], and AUCV [OR = 1.17, p = 0.011] persisted as MCI predictors, thus underscoring their independence from demographic characteristics in MCI prediction.

Further adjustment by incorporating MMSE scores, AMPV [OR = 1.18, p = 0.011], and AUCV [OR = 1.16, p = 0.018] persisted as MCI predictors. These findings affirmed the genuine independence of these variability measures of the P300 ERP component from both demographic and MMSE measures as reliable MCI predictors (Fig. 4(D)).

P3 asymmetry measures

In the crude model, specific variables showed odds ratios and corresponding 95% confidence intervals that were significantly different from 1, suggesting a possible connection with the risk of MCI. Specifically, AMPV_Asym [OR = 1.15, p = 0.021], and AUCV_Asym [OR = 1.15, p = 0.025] demonstrated this connection. However, LATV_Asym and FALV_Asym did not demonstrate any significant association with the risk of MCI.

Upon incorporating demographic factors such as sex, age, and education level, AMPV_Asym [OR = 1.15, p = 0.021], and AUCV_Asym [OR = 1.15, p = 0.024] persisted as MCI predictors, thus underscoring their independence from demographic characteristics in MCI prediction.

Further adjustment by incorporating MMSE scores revealed that AMPV_Asym [OR = 1.16, p = 0.020], and AUCV_Asym [OR = 1.16, p = 0.024] persisted as predictors for MCI. These findings affirmed the genuine independence of AMPV_Asym and AUCV_Asym from demographic and MMSE measures as reliable MCI predictors (Fig. 5(B)).

Fig. 5
figure 5

The OR for the LR models for the various variability asymmetry measures: (A) P300 non-target neural responses; (B) P300 target neural responses; Model 1 is unadjusted; Model 2 was adjusted for demographic characteristics and Model 3 was adjusted with MMSE score included; Significance levels are denoted as follows: *** for p < 0.001, ** for p < 0.01, * for p ≤ 0.05, and blank (no stars) for not significant; Actual p-values, CI and OR can be found in Table S3 of the supplementary materials

Discussion

This study aimed to assess the potential of utilizing ERP variability and related asymmetry measures from a portable EEG system to detect MCI. Through the analysis of ERP variability and its asymmetry from an auditory oddball paradigm, a comparison was made between individuals with MCI and CN. Furthermore, the research delved into understanding the link between ERP variability plus its asymmetry and MCI prevalence taking into account the demographic and neuropsychological scores.

The trial-to-trial variability analysis revealed distinct patterns in MCI patients compared to CN individuals. The patients with MCI exhibited unstable neural activity, i.e., increased variability measures of AMPV and AUCV in standard and target responses, and had more variability asymmetry compared to the CN. This association with MCI persisted even after incorporating demographic characteristics such as age, sex, and education level, and subsequently the MMSE score. This suggests that they could potentially serve as reliable predictors for MCI, irrespective of demographic and neuropsychological measures.

First, in the analysis of the variability measures of ERP components, as hypothesized, we observed that patients with MCI had more variability compared to the CN individuals, i.e., more variability in the amplitudes (AMPV) and area under the curve (AUCV). This neural variability was observed during perceptual processing (P2) and memory updating processes (P3) in both the non-target and target neural responses. The amplitude is known to provide insights into the level of cognitive processing during a task [62] and its variability in a specified period could be an indicator of the underlying deficit in attention associated with neurogenerative diseases [63]. Amplitude also correlates with attention, stimulus identification, and memory [64], and its variability could reflect the efficiency and stability of neural networks necessary for reliable encoding and retrieval of memories [38] and so our results reveal instability and inefficiency in neural processing in the patients with MCI.

The ERP variability measures were derived from the response variance curve by assessing the variance originating from trial-to-trial ERPs utilized in the conventional average ERP [30]. This approach evaluates the variability of data points within a specific time frame, providing insights into the fluctuations in neural responses, and has been particularly useful for distinguishing conditions characterized by attention deficits linked to variability in the CNS. In the research focused on ADHD [21] and Schizophrenia [34], it was observed that patients with these conditions displayed higher variability in amplitudes when compared to individuals without these disorders. This variation was attributed to the attention deficits that are typical of both ADHD and Schizophrenia. Despite scanty studies on the use of trial-to-trial ERP variability for AD or MCI distinction from normal individuals, our study agrees with [36, 38] who reported increased ERP variability in participants with neurodegenerative diseases compared to healthy individuals. These results reveal unstable perceptual and cognitive processing in the patients with MCI compared to cognitively healthy individuals. These could be attributed to several factors: First, there could be variability in the severity and progression of cognitive decline in patients with MCI due to their heterogeneity [52, 65], thereby influencing heightened disparities observed in ERP patterns in these individuals. Additionally, due to the larger proportion of older individuals with MCI in our dataset (mean age approximately 74 years), we could attribute the heightened neural variability in MCI to diminished neural plasticity, a characteristic feature of late-stage MCI [32]. Moreover, the increased variability could potentially have maladaptive effects on perception, likely stemming from increased internal noise [33]. After accounting for demographic factors such as age, sex, and education level, as well as the MMSE score, the neural variability measures showed a notable association with MCI. This implies that they may serve as dependable predictors for MCI, regardless of demographic and neuropsychological factors.

Next, in the analysis of neural variability asymmetry, we observed heightened measures in the MCI compared to the CN. Because, a greater peak amplitude or AUC of P3 ERP corresponds to more neural activity, a positive asymmetry corresponds to more activity in the right hemisphere while a negative asymmetry measure corresponds to more activity in the left hemisphere. Extending this interpretation to the asymmetry result indicates that there was more neural variability in the right hemisphere compared to the left hemisphere in both groups, with it being pronounced in the MCI compared to the CN group. The greater ERP variability in the right hemisphere compared to the left hemisphere in both groups and generally in MCI compared to the CN may not be surprising as there is a reported prevalence of depression in patients with MCI of approximately 32% [45], with depression linked to increased activity in the right prefrontal lobe and withdrawal reactions to unpleasant stimuli, frontal asymmetry [41, 42]. According to [66] the asymmetries observed in late ERP components, such as the P300, are likely contingent upon the nature of cognitive processing and the dominant hemisphere of the participant, underscoring that these asymmetries are more connected to asymmetrical cognitive processes than to fixed structural distinctions in the hemispheres. Studies like [43, 66] have explored asymmetries based on the EEG/ERP, however, to the best of our knowledge we are not aware of studies that have explored asymmetry of trial-to-trial variability. The corresponding asymmetry measures of neural variability displayed a significant association with MCI, even after adjusting for demographic factors such as age, sex, and education level, as well as the MMSE score. This reaffirms their potential as reliable predictors for MCI, independent of demographic and neuropsychological factors.

Therefore, trial-to-trial variability and its associated asymmetry measures can be useful markers differentiating MCI, independent of neuropsychological screening tests like the MMSE, and are promising complements to the MMSE. They could also serve as valuable complements to the widely used ERP amplitude and latency measures. We hope that these features can be leveraged to improve EEG/ERP-based models, similar to those discussed in studies [48, 67,68,69,70,71], for screening MCI/AD.

This study had some limitations. Firstly, variability in neural responses could be due to some factors for instance the effects of learning or habituation, fatigue during extended experimental tasks, or random variations in engagement, etc., which were uncontrollable factors in our study design. Secondly, it’s important to note that our findings might have limited generalizability since our data is composed of exclusively ethnically Korean participants. Thirdly, within the group of MCI participants, there was a mix of phenotypes (amnestic and non-amnestic) with unaccounted heterogeneity within these phenotypes. This lack of distinction might impact the interpretation of our results. Finally, it’s worth noting that the discontinuity of the ERPs may have been caused by the elimination of noisy epochs, as described in our methodology. Regrettably, this aspect was not accounted for in our analysis and interpretation of results.

In conclusion, this study showcased the potential utility of intra-individual ERP variability and its associated asymmetry measures obtained from a portable EEG device in differentiating MCI from CN elderly. We extensively described these variability and asymmetry measures and explored their connection with MCI. Our results indicated that individuals with MCI exhibited increased ERP variability and higher variability asymmetry compared to CN individuals. The logistic regression analysis unveiled that ERP variability and its related asymmetry measures retained statistical significance despite adjusting for demographic and neuropsychological measures. This suggests that ERP variability could not only complement traditional neuropsychological tests but also enhance conventional ERP measures of amplitude and latency in screening mild cognitive deficits. To the best of our knowledge, no prior studies have investigated trial-to-trial variability in MCI, especially using a relatively larger sample size.

In future research endeavors, it would be crucial to substantiate these results by expanding the study to include a diverse ethnic population given that the current investigation was limited to a Korean population. Moreover, investigating these findings by focusing on a relatively homogeneous MCI subgroup, such as amnestic MCI patients, would help mitigate the influence of MCI patient heterogeneity on the present results. Furthermore, to enhance predictive models for MCI, it is essential to consider integrating these ERP variability measures alongside conventional ERP measures and/or neuropsychological test scores. This comprehensive approach could potentially lead to more accurate and reliable assessments of cognitive deficits and improve our understanding of related neurobiological processes.