Introduction

The conventional pattern-reversal visual evoked potential (VEP) technique is used as an objective method of evaluating the visual pathway. Electrodes placed in standardized positions detect electrical signals generated by the visual cortex while the patient fixates at the center of a phase reversing checkerboard pattern. Since the VEP technique evaluates the final output of the entire visual system, it may be potentially useful for the detection of any type of visual abnormality [1]. However, problems such as positioning of electrodes, limited repeatability, subjective analysis of the waveforms, and long test duration have limited its use in clinical practice. Improvements on the original technique that could eliminate some or all of these problems have the potential to increase the clinical applicability of VEP as a screening tool for a range of eye diseases and visual conditions.

The Diopsys Enfant™ system (Diopsys, Inc., Pine Brook, New Jersey, USA) is a VEP technology that has been used as a screening tool for detecting amblyopia and other visual deficits in young children [2]. We modified this system in order to decrease test duration by means of a more efficient signal acquisition technique followed by a post-processing algorithm that provides less subjectivity in waveform assessment. Such improvements could possibly result in low test–retest repeatability that could compromise its clinical usefulness. We aimed to evaluate the within-session and inter-session repeatability of this modified VEP technique in normal individuals.

Methods

The study was approved by the Institutional Review Board of New York Eye and Ear Infirmary and The Pennsylvania College of Optometry of Salus University and informed consent was obtained from all participants.

Thirty normal individuals were prospectively enrolled. Subjects underwent complete ophthalmologic examination, including slit-lamp biomicroscopy, dilated fundus examination and visual field testing (24-2 SITA-Standard strategy, Carl Zeiss Meditec, Inc., Dublin, CA). All individuals had 20/20 best corrected visual acuity, normal biomicroscopic examination and normal field tests.

Apparatus

Transient VEPs (tVEP) were generated using a Diopsys Enfant™ System (Diopsys, Inc., Pine Brook, New Jersey, USA), with software modifications. Because the test duration was shortened considerably by the modifications to the system, we have termed this testing paradigm short-duration tVEP (SD-tVEP).

The stimulus was presented on a gamma-corrected Phillips 170B7 17-inch LCD monitor, running at 75 Hz. Luminance output over time was verified using a luminance meter (MavoSpot 2 USB, Gossen Gmb, Nuremberg, Germany). The monitor was set to operate in the linear luminance region with less than 4% measured residual flicker. Gold cup electrodes (Grass Model F-E5GH, Astro-Med, Inc., West Warwick, RI) (10 mm) and commercially available skin preparation and EEG paste were used for recording of the SD-tVEP. Synchronized single-channel SD-tVEP waves were recorded and generated a time series of 240 data points per analysis window.

Stimulus

The circular black/white checkerboard pattern reversal stimulus had a diameter of 22 cm with a red circular outline used as fixation target. The diameter of this target was 1 cm with a thickness of 1 mm. The check size was 28.87 min of arc. Two pattern contrasts were used in the study, based on previous studies that suggested that differential contrast stimulation could affect the VEP waveforms [3, 4]. The first pattern had white checks of 122.9 cd/m2 and black checks of 101.1 cd/m2 resulting in a Michelson contrast of 10% and mean luminance of 112 cd/m2. The second pattern had white checks of 122.9 cd/m2 and black checks of 9.59 cd/m2 resulting in a Michelson contrast of 85% and mean luminance of 66.25 cd/m2. The two patterns will be referred to as 10% contrast and 85% contrast. In all cases, the display was viewed through natural pupils with optimal refractive correction in place. The viewing distance was set to 1 meter, yielding a total display viewing angle of 12.54 degrees.

During a recording session, the luminance of each stimulus check was temporally modulated in counter phase at 1 Hz (two reversals per second). The 10 and 85% contrast stimuli were presented in a sequence of OS, OD, and OU for a minimum of 40 pattern reversals each. If a motion or blink artifact was detected, that epoch was ignored and repeated automatically. During monocular testing, the fellow eye was covered. The sequence was repeated five times with a 3-min break between each sequence. The five sequences were first done at 10% contrast and then repeated at the 85% contrast level.

Since an LCD display will cause apparent (artificial) delay in the P100 latency due to the input lag of the monitor [5], software modifications were made to compensate for the lag. This lag is caused by the finite time the LCD display takes to process the input video signal and display the pattern. This inherent delay of the LCD display was calculated using an optical feedback circuit. The standard filter/amplifier circuit and electrodes that would normally record the SD-tVEP was replaced with a photo detector circuit. The photo detector circuit generated a distinctive signal every time the pattern on the monitor was phase reversed. This signal was compared to the monitor’s input signal, and the length of the delay was calculated.

Five normal subjects were tested on both an LCD and CRT monitor to test for the effects of the asymmetrical luminance onset/offsets (flicker) of the LCD monitor. A Phillips 170B7 LCD monitor and a Mitsubishi SpectraView 7000 were used for the comparison study. Testing only OD, 15 tests (20 s test time, 2 reversals per second, 85% contrast) were performed with a 10-min break between monitors. The P100 latency and delta amplitude were compared between the monitors, and the results analyzed using a paired sample t test. For all patients tested, there was no significant difference between the two groups (alpha = 0.05) for P100 latency and delta amplitude.

Recording procedures

One EEG channel was recorded using gold cup electrodes. The electrodes were placed 4-cm above the inion (active) and on the frontal tuberosity centered on the midline (reference). While this deviates from the International Society for Clinical Electrophysiology of Vision (ISCEV) protocol of 10% of the inion-nasion distance, an approximation of 4 cm was used to decrease overall test preparation time. The left side of the forehead just in front of the temple served as ground.

In preparation for recording, the skin at each electrode site was scrubbed with Nuprep (D.O. Weaver & Co., Aurora, CO) on a cotton-tipped wooden swab. Electrodes were fixed in position with Ten 20 conductive paste (D.O. Weaver & Co., Aurora, CO) and secured with a small gauze pad with conductive paste applied. Electrode impedance was maintained below 10k ohms in all cases and was usually below 5k ohms. Analog signals were amplified by 104 (Diopsys Enfant Amp 100, Diopsys, Inc., Pine Brook, New Jersey, USA), band-pass filtered with cut-off frequencies of 0.5–100 Hz and sampled at 600 Hz using the Enfant System.

Data analysis

The stored SD-tVEP data for each subject was exported from the device’s relational database to an external binary file to be processed by an external signal processing algorithm [6]. All subjects had their data imported into the signal processing algorithm. The data had the N75-P100-N135 complex temporal epoch identified by windowing and bilateral band pass filtering utilizing Derr’s extraction method [6]. The low pass cutoff frequency was chosen to be 30 Hz. The high pass frequency was set to 5.1 Hz so as to minimize low frequency drift. Reducing this drift enhances an automated method for isolating the N75-P100-N135 complex. The following information was identified from the filtered N75-P100-N135 complex: N75 amplitude, N75 latency, P100 amplitude, P100 latency, and Delta Amplitude (N75-P100) amplitude.

Statistical analysis

The mean (μ) and standard deviations (SD) for the five SD-tVEP, for each OD and OS at each contrast level were calculated, and their ratio (SD/μ) defined the within-subject coefficient of variation (Sw). The standard deviations for OD and OS were grouped into two data sets based on contrast level. Intraclass correlation coefficients (ICC) were calculated for the Delta Amplitude (N75-P100) amplitudes and P100 latency for one eye. To test the degree of inter-eye symmetry in the responses, we calculated the coefficients (r) based on Pearson’s correlation between the average values of OD and OS. Bland–Altman plots were created to evaluate the inter-eye agreement, as well as the agreement between the first and last t-VEP run for each eye at each contrast level. To test the inter-session repeatability, a subset of ten patients (one eye per patient) of this database had their tests repeated within 1–2 months of the initial visit. Data from day 1 and day 2 were compared. The results from OU runs were not included in the statistical analysis.

Results

Thirty normal subjects were enrolled. The total number of tests was 900 [30 patients × 3 (OD, OS, OU) × 5 (re-tests) × 2 (10 and 85% contrast)]. The mean age was 27.3 ± 5.2 years; 66% were men and 53% were of European ancestry. Figures 1 and 2 show an overlay of five consecutive tests of the right eye of one the subjects.

Fig. 1
figure 1

An overlay of five OD 10% contrast tests

Fig. 2
figure 2

An overlay of five OD 85% contrast tests

The median, first and third quartiles, and inter-quartile range (IQR) for each parameter are shown in Table 1 (A and B). Considering the N75-P100 amplitudes at 10% contrast, the median values for this population were 4.63, 4.11, and 5.55 uV for OD, OS, and OU, respectively. Using 85% contrast, the Delta Amplitude (N75-P100) amplitude median values were 7.13, 6.98, and 7.2 uV. The median P100 latencies at 10% contrast were 115.17, 114.83, and 109.33 ms for OD, OS, OU, respectively. At 85% contrast, the median P100 latencies were 99.52, 100.89, and 98.50 ms. Table 2 shows the repeatability indices for each parameter. The ICC values at 10% contrast P100 latency and Delta Amplitude (N75-P100) amplitudes were 0.64 and 0.92, respectively. At 85% contrast, the values were 0.65 and 0.98, respectively. The within-subject coefficients of variation at 10% contrast were 5 and 30%, and at 85% they were 3 and 15%, respectively. Bland–Altman plots showed good agreement between the first and fifth test sessions. At 85% contrast, Delta Amplitude (N75-P100) mean difference was 0.48 uV, with 95% confidence interval (CI) between −0.18 and 1.12; for P100 latency, the mean was −0.82 ms, 95% CI, −3.12–1.46. At 10% contrast, Delta Amplitude (N75-P100) mean difference was 0.58 uV, 95% CI, −0.27–1.45; and for P100 latency delay, the mean difference was −2.05 uV, 95% CI, −5.12–1.01 (Fig. 3a–d).

Table 1 Summary of test–retest repeatability of different parameters using SD-tVEP
Table 2 Intraclass correlation coefficients (ICC), standard deviation (SD), and within-subject coefficient of variation (Sw) of the tested parameters
Fig. 3
figure 3

Bland–Altman plots showing the agreement between mean values of OD and OS. a 85% contrast Delta Amp. (N75-P100) (uV), b 10% contrast Delta Amplitude (N75-P100) (uV), c 85% contrast P100 latency (ms), d 10% P100 contrast latency (ms)

For the 85% contrast tests, there was a strong and significant correlation of both delta amplitudes and P100 latencies between fellow eyes (r = 0.93 and r = 0.73, P < 0.01, respectively). For the 10% contrast tests, the correlation was also strong and significant (r = 0.89 and r = 0.62, P < 0.01, respectively). The Bland–Altman plots are presented in Fig. 4a–d. There was a good overall agreement between fellow eyes for both Delta Amplitude (N75-P100) amplitudes and P100 latencies at 10 and 85% contrast. The mean difference between fellow eyes ranges from 0.1 to 0.3 uV for amplitudes and 0.5 ms for latency.

Fig. 4
figure 4

Bland–Altman plots showing the within-session agreement between first and fifth run of the short duration t-VEP. a 85% contrast Delta Amp. (uV), b 10% contrast Delta Amplitude (N75-P100) (uV), c 85% contrast P100 latency (ms), d 10% contrast P100 latency (ms)

Due to the length of testing protocol in this study, there was a need to investigate the effects of patient fatigue. All patient data from individual trials were pooled together and analyzed for trends of decreasing amplitude and/or increasing latency. The results indicated minimal effects of fatigue. The ranges for the median values of the amplitude and latency at 10% were (4.14–5.47 uV) and (114.17–117.50 ms), respectively. The 85% values were (6.60–7.87 uV) and (103.30 and 105.00 ms).

The inter-session repeatability ΔICC values for 10% latency and amplitudes were 0.86 (0.46–0.96) and 0.85 (0.40–0.96), respectively. For 85% latency and amplitude, the values were 0.71 (95% CI = 0.15–0.92) and 0.81 (0.23–0.95), respectively. Bland–Altman plots comparing the results of days 1 and 2 are shown in Fig. 5a–d.

Fig. 5
figure 5

Bland–Altman plots showing the inter-session agreement between days 1 and 2 of the short duration t-VEP. a 85% contrast Delta Amp. (N75-P100) (uV), b 10% contrast Delta Amplitude (N75-P100) (uV), c 85% contrast P100 latency (ms), d 10% contrast P100 latency (ms)

Discussion

We evaluated the within-session and inter-session repeatability of a new SD-tVEP technique in normal subjects. Using 10 and 85% contrast, the responses to checkerboard contrast reversal stimuli showed good repeatability. There was a strong agreement between the first and fifth run of the subjects, as well as between two separate visits. In addition, we found a strong and significant inter-eye agreement and correlation among subjects. Such results were obtained during a test run time of 20 s per eye (40 reversals) and still allowed straightforward determination of the P100 peak latencies and amplitudes.

The within-subject repeatability of visual evoked potentials has been investigated in different populations using various modalities of the technique [711]. Each technique has its own characteristics, and comparisons should be made within the same technology. Also, the way repeatability is tested and measured may vary according to the study. Using multifocal VEP technique, Chen et al. [10] found strong correlations (Pearson’s r = 0.88–0.92) of amplitudes obtained between two sessions in the same day, and SD values were 1.1–1.5 (measured in root-mean square). Lauritzen et al. [11] found a mean coefficient of variation of swept visual evoked potential (SWEEP-VEP) amplitudes of 16 ± 7% among seven healthy adults at 80% contrast. Future studies ought to replicate our findings using the same technique.

Derr et al. [Derr P, Tello C, De Moraes CG, Patel J, Prata TS, Sigfried J, Liebmann JM, Ritch R (2009) Fast transient VEP parameter identification algorithm for evaluating glaucoma eyes. Invest Ophthalmol Vis Sci 50: E-Abstract 5886] showed an improved performance of the herein described algorithm in identifying the N75-P100-N135 complex both in normal subjects (n = 60) and glaucomatous eyes (n = 84). Compared to visual inspection of the raw VEP waveforms, the method allowed better discrimination between glaucomatous and healthy eyes. This information combined with the findings of the present study support a potential applicability of this technique as an adjunctive technology in eye care, similarly to what has been reported using other VEP technologies [1218].

Even though we did not aim to compare patient performance at each contrast level, we found lower amplitudes and increased latencies using 10% contrast than 85% contrast. Some of these differences may have been due to the results of the change in luminance between contrast levels. Moreover, at 85% we found narrower confidence intervals both for the median values and within-session variation than at 10% contrast. The influence of the stimulus contrast on the VEP parameters was first observed by Duwaer and Spekreijse [19] in patients with multiple sclerosis: with lower contrast, the amplitude becomes smaller and the latency may increase. Changes in VEP responses associated with luminance contrast stimuli have also been observed in other diseases such as glaucoma [3, 4]. Using different contrast stimuli in patients could be employed in the future to try to evaluate selective visual dysfunction in various ocular diseases. This device’s hardware would require modifications to the display as to maintain constant luminance for each contrast level.

It is important to note that our sample was a relatively young population and patient age should be taken into account when interpreting VEP waveforms [2022]. Shaw and Cant [20] showed that the amplitudes are highest in childhood, decrease up to the fourth decade, increase again, but decrease once more after the sixth decade. The latency may also increase in later life. Thus, the normative values found in our study should be interpreted cautiously when compared with older individuals. This factor notwithstanding, the new device showed good within-session and inter-session repeatability in a young healthy sample.

We found a significant inter-eye correlation and agreement with this device. Interocular variation should be more precise when used to detect abnormalities than a comparison with a group of normal controls [23], especially in asymmetric diseases (e.g.: amblyopia, glaucoma, age-related macular degeneration, optic neuritis). By comparing the differences in amplitude or latency between fellow eyes, one can correlate with other clinical findings (such as fundus examination, intraocular pressure, and refractive error) and increase the likelihood of detecting eye diseases which are often asymmetric. This may be particularly helpful in subtle or early stage conditions in which the clinical examination provides insufficient information to establish a diagnosis based upon the assessment of a single eye alone compared to a normative database. The good inter-eye agreement and correlation found in our study could therefore be further investigated regarding the ability of the SD-tVEP in detecting early asymmetric eye diseases. Such potential use has been successfully demonstrated in patients with glaucoma using multifocal technology, for instance [24].

The observation that this test also presented good inter-session repeatability may have significant implications. It is known that the simple act of removing the electrodes and immediately repositioning them could cause variation in the measured electrical responses. Such inter-session variability could be a limitation for its use in certain conditions, especially for those requiring longitudinal evaluation of disease progression and/or response to treatment. As demonstrated in our results, there was a reasonable inter-session agreement when patients were tested on different days. With a larger and age-matched database, this could be used for follow-up of chronic diseases such as diabetic maculopathy and glaucoma, for instance.

The study’s objective was to determine the within-session and inter-session repeatability of the proposed apparatus, using a specified screening protocol in conjunction with post-processing the signal-averaged VEP with a test time of 20 s. This short-duration test time was not an issue for the normal population, but might not be adequate for all populations. Methods analyzing the signal to noise ratio with a stop criterion might prove useful in patients with ophthalmological disease.

One limitation of the conventional VEP technique is its high variability between individuals. The anatomy of the striate cortex (primary visual cortex or V1 area) varies greatly among individuals [2528] and could also be influenced by the variation of extrastriate areas of the cortex [2931]. There is also variation of the recorded responses dependent on patients’ attention and accommodation [32], which we tried to minimize using a central target for fixation.

The term “short duration transient VEP” (SD-tVEP) was named so as to reflect the time required to perform a testing protocol on a patient. It is important to note that the time to perform the test includes acquiring forty 500 ms epochs of tVEP data and performing objective analysis on the resulting signal-averaged response. The analysis, based on the Derr algorithm [6] automatically extracts a reproducible N75-P100-N135 complex from 20 s of VEP data and calculates all parametric values. As a result of the objective analysis, the protocol or a modified protocol may allow for VEP technology to become a viable screening device. It should also be noted that the devices’ proprietary synchronization of pattern presentation and data collection adds to a cleaner signal. Combining the system’s ability to lift the VEP signal out of the noise in 40 epochs, and the automatic extraction algorithm being able to detect this signal allows for a 20-s run.

In summary, our study showed that a new VEP technique with shorter test duration and simplified waveform extraction showed good within-session and inter-session repeatability, as well as strong inter-eye correlation. Further investigation to determine its usefulness as a screening tool for abnormalities of the visual pathways is warranted.