Introduction

Background and purpose

Clinical electrophysiology of vision comprises non-invasive tests which assess visual function including different types of electroretinogram (ERG), visual evoked potential (VEP) and electro-oculogram (EOG) [1]. The International Society for Clinical Electrophysiology of Vision (ISCEV) establishes and regularly updates clinical Standards and approves extended protocols for clinical electrophysiology of vision tests, to promote reliable diagnosis and monitoring with comparable results across testing centres [https://iscev.wildapricot.org/standards]. ISCEV also publishes Guidelines that provide context for the ISCEV Standards and protocols. The ISCEV Guide to visual electrodiagnostic procedures [1] introduces the principles and common clinical applications of ISCEV Standard tests. These Guidelines to calibration and verification aim to provide concise information for practitioners and instrument manufacturers, for the calibration, measurement and verification of instruments and protocol settings; this updates and supersedes the 2003 Guidelines for calibration of stimulus and recording parameters used in clinical electrophysiology of vision [2].

ISCEV standards for the full-field ERG [3], EOG [4], VEP [5], pattern ERG (PERG) [6], multifocal ERG (mfERG) [7], as well as eight ISCEV extended protocols [8,9,10,11,12,13,14,15], specify stimulus and recording parameters. For all protocols, the electrical potentials of the retina or cortex evoked by visual stimuli are detected from surface electrodes, amplified, filtered and digitised, then measured, typically with averaging to extract the signal associated with the stimuli. As both stimulus and acquisition system characteristics have substantial effects on the waveform, peak time and amplitude of responses, meaningful interpretation requires stimulators and acquisition systems to be accurate. Regular verification and periodic calibration of stimulus and recording parameters is essential to ensure the reliability and stability of equipment and stimuli, for reliable sequential monitoring and comparison with reference data.

This document, including “Appendices 15”, provides guidance for calibration and verification of stimulus and recording systems specific to clinical electrophysiology of vision.

Clarification of terms

The authors understand that the terms calibration, adjustment and verification have strict metrological definitions [16]. Given the importance of accessible Guidelines, we use the terms here with their more broadly understood but colloquial meaning. ‘Verification’ is used here to mean the process of measuring a parameter, for example flash strength, to ensure it remains within the tolerance specified by the appropriate Standard. ‘Adjustment’ is used here to mean the process of altering a parameter which has drifted beyond the measurement uncertainty of the instrument. ‘Calibration’ is the process which encompasses verification measurements and any required adjustments compared with a reference standard such as the calibration undertaken by manufacturers.

“Appendix 5” is an additional reference list selected by ISCEV members to provide broader information on topics such as physiologic amplifiers, signal processing, physiological band-pass filters, visual display systems and/or photometry. As for all other ISCEV Standards and Guidelines, this document does not contain safety standards, standards for clinical care and management, or advice for patient privacy or data protection; users of this Guideline must choose instruments approved for clinical use in their jurisdiction and follow local requirements for clinical safety, care, and data protection. This Guideline was adopted by the ISCEV Board of Directors as an official publication on 1 March 2023.

Stimulus calibration and verification

Background

Three types of stimuli are used in ISCEV Standard tests and extended protocols. Luminance flash stimuli provide uniform stimulation across retina; these are used in combination with full-field, uniform, steady backgrounds for light-adapted testing. Pattern stimuli for ISCEV Standard tests are high contrast black-and-white checkerboards presented as either contrast reversal or as onset/offset from a uniform grey field that matches the space-averaged luminance and colour of the checkerboard. Stimuli for multifocal testing comprise an array of hexagonal elements that change rapidly between dark and light (on or off) driven by a predetermined pseudorandom binary sequence (m-sequence).

The light stimuli used in ISCEV protocols are measured using photometric quantities, a system that weights electromagnetic radiation by the nominal spectral sensitivity of a standard human observer. Luminance is the photometric quantity pertinent to extended sources such as a visual display screen and its pattern elements and to uniform backgrounds presented in an integrating sphere (ganzfeld). For brief flashes, the pertinent quantity is time-integrated luminance. The strength of ISCEV stimuli are specified in photopic units which are based on the relative sensitivity of the light-adapted cone system; in addition, for dark-adapted recordings of full-field ERGs, both photopic and scotopic units may be specified. Current ISCEV Standards stipulate achromatic stimuli. Pattern stimuli are specified as black-and-white; brief flashes and backgrounds are specified as visibly white, with a scotopic to photopic ratio of approximately 2.5 (see [3]). Chromatic (non-white) flashes and backgrounds are used in several of the ISCEV extended ERG protocols [8,9,10, 13]. When testing under dark-adapted, rod dominated conditions, it is necessary to comply with scotopic specifications for both achromatic and chromatic stimuli because the scotopic to photopic ratios of stimuli vary markedly with the spectral content of the light.

Visual stimulus generators

Flash and pattern stimulus generators for clinical electrophysiology of vision testing should be capable of matching the specifications of the Standard protocols and preferably allow generation of a broader range of stimuli to accommodate ISCEV extended protocols and other, customised testing. Stimuli should be calibrated and remain stable over time, with regular verification. Adjustments are required when a visual stimulus generator fails to provide a stable output. The limits specified by the various Standards are designed to be inclusive of a range of instruments and established laboratory reference data; within laboratories instruments should be adjusted to remain within the measurement uncertainty of the instruments. We encourage manufacturers to include automated protocols for verification and adjustment and for laboratories to use these frequently (i.e. daily or before each patient) when these are available.

Most manufacturers offer stimulus calibration with a known measurement standard as part of ongoing maintenance or service contracts. If practitioners do not undertake their own photometric calibrations, they should obtain manufacturer calibrations. An earlier version of this Guideline recommended verification of calibrations at a maximum interval of 6 months [2]. However, stability of stimuli is highly dependent on the technology used. The current maximum intervals for verification based on technology are tabulated in “Appendix 1”. More frequent checks may be required to check stability if values change substantially between successive measurements. The onus is on practitioners to ensure adequate verification and any necessary calibration of stimuli and to interpret recordings with appropriate consideration of stimulus accuracy and stability.

When automated systems are not available, measurement for verification and adjustment, requires a photometer(s) capable of measuring the luminance of both extended sources, including small areas (pattern elements) and the time-integrated luminance of brief flashes. The photometer should be capable of accurate measurement of low luminance levels, e.g. the pattern VEP dark check of approximately 0.1 cd m−2. It should be equipped to measure both photopic and scotopic quantities. The photometer field of view should be several degrees and preferably tens of degrees to measure ganzfeld stimuli. The time-integrating function must be capable of measuring a single brief flash as well as multiple brief flashes over several seconds and should be sufficiently sensitive to measure weak flashes (0.01 phot cd s m−2, 0.025 scot cd s m−2). Photometers require calibration, typically annually, which should use a source traceable to an international standard.

Measurement of luminance stimuli

For continuously lit or reflecting sources such as display screens and ganzfeld backgrounds, the appropriate measure is luminance (units cd m−2). This is a measure of the steady-state luminance level of the surface. For brief flashes, the appropriate measure is time-integrated luminance (units cd s m−2). This is a measure of the total luminance delivered by the flash. For clinical electrophysiology of vision, brief flashes range from less than 100 microseconds for discharge lamps up to 5 ms for LEDs and other sources. “Appendix 2” describes methods for measuring brief flashes, flicker stimuli and backgrounds used in light-adapted full-field ERGs, flash VEPs and EOG tests.

Measurement of pattern stimuli

Visual display units (VDUs) or other optical imaging systems generate pattern stimuli used in clinical electrophysiological tests, including VEPs, PERGs and mfERGs. “Appendix 1” lists the main types of VDUs with information about their application to clinical electrophysiology of vision and calibration frequency. Pattern stimuli require calibration and verification of their mean luminance, contrast, pattern element size and the field size.

Mean luminance: The mean luminance is of the utmost importance for pattern stimuli as it affects amplitudes and peak times, particularly for PERGs and mfERGs. Mean luminance is expressed as:

$$ \frac{{L_{\max } + L_{\min } }}{2} $$
(1)

where Lmax and Lmin are luminances of light and dark pattern elements, respectively, when these are equal in size and duty cycle (relative time of presentation). The ISCEV Standards require mean luminances of 40–60 cd m−2 for the VEP [5], > 40 cd m−2 for the PERG [6] and > 50 cd m−2 for the mfERG [7].

Luminance transients: Pattern reversal or pattern onset/offset stimuli must not have transient changes in mean space-averaged luminance of the whole display. Pattern onset/offset stimuli are particularly prone to unwanted luminance artifacts, where the uniform grey field does not match the patterned field in mean space-averaged luminance. Any transient or step change in luminance with each pattern change creates a luminance artifact that will contaminate the pattern response with a luminance response. A qualitative method of checking for luminance transients can be achieved by observation. With room lights off to enhance sensitivity, a sheet of white paper is held in front of, and parallel to, the plane of the stimulus at a distance of approximately 30 cm and viewed by an observer standing behind the display while the stimulus is delivered. The diffused reflection of the display on the paper should appear as a constant brightness. If any change is perceived, the stimulus must be adjusted to remove it.

Contrast: The Michelson contrast of pattern stimuli is defined as:

$$ {\text{Contrast}}\,(\%) = \frac{{L_{\max } - L_{\min } }}{{L_{\max } + L_{\min } }} \times 100 $$
(2)

where Lmax and Lmin are the luminances of the light and dark elements, respectively. Thus, contrast ranges from 0% for a homogeneous field to 100% when the dark checks have a luminance of zero. Although ISCEV Standard VEPs require a minimum contrast of 80% [5], VEP waveforms change very little for contrast levels above approximately 50%. However, mfERG and PERG amplitudes increase linearly with contrast so that stable, high levels of contrast are critical; minimum contrast levels are 90% for mfERGs and 80% for PERGs. It is advisable to protect manual controls for ‘brightness’ and ‘contrast’ to prevent accidental changes. Methods for measurement of pattern mean luminance and contrast are presented in “Appendix 3”.

Element and field size: Pattern evoked potentials show spatial tuning and thus are affected by the size (angular subtense) of the pattern elements. For guidance, a pattern element of 1 cm subtends approximately 1° (60 min of arc) at a viewing distance of 57 cm. The elements in checkerboard stimuli are specified by check width in degrees or minutes of arc. The hexagons in mfERG stimuli are scaled as specified in the mfERG standard with a smaller central hexagonal element surrounded by progressively larger elements. The field sizes for pattern stimuli are specified by the angular subtense of both height and width. Procedures for calculating visual angles for fixed viewing distances and for choosing a viewing distance to obtain a desired visual angle are given in “Appendix 3”.

Triggers and temporal measurements

Stimulus frequency: Electronic instruments typically use precise crystal oscillators to regulate timing. These are accurate within nanoseconds and more accurate than physiologically relevant values. Thus, temporal frequency of stimulus presentation rarely requires validation and typically does not change within an instrument. It is important, however, that stimulus presentation is synchronised with a trigger for signal averaging to be effective and for the accurate measurement of peak times.

Stimulus duration, onset latency and display rise times: ISCEV stimuli vary from very brief flashes such as those from discharge lamps lasting a few nanoseconds to patterns that are sustained for half a second or more. Measurements from stimulus onset to the peaks in the response waveform are typically the most sensitive and specific diagnostic measures in clinical electrophysiology. ISCEV protocols presume that the trigger and the measurement point for stimulus onset are coincident; any delays introduced by the hardware or software of an instrument should be corrected before reporting peak times.

‘Stimulus onset’ is not instantaneous. Displays differ in the time taken to reach peak luminance (display rise time). Although CRT screens have rapid onset (within 1 ms) the onset is not simultaneous across the screen; the stimulus trigger for VEPs should be set at the midpoint of the raster or compensation made for the delay [5]. For PERGs the convention based on the 2013 ISCEV Standard is to set the trigger at the beginning of the raster scan or to note any differences [6]. For other displays (“Appendix 1”), the default assumption is that the onset trigger is set at the midpoint of the onset; any deviations or corrections for onset triggers should be noted.

When offset time is relevant, such as for mfERG recording, it is important that the total response time (onset plus offset time) is considerably less than the duration of an m-sequence step (e.g. < 13.3 ms for a step duration of one frame at 75 Hz frame rate) [7]. The onus is on the manufacturers and users to ensure that the onset timing of the instruments in use match those of the normative data. Care should be taken when changing instruments to correct for any changes in the timing and triggering of stimuli.

Acquisition systems

Background

Electrophysiological signals produced by the retina and visual cortex typically have small amplitudes in comparison to unwanted sources such as electrical signals at the line frequency (50 or 60 Hz) or from electronic equipment (often 100 or 120 Hz) and physiological ‘noise’ generated by muscle activity, electrical activity in the brain and eye movements. Differential amplifiers are used to amplify the difference between the two inputs, rejecting signals common to both inputs. Electrodes placed near the relevant signal source, distant from it, and at an indifferent point are connected to the positive (active), negative (reference), and common (ground) inputs of the acquisition system, respectively. High amplitude artifacts, such as those generated by eye movements or blinks, are typically excluded from the signal on-line or post hoc using rejection criteria based on amplitude; for example, the PERG Standard [6] suggests a ± 100 µV rejection threshold. Stimulus-locked signal averaging increases the signal to noise ratio; the choice of the number of averages should be governed by signal to noise conditions and meet the requirements of the specific Standards and extended protocols.

The amplified signal is digitised into arrays of time and voltage values using an analogue-to-digital convertor, and processing such as filtering and averaging reduces the noise further. Sampling frequency and some filter characteristics are specified in ISCEV Standards and extended protocols. Analogue filters and digital emulations introduce frequency-dependent phase changes and can alter waveforms and peak times which should be considered if filter settings differ, for example between serial recordings or between a recording and reference data. Amplifier gain should be adjustable and should ensure high amplitude resolution, and avoid any nonlinearities at the extremes of the range which could distort signals (e.g. clipping of the peaks).

Measurement of electrode impedance

Electrode impedance: ImpedanceFootnote 1 is measured by passing a low amplitude (≤ 1 µA) alternating current (AC, 10–100 Hz) through the tissue between a pair of electrodes. Devices used to measure impedance of physiological electrodes in situ must be specifically designed to do so and approved for medical use. Impedance depends on frequency but typically varies little between 10 and 100 Hz, the frequency range of interest in most physiological recordings. DC ohmmeters must not be used as these typically apply relatively high currents and will polarise electrodes, resulting in unreliable measures. Each ISCEV standard and protocol specifies the upper limits of inter-electrode impedance differences; low, matched input impedances will improve the signal-to-noise ratio of response waveforms. These specifications are pertinent to ‘passive’ electrodes: no specifications are made for ‘active’ electrodes which incorporate pre-amplification, as they have less stringent impedance requirements.

Measurement of amplification systems

Calibration and verification of amplifier gain is assessed by passing a known signal, with amplitude and timing in the expected range of the physiological signals (e.g. 1 µV to 1 mV), through the amplification system. The known signal should pass through the entire system, including any pre-amplifiers incorporated into the electrode connection box or the electrodes themselves. The amplitude of the output should closely resemble that of the input multiplied by the gain and adjusted for the effect of any applied filters. To undertake verification of the amplifiers, the signal generator should be specifically designed and calibrated to ensure accuracy, stability and to avoid damage to the amplifiers. (See “Appendix 4”).

Most manufacturers offer calibration of acquisition systems initially and as part of ongoing maintenance or service contracts. If practitioners do not undertake their own calibrations, they should obtain manufacturer calibrations. ISCEV encourages manufacturers to provide methods for practitioners to perform interim verification of the amplification systems, for example to investigate suspected malfunction. A useful check that all amplifier channels are working similarly is to pass an identical signal through each channel with identical settings. If amplifiers are not performing as expected, or if distortions of signals are observed, the equipment needs to be repaired.

Calibration of amplification and filtering should be performed as specified by the manufacturer based on the known stability of the system. If there are no specified intervals, we recommend a maximum interval of 1 year. The onus is on practitioners to perform or commission adequate calibration of the system.