Abstract
Interaural time difference (ITD) sensitivity with cochlear implant stimulation is remarkably similar to envelope ITD sensitivity using conventional acoustic stimulation. This holds true for human perception, as well as for neural response rates recorded in the inferior colliculus of several mammalian species. We hypothesize that robust excitatory-inhibitory (EI) interaction is the dominant mechanism. Therefore, we connected the same single EI-model neuron to either a model of the normal acoustic auditory periphery or to a model of the electrically stimulated auditory nerve. The model captured most features of the experimentally obtained response properties with electric stimulation, such as the shape of rate-ITD functions, the dependence on stimulation level, and the pulse rate or modulation-frequency dependence. Rate-ITD functions with high-rate, amplitude-modulated electric stimuli were very similar to their acoustic counterparts. Responses obtained with unmodulated electric pulse trains most resembled acoustic filtered clicks. The fairly rapid decline of ITD sensitivity at rates above 300 pulses or cycles per second is correctly simulated by the 3.1-ms time constant of the inhibitory post-synaptic conductance. As the model accounts for these basic properties, it is expected to help in understanding and quantifying the binaural hearing abilities with electric stimulation when integrated in bigger simulation frameworks.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
INTRODUCTION
Bilateral cochlear implant (CI) users are able to coarsely localize sound sources. They are better at this than unilateral CI users, but at the same time much worse than normal-hearing (NH) listeners. With their clinical devices, they can exploit interaural level differences (ILDs), while obtaining little to no benefit from interaural time differences (ITDs) (Seeber and Fastl 2008). However, when stimulating a single left and a single right CI electrode at a low rate, almost all bilateral CI users have demonstrated some degree of ITD sensitivity (for review, Kan and Litovsky 2015; Laback et al. 2015). The median detection thresholds are 144 µs, 12 times larger than those of young and well-trained NH listeners (Laback et al. 2015). In addition, ITD sensitivity is limited to stimulation rates below 300–500 pulses per second (pps) (Ihlefeld et al. 2015; Laback et al. 2007; van Hoesel 2007), in contrast to the NH limit near 1400 Hz for pure tones (Brughera et al. 2013). This moderate level of ITD sensitivity is very similar to the envelope ITD sensitivity of NH listeners (Dietz 2016).
At the level of the auditory nerve (AN), the trend is opposite: phase locking to electric stimulation is generally better than with acoustic stimulation (Dynes and Delgutte 1992; Hartmann et al. 1984; Shepherd and Javel 1997). Some AN fibers can phase lock to electrical pulse trains at rates of 5000 pps (e.g., Miller et al. 2008) or even above 5000 pps (e.g., Dynes and Delgutte 1992). At the level of the inferior colliculus (IC), the picture is less consistent. Some studies report ITD sensitivity with electric stimulation similar to acoustic stimulation (Rosskothen-Kuhl et al. 2021; Vollmer 2018), some report a slightly lower rate limit and less robust tuning (Smith and Delgutte 2007), and some report a low rate limit similar to the perceptual limit (Chung et al. 2016; Hancock et al. 2013). It can be inferred that the temporal information present at the level of the AN cannot be exploited by the brain to the same degree, but experimental challenges have so far prevented deeper insights. Moreover, significant differences between related studies at each stage impose additional obstacles on theory development. A variety of factors, such as species differences, deafening procedure, duration of deafness, implantation, anesthetics, stimulation procedure, neuron search techniques, and selection criteria, and an equally long list of differences in the human psychoacoustic experiments, severely limit across-study comparisons.
Computational simulation of the whole process can improve comparability and deepen our understanding of the functional relations. Specifically, it can quantify the influence of some procedural differences, and it can be instrumental in developing and consolidating hypotheses about the partial loss of temporal information. Here, we propose a simulation framework for both acoustic and electrical stimulations of binaurally sensitive model neurons. It is used to quantify the influence of stimulus types and parameters, and for comparing between acoustic and electrical stimulations.
Previous studies on this topic simulated excitatory-excitatory neurons (e.g., Chung et al. 2015; Colburn et al. 2009). They were able to reproduce peak-type rate-ITD functions (from e.g., Smith and Delgutte 2008) even at high rates such as 1000 pps. However, most human bilateral CI users cannot exploit ITDs at these rates and rather show an ITD sensitivity similar to NH envelope ITD sensitivity, e.g., a limit around 300–500 pulses or cycles per second. Envelope ITD sensitivity is commonly associated with LSO processing (Tollin 2003), which is why Dietz (2016) hypothesized that the LSO pathway may be crucial for bilateral CI users. Therefore, a goal of the present study is to complement the existing functional analysis of EE-type models with LSO-type EI processing and to discuss if this pathway has potential for future studies that model behavioral ITD sensitivity in bilateral CI users.
The general philosophy of the present study was to use “off the shelf” model components with default parameters, rather than fitting parameters to isolated neurons or data sets. The focus was on three main factors: stimulation level, stimulation rate, and stimulation type. The latter describes different amplitude-modulation shapes presented acoustically, unmodulated pulse trains presented electrically, and high-rate pulse trains with sinusoidal amplitude modulation presented electrically. Additionally, as an application example, we varied the inhibitory post-synaptic conduction time constant to demonstrate its influence on the ITD rate limit.
METHODS AND MATERIALS
Modeling
In order to compare the model outputs with acoustic stimulation and with electrical stimulation, the model framework of Klug et al. (2020) was adopted in this study. In a nutshell, it is an EI-model neuron receiving bilateral input from either an acoustic or an electrical model of the auditory periphery. The model code and data for reproducing the results are freely available on Zenodo (Hu et al. 2021).
Model of the Acoustically Stimulated Auditory Nerve
The periphery model of Bruce et al. (2018) was applied in the same fashion as in Klug et al. (2020) for simulations of acoustic hearing. It transforms the acoustic stimuli into spiking patterns of medium-spontaneous-rate (MSR, representative spontaneous rate is 0.5–18 spikes per second) AN fibers arrayed along the tonotopic axis. The input to the model is a stimulus in the form of a pressure waveform. The output of the model is given by a spike generator that produces a series of AN spikes. All simulated AN fibers had a characteristic frequency (CF) equal to the stimulus carrier frequency (CF = 8000 Hz unless otherwise stated). The model parameters were kept unchanged from (Bruce et al. 2018; Zilany et al. 2014, 2009).
Model of the Electrically Stimulated Auditory Nerve
To simulate electrical hearing, the auditory periphery model was substituted by the AN model of Hamacher (2004) in the implementation of Fredelake and Hohmann (2012). This model consists of four stages: cell membrane, membrane noise, refractory period, and latency and jitter. The response probability of the models only depends on two parameters: (1) The membrane voltage after the first pulse phase. It is calculated as a deterministic, leaky, integrate-and-fire model (Gerstner and Kistler 2002) extended with a zero-mean Gaussian noise source (membrane noise) to simulate stochastic behavior of the AN fibers (Hamacher 2004) influenced by the threshold potential, stimulation current, and the first-phase duration. (2) The time difference between the onset of the current pulse and the last action potential. The effective firing time depends on the stimulation time, the discharge time of the membrane capacitance when an action potential is generated, the latency, and the jitter. The mean and standard deviation of the latency and jitter were derived from the data of Miller et al. (1999a; 1999b). All parameters were the same as in (Fredelake and Hohmann 2012; Hamacher 2004). It should be noted that the AN model doesn’t include adaptation and assumes the same threshold level for all AN fiber inputs, which is unlikely to be the case in a real system. Tonotopic considerations and spread of excitation are irrelevant for the purpose of this study. In other words, the 1D model is independent of the location of the stimulated CI-electrode along the basilar membrane and only those AN fibers distributed at the corresponding position were analyzed and passed onto the binaural interaction stage.
Binaural Neuron Model
At the binaural interaction stage, a single EI-model neuron as in Klug et al. (2020), modified from the coincidence-counting LSO model of Ashida et al. (2016), was used. Unless otherwise stated, the default EI-model parameters of Klug et al. (2020) were used. Figure 1 shows the structures of acoustic- (Fig. 1(A)) and electric- (Fig. 1(B)) model diagrams. Briefly, it receives excitatory synaptic inputs from ipsilateral AN fibers and inhibitory inputs from contralateral AN fibers (see Klug et al. 2020 for a more detailed description of the model). In this study, for a specific CF or CI electrode, 20 MSR AN fibers were simulated on the ipsilateral ear to generate excitatory input and 8 on the contralateral ear for generating the inhibitory input to the EI-model. The length of the rectangular excitatory coincidence window and the rectangular inhibitory window (Winh) was 1.1 ms and 3.1 ms, respectively. The response threshold was 3, the inhibitory gain for increasing the weight of inhibitory inputs was 2, and the length of the refractory period was 1.6 ms.
Stimuli
Acoustic Stimuli
To coarsely replicate the AN excitation patterns from CI stimulation in NH listeners, a pulse-mimicking envelope is often multiplied on a high-frequency carrier (Goupell et al. 2013). The AN can safely be assumed to phase-lock only to the pulse-mimicking envelope, generating comparable temporal information at least at the level of the AN and at pulse rates below the auditory-filter bandwidth. For example, past studies have employed Gaussian envelope (GE) tones (Bernstein et al. 2018; Ehlers et al. 2016; Goupell et al. 2013) or filtered clicks (e.g., Baumgärtel et al. 2017; Hu et al. 2017; Majdak and Laback 2009). Such stimuli have a wider spectrum than the sinusoidally amplitude-modulated (SAM) tones and the transposed tones (Goupell et al. 2013) commonly used in NH studies (e.g., Bernstein and Trahiotis 2002, 2003; Blanks et al. 2008; Ehlers et al. 2016; Monaghan et al. 2015). Among these studies, ITD sensitivity has been compared in Goupell et al. (2013) using GE tone pulses and non-Gaussian-shaped pulses, and in Ehlers et al. (2016) using GE tones and transposed tones in the same subjects. However, no study systematically tested more than two stimulus types on the same subjects, and it is still unclear whether ITD sensitivity differs for different CI simulating stimuli and which one may be most similar to actual electrical pulse trains. Thus, in the current study, four types of acoustic stimuli were selected. The duration of each stimulus was set to 1 s, with a 10-ms sin2 gating. In order to obtain the envelope ITD (ITDENV) tuning curves for different modulation frequencies (fm), a range of ITD within [− 4, 4] ms or IPD within one period ([\(-\pi,\pi\)]), with a step size of 0.1 ms, 0.2 ms, or 0.05\(\pi\), was tested at different stimulation levels. Positive ITDs represent contralateral-leading. Consistent with Klug et al. (2020), the ITDs were inserted on the acoustic AN spike trains to reduce computational demand, and responses within the first 200 ms were discarded.
SAM Tones
As one of the most commonly used stimulus classes, high-frequency SAM tones were generated digitally as in Bernstein and Trahiotis (2012) according to Eq. (1):
Unless otherwise stated, the default test carrier frequency was fc = 8000 Hz. Different modulation frequencies fm were tested. Figure 2 shows an example of an SAM tone (top row column 1).
Transposed Tones
Transposed stimuli were designed by van de Par and Kohlrausch (1997) to better mimic low-frequency tonal responses in high-frequency AN fibers. To simulate the functional role of hair cells, a low-frequency base stimulus is half-wave rectified and subsequently low-pass filtered. The output serves as the modulator and is multiplied on a high-frequency carrier. Transposed tones clearly have sharper envelopes than SAM tones and have been extensively used in binaural studies with NH adult listeners (Bernstein and Trahiotis 2002, 2003). Here, the transposed tones were generated as in (Bernstein and Trahiotis 2002), and the default carrier frequency was fc = 8000 Hz, with various modulation frequencies. Figure 2 shows one example of a transposed tone (top row column 2).
Gaussian Envelope Tones
In this study, the band-limited, constant-amplitude Gaussian envelope (GE) tone pulse trains were generated as described in (Bernstein et al. 2018; Goupell et al. 2013). Briefly, they were generated in the time domain by applying a Gaussian-shaped envelope to a tonal carrier at the desired stimulation pulse rate (in pps). Here, GE tones were presented at different pulse rates (which correspond to the fm in SAM and transposed tones) and different levels, with the duration of the individual pulses manipulated to generate an equivalent rectangular bandwidth of 1.5 mm as defined by Greenwood (1990). Figure 2 (top row column 3) shows an example of a GE tone train with an equivalent rectangular bandwidth of 1691 Hz and an equivalent rectangular pulse duration of 0.6 ms.
Filtered Click Trains
Bandpass-filtered click trains, similar to (McKay and Carlyon 1999), make up another class of commonly-used stimuli for simulating CI pulsatile stimulation. When investigating temporal effects in NH listeners, it provides a good approximation of the electrical stimulation in CIs (Carlyon and Deeks 2013; Laback et al. 2007; Majdak and Laback 2009; Majdak et al. 2006; McKay and Carlyon 1999). Although the response of an auditory filter to bandpass-filtered click trains has a clearly reduced envelope sharpness when compared to electrical pulse trains, the resulting envelope is still sharper than for SAM or transposed tones. In this study, unmodulated bandpass-filtered click trains were created similar to (Hu et al. 2017): firstly, rectangular condensation pulses of fixed width (10 µs) were generated at different pulse rates; secondly, band-pass filtering was performed using a 2nd-order constant Q = 0.5 Butterworth filter. Figure 2 (top row column 4) shows an example of a bandpass-filtered click train. Depending on the context, their click rate is also referred to as either pulse rate or modulation frequency.
Electric Stimuli
One of the most commonly used stimuli for ITD sensitivity in both human CI users and in electrically stimulated animals is unmodulated low-rate pulse trains (e.g., Chung et al. 2016; Smith and Delgutte 2007). However, CI processors typically encode the sound envelope in each frequency band by amplitude-modulating (AM) high-rate pulse trains. Smith and Delgutte (2008) investigated the ITD sensitivity of IC neurons using SAM high-rate pulse trains (carrier rate of 1000 or 5000 pps). They found that many IC cells were sensitive to ITD in both the envelope (ITDENV) and the pulse carrier for appropriate modulation frequencies and carrier rates; ITDENV tuning generally improved with increasing modulation frequency up to the maximum tested (≤ 160 Hz) that elicited a sustained response in a neuron; ITD sensitivity to carrier pulses was present in about half the neurons for 1000-pps carriers and was nonexistent at 5000 pps. High-rate stimulation above the AN phase-locking limit has been proposed to improve a CI user’s speech perception (Rubinstein et al. 1999). It has been demonstrated that CI users can exploit ITDs in these envelopes, while being insensitive to the timing of the high-rate carrier pulses (Wilson and Dorman 2008).
In the current study, both unmodulated, low-rate pulse trains (mostly < 1000 pps) and SAM, high-rate pulse trains (1000 pps and 5000 pps) were systematically tested with the electrical model. The focus was on reproducing the stimulus conditions from Chung et al. (2016) and Smith et al. (2007; 2008), but, in the absence of experimental constraints, some additional conditions were simulated.
The unmodulated and the SAM high-rate electrical stimuli were constant-amplitude and amplitude-modulated biphasic pulse trains, respectively (cathodic/anodic, 100-µs phase duration; although the CI AN model is indifferent to polarity). The envelope of the SAM pulse-trains was \(s\left(t\right)= [1-\mathrm{cos}\left(2\pi \it{{f}_{m}t}\right)]/2\), with a fixed carrier rate (1000 pps or 5000 pps). In contrast to Smith and Delgutte (2008), static ITDs, instead of slowly changing ITDs, were applied in all simulations. The pulse rate (unmodulated pulses) or fm (SAM high-rate pulse trains for a fixed carrier rate) was varied parametrically. The upper limit of fm was up to 40% of the respective pulse rate, which is quite high, and means that the actual envelope is not well reconstructed by interpolating the pulses in these conditions. As in the acoustic simulation, an [− 4, 4] ms ITD range or an IPD within one period ([\(-\pi,\pi\)]) was tested at different stimulation levels compared to a reference threshold. The duration of each electrical stimulus was 600 ms, with a 10 ms sin2 gating.
Motivated by Smith and Delgutte (2007, 2008), a 40-pps unmodulated pulse train was used as the “standard” stimulus to obtain a reference threshold of unmodulated pulse trains, and a 40-Hz sinusoidally amplitude-modulated 1000-pps pulse train served as a reference for all SAM pulse trains. More specifically, a reference threshold level was defined as the current of the “standard” stimulus when ipsilateral-only stimulation evoked at least one spike/s (sp/s) in the EI-model neuron. For the 5000 pps SAM, the same threshold current was used as for the 1000 pps condition, as in Smith and Delgutte (2008). For the electrical stimulation, the presentation level is stated in dB re threshold (dB thr), which is 240 µA for all unmodulated, and 130 µA for all SAM pulse trains. The latter is lower due to the higher pulse rate. It should be noted that the unit s−1 was introduced in the results for simplicity when both pps and Hz are referred to in the same figure or context.
Data Analysis
AN Output
For faithfully coding ITD information, the peripheral stage needs to produce sufficient activity (quantified by spike rate) and phase locking. Phase locking refers to the synchronized firing of the simulated AN fibers to the pulse or to the envelope of the high-rate AM stimuli. The degree of phase locking can be quantified by the vector strength (Goldberg and Brown 1969). Each individual spike is represented as a unit vector with angle \({\varphi}_{k}\), corresponding to the spike time within the cycle. The vector strength vs is defined as
with N being the total number of spikes and k indicating the kth spike. The vector strength becomes 1 if all spikes occur at a single phase of the stimulus waveform. Both the peristimulus time histogram (PSTH) and the period histogram were calculated to characterize the AN outputs. The model outputs to electrical pulse trains and acoustic stimuli were compared to the corresponding physiological observations (e.g., Dynes and Delgutte 1992; Javel and Shepherd 2000; Javel and Viemeister 2000; Joris and Yin 1992; Litvak et al. 2001).
EI Output
For illustrating the effect of different factors on the EI-model outputs, the simulated spike rates over a range of values were calculated (e.g., rate-ITD or rate-IPD or rate-level functions). To characterize the extent of ITD or IPD sensitivity from the rate-ITD or rate-IPD curves, typical metrics, such as the signal-to-total variance ratio (Chung et al. 2016; Hancock et al. 2013, 2010) cannot be directly employed, because the EI model is fully deterministic, i.e., the across-trial variance is expected to be unrealistically low. If one would assume Poisson noise, the square roots of the spike rates would have a constant variance, so that the sensitivity index d′ for discriminating between two ITDs would be proportional to the difference of the square roots of the two spike rates. This assumption is not necessarily met, especially under electric stimulation (Javel and Viemeister 2000). Nevertheless, to obtain a useful measure for the rate-ITD function modulation depth, orienting on the Poisson assumption is arguably more realistic than using the almost absent simulated variance: A best-case discrimination metric was chosen, contrasting between best and worst ITD, i.e., \(\sqrt{{r}_{\mathrm{max}}}-\sqrt{{r}_{\mathrm{min}}}\), where rmax and rmin is the maximum and minimum spike rates, respectively. Furthermore, to quantify the similarity between an electric and an acoustic rate-IPD or rate ITD function, R2 values were obtained from various cross-correlations of the simulated spiking rates.
The model outputs in response to electrical pulse trains and acoustic stimuli were compared to the corresponding physiological observations. LSO neurons are sensitive to both ILD and the envelope ITD of AM tones (Joris and Yin 1995), and the acoustically stimulated EI-model outputs were compared to LSO data with corresponding stimuli (e.g., Joris 1996; Joris et al. 1998; Joris and Yin 1995). While there is no corresponding electrically stimulated LSO data as in the EI-model simulations, both the outputs of the electrical models of unmodulated low pulse-rate and modulated high pulse-rate stimuli were compared to IC recordings (e.g., Chung et al. 2016; Hancock et al. 2013; Smith and Delgutte 2007, 2008). However, it should be noted that neurons in the IC have ITD tuning that can resemble those of both the MSO and LSO, and can also exhibit a cross between the two types (McAlpine et al. 1998; Yin and Kuwada 1983).
RESULTS
AN Outputs
Acoustic Stimulation
To compare the acoustic model responses to Klug et al. (2020) and between different acoustic inputs, Fig. 2 shows the four types of acoustic stimuli with fc = 4000 Hz (row 1), the inner hair cell (IHC) receptor potential (row 2), the spike raster plots (row 3), the PSTH, and the period histograms (row 4 and small insets) of AN fibers.
In general, the results of SAM tones are similar to those shown by Klug et al. (2020, Fig. 1, stimulation level of 20 dB SPL). Both PSTHs and the period histograms of the AN outputs show good phase locking to the 128 Hz modulation frequency for all types of stimuli. However, the synchronization, or the vector strength (vs), of filtered clicks and GE tones are higher than the other two in this example, which was expected when comparing the AC/DC ratio.
where A(f) and A(0) are the FFT amplitude of the rectified input stimuli at non-zero and 0 Hz, respectively (related to the vs of the output). Both filtered clicks and GE tones showed a higher AC/DC ratio than the other two stimuli.
Electrical Stimulation.
Figure 3 shows the output of one example electrically stimulated AN model neuron to unmodulated and to SAM high-rate electric pulse trains for various stimulation rates at 3 dB thr. For comparison, the outputs of one acoustically stimulated AN model neuron for the filtered clicks at a level of 30 dB SPL are shown in the bottom row.
In general, the vector strength (Fig. 3(Y)) gradually declined with increasing rate. At 200 and 400 Hz modulation frequency the phase locking to the modulation is worse with 1000 pps compared to 5000 pps, because these envelope cycles are only 5 and 2.5 times the inter-pulse interval at 1000 pps, respectively. A single pulse offset in the response (as seen in the small inset of Fig. 3(J)) results in a considerable difference with respect to the modulation cycle phase and thus to the lower vector strength.
The overall pattern of AN responses to unmodulated pulse trains (Fig. 3(A–G)) was consistent with physiological observations (Javel and Shepherd 2000; Javel and Viemeister 2000). When comparing SAM high-rate pulse trains (Fig. 3(H–K) or (L–Q)) with unmodulated pulse trains (Fig. 3(A–G)) of a rate corresponding to the modulation frequency of SAM stimuli, Fig. 3(Y) reveals that phase locking to unmodulated pulses was slightly higher in most cases (blue circles vs. orange diamonds and yellow triangles).
As expected at such low levels, the firing rate (Fig. 3(X), blue solid line with circles) is lower than the stimulation rate, indicating that not every stimulation cycle triggers a spike (Fredelake and Hohmann, 2012). For the 1000 pps carrier SAM pulses (Fig. 3(H–K)), phase locking exists to both the carrier and the amplitude modulation, causing the complex PSTH and period histogram shapes in Fig. 3. Phase locking to different components of the stimuli is shown in Fig. 4 in more detail as a function of stimulation level.
The stimulus level had a large effect on the neurons’ firing rate, and sometimes on its synchronization. For example, Joris and Yin (1992) showed that the behavior of AN fibers changes with the level of an acoustic AM signal. In our simulations, AN rate-level functions (Fig. 4(A–D)) showed the typical sigmoidal shape. It started to flatten out at 10 dB thr for unmodulated, low-rate pulse trains (Fig. 4(A), e.g., ≤ 100 pps), and near 12–14 dB thr for the other stimuli. The continuous increase for the SAM pulse trains (Fig. 4(B, C)) resulted from the increased firing during the modulation trough, which also caused the gradual decline of fm synchrony with level at low modulation frequencies (Fig. 4(F, G), e.g., ≤ 100 Hz). At 12–14 dB thr, fm synchrony was best at intermediate, and not at the lowest modulation frequencies (Fig. 4(F, G), e.g., between 200 and 400 Hz). The reason is that, at low fm and high levels, the response rate exceeded the modulation frequency, i.e., multiple responses occurred within each modulation cycle. In line with (Dynes and Delgutte 1992; Miller et al. 2008), those simulated auditory nerve fibers, which have been allocated a low jitter, still showed some residual phase locking to electric stimulation up to 5000 pps (Fig. 4(J)), and strong phase locking to 1000-pps carrier pulses (Fig. 4(I)). Response synchrony to filtered clicks (Fig. 4(H)) was similarly level-independent as to unmodulated electric pulses (Fig. 4(E)), except near threshold, when spontaneous activity caused a lower synchrony, especially in the acoustic model (Fig. 4(H), < 30 dB SPL).
Binaural Neuron Simulations
Acoustic Stimulation
The average rate-ITD functions of the EI-model neuron at different levels for the four stimulus types are shown in Fig. 5(A–D). As in Klug et al. (2020, Fig. 3C, left hemisphere), the trough was not at zero ITD, because the inhibition lasted longer than the excitation. The minimum response was reached when the excitation was centered in the longer inhibition (Ashida et al. 2016; Klug et al. 2020). In general, the rate-ITD functions in Fig. 5(A–D) were sharper at higher levels for most stimuli; except for the SAM tones (Fig. 5(A)), the tuning curve became shallower with increasing levels above 30 dB SPL. The steps shown in the rate-ITD functions for the non-SAM stimuli at higher levels (Fig. 5(B–D)) were caused by the simplistic modeling of refractoriness at the EI stage. If the excitation was strong and preceded the inhibition by more than the refractory period (1.6 ms), a second response was generated within the same modulation cycle. This is why the plateau was at a spike rate that equals the modulation frequency. At the highest levels, a second plateau arose at an ITD of about 3.2 ms less than the worst ITD, at a rate twice the modulation frequency.
Figure 5(E–H) shows rate-ITD functions for different modulation frequencies (SAM and transposed tones) or different pulse rates (GE tones and filtered clicks) at a fixed level of 30 dB SPL. This is the level at which SAM tones evoked the highest peak spike rate in the simulated neuron (Fig. 5(A)) and does so in typical LSO neurons (e.g., FIG. 16B in Joris and Yin 1998). The rate dependence shown in Fig. 5(E–H) is also similar to experimental data (Joris and Yin 1998) and is primarily limited by the auditory-filter bandwidth
Electric Stimulation
The Effect of Input Level on the Rate-IPD Functions
Figure 6 shows the rate-IPD (Fig. 6(A–E)), R2-level (Fig. 6(F, G)), and \((\sqrt{r_{\text{max}}}-\sqrt{r_{\text{min}}})\)-level (Fig. 6(H, I)) functions of an electrically and an acoustically stimulated EI-model neurons at different levels. In order to characterize the similarity between two rate-IPD tuning curves, the R2 values were obtained, which is the square of the correlation coefficients between two rate-IPD tuning curves. For the rate-IPD tuning of unmodulated pulse trains at 40 pps, the effect of pulse level on the outputs of the EI-model neuron (Fig. 6(A)) was similar to experimental data from neurons with a trough- or step-type tuning curve (e.g., the neuron in the top-left panel of Figure 5 in Smith and Delgutte 2007). At intermediate levels (Fig. 6(A), e.g., ~ 2 dB thr), when the inhibition starts before and ends after the excitation, inhibition was able to stop almost all responses. At high levels (Fig. 6(A), e.g., > 10 dB thr), when all or almost all AN fibers entrain to the stimulation pulses, inhibition was not sufficient to stop the responses, because 20 excitatory minus 8 double-weighted inhibitory inputs resulted in a membrane potential of + 4, i.e., just enough to surpass the firing threshold of 3. At higher levels (Fig. 6(A), e.g., > 8 dB thr), some firing rates were higher than the pulse rate (40 pps) at larger IPD, because in some rare cases, one pulse triggered a second spike after the refractory period of 1.6 ms, i.e., triggered by the same effect as the plateaus in Fig. 5(B–D).
The differences between the rate-IPD functions of filtered clicks and SAM tones (Fig. 6(D, E), fm = 40 Hz) for different levels are more apparent than the differences between two stimulus types shown in Fig. 5(D, A, fm = 125 Hz). For 40 Hz, the shape of the rate-IPD tuning curve of filtered clicks (Fig. 6(D)) is more similar to the unmodulated electrical pulse trains (Fig. 6(A)) than to the acoustic SAM tones (Fig. 6(E)), except that tuning remains pronounced for filtered clicks at high levels. The extent of similarity between the rate-IPD functions of filtered clicks and the unmodulated pulses, and between the SAM tones and SAM high-rate pulses, were further quantified in Fig. 6(H, I), respectively. In all cases, an acoustic level of 20 dB SPL and an electric level of 4 dB thr resulted in R2 values > 0.75 and generally remain high (> 0.7) for higher levels. Only for unmodulated electric pulse trains, a higher level leads to a reduction in shape similarity, because the modulation trough at IPD = 0 disappears.
Effect of Modulation Frequency or Pulse Rate on IPD ENV Sensitivity
As noted in the sect. “INTRODUCTION”, a goal of this study was to investigate the rate limitation of ITD sensitivity with electrical stimulation. In the present model, it was expected to be limited by the length of the inhibitory window. As soon as the modulation or stimulation period is equal or shorter than this window, the inhibitory input is smeared out across several cycles and prohibits the normal emergence of a rate-ITD dependence in an EI neuron.
Until now, a fixed inhibitory window length (Winh) of 3.1 ms was used as in Klug et al. (2020). Figure 7 now shows the average rate-IPD curves for unmodulated (Fig. 7(A, G, M)), SAM high-rate pulse trains (Fig. 7(B, H, N; C, I, O)), filtered clicks (Fig. 7(D, J, P)), and SAM tones (Fig. 7(E, K, Q; F, L, R) for three different inhibitory window lengths (1st row, 2.1 ms; 2nd row 3.1 ms; 3rd row 4.1 ms) and the corresponding (\(\sqrt{{r}_{\mathrm{max}}}-\sqrt{{r}_{\mathrm{min}}}\))-frequency curves in Fig. 7(S–X). In general, the decreasing of Winh led to a higher fm or pulse-rate limit. The default Winh = 3.1ms corresponded to an upper rate limit of 323 s−1 as shown in Fig. 7(G–L). Up to this limit, IPD sensitivity does not critically depend on the spiking rate. Only at low stimulation levels was there a decline in the (\(\sqrt{{r}_{\mathrm{max}}}-\sqrt{{r}_{\mathrm{min}}}\)) value when lowering the pulse rate or the modulation frequency of electrical stimuli (Fig. 7(S–U)) and filtered clicks (Fig. 7(V)). The effect of Winh was relatively smaller for the 4-kHz carrier acoustic SAM tones (Fig. 7(F, L, R, X)), due to the additional above-mentioned auditory-filter bandwidth limitation. Additionally, for the electric SAM stimuli at 1000 pps, envelope ITD sensitivity was compromised by the reduced envelope sampling accuracy above 200 Hz (Fig. 7(B, H, N, T)), especially if a short inhibitory window did not limit the sensitivity.
So far, the complete IPD cycle was tested, which corresponds to an increasing ITD range with increasing pulse rate or modulation frequency. However, a fixed range of ITDs has commonly been tested experimentally (e.g., Chung et al. 2016; Joris and Yin 1995; Smith and Delgutte 2008). Figure 8 shows average rate-ITD functions and (\(\sqrt{{r}_{\mathrm{max}}}-\sqrt{{r}_{\mathrm{min}}}\))-functions for a commonly tested ITD range of ± 4 ms.
In general, the cyclic behavior of the SAM stimuli was better captured by an IPD metric (Fig. 7), but the more transient filtered clicks and unmodulated pulses revealed the influence of the model’s time constants better on an ITD metric (Fig. 8). For the unmodulated pulse trains, the model simulations were generally consistent with published responses from some IC neurons (e.g., Figure 2D in Chung et al. 2016). In addition, the lower sustained spiking rate in the electric response than in the acoustic response at high pulse rates (e.g., Fig. 7(A, G, M) vs. (D, J, P), IPD = 0, 500 pps) was in line with (Su et al. 2021). As expected, the EI model cannot reproduce peak-type rate-ITD functions, as also commonly observed in some studies (Smith and Delgutte 2007, 2008).
For the SAM high-rate pulse trains (Fig. 7(H, I) and Fig. 8(B, C)), the EI-model outputs were generally consistent with Smith and Delgutte (2008): (1) The shapes of the rate-IPD tuning curves were fairly stable for lower fm (e.g., below 100 Hz), although the peak firing rates varied substantially, consistent with Smith and Delgutte (2008, e.g., their FIG. 4A, from 40 to 160 Hz). Consequently, the rate-ITD functions get sharper with increasing modulation frequency. (2) The rate-IPD functions of 1000 pps SAM pulse trains were modulated by both the modulation frequency and carrier rate, especially for fm ≥ 200 Hz. The reason for such carrier-rate modulated oscillations can be understood when comparing the interaural pulse amplitude difference for each pulse pair, introduced by applying the ITDENV. This can also be an explanation for the restoration of sustained responses to high-rate pulse trains by low-frequency AM, as suggested in Smith and Delgutte (2008, e.g., their FIG. 12). (3) At low modulation frequencies (fm ≤ 100 Hz), the EI outputs are nearly identical for both carrier frequencies, in line with Smith and Delgutte (2008, e.g., their FIG. 4D and FIG. 5A,B). Regarding the upper fm limit, Smith and Delgutte (2008, e.g., their FIG. 5B) showed ITDENV sensitivity up to the highest tested fm of 160 Hz. Previous studies suggested that there is a modulation frequency limit near 200–300 Hz (Snyder et al. 1995, 2000), similar to the rate limit with constant-amplitude pulse trains. Although using a higher carrier rate increased the upper limit of fm that could be delivered in SAM high-rate pulse trains, due to the inhibitory window length 3.1 ms, the cutoff modulation frequency for all stimuli was close to 320 Hz.
For the acoustically stimulated EI-model outputs, the fm-dependence of the rate-ITD functions was very similar to those recorded in the LSO (e.g., Figure 11 in Joris and Yin 1995, ~ 200/300 Hz for 5/12.4 kHz CF). In addition to the limitation from the inhibitory window, ITD sensitivity is fm-limited by the width of the auditory filter (see also Fig. 7(W and X)).
GENERAL DISCUSSION
In this study, the response of an EI-model neuron with acoustic or electric stimulation was compared to LSO and IC recordings. In general, the EI-model is able to reproduce most characteristics of acoustically stimulated LSO neurons (Joris 1996; Joris and Yin 1998) and of step-type or trough-type IC neurons in the case of both electrical (e.g., Chung et al. 2016; Smith and Delgutte 2007, 2008) and acoustic stimulations (e.g., Dietz et al. 2016; Greenberg et al. 2017; Griffin et al. 2005). That said, the majority of reported rate-ITD functions to unmodulated electrical pulse trains do not match in shape those reported here. In part, this is simply due to ours showing a larger ITD range beyond the ± 2 ms that is typically applied experimentally. A clear mismatch remains for the narrowly tuned, peak-type responses, prominent in Smith and Delgutte (2007; 2008) and Vollmer (2018). Those response patterns can be expected to originate from fast excitatory-excitatory (EE) type interaction, as commonly associated with MSO processing, and can be simulated with existing models (Chung et al. 2015; Colburn et al. 2009). These models, in turn, cannot simulate the EI-type patterns.
The peak-type neurons are also ITD sensitive at high pulse rates, such as 1000 pps (e.g., Smith and Delgutte), as a consequence of the short excitatory postsynaptic potentials (EPSPs) of MSO inputs, and are markedly different to our simulations or to typical step-type or trough-type neurons (Chung et al., 2016). On the other hand, the simulated pulse-rate limitation of ITD sensitivity near 320 pps is markedly similar to the limit observed in step-type or trough-type IC neurons under electrical- (Chung et al. 2016) or acoustic stimulation (Greenberg et al. 2017; Griffin et al. 2005). This limit is also comparable to the ITD sensitivity limit of human bilateral CI users (e.g., Ihlefeld et al. 2015) and to the envelope ITD sensitivity limit of normal-hearing humans (Bernstein and Trahiotis 2014). As previously hypothesized (Dietz 2016; Kelvasa and Dietz 2015), all of this hints at a dominant role of EI interaction in binaural processing under electrical stimulation, in line with the larger negative binaural interaction component in bilaterally implanted CI users (Brown et al. 2019; Gordon et al. 2012; He et al. 2012; Hu and Dietz 2015). Future studies need to examine whether such a single pathway model, which can also simulate ILD sensitivity within the very same model neuron (Klug et al. 2020), is fully sufficient to model binaural perception of an average bilateral CI user. Currently, this appears to be possible. Individual rate-limit differences in the 250–500 s−1 range can be simulated with different inhibition time constants or, more biophysically, with different synaptic properties (Brughera et al. 2021; Dietz et al. 2016). Only for the occasional sensitivity well above 500 pps (e.g., van Hoesel et al. 2009) is an additional MSO model branch expected to be essential. However, no satisfactory answer can be given as to why EE processing may not be accessible to electrical stimulation in many bilaterally implanted humans. The idea is only loosely based on descriptions of MSO and LSO circuits, hinting at different degrees of robustness: the ITD tuning of fast, highly leaky, MSO neurons have already developed in the juvenile system by optimizing myelination patterns (Stange-Marten et al. 2017) and an accurately tailored expression of inhibitory inputs (Beiderbeck et al. 2018; Brand et al. 2002; Grothe and Pecka 2014). We speculate that any form of input changes may corrupt this type of ITD sensitivity. On the other hand, the simple EI mechanism can be expected to be robust against input variations (Joris and Yin 1995; Tsai et al. 2010) and is also the reason for CI users’ fair sensitivity to ILD (Hancock et al. 2010). While the EI-model output does critically depend on the number of input fibers, these differences can be compensated for, by adapting other model parameters such as the response threshold (Klug et al. 2020).
Two aspects that are not in line with physiology, or potentially subject to controversy, are the absence of a cochlear nucleus and some other processing stages, as well as the rectangular-shaped 3.1-ms inhibition window. This nominal length is much larger than commonly reported from LSO neurons (Sanes 1990). In part, this is due to the different definition. The equivalent rectangular duration is usually larger by a factor of 2–3 than time constants from an integration window function. For the present simplistic EI model, the difference between the inhibition time constant and the excitation time constant is also relevant and determines the duration of troughs in some of the rate-ITD functions to be just over 2 ms (Fig. 8(A)).
In any case, we do not want to make anatomical claims about where the EI interaction happens. The LSO is one possible location, but de novo EI interaction at the level of the IC is also an option. In the absence of physiological recordings to electric stimulation from the brainstem, this question cannot be answered. The rectangular window shape, however, is a simplification that can be problematic for the highly phase-locked electrical input. For a more detailed study of the course of the ITD-sensitivity decline with increasing electrical pulse rate, a more realistic window shape would be required.
In addition to the ITD rate limit, the effect of input level on the rate-ITD or rate-IPD functions was also very similar to experimental data of electrically stimulated IC neurons (e.g., Fig. 5, trough- or step-type, Smith and Delgutte 2007) or acoustic stimulated LSO neurons (e.g., FIG. 16B in Joris and Yin 1998). In general, there are increased response rates and sharpening ITD tuning curves with increasing level before saturation effects reduce tuning at high levels. Some discrepancy remains to the generally increasing behavioral ITD sensitivity with increasing level (e.g., Egger et al. 2016), but a similar discrepancy is known from acoustic hearing (e.g., Dietz et al. 2013). In order to better relate the results to psychophysical abilities of bilateral CI users, the model has to be combined with suitable back-ends for discrimination or lateralization tasks. As with acoustic stimulation (Klug et al. 2020), the model is expected to be generally able to account for both ILD- and ITD-based lateralizations with the same EI-model units. For example, an ongoing follow-up project is to connect the current EI-outputs with the most simplistic rate-difference decoding back-end used in (Klug et al., 2020) to predict different lateralization and localization data reported in previous bilateral CI experiments (e.g., Egger et al. 2016; Ihlefeld et al. 2015; Kan et al. 2016; Laback et al. 2015, 2004; Stakhovskaya and Goupell 2017). It may be prudent, or even necessary, to model a range of EI neurons across the tonotopic array. Such a population is expected to cause a more gradual change of lateralization with increasing ITD (e.g., Baumgärtel et al. 2017), despite the step-wise rate-ITD tuning functions observed both experimentally and in simulated neurons. It is also expected to account for the abovementioned discrepancy in level dependence.
The EI-model stage can also be embedded into model frameworks that include CI signal processing, or at least multi-electrode stimulation (e.g., Kelvasa and Dietz 2015; Stakhovskaya and Goupell 2017; Todd et al. 2016). Simulating the consequences of, e.g., coding strategy or interaural mismatches in insertion depth, neural health, or fitting parameters on localization is a possible application. Compared to the Hodgkin-Huxley-based MSO or LSO models of previous studies (e.g., Chung et al. 2015; Kelvasa and Dietz 2015), it offers the advantage of computational speed and more intuitive parametrization at the cost of physiological detail, such as the rectangular window shape.
References
Ashida G, Kretzberg J, Tollin DJ (2016) Roles for coincidence detection in coding amplitude-modulated sounds. PLoS Comput Biol 12:e1004997. https://doi.org/10.1371/journal.pcbi.1004997
Baumgärtel RM, Hu H, Kollmeier B, Dietz M (2017) Extent of lateralization at large interaural time differences in simulated electric hearing and bilateral cochlear implant users. J Acoust Soc Am 141:2338–2352. https://doi.org/10.1121/1.4979114
Beiderbeck B, Myoga MH, Müller NIC, Callan AR, Friauf E, Grothe B, Pecka M (2018) Precisely timed inhibition facilitates action potential firing for spatial coding in the auditory brainstem. Nat Commun 9:1771–1771. https://doi.org/10.1038/s41467-018-04210-y
Bernstein JGW, Stakhovskaya OA, Schuchman GI, Jensen KK, Goupell MJ (2018) Interaural time-difference discrimination as a measure of place of stimulation for cochlear-implant users with single-sided deafness Trends in Hearing 22:2331216518765514 https://doi.org/10.1177/2331216518765514
Bernstein LR, Trahiotis C (2002) Enhancing sensitivity to interaural delays at high frequencies by using “transposed stimuli.” J Acoust Soc Am 112:1026–1036. https://doi.org/10.1121/1.1497620
Bernstein LR, Trahiotis C (2003) Enhancing interaural-delay-based extents of laterality at high frequencies by using “transposed stimuli.” J Acoust Soc Am 113:3335–3347. https://doi.org/10.1121/1.1570431
Bernstein LR, Trahiotis C (2012) Lateralization produced by interaural temporal and intensitive disparities of high-frequency, raised-sine stimuli: Data and modeling. J Acoust Soc Am 131:409–415. https://doi.org/10.1121/1.3662056
Bernstein LR, Trahiotis C (2014) Sensitivity to envelope-based interaural delays at high frequencies: Center frequency affects the envelope rate-limitation. J Acoust Soc Am 135:808–816. https://doi.org/10.1121/1.4861251
Blanks DA, Buss E, Grose JH, Fitzpatrick DC, Hall JW 3rd (2008) Interaural time discrimination of envelopes carried on high-frequency tones as a function of level and interaural carrier mismatch. Ear Hear 29:674–683. https://doi.org/10.1097/AUD.0b013e3181775e03
Brand A, Behrend O, Marquardt T, McAlpine D, Grothe B (2002) Precise inhibition is essential for microsecond interaural time difference coding. Nature 417:543–547. https://doi.org/10.1038/417543a
Brown AD, Anbuhl KL, Gilmer JI, Tollin DJ (2019) Between-ear sound frequency disparity modulates a brain stem biomarker of binaural hearing. J Neurophysiol 122:1110–1122. https://doi.org/10.1152/jn.00057.2019
Bruce IC, Erfani Y, Zilany MSA (2018) A phenomenological model of the synapse between the inner hair cell and auditory nerve: Implications of limited neurotransmitter release sites. Hear Res 360:40–54. https://doi.org/10.1016/j.heares.2017.12.016
Brughera A, Dunai L, Hartmann WM (2013) Human interaural time difference thresholds for sine tones: The high-frequency limit. J Acoust Soc Am 133:2839–2855. https://doi.org/10.1121/1.4795778
Brughera A, Mikiel-Hunter J, Dietz M, McAlpine D (2021) Auditory brainstem models: adapting cochlear nuclei improve spatial encoding by the medial superior olive in reverberation. J Assoc Res Otolaryngol 22:289–318. https://doi.org/10.1007/s10162-021-00797-0
Carlyon RP, Deeks JM Relationships between auditory nerve activity and temporal pitch perception in cochlear implant users. In: Moore BCJ, Patterson RD, Winter IM, Carlyon RP, Gockel HE (eds) Basic Aspects of Hearing, New York, NY, 2013. Springer New York, pp 363–371
Chung Y, Delgutte B, Colburn HS (2015) Modeling binaural responses in the auditory brainstem to electric stimulation of the auditory nerve. J Assoc Res Otolaryngol 16:135–158. https://doi.org/10.1007/s10162-014-0492-6
Chung Y, Hancock KE, Delgutte B (2016) Neural coding of interaural time differences with bilateral cochlear implants in unanesthetized rabbits. J Neurosci 36:5520–5531. https://doi.org/10.1523/jneurosci.3795-15.2016
Colburn HS, Chung Y, Zhou Y, Brughera A (2009) Models of brainstem responses to bilateral electrical stimulation. J Assoc Res Otolaryngol 10:91–110. https://doi.org/10.1007/s10162-008-0141-z
Dietz M (2016) Models of the electrically stimulated binaural system: A review. Network: Computation in Neural Systems 27:186–211. https://doi.org/10.1080/0954898x.2016.1219411
Dietz M, Bernstein LR, Trahiotis C, Ewert SD, Hohmann V (2013) The effect of overall level on sensitivity to interaural differences of time and level at high frequencies. J Acoust Soc Am 134:494–502. https://doi.org/10.1121/1.4807827
Dietz M, Wang L, Greenberg D, McAlpine D (2016) Sensitivity to interaural time differences conveyed in the stimulus envelope: estimating inputs of binaural neurons through the temporal analysis of spike trains. J Assoc Res Otolaryngol 17:313–330. https://doi.org/10.1007/s10162-016-0573-9
Dynes SBC, Delgutte B (1992) Phase-locking of auditory-nerve discharges to sinusoidal electric stimulation of the cochlea. Hear Res 58:79–90. https://doi.org/10.1016/0378-5955(92)90011-B
Egger K, Majdak P, Laback B (2016) Channel interaction and current level affect across-electrode integration of interaural time differences in bilateral cochlear-implant listeners. J Assoc Res Otolaryngol 17:55–67. https://doi.org/10.1007/s10162-015-0542-8
Ehlers E, Kan A, Winn MB, Stoelb C, Litovsky RY (2016) Binaural hearing in children using gaussian enveloped and transposed tones. J Acoust Soc Am 139:1724–1724. https://doi.org/10.1121/1.4945588
Fredelake S, Hohmann V (2012) Factors affecting predicted speech intelligibility with cochlear implants in an auditory model for electrical stimulation. Hear Res 287:76–90. https://doi.org/10.1016/j.heares.2012.03.005
Gerstner W, Kistler WM (2002) Spiking neuron models: single neurons, populations, plasticity. Cambridge University Press, Cambridge
Goldberg JM, Brown PB (1969) Response of binaural neurons of dog superior olivary complex to dichotic tonal stimuli: some physiological mechanisms of sound localization. J Neurophysiol 32:613–636. https://doi.org/10.1152/jn.1969.32.4.613
Gordon KA, Salloum C, Toor GS, van Hoesel R, Papsin BC (2012) Binaural interactions develop in the auditory brainstem of children who are deaf: effects of place and level of bilateral electrical stimulation. J Neurosci 32:4212–4223
Goupell MJ, Stoelb C, Kan A, Litovsky RY (2013) Effect of mismatched place-of-stimulation on the salience of binaural cues in conditions that simulate bilateral cochlear-implant listening. J Acoust Soc Am 133:2272–2287. https://doi.org/10.1121/1.4792936
Greenwood DD (1990) A cochlear frequency‐position function for several species—29 years later. J Acoust Soc Am 87:2592–2605. https://doi.org/10.1121/1.399052
Greenberg D, Monaghan JJM, Dietz M, Marquardt T, McAlpine D (2017) Influence of envelope waveform on itd sensitivity of neurons in the auditory midbrain. J Neurophysiol 118:2358–2370. https://doi.org/10.1152/jn.01048.2015
Griffin SJ, Bernstein LR, Ingham NJ, McAlpine D (2005) Neural sensitivity to interaural envelope delays in the inferior colliculus of the guinea pig. J Neurophysiol 93:3463–3478. https://doi.org/10.1152/jn.00794.2004
Grothe B, Pecka M (2014) The natural history of sound localization in mammals – a story of neuronal inhibition.Fronts Neural Circuits 8 https://doi.org/10.3389/fncir.2014.00116
Hamacher V (2004) Signalverarbeitungsmodelle des elektrisch stimulierten gehörs. Ph.D. Thesis, RWTH Aachen
Hancock KE, Chung Y, Delgutte B (2013) Congenital and prolonged adult-onset deafness cause distinct degradations in neural itd coding with bilateral cochlear implants. J Assoc Res Otolaryngol 14:393–411. https://doi.org/10.1007/s10162-013-0380-5
Hancock KE, Noel V, Ryugo DK, Delgutte B (2010) Neural coding of interaural time differences with bilateral cochlear implants: Effects of congenital deafness. J Neurosci 30:14068–14079. https://doi.org/10.1523/jneurosci.3213-10.2010
Hartmann R, Topp G, Klinke R (1984) Discharge patterns of cat primary auditory fibers with electrical stimulation of the cochlea. Hear Res 13:47–62. https://doi.org/10.1016/0378-5955(84)90094-7
He S, Brown CJ, Abbas PJ (2012) Preliminary results of the relationship between the binaural interaction component of the electrically evoked auditory brainstem response and interaural pitch comparisons in bilateral cochlear implant recipients. Ear Hear 33:57–68. https://doi.org/10.1097/AUD.0b013e31822519ef
Hu H, Dietz M (2015) Comparison of interaural electrode pairing methods for bilateral cochlear implants. Trends Hear 19. https://doi.org/10.1177/2331216515617143
Hu H, Ewert SD, McAlpine D, Dietz M (2017) Differences in the temporal course of interaural time difference sensitivity between acoustic and electric hearing in amplitude modulated stimuli. J Acoust Soc Am 141:1862–1873. https://doi.org/10.1121/1.4977014
Hu H, Klug J, Dietz M (2021) Simulation of ITD-dependent single-neuron responses under electrical stimulation and with amplitude-modulated acoustic stimuli (version 1.0.2). Zenodo. https://doi.org/10.5281/zenodo.5571858
Ihlefeld A, Carlyon RP, Kan A, Churchill TH, Litovsky RY (2015) Limitations on monaural and binaural temporal processing in bilateral cochlear implant listeners. J Assoc Res Otolaryngol 16:641–652. https://doi.org/10.1007/s10162-015-0527-7
Javel E, Shepherd RK (2000) Electrical stimulation of the auditory nerve: III. Response initiation sites and temporal fine structure. Hear Res 140:45–76. https://doi.org/10.1016/S0378-5955(99)00186-0
Javel E, Viemeister NF (2000) Stochastic properties of cat auditory nerve responses to electric and acoustic stimuli and application to intensity discrimination. J Acoust Soc Am 107:908–921. https://doi.org/10.1121/1.428269
Joris PX (1996) Envelope coding in the lateral superior olive. Ii. Characteristic delays and comparison with responses in the medial superior olive. J Neurophysiol 76:2137–2156. https://doi.org/10.1152/jn.1996.76.4.2137
Joris PX, Smith PH, Yin TCT (1998) Coincidence detection in the auditory system: 50 years after jeffress. Neuron 21:1235–1238. https://doi.org/10.1016/s0896-6273(00)80643-1
Joris PX, Yin TC (1995) Envelope coding in the lateral superior olive. I. Sensitivity to interaural time differences. J Neurophysiol 73:1043–1062. https://doi.org/10.1152/jn.1995.73.3.1043
Joris PX, Yin TCT (1992) Responses to amplitude-modulated tones in the auditory nerve of the cat. J Acoust Soc Am 91:215–232. https://doi.org/10.1121/1.402757
Joris PX, Yin TCT (1998) Envelope coding in the lateral superior olive. Iii. Comparison with Afferent Pathways J Neurophysiol 79:253–269. https://doi.org/10.1152/jn.1998.79.1.253
Kan A, Jones HG, Litovsky RY (2016) Lateralization of interaural timing differences with multi-electrode stimulation in bilateral cochlear-implant users. J Acoust Soc Am 140:EL392–EL398. https://doi.org/10.1121/1.4967014
Kan A, Litovsky RY (2015) Binaural hearing with electrical stimulation. Hear Res 322:127–137. https://doi.org/10.1016/j.heares.2014.08.005
Kelvasa D, Dietz M (2015) Auditory model-based sound direction estimation with bilateral cochlear implants. Trends in Hearing 19:2331216515616378. https://doi.org/10.1177/2331216515616378
Klug J, Schmors L, Ashida G, Dietz M (2020) Neural rate difference model can account for lateralization of high-frequency stimuli. J Acoust Soc Am 148:678–691. https://doi.org/10.1121/10.0001602
Laback B, Egger K, Majdak P (2015) Perception and coding of interaural time differences with bilateral cochlear implants. Hear Res 322:138–150. https://doi.org/10.1016/j.heares.2014.10.004
Laback B, Majdak P, Baumgartner W-D (2007) Lateralization discrimination of interaural time delays in four-pulse sequences in electric and acoustic hearing. J Acoust Soc Am 121:2182–2191. https://doi.org/10.1121/1.2642280
Laback B, Pok S-M, Baumgartner W-D, Deutsch WA, Schmid K (2004) Sensitivity to interaural level and envelope time differences of two bilateral cochlear implant listeners using clinical sound processors. Ear Hear 25:488–500. https://doi.org/10.1097/01.aud.0000145124.85517.e8
Litvak L, Delgutte B, Eddington D (2001) Auditory nerve fiber responses to electric stimulation: modulated and unmodulated pulse trains. J Acoust Soc Am 110:368–379. https://doi.org/10.1121/1.1375140
Majdak P, Laback B (2009) Effects of center frequency and rate on the sensitivity to interaural delay in high-frequency click trains. J Acoust Soc Am 125:3903–3913. https://doi.org/10.1121/1.3120413
Majdak P, Laback B, Baumgartner W-D (2006) Effects of interaural time differences in fine structure and envelope on lateral discrimination in electric hearing. J Acoust Soc Am 120:2190–2201. https://doi.org/10.1121/1.2258390
McAlpine D, Jiang D, Shackleton TM, Palmer AR (1998) Convergent input from brainstem coincidence detectors onto delay-sensitive neurons in the inferior colliculus. J Neurosci 18:6026-39. https://doi.org/10.1523/jneurosci.18-15-06026.1998
McKay CM, Carlyon RP (1999) Dual temporal pitch percepts from acoustic and electric amplitude-modulated pulse trains. J Acoust Soc Am 105:347–357. https://doi.org/10.1121/1.424553
Miller CA, Abbas PJ, Robinson BK, Rubinstein JT, Matsuoka AJ (1999a) Electrically evoked single-fiber action potentials from cat: responses to monopolar, monophasic stimulation. Hear Res 130:197–218. https://doi.org/10.1016/S0378-5955(99)00012-X
Miller CA, Abbas PJ, Rubinstein JT (1999b) An empirically based model of the electrically evoked compound action potential. Hear Res 135:1–18. https://doi.org/10.1016/S0378-5955(99)00081-7
Miller CA, Hu N, Zhang F, Robinson BK, Abbas PJ (2008) Changes across time in the temporal responses of auditory nerve fibers stimulated by electric pulse trains. J Assoc Res Otolaryngol 9:122–137. https://doi.org/10.1007/s10162-007-0108-5
Monaghan JJM, Bleeck S, McAlpine D (2015) Sensitivity to envelope interaural time differences at high modulation rates. Trends in Hearing 19:2331216515619331. https://doi.org/10.1177/2331216515619331
Rosskothen-Kuhl N, Buck AN, Li K, Schnupp JWH (2021) Microsecond interaural time difference discrimination restored by cochlear implants after neonatal deafness. eLife 10:e59300. https://doi.org/10.7554/eLife.59300
Rubinstein JT, Wilson BS, Finley CC, Abbas PJ (1999) Pseudospontaneous activity: stochastic independence of auditory nerve fibers with electrical stimulation. Hear Res 127:108–118. https://doi.org/10.1016/S0378-5955(98)00185-3
Sanes DH (1990) An in vitro analysis of sound localization mechanisms in the gerbil lateral superior olive. J Neurosci 10:3494-506. https://doi.org/10.1523/jneurosci.10-11-03494.1990
Seeber BU, Fastl H (2008) Localization cues with bilateral cochlear implants. J Acoust Soc Am 123:1030–1042. https://doi.org/10.1121/1.2821965
Shepherd RK, Javel E (1997) Electrical stimulation of the auditory nerve. I. Correlation of physiological responses with cochlear status. Hear Res 108:112–144. https://doi.org/10.1016/S0378-5955(97)00046-4
Smith ZM, Delgutte B (2007) Sensitivity to interaural time differences in the inferior colliculus with bilateral cochlear implants. J Neurosci 27:6740–6750. https://doi.org/10.1523/jneurosci.0052-07.2007
Smith ZM, Delgutte B (2008) Sensitivity of inferior colliculus neurons to interaural time differences in the envelope versus the fine structure with bilateral cochlear implants. J Neurophysiol 99:2390–2407. https://doi.org/10.1152/jn.00751.2007
Snyder R, Leake P, Rebscher S, Beitel R (1995) Temporal resolution of neurons in cat inferior colliculus to intracochlear electrical stimulation: effects of neonatal deafening and chronic stimulation. J Neurophysiol 73:449–467. https://doi.org/10.1152/jn.1995.73.2.449
Snyder RL, Vollmer M, Moore CM, Rebscher SJ, Leake PA, Beitel RE (2000) Responses of inferior colliculus neurons to amplitude-modulated intracochlear electrical pulses in deaf cats. J Neurophysiol 84:166–183. https://doi.org/10.1152/jn.2000.84.1.166
Stakhovskaya OA, Goupell MJ (2017) Lateralization of interaural level differences with multiple electrode stimulation in bilateral cochlear-implant listeners. Ear Hear 38:e22–e38. https://doi.org/10.1097/aud.0000000000000360
Stange-Marten A et al (2017) Input timing for spatial processing is precisely tuned via constant synaptic delays and myelination patterns in the auditory brainstem. Proc Natl Acad Sci 14:E4851–E4858. https://doi.org/10.1073/pnas.1702290114
Su Y, Chung Y, Goodman DFM, Hancock KE, Delgutte B (2021) Rate and temporal coding of regular and irregular pulse trains in auditory midbrain of normal-hearing and cochlear-implanted rabbits. J Assoc Res Otolaryngol 22:319–347. https://doi.org/10.1007/s10162-021-00792-5
Todd AE, Goupell MJ, Litovsky RY (2016) Binaural release from masking with single- and multi-electrode stimulation in children with cochlear implants. J Acoust Soc Am 140:59–59. https://doi.org/10.1121/1.4954717
Tollin DJ (2003) The lateral superior olive: a functional role in sound source localization. Neuroscientist 9:127–143. https://doi.org/10.1177/1073858403252228
Tsai JJ, Koka K, Tollin DJ (2010) Varying overall sound intensity to the two ears impacts interaural level difference discrimination thresholds by single neurons in the lateral superior olive. J Neurophysiol 103:875–886. https://doi.org/10.1152/jn.00911.2009
van de Par S, Kohlrausch A (1997) A new approach to comparing binaural masking level differences at low and high frequencies. J Acoust Soc Am 101:1671–1680. https://doi.org/10.1121/1.418151
van Hoesel RJM (2007) Sensitivity to binaural timing in bilateral cochlear implant users. J Acoust Soc Am 121:2192–2206. https://doi.org/10.1121/1.2537300
van Hoesel RJM, Jones GL, Litovsky RY (2009) Interaural time-delay sensitivity in bilateral cochlear implant users: effects of pulse rate, modulation rate, and place of stimulation. J Assoc Res Otolaryngol 10:557–567. https://doi.org/10.1007/s10162-009-0175-x
Vollmer M (2018) Neural processing of acoustic and electric interaural time differences in normal-hearing gerbils. J Neurosci 38:6949–6966. https://doi.org/10.1523/jneurosci.3328-17.2018
Wilson BS, Dorman MF (2008) Cochlear implants: a remarkable past and a brilliant future. Hear Res 242:3–21. https://doi.org/10.1016/j.heares.2008.06.005
Yin TC, Kuwada S (1983) Binaural interaction in low-frequency neurons in inferior colliculus of the cat. Ii. Effects of changing rate and direction of interaural phase. J Neurophysiol 50:1000–1019. https://doi.org/10.1152/jn.1983.50.4.1000
Zilany MSA, Bruce IC, Carney LH (2014) Updated parameters and expanded simulation options for a model of the auditory periphery. J Acoust Soc Am 135:283–286. https://doi.org/10.1121/1.4837815
Zilany MSA, Bruce IC, Nelson PC, Carney LH (2009) A phenomenological model of the synapse between the inner hair cell and auditory nerve: long-term adaptation with power-law dynamics. J Acoust Soc Am 126:2390–2412. https://doi.org/10.1121/1.3238250
Acknowledgements
We thank Dr. Joerg Encke and Dr. Go Ashida for their helpful discussions. We also thank Dr. Paul B. Manis, Dr. Lina Reiss, and three anonymous reviewers for their helpful comments.
Funding
Open Access funding enabled and organized by Projekt DEAL. This work was supported by the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Program grant agreement no. 716800 (ERC Starting Grant to Mathias Dietz).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hu, H., Klug, J. & Dietz, M. Simulation of ITD-Dependent Single-Neuron Responses Under Electrical Stimulation and with Amplitude-Modulated Acoustic Stimuli. JARO 23, 535–550 (2022). https://doi.org/10.1007/s10162-021-00823-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10162-021-00823-1