Keywords

1 Introduction

Frequency-dependent cochlear signal-transduction nonlinearities manifest as suppressive interactions between acoustic-stimulus components (Sachs and Kiang 1968; de Boer and Nuttall 2002; Versteegh and van der Heijden 2012, 2013). The underlying mechanism is thought to be saturation of outer-hair-cell receptor currents (Geisler et al. 1990; Cooper 1996). Despite the importance of suppression for neural coding of complex sounds (e.g., Sachs and Young 1980), relatively little is known about suppression in listeners with cochlear hearing loss (Schmiedt et al. 1990; Miller et al. 1997; Hicks and Bacon 1999). Here we present preliminary data on firing-rate suppression, measured with tones, and with broadband noise, from ANFs in chinchillas (Chinchilla laniger), following noise-induced hearing loss. Quantitative descriptions of suppression in the hearing-impaired auditory periphery will help guide novel amplification strategies, particularly for complex listening situations (e.g., speech in noise), in which current hearing aids struggle to improve speech intelligibility.

2 Methods

2.1 Animal Model

Animal procedures were under anesthesia, approved by Purdue University’s IACUC, and followed NIH-issued guidance. Evoked-potential and DPOAE measures, and noise exposures, were with ketamine (40 mg/kg i.p.) and xylazine (4 mg/kg s.c.). Single-unit neurophysiology was done with sodium pentobarbital (boluses: 5–10 mg/hr. i.v.).

2.1.1 Noise Exposure & Hearing-Loss Characterization

Two groups of chinchillas are included: normal-hearing controls (NH), and hearing-impaired (HI) animals with a stable, permanent, sensorineural hearing loss. Prior to noise exposure, we recorded auditory brainstem responses (ABRs) and distortion-product otoacoustic emissions (DPOAEs) from the HI group, verifying “normal” baseline status.

ABRs were recorded in response to pure-tone bursts, at octave-spaced frequencies between 0.5 and 16 kHz. Threshold was determined according to statistical criteria (Henry et al. 2011). DPOAE stimulus primaries (f 1 and f 2) were presented with equal amplitude (75-dB SPL), and with a constant f 2/f 1 ratio of 1.2. f 2 varied between 0.5 and 12 kHz, in 2-semitone steps.

Animals were exposed to either a 500-Hz-centered octave-band noise at 116-dB SPL for 2 h, or a 2-kHz-centered 50-Hz-wide noise at 114-115-dB SPL for 4 h. They were allowed to recover for 3–4 weeks before an acute single-unit experiment. At surgery, ABR and DPOAE measures were repeated prior to any additional intervention (for NH animals, these were their only ABR and DPOAE measurements).

2.1.2 Single-Unit Neurophysiology

The auditory nerve was approached via a posterior-fossa craniotomy. ANFs were isolated using glass pipettes with 15–25 MΩ impedance. Spike times were recorded with 10-μs resolution. Stimulus presentation and data acquisition were controlled by MATLAB programs interfaced with hardware modules (TDT and National Instruments). For HI animals, ANF characteristic frequency (CF) was determined by the high-side-slope method of Liberman (1984).

2.2 Stimuli

2.2.1 Adaptive-Tracking: Suppression-Threshold Tuning

This two-tone suppression (2TS) adaptive-tracking technique is based on Delgutte (1990). Stimuli were sequences of 60-ms duration supra-threshold tones at CF, with or without a second tone at the suppressor frequency (FS; Fig. 1). For each FS frequency-level combination, 10 repetitions of the two-interval sequence were presented. The decision to increase or decrease sound level was made based on the mean of the 2nd through 10th of these two-interval comparisons. The algorithm sought the lowest sound level of FS which reduced the response to the CF tone by 20 spikes s‑ 1.

Fig. 1
figure 1

Schematized stimulus paradigm: adaptive tracking. (B) Black line, excitatory tuning curve; Red and blue dashed lines, high- and low-side suppression-threshold curves, respectively

2.2.2 Systems-Identification Approach

Using spike-triggered characterization, we probed suppressive influences on firing rate in response to broadband-noise stimulation (e.g., Lewis and van Dijk 2004; Schwartz et al. 2006). Our implementation is based on singular-value decomposition of the second-order Wiener kernel (h 2) in response to 16.5-kHz bandwidth, 15-dB SL noise (Lewis et al. 2002a, 2002b; Recio-Spinoso et al. 2005). We collected ~ 10–20 K spike times per fiber.

2.3 Analyses

2.3.1 Second-Order Wiener Kernels

For each fiber, h 2 was computed from the second-order cross-correlation between the noise-stimulus waveform x(t) and N spike times. The spike-triggered cross correlation was sampled at 50 kHz, and with maximum time lag τ of 10.2 ms (m = 512 points) for CFs > 3 kHz, or 20.4 ms (m = 1024 points) for CFs < 3 kHz. h 2(τ 1, τ 2) is calculated as

$$\frac{{{N}_{0}}}{{{A}^{2}}}\left[ {{R}_{2}}({{\tau }_{1}},{{\tau }_{2}})-{{\phi }_{xx}}({{\tau }_{2}}-{{\tau }_{1}}) \right]$$

where τ 1 and τ 2 are time lags, A the instantaneous noise power, N 0 the mean firing rate in spikes s‑1, and \({{R}_{2}}({{\tau }_{1}},{{\tau }_{2}})\) the second-order reverse-correlation function calculated as

$$\frac{1}{N}\sum\limits_{i=1}^{N}{x({{t}_{i}}-{{\tau }_{1}})x}({{t}_{i}}-{{\tau }_{2}})$$

with t i the i th spike time and \({{\phi }_{xx}}({{\tau }_{2}}-{{\tau }_{1}})\) the stimulus autocorrelation matrix. So computed, h 2 is an m-by-m matrix with units of spikes·s‑1·Pa‑2 (Recio-Spinoso et al. 2005). We used singular-value decomposition to parse h 2 into excitatory (h ) and suppressive (h ) sub-kernels (Lewis et al. 2002a, 2002b; Lewis and van Dijk 2004; Rust et al. 2005; Sneary and Lewis 2007).

2.3.2 Excitatory and Suppressive Sub-Kernels

Using the MATLAB function svd, h 2s were decomposed as

$${{h}_{2}}=\text{USV}$$

where U, S and V are m-by-m matrices. The columns of U and rows of V are the left and right singular vectors, respectively. S is a diagonal matrix, the nonzero values of which are the weights of the corresponding-rank vectors. The decomposition can be rephrased as

$${{h}_{2}}=\sum\limits_{j=1}^{m}{{{k}_{j}}{{u}_{j}}{{v}_{j}}}$$

where \({{u}_{j}}\) and \({{v}_{j}}\) are column vector elements of U and V, respectively, and \({{k}_{j}}\) is the signed weight calculated as

$${{k}_{j}}=sgn\left[ {{u}_{j}}(j){{v}_{j}}(j) \right]{{s}_{j}}$$

where sgn is the signum function, \({{u}_{j}}(j)\) is the j th element of the j th left singular vector, \({{v}_{j}}(j)\) is the j th element of the j th right singular vector, and s j is the j th element of the nonzero diagonal of S.

Positively and negatively weighted vectors are interpreted as excitatory and suppressive influences, respectively (Rust et al. 2005; Schwartz et al. 2006; Sneary and Lewis 2007). However, mechanical suppression, adaptation, and refractory effects may all contribute to what we term “suppressive” influences.

To determine statistical significance of each weighted vector, we re-computed h 2 from 20 different spike:train randomizations, conserving the first-order inter-spike-interval distribution. Based on this bootstrap distribution, weights were expressed as z-scores, and any vector with |z| > 3 and rank ≤ 20 was considered significant. We calculated a normalized excitatory-suppressive ratio as

$${{R}_{\varepsilon ,\sigma }}=\frac{\sum\nolimits_{j=1}^{{{N}_{\varepsilon }}}{\left| {{Z}_{j}}_{\varepsilon } \right|-\sum\nolimits_{j=1}^{{{N}_{\sigma }}}{\left| {{Z}_{j}}_{\sigma } \right|}}}{\sum\nolimits_{j=1}^{{{N}_{\varepsilon }}}{\left| {{Z}_{j}}_{\varepsilon } \right|+\sum\nolimits_{j=1}^{{{N}_{\sigma }}}{\left| {{Z}_{j}}_{\sigma } \right|}}}$$

for N ε significant excitatory vectors and N σ significant suppressive vectors. This normalized ratio varies from 1 (only excitation, no suppression), through 0 (equal excitation and suppression), to ‑1 (only suppression and no excitation: in practice \({{R}_{\varepsilon ,\sigma }}<0\) does not occur).

2.3.3 Spectro-Temporal Receptive Fields

Based on Lewis and van Dijk (2004), and Sneary and Lewis (2007), we estimated the spectro-temporal receptive field (STRF) from h 2. These STRFs indicate the timing of spectral components of the broadband-noise stimulus driving either increases or decreases in spike rate. Moreover, the STRF calculated from the whole h 2 kernel is the sum of excitatory and suppressive influences. Therefore, we also determined STRFs separately from h and h to assess the tuning of excitation and suppression (Sneary and Lewis 2007).

3 Results

3.1 Hearing-Loss Characterization

Noise exposure elevated ABR threshold, and reduced DPOAE magnitude, across the audiogram; indicating a mixed inner- and outer-hair-cell pathology (Fig. 2).

Fig. 2
figure 2

Audiometric characterization. a ABR thresholds. Thin lines, individual-animal data; symbols, within-group least-squares-mean values; shading, S.E.M; green area, noise-exposure band. b Lower plot, DPOAE magnitude; Upper plot, probability of observing a DPOAE above the noise floor; thick lines, within-group least-squares means; shading, S.E.M

3.2 Suppression Threshold

Two-tone-suppression tuning curves were obtained from 62 NH fibers, and 35 HI fibers (Fig. 3a and b). HI excitatory tuning curves show threshold elevation and broadened tuning (Fig. 3c). For NH fibers, high-side 2TS was always observed (Fig. 3a). However, in 6 of 35 HI fibers, we could not detect significant high-side 2TS (Fig. 3b). These fibers have very broadened excitatory tuning (Fig. 3c). In HI fibers with detectable high-side 2TS, the dB difference between excitatory threshold and suppressive threshold was not greater than observed in NH fibers (Fig. 3d). For many fibers high-side-2TS threshold was within 20 dB of on-CF excitatory threshold. Low-side 2TS-threshold estimates are surprisingly low (Fig. 3a and b). All fibers had low-side suppressive regions, regardless of hearing status, typically in the region of 0- to 20-dB SPL.

Fig. 3
figure 3

Two-tone-suppression threshold. a NH: gray, excitatory-threshold tuning curves; red lines (fits) and crosses (data), high-side suppression-threshold tuning curves; blue lines and crosses, low-side suppression-threshold. b, HI data. c Q 10dB. d Solid line, equality; dashed and dotted lines, 20- and 40-dB shifts in suppressive threshold, respectively. bd, Green triangles at 100-dB SPL, CFs of HI fibers with no high-sided suppression

3.3 Wiener-Kernel Estimates of Suppression

Figure 4 shows second-order Wiener-kernel and STRF analyses from the responses of a single medium-spontaneous-rate (< 18 spikes s‑1) ANF, with CF 4.2 kHz, recorded in a NH chinchilla. The h 2 kernel is characterized by a series of parallel diagonal lines representing the non-linear interactions driving the ANF’s response to noise (Fig. 4a; Recio-Spinoso et al. 2005). The STRF derived from h 2 shows suppressive regions flanking the main excitatory region along the frequency axis, with a high-to-low frequency glide (early-to-late) consistent with travelling-wave delay (Fig. 4b). Decomposing h 2 into its excitatory and suppressive sub-kernels, we observe a broad area of suppressive tuning which overlaps with the excitatory tuning in time and frequency (Fig. 4d and f). There is also an “on-CF” component to the suppression, occurring before the excitation.

Fig. 4
figure 4

Second-order Wiener-kernel, excitatory and suppressive sub-kernels, and their spectro-temporal receptive-field equivalents calculated from the responses of a single medium-spontaneous-rate ANF in a NH chinchilla: CF = 4.2 kHz, θ = 20-dB SPL, Q 10=3.6, SR = 6.0 spikes s‑1. 21,161 driven spike times are included in the analysis. Right-hand column, warm colors and solid lines indicate areas of excitation, cool colors and dashed lines areas of suppression, with z-score > 3 w.r.t. the bootstrap distribution

There is substantial variability in the relative contribution of suppression to ANF responses (Fig. 5a), similar to that observed with tone stimuli in the “fractional-response” metric (e.g., Miller et al. 1997). There is a group of fibers (both NH and HI) across the CF axis, which show no significant suppression (\({{R}_{\varepsilon ,\sigma }}=1\); Fig. 5a). These are mainly, but not exclusively, of the high-spontaneous-rate class (≥ 18 spikes s‑1). HI fibers in the region of greatest damage (~ 2–5 kHz) tend to have reduced suppression. Plotting \({{R}_{\varepsilon ,\sigma }}\) vs. CF-normalized 10-dB bandwidth, we find a significant correlation between broadened tuning and loss of suppression (Fig. 5b). Considering only CFs > 2 kHz in this linear regression increased the variance explained from 9.1 to 18.4 %.

Fig. 5
figure 5

Normalized excitatory-suppressive ratio vs. CF (a), and vs. CF-normalized 10-dB bandwidth (b). a, Solid lines, lowess fits to values of \(~{{R}_{\varepsilon ,\sigma }}\,<1\). b, Horizontal axis expresses 10-dB bandwidth in octaves relative to the 95th percentile of NH-chinchilla ANF data (Kale and Heinz 2010). Gray line and text, least-squares linear fit to values of \(~{{R}_{\varepsilon ,\sigma }}\,<1\)

4 Discussion

Using tones and broadband noise, we found significant changes in the pattern of suppression in the responses of ANFs following noise-induced hearing loss, likely reflecting outer-hair-cell disruption. Previous studies have examined the relationship between ANF 2TS and chronic low-level noise exposure coupled with ageing in the Mongolian gerbil (Schmiedt et al. 1990; Schmiedt and Schultz 1992), and following an acute intense noise exposure in the cat (Miller et al. 1997). Ageing in a noisy environment was associated with a loss of 2TS. Schmiedt and colleagues often found complete absence of high-side 2TS, with sparing of low-side 2TS, even in cochlear regions with up to 60 % outer-hair-cell loss; suggesting potentially different mechanisms for low- and high-side suppression. Our 2TS data are in broad agreement with these earlier findings: HI chinchillas had elevated high-side 2TS thresholds, but retained sensitivity to low-side suppressors. Miller et al. (1997) related a reduction in 2TS in ANFs to changes in the representation of voiced vowel sounds. Weakened compressive nonlinearities contributed to a reduction in “synchrony capture” by stimulus harmonics near vowel-formant peaks. These effects likely contribute to deficits in across-CF spatio-temporal coding of temporal-fine-structure information in speech for HI listeners (e.g., Heinz et al. 2010).

Although the 2TS approach provides important insights on cochlear nonlinearities, it is important to consider the effects of hearing impairment on cochlear nonlinearity in relation to broadband sounds. Our Wiener-kernel approach demonstrated reduced suppression in ANF responses from HI animals, which was correlated with a loss of frequency selectivity. These HI animals also exhibited reduced magnitude DPOAEs: additional evidence of reduced compressive nonlinearity. However, the analyses presented here (Fig. 5) only quantify the overall relative suppressive influence on ANF responses. Using the STRFs derived from h 2s, we aim to characterize the timing and frequency tuning of suppression in the HI auditory periphery in response to broadband sounds. Moreover, these same techniques can be exploited to address potential changes in the balance between excitation and inhibition (“central gain change”) in brainstem nuclei following damage to the periphery. Such approaches have previously proved informative across sensory modalities (e.g., Rust et al. 2005), and will likely yield results with tangible translational value.