Auditory Nerve Frequency Tuning Measured with Forward-Masked Compound Action Potentials

Verschooten, Eric; Robles, Luis; Kovačić, Damir; Joris, Philip X.

doi:10.1007/s10162-012-0346-z

Auditory Nerve Frequency Tuning Measured with Forward-Masked Compound Action Potentials

Research Article
Published: 05 September 2012

Volume 13, pages 799–817, (2012)
Cite this article

Download PDF

Journal of the Association for Research in Otolaryngology Aims and scope Submit manuscript

Auditory Nerve Frequency Tuning Measured with Forward-Masked Compound Action Potentials

Download PDF

Eric Verschooten¹,
Luis Robles^1,2,
Damir Kovačić¹ &
…
Philip X. Joris¹

1666 Accesses
16 Citations
Explore all metrics

Abstract

Frequency selectivity is a fundamental cochlear property. Recent studies using otoacoustic emissions and psychophysical forward masking suggest that frequency selectivity is sharper in human than in common laboratory species. This has been disputed based on reports using compound action potentials (CAPs), which reflect activity in the auditory nerve and can be measured in humans. Comparative data of CAPs, obtained with a variety of simultaneous masking protocols, have been interpreted to indicate similarity of frequency tuning across mammals and even birds. Unfortunately, there are several issues with the available CAP measurements which hamper a straightforward comparison across species. We investigate sharpness of CAP tuning in cat and chinchilla using a forward masking notched-noise paradigm—which is less confounded by cochlear nonlinearities than simultaneous masking paradigms and similar to what was used in the psychophysical study reporting sharper tuning in humans. Our parametric study, using different probe frequencies and notch widths, shows relationships consistent with those of auditory nerve fibers (ANFs). The sharpness of tuning, quantified by Q ₁₀ factors, is negatively correlated with probe level and increases with probe frequency, but the Q ₁₀ values are generally lower than the average trend for ANFs. Like the single fiber data, tuning for CAPs is sharper in cat than in chinchilla, but the two species are similar in the dependence of tuning on probe frequency and in the relationship between tuning in ANFs and CAP. Growth-of-maskability functions show slopes <1 indicating that with increasing probe level the probe is more susceptible to cochlear compression than the masker. The results support the use of forward-masked CAPs as an alternative measure to estimate ANF tuning and to compare frequency tuning across species.

Estimation of Cochlear Frequency Selectivity Using a Convolution Model of Forward-Masked Compound Action Potentials

Article 26 January 2024

François Deloche, Satyabrata Parida, … Michael G. Heinz

Estimating Cochlear Frequency Selectivity with Stimulus-frequency Otoacoustic Emissions in Chinchillas

Article 18 September 2014

Karolina K. Charaziak & Jonathan H. Siegel

Tuning of SFOAEs Evoked by Low-Frequency Tones Is Not Compatible with Localized Emission Generation

Article 27 March 2015

Karolina K. Charaziak & Jonathan H. Siegel

Introduction

Frequency tuning is a fundamental property of responses from the cochlea and higher level structures in the auditory system. Frequency tuning of the cochlea and auditory nerve fibers (ANFs) is most commonly studied by determining the stimulus intensity needed to achieve a criterion response (e.g., basilar membrane displacement or spike rate) over a range of pure tone frequencies. The resulting typically V-shaped curve is referred to as a frequency threshold curve, and from it the characteristic frequency (CF) and a measure for sharpness of tuning are obtained. By sharpness of tuning, we refer to the width of the V-shape relative to the CF, commonly expressed in terms of the so-called quality (Q) factor (see “Methods”).

In humans, sharpness of frequency tuning has been studied with a variety of behavioral and physiological methods but is unknown for single ANFs because their study requires invasive recordings. Despite the lack of single ANF measurements, it is generally thought that frequency tuning in humans is similar to that in commonly studied laboratory mammals (Ruggero and Temchin 2005). On the other hand, an increasing number of studies report sharper tuning in humans. An early study by Harrison et al. (1981) compared Q factors calculated from compound action potential (CAP) recordings in human patients to similar measurements obtained in guinea pig. The CAP is an evoked potential which reflects some properties of ANFs; it is the weighted summed extracellular response of synchronized ANFs at the onset of acoustic stimuli (Goldstein and Kiang 1958; Kiang et al. 1976; Antoli-Candela and Kiang 1978). Extrapolating from CAP measurements and from Q factors of single ANFs in guinea pigs, Harrison et al. (1981) concluded that frequency selectivity is higher in humans than in guinea pigs. More recently, based on behavioral (Oxenham and Shera 2003) and otoacoustic (Shera et al. 2010) measurements, it was concluded that frequency selectivity in humans is higher than previously thought, by a factor of more than two. Finally, cortical recordings in humans also reveal high frequency selectivity (Bitterman et al. 2008). It is controversial whether the high Q values reflect real species differences rather than differences in experimental paradigm (Ruggero and Temchin 2005; Eustaquio-Martin and Lopez-Poveda 2011). Comparison of tuning estimates from otoacoustic emissions and single ANF in macaque monkey supports the correspondence between the two methods and suggests that frequency tuning in Old World primates may be superior to that observed in more commonly studied species (Joris et al. 2011).

CAP recordings provide an electrophysiological means to compare tuning across species. In order to make predictions of the tuning at the single neuron level in humans, it is necessary to first assess the quantitative relationship between CAP and single unit measures in species where both can be measured or are available. To achieve this goal in the present study, we adopt a forward masking procedure (Oxenham and Shera 2003) to measure Q factors from CAP recordings in two species for which tuning in ANF is well characterized, but for which little data on CAP tuning are available (Ruggero and Temchin 2005).

Methods

Surgical preparation

The experiments were conducted using six adult cats and nine chinchillas of either sex. All procedures were approved by the KU Leuven Ethics Committee for Animal Experiments and were in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals. Animals were briefly examined for clear external and middle ears. Cats were anesthetized with a mixture of ketamine (20 mg/kg) and acepromazine (0.2 mg/kg) administered intramuscularly. A venous cannula was then placed to infuse lactated Ringer’s solution and sodium pentobarbital to maintain a deep state of anesthesia. A cannula was inserted into the trachea. Chinchillas were anesthetized with an intramuscular injection of ketamine (20 mg/kg), xylazine (3 mg/kg), and atropine (0.04 mg/kg). Supplementary doses of ketamine or intraperitoneal injection of pentobarbital were administered to maintain a surgical level of anesthesia.

The left pinna was surgically removed and the auditory bulla was exposed and opened. Through the opening in the bulla, a silver Teflon-insulated wire with ball electrode was inserted and placed at the round window. The electrode lead was glued to the bone and the bulla opening was closed again with ear impression compound (Microsonic). Two silver wire electrodes were placed in the skin: a reference electrode at the nape of the neck and a ground electrode near the contralateral bulla. During the measurements, the animals were kept warm with a homeothermic blanket (Harvard). The experiments were conducted in a double-walled soundproof and faradized room (Industrial Acoustics Company).

Stimulus generation

Stimuli were generated with custom software and a digital sound system (Tucker-Davis Technologies, system 2, sample rate 125 kHz/channel) consisting of a digital-to-analog converter (PD1), a digitally controlled attenuator (PA5), a headphone driver (HB7), and an electromagnetically shielded acoustic transducer (for chinchilla, Etymotic Research, ER1, 20 Hz–16 kHz; for cat a dynamic electro-acoustic transducer, Radio Shack, 20 Hz–50 kHz). The transducer was connected with plastic tubing to a custom earpiece coupler. In cat, the coupler was fit in the transversely cut ear canal; in chinchilla, the coupler was sealed with ear impression compound to the bony external auditory meatus. The acoustic system was calibrated in situ near the eardrum, through the custom coupler, with a calibrated probe microphone (Brüel & Kjær, type 4192, 1/2-in. condenser microphone and conditioning amplifier Nexus 2690).

Signal sampling

The cochlear responses were differentially measured (low noise differential preamplifier, Signal Recovery, Model 5115) between the signal electrode at the round window and the reference electrode in the nape of the neck. The ground of the differential preamplifier was connected to the electrode at the contralateral mastoid. The signal was filtered (100 Hz–10 kHz, cut-off slopes 12 dB/oct.) and amplified with another external amplifier (Stanford Research, SR560) to a total gain of × 10,000. All of the relevant signals including the filtered cochlear response, the stimulus waveform, and synchronization pulses at the start of every probe were visualized on an oscilloscope (LeCroy, WaveSurfer 24Xs), sampled and digitized with an ADC (TDT, RX8, ~100 kHz/channel, max. signal-to-noise ratio (SNR) 96 dB), and stored on disk for further analysis.

The amplitude of the CAP was measured between P1 (first positive peak) and N1 (first trough) as shown in the inset of Figure 4 (the remainder of that figure will be discussed later in the “Results” section). Whenever P1 was not clearly defined, the amplitude was measured between P2 and N1, and when P2 was not clearly visible, between the positive and negative maxima. Before the amplitude was measured, the CAP response was de-noised with a wavelet filter. At low probe frequencies (<2 kHz), an additional sharp notch FIR filter at twice the probe frequency was used to remove the neural phase-locked response. The measured responses contained significant background noise uncorrelated to the stimulus; at low stimulus intensities, the background noise exceeded the desired response signal. To increase the response SNR, we used signal averaging, typically 512 or 1,024 repetitions, depending on the background noise level.

Stimulus paradigm

Schematic representations of the notched-noise forward masking (NNFM) paradigm in the time and frequency domain are shown in Figure 1. The general principle is as follows: A notched-noise masker of variable level was followed by a 10-ms tonal probe at a fixed level (Fig. 1A). We varied the notch width of the masker and searched for the masker level required to suppress the CAP response to the fixed probe tone by a given percentage, from hereon called the masking criterion. (A full description of the protocol is given in the next section). To reduce spectral splatter due to the on- and off-switching of the stimuli, probe and masker were gated with 5 ms raised-cosine ramps. The 10-ms probe thus never reached steady state; it smoothly ramped up and down. The polarity of the probe tone alternated in successive presentations. The cochlear microphonic follows the polarity of the stimulus and is largely symmetrical, so that it is almost completely removed by averaging the responses to stimuli with alternating polarity (Dallos 1973). In contrast, the CAP—the signal we want to measure—is mainly from neural origin and is largely preserved after averaging.

Figure 1B depicts the idealized frequency spectrum of the stimulus. The notched-noise masker consisted of two Gaussian noise bands generated in software in the complex frequency domain and designed with infinitely steep edges (brick-wall shape). The real and imaginary parts of the spectral components were generated with a Gaussian random generator (randn) in MATLAB, spectrally calibrated and converted to the time domain with a discrete fast Fourier transformation (FFT function in MATLAB). For convenience, the amplitudes for the masker and probe signals were calculated before signal gating and before calibration. Note that (a) the probe levels with gating are 4.3 dB lower than those without gating and (b) the masker level is expressed as an overall level (total noise band), and not as a “masker spectrum level” (intensity level within a 1-Hz band). The conversion factor between the overall level and masker spectrum level is −10log₁₀(BW), where BW = f _p in this study.

Spectrally, the noise bands had equal bandwidth and straddled the probe frequency (f _p), so as to create a notch symmetrically spaced on a linear frequency axis. The masker notch width and level were the main experimental variables. The duration of the masker was set to exceed the settling time of the adaptation in the masker response. In initial pilot experiments, we used 300 ms, but after validation, we shortened the duration to 83.71 ms to speed up data collection. For a few test cases (not included in the results), we shortened it further to 33.71 ms, which resulted in Q values that were not significantly different compared to longer masker durations. Masker and probe were separated by a silent interval t _mp, so as to reduce overlap between the responses from the masker and the probe. Successive presentations of probe–masker combinations were separated with the interval t _s. The values of the intervals t _mp and t _s were adjusted in initial experiments and were fixed at 10 ms for the remaining experiments.

Experimental protocol

Masking tuning curves (MTCs) were constructed with a predetermined masking criterion by varying the notch width (NW) and masker level (L _m) while keeping the probe constant in level and frequency. Within a run, NW was held constant and L _m was varied in search for the probe response associated with the predetermined masking criterion, called the “criterion response.” The L _m values obtained at runs for different NW values resulted in one MTC.

The procedure to extract the MTC consisted of the steps schematically illustrated in Figure 2 (top panel). We obtained an input–output (I/O) curve in quiet, i.e., the probe level (L _p) was varied in level and the corresponding CAP amplitudes were measured without the masker. From these responses, a suitable probe level was determined. The choice of this level has to meet two conflicting demands. On the one hand, it should be low to obtain Q values as close as possible to CAP response threshold for better comparison with Q values from frequency threshold curves of single ANFs. On the other hand, it should be high enough to yield a reasonable SNR without requiring many measurement repetitions. For pragmatic reasons, we used the SNR of the CAP response (signal and noise in V _rms) as a figure of merit, and aimed at 18 dB SNR. To obtain a response with such SNR, a stimulus probe level was required of approximately 15 dB above the CAP threshold level, which is defined as the (averaged) probe level that produces a CAP response equal to the background noise (≡0 dB SNR). We refer to this probe level as the “fixed probe level” (① in Fig. 2) and to its response as the “fixed probe response” (② in Fig. 2). The fixed probe level is maintained throughout the different runs needed to collect data to construct a MTC. Note that the SNR value was chosen to have enough headroom to cover the smaller amplitudes of the masked responses, e.g., a masking criterion of 33 % reduces the SNR by approximately 3.5 dB.

Next (middle panel Fig. 2), we searched for the masker levels that suppressed the response amplitude at the fixed probe level with the masking criterion. In initial experiments, the masking criterion was chosen at 50 %, i.e., we determined L _m that reduced the response amplitude to the fixed probe by half. In later experiments, the masking criterion was refined to 33 % (see “Results”). In the first run (run 1), we presented a spectrally continuous masker (no-notch condition) and varied the masker level L _m in small steps to search for the level, e.g., L _m1, that resulted in the criterion response as estimated by simple linear regression. In the other subsequent runs, the masker levels (L _m2, L _m3, …, L _mn) were determined in the same way with the same probe and criterion conditions, but with increasing masker notch widths.

In the third and final step (bottom panel in Fig. 2), a plot of masker levels for different notch widths (L _m1, L _m2, …, L _mn) constitutes the MTC, from which the Q factor is obtained. To facilitate comparison across MTCs, we normalize both abscissa and ordinate. Masker levels at different notch widths are normalized by subtracting the masker level for the no-notch condition L _m1 and the notch widths are divided by the probe frequency (f _p). Thus, the first data point (NW₁, L _m1) with the no-notch condition is positioned at the origin of the normalized MTC. For clarity, the names of the normalized parameters are indicated with the letter “n” at the beginning of the corresponding label (e.g., nNW₂ = NW₂ / f _p and nL_m2 = L _m2 − L _m1). The MTC curve is estimated by second order polynomial regression. This is illustrated in Figure 3 where the data points from all runs, with different notch widths and masker levels, are plotted (open circles). For a better view, the axes are rotated relative to Figure 2 (bottom panel); the extra axis (x-axis) is the probe response. The dashed curves are fits, obtained with polynomial linear regression, through the responses of one run (i.e., one notch width). The intersections of these fits with the criterion response (here 2.6 μV) are indicated with crosses. The MTC is obtained as a fit through these crosses using polynomial linear regression (solid line in Fig. 3).

In this study, sharpness of tuning is quantified using two Q factors: the Q ₁₀ and the Q _ERB. Since sharpness of tuning obtained with the notched-noise paradigm is compared with that from single nerve fibers obtained with tones, the Q factors were extracted in two ways: first, directly from the iso-response data, and secondly, from a calculated auditory filter (Patterson 1976), which is the representation of an equivalent filter for tonal maskers.

Direct extraction method

For this method, the Q ₁₀ is defined as f _p/NW_{10 dB}, where f _p is the probe frequency in Hertz and NW_{10 dB} is the notch width at 10 dB above the nadir of the MTC. This notch width is graphically illustrated by the blue arrow in Figure 3. The Q _ERB is the ratio of the probe frequency to the MTC’s equivalent rectangular bandwidth (ERB). The ERB is an approximation of the bandwidth of the filter, modeled as a rectangular band-pass filter passing the same power. The Q _ERB is calculated as

$$ Q_{\text{ERB}} = \frac{{f{\text{p}}}}{\text{ERB}} = \frac{1}{{\mathop{\smallint }\nolimits_{{ - \infty }}^{{ + \infty }} {{\left( {\frac{1}{\text{nMTC}}} \right)}^2}{\text{dnNW}}}} $$

(1)

where Q _ERB is the dimensionless Q factor, f _p is the probe frequency in Hz, ERB is the equivalent rectangular bandwidth in Hz, nNW is the dimensionless notch width normalized to the probe frequency, and nMTC is the normalized MTC in decibel. Because the exact MTC is not known and needs to be estimated, we use a numerical integration technique to approximate the definite integral in Eq. 1. The Q _ERB is calculated with the trapezium rule for non-uniform intervals. The following equation was used

$$ {{Q}_{{{\text{ERB}}}}} \approx \frac{2}{{\sum\nolimits_{{i = 2}}^{N} {\left( {{\text{nN}}{{{\text{W}}}_{i}} - {\text{nN}}{{{\text{W}}}_{{i - 1}}}} \right)} \left( {\frac{1}{{{\text{n}}{{{\text{L}}}_{{{\text{m}}i}}}^{2}}} + \frac{1}{{{\text{n}}{{{\text{L}}}_{{{\text{m}}i - 1}}}^{2}}}} \right)}} $$

(2)

where nNW_i are the dimensionless normalized discrete notch widths, nL_mi are the normalized discrete masker levels in decibel, and N is the number of runs. Due to time constraints, only the most relevant part at the region of the nadir was measured and used in the calculation of the Q _ERB.

When enough data points are available, the Q _ERB is a more robust measure for the sharpness of tuning than the Q ₁₀. The Q _ERB is calculated from all available data points, whereas the Q ₁₀ is derived from the data points at 10 dB above the nadir of the MTC. This makes the Q _ERB less prone to local variations and more informative regarding the curvature of the slopes. Nevertheless, to limit the total time span needed to obtain a single Q value, we optimized our measurements more towards Q ₁₀ than to Q _ERB estimates. To achieve this, in some cases, a more direct method was used where NW_{10 dB}, the notch width at the corresponding masker level 10 dB above the nadir, was sought at the masking criterion; the Q ₁₀ is then calculated as the reciprocal of the normalized NW_{10 dB}. We refer to this method as the “fast method.”

Extraction from an auditory filter

The Q factors were calculated in a similar manner as for the direct extraction method, but from estimated tonal auditory filters commonly used in psychophysical experiments. The auditory filter shapes were obtained with the same fitting procedure as used in a recent psychophysical study (Oxenham and Shera 2003), but without correction for the middle ear transfer function and off-frequency listening. This fitting procedure is based on the power spectrum model of masking (Patterson 1976) assuming a rounded exponential (ROEX) filter shape (Patterson et al. 1982; Glasberg et al. 1984). Here, a symmetrical ROEX(p,w,t) function (Equation 3 in Oxenham and Shera 2003) with three independent parameters was used, with two equal filter sides. The equation for one of the symmetric sides of this ROEX filter is equal to

$$ W(g) = \left( {1 - w} \right)\left( {1 + pg} \right)\exp \left( { - pg} \right) + w\left( {1 + \frac{{pg}}{t}} \right)\exp \left( { - \frac{{pg}}{t}} \right) $$

(3)

where W(g) is the filter power weighting function, g is one half of the discrete normalized notch width (nNW_i/2), w is the weighting factor between the first and second term, p is a measure for the slope of the left term, and t is the ratio between the slopes of the left and right terms. From the power spectrum model, a fourth, dependent, parameter K (decibel) was extracted, which represents the masking “efficiency” corresponding to the predefined masking criteria, and is defined as the ratio of the powers of the probe and the amount of masker (spectral noise) passing the filter (W) necessary to comply with the masking criterion. For clarity, we indicate the sharpness of tuning obtained from these filters with the prefix ROEX.

In a few animals, we also studied the asymmetry of MTCs obtained with a single noise band positioned above or below the probe frequency. In those instances, the two MTCs obtained with single noise bands were merged and represented as a V-shaped MTC; we refer to these as single-sided MTCs (ss-MTC).

Results

In this study, we focus on the sharpness of tuning of CAP responses in cat and chinchilla. We calculate Q ₁₀ and Q _ERB factors from iso-response tuning curves derived from masked CAP responses recorded at the round window using a NNFM paradigm. The most complete datasets were obtained in cat. The probe frequencies varied from 0.5 to 14 kHz for cat and from 1 to 12 kHz for chinchilla. The sequence of probe frequencies tested was arbitrary. In the course of an experiment, repeated measurements were occasionally made as a control: in those cases, average Q factors are calculated and reported. We strove to use probe levels as low as practical. In some measurements, the probe level was systematically increased to explore the dependence of Q ₁₀ on level.

Fixed probe level

The first step was to determine a fixed probe level for the remainder of the procedure. Examples of typical CAP responses from cat to unmasked tone pips are depicted in Figure 4, for a probe frequency of 8 kHz. Traces for six different probe levels, increasing from bottom to top, are shown. The amplitudes are scaled: the table to the left gives the scaling factor, SNR and probe level for the corresponding responses. In this example, 22 dB SPL was chosen as the fixed probe level, based on the corresponding SNR of 17.2 dB, which is close to our predefined target value of 18 dB. Note that at high levels, P1 and N1 (as well as other components such as P2, N2) are clearly defined so that the CAP response can be simply defined as their amplitude difference. However, at low levels, a broad maximum rather than a distinct P1 component was often seen (lower traces in Fig. 4) and was used to quantify the response amplitude. In these cases, it is impossible to know whether this absolute maximum corresponds to P1, P2, or the summating potential or a mixture thereof. In practice, the exact choice of response metric is not critical because our paradigm uses iso-response conditions and the intrinsic shape of the CAP response does not change much with notch width and masker level.

Masking criterion

Earlier studies (Eggermont 1977; Harrison et al. 1981) used a variety of masking criteria. In our initial experiments, we used a masking criterion of 50 %, but subsequently settled on 33 % which we found to be optimal, as illustrated in Figure 5. This figure shows three masking intensity curves in cat for three different conditions (see caption Fig. 5). The normalized masking is plotted as a function of masker level. The curves show four distinct regions: region I: no masking; region II: steep linear growth of masking between ~10 and 60 % (upper and lower horizontal dashed dotted lines) on a semi-log scale; region III: transition region, with a slope significantly smaller than in region II; and region IV: saturation of masking, the region is only visible for the leftmost curve (red filled dots). Region II shows a linear relationship between level and masking in the three curves, which differ in frequency, notch width, and probe level. We therefore chose 33 % as the optimal level for our masking criterion (middle dashed dotted line) and used simple linear regression. There is a clear difference in slope between the two curves with higher probe levels (blue diamonds and green squares, slopes of 1.7 and 1.8 dB⁻¹, respectively) and the one with low probe level (red dots, 3.4 dB⁻¹). The masking intensity curves are similar to those for single fiber measurements (Harris and Dallos 1979); the red curve in our example with low probe level is almost identical with the same ascending slope to the normalized curve in their Figure 9.

Masking recovery

As a final check on the chosen parameter values in our NNFM paradigm, i.e., t _m, t _mp, and t _s, we verified the (fast) recovery from masking. Figure 6 illustrates the recovery of a 10-kHz, 10-ms, probe tone from a preceding no-notch noise masker with a duration of 83.71 ms (t _m); t _s was chosen to allow full (fast) recovery. The recovery function is measured in successive measurements where t _mp is increased from 0 to 30 ms in steps of approximately 5 ms while the masker to masker interval was held constant. The data points are well described by a recovery curve based on an exponential fast recovery model used for forward masking in single auditory nerve fibers (Harris and Dallos 1979). The exponential fast recovery model predicts that >99 % of the probe response would recover within the next probe to probe interval of 113.71 ms (t _p + t _s + t _m + t _mp), which validates the current chosen parameter values.

MTCs, Q ₁₀, and Q _ERB

Figure 7 presents three sets of MTCs obtained in three different cats for various probe frequencies. The MTCs are reconstructed by straight lines between the data points. All MTCs show a monotonic increase with notch width and can have a slightly concave or convex curvature. In the three animals, there is a consistent increase in slope with increasing probe frequency which is a good indication that the CAP responses are generated by single units with CF close to the probe frequency.

The notched-noise paradigm yields one-sided masking tuning curves and, in its standard configuration with symmetrical notch, does not allow separate examination of the part of the filter above and below the probe frequency. One of the assumptions made, validated below in “Single-sided noise maskers,” is that the MTC, at low probe level, is symmetric around the nadir, i.e., both noise bands produce the same amount of masking for a given notch width. As illustrated in Figure 7B, the intersection of these one-sided tuning curves with a horizontal line at a relative masker level of 10 dB provides an estimate of the Q ₁₀ for notched-noise maskers.

Figure 8 shows the two indicators of sharpness, Q ₁₀ (Fig. 8A) and Q _ERB (Fig. 8B), plotted on a linear frequency axis for different cats (indicated with different symbols, see legend). In both figures, the filled dots, connected with dashed lines, are the average values across animals. The mean trend line of the Q ₁₀ (Fig. 8A, dashed line) shows a fairly linear relation with probe frequency. For some probe frequencies, measurements are available for only a single animal and are thus statistically less reliable. For example, at 14 kHz, the mean trend line suggests a decreasing Q ₁₀ factor beyond 12 kHz, but this is not supported by the within-animal trend (experiment A0962, … < Q _{10, 10 kHz} < Q _{10, 12 kHz} < Q _{10, 14 kHz}). A trend predictor more resistant to outliers, missing and scattered data points is the robust LOESS trend line (MATLAB routine smooth, method RLOESS, span 0.7), indicated in red in both plots. The LOESS trend lines show a, monotonically increasing, smoother course than the mean trend lines. The Q _ERB trend line (Fig. 8B) shows a clear bend at 4 kHz with a much steeper slope at the lower frequency side.

In this study, Q factors are obtained with notched-noise maskers whereas Q factors from single fibers are obtained with tones. To make a more appropriate comparison between the results from different maskers/exciters, i.e., noise maskers versus tones, the MTCs from three cats were converted to auditory filters as commonly used in psychophysical experiments (see “Extraction from an auditory filter” in “Methods”). The filter parameters and Q factors obtained from the ROEX filters are given in Table 1. It is clear from this table that for most of the obtained ROEX filter functions the second term in the ROEX(p,w,t) is negligible (w < −100 dB). Note that the masking “efficiency,” K, in the table depends on the specific probe level and masking criterion and that direct comparison between K factors with different conditions is meaningless.

TABLE 1 Parameters, filter parameters, and results derived from the same data as in Figure 7

Full size table

The Q ₁₀ and Q _ERB factors from Table 1 are depicted as a function of frequency in Figure 9. The ROEX-Q ₁₀ trend line in Figure 9A shows the same course as the directly obtained Q ₁₀ trend line (replotted from Fig. 8A), but with slightly lower values. The relationship (ratio) between the ROEX-Q ₁₀ and the directly obtained Q ₁₀ trend lines is constant over the considered frequency range with a mean value of 0.89 (STD 0.09) and is depicted as the black line in Figure 9A. Since the ratio is small and because of the additional uncertainties in the ROEX-Q ₁₀ arising from the assumptions (linearity and ROEX filter shape) in the derivation of the auditory filter, we consider the difference between the two curves too small to warrant use of the ROEX filter method. Therefore, in the remainder of this study, we only consider the Q ₁₀ values directly obtained from the MTCs.

In panel B, the values for the ROEX-Q _ERB and for the directly obtained Q _ERB are shown for the same animals as in panel A. The trend line for the ROEX-Q _ERB is besides a proportionality factor identical to the trend line of the ROEX-Q ₁₀ in panel A. The proportionality factor is equal to 1.95 and is biased by the dominant term in the ROEX(p,w,t) function at the tip of the symmetric auditory filter, i.e., W(g) = (1 + p|g|)exp(−p|g|). This value is almost equal to the theoretical value of ~1.945, which is calculated from the following derived equation

$$ {Q_{\text{RATIO}}}\left( \alpha \right) = \frac{{{Q_{\text{ERB}}}}}{{{Q_{\alpha }}(\alpha )}} = - \frac{{{\Omega_{{ - 1}}}\left( { - {e^{{ - 1}}}{{10}^{{( - \frac{\alpha }{{10}})}}}} \right) + 1}}{2} $$

(4)

where α means the relative level (decibel) associated to the type of the Q factor, e.g., α = 10 for Q ₁₀; and where Ω ₋₁ represents the omega function, which is the inverse relation of the function f(w) = we ^w for the lower branch. The course of the directly obtained Q _ERB deviates significantly from the ROEX-Q _ERB; it is further discussed below in the part “Comparison with single auditory nerve fibers.”

Figure 10 presents Q ₁₀ factors as a function of probe frequency for nine chinchillas. The mean values have a similar increasing trend with probe frequency as in cat, but display much more variability. Despite the large variability, the trend (RLOESS) is very similar to the trend for cats. The trends for cat and chinchilla are further discussed below.

Dependence on parameter choices

Most studies examining frequency tuning are conducted at threshold or at low probe levels, at which frequency selectivity is highest (Robles and Ruggero 2001). Due to the nature of the recorded signal and paradigm, compromises had to be made regarding SNR and number of repetitions. Probe levels were never at threshold, but at least 15 dB above CAP threshold and varied across the measured MTCs. So far, in the comparison between MTCs, we made abstraction of the probe level by normalizing the results. In this section, we examine the implication of some of our experimental parameter choices.

Probe level

We examined the dependence of Q ₁₀ on probe level. Cochlear frequency selectivity decreases with level: this is well documented for iso-input measurements although it may increase for iso-response measurements (Eustaquio-Martin and Lopez-Poveda 2011). Figure 11 presents Q ₁₀ values as a function of probe level for probe frequencies ranging from 3 to 10 kHz, for three different cats. The trend lines for the most comprehensive data show a low-pass behavior with a cut-off level between approximately 40 and 50 dB SPL. Below the cut-off level, the Q ₁₀ is roughly independent of the probe level. Some trends, i.e., for 5 and 8 kHz might suggest a local increasing or decreasing trend with frequency, but these are not significant, given the large standard error of the estimate for regression Above the cut-off level, the Q ₁₀ declines monotonically with slopes of approximately −0.050, −0.067, and −0.046 dB/dB for 4, 8, and 10 kHz (curves with data points above 55 dB SPL). The probe levels that would typically be used (with the predefined SNR criterion of 18 dB) would be 45, 40, 35, 35, and 40 dB SPL for 3, 4, 5, 8, and 10 kHz, respectively. Thus, the Q ₁₀ values obtained with our 18 dB SNR criterion represent the highest values that can be measured with this paradigm at all probe frequencies.

We interpret the decline of Q ₁₀ beyond the cut-off level (Fig. 11) to reflect a change in operation mode (linear to compressive) of the probe. Mechanical measurements of basilar membrane motion to tones (Robles and Ruggero 2001) reveal an input–output (I/O) relationship consisting of three regimes (Fig. 13B): linear (1 dB/dB) regimes at the lowest and highest SPLs, connected by a compressive portion with shallower slope (taken at 0.2 dB/dB). At low probe levels, both probe and masker are in the linear regime and Q values are independent of SPL. With our masking criterion of 33 %, at higher probe levels, the probe can be in the compressive regime of the I/O curve while the masker is in the low SPL linear regime. In this situation, a change (reduction) in response due to a small increase in masker level is greater than a change (increase) in response to the same small increment in the probe level. The decline in Q ₁₀ associated with the compression of the probe is not the result of the corresponding additional increase in probe level, but is related to the associated deformation (widening) of the probe’s excitation pattern. As a global effect, to keep the amount of masking constant, the notch width of the masker has to be widened, so the Q value decreases.

Growth-of-maskability

The interplay between probe level and masker level is traditionally measured with growth-of-masking (GOM) functions, which describe the dependence of masked threshold on masker level. We examined how the masker level required to generate a criterion response depends on probe level. Following the psychophysical literature (Nelson et al. 2001), we call the resulting functions (Oxenham and Plack 1997) growth-of-maskability (GMB) functions. According to the reasoning in the previous paragraph, we expect GMB functions to have a slope <1 dB/dB if the probe level is in a compressive regime while the masker level is in a linear regime. GMB functions for several probe frequencies are plotted in Figure 12 using a no-notch masker (masker reference level, i.e., L _m1 in Fig. 2). Masker (reference) levels for probe levels above roughly 40 dB SPL, associated to the cut-off level in Figure 11, show little dependence on probe level. The slopes are smaller than 1 dB/dB (cf. dashed line of equality) and closer to ~0.2 dB/dB (lower dashed line). These small slopes are consistent with our interpretation that the masker is operating in the linear region while the probe is operating in the compressive region, causing the decline in Q ₁₀ with probe level (Fig. 11). At the lowest probe levels, the data points deviate significantly from the course of the other data points; the reason is unclear and is further briefly discussed below in the topic about the GMB model.

For comparison with psychophysical data, it is useful to illustrate our reasoning with a very simple model. Figure 13A shows GMB curves for different masking conditions. For example, let us assume that for a given masking condition the effective “output” (e.g., the vibration pattern of the basilar membrane) in response to the probe stimulus has to be 15 dB higher than the response to the masker. At low stimulus intensities, in the linear regime, this requires a masker level 15 dB below the probe level. At high stimulus intensities, with the probe in the compressive regime, the masker level required is much lower than −15 dB relative to the probe level (e.g., for the I/O function in Fig. 13B, this would be a difference of −47 dB for a probe level of 80 dB SPL). The −15-dB blue line in Figure 13A shows all combinations of probe and masker levels that will generate at 15 dB difference between probe and masker output. If, on the other hand, the masking conditions would be such that the response to the probe stimulus has to be 15 dB lower than the response to the masker, the pattern is inverted so that the predicted GMB curves are mirror symmetric relative to the diagonal of equality (red vs. blue curves in Fig. 13).

The masking conditions (paradigm parameters such as masking criterion, masker–probe interval, masker length, etc.) determine the relative levels required between masker and probe to obtain masking and therefore the expected trajectory of the GMB curves. The black line in Figure 13A (diagonal of equality) represents the relation when the levels of probe and masker are equal. This situation can be obtained with tonal on-frequency maskers; for an example, see listener CP for 6 kHz in Fig. 2 (left) in Oxenham and Plack (1997). For all other curves, the compressive nature of the cochlea introduces regions with slopes deviating from unity. For an idealized I/O curve, the values of these slopes correspond to the compression exponent (e.g., 0.2 or 5 dB/dB). GMB or GOM curves obtained from studies using two-interval forced choice adaptive procedures or using “linear” off-frequency maskers to estimate compression exponents (Relkin and Turner 1988; Oxenham and Plack 1997; Plack and Oxenham 1998) typically have higher masker levels than probe levels and look similar to the red curves in Figure 13A. In the present paradigm, the masker level needed to achieve the masking criterion of 33 % is significantly lower than the probe level and therefore the shape of the curves is reciprocal (blue curves) to GMBs reported in the literature.

The slope of the GMB above the cut-off level (~40 dB SPL) for the 10-kHz data (green dots in Fig. 13A) is well predicted by the model. Below the cut-off level, some of the masker levels deviate from the prediction of the model, i.e., the lowest data points are off-set and do not show a clear predicted slope equal to 1 dB/dB. This observation is also valid for most of the other GMB curves in Figure 12 having different probe frequencies. Note that these data points were measured at unusually low probe levels, below our quality criterion.

Single-sided noise maskers

All previous results were obtained with notched-noise maskers symmetrically positioned with respect to the probe frequency. The assumption here is that, at least at low probe levels, the slopes of the MTCs are symmetrical at both sides. To explore the validity of this assumption, we compared MTCs obtained using stimuli with single- and (symmetrical) double-sided noise maskers, hereafter named ss-MTCs and ds-MTCs, respectively. Figure 14 depicts four MTCs, with ss-MTCs and ds-MTCs at frequencies of 4, 5, and 8 kHz, obtained from two different cats. The abscissa shows the normalized spectral distance between the closest edge of the masker and the probe. The ss-MTCs are obtained by joining two independently measured MTCs obtained with a left and a right single-sided noise masker: negative relative masker–probe distances indicate a masker band below the probe frequency (f _m < f _P) while positive values indicate a masker band above the probe frequency (f _m > f _P). For the standard double-sided masker, we plot the MTC twice, mirror symmetrically. Note that in this case, the spectral distance equals half of the notch width. For visualization, left and right sides were fit by a trend line and vertically aligned at the origin. To minimize drift and variability between responses to single- and double-sided noise maskers, all measurements with the same notch widths were interleaved and recorded in one continuous measurement. The ds- and ss-MTCs do not show large differences. Particularly for single-sided maskers above the probe frequency, the trend line is very similar to that of the double-sided maskers. For single-sided maskers below the probe frequency, the data points are slightly elevated at the lowest probe frequencies (red curves in Fig. 14C and A, 4 and 5 kHz), but the slopes of the trend lines are similar.

Comparison with single auditory nerve fibers

The main goal of our study was to develop an electrophysiological method to estimate sharpness of tuning from mass cochlear potentials. In this section, we compare our estimates with those for single ANFs. The dots in Figure 15 show neural Q ₁₀ values for ANFs available from previous studies in our laboratory. As observed in many previous studies, not only in ANFs but also behaviorally and in the CNS (Oxenham and Shera 2003; Shera and Guinan 2003; Sayles and Winter 2010), Q ₁₀ increases approximately linearly with CF on a log-log scale. Interestingly, again in agreement with many previous ANF studies in different species (guinea pig [Evans 1972; Sayles and Winter 2010]; cat [Liberman 1978]; squirrel monkey [Rhode et al. 2010]; macaque monkey [Joris et al. 2011]), the distribution levels off or even bends over at its upper end, i.e., there is no further increase in Q ₁₀ values at the highest CFs. To our knowledge, the mechanism(s) behind this saturating relationship have not been addressed.

Figure 15A compares the Q ₁₀ CAP trend line (red, same as in Fig. 8A) with the trend of the ANFs (dashed black line). Note that the CAP trend line is located within the cluster of ANF data points. For low and middle probe frequencies, the CAP trend closely follows the lower boundary of the ANF data cluster, but at 10 kHz, the CAP trend intersects the ANF trend.

Many studies report Q _ERB rather than Q ₁₀ values, the rationale being that the former provides a more robust measure because it incorporates all measurement points rather than only the points at the lowest threshold (CF) and 10 dB above. However, as pointed out in the “Methods” section, this is an advantage only when sufficient data points are available around the nadir of the tuning curve. Figure 15B compares the CAP-Q _ERB trend line (red, same curve as in Fig. 8B) with that from the ANF data. As in the case for the Q ₁₀, the CAP-Q _ERB trend line is largely located within the cluster of ANF data points, but it is less congruent with the lower envelope of that distribution. This is in line with the earlier finding in Figure 9B, where the course of the CAP-Q _ERB clearly differed from that of the ROEX prediction. To understand this discrepancy, we studied the Q _ERB/Q ₁₀ relationships in cat for ANF and CAP. For ANFs, the Q _ERB/Q ₁₀ ratio is largely independent of species and CF ([Shera and Guinan 2003], footnote 6). Figure 16 shows the Q _ERB/Q ₁₀ ratios derived from ANFs (gray dots and dashed line) and CAPs (blue line, data from Fig. 15) in cat. To illustrate the ratio for different tip curvatures, associated cartoons are shown at the right side of the figure. It is clear that the Q _ERB/Q ₁₀ ratio of the CAPs has a much larger dependence on frequency than the trend of the ANFs, which is rather stable with a mean value of 1.82. Our interpretation is that due to a small number of data points near the nadir of the MTCs, the Q _ERB values for CAPs are unreliable. This is confirmed by the analysis of the MTCs from Figure 14 which have much finer spaced notch widths. The corresponding ratios of these double-sided and single-sided MTCs are included in Figure 15 as the red circles and green squares, respectively: these values straddle the lower boundary of the ANF distribution of Q _ERB values rather than the mean trend (red curve). The associated Q _ERB/Q ₁₀ ratios depicted as the red circles and green squares in Figure 16 also show more consistency with the ratios of ANFs. In summary, the CAP-Q ₁₀ values (Fig. 15A) are consistent between data obtained at high and at low resolution, while this is not the case for the Q _ERB values (Fig. 15B). This confirms our interpretation that Q _ERB factors are only reliable when enough data points are available around the nadir of the tuning curve. Because we do not have enough data with sufficient data points around the nadir, we restrict ourselves to Q ₁₀ factors in the remaining analysis.

As has been pointed out in the “Introduction,” human sharpness of frequency tuning is unknown for single ANFs. CAP recordings provide an electrophysiological means to compare tuning across species, including humans. Moreover, CAP/ANF conversion functions of representative animals may allow us to estimate neural sharpness of tuning in animals or humans where no measurements of single units are available. In order to estimate ANF tuning from CAP tuning in other species and humans, we first have to assess the quantitative relationship between CAPs and single unit responses. In Figure 17A, the CAP-Q ₁₀ of chinchilla (red line) and cat (blue line) are compared with the trends of ANFs (chinchilla: from (Ruggero and Temchin 2005); cat: own data from Fig. 15). As in cat (see earlier discussion in “Comparison with single auditory nerve fibers”), the CAP-Q ₁₀ trend of chinchilla runs below the ANF trend, intersects it at 7.3 kHz, and is slightly larger above this frequency.

The relationship between Q ₁₀ ratios of ANF and CAP for cat (blue) and chinchilla (red) are shown in Figure 17B. Although the ratios in chinchilla are generally lower, the resemblance between the curves of the two animals is remarkable. Two regions can be distinguished that are separated by a deflection point at ~5 kHz. The first region, below the deflection point, shows a modest decrease with frequency. The second region has a much steeper declining slope and includes the intersection of the trend lines (cat—10 kHz, chinchilla—7.3 kHz), indicated by the dashed line.

Discussion

Whether frequency tuning is sharper in humans than in commonly studied laboratory animals is a matter of controversy (Ruggero et al. 1997; Shera et al. 2002; Oxenham and Simonson 2006; Lopez-Poveda et al. 2007; Eustaquio-Martin and Lopez-Poveda 2011; Joris et al. 2011). The aim of the present study was to examine whether the combination of forward masking and CAP recording provides a suitable means to investigate the basic frequency tuning of the neural input to the central nervous system. CAP recordings can be performed in humans in a minimally invasive way (Harrison et al. 1981) and may therefore provide comparative data across species. But the CAP is a mass potential, giving indirect information regarding tuning in single nerve fibers: “tricks” in stimuli and analyses are required to estimate frequency tuning. Behavioral studies also face this problem, and Oxenham and Shera (2003) argued that forward masking paradigms hold the greatest promise in this regard. However, from an extensive literature survey, Ruggero and Temchin (2005) concluded that “forward-masking psychophysical tuning curves […] greatly overestimate the sharpness of cochlear tuning in experimental animals and, hence, also probably in humans”. To investigate the validity of forward-masked CAP responses as a probe for frequency tuning, we adopted the masking paradigm of Oxenham and Shera (2003) and recorded from two species for which abundant single nerve fiber data are available: cat and chinchilla.

We find that masking of CAP responses behaves as expected from cochlear and nerve physiology in a number of respects. The masking intensity curve to broadband noise at low probe level (Fig. 5, red curve) is very similar to that of tonal maskers in single fibers (Harris and Dallos 1979). Also, recovery from masking as described by these authors applies in similar form to our data (Fig. 6). Masking tuning curves show a monotonic increase with notch width, and they become narrower with increasing probe frequency (Fig. 7). Tuning sharpness, as quantified by Q ₁₀, decreases for probe levels above ~45 dB SPL (Fig. 11). The slope of the growth of maskability with probe level is as expected from cochlear compression (Figs. 12 and 13). Single-sided and double-sided maskers result in quite similar estimates for Q ₁₀, which validates the assumption of symmetry around the nadir of the tuning curve (Figs. 14 and 15). Finally and most importantly, CAP-Q ₁₀ values increase with probe frequency at a slope very similar to the trend in single ANFs (Figs. 15, 17, and 18), and the same basic relationship is found in the two species studied (cat and chinchilla), although the data in chinchilla showed more variability. We find that over most of the frequency range, the CAP-Q ₁₀ closely follows the lower boundary of the distribution of neural Q ₁₀ values (Fig. 18) and underestimates the trend rather than overestimating it as seen in some of the available masking studies (symbols in Fig. 18). Only at high frequencies does the CAP trend cross the trend for single fibers: in cat, this crossing point is near 9 kHz, and in chinchilla, it is near 7 kHz. Overall, the results show that the technique of notched-noise forward-masked CAP measurements provides a reliable estimate of the lower boundary of frequency tuning in the auditory nerve. Even if it would be the case that psychophysically the paradigm results in an overestimate of frequency tuning (Ruggero and Temchin 2005), this is not the case for the paradigm used in this study.

In the present paper, we explore the paradigm as an empirical tool to obtain Q measures from CAP recordings. To fully understand the relationship between single fiber and CAP responses, it is necessary to (1) characterize the population response of fibers to the stimuli used and (2) to characterize the “unit contribution” of each fiber at the round window. For example, the CAP response to click stimuli has been modeled convolving post-stimulus time histograms with spike-triggered averages of the round window potential for single fibers and summing across CF (Goldstein and Kiang 1958; Antoli-Candela and Kiang 1978; Wang 1979; Prijs 1986). Single fiber studies of forward masking are limited and restricted to probe frequencies at the CF of the fiber studied (Harris and Dallos 1979): population data are needed to allow prediction of forward-masked CAPs. In the absence of such data, auditory nerve models can be used to gain insight into the relationship between the notch-noise forward-masked CAP data and single fiber data. A full treatment of factors generating the patterns observed is deferred to a subsequent study and we here restrict ourselves to some general comments on potential mechanisms.

Synchronization across nerve fibers is essential to obtain a measurable CAP (Goldstein and Kiang 1958). Cochlear mechanisms cause a CF-dependent delay such that high CF fibers tend to be more synchronized to each other than low CF fibers. In Figure 17B, we show the ratio between neural and CAP-Q ₁₀ values as a function of frequency, for cat (blue line) and chinchilla (red line). The trend is similar for the two species and shows two segments with different slopes. We surmise that the general decrease in the ratio from low to high probe frequencies, as well as the accelerating trend in this decrease, reflects an increase in across-fiber synchronization with increasing probe frequency. However, it is difficult to estimate the interaction of probe frequency, notch bandwidth, and probe and masker level on response variables such as response amplitude and synchronization without further empirical data or use of a detailed nerve model.

A factor which has to be taken into account is the (a)symmetry of the excitation patterns. One of the assumptions made in the paradigm is symmetry in the excitation patterns around the probe frequency. Preliminary modeling suggests that the presence of asymmetry around the nadir of the MTC predicts lower Q values measured with (symmetrical) double-sided maskers. We made a limited number of measurements with single-sided maskers (Fig. 14), which have the ability to reveal asymmetry in MTCs, and indeed, the Q factors yielded somewhat higher values than those obtained with double-sided maskers. This is illustrated in Figure 15A where the Q ₁₀ values from the single-sided noise maskers (green squares) are consistently higher than the double-sided noise maskers (red circles).

Figure 18 shows Q values from several sources (compiled by Ruggero and Temchin 2005; for references, see legend). Behavioral studies used simultaneous masking (Fig. 18A: green squares, Fig. 18B: green line) or forward masking (orange diamonds); CAP studies also used simultaneous masking (blue circles) or forward masking (red trend line: this study). The trend lines for single fiber tuning are also shown. Surprisingly, nearly all values based on simultaneous masking are above our CAP trend line (red) and even above the trend line for the neural data (dashed). As pointed out by Ruggero and Temchin (2005), the behavioral data obtained with forward masking (chinchilla) yield extremely high Q values compared to either the single fiber nerve data or to our CAP data. Similar data and trends are shown for other species in Figs. 2 and 4 of Ruggero and Temchin (2005). The origin of the high Q values measured with behavioral forward masking in chinchilla (or in other species) is unclear. Ultimately, the behavioral data must reflect the interpretation by a central processor of activation patterns in the auditory nerve, and it is unclear which aspects of these patterns are critical (Relkin and Turner 1988; Relkin and Smith 1991). For example, “off-frequency listening” is mentioned by Ruggero and Temchin (2005) as a probable cause for the large difference between forward-masked behavioral and neural data. However, a possible peripheral factor leading to overestimation of sharpness is compression of the masker, also discussed by Eustaquio-Martin and Lopez-Poveda (2011). Note that compression can also occur in threshold tuning curves of auditory nerve fibers, especially for frequencies away from CF, which require higher signal levels to comply with the same response criterion. However, Q ₁₀ values for auditory nerve fibers are unlikely to be affected by compression, because the corresponding SPL values (at the tuning curve nadir and the range 10 dB above) are too low to be in the compressive range. The schematic input–output function in Figure 13B shows two linear regions connected by a compressive region where larger masking levels are required to obtain the same effective masking as in the linear regions—an outcome which is phenomenologically similar to that of a high Q system. Growth-of-maskability functions in detection studies commonly show a slope ≥1 (Relkin and Turner 1988; Relkin and Smith 1991; Oxenham and Plack 1997; Plack and Oxenham 1998), as illustrated by the red lines (Fig. 13A). In contrast, in our data (Fig. 13A, green circles), these functions had a slope <1, suggesting that the probe was rather in the compressive region and the masker in the lower linear region and vice versa in detection studies like most psychophysical studies. The difference may stem from the masking criterion. We surmise that the effective masking criterion, and therefore masking, is high in detection studies, while it is typically low and predefined in physiological studies (e.g., 33 % here and in Dallos and Cheatham [1976], Fig. 6). Variables that increase masking (e.g., shortening of the masker–probe interval; increase in masker duration) can influence the masking criterion and thereby affect the estimated Q value. Most of our measurements were at low probe and masker levels: to the extent that the probe levels reached the compressive region of the input–output function, the Q values obtained with forward-masked CAPs would be underestimates of nerve fiber tuning.

In psychophysical studies, it is common practice to convert MTCs obtained with notched-noise maskers to tonal equivalent auditory filters using a power spectrum model of masking. In contrast to psychophysical studies, Q ₁₀ values extracted from auditory filters (Fig. 9) were, although not importantly, smaller than those directly obtained from MTCs. This suggests that, for cochlear tuning, masking with notched-noise maskers is similar to masking with tonal maskers. The lack of off-frequency listening and central auditory processing could explain the difference with psychophysical studies.

References

Antoli-Candela EJ, Kiang NYS (1978) Unit activity underlying the N1 potential. In: Naunton R, Fernandez C (eds) Evoked electrical activity in the auditory nervous system. Academic, New York, pp 165–189
Google Scholar
Bitterman Y, Mukamel R, Malach R, Fried I, Nelken I (2008) Ultra-fine frequency tuning revealed in single neurons of human auditory cortex. Nature 451:197–201
Article PubMed CAS Google Scholar
Carlier E, Lenoir M, Pujol R (1979) Development of cochlear frequency selectivity tested by compound action potential tuning curves. Hear Res 1:197–201
Article Google Scholar
Dallos P (1973) The auditory periphery. Academic, New York
Google Scholar
Dallos P, Cheatham MA (1976) Compound action potential (AP) tuning curves. J Acoust Soc Am 59:591–597
Article PubMed CAS Google Scholar
Eggermont JJ (1977) Compound actionpotential tuning curves in normal and pathological human ears. J Acoust Soc Am 62:1247–1251
Article PubMed CAS Google Scholar
Eustaquio-Martin A, Lopez-Poveda EA (2011) Isoresponse versus isoinput estimates of cochlear filter tuning. J Assoc Res Otolaryngol 12:281–299
Article PubMed Google Scholar
Evans EF (1972) The frequency response and other properties of single fibres in the guinea-pig cochlear nerve. J Physiol 226:263–287
PubMed CAS Google Scholar
Glasberg BR, Moore BC, Patterson RD, Nimmo-Smith I (1984) Dynamic range and asymmetry of the auditory filter. J Acoust Soc Am 76:419–427
Article PubMed CAS Google Scholar
Goldstein MH, Kiang NYS (1958) Synchrony of neural activity in electric responses evoked by transient acoustic stimuli. J Acoust Soc Am 30:107–114
Article Google Scholar
Harris DM, Dallos P (1979) Forward masking of auditory nerve fiber responses. J Neurophysiol 42:1083–1107
PubMed CAS Google Scholar
Harrison RV, Aran JM, Erre JP (1981) AP tuning curves from normal and pathological human and guinea pig cochleas. J Acoust Soc Am 69:1374–1385
Article PubMed CAS Google Scholar
Joris PX, Bergevin C, Kalluri R, Mc Laughlin M, Michelet P, van der Heijden M, Shera CA (2011) Frequency selectivity in Old-World monkeys corroborates sharp cochlear tuning in humans. Proc Natl Acad Sci U S A 108:17516–17520
Article PubMed CAS Google Scholar
Kiang NYS, Moxon EC, Kahn AR (1976) The relationship of gross potentials recorded fron the cochlea to single unit activity in the auditory nerve. In: Ruben RJ, Elberling C, Salomon G (eds) Electrocochleography. University Park Press, Baltimore, pp 95–115
Google Scholar
Liberman MC (1978) Auditory-nerve response from cats raised in a low-noise chamber. J Acoust Soc Am 63:442–455
Article PubMed CAS Google Scholar
Lopez-Poveda EA, Barrios LF, Alves-Pinto A (2007) Psychophysical estimates of level-dependent best-frequency shifts in the apical region of the human basilar membrane. J Acoust Soc Am 121:3646–3654
Article PubMed Google Scholar
McGee T, Ryan A, Dallos P (1976) Psychophysical tuning curves of chinchillas. J Acoust Soc Am 60:1146–1150
Article PubMed CAS Google Scholar
Nelson DA, Schroder AC, Wojtczak M (2001) A new procedure for measuring peripheral compression in normal-hearing and hearing-impaired listeners. J Acoust Soc Am 110:2045–2064
Article PubMed CAS Google Scholar
Niemiec AJ, Yost WA, Shofner WP (1992) Behavioral measures of frequency selectivity in the chinchilla. J Acoust Soc Am 92:2636–2649
Article PubMed CAS Google Scholar
Oxenham AJ, Plack CJ (1997) A behavioral measure of basilar-membrane nonlinearity in listeners with normal and impaired hearing. J Acoust Soc Am 101:3666–3675
Article PubMed CAS Google Scholar
Oxenham AJ, Shera CA (2003) Estimates of human cochlear tuning at low levels using forward and simultaneous masking. J Assoc Res Otolaryngol 4:541–554
Article PubMed Google Scholar
Oxenham AJ, Simonson AM (2006) Level dependence of auditory filters in nonsimultaneous masking as a function of frequency. J Acoust Soc Am 119:444–453
Article PubMed Google Scholar
Patterson RD (1976) Auditory filter shapes derived with noise stimuli. J Acoust Soc Am 59:640–654
Article PubMed CAS Google Scholar
Patterson RD, Nimmo-Smith I, Weber DL, Milroy R (1982) The deterioration of hearing with age: frequency selectivity, the critical ratio, the audiogram, and speech threshold. J Acoust Soc Am 72:1788–1803
Article PubMed CAS Google Scholar
Pickles JO (1979) Psychophysical frequency resolution in the cat as determined by simultaneous masking and its relation to auditory-nerve resolution. J Acoust Soc Am 66:1725–1732
Article PubMed CAS Google Scholar
Plack CJ, Oxenham AJ (1998) Basilar-membrane nonlinearity and the growth of forward masking. J Acoust Soc Am 103:1598–1608
Article PubMed CAS Google Scholar
Prijs VF (1986) Single-unit response at the round window of the guinea pig. Hear Res 21:127–133
Article PubMed CAS Google Scholar
Relkin EM, Turner CW (1988) A reexamination of forward masking in the auditory nerve. J Acoust Soc Am 84:584–591
Article PubMed CAS Google Scholar
Relkin EM, Smith RL (1991) Forward masking of the compound action potential: thresholds for the detection of the N1 peak. Hear Res 53:131–140
Article PubMed CAS Google Scholar
Rhode WS, Roth GL, Recio-Spinoso A (2010) Response properties of cochlear nucleus neurons in monkeys. Hear Res 259:1–15
Article PubMed Google Scholar
Robles L, Ruggero MA (2001) Mechanics of the mammalian cochlea. PhysiolRev 81:1305–1352
CAS Google Scholar
Ruggero MA, Temchin AN (2005) Unexceptional sharpness of frequency tuning in the human cochlea. Proc Natl Acad Sci U S A 102:18614–18619
Article PubMed CAS Google Scholar
Ruggero MA, Rich NC, Recio A, Narayan SS, Robles L (1997) Basilar-membrane responses to tones at the base of the chincilla cochlea. J Acoust Soc Am 101:2151–2163
Article PubMed CAS Google Scholar
Salvi RJ, Giraudi DM, Henderson D, Hamernik RP (1982) Detection of sinusoidally amplitude modulated noise by the chinchilla. J Acoust Soc Am 71:424–429
Article PubMed CAS Google Scholar
Sayles M, Winter IM (2010) Equivalent-rectangular bandwidth of single units in the anaesthetized guinea-pig ventral cochlear nucleus. Hear Res 262:26–33
Article PubMed Google Scholar
Shera CA, Guinan JJ Jr (2003) Stimulus-frequency-emission group delay: a test of coherent reflection filtering and a window on cochlear tuning. J Acoust Soc Am 113:2762–2772
Article PubMed Google Scholar
Shera CA, Guinan JJ Jr, Oxenham AJ (2002) Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements. Proc Natl Acad Sci U S A 99:3318–3323
Article PubMed CAS Google Scholar
Shera CA, Guinan JJ Jr, Oxenham AJ (2010) Otoacoustic estimation of cochlear tuning: validation in the chinchilla. J Assoc Res Otolaryngol 11:343–365
Article PubMed Google Scholar
Spagnoli SD, Saunders JC (1987) Threshold sensitivity and frequency selectivity measured with round window whole nerve action potentials in the awake, restrained chinchilla. Otolaryngol Head Neck Surg 96:99–105
PubMed CAS Google Scholar
Van Heusden E, Smoorenburg GF (1981) Eighth-nerve action potentials evoked by tone bursts in cats before and after inducement of an acute noise trauma. Hear Res 5:1–23
Article PubMed Google Scholar
Wang B (1979) The relation between the compound action potential and unit discharges of the auditory nerve. Massachusetts Institute of Technology, Cambridge, p 219
Google Scholar

Download references

Acknowledgments

This study was supported by Research Fund KU Leuven (OT/09/50 and BIL11/12 and senior fellowship to LR). DK was supported by a Marie-Curie IEF Fellowship (GA 221755); LR was partly funded by Fondecyt 1080227. We thank EA. Lopez-Poveda for valuable comments on a previous version of the manuscript and A. Eustaquio-Martin for providing us with the algorithm to fit the ROEX filter shapes to the experimental data.

Author information

Authors and Affiliations

Laboratory of Auditory Neurophysiology, KU Leuven Medical School, Campus Gasthuisberg O&N 2, Herestraat 49 bus 1021, 3000, Leuven, Belgium
Eric Verschooten, Luis Robles, Damir Kovačić & Philip X. Joris
Program of Physiology and Biophysics, Universidad de Chile, Independencia 1027, Santiago, Chile
Luis Robles

Authors

Eric Verschooten
View author publications
You can also search for this author in PubMed Google Scholar
Luis Robles
View author publications
You can also search for this author in PubMed Google Scholar
Damir Kovačić
View author publications
You can also search for this author in PubMed Google Scholar
Philip X. Joris
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philip X. Joris.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Verschooten, E., Robles, L., Kovačić, D. et al. Auditory Nerve Frequency Tuning Measured with Forward-Masked Compound Action Potentials. JARO 13, 799–817 (2012). https://doi.org/10.1007/s10162-012-0346-z

Download citation

Received: 15 December 2011
Accepted: 31 July 2012
Published: 05 September 2012
Issue Date: December 2012
DOI: https://doi.org/10.1007/s10162-012-0346-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Auditory Nerve Frequency Tuning Measured with Forward-Masked Compound Action Potentials

Abstract

Similar content being viewed by others

Estimation of Cochlear Frequency Selectivity Using a Convolution Model of Forward-Masked Compound Action Potentials

Estimating Cochlear Frequency Selectivity with Stimulus-frequency Otoacoustic Emissions in Chinchillas

Tuning of SFOAEs Evoked by Low-Frequency Tones Is Not Compatible with Localized Emission Generation

Introduction