Loudness recruitment is an abnormal relationship between sound intensity and perceived loudness that may accompany sensorineural hearing loss (SNHL; Moore 2002). In this condition, the audibility threshold is elevated, but a given increase in intensity above threshold causes significantly greater than normal increase in loudness. The rate of loudness growth in the impaired ear is often so steep that, at high intensities (60–100 dB SPL, e.g., Stillman et al. 1993), the loudness in the impaired ear matches that in the normal one. Loudness recruitment has been primarily studied in humans (e.g., Miskolczy-Fodor 1960; Hellman and Zwislocki 1981; Moore et al. 1985; Zeng and Turner 1991) and has also been demonstrated in behavioral animals with SNHL (Pugh et al. 1979).

The physiological mechanisms of loudness recruitment are not well understood. In this study, we assume that overall sound intensity or loudness is encoded by the discharge rates of auditory nerve (AN) fibers (Sachs and Abbas 1974; Smith 1988; Palmer and Evans 1982; Winter et al. 1990). One hypothesis of the physiological origin of loudness recruitment is that the rate-level functions (RLFs) of AN fibers are steeper in SNHL, as was observed with ototoxic damage (e.g., Harrison 1981). This hypothesis is consistent with the observation that the velocity–intensity relationship of the basilar membrane becomes steeper after the loss of outer-hair cell (OHC) amplification (Ruggero and Rich 1991). However, with acoustic trauma (AT), OHC loss can be accompanied by inner hair cell damage (Liberman and Dodds 1984), which compromises cochlear transduction and lowers the firing rates of AN fibers (Liberman and Kiang 1984; Sewell 1984). As a result, the slopes of their RLFs become shallower on average (e.g., Salvi et al. 1983; Heinz and Young 2004). In cats following AT, the summed RLFs of AN fibers over either a narrow or wide range of best frequencies (BFs) surrounding the stimulus frequency do not show abnormally steep slopes, a result inconsistent with loudness recruitment (Heinz et al. 2005).

Adding to this paradoxical situation is the hyperactivity seen in central auditory neurons after SNHL, which is characterized by an abnormally rapid increase of response with stimulus intensity or by a heightened maximum response or both (e.g., Salvi et al. 2000; Langers et al. 2007). This central hyperactivity qualitatively resembles loudness recruitment and thus is a potential neural correlate. It has been suggested that central hyperactivity reflects a homeostatic plasticity, which aims at maintaining a relatively constant level of neural activity in the face of reduced AN inputs (Schaette and Kempter 2006). However, there are a number of unanswered questions, including at which stage(s) of the central pathway the plastic changes occur, in which populations of neurons, and how the changes interact with peripheral changes to produce enhanced central responses. The relationships between neuronal types and alterations in response properties after permanent SNHL have not been systematically mapped out in central auditory neurons. Most previous studies used evoked potential recordings (e.g., Saunders et al. 1972; Salvi et al. 1990), which did not allow identification of the neuronal sources of changes.

This paper describes the RLFs of single neurons in the ventral cochlear nucleus (VCN) of cats following AT. Neurons recorded in VCN can be classified into several types based on temporal discharge patterns in response to tones. These neuronal types differ in morphology and function (reviewed by Young and Oertel 2004). In this study, we analyze RLFs summed over various populations of VCN neurons and show that the responses of non-primary-like types of neurons, especially choppers, show changes in rate responses consistent with loudness recruitment, whereas primary-like neurons do not, except in cases where spread of excitation in the population is included.

Materials and methods

Acoustic trauma

The animal protocols in this study were approved by the Johns Hopkins University Animal Care and Use Committee (Protocol number CA05M257). Twenty-one adult male cats obtained from Liberty Labs were divided into an AT group (12 animals) and a normal-hearing control group (nine). Before sound exposure, the animal was checked to have clear outer and middle ears with no signs of infection. The animal was anesthetized with ketamine (35–45 mg/kg, i.m.) and xylazine (0.5–0.7 mg/kg, i.m.), supplemented with half-sized doses to maintain an areflexive state, and exposed to a narrowband noise (50 Hz bandwidth centered at 2 kHz, 111–112 dB SPL) for 4 h.

A recovery period between 33 and 121 days (median, 37 days) was allowed between the exposure and the neural recording. The length of the recovery was not an intended variable in this experiment, but three animals had recoveries over 40 days (60, 72, and 121 days) because of scheduling problems. There was a tendency for RLFs to be less non-monotonic in these animals, suggesting reduced inhibition, but it was statistically significant (χ 2 = 10.1, P = 0.04, 4 df) only for the primary-like neurons. There were no significant changes in RLF slopes. The data from these three animals were grouped together with other data in the analyses described below.

Surgical preparation and CAP audiograms

For neural recording, the animal was anesthetized with ketamine and xylazine as above. Atropine (0.03 mg kg−1 day−1, i.m.) and dexamethasone (1 mg/kg, i.m.) were given, and a trachea tube was inserted. Body temperature was maintained at 38.5°C, and hydration was maintained with an i.v. drip of lactated Ringer’s solution. Aspiration under visual control was used to make a complete section of the brain between the superior colliculus and the thalamus. Anesthesia was discontinued after the completion of the decerebration. After the experiment, the animal was euthanized by i.v. injection of sodium pentobarbital (∼800 mg to effect).

The bulla was vented through an ∼1-m length of PE-90 (or similar) tubing. Two types of speakers were used in the closed acoustic system: an electrostatic speaker (levels up to ∼100 dB SPL) and a dynamic speaker (up to ∼130 dB SPL). In situ calibration was performed with a calibrated probe microphone (B&K 4134) placed 2 mm from the tympanic membrane. The amplitude–frequency responses of both speakers varied less than ±5 dB in the frequency range between 0.2 and 20 kHz. An unexpected acoustic noise originating from the power amplifier driving the dynamic speaker (Crown D-75A, Crown Int. Inc., Elkhart, IL) was discovered after it had been used in experiments on six deafened animals. Measurements indicated that the acoustic noise had a nearly flat spectrum spanning 0.3 to 4 kHz and a spectrum level of −7.7 dB re 20 μPa/√Hz. All neurons recorded from the six animals had thresholds for broadband noise (BBN) at least 20 dB above this spectrum level, except for three neurons, whose BBN thresholds were 5–10 dB above the level of the speaker noise. These three neurons were discarded and not used in the analysis. The dynamic speaker was not used after the discovery of this noise.

Before the beginning of single-neuron recording in the VCN, a compound action potential (CAP) audiogram was constructed as a measure of AN thresholds. The CAP stimuli were 10-ms tone pips with 1-ms rise-and-fall times repeated at a rate of 10 Hz. Frequencies approximately logarithmically spaced between 0.2 and 20 kHz were presented. For frequencies up to 3 kHz, the amplitude of the CM was reduced by reversing the phase of the tone on alternating presentations and by using band-stop digital filtering (Cai 2007). Threshold was interpolated from measurements at three consecutive near-threshold levels (5 dB steps), as the level needed for a CAP amplitude of two standard deviations of the background electrode signal plus its mean. The CAP audiograms of noise-exposed animals showed substantial threshold elevations at frequencies below 6–16 kHz, as shown in Figure 1A. At higher frequencies, threshold shifts were smaller or non-existent. The edge frequency of the audiogram of an exposed animal was defined as the logarithmic center frequency of the right-most segment in the down-sloping portion of the audiogram (i.e., at the upper frequency edge of the threshold shift).

FIG. 1
figure 1

CAP audiograms in the normal and noise-exposed experiments. A The light lines show CAP audiograms from individual experiments, and the heavy lines are averages for the normal (dashed lines) and exposed (solid lines) animals. The vertical bar shows the frequency of the noise exposure. B CAP audiograms from individual experiments shifted to align their edge frequencies with the geometric mean value (11.3 kHz). Audiograms from normal animals are also aligned at 11.3 kHz.

Single-neuron recording and classification

For recording, a portion of the cerebellum was aspirated to expose the dorsal surface of the cochlear nucleus. Custom-made glass-coated platinum–iridium electrodes were used to record from VCN neurons. The electrodes were aligned in the parasaggital plane and lowered into the VCN directly or through part of the DCN under visual control. After a single neuron was isolated, its BF and threshold were determined, usually by constructing a tuning curve as the sound level required to produce a one spike increase in rate in response to a 50-ms tone burst (Kiang et al. 1970). To account for the downward shift of apparent BFs in neurons with substantial threshold shifts and broadened tuning, BFs were set at the low-frequency edge of the high-slope high-frequency portion of the tuning curves of those neurons (Liberman 1984). Tuning quality was estimated as usual by the 10-dB quality factor (Q10), defined as the BF divided by the width of the tuning curve 10 dB above threshold.

Responses to 50-ms BF-tone bursts (5-ms rise-fall ramps, repeated at 4/s) at 30 (and sometimes also 20) dB above threshold were acquired and a peri-stimulus time histogram (PSTH) was constructed. VCN neurons in both unexposed and exposed ears were classified into primary-like (Pri), primary-like-with-notch (PriN), chopper (Ch), onset (On), locker (Lckr), and unusual (Unsl) types, according to the shapes and quantitative measures of the PSTHs (Rhode and Smith 1986; Young et al. 1988); the classification was based on the decision tree given by Blackburn and Sachs (1989). Regularity of discharge and first-spike latency also differed consistently among the response types and were used as an aid to classification. The Pri and PriN neurons are sometimes grouped together into a single primary-like category, called PL below. Three subgroups of choppers were defined (Ch-S, Ch-T, and Ch-L), as in Bourk (1976) and Blackburn and Sachs (1989). Data from the subgroups are not discussed separately in this paper because they did not differ in the measures used; the results from these subclasses were combined in the average measures discussed below and reported as a single class of choppers.

Pri PSTHs are characterized by a relatively high firing rate near onset and a gradual and smooth rate decrease (adaptation). PriN neurons share these properties except for a brief cessation of firing (a notch) immediately following the onset peak. Chopper neurons exhibit multimodal PSTH shapes near the onset of the stimulus, which reflect the regular discharge (chopping) in these neurons. Onset neurons are characterized by a high instantaneous firing rate at short latencies after stimulus onset followed by little or no sustained firing (rate <100 spikes/s). Locker neurons are low-BF (<1 kHz) neurons whose PSTHs cannot be classified in the above categories because of strong phase-locking. Unusual-type neurons show PSTHs that cannot be described by any of the three above-mentioned features and/or cannot satisfy the contingencies in the Blackburn–Sachs decision tree to reach a response type. For example, a few neurons with primary-like PSTH shapes showed extraordinarily long first-spike latencies and were categorized as unusual. In some neurons, PSTH data were not obtained; these are listed as “unknown.”

Stimuli for rate analysis

After the completion of PSTH recording, the following stimuli were presented over a range of sound levels in 1-dB ascending steps: (1) tones at the BF of the neuron and at 1 and 2 kHz; (2) tones with frequencies spaced at 0.4-octave intervals around the BF (e.g., −6.0, −5.6, −5.2,…, 3.6 octaves); (3) BBN with an approximately flat spectrum between 0.2 and 50 kHz. The above stimuli had 200-ms durations and 10-ms rise/fall ramps and were spaced by 800-ms silent intervals. The range of sound levels extended from below the rate threshold to either the maximum level of the sound delivery system or the maximum level at which single-neuron isolation was unaffected by neurophonic responses (typically 80–100 dB SPL for frequencies between 0.5 and 6 kHz). Each level was repeated one to three times.

Data analysis

The counting window for calculating driven firing rates was between 10 ms post-stimulus onset to 10 ms post-stimulus offset, except for onset neurons, for which the counting window extended from stimulus onset to offset. The 10-ms delays were used to compensate for neural latencies, given a 10-ms stimulus rise time, plus the time delays in our acoustic system. Rates were computed as the number of spikes during the counting window divided by its duration; rates were averaged over all repetitions of each stimulus level and were smoothed (5-dB triangular window) before subsequent calculations. The threshold of a RLF was determined as the first of five consecutive sound levels at which the discharge rates rose above or fell below the 50% confidence interval for spontaneous rate (SR) based on a Poisson-counts assumption.

The slopes of the rising portion of RLFs were calculated two ways: (1) as the chord slope from threshold to the rate 20 dB above threshold and (2) using a spline fit to the RLF (Ertel and Fowlkes 1976). Both methods of slope estimation have drawbacks: the spline fit often works well but requires some hand tuning; the chord-slope often underestimates the actual RLF slope. Nevertheless, the spline and chord slopes were correlated (R 2 between 0.47 and 0.73 for different neuron types, P < 10−11 for all neuron types) and gave qualitatively similar results. The results of this paper do not depend on these slope estimates, which were done for comparison with previous work on the AN (Heinz and Young 2004); a problem with the slope estimates is that they are confounded by changes in maximum discharge rate and therefore difficult to interpret. The spline fits were used as a data reduction method based on which the RLFs could be classified into monotonic or non-monotonic. The following two features of the spline fit were used as the criteria for a non-monotonic shape: (1) firing rates below the 50% confidence interval of the spontaneous rate at levels above the threshold or (2) a decreasing slope located at levels above the threshold and spanning more than one eighth of the total level range (Cai 2007).

Pseudopopulation analysis

It is not possible to record large number of neurons in the cochlear nucleus with our methods, as can be done for example in constructing population responses in the AN (Pfeiffer and Kim 1975; Sachs and Young 1979). For this reason, we adopted the approach of taking as much data as possible from each neuron and studying fewer neurons. We were able to study an average of 10–12 VCN neurons per animal; in each neuron, data were taken at a number of different tone frequencies, as described above. To produce population estimates, we pooled neurons across multiple animals and used a pseudopopulation approach to study the RLFs. This method has been used previously in the CN (May et al. 1998). Frequency was expressed as the octave difference between BF and the tone frequency. Then, data were combined, ignoring the absolute BF and tone frequency and considering only the BF relative to the tone frequency. Figures 6, 7, 8, and 9 were constructed in this way, with the BF of the neuron expressed in octaves relative to the tone frequency. Each neuron usually contributed several RLFs to the average. For example, the data points at 1.2 octave on the abscissae of Figure 6 are averages of all the cases in which tone frequency was 1–1.4 octave below BF (e.g., a 2 kHz tone presented to a 4.6-kHz neuron or a 3.2-kHz tone presented to a 7.4-kHz neuron, etc.). Data were included only if both the BF and the stimulus frequency were below the CAP-audiogram edge frequency.

The pseudopopulation approach ignores the dependence of the properties of RLFs on BF. The assumption is supported by the fact that slopes of BF-tone RLFs did not vary significantly with BF in AN fibers in normal-hearing ears (Heinz and Young 2004). However, the thresholds of RLFs did vary systematically with BF. To correct for this effect, we defined an adjusted sound level scale dubbed dBP (“P” for “pseudopopulation”) according to the following equation:

$$L_{\text{P}} = L_{{\text{SPL}}} - \Theta _{{\text{animal}}} \left( {{\text{BF}}} \right) + \Theta _{{\text{avg}}} \left( {2{\text{ kHz}}} \right)$$

where L P and L SPL are, respectively, the adjusted level and the original SPL level; Θ animal (BF) is the CAP threshold of the animal at the BF of the neuron (in dB SPL); and Θ avg (2 kHz) is the 2-kHz threshold (also dB SPL) of the average CAP audiogram of the corresponding animal group (exposed or unexposed). This adjustment expresses sound level as dB relative to the CAP threshold in the same animal [L SPL − Θ animal (BF)], corrected to the average CAP threshold at 2 kHz. The subtraction of CAP threshold at BF eliminates systematic threshold variation with BF and AT and is analogous to expressing threshold in terms of SL, i.e., dB re threshold. Adding the average 2-kHz threshold in normal or exposed animals restores the average threshold shift and expresses L P in terms comparable to absolute sound level; this correction allows meaningful binaural balance calculations to be done, described below.

Weighted averages of the RLFs were calculated both within a PSTH type and then across different PSTH types. Because the relative numbers of different response types studied in this paper may not be representative of the relative sizes of the actual populations of VCN neurons, the averages were corrected using estimates of the relative fractions of different response types from previous studies, based on counts of anatomical cell types and counts of PSTH response types (Osen 1970; Bourk 1976; Cant and Morest 1984; Blackburn and Sachs 1989; Melcher 1993; Doucet and Ryugo 2006). The relative counts are as follows: Pri/PriN/ChS/ChT/ChL/On/Locker/Unusual = 1/0.21/0.39/0.59/0.097/0.12/0.53/0.36 (Cai 2007); that is, for example, there should be 21% as many PriN as Pri neurons and 12% as many onset neurons as Pri neurons. For the data shown in this paper, these weights were applied when averaging the Pri and PriN subpopulations to construct the overall primary-like (PL) group and to the ChS, ChT, and ChL subpopulations to construct the overall chopper group. All the categories were used in computing the estimates in Figure 6. The same sets of weights were used in the normal-hearing and exposed groups.

To estimate the uncertainty of average RLFs and rate-matching slopes, we performed bootstrapping on the datasets. For the pseudopopulation datasets, the bootstrapping procedures were based on repeated re-sampling of neurons in different hearing states and PSTH types. Five-hundred-time bootstrapping was done for each case.

Statistical comparisons in this paper are based on two-tailed Wilcoxon rank-sum tests on a significance level of 0.05, unless stated otherwise.


CAP audiograms and edge frequencies

Data were obtained from 12 noise-exposed and nine normal-hearing cats, whose CAP audiograms are shown in Figure 1A. CAP audiograms were used in this study in lieu of recordings from AN fibers to characterize the degree of damage to the cochlea. For both the individual-experiment and the population-average audiograms, threshold shifts were greatest within two octaves of 2 kHz (the shaded bar in Fig. 1A), near the frequency of the exposure noise. At lower and higher frequencies, threshold shifts generally decreased.

Although the extent and frequency span of threshold shifts varied between individual exposed animals, a sharply down-sloping portion at the high end of the threshold-elevated frequency region was invariably present, based on which the boundaries (edge frequencies) of the threshold-elevation could be defined, as described in “Materials and methods.” The edge frequencies of the exposed animals ranged from 4.2 to 17.3 kHz, with a geometric mean of 11.3 kHz, which was arbitrarily chosen as the edge frequency of all the normal-hearing animals. In Figure 1B, the CAP audiograms are aligned at their edge frequencies, with frequencies expressed on a virtual frequency axis as octaves re the edge. It can be seen that the audiograms align reasonably well at the edge frequencies and that the virtual frequency axis provides an effective way to distinguish frequency ranges with and without consistent threshold elevations. Subsequent analysis of RLFs will be based on neurons with BFs below the edge. The amount of threshold shift below the edge frequency varied over about a 20-dB range. To compensate for this variation, sound level was expressed as dB relative to threshold in various ways, described below.

Classification and quantification of PSTHs

The major PSTH types of VCN neurons in normal-hearing cats were found in noise-exposed animals as well. Most VCN neurons in noise-exposed animals could be readily classified into Pri, PriN, Chopper, Onset, Locker, and Unusual types according to the decision tree given by Blackburn and Sachs (1989). Table 1 summarizes the numbers of different types of VCN neurons recorded from normal and exposed animals, as well as the distributions of their BFs relative to the edge frequencies. No PSTH types seen in the normal animals were missing in exposed animals nor was there any evidence for well-defined abnormal PSTH categories in exposed ears. The “unusual” response type did not become more common, for example (χ 2 = 0.08, N.S.). The distribution of response types was similar in normal and exposed animals, except that there were relatively fewer onset neurons and more Pri neurons at frequencies below the edge in exposed animals (χ 2 = 12.2, P = 0.002, 2 df).

TABLE 1 Numbers of neurons in different PSTH categories, counted separately for neurons below and above the edge frequency and for normal and exposed ears

The qualitative similarity of PSTH patterns in normal and exposed animals begs the question of whether AT altered any quantitative aspects of the responses. A consistently observed difference in normal VCN is first-spike latency, which is ∼1 ms shorter in primary-like neurons than in chopper neurons (Young et al. 1988; Blackburn and Sachs 1989); this difference was also observed in this study after AT (P = 4 × 10−5). In addition, we compared the sustained discharge rate, the coefficient of variation of interspike intervals, the standard deviation of first-spike latencies, and the rate of change of first-spike latency with level. At the levels tested, none of these measures differed significantly between the normal and exposed groups (not shown), except that the maximum sustained discharge rate was significantly higher in chopper neurons; changes in discharge rate are considered in “Tone rate-level functions,” “Spread of excitation,” and “Rate-matching curves.”

The BF thresholds of neurons in normal and exposed ears are shown in Figure 2, in the form of threshold relative to the average normal CAP threshold at the neuron’s BF (Fig. 1A), plotted against frequency relative to the edge frequency. Neurons of different response types showed similar threshold shifts, as expected if the threshold shifts were of cochlear origin. The single-neuron thresholds in normal animals (unfilled symbols) were on average below the CAP threshold because a suprathreshold response is necessary to achieve the CAP threshold criterion. In individual animals, the single neuron thresholds varied with BF in essentially the same way as the CAP thresholds (Cai 2007).

FIG. 2
figure 2

Thresholds at BF of neurons of various response types plotted against BF relative to the edge of the CAP audiogram. The thresholds are presented as dB relative to the thresholds in the average CAP audiogram of the normal group (Fig. 1). Symbols identify neuron types; these are the same as in subsequent figures. Filled symbols are from exposed animals, and unfilled symbols are from normal animals.

One hypothesis for recruitment is a compression in the range of neural thresholds following cochlear damage (Zeng and Turner 1991; Moore et al. 1985); that is, when thresholds are elevated, the distribution of neural thresholds is compacted by the reduction in the possible range of threshold values, thus reducing the range of sound levels over which the summed rate of a population of auditory neurons can change, leading to recruitment. This hypothesis was not supported by AN data (Heinz et al. 2005); in the present data set, the range of thresholds was estimated as the standard deviation of neural thresholds relative to the CAP threshold at the neuron’s BF. Locker neurons were not included in this calculation because of their low BFs but do not change the result if included. The resulting standard deviations are 9.5 dB (N = 50) in normal animals and 11.4 dB (N = 71) in exposed animals. Thus, the hypothesis of threshold range compression is not supported by these cochlear nucleus data.

Sharpness of tuning

In normal-hearing ears, the tuning curves of VCN neurons were similar to those of previously described AN fibers in terms of shape, threshold, and tuning quality. Figure 3A shows the 10-dB quality factors (Q10s) of normal (unfilled symbols) and exposed (filled symbols) VCN neurons. Most neurons from normal ears fell into the 95% confidence interval of Q10s of normative AN populations from Miller et al. (1997) and Ji (2000) (dashed lines), but many neurons from exposed ears had Q10s below the normal range.

FIG. 3
figure 3

Q10 values of VCN neurons are similar to those of AN fibers in both normal and noise-exposed ears. A A summary of tuning-curve Q10s of neurons in the normal (unfilled symbols) and noise-exposed (filled symbols) populations. The gray bar indicates the frequency of the exposure noise. The upper and lower limits (5% and 95% quantiles) of Q10s of normal AN fibers from Miller et al. (1997) and Ji (2000) are overlaid for comparison (dashed lines). The solid line shows the moving average of Q10 versus BF for the control group. B Relative Q10 (normalized by the average Q10 at the same BF in the normal-hearing population) versus threshold relative to the average normal CAP audiogram (Fig. 1). The horizontal dashed line indicates the average normal Q10 values. The horizontal dash-dotted line (at 0.55) is the boundary between sharp and broad tuning used by Heinz and Young (2004), chosen to be the fifth percentile of Q10s in normal AN fibers. The ellipse shows the region occupied by AN fibers with moderate to severe loss from Figure 5 of Heinz and Young 2004 for comparison. The ellipse was shifted along the abscissa by 28 dB to account for the difference between the mean CAP threshold at 2 kHz (23.5 dB SPL) and the threshold reference used by Heinz and Young at 2 kHz (−4.5 dB SPL).

To show the relationship of Q10 and threshold, we defined relative Q10 as Q10 divided by the moving average of Q10s of normal VCN neurons at the same BF (the solid line in Fig. 3A). The relative Q10s are plotted against threshold shift (the threshold minus the average normal CAP threshold from Fig. 1A) in Figure 3B. In the exposed neurons, there was a general trend for neurons with greater threshold shifts to have lower Q10 values (P = 5 × 10−6, first-order linear regression). The tuning exhibited little dependence on PSTH type. For comparison, the ellipse shows the region occupied by the AN fibers with moderate to severe hearing loss in a similar plot in Heinz and Young (2004). The overlap in the AN and VCN data suggests that the cochlear damage is similar in the two sets of exposed animals. Given this consistency, we will compare post-trauma alterations in RLFs between our VCN neurons and the AN fibers of Heinz and Young (2004).

Tone rate-level functions

Tone rate-level functions were recorded in 1-dB steps at multiple frequencies including BF. Figure 4A shows the driven rates, meaning total rate minus SR, versus sound level for tones at BF in the Pri and PriN populations. Normal and exposed populations are indicated by the dashed and solid lines, respectively. These data are from neurons with BFs below the CAP edge frequency, so the neurons from exposed ears all had substantial threshold shifts. As can be seen in Figure 4A, PL neurons showed a mixture of RLF shapes. The majority showed a monotonic rate increase with level over the first 20–30 dB above threshold, followed by a decrease in slope or saturation. Of the normal PL neurons 45.5%, predominantly PriN, showed non-monotonic RLFs; there was no significant difference in the number of PL neurons with nonmonotonic RLFs in the exposed and unexposed animals (P = 0.4, Fisher’s exact test).

FIG. 4
figure 4

Primary-like neurons show a decrease in rate responses following AT. RLFs from Pri and PriN neurons are shown. Data in normal and exposed animals are identified by line style, defined in the caption. A BF-tone RLFs as driven rate (rate–SR). B Average firing rate versus level re threshold. The individual RLFs in A were aligned at their thresholds (vertical line) and averaged. The circles and error bars show mean ± 1 SE of rates at 30 dB re threshold.

Visual inspection of Figure 4A suggests that threshold elevation was accompanied by diminished driven firing rates in PL neurons. To investigate this question, Figure 4B shows average RLFs for the normal and exposed PL population; the RLFs were aligned on threshold before averaging. Both average RLFs show a sigmoid shape with a dynamic range of approximately 20 dB but different rates of saturation. The mean saturated rates at 30 dB above threshold are 168 and 102 spikes/s in the normal and exposed PL populations, shown by the circles and error bars in Figure 4B (SEMs; P = 0.08). The slopes of the BF RLFs of individual fibers were significantly different, slightly steeper for normal neurons (mean ± 1 SD is 7.9 ± 4.4 spikes/s for normal and 4.7 ± 2.7 spikes/s for exposed neurons, P = 0.03). Slopes at frequencies away from BF were not significantly different (P = 0.2, not shown).

The same measures applied to the chopper responses are shown in Figure 5. The chopper neurons showed a substantial increase in saturation driven rate in exposed animals. In the averaged rate functions (Fig. 5B), the saturation rates are 357 spikes/s in the exposed versus 234 spikes/s in unexposed animals (P = 0.02). In addition, the fraction of non-monotonic BF-tone RLFs was significantly smaller in the exposed animals (28.6%) than in the unexposed animals (83.3%; P = 0.007, Fisher’s exact test). The slopes of BF RLFs were significantly larger in exposed animals (10.1 ± 3.9 in normals versus 14.9 ± 4.8 spikes/s in exposed ears, P = 0.02), but the slopes at frequencies away from BF did not differ significantly (P = 0.1).

FIG. 5
figure 5

Chopper neurons show an increase in rate response after AT. RLFs for BF tones in chopper neurons arranged as in Figure 4.

The results from other PSTH types are not presented separately. However, they are generally consistent with the results shown in Figure 5 (Cai 2007) in that onset and unusual neurons showed BF-tone RLFs that were, on average, steeper than normal following sound exposure and saturated at higher rates. The low-BF Locker neurons showed a bimodal distributions of slopes, which presumably reflects a mixture of PL and non-PL neurons in this category.

Spread of excitation

Spread of excitation in the neural population is often considered to be an important parameter in the growth of loudness. Abnormally fast spread of excitation due to broadened tuning in impaired ears is one possible correlate of recruitment (Kiang et al. 1970; Evans 1975). Although spread of excitation in the AN array does not seem to be an important effect (Heinz et al. 2005), the properties of spread of excitation could be different in central populations, so estimates of the spread of excitation in VCN were derived here (Fig. 6). This analysis shows average normalized rate versus BF in octaves re the tone frequency. Rates were collected in 0.4 octave wide frequency bins, the approximate critical bandwidth in the cat (Pickles 1979). Because it is the activation of neurons that is important in spread of excitation, and not the overall discharge rate, the firing rates were normalized for each response type by the average driven rate in unexposed ears for a BF tone at 30 dB re threshold (e.g., the circles in Figs. 4B and 5B) and then combined across neuron types using the weights described in “Materials and methods.” This prevents response types with high discharge rates (e.g., choppers in exposed ears) from dominating the plot. Thus, Figure 6 shows the average level of activation across PSTH types and not the average or overall rate in the pseudopopulation.

FIG. 6
figure 6

Spread of excitation is larger following AT. A Equal-level rate contours for the tonal pseudopopulation in normal animals. The rates are average normalized driven firing rates of all VCN response types, computed in 0.4 octave frequency bins, as described in the text. The numbers show sound levels in dBP. The heavy gray line shows the 60 dB SPL contour for AN fibers from Heinz et al. (2005), Figure 4, normalized in the same way. This level was chosen for comparison because it shows a clear shift in the BF at which the peak rate occurs. B Same for the exposed preparations. The heavy gray lines show 80 and 90 dB contours for AN fibers in preparations with “mild” and “moderate” hearing loss, as defined in Heinz et al. (2005). Locker neurons are not included in this figure because their BFs are far from those of other neuron types.

As expected, the degree of activation spreads as sound intensity increases. Comparing the normal (Fig. 6A) and exposed (Fig. 6B) populations, the difference in threshold is evident. There is also a difference in spread of activation in that the whole population was activated at 60–70 dBP in the exposed population, about 20–30 dB above threshold, whereas activation did not spread across the whole population until 60 dBP (>50 dB re threshold) in the normal population, which implies that spread of excitation was two to three times faster in the exposed ear than in the unexposed one.

Comparison AN data at moderate to high levels are shown by the heavy gray lines (from Fig. 4 of Heinz et al. 2005). Note that the broadening of the active population is not as prominent in VCN as in the AN, especially in the exposed population (Fig. 6B), and the shift in the peak of the curves toward BFs above the tone frequency is smaller or non-existent in the VCN, consistent with iso-level rate plots in previous studies (Rhode and Smith 1986; Rhode and Greenberg 1994). In this study, a shift in the response peak did not occur up to 70 dBP in the normal population (Fig. 6A), whereas in the AN fibers, it was seen at levels of 50–60 dB SPL in both normal and impaired populations. These levels correspond to ∼50 dB above threshold in the normal AN fibers and 20–30 dB above threshold in fibers following AT. Thus, the peak of the rate plots is more stable in VCN than AN.

Rate-matching curves

The summed firing rate of a population of auditory neurons is usually considered to be the neural correlate of perceptual loudness, although how summed rates translate to loudness is not clear (Pickles 1983; Relkin and Doucet 1997; Heinz et al. 2005). To compare the behaviors of VCN population rates and loudness, we estimated population firing rates using the pseudopopulation method described in “Materials and methods.” Evidence has been offered that the population involved in encoding the level of a tone may not be the entire BF range, but only BFs near the tone frequency (Hellman 1978; Moore et al. 1985; reviewed by Smith 1988), so the population firing rate was estimated for several BF ranges and also separately for the VCN neuron types.

In psychophysical studies of loudness recruitment, a frequently used paradigm is loudness balance between the two ears. In this paradigm, the loudness in one ear with a near-normal threshold is adjusted (by varying sound level) to match the loudness in an impaired ear across a range of sound levels. The slope of the generated loudness matching curve quantifies the rapidity of loudness growth in the impaired ear relative to normal (Miskolczy-Fodor 1960; Stillman et al. 1993; Moore et al. 1985; Zeng and Turner 1991; Moore 2004). In this study, loudness balance was simulated by matching discharge rates in normal and exposed pseudopopulations. The calculations were done by interpolating in the two RLFs to find the sound levels that produce a series of rates between threshold and saturation (or the first turning point in either RLF).

Rate-matching curves for on-BF and all-BF encoding in three different populations are shown in Figure 7. In this figure, “on-BF” means that the BF was within ±0.2 octaves of the tone frequency and “all-BF” means that RLFs were included over all frequencies, as long as both tone frequency and BF were below the edge frequency. The left panel of each part of Figure 7 shows average RLFs for the normal (dashed lines) and noise-exposed (solid lines) populations. Bootstrapping was used to estimate the standard deviations of the rate estimates; ±1 SD confidence limits are shown as shaded regions for the on-BF pseudopopulations. The all-BF pseudopopulation results are shown without confidence limits, but the limits are similar to the ones shown for the on-BF data (the presence of shading also serves to separate the on-BF from the all-BF data). The rate-matching curves for on-BF and all-BF encoding are shown in the right panels, identified by line color, as in the legend. For comparison, the dashed line in the right panels shows a typical rate-matching curve based on the reported loudness-balance data in human observers (Miskolczy-Fodor 1960; Stillman et al. 1993; Moore et al. 1985; Zeng and Turner 1991; Moore 2004). This is a straight line that extends from a point defined by the thresholds in the normal and impaired ears to the point (100 dB, 100 dB), assuming that loudness is the same in normal and impaired ears at 100 dB. Linear regression was performed on the rate-matching curves to give their overall slopes, which are shown next to the lines. Note that 100 dB is a high level for the equal-loudness level; however, choosing a lower level would increase the slope of the dashed line, effectively increasing the slope that neural rate balance functions have to achieve to be considered consistent with recruitment. For the comparisons below, that change would make it easier to conclude that neural slopes are smaller than expected for recruitment and, in this sense, the choice of 100 dB is conservative.

FIG. 7
figure 7

Rate-balance functions from Ch, but not PL neurons resemble perceptual loudness-balance functions. In each section, the left plot shows the average rate for normal (dashed lines) and exposed (solid lines) pseudopopulations consisting of (1) neurons with BFs within ±0.2 octaves of the tone frequency (On-BF), shown with shaded bootstrap confidence limits (±1 SD), and (2) neurons of all BFs, shown as the average rate lines only. The circles on the on-BF functions show the thresholds (3 spikes/s). The right plots show the sound levels that give the same rates in the normal and exposed populations. Matches are shown for the on-BF and all-BF populations, identified by line shading, as in the legend in A. The numbers are the slopes of best-fitting lines for the rate matches over the range of sound levels shown. The dashed lines show predictions for typical recruitment functions in human observers (see text). A AN data from Heinz et al. (2005), reanalyzed with the pseudopopulation method. Rate-matching lines and slopes are shown for both pseudopopulation analysis (slopes, 0.54 and 1.46) and from the previous real-population analysis done by Heinz and colleagues (slopes, 0.56 and 1.20). B, C Average RLFs and the corresponding rate-matching curves for PL and chopper neurons.

Figure 7A shows a pseudopopulation response constructed from the AN data of Heinz et al. (2005). RLFs with tone frequencies and neuron BFs between 0.5 and 6 kHz (the most impaired BF region) from animals with moderate/severe SNHL were used. AN fibers were averaged within SR groups (low + medium, high) before combining those averages with weighting according to the expected numbers of fibers in each SR group from previous studies (0.4 and 0.6 respectively; Liberman and Dodds 1984). The resulting on-BF RLF has a smaller slope in impaired ears compared to normal, giving a rate-balance line with slope 0.54. The all-BF RLF for the impaired pseudopopulation has an accelerating slope that is smaller than normal at low levels but increases rapidly at high levels. The rate-matching function has a slope less than the recruitment line at sound levels within 20 dB of threshold, but the slope increases at higher levels, giving an overall slope of 1.46. For comparison, the rate-matching curves from the earlier real-population calculation of Heinz and colleagues are superimposed in the right-hand plot (slopes, 0.56 and 1.20). The two sets of rate-matching curves are essentially identical, suggesting that the pseudopopulation method gives accurate results for the rate-matching curves.

The behavior of the PL pseudopopulation (Fig. 7B) is similar to that of AN fibers, except that the standard deviations of the RLFs are larger. The rate matching slope (0.35) is again considerably smaller than the expected recruitment behavior in the on-BF population. The all-BF RLF accelerates considerably at high sound levels, giving a steep rate-matching function (slope, 3.37) beginning about 20 dB above threshold.

The all-BF rate matching function of the PL neurons, and to a lesser degree also the AN fibers, seems at first glance to be consistent with recruitment, in that steep slopes were obtained over most of the range of sound levels. However, in AN fibers, there is a low-slope segment near threshold, over the first 20 dB in the impaired ear. This low-slope portion must also be present in the PL neurons (Fig. 7B) because the on-BF rate-matching curve of the PL neurons has a shallow slope. But the low-slope portion in the all-BF curve is not plotted because the slow growth of the rate balance function over the first 10 dB above threshold was obscured by noise in the all-BF RLF. However, the rate-matching curve must begin at the point defining threshold in the two ears, 42 dB and 0 dB for the PL neurons (Moore 2004), meaning that the PL neurons’ rate-matching curve must include a low-slope portion similar to the on-BF rate matching curve.

The chopper neurons (Fig. 7C) showed summed RLFs, which were quite different from those of the AN fibers and PL neurons, in that the impaired population showed a steeper on-BF RLF than its normal counterpart. This steepened slope, together with the absence of rate saturation, caused the on-BF rate-matching curve to have a steep slope beginning at threshold (1.73), consistent with loudness growth in recruitment. The all-BF average rates are smaller than the on-BF ones but again have a steeper slope, giving a rate matching curve that also matches expected recruitment behavior (slope, 2.02).

Spread of excitation changes rate-matching for primary-like but not chopper neurons

On-BF and all-BF are the two extremes of the possible BF ranges of neurons participating in intensity encoding. Figure 8 shows how the slopes of the rate-match functions vary with the frequency window width of the analysis. The rate-matching computation was done for window widths from 0.4 to 6.8 octaves in 0.4-octave steps for the pseudopopulations analyzed in Figure 7. The BF window was always centered at the tone frequency. As in Figure 7, the slopes are often a compromise between shallow values near threshold and steep values at higher levels. Bootstrapping was performed for each pseudopopulation and each window width to generate estimates of uncertainty, given as ±1 SD error bars in Figure 8. The shaded region shows the range of loudness-balance slopes from several psychophysical studies (Miskolczy-Fodor 1960; Stillman et al. 1993; Moore et al. 1985; Zeng and Turner 1991; Moore 2004). Cases with threshold shifts between 30 and 50 dB were chosen, and the median (1.9) and interdecile range (1.2–2.8) of the slopes are plotted as the gray region. There is a tendency for the rate-matching slopes to increase from values less than the psychophysical data at narrow window widths to values consistent with or larger than the psychophysical values at wide window widths. The exception is the choppers that show recruitment-like slopes for all bandwidths. AN fibers have the lowest slopes across the range of bandwidths and barely reach the recruitment region for all-BF encoding.

FIG. 8
figure 8

The slopes of rate-matching functions change with the averaging bandwidth for AN fibers and PL neurons, but not Ch neurons. Slopes of rate-matching functions are shown for tonal pseudopopulations as a function of the range of BFs (relative to the tone frequency) averaged together. Results are given as mean ± 1 SD of 500 bootstrap repetitions of the calculation for the same three populations as in Figure 7, identified in the legends. The shaded band shows the range of slopes of loudness-matching curves in human subjects (publications cited in the text) with hearing losses comparable to those of the exposed cats. The AN fiber data are from a reanalysis using pseudopopulations of the data of Heinz et al. (2005).

Rate-matching curves for BBN show the same effects as for tones

Figure 9 shows the average RLFs and rate matching curves for responses to BBN. For noise, there is no meaningful definition of on-BF, so results are shown for the all-BF pseudopopulations only. In the left part of the figure, the average RLFs for primary-like and non-primary-like (Ch, On, and Unusual) VCN neurons are compared. The different PSTH types were weighted as in Figure 6 when averaging, but the rates were not normalized. The rate-matching curve of the PL neurons in the plot at right showed a shallow slope (0.60) similar to that of the AN fibers and PL neurons in Figure 7, whereas the non-PL neurons showed a steeper rate-matching slope (2.77), like the Ch neurons in Figure 7. This shows a clear recruitment-like effect in the non-primary-like neurons in VCN.

FIG. 9
figure 9

Rate-matching slopes for responses to BBN; these differ between PL and non-PL neurons as for the tone responses. The figure is drawn as in Figure 7, except that the recruitment prediction line is not shown because of uncertainty as to how recruitment should behave for BBN. Neurons are grouped into PL and non-PL pseudopopulations, and the plots show average RLFs (left) and rate-matching curves (right).


VCN response properties after acoustic trauma

Following AT, neurons in the VCN showed threshold shifts and broadened tuning over a wide range of frequencies centered on the exposure frequency (Figs. 2 and 3). The changes were qualitatively and quantitatively similar to those seen previously in AN fibers following similar exposures (Miller et al. 1997; Heinz and Young 2004). Moreover the VCN threshold shifts were consistent with the CAP thresholds in the same animals in that VCN neural and CAP thresholds varied similarly with frequency (Cai 2007). Beyond the threshold and tuning changes, the main effects of AT on VCN neurons were changes in the RLFs. It is important that there was no evidence for significantly altered PSTH properties in VCN neurons. The usual VCN response types were observed after AT, and their properties, such as regularity and latency, were not significantly altered. This allowed the analyses done in this study to be based on PSTH response types established in normal animals. It also makes it possible to assume that the relationships between response types and morphological cell types established in normal animals also hold in impaired animals.

The observation that all major PSTH types were still present after AT is consistent with previous intracellular studies, which showed that basic membrane properties of VCN neurons in animals with cochlear damage or ablation are similar to normal (Francis and Manis 2000; Wang and Manis 2005; Lu et al. 2007). Although changes in a number of parameters of postsynaptic ion channels have been reported in the medial nucleus of the trapezoid body after AT (Leao et al. 2004, 2005, 2006), in the CN it is clear that the important differences between so-called types I and II neurons (Oertel 1983) remain following AT. Type I neurons are stellate cells that show long time constants and good temporal integration; they respond to steady current injection with a maintained spike train. Type I neurons seem to correspond to choppers in vivo. Type II neurons are bushy cells, which have short time constants and fire only a few spikes at the onset of a current step. The difference is largely due to the presence of low-threshold potassium channels in type II membranes (Manis and Marx 1991; Rothman and Manis 2003). They correspond to PL neurons in vivo.

Rate functions of primary- and non-primary-like neurons after trauma

One interesting result of this study is the difference in the behavior of RLFs between PL and non-PL (especially chopper) neurons. In PL neurons, rates in response to BF tones were smaller following trauma (Fig. 4A, B); by contrast, chopper rates were substantially larger (Fig. 5A, B). Lonsbury-Martin and Martin (1981) previously showed that, in acutely deafened rhesus monkeys, a subpopulation of VCN neurons showed diminished rate-level responses, whereas another subpopulation showed steepened RLFs. However these neurons were not identified by response type.

PL responses are recorded from anatomical bushy cells, which are innervated by AN fibers through large secure synaptic endings (Ryugo 1992). At these synapses, postsynaptic potentials are large, and the time constants of the postsynaptic neurons are short, which prevents temporal summation of synaptic events (Oertel 1985; Raman and Trussell 1992; Isaacson and Walmsley 1996). Presumably, PL and AN RLFs were similar because the operating mode of these synapses is to produce a postsynaptic spike whenever there is a presynaptic spike, as demonstrated by their complex action potentials which contain both a pre- and a postsynaptic potential (Pfeiffer 1966; Bourk 1976; but see Kopp-Scheinpflug et al. 2002). Bushy-cell synapses are altered in morphology following cochlear damage (Ryugo et al. 1997; Redd et al. 2000), but the implications for synaptic transmission of these morphological changes are not clear. In fact, some evidence suggests that AN/VCN synapses are strengthened following cochlear degeneration (Oleskovich and Walmsley 2002). However, as long as the postsynaptic potentials produced in bushy cells by AN fibers are above threshold, bushy-cell RLFs should remain like those of AN fibers.

The discussion above applies strictly to spherical bushy cells. The nature of synaptic integration in globular bushy cells is not completely understood (e.g., Smith and Rhode 1987; Joris et al. 1994; Rothman and Young 1996; Young and Sachs 2008), but probably shares many features with spherical bushy cells because of similarities like irregularity of spike timing and strong phase locking. In this study, we did not see a difference in behavior between Pri and PriN neurons, suggesting that the effects of AT are the same in spherical and globular bushy cells. Overall, the present evidence suggests that bushy cells do not change their operating mode following AT, as shown in Figures 4 and 7B.

After AT, chopper neurons and some other non-primary-like VCN neurons exhibited enhanced rate responses (Fig. 6). Given the reduced rate responses of AN fibers, this observation suggests that post-trauma neuronal plasticity strengthened the effects of AN synapses on these neurons. The morphological correlate in VCN of non-primary-like PSTH types is the stellate or multipolar cell plus the octopus cell (Rhode et al. 1983; Smith and Rhode 1989), although it is unlikely that many octopus cells are included, based on PSTHs. Normally, AN synapses on stellate cells produce subthreshold postsynaptic responses (Oertel 1985; Ferragamo et al. 1998; Golding et al. 1995), which require temporal summation of several EPSPs to exceed threshold. Thus, the discharge rates of stellate cells are likely to be affected sensitively by changes in either the strength of AN synapses or the postsynaptic processing. AN synapses on stellate cells have been shown to undergo less severe morphological changes than bushy synapses in congenital deafness (Redd et al. 2002). Because the basic postsynaptic properties of these neurons seem not to change after cochlear damage (Francis and Manis 2000), they probably still rely on temporal summation for spiking after AT. The increased discharge rate could be produced by various effects including (1) an increase in postsynaptic resistance (Fancis and Manis 2000), (2) changes in postsynaptic excitability or the density of postsynaptic receptors, (3) a decrease in inhibitory inputs to the cells, (4) presynaptic changes that strengthen the synapse (Oleskevich and Walmsley 2002), or (5) the formation of new synaptic inputs from AN fibers or other VCN neurons, e.g., through axonal sprouting (Kim et al. 2004).

A variety of evidence from prior studies suggests that inhibitory synapses in auditory neurons are weaker following cochlear damage or ablation (Willott et al. 1997; Suneja et al. 1998; Milbrandt et al. 2000; Mossop et al. 2000; Vale and Sanes 2000; Vale et al. 2003, 2004). Moreover, inhibitory effects on spike trains have been shown to be weaker following AT in the inferior colliculus and dorsal cochlear nucleus (Salvi et al. 2000; Ma and Young 2006). Inhibitory responses were not directly studied in this work. However, the decreased prevalence of non-monotonic RLFs in choppers following AT is consistent with a loss of inhibitory effects on the cells.

VCN effects of changes in spatiotemporal patterns in the AN

The discussion above treats VCN responses as resulting from standard feedforward synaptic processing. However, in an important paper, Carney (1994) has proposed a model in which VCN neurons are sensitive through coincidence detection to the relative temporal patterns of AN spike times across BF. Because the timing of AN spikes varies with sound level, this monaural coincidence detection could be used to encode both intensity differences (Colburn et al. 2003) and loudness. Furthermore, it could offer a possible explanation for loudness recruitment (Carney 1994). In an earlier paper, Carney (1990) showed sensitivity to the relative timing of AN spikes across BF in VCN neurons, including PriN, Onset, and Chopper neurons. However, it is not clear how important this mechanism is for the results shown in this paper, for two reasons. First, for tone stimuli, the coincidence mechanism only applies to low frequencies (those with phase-locking), and most neurons in this paper had BFs above 3 kHz. Recruitment in SNHL patients is observed at high tone frequencies as well as low (e.g. Stillman et al. 1993). Moreover, Figure 9 shows that the Ch and PL slopes also change differently for BBN, for which coincidence works at all BFs, because of envelope fluctuations. Thus, there is no evidence in our data for a connection between the changes in RLFs and the availability of information in monaural coincidence detection. Second, Carney (1990) showed coincidence effects in both PriN and Ch neurons, so it is not clear why these two neuron types should behave differently after AT (as in Figs. 4 and 5).

The pseudopopulation method

In order to combine data across animals with varying degrees of acoustic trauma, the pseudopopulation method was developed. To prevent including responses from normal neurons, the pseudopopulation was limited to neurons with BFs below the edge or cutoff frequency of the cochlear damage as estimated by the CAP threshold shift; in addition, responses to particular tone frequencies were included only if the tone frequency was below the edge. To further compensate for the varying degrees of threshold shift within the damaged BF region, sound levels were expressed as dBP (Eq. 1), which essentially expresses levels as dB relative to the CAP threshold at the neuron’s BF. For tone frequencies near the BF of a neuron (i.e., in the On-BF calculations), this method is appropriate, controlling between-animal and across-frequency variability. However, it does combine data from neurons with different degrees of threshold shift, which is unavoidable with the data base available in this paper. Given the changes in BF RLFs shown in Figures 4 and 5, combining across neurons should cause the estimates of population RLFs to be a compromise between RLFs of varying severity, but should not affect the principal results reported in this study, especially the differences between primary-and non-primary-like neurons. For tone frequencies away from BF, there is the additional problem that the difference of thresholds between tuning-curve tips and tails varies with the degree of threshold shift (e.g., Liberman and Dodds 1984), which introduces an additional variable into the calculation. This problem mainly affects all-BF calculations; again, it is not clear that it should affect the results in this study in a qualitative way.

Implications for loudness recruitment

Generally loudness is assumed to be proportional to the summed activity of some group of auditory neurons (e.g., Moore and Glasberg 2004), but the nature of that summation and the relevant populations of neurons are unclear (Pickles 1983; Relkin and Doucet 1997; Heinz et al. 2005). Similar questions remain in the coding of somatosensory perceptual intensity (Johnson 1974; Muniak et al. 2007). Attempts to reproduce perceptual loudness functions by summing the activities of populations of AN fibers suffer from the difficulty that it is not clear exactly how the degree of match of the loudness and neural growth functions should be evaluated, given that they are both monotonically increasing functions of sound level.

In considering recruitment, these difficulties can be bypassed to some extent by assuming that changes in the loudness function should be mirrored by similar changes in neural response growth functions, steeper neural growth functions corresponding to steeper loudness growth, for example. Previous studies based on evoked potentials and multi-neuron recording showed recruitment-like overexcitability of central auditory neurons following cochlear damage in a variety of animal models (e.g., Saunders et al. 1972; Popelář et al. 1987; Salvi et al. 1990; Syka et al. 1994; Szczepaniak and Møller 1996; Wang et al. 2002). This behavior has also been reported in the human auditory system through functional imaging (Morita et al. 2003; Langers et al. 2007). These neural data match the expectation of recruitment qualitatively in that the neural responses of populations are stronger after cochlear damage. Neural hyperexcitability is commonly reported in cortex and inferior colliculus, but the results in cochlear nucleus have been mixed, with both positive (e.g., Saunders et al. 1972) and negative (e.g., Salvi et al. 1990; Qiu et al. 2000) results. The different directions of change reported in this paper in the RLFs of different types of VCN neurons provide one possible explanation for this disagreement. From our results, it is clear that evoked potential amplitudes in VCN would be sensitive to which neurons dominate the response for the particular stimuli and electrode placements used. Nonetheless, the current study unequivocally suggests that the VCN is the most peripheral stage of the auditory pathway at which overexcitability of neurons appears following AT (e.g., Fig. 5). Moreover, it points to the synaptic processing in the non-primary-like, especially the stellate, cells as the initial locus of central compensation for reduced rate responses in the AN.

A more quantitative comparison of perceptual and neural response growth can be obtained by comparing loudness balance functions to rate-equality functions, as was done in Figures 7, 8, and 9. Equating the perceptual loudness between a normal and a damaged ear should behave similarly to equating the summed discharge rates of corresponding groups of neurons as long as loudness is in some way a monotonic function of summed neural activity. This approach avoids the difficulties posed by nonlinearities in the loudness growth functions themselves (e.g., Allen 1997; Buus and Florentine 2002; Moore and Glasberg 2004).

The rate-balance functions in Figure 7 were compared to an assumed recruitment function (dashed lines) in which loudness grows linearly from a point at the threshold (zero loudness) in the two ears to loudness equality at 100 dB. Real loudness balance functions deviate from this assumed line in two ways: (1) Loudness matches can show a slope near 1 for the first 10 dB or so above threshold and then become steeper (Moore 2004), and (2) loudness may not be equal in the two ears at 100 dB (Moore 2004; Zeng and Turner 1991; Stillman et al. 1993). Equality can occur at lower sound levels or loudness in the damaged ear may not reach equality with loudness in the good ear, say in cases with conductive hearing loss. Neither of these deviations can lower the loudness-balance slopes to the values shown by the AN fibers and PL neurons.

Because of these uncertainties, we compared rate-equality and loudness-balance functions by computing their slopes over a range of 20–40 dB near threshold. The unity slope in the first 10 dB was not separately considered and was usually not apparent in the data. These results, summarized in Figure 8, show that chopper neurons (and other non-PL neurons, not shown) behave in a way similar to human observers regardless of summation bandwidth. Other populations matched the psychophysical data only for wideband summations that include most of the population of active neurons.

Interpretation of the rate-balance results depends on the psychophysical evidence suggesting that recruitment for tones depends primarily on activity in the auditory filters near the tone frequency and not on spread of excitation (Hellman 1978; Moore et al. 1985; Zeng and Turner 1991). Figure 8 makes it clear that spread of excitation has a larger effect in the VCN than in the AN, in that spread of excitation causes substantially steeper rate-balance slopes in VCN than in AN. This conclusion is most clearly shown by comparing the AN and PL slopes in Figure 8. These results suggest that recruitment, and perhaps by extension loudness, depends on non-PL neurons in VCN, particularly the chopper neurons.

An additional aspect of chopper responses following AT is their elevated discharge rates at high sound levels (Fig. 5). This behavior could provide an explanation for auditory hyperacusis (e.g., Sherlock and Formby 2005; Nelson and Chen 2004) in which intense sounds become too loud for comfortable listening. Of course, this hypothesis depends on the nature of the additional adjustments in rate responses at later stages of the auditory system. Presumably, there are mechanisms that regulate the strengths of synapses between cochlear nucleus and inferior colliculus neurons to keep the collicular neurons within their dynamic ranges. Depending on those rules, the increase in rate response of chopper neurons could be accompanied by an increase or a decrease in synaptic strength at the next stage (as in the peripheral tinnitus model of Shaette and Kempter 2006), producing varying degrees of recruitment and hyperacusis. These questions require further research.