Similarity of Traveling-Wave Delays in the Hearing Organs of Humans and Other Tetrapods
Transduction of sound in mammalian ears is mediated by basilar-membrane waves exhibiting delays that increase systematically with distance from the cochlear base. Most contemporary accounts of such “traveling-wave” delays in humans have ignored postmortem basilar-membrane measurements in favor of indirect in vivo estimates derived from brainstem-evoked responses, compound action potentials, and otoacoustic emissions. Here, we show that those indirect delay estimates are either flawed or inadequately calibrated. In particular, we argue against assertions based on indirect estimates that basilar-membrane delays are much longer in humans than in experimental animals. We also estimate in vivo basilar-membrane delays in humans by correcting postmortem measurements in humans according to the effects of death on basilar-membrane vibrations in other mammalian species. The estimated in vivo basilar-membrane delays in humans are similar to delays in the hearing organs of other tetrapods, including those in which basilar membranes do not sustain traveling waves or that lack basilar membranes altogether.
Keywordscochlea basilar membrane auditory otoacoustic emissions compound action potentials brainstem evoked responses
Transduction of sound in mammalian ears is mediated by basilar-membrane (BM) waves exhibiting delays that increase systematically with distance from the cochlear base. Fragmentary estimates of such traveling-wave delays exist for several species [see Table 2 in Robles and Ruggero (2001)]. Recently, a nearly complete map of delays, the first for any mammalian species, was drafted on the basis of correlated in vivo BM and auditory nerve measurements in chinchilla (Siegel et al. 2005). Siegel et al. also presented tentative maps for the cochleae of cats and guinea pigs (Siegel et al. 2005). A map of cochlear delays in living humans would be of immense interest but it is impossible to obtain using in vivo measurements such as recordings of BM vibrations or individual auditory nerve fibers: such recordings are highly invasive using current methods and will probably remain undoable in the foreseeable future.
Delays of BM waves were first measured in the temporal bones of human cadavers more than 60 years ago (von Békésy 1943), but, with rare exceptions, most contemporary accounts of BM delays in humans have ignored measurements in temporal bones in favor of indirect estimates based on brainstem-evoked responses (BSERs), compound action potentials (CAPs), and otoacoustic emissions (OAEs). Some of those estimates led to assertions that BM delays are much longer in humans than in common experimental animals (Neely et al. 1988; Shera et al. 2002). Here, we dispute those assertions, showing that all indirect estimates of BM delays in humans, including those used to justify the assertions, are flawed or are not yet adequately calibrated against BM measurements. We also show that estimates of in vivo BM delays in human cochleae, obtained by correcting postmortem BM data according to the effects of death on BM vibrations in experimental animals, are very similar to delays in the hearing organs of other mammals and in nonmammalian tetrapods, including those in which BMs do not sustain traveling waves or that lack BMs altogether.
MATERIALS AND METHODS
Estimates of cochlear delays in humans and experimental animals were drawn from the literature. Delay estimates not explicitly stated in texts or tables were computed from data extracted from figures by digitization (digiMatic, FEB Software, Chesterfield, VA, USA).
Measures of BM delays
In experimental animals, the signal-front delay remains unaltered postmortem at both basal [e.g., Fig. 1 of Ruggero (1994); Fig. 14 of Recio et al. (1998)] and apical (Cooper and Rhode 1996; Zinn et al. 2000) cochlear sites. In other words, the signal-front delay is a “passive” property of BM responses [see review by Ruggero (1994)]. Following von Békésy, we equate signal-front delay to BM travel time [see Fig. 11 of von Békésy (1949a) and p. 2293 of Recio and Rhode (2000)]. Hence, in vivo traveling-wave delay can be measured postmortem, as von Békésy did in the case of the human cochlea [Fig. 11 of von Békésy (1949a)].
The group delay at any particular frequency is the negative of the slope of the phase-vs.-frequency function at that frequency. For BM responses to low-level stimuli in mature healthy cochleae, the CF group delay coincides with the weighted-average group delay (the “center of mass”) of the impulse response. [Following Overstreet et al. (2002a, b), we distinguish between best frequency (BF) and characteristic frequency (CF). Best frequency is the stimulus frequency producing the largest response (without regard to stimulus level or cochlear health or development). Characteristic frequency is the BF of responses to low-level stimuli in mature and healthy cochleae.] For higher stimulus levels at the base of the cochlea, both the CF group delay and the weighted-average group delay become shorter, the latter at a greater rate than the former due to the increasing weight of low-frequency regions, which have low group delays [see Fig. 1 of Ruggero (1994)]. In BM responses, the CF group delay is partly an “active” property because its exact value in vivo depends on the health of the cochlea and on the stimulus level (Recio et al. 1998; Ruggero et al. 1997). Postmortem, the CF group delay does not vary as a function of stimulus level and is nearly the same as the CF group delay for very intense stimuli in vivo.
The phase delay, signal-front delay, and BF group delay are illustrated in Figure 1, which is based on von Békésy’s BM measurements in human temporal bones at the cochlear site with BF of 200 Hz. Figure 1A shows a delayed version of a minimum-phase impulse response [Figs. 7 and 9 of Flanagan and Bird (1962)] computed by Flanagan and Bird on the basis of von Békésy’s amplitude data for responses to tones (von Békésy 1947). The minimum-phase impulse response has been delayed by 1.48 ms, the travel time (i.e., the signal-front delay) measured by von Békésy using responses to clicks [Fig. 11 of von Békésy (1949a)]. After the signal-front delay, the BM “rings” with a period of 5 ms (=1/BF; Fig. 1B). Note that the near-BF group delay is the sum of the signal-front delay and a filter delay [or “resonance build-up time” (de Boer and Nuttall 1996)].
BM delays in experimental animals
The putative derivation of BM delays in humans from OAE group delays
Shera et al. (2002) used the group delays of stimulus-frequency OAEs (SFOAEs) to estimate the CF group delays of BM responses and concluded that BM delays are much longer in humans than in common experimental animals [e.g., three times longer than in cats or guinea pigs for the 3-kHz cochlear site according to Fig. 2 of Shera et al. (2002)]. They based that conclusion on the fact that SFOAE group delays are much longer in humans than in experimental species (Shera et al. 2002) and on the assumption that “the group delay of SFOAEs is equal to twice the group delay of the BM mechanical transfer function evaluated at the cochlear location with CF equal to the stimulus frequency” [page 3319 of Shera et al. (2002)]. Recently, Siegel et al. (2005) tested the assumption of Shera et al. (2002) by comparing SFOAE and BM group delays in chinchilla, cat, and guinea pig and found that, in those species, SFOAEs actually have group delays either similar to or lower than the near-CF BM group delays. That finding contradicts the aforementioned assumption [page 3319 of Shera et al. (2002)] for the species tested and suggests that the assumption is also invalid for estimating BM delays in other species, including humans.
Some investigations estimated BM delays in humans using the group delays of 2f1–f2 distortion-product OAEs (DPOAEs) stimulated by one tone with fixed frequency and another with variable frequency (Bowman et al. 1997; Ramotowski and Kimberley 1998; Schoonhoven et al. 2001). The estimates were based on the assumption that the DPOAE group delays for f2 or f1 sweeps amount to twice the CF group delay of the BM traveling wave. Such assumption has been shown to be invalid for chinchilla, guinea pig, and gerbil, in which DPOAE and BM group delays are actually similar (Gong et al. 2005; Narayan et al. 1998; Ren 2004; Ren et al. 2006; Ruggero 2004). Thus, it is likely that the assumption is also invalid for humans.
The putative derivation of BM delays in humans from BSER latencies
Neely et al. (1988) estimated BM delays in humans from the wave-V latencies of BSERs to tone-bursts. Latencies were assumed to consist of sums of a constant (i.e., independent of stimulus level and frequency) neural component and a BM component, which varied strongly with stimulus level and frequency. Neely et al., recognizing that their BM delay estimates [e.g., 9 ms for 1-kHz tones presented at 20 dB sound pressure level (SPL)] were “much longer than most previously reported values,” suggested that humans have “longer cochlear travel times than most experimental animals” because “human subjects have longer, more compliant cochleas” (Neely et al. 1988). We show here that the long delay estimates actually resulted from flawed methodology and interpretation.
From the foregoing, it is clear that at least one half of the measured BSER latencies either was a methodological artifact (see below) and/or originated at sites of the auditory pathway more central than the inner hair cells. Recordings from auditory nerve fibers and cochlear nucleus neurons suggest that those sites are the synapses between inner hair cells and auditory nerve terminals. In the 30–90 dB SPL range, with onset stimulus ramps of 1.4 ms, intensity-dependent first-spike latency changes of auditory nerve fibers can be as large as 90 ms [Fig. 4A of Heil and Irvine (1997)]. In the same level range, using 6 ms on ramps, Kitzes et al. (1978) documented first-spike latency changes of about 30 ms in responses of cochlear-nucleus neurons to CF tones (11.6 and 1.8 kHz) [Figs. 1 and 3 of Kitzes et al. (1978)].
The methodology of Neely et al. (1988) was flawed because the durations of the stimulus-onset ramps were not constant across stimulus frequencies. Brainstem-evoked responses to tone bursts consist of neural activity synchronized by short stimulus-onset ramps. Therefore, to validly compare the latencies of responses to different tone bursts, the onset ramps must have a constant duration. In contrast, Neely et al. (1988) used tone bursts with onset-ramp durations that decreased as a function of increasing frequency (e.g., 4 ms for 0.25 and 0.5 kHz, 2 ms for 1–2 kHz, 1.4 ms for 3 kHz, 1 ms for 4 kHz, and 0.5 ms for 8 kHz), which should artificially produce delays that increase as stimulus frequency decreases (Neely et al. 1988). According to the aforementioned study by Heil and Irvine (1997) on auditory nerve fibers, for 30-dB SPL tones, lengthening ramp duration from 1.7 to 4.2 ms can increase first spike latency by 1.5–3 ms [Figs. 2E and 3E of Heil and Irvine (1997)]. Between 0.5 and 3.5 kHz, Neely et al. (1988) found a BSER latency change of 5.9 ms for tone bursts presented at 30 dB SPL. The latency effects documented by Heil and Irvine suggest that between 1.5 and 3 ms (i.e., 25–50%) of the 5.9-ms latency change can be attributed solely to differences in ramp duration (and, therefore, were not due to cochlear travel time). To summarize, the large dependencies of latency on stimulus frequency and level reported by Neely et al. include large, probably dominant, components that did not reflect BM latency shifts but rather were due to the different durations of the onset ramps and to level-dependent variations in first-spike latency of auditory nerve fibers.
Estimation of cochlear delays in experimental animals using derived-band CAPs
Teas et al. (1962) devised a so-called derived-band method that permits extracting some CF-specific information from click-evoked CAPs (Teas et al. 1962): CAPs evoked by clicks presented with a high-pass noise are subtracted from CAPs evoked by clicks presented with a high-pass noise with higher-frequency cut-off. Such derived-band responses, presumably reflecting synchronized firings of auditory nerve fibers with CFs in the frequency range between the two cut-offs, have latencies that vary as a function of frequency in a manner reminiscent of BM delays (Teas et al. 1962).
Early comparisons of derived-band CAP latencies and responses to clicks of auditory nerve fibers in guinea pigs (Prijs and Eggermont 1981) and in a single cat (Evans and Elberling 1982) suggested that such latencies were CF-specific for CFs higher than 1–2 kHz but could not test their accuracy rigorously because reliable measures of BM delays were not available. We now take advantage of newly available maps of BM delay for cat and guinea pig to compare them with derived-band delay estimates.
Figure 2C permits comparing derived-band delay estimates for cat with BM CF group delays and signal-front delays, i.e., the latencies of auditory nerve fiber responses to intense rarefaction clicks, corrected for a neural/synaptic delay of 1 ms [see p. 389 of Ruggero and Rich (1987) and Figs. 2 and 7 of Siegel et al. (2005)]. For the cat cochlea, derived-band delays are shorter than CF BM group delays but similar to signal-front delays. For guinea pig (Fig. 2B), the derived-band delays also approximate the signal-front delays closely. These results, which are consistent with the strong skew toward short delays evident in responses to intense clicks at the BM (Recio et al. 1998) and (even more pronouncedly) in poststimulus histograms of auditory nerve fibers (Kiang et al. 1965; Lin and Guinan 2000), suggest that when implemented with intense stimuli the derived-band method estimates BM signal-front delays for CFs higher than 1–2 kHz.
Derived-band delay estimates for the human cochlea
Figure 3A shows BM delay estimates for humans based on derived-band CAPs measured with intense stimuli (open symbols) as well as BSERs measured with stimuli presented at several levels (filled symbols). The BM delay estimates were computed in the original publications by subtracting 0.8 ms from the latencies of the derived-band responses to clicks, on the assumption that synaptic and neural-conduction delays amount to 0.8 ms. Observations listed below suggest that such a procedure is incorrect.
(1) In experimental animals, CAPs evoked by intense clicks principally reflect the activity of basal spiral ganglion cells. Therefore, the CAP latency matches the average first-spike latency of responses of high-CF auditory nerve fibers [e.g., Fig. 16 of Kiang (1984) and Fig. 5.14 of Kiang et al. (1965)]. (2) That latency, about 1 ms, exceeds by about 1 ms the (very short) BM signal-front delay at the base of the cochlea [e.g., about 30 μs at the 10-kHz place in chinchilla (Recio et al. 1998; Recio and Rhode 2000)]. (3) As a corollary of 1 and 2, BM travel time (i.e., signal-front delay relative to stapes motion) to any given CF region can be accurately estimated by subtracting the latency of responses of high-CF fibers from the latency of responses of fibers with appropriate CF (Kiang et al. 1965; Ruggero and Rich 1987). (4) As another corollary of 1 and 2, BM travel time in experimental animals can be estimated equally well by subtracting the latency of derived-band CAPs for the highest CFs from derived-band estimates for lower CFs. (5) Presumably, the same procedure should apply equally well for estimating travel time from derived-band delays in humans.
Figure 3B shows the result of subtracting from the derived-band estimates of Figure 3A the minimum latency in each of the corresponding experimental series, which generally corresponds to the derived-band latencies for the highest CF. For high CFs, the delays expressed relative to the delays at the most basal cochlear sites are, by definition, near zero. This result is consistent with the high traveling wave velocities at the base of the human cochlea (e.g., about 20 m/s at the 10-kHz place), as estimated from derived-band CAPs [see Fig. 9 of Eggermont (1976)]. For lower CFs, the derived-band delays for humans grow larger.
The BSER derived-band estimates (Eggermont and Don 1980) of Figure 3B (filled symbols) appear to be the only ones that have addressed the intensity dependence of derived-band delays. For stimulation with intense clicks, derived-band delays measured with BSERs and CAPs are similar and very short (see also Schoonhoven et al. 2001). For lower click levels, delays grow systematically larger. The intensity dependence of derived-band BSER delays is qualitatively consistent with the large intensity dependence of the weighted-average group delay of responses to clicks of auditory nerve fibers [e.g., Figs. 5.9 and 5.10 of Kiang et al. (1965)], which exaggerates the intensity dependence of BM weighted-average group delays [see Fig. 1 of Ruggero (1994)]. Thus, derived-band delays may reflect BM CF group delays at low stimulus levels and signal-front delays at intense levels. However, the precise relationship between BM delays and derived-band estimates cannot be specified at present with any certainty. Given such uncertainty, it seems clear that before derived-band CAPs can be used to estimate BM delays in humans, derived-band CAPs must first be adequately calibrated against BM delays in experimental animals, particularly with regard to their intensity dependence.
BM delays in the cochleae of human cadavers
For comparison with the human data of Figure 4A, B shows phase-vs.-frequency curves for cochlear responses in squirrel monkey and chinchilla cochleae. The responses were recorded at sites with CFs comparable to those of Figure 4A. For squirrel monkey, phase-vs.-frequency curves are for postmortem BM responses at sites with BFs near 4.7 kHz (solid lines and triangles) and in vivo responses (corrected for synaptic and conduction delays) of an auditory nerve fiber with CF of 2.3 kHz to tones presented at 50 (open circles) and 90 dB SPL (dashed line). Figure 4B also shows phase curves for chinchilla in vivo BM responses to tones presented at 30 (open squares) and 80 dB SPL (short-dash line). The neural curves for squirrel monkey and the BM curves for chinchilla illustrate that, either in vivo or postmortem, CF group delays for responses to intense stimuli are shorter than in vivo responses to low-level stimuli. The responses for humans and squirrel monkey differ from those in chinchilla in having nearly straight-line phase-vs.-frequency curves. The difference between the curves for primates and rodents probably reflects both species dissimilarities and variation of the phase-vs.-frequency curves as a function of CF [see Pfeiffer and Molnar (1970) and p. 1319 of Robles and Ruggero (2001)].
The effects of death on BM group delays
Figure 6B summarizes the effects of death on the slopes of phase-vs.-frequency curves for most of the measurements represented in Figure 6A. [Squirrel monkey data are not included because they are available only for cochleae exhibiting signs of deterioration.] Such effects consist of the abolition of the intensity dependence of group delay, which is longest for in vivo response to low-level stimuli and shortest for intense stimulation either in vivo or postmortem [Fig. 4B and pp. 1311–1312 of Robles and Ruggero (2001)]. For reference, Figure 6B includes a dashed line indicating equality between postmortem CF delay and in vivo CF delay for low-level stimulation; i.e., the line indicates a lack of dependence of CF phases on intensity, as would be the case for (hypothetical) linear responses. The effects of death (or, in vivo, of stimulus intensity) are seen in Figure 6B as a divergence between the dashed line and the actual trend of in vivo CF delay as a function of postmortem CF delay (solid line).
Figure 7 also shows the trend (thin solid line) of in vivo CF group delays for low-level stimulation in chinchilla (Temchin et al. 2005). The in vivo CF group delays for humans and chinchillas are similar throughout most of the range of CFs. This similarity is of interest because the two species have comparable auditory thresholds and nearly identical hearing ranges [so that their cochlear maps are scaled versions of each other (Greenwood 1990)], but their BM lengths differ by a factor of nearly two (chinchillas, 18 mm; humans, 35 mm). The similarity of delays in spite of widely different BM lengths contradicts the conjecture that “the absolute magnitude of wave velocity in live human cochleae is roughly the same (as in others mammals)” [page 227 of Zwislocki (2002)] and its corollary, that because “human subjects have longer...cochleas” they must have “longer cochlear travel times than most experimental animals” [pp. 655–656 of Neely et al. (1988)]. [Caveat: Zwislocki’s conjecture was based on an analysis of phase delay, a measure that can differ substantially from CF group delay (see Fig. 1).]
Similarity of CF group delays among mammalian and nonmammalian tetrapods
Signal-front and CF BM group delays in human cochleae (Fig. 8) are also similar to delays in the extremely short hearing organs of nonmammalian tetrapods [see Fig. 6B of Ruggero and Temchin (2005)]. Such similarity is remarkable in the case of the bobtail lizard Tiliqua, whose BM does not support a traveling wave (Manley et al. 1988), and even more remarkable in the case of frogs (Hillery and Narins 1984; Hillery and Narins 1987), whose hearing organs lack BMs altogether. The implication is that systematic increases of signal-front and/or CF group delay as a function of decreasing CF do not necessarily reflect BM waves that travel from base to apex (Köppl 1997b; Manley 1990; Smolders and Klinke 1986).
Justification of the assumption that the effects of death on cochlear amplification and group delays in humans can be derived from the corresponding effects in experimental animals
The general assumption underlying the derivation of in vivo CF group delays from data in experimental animals is that cochlear processes in humans are similar to those in other mammalian species. Here we present evidence that the applicability to humans of quantitative relationships derived from experimental animals is not a mere assumption but rather a reasonable hypothesis supported by many facts. Specifically, we show below that (1) in vivo CFs and postmortem BFs are quantitatively related to each other in the same manner as they are in experimental species (Fig. 6A) and (2) in vivo CF group delays and postmortem CF group delays are quantitatively related to each other in the same manner as they are in experimental species (Fig. 6B).
Hypothesis 1 is supported by the phenomenon of “the half-octave shift”: temporary threshold shifts induced by exposure to intense tones in humans (Davis et al. 1950) and other animals (Cody and Johnstone 1980) cause the greatest hearing-threshold elevations at frequencies one-half octave higher than that of the intense tone. It is almost certain that the half-octave shift reflects a corresponding shift in BM BF following death (Sellick et al. 1982) or stimulation with intense stimuli (Ruggero et al. 1996).
Postmortem, the peak gains of BM motion (i.e., at BF) relative to incus or stapes motion are similar in humans and in experimental animals [humans, 10–25 dB, Figs. 2 of Gundersen et al. (1978) and 5A of Stenfelt et al. (2003); squirrel monkey, 7–15 dB, Fig. 9 of Rhode (1978); gerbil, 26 dB, Fig. 6 of Overstreet et al. (2002a); and chinchilla, 20 dB, Fig. 5 of Recio and Rhode (2000)].
The peak magnitude of stapes or incus velocity gain (re stimulus pressure) is similar (in the order of 0.3 mm s−1 Pa−1) in humans and in experimental species [humans, Fig. 3 of Aibara et al. (2001); squirrel monkey, Figs. 3 and 4 of Rhode (1971); gerbil, Fig. 2 of Overstreet and Ruggero (2002); chinchilla, Fig. 1 of Ruggero et al. (2007)].
“...Most mammals, including humans, have the lowest (auditory) thresholds within 10 dB of 0 dB SPL” [pp. 7–8 of Fay (1994)]. In other words, with few exceptions, “the lowest level of stimuli that can be detected within the BF range is approximately the same for all mammalian species” [pp. 24–25 of Long (1994)].
For high-CF sites, CF group delays are shortest postmortem or, in vivo, for intense stimuli; in vivo, CF group delays become systematically longer as stimulus levels are decreased or when cochlear health deteriorates [e.g., Fig. 2 of Ruggero (1994); reviewed on p. 1336 of Robles and Ruggero (2001)]. However, when CF group delay changes do occur postmortem, they are relatively small; e.g., 0.38 ms at the 10-kHz site of chinchilla (Ruggero et al. 1997) or 0.25 ms at the 18-kHz site of guinea pig (Nuttall and Dolan 1996), corresponding to in vivo re postmortem gains at CF (i.e., “amplification”) of 65 and 60 dB, respectively. The ratio of postmortem group delay change to the corresponding gain change indicates the rate at which CF group delay varies with amplification: 5.8 μs/dB for chinchilla and 4.2 μs/dB for guinea pig.
From a, b, and c, it follows that the strength of cochlear amplification (the amount by which in vivo sensitivity exceeds postmortem sensitivity at CF) must be approximately the same in humans and other species. According to d, postmortem and in vivo CF group delay are the same for low CF regions in experimental animals and, hence, also probably in humans. According to e, even if large differences exist between humans and other species in the strength of cochlear amplification, those differences would be accompanied by relatively minor differences in CF group delay. For example, if amplification at CF were to amount in humans to 100 dB (i.e., 40 dB or two orders of magnitude higher than in chinchilla or guinea pig), the equation illustrated in Figure 6B would underestimate by only 160–240 μs (i.e., 40 dB times 4.2 or 5.8 μs/dB) the CF group delays for CFs of 10–18 kHz.
Comparability of postmortem measurements of BM vibration in humans and in experimental animals
A matter of concern in extending observations in experimental animals to humans is whether the respective functional and structural properties are validly comparable. Postmortem observations in animal experiments (such as illustrated in Figs. 4–6) are typically carried out soon (within minutes or a few hours) after death, when cochlear structures may be presumed to remain largely intact. In contrast, BM vibration measurements in human temporal bones appear to have been carried out, at the earliest, several hours after death. Von Békésy stated that his vibration measurements were made in “fresh” temporal-bone preparations (von Békésy 1949b), but such explicit statements are lacking in von Békésy (1947) and von Békésy (1949a). The latter papers, however, refer to the use of “fresh” preparations in other contexts. Gundersen et al. (1978) gave no details regarding their temporal bone preparations, but another publication reporting on the same experiments states that measurements were carried in “temporal bones...prepared 8–24 h postmortem” (Kringlebotn et al. 1979). Stenfelt et al. (2003) made their BM measurements within 6 days after death in temporal bones that had been extracted 2 days after death and stored at 5°C in a 1:10,000 merthiolate solution in normal saline (Stenfelt et al. 2003).
The concern about the dissimilarity of cochlear conditions in human temporal bones and in experimental animals is somewhat allayed by the fact that most postmortem changes in BM responses in experimental animals, in particular the disappearance of nonlinear features (including the dependence of CF group delay on stimulus intensity; Fig. 6B), occur soon (within minutes) after death (Recio et al. 1998; Rhode 1973). Nevertheless, lingering changes in “passive” features were noted by Kohllöffel (1972) and Rhode (1973). Of special interest is that, after the initial postmortem effects (Fig. 5B), further changes (decreases) in near-CF group delays remained small for as long as 31 h in squirrel monkey [see Fig. 4 of Rhode (1973)]. Kohllöffel compared responses in guinea pig cochleae 4 and 7 days after death and found that there was “practically no effect of aging on the phase patterns” (Kohllöffel 1972).
Significance of the present findings for the concept of a traveling wave
In a 1954 paper, Wever, Lawrence, and von Békésy reconciled some of their views on the nature of the traveling wave. They stated that when the cochlea is stimulated with a tone, a BM “displacement wave seems to be moving up the cochlea. Actually...each element of the membrane is executing sinusoidal vibrations...different elements...executing these vibrations in different phases. This action can be referred to as that of a traveling wave, provided that...nothing is implied about the underlying causes. It is in this sense that Békésy used the term ‘traveling wave’...” [pp. 511–513 of Wever et al. (1954)]. More than half a century later, the nature of cochlear mechanical “traveling waves” remains a matter of argument, as demonstrated by a recent discussion in Portland, OR [see pp. 534–539 of Nuttall et al. (2006)]. Nevertheless, we agree with many others that physical traveling waves propagate in the mammalian cochlea, carrying energy from base to apex. Such traveling waves apparently result from interactions between fluid inertia and the structural properties of the organ of Corti and the BM, principally the latter’s elasticity. The evidence for the existence of traveling waves in mammalian cochleae is abundant [see reviews by Robles and Ruggero (2001) and Ruggero (1994)]. Numerous measurements of BM vibration [especially those of Russell and Nilsen (1997), Ren (2002), and Rhode and Recio (2000)], of fluid pressure in scala tympani near the BM (Olson 1998) and of responses of auditory nerve fibers (Pfeiffer and Kim 1975) have been complemented and interpreted by substantial theoretical and/or modeling work on hydrodynamics (de Boer 1984; Lighthill 1981; Mammano and Nobili 1993; Peterson and Bogert 1950; Taber and Steele 1981; Zwislocki 1948).
The situation is less clear in nonmammalian hearing organs. In particular, substantial signal-front delays that increase from high-CF to low-CF regions (Fig. 8) exist in the hearing organs of the bobtail lizard (Tiliqua), in which the BM does not sustain a traveling wave (Manley 1990), and frogs, which lack BMs altogether. Obviously, the concept of the traveling wave cannot be applied to those hearing organs in the same sense that it applies to mammals (Robles and Ruggero 2001; Ruggero 1994). Nevertheless, mechanical traveling waves may well be sustained by structures other than BMs, such as the tectorial membrane in the amphibian papilla of frogs (Hillery and Narins 1984) and the tympanic membrane of locusts (Windmill et al. 2005).
SUMMARY AND CONCLUSIONS
Assertions based on SFOAEs and BSERs that BM delays are much longer in humans than in experimental animals, as well as estimates of BM delays based on DPOAEs, are probably invalid because they depend on assumptions that are invalid for experimental animals. The validity of derived-band estimates of BM delays in humans is uncertain. Derived-band delay estimates should be calibrated against BM delays in experimental animals. Signal-front and BF BM group delays in the cochleae of human cadavers are similar to the corresponding postmortem delays in other mammals (Fig. 5). Measurements in experimental animals of the effects of death or, in vivo, of the intensity-dependence of BM responses (Fig. 6), permit the estimation of CF group delays for living humans (Fig. 7) on the basis of BM delays in cadavers (on the assumption that the relationship between postmortem and in vivo delays is similar across species). Signal-front and CF group delays in the cochleae of living humans appear to be similar to the corresponding delays in chinchillas, other mammals, and even nonmammalian species, including those in which a BM traveling wave does not exist (Fig. 8).
Hongxue Cai, Mandy Cerka, Yun-Hui Fan, Luis Robles, Jon Siegel, and Joe Zwislocki provided helpful comments on previous drafts of this article. The authors were supported by NIH Grant DC-00419 from the National Institute on Deafness and Other Communication Disorders.
- Aran JM, Cazals Y. Electrocochleography: animal studies. In: Naunton RF and Fernández C (eds) Evoked Electrical Activity in the Auditory Nervous System. New York, Academic Press, pp. 239–257, 1978.Google Scholar
- Brillouin L. Wave Propagation and Group Velocity. New York, Academic Press, 1960.Google Scholar
- Cooper NP, Rhode WS. Fast travelling waves, slow travelling waves and their interactions in experimental studies of apical cochlear mechanics. Aud. Neurosci. 2:289–299, 1996.Google Scholar
- Davis H, Morgan CT, Hawkins JE, Galambos R, Smith FW. Temporary deafness following exposure to loud tones and noise. Acta Otolaryngol. Suppl. LXXXVIII:4–57, 1950.Google Scholar
- de Boer E, Nuttall AL. Cochlear travel time and minimum phase. Assoc. Res. Otolaryngol. Mid-Winter Meet. Abstr. 19:57, 1996.Google Scholar
- Elberling C. Action potentials along the cochlear partition recorded from the ear canal in man. Scand. Audiol. 3:13–19, 1974.Google Scholar
- Evans EF. The frequency response and other properties of single fibres in the guinea-pig cochlear nerve. J. Physiol. (Lond.) 226:263–287, 1972.Google Scholar
- Fay RR. Comparative auditory research. In: Fay RR and Popper AN (eds) Comparative Hearing: Mammals. New York, Springer-Verlag, pp. 1–17, 1994.Google Scholar
- Goldstein JL, Baer T, Kiang NYS. A theoretical treatment of latency, group delay and tuning. Characteristics for auditory nerve responses to clicks and tones. In: Sachs MB (ed) The Physiology of the Auditory System. Baltimore, National Educational Consultants, pp. 133–141, 1971.Google Scholar
- Gong Q, Temchin AN, Siegel JH, Ruggero MA. Similarity of group delays of basilar-membrane vibrations and distortion-product otoacoustic emissions in chinchilla. Assoc. Res. Otolaryngol. Mid-Winter Meet. Abstr. 28:113, 2005.Google Scholar
- Kiang NYS. Peripheral neural processing of auditory information. In: Darian-Smith I (ed) The Nervous System. Volume III: Sensory Processes, Part 2. Bethesda, American Physiological Society, pp. 639–674, 1984.Google Scholar
- Kiang NYS, Watanabe T, Thomas EC, Clark LF. Discharge Patterns of Single Fibers in the Cat’s Auditory Nerve. Cambridge, M.I.T. Press, 1965.Google Scholar
- Kohllöffel LUE. A study of basilar membrane vibrations. II. The vibratory amplitude and phase pattern along the basilar membrane (postmortem). Acustica 27:66–81, 1972.Google Scholar
- Long GR. Psychoacoustics. In: Fay RR and Popper AN (eds) Comparative Hearing: Mammals. New York, Springer-Verlag, pp. 18–56, 1994.Google Scholar
- Manley GA. Peripheral Hearing Mechanisms in Reptiles and Birds. Berlin, Springer-Verlag, 1990.Google Scholar
- Narayan SS, Recio A, Ruggero MA. Cubic distortion products at the basilar membrane and in the ear canal of chinchillas. Assoc. Res. Otolaryngol. Mid-Winter Meet. Abstr. 21:181, 1998.Google Scholar
- Nuttall AL, Ren T, Gillespie PG, Grosh K, de Boer E. Auditory Mechanisms: Processes and Models. Singapore, World Scientific, 2006.Google Scholar
- Papoulis A. The Fourier Integral and its Applications. New York, McGraw-Hill, 1962.Google Scholar
- Rhode WS. An investigation of postmortem cochlear mechanics. In: Møller AR (ed) Basic Mechanisms in Hearing. New York, Academic, pp. 49–63, 1973.Google Scholar
- Rhode WS, Cooper NP. Nonlinear mechanics in the apical turn of the chinchilla cochlea in vivo. Aud. Neurosci. 3:101–121, 1996.Google Scholar
- Ruggero, MA, Narayan, SS, Temchin, AN, Recio, A. Mechanical bases of frequency tuning and neural excitation at the base of the cochlea: comparison of basilar-membrane vibrations and auditory-nerve-fiber responses in chinchilla. Proc. Natl. Acad. Sci. U. S. A. 97:11744–11750, 2000.PubMedCrossRefGoogle Scholar
- Ruggero MA, Rich NC, Recio A. Basilar membrane responses to clicks. In: Cazals Y, Demany L, and Horner K (eds) Auditory Physiology and Perception. London, Pergamon Press, pp. 85–91, 1992.Google Scholar
- Ruggero MA, Rich NC, Recio A. The effect of intense acoustic stimulation on basilar-membrane vibrations. Aud. Neurosci. 2:329–345, 1996.Google Scholar
- Ruggero MA, Temchin AN, Fan Y-H, Cai H. Boost of transmission at the pedicle of the incus in the chinchilla middle ear. In: Eiber A and Huber A (eds) Middle Ear Mechanics in Research and Otology. Singapore, World Scientific, pp. 160–163, 2007.Google Scholar
- von Békésy G. Uber die Rezonanzkurve und die Abklingzeit der verschiedenen Stellen der Schneckentrennwand. Akust. Zeits. 8:66–76, 1943.Google Scholar
- Zwislocki JJ. Theorie der Scheneckenmechanik: qualitative und quantitative analyse. Acta Otolaryngol. Suppl. 72:1–76, 1948.Google Scholar
- Zwislocki JJ. Auditory Sound Transmission—An Autobiographical Perspective. Mahwah, Lawrence Erlbaum Associates, 2002.Google Scholar