Skip to main content

Advertisement

Log in

An auditory localization model based on high-frequency spectral cues

  • Research Articles
  • Published:
Annals of Biomedical Engineering Aims and scope Submit manuscript

Abstract

We present in this paper a connectionist model that extracts interaural intensity differences (IID) from head-related transfer functions (HRTF) in the form of spectral cues to localize broadband high-frequency auditory stimuli, in both azimuth and elevation. A novel discriminative matching measure (DMM) is defined and optimized to characterize matching this IID spectrum. The optimal DMM approach and a novel backpropagation-based fuzzy model of localization are shown to be capable of localizing sources in azimuth, using only spectral IID cues. The fuzzy neural network model is extended to include localization in elevation. The use of training data with additive noise provides robustness to input errors. Outputs are modeled as two-dimensional Gaussians that act as membership functions for the fuzzy sets of sound locations. Error back-propagation is used to train the network to correlate input patterns and the desired output patterns. The fuzzy outputs are used to estimate the location of the source by detecting Gaussians using the max-energy paradigm. The proposed model shows that HRTF-based spectral IID patterns can provide sufficient information for extracting localization cues using a connectionist paradigm. Successful recognition in the presence of additive noise in the inputs indicates that the computational framework of this model is robust to errors made in estimating the IID patterns. The localization errors for such noisy patterns at various elevations and azimuths are compared and found to be within limits of localization blurs observed in humans.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Angell, J. C., and W. Fite. The monaural localization of sound.Psychol. Rev. 8:225–243, 1901.

    Article  Google Scholar 

  2. Ben-Arie, J., and K.R. Rao. Non-orthogonal signal representations by Gaussians and Gabor functions.IEEE Trans. Circuits Syst. 42:402–413, 1995.

    Google Scholar 

  3. Blauert, J. Modeling of interaural time and intensity difference discrimination. In:Psychophysical, Physiological, and Behavioral Studies in Hearing edited by G. vanden Brink and F. Bilsen. Delft, The Netherlands: Delft University Press, 1980, pp. 421–424.

    Google Scholar 

  4. Bodreau, J., and C. Tsuchitani: Binaural interaction in the superior olive S-segment.J. Neurophysiol 21:422–454, 1968.

    Google Scholar 

  5. Colburn, H. S. and N. I. Durlach. Models of binaural interaction. In:Handbook of Perception IV: Hearing, edited by E. C. Carterette and M. P. Friedman. New York: Academic Press, 1978, pp. 467–515.

    Google Scholar 

  6. Duda, R. O. Estimating Azimuth and Elevation from the Interaural Intensity Difference. Technical Report no. 4. San Jose, CA: Department of Electrical Engineering, San Jose State University, September 1993, 57 pp.

    Google Scholar 

  7. Guinan, J., Jr., B. Norris, and S. Guinan. Single auditory units in the superior olivary complex. II. Locations of unit categories and tonotopic organization.Int. J. Neurosci 4: 147–166, 1972.

    Article  Google Scholar 

  8. Haykin, S.Adaptive Filter Theory. Englewood Cliffs, NJ: Prentice-Hall, 1986, 590 pp.

    Google Scholar 

  9. Holmström, L., and P. Koistinen. Using additive noise in back-propagation training.IEEE Trans. Neural Networks 3:24–38, 1992.

    Article  Google Scholar 

  10. Kosko, B.Neural Networks and Fuzzy Systems: A Dynamical Systems Approach to Machine Intelligence. Englewood Cliffs, NJ: Prentice-Hall, 1992, 449 pp.

    Google Scholar 

  11. Lyon, R. F. Computational models of binaural localization and separation.Proc. ICASSP, 3:1148–1151, 1983.

    Google Scholar 

  12. Lyon, R. F. An analog electronic cochlea.IEEE Trans. Acoustics Speech Signal Processing 36:1119–1134, 1988.

    Article  Google Scholar 

  13. Makous, J. C., and J. C. Middlebrooks. Two-dimensional sound localization by human listeners.J. Acoust. Soc. Am. 87:2188–2199, 1990.

    Article  PubMed  CAS  Google Scholar 

  14. Middlebrooks, J. C., and D. M. Green. Sound localization by human listeners.Annu. Rev. Physiol. 42:135–59, 1991.

    CAS  Google Scholar 

  15. Musicant, A. D. and R. A. Butler. Influence of monaural spectral cues on binaural localization.J. Acoust. Soc. Am. 77:202–208, 1985.

    Article  PubMed  CAS  Google Scholar 

  16. Nandy, D., and J. Ben-Arie. A comparison of auditory localization models.Proceedings of the 12th ICPR, Jerusalem, Israel, October 1994, Vol. 3, pp. 123–127.

  17. Neti, C., E. D. Young, and M. H. Schneider. Neural network models of sound localization based on directional filtering by the pinna.J. Acoust. Soc. Am. 92:3140–3156, 1992.

    Article  PubMed  CAS  Google Scholar 

  18. Plenge, G. On the difference between localization and lateralization.J. Acoust. Soc. Am. 56:944–951, 1953.

    Article  Google Scholar 

  19. Rumelhart, D. E., G. E. Hinton, and R. J. Williams. Learning internal representations by error propagation. In:Parallel Distributed Processing: Explorations in the Microstructure of Cognition: Foundation vol. 1, edited by D. E. Rumelhart and J. L. McClelland. Cambridge: MIT Press, 1986, pp. 318–362.

    Google Scholar 

  20. Sachs, M. B., and P. J. Abbas. Rate versus level functions for the auditory-nerve fibers in cats: tone-burst stimuli.J. Acoust. Soc. Am. 56:1835–1847, 1974.

    Article  PubMed  CAS  Google Scholar 

  21. Shaw, E. A. G. Transformation of sound pressure level from the free field to the ear drum in the horizontal plane.J. Acoust. Soc. Am. 56:1848–1861, 1974.

    Article  PubMed  CAS  Google Scholar 

  22. Wenzel, E. M. Localization in virtual acoustic displaysPresence Teleoperators Virtual Environments 1:80–107, 1992.

    Google Scholar 

  23. Wightman, F. L., and D. J. Kistler. Headphone simulation of free-field listening. I. Stimulus synthesis.J. Acoust. Soc. Am. 85:858–867, 1989.

    Article  PubMed  CAS  Google Scholar 

  24. Yang, X., K. Wang, and S. A. Shamma. Auditory representations of acoustic signals.IEEE Trans. Information Theor. 38:824–839, 1992.

    Article  Google Scholar 

  25. Yost, W. A., and D. W. Nielsen.Fundamentals of Hearing: An Introduction. New York: CBS College Publishing, 1985, 269 pp.

    Google Scholar 

  26. Zwislocki, J., and R. Feldman. Just noticeable differences in dichotic phase.J. Acoust. Soc. Am. 28:860–864, 1956.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nandy, D., Ben-Arie, J. An auditory localization model based on high-frequency spectral cues. Ann Biomed Eng 24, 621–638 (1996). https://doi.org/10.1007/BF02684176

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02684176

Keywords

Navigation