Abstract
In this chapter, we will describe the application of the proposed Japanese DRT to the measurement of intelligibility of localized Japanese speech in virtual three dimensional environment. Speech is localized to the specified location by convolving with the head related transfer function, which is the transfer function of the path between the source and each of the listener’s ear. We have been attempting to use speech localization to place many of the potential speech sources at distant locations within the virtual space to maintain the localized speech intelligibility, while preserving the “presence” of the other audio sources. The intended applications include multi-party audio conferencing, and augmented audio reality applications that mix localized speech on top of actual acoustic space.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Breebaart, J., Faller, C.: Spatial Audio Processing. Wiley, West Sussex (2007)
Breebaart, J., van de Par, S., Kohlrahsch, A., Schuijers, E.: Parametric coding of stereo audio. EURASIP J. Appl. Sig. Process. 2005(9), 1305–1322 (2005)
Bronkhorst, A.W., Plomp, R.: Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing. J. Acoust. Soc. Am. 92(6), 3132–3139 (1992)
Brungart, D., Simpson, B.: Distance-based speech segregation in near-field virtual audio displays. In: Proceedings of International Conference on Auditory Display, pp. 169–174 (2001)
Brungart, D., Simpson, B.: Optimizing the spatial configuration of a seven-talker speech display. In: Proceedings of International Conference on Auditory Display, pp. 188–191 (2003)
Chiba, T., Kitashima, Y., Yano, N., Kondo, K., Nakagawa, K.: On the influence of localized position of interference noise on the intelligibility of read Japanese words in remote conference systems. In: Proceedings of 37th International Congress and Exposition on Noise Control Engineering (Internoise), in08_0294. Shanghai, China (2008)
Fujimori, M., Kondo, K., Nakagawa, K.: On low frequency-pass characteristics of a one-out-of-two selection-based Japanese intelligibility test. In: Proceedings of 6th Technical Meeting of the Information Processing Society of the Japan Tohoku Chapter, vol. A2-2 (2005) (in Japanese)
Gardner, B., Martin, K.: HRTF measurement of a KEMAR dummy-head microphone (1994). http://sound.media.mit.edu/resources/KEMAR.html
Hawley, M.L., Litovsky, R.Y., Colburn, H.S.: Speech intelligibility and localization in a multi-source environment. J. Acoust. Soc. Am. 105(6), 3436–3448 (1999)
ISO/IEC JTC1/SC29: Coding of audio-visual objects—Part 3: Audio (2005)
Johnston, J.D., Ferreira, A.J.: Sum-difference stereo transform coding. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 569–572. IEEE, San Francisco (1992)
Kanada, Y.: Simulated virtual market place by using voiscape communication medium. In: Proceedings of 13th Annual ACM International Conference on Multimedia, pp. 794–795, Singapore (2005)
Kilgore, R., Chignell, M.: Spatialized audioconferencing: What are the benefits? In: Proceedings of Conference of Centre for Advanced Studies Conference on Collaborative Research, pp. 111–120 (2003)
Kilgore, R., Chignell, M.: The Vocal village: enhancing collaboration with spatialized audio-conferencing. In: Proceedings of World Conference on E-Learning in Corporate, Government, Healthcare and Higher Education (ELEARN) (2004)
Kitashima, Y., Kondo, K., Nakagawa, K.: A speech communication environment using open source software library for active sound image control. J. Acoust. Soc. Am. 120(5), 3379 (2006)
Kitashima, Y., Kondo, K., Terada, H., Chiba, T., Nakagawa, K.: Intelligibility of read Japanese words with competing noise in virtual acoustic space. Acoust. Sci. Tech. 29(1), 74–81 (2008)
Kondo, K., Izumi, R., Fujimori, M., Kaga, R., Nakagawa, K.: On a two-to-one selection based Japanese intelligibility test. J. Acoust. Soc. Jpn. 63(4), 196–205 (2007) (in Japanese)
Mesgarani, N., Grant, K.W., Duraiswami, R., Shamma, S.: Augmented intelligibility in simultaneous multi-talker environments. In: Proceedings of International Conference on Auditory Display, pp. 71–74 (2003)
Rice University: Signal Processing Information Base (SPIB). http://spib.rice.edu/spib/select_noise.html
Sugita, K., Yokota, M.: Practical research on a large scale video conference system. In: Proceedings of DICOMO, pp. 595–600 (2007)
Uchida, K., Nishino, T., Itou, K., Takeda, K., Itakura, F.: Evaluating the sound localization based on auditory masking. In: Technical Report of the IEICE, No. EA2003-121, pp. 15–20 (2005)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kondo, K. (2012). DRT Evaluation of Localized Speech Intelligibility in Virtual 3-D Acoustic Space. In: Subjective Quality Measurement of Speech. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27506-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-27506-7_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27505-0
Online ISBN: 978-3-642-27506-7
eBook Packages: EngineeringEngineering (R0)