DRT Evaluation of Localized Speech Intelligibility in Virtual 3-D Acoustic Space

Kondo, Kazuhiro

doi:10.1007/978-3-642-27506-7_6

DRT Evaluation of Localized Speech Intelligibility in Virtual 3-D Acoustic Space

Kazuhiro Kondo²

Chapter
First Online: 01 January 2012

665 Accesses
1 Citations

Part of the book series: Signals and Communication Technology ((SCT))

Abstract

In this chapter, we will describe the application of the proposed Japanese DRT to the measurement of intelligibility of localized Japanese speech in virtual three dimensional environment. Speech is localized to the specified location by convolving with the head related transfer function, which is the transfer function of the path between the source and each of the listener’s ear. We have been attempting to use speech localization to place many of the potential speech sources at distant locations within the virtual space to maintain the localized speech intelligibility, while preserving the “presence” of the other audio sources. The intended applications include multi-party audio conferencing, and augmented audio reality applications that mix localized speech on top of actual acoustic space.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Breebaart, J., Faller, C.: Spatial Audio Processing. Wiley, West Sussex (2007)
Book Google Scholar
Breebaart, J., van de Par, S., Kohlrahsch, A., Schuijers, E.: Parametric coding of stereo audio. EURASIP J. Appl. Sig. Process. 2005(9), 1305–1322 (2005)
Article MATH Google Scholar
Bronkhorst, A.W., Plomp, R.: Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing. J. Acoust. Soc. Am. 92(6), 3132–3139 (1992)
Article Google Scholar
Brungart, D., Simpson, B.: Distance-based speech segregation in near-field virtual audio displays. In: Proceedings of International Conference on Auditory Display, pp. 169–174 (2001)
Google Scholar
Brungart, D., Simpson, B.: Optimizing the spatial configuration of a seven-talker speech display. In: Proceedings of International Conference on Auditory Display, pp. 188–191 (2003)
Google Scholar
Chiba, T., Kitashima, Y., Yano, N., Kondo, K., Nakagawa, K.: On the influence of localized position of interference noise on the intelligibility of read Japanese words in remote conference systems. In: Proceedings of 37th International Congress and Exposition on Noise Control Engineering (Internoise), in08_0294. Shanghai, China (2008)
Google Scholar
Fujimori, M., Kondo, K., Nakagawa, K.: On low frequency-pass characteristics of a one-out-of-two selection-based Japanese intelligibility test. In: Proceedings of 6th Technical Meeting of the Information Processing Society of the Japan Tohoku Chapter, vol. A2-2 (2005) (in Japanese)
Google Scholar
Gardner, B., Martin, K.: HRTF measurement of a KEMAR dummy-head microphone (1994). http://sound.media.mit.edu/resources/KEMAR.html
Hawley, M.L., Litovsky, R.Y., Colburn, H.S.: Speech intelligibility and localization in a multi-source environment. J. Acoust. Soc. Am. 105(6), 3436–3448 (1999)
Article Google Scholar
ISO/IEC JTC1/SC29: Coding of audio-visual objects—Part 3: Audio (2005)
Google Scholar
Johnston, J.D., Ferreira, A.J.: Sum-difference stereo transform coding. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 569–572. IEEE, San Francisco (1992)
Google Scholar
Kanada, Y.: Simulated virtual market place by using voiscape communication medium. In: Proceedings of 13th Annual ACM International Conference on Multimedia, pp. 794–795, Singapore (2005)
Google Scholar
Kilgore, R., Chignell, M.: Spatialized audioconferencing: What are the benefits? In: Proceedings of Conference of Centre for Advanced Studies Conference on Collaborative Research, pp. 111–120 (2003)
Google Scholar
Kilgore, R., Chignell, M.: The Vocal village: enhancing collaboration with spatialized audio-conferencing. In: Proceedings of World Conference on E-Learning in Corporate, Government, Healthcare and Higher Education (ELEARN) (2004)
Google Scholar
Kitashima, Y., Kondo, K., Nakagawa, K.: A speech communication environment using open source software library for active sound image control. J. Acoust. Soc. Am. 120(5), 3379 (2006)
Google Scholar
Kitashima, Y., Kondo, K., Terada, H., Chiba, T., Nakagawa, K.: Intelligibility of read Japanese words with competing noise in virtual acoustic space. Acoust. Sci. Tech. 29(1), 74–81 (2008)
Article Google Scholar
Kondo, K., Izumi, R., Fujimori, M., Kaga, R., Nakagawa, K.: On a two-to-one selection based Japanese intelligibility test. J. Acoust. Soc. Jpn. 63(4), 196–205 (2007) (in Japanese)
Google Scholar
Mesgarani, N., Grant, K.W., Duraiswami, R., Shamma, S.: Augmented intelligibility in simultaneous multi-talker environments. In: Proceedings of International Conference on Auditory Display, pp. 71–74 (2003)
Google Scholar
Rice University: Signal Processing Information Base (SPIB). http://spib.rice.edu/spib/select_noise.html
Sugita, K., Yokota, M.: Practical research on a large scale video conference system. In: Proceedings of DICOMO, pp. 595–600 (2007)
Google Scholar
Uchida, K., Nishino, T., Itou, K., Takeda, K., Itakura, F.: Evaluating the sound localization based on auditory masking. In: Technical Report of the IEICE, No. EA2003-121, pp. 15–20 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Graduate School of Science and Engineering, Yamagata University, Jonan, Yonezawa 4-3-16, Yamagata, 992-8510, Japan
Kazuhiro Kondo

Authors

Kazuhiro Kondo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kazuhiro Kondo .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kondo, K. (2012). DRT Evaluation of Localized Speech Intelligibility in Virtual 3-D Acoustic Space. In: Subjective Quality Measurement of Speech. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27506-7_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-27506-7_6
Published: 07 February 2012
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27505-0
Online ISBN: 978-3-642-27506-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics