Detection of Dialogue in Movie Soundtrack for Speech Intelligibility Enhancement
A method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility. The techniques for reduction of artifacts in the processed signal are also introduced. Smoothing in the time domain and in the frequency domain is applied to reduce unpleasant artifacts. The results of objective tests are provided, which prove that increased dialogue intelligibility is achieved with the aid of the proposed algorithm. The algorithm is particularly applicable in mobile devices while listening in mobile conditions.
Keywordsspeech intelligibility center channel extraction speech processing 5.1 downmix
Unable to display preview. Download preview PDF.
- 1.Technical Specification: ETSI TS 102 563 V1.2.1, European Telecommunication Standards Institute (2010)Google Scholar
- 2.Technical standard: ISO/IEC 14496-3:2009, Information technology – Coding of audio-visual objects – Part 3: Audio, International Standards Organization (2009)Google Scholar
- 3.Technical standard: ITU-R B S.775-3 - Multichannel stereophonic sound system with and without accompanying picture. International Telecommunication Union (2006)Google Scholar
- 4.Lopatka, K., Czyzewski, A.: Method and apparatus for speech clarity enhancement in multichannel multimedia signal, especially audio-visual signal. Polish patent application no. P.402373 (January 7, 2013)Google Scholar
- 9.Barry, D., Lawlor, R., Coyle, E.: Real-time sound source separation: Azimuth discrimination and resynthesis. In: 117th Audio Engineering Society Convention. AES, San Francisco (2004)Google Scholar
- 10.Lopatka, K., Kunka, B., Czyzewski, A.: Novel 5.1 downmix algorithm with improved dialogue intelligibility. In: 134th Audio Engineering Society Convention, May 4-7. AES, Rome (2013)Google Scholar
- 15.ITU-T Recommendation P.800, Methods for Subjective Determination of Transmission Quality, ITU (1996)Google Scholar
- 16.Opticom software homepage (2013), http://www.opticom.de
- 17.Stabinski, A.: Multimedia database for evaluation of downmix quality, Master thesis, Gdansk University of Technology (2013)Google Scholar
- 18.Digital Audio Compression Standard (AC-3, E-AC-3), ATSC (2010)Google Scholar