Advertisement

Study on Automated Audio Descriptions Overlapping Live Television Commentary

  • Manon Ichiki
  • Toshihiro Shimizu
  • Atsushi Imai
  • Tohru Takagi
  • Mamoru Iwabuchi
  • Kiyoshi Kurihara
  • Taro Miyazaki
  • Tadashi Kumano
  • Hiroyuki Kaneko
  • Shoei Sato
  • Nobumasa Seiyama
  • Yuko Yamanouchi
  • Hideki Sumiyoshi
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10896)

Abstract

We are conducting research on “automated audio description (AAD)” which automatically generates audio descriptions from real-time competition data for visually impaired people to enjoy live sports programs. However, there is a problem that AAD overlaps with the live television commentary voice, making it difficult to hear each other’s comment. In this paper, first, we show that the game situation is conveyed effectively when visually impaired persons listen to the AAD alone. Then we state the results of experiments on the following items to solve the overlap issue: (1) There is a difference in optimum volume level between live commentary and AAD, (2) The ease of listening differs depending on the difference in the characteristics of text-to-speech synthesizer for AAD, (3) Playing back AAD through a speaker placed differently from the TV speaker makes both voice sounds easier to listen to. We had clues to solve that depending on the presentation method of AAD, we can make AAD easy to listen to even when AAD overlaps the live television commentary.

Keywords

Audio description Text-to-speech synthesizer Visually impaired 

References

  1. 1.
    International Olympic Committee. Olympic Data Feed. http://odf.olympictech.org/. Accessed 23 Mar 2018
  2. 2.
    Data creative & technology initiative for sports communities. http://www.datastadium.co.jp/. Accessed 23 Mar 2018. (in Japanese)
  3. 3.
    Taro, M.: Automatic generation of audio description for olympics/paralympics programs. In: NAB (National association of broadcasters) Show Conference, Broadcast Engineering and Information Technology Conference N256 (2017)Google Scholar
  4. 4.
    Kiyoshi, K.: Automatic Generation of Audio Description for Sports Programs, IBC (International Broadcasting Convention) (2017)Google Scholar
  5. 5.
    Shoei, S.: Speech Guides Helpful for Understanding Sports Broadcast, Spring Meeting of the Acoustical Society of Japan, 1-4-3, pp. 1547–1548 (2016). (in Japanese)Google Scholar
  6. 6.
    Shoei, S.: Utterance Classification for Automatic Audio Description of Sports Broadcasts, ITE Technical report, 11C-4 (2017). (in Japanese)Google Scholar
  7. 7.
    Atsushi, I.: A study on new service for overlapped audio information on TV programs. In: Proceedings of IEICE General Conference, H-4-11 (2016). (in Japanese)Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  • Manon Ichiki
    • 1
  • Toshihiro Shimizu
    • 1
  • Atsushi Imai
    • 1
  • Tohru Takagi
    • 2
  • Mamoru Iwabuchi
    • 3
  • Kiyoshi Kurihara
    • 1
  • Taro Miyazaki
    • 1
  • Tadashi Kumano
    • 1
  • Hiroyuki Kaneko
    • 1
    • 2
    • 3
  • Shoei Sato
    • 1
  • Nobumasa Seiyama
    • 1
  • Yuko Yamanouchi
    • 1
  • Hideki Sumiyoshi
    • 1
  1. 1.NHK (Nippon Hoso Kyokai; Japan Broadcasting Corp.) Science and Technical Research LaboratoriesTokyoJapan
  2. 2.NHK Engineering SystemsTokyoJapan
  3. 3.University of TokyoTokyoJapan

Personalised recommendations