Development of a Voice-Input Voice-Output Communication Aid (VIVOCA) for People with Severe Dysarthria
This paper describes an approach to the development of a voice-input voice-output communication aid (VIVOCA) for people with disordered or unintelligible speech, initially concentrating on people with moderate to severe dysarthria. The VIVOCA is intended to recognize and interpret an individual’s disordered speech and speak out an equivalent message in clear synthesized speech. User consultation suggests that such a device would be acceptable and would be useful in communication situations where speed and intelligibility are crucial. Speech recognition techniques will build on previously successful development of speech-based home control interfaces, and various methods for speech ‘translation’ are being evaluated.
KeywordsAssistive Technology Automatic Speech Recognition Large Vocabulary Translation Scheme Speech Technology
Unable to display preview. Download preview PDF.
- 1.Enderby, P., Emerson, L.: Does Speech and Language Therapy Work?, p. 84. Singular Publications (1995)Google Scholar
- 2.Hawley, M.S., Enderby, P., Green, P., Brownsell, S., Hatzis, A., Parker, M., Carmichael, J., Cunningham, S., O’Neill, P., Palmer, R.: STARDUST; Speech Training And Recognition for Dysarthric Users of Assistive Technology. In: Craddock, G.M., et al. (eds.) Assistive Technology – Shaping the Future, pp. 959–964. IOS Press, Amsterdam (2003)Google Scholar
- 3.Green, P.D., Carmichael, J., Hatzis, A., Enderby, P.M., Hawley, M., Parker, M.P.: Automatic Speech Recognition with Sparse Training Data for Dysarthric Speakers. In: Proc. European Conference on Speech Technology (Eurospeech), Geneva, pp. 1189–1192 (2003)Google Scholar
- 4.Holmes, J.N., Holmes, W.: Speech Synthesis and Recognition. Taylor & Francis, Abington (2001)Google Scholar
- 5.Hatzis, A., Green, P.D., Carmichael, J., Cunningham, S.P., Palmer, R., Parker, M.P., O’Neill, P.: An Integrated Toolkit Deploying Speech Technology for Computer Based Speech Training with Application to Dysarthric Speakers. In: Proc. European Conference on Speech Technology (Eurospeech), Geneva, pp. 2213–2216 (2003)Google Scholar
- 7.Hermansky, H., Morgan, N.: RASTA processing of speech. IEEE Trans. Speech & Audio Proc. 2, 587–589 (1994)Google Scholar
- 9.Cross, R.T., Baker, B.R., Klotz, L.V., Badman, A.L.: Semantic compaction in both static and dynamic environments: a new synthesis. In: CSUN conference, Los Angeles (March 1998), Available at http://www.dinf.ne.jp/doc/english/Us_Eu/conf/csun_98/csun98_064.htm