Abstract
Current Automatic Speech Recognition (ASR) systems designed to recognize dysarthric speech require an investment in training that involves considerable effort and must be repeated if speech patterns change. We present CanSpeak, a customizable speech recognition interface that does not require automatic training and uses a list of keywords customized for each user. We conducted a preliminary user study with four subjects with dysarthric speech. Customizing the keyword lists doubled the accuracy rate of the system for two of the subjects whose parents and caregivers participated in the customizing task. For the other two subjects only small improvements were observed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blaney, B., Wilson, J.: Acoustic Variability in Dysarthria and Computer Speech Recognition. Clinical Linguistics and Phonetics 14(4), 307–327 (2000)
Chen, F., Kostov, A.: Optimization of Dysarthric Speech Recognition. In: Proc. of IEEE/EMBS 1997, pp. 1436–1439. IEEE Press, New York (1997)
Clark, J.A., Roemer, R.B.: Voice Controller Wheelchair. Archives of Physical Medicine & Rehabilitation 58(4), 169–175 (1977)
Cohen, A., Graupe, D.: Speech Recognition and Control System for the Severely Disabled. Journal of Biomedical Engineering 2(2), 97–107 (1980)
Franco, H., Abrash, V., Precoda, K., Bratt, H., Rao, R., Butzberger, J., Rossier, R., Cesari, F.: The SRI EduSpeak System: Recognition and Pronunciation Scoring for Language Learning. In: Proc. of InSTL 2000, pp. 123–128 (2000)
Fried-Oken, M.: Voice Recognition Device as a Computer Interface for Motor and Speech Impaired People. Archives of Physical Medicine and Rehabilitation 34(3), 678–681 (1985)
Grammenos, D., Savidis, A., Georgalis, Y., Stephanidis, C.: Access Invaders: Developing a Universally Accessible Action Game. In: Miesenberger, K., Klaus, J., Zagler, W.L., Karshmer, A.I. (eds.) ICCHP 2006. LNCS, vol. 4061, pp. 388–395. Springer, Heidelberg (2006)
Green, P., Carmichael, J., Hatzis, A., Enderby, P., Hawley, M., Parker, M.: Automatic Speech Recognition with Sparse Training Data for Dysarthric Speakers. In: Proc. of EUROSPEECH 2003, pp. 1189–1192 (2003)
Hasegawa-Johnson, M., Gunderson, J., Perlman, A., Huang, T.: HMM-Based and SVM-Based Recognition of the Speech of Talkers with Spastic Dysarthria. In: Proc. of ICASSP 2006, pp. 1060–1063. IEEE Press, New York (2006)
Hawley, M., Enderby, P., Green, P., Brownsell, S., Hatzis, A., Parker, M., Carmichael, J., Cunningham, M., O’Neill, P., Palmer, R.: STARDUST:Speech Training and Recognition for Dysarthric Users of Assistive Technology. In: Craddock, G.M., McCormack, L.P., Reilly, R.B., Knops, H. (eds.) Assistive Technology – Shaping the Future, pp. 959–964. IOS Press, Amsterdam (2003)
Hawley, M., Enderby, P., Green, P., Cunningham, S., Palmer, R.: Development of a Voice-Input Voice-Output Communication Aid (VIVOCA) for People with Severe Dysarthria. In: Craddock, G.M., McCormack, L.P., Reilly, R.B., Knops, H. (eds.) Assistive Technology – Shaping the Future, pp. 882–885. IOS Press, Amsterdam (2003)
Kawai, S., Aida, H., Saito, T.: Designing Interface Toolkit with Dynamic Selectable Modality. In: Proc. of ASSETS 1996, pp. 72–79. ACM Press, New York (1996)
Kotler, A., Thomas-Stonell, N.: Effects of Speech Training on the Accuracy of Speech Recognition for an Individual with Speech Impairment. Journal of Augmentative and Alternative Communication 13(2), 71–80 (1997)
Manasse, N.J., Hux, K., Rankin-Erickson, J.L.: Speech Recognition Training for Enhancing Written Language Generation by a Traumatic Brain Injury Survivor. Brain Injury 14(11), 1015–1034 (2000)
Menéndez-Pidal, X., Polikoff, J.B., Peters, S.M., Leonzio, J.E., Bunnell, H.T.: The Nemours Database of Dysarthric Speech. In: Proc. of ICSLP 1996, pp. 1962–1965. IEEE Press, New York (1996)
Netsell, R., Abbs, J.: Acoustic Characteristics of Dysarthria Associated with Cerebellar Disease. Journal of Speech and Hearing Research 22, 627–648 (1997)
Patel, R., DiCicco, T.M.: Automatic Landmark Analysis of Dysartheric Speech. Journal of Medical Speech-Language Pathology 16(4), 221–224 (2008)
Raghavendra, P., Rosengren, E., Hunnicutt, S.: An Investigation of two Speech Recognition Systems with Dysarthric Speech as Input. In: Proc. of ISSAC 1994, pp. 479–481 (1994)
Rosen, K., Yampolsky, S.: Automatic Speech Recognition and a Review of its Functioning with Dysarthric Speech. Journal of Augmentative and Alternative Communication 16(1), 48–60 (2000)
Rosengren, E., Raghavendra, P., Hunnicut, S.: How Does Automatic Speech Recognition Handle Severely Dysarthric Speech? In: Porrero, P., Puig de la Bellacasa, R. (eds.) European Context for Assistive Technology, pp. 336–339. IOS Press, Amsterdam (1995)
Spalteholz, L., Lin, K.F., Livingston, N., Hamidi, F.: KeySurf: A Character Controlled Browser for People with Physical Disabilities.. In: Proc. of WWW 2008, pp. 31–39. ACM Press, New York (2008)
Walker, W., Lamere, P., Kwok, P., Raj, B., Singh, R., Gouvea, E., Wolf, P., Woelfel, J.: Sphinx-4: A Flexible Open Source Framework for Speech Recognition. Technical report, Sun Microsystems (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hamidi, F., Baljko, M., Livingston, N., Spalteholz, L. (2010). CanSpeak: A Customizable Speech Interface for People with Dysarthric Speech. In: Miesenberger, K., Klaus, J., Zagler, W., Karshmer, A. (eds) Computers Helping People with Special Needs. ICCHP 2010. Lecture Notes in Computer Science, vol 6179. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14097-6_97
Download citation
DOI: https://doi.org/10.1007/978-3-642-14097-6_97
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14096-9
Online ISBN: 978-3-642-14097-6
eBook Packages: Computer ScienceComputer Science (R0)