CanSpeak: A Customizable Speech Interface for People with Dysarthric Speech

Hamidi, Foad; Baljko, Melanie; Livingston, Nigel; Spalteholz, Leo

doi:10.1007/978-3-642-14097-6_97

Foad Hamidi²⁰,
Melanie Baljko²⁰,
Nigel Livingston²¹ &
…
Leo Spalteholz²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6179))

Included in the following conference series:

International Conference on Computers for Handicapped Persons

1290 Accesses
15 Citations

Abstract

Current Automatic Speech Recognition (ASR) systems designed to recognize dysarthric speech require an investment in training that involves considerable effort and must be repeated if speech patterns change. We present CanSpeak, a customizable speech recognition interface that does not require automatic training and uses a list of keywords customized for each user. We conducted a preliminary user study with four subjects with dysarthric speech. Customizing the keyword lists doubled the accuracy rate of the system for two of the subjects whose parents and caregivers participated in the customizing task. For the other two subjects only small improvements were observed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Blaney, B., Wilson, J.: Acoustic Variability in Dysarthria and Computer Speech Recognition. Clinical Linguistics and Phonetics 14(4), 307–327 (2000)
Article Google Scholar
Chen, F., Kostov, A.: Optimization of Dysarthric Speech Recognition. In: Proc. of IEEE/EMBS 1997, pp. 1436–1439. IEEE Press, New York (1997)
Google Scholar
Clark, J.A., Roemer, R.B.: Voice Controller Wheelchair. Archives of Physical Medicine & Rehabilitation 58(4), 169–175 (1977)
Google Scholar
Cohen, A., Graupe, D.: Speech Recognition and Control System for the Severely Disabled. Journal of Biomedical Engineering 2(2), 97–107 (1980)
Article Google Scholar
Franco, H., Abrash, V., Precoda, K., Bratt, H., Rao, R., Butzberger, J., Rossier, R., Cesari, F.: The SRI EduSpeak System: Recognition and Pronunciation Scoring for Language Learning. In: Proc. of InSTL 2000, pp. 123–128 (2000)
Google Scholar
Fried-Oken, M.: Voice Recognition Device as a Computer Interface for Motor and Speech Impaired People. Archives of Physical Medicine and Rehabilitation 34(3), 678–681 (1985)
Google Scholar
Grammenos, D., Savidis, A., Georgalis, Y., Stephanidis, C.: Access Invaders: Developing a Universally Accessible Action Game. In: Miesenberger, K., Klaus, J., Zagler, W.L., Karshmer, A.I. (eds.) ICCHP 2006. LNCS, vol. 4061, pp. 388–395. Springer, Heidelberg (2006)
Chapter Google Scholar
Green, P., Carmichael, J., Hatzis, A., Enderby, P., Hawley, M., Parker, M.: Automatic Speech Recognition with Sparse Training Data for Dysarthric Speakers. In: Proc. of EUROSPEECH 2003, pp. 1189–1192 (2003)
Google Scholar
Hasegawa-Johnson, M., Gunderson, J., Perlman, A., Huang, T.: HMM-Based and SVM-Based Recognition of the Speech of Talkers with Spastic Dysarthria. In: Proc. of ICASSP 2006, pp. 1060–1063. IEEE Press, New York (2006)
Google Scholar
Hawley, M., Enderby, P., Green, P., Brownsell, S., Hatzis, A., Parker, M., Carmichael, J., Cunningham, M., O’Neill, P., Palmer, R.: STARDUST:Speech Training and Recognition for Dysarthric Users of Assistive Technology. In: Craddock, G.M., McCormack, L.P., Reilly, R.B., Knops, H. (eds.) Assistive Technology – Shaping the Future, pp. 959–964. IOS Press, Amsterdam (2003)
Google Scholar
Hawley, M., Enderby, P., Green, P., Cunningham, S., Palmer, R.: Development of a Voice-Input Voice-Output Communication Aid (VIVOCA) for People with Severe Dysarthria. In: Craddock, G.M., McCormack, L.P., Reilly, R.B., Knops, H. (eds.) Assistive Technology – Shaping the Future, pp. 882–885. IOS Press, Amsterdam (2003)
Google Scholar
Kawai, S., Aida, H., Saito, T.: Designing Interface Toolkit with Dynamic Selectable Modality. In: Proc. of ASSETS 1996, pp. 72–79. ACM Press, New York (1996)
Google Scholar
Kotler, A., Thomas-Stonell, N.: Effects of Speech Training on the Accuracy of Speech Recognition for an Individual with Speech Impairment. Journal of Augmentative and Alternative Communication 13(2), 71–80 (1997)
Article Google Scholar
Manasse, N.J., Hux, K., Rankin-Erickson, J.L.: Speech Recognition Training for Enhancing Written Language Generation by a Traumatic Brain Injury Survivor. Brain Injury 14(11), 1015–1034 (2000)
Article Google Scholar
Menéndez-Pidal, X., Polikoff, J.B., Peters, S.M., Leonzio, J.E., Bunnell, H.T.: The Nemours Database of Dysarthric Speech. In: Proc. of ICSLP 1996, pp. 1962–1965. IEEE Press, New York (1996)
Google Scholar
Netsell, R., Abbs, J.: Acoustic Characteristics of Dysarthria Associated with Cerebellar Disease. Journal of Speech and Hearing Research 22, 627–648 (1997)
Google Scholar
Patel, R., DiCicco, T.M.: Automatic Landmark Analysis of Dysartheric Speech. Journal of Medical Speech-Language Pathology 16(4), 221–224 (2008)
Google Scholar
Raghavendra, P., Rosengren, E., Hunnicutt, S.: An Investigation of two Speech Recognition Systems with Dysarthric Speech as Input. In: Proc. of ISSAC 1994, pp. 479–481 (1994)
Google Scholar
Rosen, K., Yampolsky, S.: Automatic Speech Recognition and a Review of its Functioning with Dysarthric Speech. Journal of Augmentative and Alternative Communication 16(1), 48–60 (2000)
Article Google Scholar
Rosengren, E., Raghavendra, P., Hunnicut, S.: How Does Automatic Speech Recognition Handle Severely Dysarthric Speech? In: Porrero, P., Puig de la Bellacasa, R. (eds.) European Context for Assistive Technology, pp. 336–339. IOS Press, Amsterdam (1995)
Google Scholar
Spalteholz, L., Lin, K.F., Livingston, N., Hamidi, F.: KeySurf: A Character Controlled Browser for People with Physical Disabilities.. In: Proc. of WWW 2008, pp. 31–39. ACM Press, New York (2008)
Google Scholar
Walker, W., Lamere, P., Kwok, P., Raj, B., Singh, R., Gouvea, E., Wolf, P., Woelfel, J.: Sphinx-4: A Flexible Open Source Framework for Speech Recognition. Technical report, Sun Microsystems (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, York University, 4700 Keele St., Toronto, Ontario, Canada, M3J 1P3
Foad Hamidi & Melanie Baljko
CanAssist, University of Victoria, Victoria, BC, Canada, V8W 3P6
Nigel Livingston & Leo Spalteholz

Authors

Foad Hamidi
View author publications
You can also search for this author in PubMed Google Scholar
Melanie Baljko
View author publications
You can also search for this author in PubMed Google Scholar
Nigel Livingston
View author publications
You can also search for this author in PubMed Google Scholar
Leo Spalteholz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut Integriert Studieren, University of Linz, Altenbergerstraße 49, 4040, Linz, Austria
Klaus Miesenberger
Studienzentrum fuer Sehgeschaedigte, Universitaet Karlsruhe (TH), Germany
Joachim Klaus
fortec - Research Group on Rehabilitation Technology, Institute integrated study, Vienna Univ. of Technology, Favoritenstr. 11/029, A 1040, Vienna, Austria
Wolfgang Zagler
University of San Francisco The Universal Math Access Lab, 281 Masonic San Francisco, CA 94117, USA
Arthur Karshmer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hamidi, F., Baljko, M., Livingston, N., Spalteholz, L. (2010). CanSpeak: A Customizable Speech Interface for People with Dysarthric Speech. In: Miesenberger, K., Klaus, J., Zagler, W., Karshmer, A. (eds) Computers Helping People with Special Needs. ICCHP 2010. Lecture Notes in Computer Science, vol 6179. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14097-6_97

Download citation

DOI: https://doi.org/10.1007/978-3-642-14097-6_97
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14096-9
Online ISBN: 978-3-642-14097-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics