Low-Cost Portable Text Recognition and Speech Synthesis with Generic Laptop Computer, Digital Camera and Software

  • Lauri Lahti
  • Jaakko Kurhila
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4555)

Abstract

Blind persons or people with reduced eyesight could benefit from a portable system that can interpret textual information in the surrounding environment and speak directly to the user. The need for such a system was surveyed with a questionnaire, and a prototype system was built using generic, inexpensive components readily available. The system architecture is component-based so that every module can be replaced with another generic module. Even though the system makes partly incorrect recognition of text in a versatile environment, the evaluation of the system with five actual users suggested that the system can provide genuine additional value in coping with everyday issues outdoors.

Keywords

Text recognition speech synthesis independent initiative 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    C.A. Technologies, Inc. P2RD: Portable Print Reading Device (2004), http://www.catechnology.net/
  2. 2.
    Doermann, D., Liang, J., Li, H.: Progress in Camera-Based Document Image Analysis. In: ICDAR ’03: Proc. seventh international conference on Document analysis and recognition, p. 606. IEEE Computer Society Press, Washington (2003)CrossRefGoogle Scholar
  3. 3.
    Ezaki, N., Bulacu, M., Schomaker, L.: Text Detection from Natural Scene Images: Towards a System for Visually Impaired Persons. In: ICPR 2004: Proc. Pattern Recognition, 17th international conference on (ICPR 2004), vol. 2, pp. 683–686. IEEE Computer Society Press, Washington (2004)CrossRefGoogle Scholar
  4. 4.
    Gaudissart, V., Ferreira, S., Mancas-Thillou, C., Gosselin, B.: Sypole: A Mobile Assistant for the Blind. In: Proc. European Signal Processing Conference (EUSIPCO 2005), Antalya, Turkey (2005), http://tcts.fpms.ac.be/publications/papers/2005/eusipco05_vgsfcmbg.pdf
  5. 5.
    JSS Computing, Inc. QuickTextScan (2006), http://jsscomputing.com/quicktextscan/
  6. 6.
    Krishna, S., Little, G., Black, J., Panchanathan, S.: iCARE Interaction Assistant: A Wearable Face Recognition System for Individuals with Visual Impairments. In: Assets 2005: Proc. 7th international ACM SIGACCESS conference on Computers and accessibility, pp. 216–217. ACM Press, New York (2005)CrossRefGoogle Scholar
  7. 7.
    Krishna, S., Little, G., Black, J., Panchanathan, S.: A Wearable Face Recognition System for Individuals with Visual Impairments. In: Assets 2005: Proc. 7th international ACM SIGACCESS conference on Computers and accessibility, pp. 106–113. ACM Press, New York (2005)CrossRefGoogle Scholar
  8. 8.
    Lahti, L.: Computer-assisted acquisition of information for visually impaired (in Finnish). Master’s thesis, University of Helsinki, Faculty of Science, Department of Computer Science, Report C-2006-32 (2006), http://ethesis.helsinki.fi/julkaisut/mat/tieto/pg/lahti/
  9. 9.
    Mallett, C.: AutoHotkey: Open Source Mouse and Keyboard Macro Program (2006), http://www.autohotkey.com/
  10. 10.
    Meijer, P.: The vOICe: A Synthetic Vision for the Blind (2006), http://www.seeingwithsound.com/
  11. 11.
    Mihara, Y., Sugimoto, A., Shibayama, E., Takahashi, S.: An Interactive Braille-Recognition System for the Visually Impaired Based on a Portable Camera. In: Proc. CHI 2005: extended abstracts on Human factors in computing systems, pp. 1653–1656. ACM Press, New York (2005)CrossRefGoogle Scholar
  12. 12.
    Nagel, H.-H.: Steps toward a Cognitive Vision System. AI Magazine 25(2), 31–50 (2004)Google Scholar
  13. 13.
    Nullsoft Ltd. Winamp: Media Player for Windows (2006), http://www.winamp.com/
  14. 14.
    Paradise, J., Mynatt, E.D., Williams, C., Goldthwaite, J.: Designing a Cognitive Aid for the Home: A Case-Study Approach. In: Assets 2004: Proc. 6th international ACM SIGACCESS conference on Computers and accessibility, pp. 140–146. ACM Press, New York (2004)CrossRefGoogle Scholar
  15. 15.
    Ross, D.A., Lightman, A.: Talking Braille: A Wireless Ubiquitous Computing Network for Orientation and Wayfinding. In: Assets 2005: Proc. 7th international ACM SIGACCESS conference on Computers and accessibility, pp. 98–105. ACM Press, New York (2005)CrossRefGoogle Scholar
  16. 16.
    Timehouse Oy. Mikropuhe: Finnish Speech Synthesis Software (2003), http://www.mikropuhe.com/mikropuhe.asp
  17. 17.
    TopSoft Ltd. TopOCR: Optical Character Recognition Software (2005), http://www.topocr.com/
  18. 18.
    VoiceSignal Technologies, Inc. VSpeak: Speech Synthesis for Mobile Phones (2006), http://www.voicesignal.com/solutions/applications.php3
  19. 19.
    Yang, J., Gao, J., Zhang, Y., Waibel, A.: Towards Automatic Sign Translation. In: HLT 2001: Proc. 1st international conference on Human language technology research, pp. 1–6. Association for Computational Linguistics, Morristown, NJ, USA (2001)Google Scholar
  20. 20.
    Zandifar, A., Chahine, A.: A Video Based Interface to Textual Information for the Visually Impaired. In: ICMI 2002: Proc. 4th IEEE international conference on Multimodal interfaces, p. 325. IEEE Computer Society, Washington (2002)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Lauri Lahti
    • 1
  • Jaakko Kurhila
    • 2
  1. 1.Department of Computer Science and Engineering, P.O. Box 5400, FIN-02015 Helsinki University of TechnologyFinland
  2. 2.Department of Computer Science, P.O. Box 68, FIN-00014 University of HelsinkiFinland

Personalised recommendations