A Universal Assistive Technology with Multimodal Input and Multimedia Output Interfaces

Karpov, Alexey; Ronzhin, Andrey

doi:10.1007/978-3-319-07437-5_35

Alexey Karpov^16,17 &
Andrey Ronzhin¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8513))

Included in the following conference series:

International Conference on Universal Access in Human-Computer Interaction

3352 Accesses
15 Citations

Abstract

In this paper, we present a universal assistive technology with multimodal input and multimedia output interfaces. The conceptual model and the software-hardware architecture with levels and components of the universal assistive technology are described. The architecture includes five main interconnected levels: computer hardware, system software, application software of digital signal processing, application software of human-computer interfaces, software of assistive information technologies. The universal assistive technology proposes several multimodal systems and interfaces to the people with disabilities: audio-visual Russian speech recognition system (AVSR), “Talking head” synthesis system (text-to-audiovisual speech), “Signing avatar” synthesis system (sign language visual synthesis), ICANDO multimodal system (hands-free PC control system), and the control system of an assistive smart space.

Download to read the full chapter text

Chapter PDF

Assistive Robot Multi-modal Interaction with Augmented 3D Vision and Dialogue

Gesture-Controlled Speech Assist Device for the Verbally Disabled

Smart Assistant for Challenged People Using Embedded Systems

Keywords

References

Ambient Assisted Living Joint Programme, http://www.aal-europe.eu
The Convention on the Rights of Persons with Disabilities of the United Nations, http://www.un.org/disabilities/convention/conventionfull.shtml
The Russian State Programme “Accessible Environment”, http://zhit-vmeste.ru
Tech Act, http://www.ok.gov/abletech/documents/Tech%20Act-Individuals%20with%20Disabilities.pdf
Argyropoulos, S., Moustakas, K., Karpov, A., Aran, O., Tzovaras, D., Tsakiris, T., Varni, G., Kwon, B.: A Multimodal Framework for the Communication of the Disabled. Journal on Multimodal User Interfaces 2(2), 105–116 (2008)
Article Google Scholar
Hruz, M., Campr, P., Dikici, E., Kindirouglu, A., Krňoul, Z., Ronzhin, A., Sak, H., Schorno, D., Akarun, L., Aran, O., Karpov, A., Saraclar, M., Železný, M.: Automatic Fingersign to Speech Translation System. Journal on Multimodal User Interfaces 4(2), 61–79 (2011)
Article Google Scholar
Stephanidis, C., Akoumianakis, D., Sfyrakis, M., Paramythis, A.: Universal accessibility in HCI: Process-oriented design guidelines an tool requirements. In: Proc. 4th ERCIM Workshop on User Interfaces for All, Stockholm, Sweden, pp. 19–21 (1998)
Google Scholar
Savidis, A., Stephanidis, C.: Unified user interface design: designing universally accessible interfaces. Interacting with Computers 16(2), 243–270 (2004)
Article Google Scholar
De Marsico, M., Kimani, S., Mirabella, V., Norman, K.L., Catarci, T.: A Proposal toward the Development of Accessible e-Learning Content by Human Involvement. Universal Access in the Information Society 5(2), 150–169 (2006)
Article Google Scholar
Obrenovic, Z., Abascal, J., Starcevic, D.: Universal Accessibility as a Multimodal Design Issue. Communications of the ACM 50(5), 83–88 (2007)
Article Google Scholar
Oviatt, S., Cohen, P.: Perceptual user interfaces: multimodal interfaces that process what comes naturally. Communications of the ACM 43(3), 45–53 (2000)
Article Google Scholar
Martin, J.-C.: Towards “intelligent” cooperation between modalities. The example of a system enabling multimodal interaction with a map. In: Proc. IJCAI 1997 Workshop on Intelligent Multimodal Systems, Nagoya, Japan (1997)
Google Scholar
Ong, S., Ranganath, S.: Automatic Sign Language Analysis: A Survey and the Future beyond Lexical Meaning. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(6), 873–891 (2005)
Article Google Scholar
Grauman, K.: Communication via Eye Blinks and Eyebrow Raises: Video-Based Human-Computer Interfaces. In: Grauman, K., Betke, M., Lombardi, J., Gips, J., Bradski, G. (eds.) Universal Access in the Information Society, vol. 4, pp. 359–373 (2003)
Google Scholar
Graimann, B., Allison, B., Pfurtscheller, G.: Brain–Computer Interfaces: A Gentle Introduction. In: Brain-Computer Interfaces. The Frontiers Collection, pp. 1–27. Springer (2010)
Google Scholar
Colwell, C., Petrie, H., Kornbrot, D., Hardwick, A., Furner, S.: Haptic Virtual Reality for Blind Computer Users. In: Proc. Annual ACM Conference on Assistive Technologies, ASSETS 1998, Marina del Rey, CA, USA, pp. 92–99 (1998)
Google Scholar
Karpov, A., Ronzhin, A., Markov, K., Zelezny, M.: Viseme-Dependent Weight Optimization for CHMM-Based Audio-Visual Speech Recognition. In: Proc. INTERSPEECH 2010 International Conference, ISCA Association, Makuhari, Japan, pp. 2678–2681 (2010)
Google Scholar
Karpov, A., Ronzhin, A., Kipyatkova, I., Zelezny, M.: Influence of Phone-viseme Temporal Correlations on Audiovisual STT and TTS Performance. In: Proc. 17th International Congress of Phonetic Sciences, ICPhS 2011, Hong Kong, China, pp. 1030–1033 (2011)
Google Scholar
Karpov, A., Markov, K., Kipyatkova, I., Vazhenina, D., Ronzhin, A.: Large vocabulary Russian speech recognition using syntactico-statistical language modeling. Speech Communication 56, 213–228 (2014)
Article Google Scholar
Nefian, A., Liang, L., Pi, X., Xiaoxiang, X., Mao, C., Murphy, K.: A Coupled HMM for Audio-Visual Speech Recognition. In: Proc. International Conference on Acoustics, Speech and Signal Processing, ICASSP 2002, Orlando, USA, pp. 2013–2016 (2002)
Google Scholar
Karpov, A., Tsirulnik, L., Krňoul, Z., Ronzhin, A., Lobanov, B., Železný, M.: Audio-Visual Speech Asynchrony Modeling in a Talking Head. In: Proc. INTERSPEECH 2009 International Conference, Brighton, UK, pp. 2911–2914 (2009)
Google Scholar
Karpov, A., Tsirulnik, L., Zelezny, M., Krnoul, Z., Ronzhin, A., Lobanov, B.: Study of Audio-Visual Asynchrony of Russian Speech for Improvement of Talking Head Naturalness. In: Proc. 13th International Conference SPECOM 2009, St. Petersburg, pp. 130–135 (2009)
Google Scholar
Morales-Rodriguez, M.L., Pavard, B.: Embodied Conversational Agents: A New Kind of Tool for Motor Rehabilitation? In: Proc. 11th Annual International Workshop on Presence, PRESENCE 2008, Padova, Italy, pp. 95–99 (2008)
Google Scholar
Multimedia demonstration of “Talking head” for audio-visual Russian speech synthesis, http://www.spiiras.nw.ru/speech/demo/th.avi
Karpov, A., Krnoul, Z., Zelezny, M., Ronzhin, A.: Multimodal Synthesizer for Russian and Czech Sign Languages and Audio-Visual Speech. In: Stephanidis, C., Antona, M. (eds.) UAHCI 2013, Part I. LNCS, vol. 8009, pp. 520–529. Springer, Heidelberg (2013)
Chapter Google Scholar
Karpov, A., Železný, M.: Towards Russian Sign Language Synthesizer: Lexical Level. In: Proc. 5th International Workshop on Representation and Processing of Sign Languages at the LREC 2012, Istanbul, Turkey, pp. 83–86 (2012)
Google Scholar
Hanke, T.: HamNoSys - Representing sign language data in language resources and language processing contexts. In: Proc. International Conference on Language Resources and Evaluation, LREC 2004, Lisbon, Portugal, pp. 1–6 (2004)
Google Scholar
Multimedia demonstration of 3D “Signing avatar” for Russian sign language synthesis, http://www.spiiras.nw.ru/speech/demo/signlang.avi
Kindiroglu, A., Yalcın, H., Aran, O., Hruz, M., Campr, P., Akarun, L., Karpov, A.: Automatic Recognition of Fingerspelling Gestures in Multiple Languages for a Communication Interface for the Disabled. Pattern Recognition and Image Analysis 22(4), 527–536 (2012)
Article Google Scholar
Kindiroglu, A., Yalcın, H., Aran, O., Hruz, M., Campr, P., Akarun, L., Karpov, A.: Multi-lingual Fingerspelling Recognition in a Handicapped Kiosk. Pattern Recognition and Image Analysis 21(3), 402–406 (2011)
Article Google Scholar
Karpov, A., Ronzhin, A., Kipyatkova, I.: An Assistive Bi-modal User Interface Integrating Multi-channel Speech Recognition and Computer Vision. In: Jacko, J.A. (ed.) Human-Computer Interaction, Part II, HCII 2011. LNCS, vol. 6762, pp. 454–463. Springer, Heidelberg (2011)
Chapter Google Scholar
Karpov, A., Ronzhin, A.: ICANDO: Low Cost Multimodal Interface for Hand Disabled People. Journal on Multimodal User Interfaces 1(2), 21–29 (2007)
Article Google Scholar
Demonstration of multimodal hands-free PC control system (ICANDO), http://www.spiiras.nw.ru/speech/demo/assistive.html
Demiröz, B., Ari, I., Ronzhin, A., Çoban, A., Yalçın, H., Karpov, A., Akarun, L.: Multimodal Assisted Living Environment. Report on research project at eNTERFACE-2011 Summer Workshop on Multimodal Interfaces, Pilsen, Czech Republic (2011), http://www.cmpe.boun.edu.tr/~ari/files/demiroz2011enterface.pdf
De Marsico, M., Sterbini, A., Temperini, M.: A Framework to Support Social-Collaborative Personalized e-Learning. In: Kurosu, M. (ed.) HCII/HCI 2013, Part II. LNCS, vol. 8005, pp. 351–360. Springer, Heidelberg (2013)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

University ITMO, St. Petersburg, Russia
Alexey Karpov
St. Petersburg Institute for Informatics and Automation of RAS (SPIIRAS), Russia
Alexey Karpov & Andrey Ronzhin

Authors

Alexey Karpov
View author publications
You can also search for this author in PubMed Google Scholar
Andrey Ronzhin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Foundation for Research and Technology - Hellas (FORTH)Institute of Computer Science, N. Plastira 100, Vassilika Vouton, 70013, Heraklion, Crete, Greece
Constantine Stephanidis & Margherita Antona &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Karpov, A., Ronzhin, A. (2014). A Universal Assistive Technology with Multimodal Input and Multimedia Output Interfaces. In: Stephanidis, C., Antona, M. (eds) Universal Access in Human-Computer Interaction. Design and Development Methods for Universal Access. UAHCI 2014. Lecture Notes in Computer Science, vol 8513. Springer, Cham. https://doi.org/10.1007/978-3-319-07437-5_35

Download citation

DOI: https://doi.org/10.1007/978-3-319-07437-5_35
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07436-8
Online ISBN: 978-3-319-07437-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Universal Assistive Technology with Multimodal Input and Multimedia Output Interfaces

Abstract

Chapter PDF

Similar content being viewed by others

Assistive Robot Multi-modal Interaction with Augmented 3D Vision and Dialogue

Gesture-Controlled Speech Assist Device for the Verbally Disabled

Smart Assistant for Challenged People Using Embedded Systems

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Universal Assistive Technology with Multimodal Input and Multimedia Output Interfaces

Abstract

Chapter PDF

Similar content being viewed by others

Assistive Robot Multi-modal Interaction with Augmented 3D Vision and Dialogue

Gesture-Controlled Speech Assist Device for the Verbally Disabled

Smart Assistant for Challenged People Using Embedded Systems

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation