Abstract
The traditional fusion methods of multiple sensing modalities are summarized with 1) data-level fusion, 2) feature-level fusion and 3) decision-level fusion. This paper suggests the decision-level fusion-oriented novel fusion and fission framework, and it implements WPS (Wearable Personal Station) and Voice-XML-Based Multi-Modal Fusion Agent (hereinafter, MMFA) using audio-gesture modalities. Because the MMFA provides different weight and a feed-back function in individual recognizer, according to SNNR(Signal Plus Noise to Noise Ratio) and fuzzy value, it may select an optimal instruction processing interface under a given situation or noisy environment, and can allow more interactive communication functions in noisy environment. In addition, the MMFA provides a wider range of personalized information more effectively as well as it not need complicated mathematical algorithm and computation costs that are concerned with multidimensional features and patterns (data) size, according as it use a WPS and distributed computing-based database and SQL-logic, for synchronization and fusion between modalities.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Cenek, P., Melichar, M., Rajman, M.: A Framework for Rapid Multimodal Application Design. In: Matoušek, V., Mautner, P., Pavelka, T. (eds.) TSD 2005. LNCS (LNAI), vol. 3658, pp. 393–403. Springer, Heidelberg (2005)
Rajman, M., et al.: Assessing the usability of a dialogue management system designed in the framework of a rapid dialogue prototyping methodology. EAA 90, 1096–1111 (2004)
Bernsen, N.O.: Modality Theory: Supporting Multi-modal Interface Design. In: Proc. ERCIM (1993)
Bernsen, N.O.: A toolbox of output modalities. In: Representing output information in multimodal interfaces, Roskilde University (1995)
Martin, J.-C.: Towards intelligent cooperation between modalities. In: Proc. IJCAI 1997, Nagoya, Japan (1997)
Coutaz, J., Nigay, L., Salber, D., Blandford, A., May, J., Young, R.M.: Four Easy Pieces for Assessing the Usability of Multimodal Interaction: The CARE Properties. In: Proc.Interact 1995, pp. 115–120. Chapman & Hall, Sydney, Australia (1995)
Kim, S.-G.: Korean Standard Sign Language Tutor, 1st edn. Osung Publishing Company, Seoul (2000)
Kim, J.-H., et al.: An Implementation of KSSL Recognizer for HCI Based on Post Wearable PC and Wireless Networks KES 2006. In: Gabrys, B., Howlett, R.J., Jain, L.C. (eds.) KES 2006. LNCS (LNAI), vol. 4251, pp. 788–797. Springer, Heidelberg (2006)
Chen, C.H.: Fuzzy Logic and Neural Network Handbook. McGraw-Hill, New York (1992)
kandasamy, W.B.V.: Smaranda Fuzzy Algebra. American Research Press, Seattle (2003)
McGlashan, S., et al.: Voice Extensible Markup Language (VoiceXML) Version 2.0. W3C Recommendation (1992), http://www.w3.org
Martin, W.H.: DeciBel -The New Name for the Transmission Unit. Bell System Technical Journal (January 1929)
NIOSH working group.: STRESS..AT WORK NIOSH, Publication No. 99-101,U.S. National Institutes of Occupational Health (2006)
Kim., J.-H., et al.: Hand Gesture Recognition System using Fuzzy Algorithm and RDBMS for Post PC. In: Wang, L., Jin, Y. (eds.) FSKD 2005. LNCS (LNAI), vol. 3614, pp. 170–175. Springer, Heidelberg (2005)
i.MX21 Processor Data-sheet: http://www.freescale.com
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, JH., Hong, KS. (2007). WPS and Voice-XML-Based Multi-Modal Fusion Agent Using SNNR and Fuzzy Value. In: Enokido, T., Barolli, L., Takizawa, M. (eds) Network-Based Information Systems. NBiS 2007. Lecture Notes in Computer Science, vol 4658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74573-0_54
Download citation
DOI: https://doi.org/10.1007/978-3-540-74573-0_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74572-3
Online ISBN: 978-3-540-74573-0
eBook Packages: Computer ScienceComputer Science (R0)
