Skip to main content

WPS and Voice-XML-Based Multi-Modal Fusion Agent Using SNNR and Fuzzy Value

  • Conference paper
  • 756 Accesses

Part of the Lecture Notes in Computer Science book series (LNISA,volume 4658)

Abstract

The traditional fusion methods of multiple sensing modalities are summarized with 1) data-level fusion, 2) feature-level fusion and 3) decision-level fusion. This paper suggests the decision-level fusion-oriented novel fusion and fission framework, and it implements WPS (Wearable Personal Station) and Voice-XML-Based Multi-Modal Fusion Agent (hereinafter, MMFA) using audio-gesture modalities. Because the MMFA provides different weight and a feed-back function in individual recognizer, according to SNNR(Signal Plus Noise to Noise Ratio) and fuzzy value, it may select an optimal instruction processing interface under a given situation or noisy environment, and can allow more interactive communication functions in noisy environment. In addition, the MMFA provides a wider range of personalized information more effectively as well as it not need complicated mathematical algorithm and computation costs that are concerned with multidimensional features and patterns (data) size, according as it use a WPS and distributed computing-based database and SQL-logic, for synchronization and fusion between modalities.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cenek, P., Melichar, M., Rajman, M.: A Framework for Rapid Multimodal Application Design. In: Matoušek, V., Mautner, P., Pavelka, T. (eds.) TSD 2005. LNCS (LNAI), vol. 3658, pp. 393–403. Springer, Heidelberg (2005)

    Google Scholar 

  2. Rajman, M., et al.: Assessing the usability of a dialogue management system designed in the framework of a rapid dialogue prototyping methodology. EAA 90, 1096–1111 (2004)

    Google Scholar 

  3. Bernsen, N.O.: Modality Theory: Supporting Multi-modal Interface Design. In: Proc. ERCIM (1993)

    Google Scholar 

  4. Bernsen, N.O.: A toolbox of output modalities. In: Representing output information in multimodal interfaces, Roskilde University (1995)

    Google Scholar 

  5. Martin, J.-C.: Towards intelligent cooperation between modalities. In: Proc. IJCAI 1997, Nagoya, Japan (1997)

    Google Scholar 

  6. Coutaz, J., Nigay, L., Salber, D., Blandford, A., May, J., Young, R.M.: Four Easy Pieces for Assessing the Usability of Multimodal Interaction: The CARE Properties. In: Proc.Interact 1995, pp. 115–120. Chapman & Hall, Sydney, Australia (1995)

    Google Scholar 

  7. Kim, S.-G.: Korean Standard Sign Language Tutor, 1st edn. Osung Publishing Company, Seoul (2000)

    Google Scholar 

  8. Kim, J.-H., et al.: An Implementation of KSSL Recognizer for HCI Based on Post Wearable PC and Wireless Networks KES 2006. In: Gabrys, B., Howlett, R.J., Jain, L.C. (eds.) KES 2006. LNCS (LNAI), vol. 4251, pp. 788–797. Springer, Heidelberg (2006)

    CrossRef  Google Scholar 

  9. Chen, C.H.: Fuzzy Logic and Neural Network Handbook. McGraw-Hill, New York (1992)

    Google Scholar 

  10. kandasamy, W.B.V.: Smaranda Fuzzy Algebra. American Research Press, Seattle (2003)

    Google Scholar 

  11. McGlashan, S., et al.: Voice Extensible Markup Language (VoiceXML) Version 2.0. W3C Recommendation (1992), http://www.w3.org

  12. Martin, W.H.: DeciBel -The New Name for the Transmission Unit. Bell System Technical Journal (January 1929)

    Google Scholar 

  13. NIOSH working group.: STRESS..AT WORK NIOSH, Publication No. 99-101,U.S. National Institutes of Occupational Health (2006)

    Google Scholar 

  14. Kim., J.-H., et al.: Hand Gesture Recognition System using Fuzzy Algorithm and RDBMS for Post PC. In: Wang, L., Jin, Y. (eds.) FSKD 2005. LNCS (LNAI), vol. 3614, pp. 170–175. Springer, Heidelberg (2005)

    Google Scholar 

  15. i.MX21 Processor Data-sheet: http://www.freescale.com

Download references

Author information

Authors and Affiliations

Authors

Editor information

Tomoya Enokido Leonard Barolli Makoto Takizawa

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kim, JH., Hong, KS. (2007). WPS and Voice-XML-Based Multi-Modal Fusion Agent Using SNNR and Fuzzy Value. In: Enokido, T., Barolli, L., Takizawa, M. (eds) Network-Based Information Systems. NBiS 2007. Lecture Notes in Computer Science, vol 4658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74573-0_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74573-0_54

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74572-3

  • Online ISBN: 978-3-540-74573-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics