Abstract
This paper presents a way to improve the current situation by introducing a newly developed computational tool that is capable of speech recognition and voice output and includes a keypad-free user interface, resembling a Smartphone message box, to enhance human-machine interaction experience. Furthermore, it relies on effectively offloading computation to a remote online service. The main ideas behind this paper are the possibility of transforming tangible daily tools into specially designed interface agents that enable voice communication with users, and the possibility of utilizing available online information databases and well as online services that rely on remote machines instead of utilizing local computation. An important novelty of the presented work is also the fact that it was designed on the basis of empirical data arising from an appropriate Wizard-of-Oz-like experiment that was performed with almost one hundred people, and thus high-quality recognition of commonly occurring natural language queries was achieved.
Keywords
- Human Computer Interface
- Spoken Dialogue System
- Language Model Design
- Online Services
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Varile, G.B., Zampolli, A.: Survey of the State of the Art in Human Language Technology. Cambridge University Press, Cambridge (1997)
Myers, B.: A Brief History of Human-Computer Interaction Technology. Interact 5, 44–54 (1998)
Baecker, R.M., Grudin, J., Buxton, W.A.S., Greenberg, S.: Readings in Human-Computer Interaction. Morgan Kaufmann (1995)
Casali, S.P., Williges, B.H., Dryden, R.D.: Effects Recognition Accuracy and Vocabulary Size of a Speech Recognition System on Task Performance and User Acceptance. Journal of Human Factors 32, 183–196 (1990)
Hernandez-Abrego, G., Olorenshaw, L., Tato, R., Schaaf, T.: Dictionary Refinemets Based on Phonetic Consensus and Non-uniform Pronunciation Reduction. In: 8th International Conference on Spoken Language Processing, pp. 1697–1700. IEEE Press, New York (2004)
Schwartz, E.: Speech Recognition Grammar Specification. W3C Recommendation (2004)
Karat, C.M., Vergo, J., Nahamoo, D.: Conversational Interface Technologies. In: The Human-Computer Interaction Handbook, pp. 169–186. Lawrence Erlbaum Associates (2003)
Bernsen, N.O., Dybkjaer, H., Dybkjaer, L.: Designing Interactive Speech Systems – From First Idea to User Testing. Springer (1998)
Clarkson, P., Rosenfeld, R.: Statistical Language Modeling Using the CMU-Cambridge Toolkit. In: 5th European Conference on Speech Communication and Technology, pp. 2707–2710 (1997)
Katz, S.M.: Estimation of Probability from Sparse Data for the Language Model Component of a Speech Recognizer. IEEE Transactions on Acoustics, Speech, and Signal Processing 35, 400–401 (1987)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, X., Liang, H., Dong, H., Mavridis, N. (2012). Development of a Novel Conversational Calculator Based on Remote Online Computation. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7663. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34475-6_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-34475-6_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34474-9
Online ISBN: 978-3-642-34475-6
eBook Packages: Computer ScienceComputer Science (R0)
