Abstract
In recent years, domestic robots have become more functional, leading to their integration in peoples’ daily routines. However, most users are not experienced enough in human–robot interaction, necessitating simplified interfaces to bridge this gap. One approach involves using natural language as an intuitive form of communication. Insofar as using natural language doesn`t require any special skills, it makes robot control easier for non-experts. The first section of this paper includes an overview of voice-control work to-date, with references to state-of-the-art approaches. The second section proposes a hybrid architecture for a voice-based interface, combining machine learning techniques and rule-based methods. This approach reaches 95.4% and 98.8% accuracy on a small and larger model in the case of using clear speech and 88.7% and 90.3% for mumbled speech.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Berg, J., Lu, S.: Review of Interfaces for industrial human-robot interaction. Curr. Robot. Rep. 1, 27–34 (2020). https://doi.org/10.1007/s43154-020-00005-6
Tellex, S., Gopalan, N., Kress-Gazit, H.: Robots that use language. Ann. Rev. Control Robot. Autonom. Syst. 3, 25–55 (2020). https://doi.org/10.1146/annurev-control-101119-071628
Can Bingol, M., Aydogmus, O.: Performing predefined tasks using the human–robot interaction on speech recognition for an industrial robot. Eng. Appl. Artif. Intell. 95, id: 103903 (2020). https://doi.org/10.1016/j.engappai.2020.103903
Bakouri, M., Alsehaimi, M., Ismail, H.F., Alshareef, K., Ganoun, A., Alqahtani, A., Alharbi, Y.: Steering a robotic wheelchair based on voice recognition system using convolutional neural networks. Electronics 11(1), id: 168 (2022). https://doi.org/10.3390/electronics11010168
Sokolov, A., Savchenko, A.: Voice command recognition in intelligent systems using deep neural networks. In: IEEE 17th World Symposium on Applied Machine Intelligence and Informatics (SAMI), Herlany, pp. 113–116 (2019). https://doi.org/10.1109/SAMI.2019.8782755
Ni, P., Li, Y., Li, G., et al.: Natural language understanding approaches based on joint task of intent detection and slot filling for IoT voice interaction. Neural Comput. Appl. 32, 16149–16166 (2020). https://doi.org/10.1007/s00521-020-04805-x
Sun, R., Rao, L., Zhou, X.: A Joint model of natural language understanding for human-computer conversation in IoT. Wirel. Commun. Mob. Comput., id: 2074035 (2022). https://doi.org/10.1155/2022/2074035
Tada, Y., Hagiwara, Y., Tanaka, H., Taniguchi, T.: Robust understanding of robot-directed speech commands using sequence to sequence with noise injection. Front. Robot. AI 6, id: 144 (2020). https://doi.org/10.3389/frobt.2019.00144
Rubert-tiny: https://huggingface.co/cointegrated/rubert-tiny. Last accessed 10 June 2023
Rubert-base-cased. https://huggingface.co/DeepPavlov/rubert-base-cased. Last accessed 26 June 2023
Chomsky, N.: Three models for the description of language. IRE Trans. Inform. Theory 2(3), 113–124 (1956). https://doi.org/10.1109/TIT.1956.1056813
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Chepin, E., Gridnev, A., Erlou, M. (2024). Developing a Voice Control System for a Wheeled Robot. In: Samsonovich, A.V., Liu, T. (eds) Biologically Inspired Cognitive Architectures 2023. BICA 2023. Studies in Computational Intelligence, vol 1130. Springer, Cham. https://doi.org/10.1007/978-3-031-50381-8_24
Download citation
DOI: https://doi.org/10.1007/978-3-031-50381-8_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-50380-1
Online ISBN: 978-3-031-50381-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)