Skip to main content

Developing a Voice Control System for a Wheeled Robot

  • Conference paper
  • First Online:
Biologically Inspired Cognitive Architectures 2023 (BICA 2023)

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1130))

Included in the following conference series:

  • 165 Accesses

Abstract

In recent years, domestic robots have become more functional, leading to their integration in peoples’ daily routines. However, most users are not experienced enough in human–robot interaction, necessitating simplified interfaces to bridge this gap. One approach involves using natural language as an intuitive form of communication. Insofar as using natural language doesn`t require any special skills, it makes robot control easier for non-experts. The first section of this paper includes an overview of voice-control work to-date, with references to state-of-the-art approaches. The second section proposes a hybrid architecture for a voice-based interface, combining machine learning techniques and rule-based methods. This approach reaches 95.4% and 98.8% accuracy on a small and larger model in the case of using clear speech and 88.7% and 90.3% for mumbled speech.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 279.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Berg, J., Lu, S.: Review of Interfaces for industrial human-robot interaction. Curr. Robot. Rep. 1, 27–34 (2020). https://doi.org/10.1007/s43154-020-00005-6

    Article  Google Scholar 

  2. Tellex, S., Gopalan, N., Kress-Gazit, H.: Robots that use language. Ann. Rev. Control Robot. Autonom. Syst. 3, 25–55 (2020). https://doi.org/10.1146/annurev-control-101119-071628

    Article  Google Scholar 

  3. Can Bingol, M., Aydogmus, O.: Performing predefined tasks using the human–robot interaction on speech recognition for an industrial robot. Eng. Appl. Artif. Intell. 95, id: 103903 (2020). https://doi.org/10.1016/j.engappai.2020.103903

  4. Bakouri, M., Alsehaimi, M., Ismail, H.F., Alshareef, K., Ganoun, A., Alqahtani, A., Alharbi, Y.: Steering a robotic wheelchair based on voice recognition system using convolutional neural networks. Electronics 11(1), id: 168 (2022). https://doi.org/10.3390/electronics11010168

  5. Sokolov, A., Savchenko, A.: Voice command recognition in intelligent systems using deep neural networks. In: IEEE 17th World Symposium on Applied Machine Intelligence and Informatics (SAMI), Herlany, pp. 113–116 (2019). https://doi.org/10.1109/SAMI.2019.8782755

  6. Ni, P., Li, Y., Li, G., et al.: Natural language understanding approaches based on joint task of intent detection and slot filling for IoT voice interaction. Neural Comput. Appl. 32, 16149–16166 (2020). https://doi.org/10.1007/s00521-020-04805-x

    Article  Google Scholar 

  7. Sun, R., Rao, L., Zhou, X.: A Joint model of natural language understanding for human-computer conversation in IoT. Wirel. Commun. Mob. Comput., id: 2074035 (2022). https://doi.org/10.1155/2022/2074035

  8. Tada, Y., Hagiwara, Y., Tanaka, H., Taniguchi, T.: Robust understanding of robot-directed speech commands using sequence to sequence with noise injection. Front. Robot. AI 6, id: 144 (2020). https://doi.org/10.3389/frobt.2019.00144

  9. Rubert-tiny: https://huggingface.co/cointegrated/rubert-tiny. Last accessed 10 June 2023

  10. Rubert-base-cased. https://huggingface.co/DeepPavlov/rubert-base-cased. Last accessed 26 June 2023

  11. Chomsky, N.: Three models for the description of language. IRE Trans. Inform. Theory 2(3), 113–124 (1956). https://doi.org/10.1109/TIT.1956.1056813

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Margarita Erlou .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chepin, E., Gridnev, A., Erlou, M. (2024). Developing a Voice Control System for a Wheeled Robot. In: Samsonovich, A.V., Liu, T. (eds) Biologically Inspired Cognitive Architectures 2023. BICA 2023. Studies in Computational Intelligence, vol 1130. Springer, Cham. https://doi.org/10.1007/978-3-031-50381-8_24

Download citation

Publish with us

Policies and ethics