Voice Enabled Form Filling Using Hidden Markov Model

Sallagundla, Babu; Kethavath, Bharath Naik; Mitaigiri, Shaik Arshad Hussain; Kata, Siddartha; Merla, Kodandaram Sri Satya Sai

doi:10.1007/978-3-031-56700-1_18

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2053))

Included in the following conference series:

International Advanced Computing Conference

89 Accesses

Abstract

Speech Recognition technology is widely used for voice-enabled form filling. The manual process of filling out forms by typing has become increasingly challenging and time-consuming. This issue is particularly evident in various locations such as job applications and internships. To address this problem, a solution is proposed as a system that automates the form-filling process using speech recognition technology. The ability to operate anything with voice command is a crucial factor in today’s environment. The proposed system is that it automatically fills out the forms. i.e., the system analyses the user’s unique voice, identifies the user’s speech, and then transcribes the speech into text. This paper proposes a machine-learning model that builds on Hidden Markov Model. The model will be trained and tested on this system and the proposed pre-processed methodology is Mel Frequency Cepstral Coefficients. The methodology was widely used in the prospect of recognition of voice automatically. The results demonstrate that this system effectively accurately transcribes user speech into text, simplifying the form-filling process significantly. By providing these results, we hope to demonstrate how this technology has the potential to revolutionize data entry and accessibility while also establishing a strong case for speech recognition as a convenient way to speed up form completion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Syiem, B., Dutta, S.K., Binong, J., Singh, L.J.: Comparison of Khasi speech representations with different spectral features and hidden Markov states. J. Electron. Sci. Technol. 19(2), 100079 (2021)
Article Google Scholar
Cui, X., Afify, M., Gao, Y., Zhou, B.: Stereo hidden Markov modeling for noise robust speech recognition. Comput. Speech Lang. 27(2), 407–419 (2013)
Article Google Scholar
Najkar, N., Razzazi, F., Sameti, H.: A novel approach to HMM-based speech recognition systems using particle swarm optimization. Math. Comput. Model. 52(11–12), 1910–1920 (2010)
Article Google Scholar
Siddiqi, M.H.: An improved Gaussian mixture hidden conditional random fields model for audio-based emotions classification. Egypt. Inform. J. 22(1), 45–51 (2021)
Article Google Scholar
Al-Anzi, F.S., AbuZeina, D.: Synopsis on Arabic speech recognition. Ain Shams Eng. J. 13(2), 101534 (2022)
Article Google Scholar
Gámiz, M.L., Limnios, N., del Carmen Segovia-García, M.: Hidden Markov models in reliability and maintenance. Eur. J. Oper. Res. 304(3), 1242–1255 (2023)
Article MathSciNet Google Scholar
Champion, C., Houghton, S.M.: Application of continuous state hidden Markov models to a classical problem in speech recognition. Comput. Speech Lang. 36, 347–364 (2016)
Article Google Scholar
Mouaz, B., Abderrahim, B.H., Abdelmajid, E.: Speech recognition of Moroccan dialect using hidden Markov models. Procedia Comput. Sci. 151, 985–991 (2019)
Article Google Scholar
Nedjah, N., Bonilla, A.D., de Macedo Mourelle, L.: Automatic speech recognition of Portuguese phonemes using neural networks ensemble. Expert Syst. Appl. 229, 120378 (2023)
Article Google Scholar
Lee, L.M., Jean, F.R.: Adaptation of hidden Markov models for recognizing speech of reduced frame rate. IEEE Trans. Cybern. 43(6), 2114–2121 (2013)
Article Google Scholar
Chen, Y., Zheng, H.: The application of HMM algorithm based music note feature recognition teaching in universities. Intell. Syst. Appl. 20, 200277 (2023)
Google Scholar
Mannepalli, K., Sastry, P.N., Suman, M.: MFCC-GMM based accent recognition system for Telugu speech signals. Int. J. Speech Technol. 19, 87–93 (2016)
Article Google Scholar
Chandrakala, S.: Investigation of DNN-HMM and lattice free maximum mutual information approaches for impaired speech recognition. IEEE Access 9, 168840–168849 (2021)
Article Google Scholar
Li, Q., Zhang, C., Woodland, P.C.: Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring. Speech Commun. 147, 12–21 (2023)
Article Google Scholar
Ma, Z., Zhang, J., Li, T., Yang, R., Wang, H.: A parameter transfer method for HMM-DNN heterogeneous model with the scarce mongolian data set. Procedia Comput. Sci. 187, 258–263 (2021)
Article Google Scholar
Das, T.K., Nahar, K.M.: A voice identification system using hidden Markov model. Indian J. Sci. Technol. 9(4), 1–6 (2016)
Article Google Scholar
Ranjan, A., Jegadeesan, K.: Hybrid ASR for resource-constrained robots: HMM-deep learning fusion. arXiv preprint arXiv:2309.07164 (2023)
Yadava, G.T., Nagaraja, B.G., Jayanna, H.S.: An end-to-end continuous Kannada ASR system under uncontrolled environment. Multimed. Tools Appl. 1–14 (2023)
Google Scholar
Trabelsi, A., Warichet, S., Aajaoun, Y., Soussilane, S.: Evaluation of the efficiency of state-of-the-art Speech Recognition engines. Procedia Comput. Sci. 207, 2242–2252 (2022)
Article Google Scholar
Jaradat, G.A., Alzubaidi, M.A., Otoom, M.: A novel human-vehicle interaction assistive device for Arab drivers using speech recognition. IEEE Access 10, 127514–127529 (2022)
Article Google Scholar
Speech recognition. Wikipedia (2023). https://en.wikipedia.org/wiki/Speech_recognition
Hidden Markov model. Wikipedia (2023). https://en.wikipedia.org/wiki/Hidden_Markov_model
Viterbi algorithm. Wikipedia (2023). https://en.wikipedia.org/wiki/Viterbi_algorithm
Brown, D.G., Golod, D.: Decoding HMMs using the k best paths: algorithms and applications. BMC Bioinform. 11(S1) (2010). https://doi.org/10.1186/1471-2105-11-s1-s28

Download references

Author information

Authors and Affiliations

Department of CSE, V R Siddhartha Engineering College, Vijayawada, India
Babu Sallagundla, Bharath Naik Kethavath, Shaik Arshad Hussain Mitaigiri, Siddartha Kata & Kodandaram Sri Satya Sai Merla

Authors

Babu Sallagundla
View author publications
You can also search for this author in PubMed Google Scholar
Bharath Naik Kethavath
View author publications
You can also search for this author in PubMed Google Scholar
Shaik Arshad Hussain Mitaigiri
View author publications
You can also search for this author in PubMed Google Scholar
Siddartha Kata
View author publications
You can also search for this author in PubMed Google Scholar
Kodandaram Sri Satya Sai Merla
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bharath Naik Kethavath .

Editor information

Editors and Affiliations

SR University, Warangal, India
Deepak Garg
COPELABS, Lusófona University, Lisbon, Portugal
Joel J. P. C. Rodrigues
Bennett University, Greater Noida, India
Suneet Kumar Gupta
Swansea University, Wales, UK
Xiaochun Cheng
Lovely Professional University, Phagwara, India
Pushpender Sarao
SITCOE Engineering College, Ichalkaranji, India
Govind Singh Patel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sallagundla, B., Kethavath, B.N., Mitaigiri, S.A.H., Kata, S., Merla, K.S.S.S. (2024). Voice Enabled Form Filling Using Hidden Markov Model. In: Garg, D., Rodrigues, J.J.P.C., Gupta, S.K., Cheng, X., Sarao, P., Patel, G.S. (eds) Advanced Computing. IACC 2023. Communications in Computer and Information Science, vol 2053. Springer, Cham. https://doi.org/10.1007/978-3-031-56700-1_18

Download citation

DOI: https://doi.org/10.1007/978-3-031-56700-1_18
Published: 26 March 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-56699-8
Online ISBN: 978-3-031-56700-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics