Abstract
Speech Recognition technology is widely used for voice-enabled form filling. The manual process of filling out forms by typing has become increasingly challenging and time-consuming. This issue is particularly evident in various locations such as job applications and internships. To address this problem, a solution is proposed as a system that automates the form-filling process using speech recognition technology. The ability to operate anything with voice command is a crucial factor in today’s environment. The proposed system is that it automatically fills out the forms. i.e., the system analyses the user’s unique voice, identifies the user’s speech, and then transcribes the speech into text. This paper proposes a machine-learning model that builds on Hidden Markov Model. The model will be trained and tested on this system and the proposed pre-processed methodology is Mel Frequency Cepstral Coefficients. The methodology was widely used in the prospect of recognition of voice automatically. The results demonstrate that this system effectively accurately transcribes user speech into text, simplifying the form-filling process significantly. By providing these results, we hope to demonstrate how this technology has the potential to revolutionize data entry and accessibility while also establishing a strong case for speech recognition as a convenient way to speed up form completion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Syiem, B., Dutta, S.K., Binong, J., Singh, L.J.: Comparison of Khasi speech representations with different spectral features and hidden Markov states. J. Electron. Sci. Technol. 19(2), 100079 (2021)
Cui, X., Afify, M., Gao, Y., Zhou, B.: Stereo hidden Markov modeling for noise robust speech recognition. Comput. Speech Lang. 27(2), 407–419 (2013)
Najkar, N., Razzazi, F., Sameti, H.: A novel approach to HMM-based speech recognition systems using particle swarm optimization. Math. Comput. Model. 52(11–12), 1910–1920 (2010)
Siddiqi, M.H.: An improved Gaussian mixture hidden conditional random fields model for audio-based emotions classification. Egypt. Inform. J. 22(1), 45–51 (2021)
Al-Anzi, F.S., AbuZeina, D.: Synopsis on Arabic speech recognition. Ain Shams Eng. J. 13(2), 101534 (2022)
Gámiz, M.L., Limnios, N., del Carmen Segovia-García, M.: Hidden Markov models in reliability and maintenance. Eur. J. Oper. Res. 304(3), 1242–1255 (2023)
Champion, C., Houghton, S.M.: Application of continuous state hidden Markov models to a classical problem in speech recognition. Comput. Speech Lang. 36, 347–364 (2016)
Mouaz, B., Abderrahim, B.H., Abdelmajid, E.: Speech recognition of Moroccan dialect using hidden Markov models. Procedia Comput. Sci. 151, 985–991 (2019)
Nedjah, N., Bonilla, A.D., de Macedo Mourelle, L.: Automatic speech recognition of Portuguese phonemes using neural networks ensemble. Expert Syst. Appl. 229, 120378 (2023)
Lee, L.M., Jean, F.R.: Adaptation of hidden Markov models for recognizing speech of reduced frame rate. IEEE Trans. Cybern. 43(6), 2114–2121 (2013)
Chen, Y., Zheng, H.: The application of HMM algorithm based music note feature recognition teaching in universities. Intell. Syst. Appl. 20, 200277 (2023)
Mannepalli, K., Sastry, P.N., Suman, M.: MFCC-GMM based accent recognition system for Telugu speech signals. Int. J. Speech Technol. 19, 87–93 (2016)
Chandrakala, S.: Investigation of DNN-HMM and lattice free maximum mutual information approaches for impaired speech recognition. IEEE Access 9, 168840–168849 (2021)
Li, Q., Zhang, C., Woodland, P.C.: Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring. Speech Commun. 147, 12–21 (2023)
Ma, Z., Zhang, J., Li, T., Yang, R., Wang, H.: A parameter transfer method for HMM-DNN heterogeneous model with the scarce mongolian data set. Procedia Comput. Sci. 187, 258–263 (2021)
Das, T.K., Nahar, K.M.: A voice identification system using hidden Markov model. Indian J. Sci. Technol. 9(4), 1–6 (2016)
Ranjan, A., Jegadeesan, K.: Hybrid ASR for resource-constrained robots: HMM-deep learning fusion. arXiv preprint arXiv:2309.07164 (2023)
Yadava, G.T., Nagaraja, B.G., Jayanna, H.S.: An end-to-end continuous Kannada ASR system under uncontrolled environment. Multimed. Tools Appl. 1–14 (2023)
Trabelsi, A., Warichet, S., Aajaoun, Y., Soussilane, S.: Evaluation of the efficiency of state-of-the-art Speech Recognition engines. Procedia Comput. Sci. 207, 2242–2252 (2022)
Jaradat, G.A., Alzubaidi, M.A., Otoom, M.: A novel human-vehicle interaction assistive device for Arab drivers using speech recognition. IEEE Access 10, 127514–127529 (2022)
Speech recognition. Wikipedia (2023). https://en.wikipedia.org/wiki/Speech_recognition
Hidden Markov model. Wikipedia (2023). https://en.wikipedia.org/wiki/Hidden_Markov_model
Viterbi algorithm. Wikipedia (2023). https://en.wikipedia.org/wiki/Viterbi_algorithm
Brown, D.G., Golod, D.: Decoding HMMs using the k best paths: algorithms and applications. BMC Bioinform. 11(S1) (2010). https://doi.org/10.1186/1471-2105-11-s1-s28
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sallagundla, B., Kethavath, B.N., Mitaigiri, S.A.H., Kata, S., Merla, K.S.S.S. (2024). Voice Enabled Form Filling Using Hidden Markov Model. In: Garg, D., Rodrigues, J.J.P.C., Gupta, S.K., Cheng, X., Sarao, P., Patel, G.S. (eds) Advanced Computing. IACC 2023. Communications in Computer and Information Science, vol 2053. Springer, Cham. https://doi.org/10.1007/978-3-031-56700-1_18
Download citation
DOI: https://doi.org/10.1007/978-3-031-56700-1_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-56699-8
Online ISBN: 978-3-031-56700-1
eBook Packages: Computer ScienceComputer Science (R0)