Skip to main content

Voice Enabled Form Filling Using Hidden Markov Model

  • Conference paper
  • First Online:
Advanced Computing (IACC 2023)

Abstract

Speech Recognition technology is widely used for voice-enabled form filling. The manual process of filling out forms by typing has become increasingly challenging and time-consuming. This issue is particularly evident in various locations such as job applications and internships. To address this problem, a solution is proposed as a system that automates the form-filling process using speech recognition technology. The ability to operate anything with voice command is a crucial factor in today’s environment. The proposed system is that it automatically fills out the forms. i.e., the system analyses the user’s unique voice, identifies the user’s speech, and then transcribes the speech into text. This paper proposes a machine-learning model that builds on Hidden Markov Model. The model will be trained and tested on this system and the proposed pre-processed methodology is Mel Frequency Cepstral Coefficients. The methodology was widely used in the prospect of recognition of voice automatically. The results demonstrate that this system effectively accurately transcribes user speech into text, simplifying the form-filling process significantly. By providing these results, we hope to demonstrate how this technology has the potential to revolutionize data entry and accessibility while also establishing a strong case for speech recognition as a convenient way to speed up form completion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Syiem, B., Dutta, S.K., Binong, J., Singh, L.J.: Comparison of Khasi speech representations with different spectral features and hidden Markov states. J. Electron. Sci. Technol. 19(2), 100079 (2021)

    Article  Google Scholar 

  2. Cui, X., Afify, M., Gao, Y., Zhou, B.: Stereo hidden Markov modeling for noise robust speech recognition. Comput. Speech Lang. 27(2), 407–419 (2013)

    Article  Google Scholar 

  3. Najkar, N., Razzazi, F., Sameti, H.: A novel approach to HMM-based speech recognition systems using particle swarm optimization. Math. Comput. Model. 52(11–12), 1910–1920 (2010)

    Article  Google Scholar 

  4. Siddiqi, M.H.: An improved Gaussian mixture hidden conditional random fields model for audio-based emotions classification. Egypt. Inform. J. 22(1), 45–51 (2021)

    Article  Google Scholar 

  5. Al-Anzi, F.S., AbuZeina, D.: Synopsis on Arabic speech recognition. Ain Shams Eng. J. 13(2), 101534 (2022)

    Article  Google Scholar 

  6. Gámiz, M.L., Limnios, N., del Carmen Segovia-García, M.: Hidden Markov models in reliability and maintenance. Eur. J. Oper. Res. 304(3), 1242–1255 (2023)

    Article  MathSciNet  Google Scholar 

  7. Champion, C., Houghton, S.M.: Application of continuous state hidden Markov models to a classical problem in speech recognition. Comput. Speech Lang. 36, 347–364 (2016)

    Article  Google Scholar 

  8. Mouaz, B., Abderrahim, B.H., Abdelmajid, E.: Speech recognition of Moroccan dialect using hidden Markov models. Procedia Comput. Sci. 151, 985–991 (2019)

    Article  Google Scholar 

  9. Nedjah, N., Bonilla, A.D., de Macedo Mourelle, L.: Automatic speech recognition of Portuguese phonemes using neural networks ensemble. Expert Syst. Appl. 229, 120378 (2023)

    Article  Google Scholar 

  10. Lee, L.M., Jean, F.R.: Adaptation of hidden Markov models for recognizing speech of reduced frame rate. IEEE Trans. Cybern. 43(6), 2114–2121 (2013)

    Article  Google Scholar 

  11. Chen, Y., Zheng, H.: The application of HMM algorithm based music note feature recognition teaching in universities. Intell. Syst. Appl. 20, 200277 (2023)

    Google Scholar 

  12. Mannepalli, K., Sastry, P.N., Suman, M.: MFCC-GMM based accent recognition system for Telugu speech signals. Int. J. Speech Technol. 19, 87–93 (2016)

    Article  Google Scholar 

  13. Chandrakala, S.: Investigation of DNN-HMM and lattice free maximum mutual information approaches for impaired speech recognition. IEEE Access 9, 168840–168849 (2021)

    Article  Google Scholar 

  14. Li, Q., Zhang, C., Woodland, P.C.: Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring. Speech Commun. 147, 12–21 (2023)

    Article  Google Scholar 

  15. Ma, Z., Zhang, J., Li, T., Yang, R., Wang, H.: A parameter transfer method for HMM-DNN heterogeneous model with the scarce mongolian data set. Procedia Comput. Sci. 187, 258–263 (2021)

    Article  Google Scholar 

  16. Das, T.K., Nahar, K.M.: A voice identification system using hidden Markov model. Indian J. Sci. Technol. 9(4), 1–6 (2016)

    Article  Google Scholar 

  17. Ranjan, A., Jegadeesan, K.: Hybrid ASR for resource-constrained robots: HMM-deep learning fusion. arXiv preprint arXiv:2309.07164 (2023)

  18. Yadava, G.T., Nagaraja, B.G., Jayanna, H.S.: An end-to-end continuous Kannada ASR system under uncontrolled environment. Multimed. Tools Appl. 1–14 (2023)

    Google Scholar 

  19. Trabelsi, A., Warichet, S., Aajaoun, Y., Soussilane, S.: Evaluation of the efficiency of state-of-the-art Speech Recognition engines. Procedia Comput. Sci. 207, 2242–2252 (2022)

    Article  Google Scholar 

  20. Jaradat, G.A., Alzubaidi, M.A., Otoom, M.: A novel human-vehicle interaction assistive device for Arab drivers using speech recognition. IEEE Access 10, 127514–127529 (2022)

    Article  Google Scholar 

  21. Speech recognition. Wikipedia (2023). https://en.wikipedia.org/wiki/Speech_recognition

  22. Hidden Markov model. Wikipedia (2023). https://en.wikipedia.org/wiki/Hidden_Markov_model

  23. Viterbi algorithm. Wikipedia (2023). https://en.wikipedia.org/wiki/Viterbi_algorithm

  24. Brown, D.G., Golod, D.: Decoding HMMs using the k best paths: algorithms and applications. BMC Bioinform. 11(S1) (2010). https://doi.org/10.1186/1471-2105-11-s1-s28

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bharath Naik Kethavath .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sallagundla, B., Kethavath, B.N., Mitaigiri, S.A.H., Kata, S., Merla, K.S.S.S. (2024). Voice Enabled Form Filling Using Hidden Markov Model. In: Garg, D., Rodrigues, J.J.P.C., Gupta, S.K., Cheng, X., Sarao, P., Patel, G.S. (eds) Advanced Computing. IACC 2023. Communications in Computer and Information Science, vol 2053. Springer, Cham. https://doi.org/10.1007/978-3-031-56700-1_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-56700-1_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-56699-8

  • Online ISBN: 978-3-031-56700-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics