Automatic Speech Recognition of Continuous Speech Signal of Gujarati Language Using Machine Learning

Pandit, Purnima; Makwana, Priyank; Bhatt, Shardav

doi:10.1007/978-981-15-9953-8_13

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1287))

320 Accesses
1 Citations

Abstract

In this work we perform automatic recognition of continuous speech signal spoken in Gujarati language using machine learning (ML) technique. For this purpose, from continuous speech signal of sentence we first extract words using short term auto-correlation (STAC) method. Since the signals for each word are large in size, the dimension reduction is done using feature extraction algorithm: mel-frequency discrete wavelet coefficient (MFDWC). Then these features are trained using ML algorithm for recognition of speech.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Global Data Pre-processing Technique for Automatic Speech Recognition

Hindi speech recognition in noisy environment using hybrid technique

Article 01 January 2021

Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise

Article 12 April 2023

References

Rabiner, L.R., Juang, B.-H., Yegnanarayana, B.: Fundamentals of speech recognition. Pearson Education (2010)
Google Scholar
Census of India 2011: http://censusindia.gov.in/
Juang, B.H., Rabiner, L.R.: Automatic speech recognition—a brief history of the technology. 1–24 (2005)
Google Scholar
Davis, S.B., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. 28, 357–366 (1980). https://doi.org/10.1109/TASSP.1980.1163420
Article Google Scholar
Dua, M., Aggarwal, R.K., Biswas, M.: Discriminatively trained continuous Hindi speech recognition system using interpolated recurrent neural network language modeling. Neural Comput. Appl. 31, 6747–6755 (2019). https://doi.org/10.1007/s00521-018-3499-9
Article Google Scholar
China Bhanja, C., Laskar, M.A., Laskar, R.H., Bandyopadhyay, S.: Deep neural network based two-stage Indian language identification system using glottal closure instants as anchor points. J. King Saud Univ. Comput. Inf. Sci. (2019). https://doi.org/10.1016/j.jksuci.2019.07.001
Goel, S., Pangasa, R., Dawn, S., Arora, A.: Audio acoustic features based tagging and comparative analysis of its classifications. In: 2018 11th International Conference Contemporary Computing IC3 2018, pp. 1–5 (2018). https://doi.org/10.1109/IC3.2018.8530512
Tufekci, Z., Gowdy, J.N.: Feature extraction using discrete wavelet transform for speech recognition. In: Conference Proceedings—IEEE SOUTHEASTCON. pp. 116–123 (2000)
Google Scholar
Pandit, P., Bhatt, S.: Automatic speech recognition of Gujarati digits using dynamic time warping. Int. J. Eng. Innov. Technol. 3, 69–73 (2014)
Google Scholar
Pandit, P., Bhatt, S., Makwana, P.: Automatic speech recognition of Gujarati digits using artificial neural network. In: Proceedings of 19th Annual Cum 4th International Conference of GAMS On Advances in Mathematical Modelling to Real World Problems. pp. 141–146. Excellent Publishers (2014)
Google Scholar
Pandit, P., Bhatt, S.: Automatic speech recognition of Gujarati digits using radial basis function network. In: International Conference on Futuristic Trends in Engineering, Science, Pharmacy and Management. pp. 216–226. A D Publication (2016)
Google Scholar
Pandit, P., Bhatt, S.: Automatic speech recognition of Gujarati digits using wavelet coefficients. J. Maharaja Sayajirao Univ. Baroda. 52, 101–110 (2017)
Google Scholar
Tufekci, Z., Gowdy, J.N., Gurbuz, S., Patterson, E.: Applied mel-frequency discrete wavelet coefficients and parallel model compensation for noise-robust speech recognition. Speech Commun. 48, 1294–1307 (2006). https://doi.org/10.1016/j.specom.2006.06.006
Article Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323, 533–536 (1986). https://doi.org/10.1038/323533a0
Article MATH Google Scholar
Kingma, D.P., Ba, J.L.: Adam: A method for stochastic optimization. In: International Conference on Learning Representations. pp. 1–15 (2015)
Google Scholar
Maas, A., Hannun, A., Ng, A.: Rectifier nonlinearities improve neural network acoustic models. In: ICML Workshop on Deep Learning for Audio, Speech and Language Processing (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Applied Mathematics, Faculty of Technology and Engineering, The Maharaja Sayajirao University of Baroda, Vadodara, 390001, Gujarat, India
Purnima Pandit
Applied Science and Humanities Department, Parul Institute of Engineering and Technology (Diploma Studies), Parul University, Limda–Waghodia, Vadodara, 391760, Gujarat, India
Priyank Makwana
School of Engineering and Technology, Navrachana University, Vasna-Bhayli Road, Vadodara, 391410, Gujarat, India
Shardav Bhatt

Authors

Purnima Pandit
View author publications
You can also search for this author in PubMed Google Scholar
Priyank Makwana
View author publications
You can also search for this author in PubMed Google Scholar
Shardav Bhatt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Priyank Makwana .

Editor information

Editors and Affiliations

Department of Mathematics, School of Technology, Pandit Deendayal Petroleum University, Gandhinagar, India
Manoj Sahni
University of Technology Sydney, Sydney, NSW, Australia
José M. Merigó
Department of Mathematics, School of Technology, Pandit Deendayal Petroleum University, Gandhinagar, India
Brajesh Kumar Jha
Department of Management Control and Information Systems, University of Chile, Santiago, Chile
Rajkumar Verma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pandit, P., Makwana, P., Bhatt, S. (2021). Automatic Speech Recognition of Continuous Speech Signal of Gujarati Language Using Machine Learning. In: Sahni, M., Merigó, J.M., Jha, B.K., Verma, R. (eds) Mathematical Modeling, Computational Intelligence Techniques and Renewable Energy. Advances in Intelligent Systems and Computing, vol 1287. Springer, Singapore. https://doi.org/10.1007/978-981-15-9953-8_13

Download citation

DOI: https://doi.org/10.1007/978-981-15-9953-8_13
Published: 28 February 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-9952-1
Online ISBN: 978-981-15-9953-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Automatic Speech Recognition of Continuous Speech Signal of Gujarati Language Using Machine Learning

Abstract

Access this chapter

Similar content being viewed by others

A Global Data Pre-processing Technique for Automatic Speech Recognition

Hindi speech recognition in noisy environment using hybrid technique

Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Automatic Speech Recognition of Continuous Speech Signal of Gujarati Language Using Machine Learning

Abstract

Access this chapter

Similar content being viewed by others

A Global Data Pre-processing Technique for Automatic Speech Recognition

Hindi speech recognition in noisy environment using hybrid technique

Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation