Abstract
In this paper, the effects on speech recognition performance by the speech coders are presented. We evaluate our Amazigh speech recognition system through wireless network based on a configurable platform that was created by combining both automatic speech recognition and IVR technologies. Different parameters are used such as VoIP audio codecs, hidden Markov models (HMMs) and Gaussian mixture models (GMMs). The system is trained and tested on ten first digits by collecting data from 24 speakers native of Tarifit. On the other hand, the VoIP codecs used in this work are G.711, GSM and Speex depending on the SIP protocol. Our results show that the best performance is 84.14% achieved by using the GSM audio codec.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yu, D., Deng, L.: Automatic Speech Recognition. Springer London limited (2016)
Edan, N.M., Al-Sherbaz, A., Turner, S., Ajit, S.: Performance evaluation of QoS using SIP & IAX2 VVoIP protocols with CODECS. In: SAI Computing Conference (SAI), pp. 631–636. IEEE (2016)
Ansari, S., Gutta, R.: Evaluate performance of voice over LTE networks using voice codecs. Int. J. Sci. Eng. Technol. Res. 5(5) (2016)
Das, T.K., Nahar, K.M.: A voice identification system using hidden Markov model. Indian J. Sci. Technol. 9(4) (2016)
Satori, H., Elhaoussi, F.: Investigation Amazigh speech recognition using CMU tools. Int. J. Speech Technol. 17(3), 235–243 (2014)
Ahmad, J., Fiaz, M., Kwon, S.I., Sodanil, M., Vo, B., Baik, S.W.: Gender identification using MFCC for telephone applications—a comparative study (2016). arXiv: 1601.01577
Bhat, C., Mithun, B., Saxena, V., Kulkarni, V.Y., Kopparapu, S.K.: Deploying usable speech enabled IVR systems for mass use. In: International Conference on Human Computer Interactions (ICHCI), pp. 1–5 (2013)
Suciu, G., Vulpe, A., Arseni, S.C., Stancu, A., Butca, C., Suciu, V.: Monitoring a cloud-based speech processing system. In: 7th International Conference on Electronics, Computers and Artificial Intelligence (ECAI), pp. Y-23. IEEE (2015)
Lee, K. M., and Lai, J.: Speech versus touch: A comparitive study of the use of speech and dtmf keypad for navigation. Int. J. Hum. Comput. Interact. 19(3), 343–360 (2005)
Hamidi, M., Satori, H., Zealouk, O., Satori, K., Laaidi, N.: Interactive voice response server voice network administration using hidden Markov model speech recognition system. In: Second World (2018)
Hamidi, M., Satori, H., Zealouk, O., Satori, K., Laaidi, N.: Interactive administration service based on HMM speech recognition system. Int. J. Comput. Aided Eng. Technol. 16(2), 266–282 (2022)
Varshney, U., Snow, A., McGivern, M., Howard, C.: Voice over IP. Commun. ACM 45(1), 89–96 (2002)
Karapantazis, S., Pavlidou, F.N.: VoIP: a comprehensive survey on a promising technology. Comput. Netw. 53(12), 2050–2090 (2009)
Huang, X., Acero, A., Hon, H.W., Foreword By-Reddy, R.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall PTR (2001)
Outahajala, M., Zenkouar, L., Rosso, P.: Building an annotated corpus for Amazighe. In: Will Appear In Proceedings of 4th International Conference on Amazigh and ICT (2011)
Boukous, A.: Phonologie de L’amazighe. Institut Royal de la Culture Amazighe, Rabat (2009)
Satori, H., Zealouk, O., Satori, K., ElHaoussi, F.: Voice comparison between smokers and non-smokers using HMM speech recognition system. Int. J. Speech Technol. 20(4), 771–777 (2017)
Zealouk, O., Satori, H., Hamidi, M., Laaidi, N., Satori, K.: Vocal parameters analysis of smoker using Amazigh language. Int. J. Speech Technol. 21(1), 85–91 (2018)
Zealouk, O., Satori, H., Hamidi, M., Laaidi, N., Salek, A., Satori, K.: Analysis of COVID-19 resulting cough using formants and automatic speech recognition system. J. Voice (2021)
Hamidi, M., Satori, H., Zealouk, O., Laaidi, N.: Estimation of ASR parameterization for interactive system. Int. J. Nat. Comput. Res. (IJNCR) 10(1), 28–40 (2021)
Hamidi, M., Satori, H., Zealouk, O., Satori, K.: Speech coding effect on Amazigh alphabet speech recognition performance. J. Adv. Res. Dyn. Control Syst. 11(2), 1392–1400 (2019)
Lounnas, K., Abbas, M., Lichouri, M., Hamidi, M., Satori, H., Teffahi, H.: Enhancement of spoken digits recognition for under-resourced languages: case of Algerian and Moroccan dialects. Int. J. Speech Technol. 1–13 (2022)
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)
Muda, L., Begam, M., Elamvazuthi, I.: Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques (2010). arXiv:1003.4083
Falk, T.H., Chan, W.Y.: Modulation spectral features for robust far-field speaker identification. IEEE Trans. Audio, Speech Lang. Process. 18(1), 90–100 (2010)
Wavesurfer. https://sourceforge.net/projects/wavesurfer/. Accessed July 2015
El Amrani, M.Y., Rahman, M.M.H., Wahiddin, M.R., Shah, A.: Building CMU Sphinx language model for the Holy Quran using simplifed Arabic phonemes. Egypt. Inf. J. 17, 305–314 (2016)
Abushariah, M.A.M., Ainon, R.N., Zainuddin, R., Alqudah, A.A.M., Elshafei, M.A., Khalifa, O.O.: Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems. J. Franklin Inst. 349, 2215–2242 (2011)
Hyassat, H., Abu-Zitar, R.: Arabic speech recognition using SPHINX engine. Int. J. Speech Technol. 9, 133 (2006). https://doi.org/10.1007/s10772-008-9009-1
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Hamidi, M., Zealouk, O., Satori, H. (2023). Automatic Speech Recognition Analysis Over Wireless Networks. In: Bhateja, V., Yang, XS., Chun-Wei Lin, J., Das, R. (eds) Intelligent Data Engineering and Analytics. FICTA 2022. Smart Innovation, Systems and Technologies, vol 327. Springer, Singapore. https://doi.org/10.1007/978-981-19-7524-0_44
Download citation
DOI: https://doi.org/10.1007/978-981-19-7524-0_44
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-7523-3
Online ISBN: 978-981-19-7524-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)