Abstract
Despite many years of research into speech recognition systems, there are limited research publications available covering Arabic speech recognition. Although statistical techniques have been the most applied techniques for such classification problems, Neural Networks have also recorded successful results in speech recognition. In this research three different biologically inspired Continuous Arabic Speech Recognition neural network system structures are presented. An Arabic phoneme database (APD) of six male speakers was constructed manually from the King Abdulaziz Arabic Phonetics Database (KAPD). The Mel-Frequency Cepstrum Coefficients (MFCCs) algorithm was used to extract the phoneme features from the speech signals of this database. The normalized dataset was used to train and test three different architectures of Multilayer Perceptron (MLP) neural network identification systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Yuk D.: Robust speech recognition using neural networks and hidden markov modelsadaptations using non-linear transformations [dissertation]. New Jersey: The State University of New Jersey; 1999.
Tursun, N., Silamu, W. In: Large vocabulary continuous speech recognition in uyghur: Data preparation and experimental results. Network computing and information security (NCIS), international conference; China. ; 2011. p. 197-200. N. Hmad and T. Allen
Al-manie, M.A., Alkanhal, M.I., Al-ghamdi, M.M. In: Automatic speech segmentation using the arabic phonetic database. Proceedings of the 10th WSEAS international conference on AUTOMATION & INFORMATION; ; 2006. p. 76-9.
Renals, S.: Radial Basis Function Network For Speech Pattern Classification. [Internet]. 1989;25(7):437; 439. Available from: internal-pdf://18-0638587909/18.pdf.
Maheswari, N.U., Kabilan, A.P., Venkatesh R.: Speaker independent phoneme recognition using neural networks. Journal of Theoretical and Applied Information Technology (JATIT). 2009:230-5.
Chen, W., Chen, S., Lin, C.: A speech recognition method based on the sequential multi-layer perceptrons. Neural Networks. 1996;9(4):655-69.
Alghmadi, M.M.: KACST arabic phonetics database. Congress of Phonetics Sci. 2003;15:3109-12.
Mosa, G.S., Ali, A.A.: Arabic phoneme recognition using hierarchical neural fuzzy petri net and LPC feature extraction. Signal Processing: An International Journal (SPIJ). 2009;3(5):161-71.
Anwar, M.J., Awais, M.M., Masud, S., Shamail, S.: Automatic arabic speech segmentation system. International Journal of Information Technology. 2006;12(6):102-11.
Waheed, K., Weaver, K., Salam, F.M.: A robust algorithm for detecting speech segments using an entropic contrast. In proc. of the IEEE Midwest Symposium on Circuits and Systems. Lida Ray Technologies Inc. 2002;45.
Hong, L., Yanmin, Q., Jia, L.: English speech recognition system on chip. Tsinghua Science & Technology. 2011;16(1):95-9.
Reynolds, T.J., Antoniou, C.A.: Experiments in speech recognition using a modular MLP architecture for acoustic modelling. Information Sciences. 2003;156(1-2):39-54.
Jou, S., Schultz, T., Walliczek, M., Kraft, F., Waibel, A.: Towards continuous speech recognition using surface electromyography. Interspeech. 2006:573-6.
Kirchhoff, K., Vergyri, D.: Cross-dialectal data sharing for acoustic modeling in arabic speech recognition. Speech Communication. 2005;46(1):37-51.
Szczurowska, I., Kuniszyk-Jókowiak, W., Smoka, E.: The application of kohonen and multilayer perceptron networks in the speech nonfluecy analysis. Archives of Acoustics. 2006;31(4):205-10.
Nakamura, M., Tsuda, K., Aoe, J.: A new approach to phoneme recognition by phoneme filter neural networks. Information Sciences. 1996;90(1-4):109-19.
Koizumi, T., Mori, M., Taniguchi, S., Maruya, M. In: Recurrent neural networks for phoneme recognition. Department of information science, fourth international conference; 3- 6 Oct 1996; Fukui University, Fukui, Japan. Spoken Language, ICSLP 96; 1996. p. 326-9.
Skowronski, M.D., Harris, J.G.: Automatic speech recognition using a predictive echo state network classifier. Neural Networks. 2007;20(3):414-23.
Ismail, S., Bin Ahmad, A.M. In: Recurrent neural network with backpropagation through time algorithm for arabic recognition. ; 2004.
Bengio, Y.: Neural Networks For Speech And Sequence Recognition. first edition ed. International Thomson Computer Press; 1996.
Sweeney, L., Thompson, P.: Speech perception using real-time phoneme detection: The BeBe system. Laboratory for Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; 1997.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag London
About this paper
Cite this paper
Hmad, N., Allen, T. (2012). Biologically inspired Continuous Arabic Speech Recognition. In: Bramer, M., Petridis, M. (eds) Research and Development in Intelligent Systems XXIX. SGAI 2012. Springer, London. https://doi.org/10.1007/978-1-4471-4739-8_20
Download citation
DOI: https://doi.org/10.1007/978-1-4471-4739-8_20
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4738-1
Online ISBN: 978-1-4471-4739-8
eBook Packages: Computer ScienceComputer Science (R0)