Biologically inspired Continuous Arabic Speech Recognition

Hmad, N.; Allen, T.

doi:10.1007/978-1-4471-4739-8_20

N. Hmad³ &
T. Allen³

Included in the following conference series:

International Conference on Innovative Techniques and Applications of Artificial Intelligence

899 Accesses
3 Citations

Abstract

Despite many years of research into speech recognition systems, there are limited research publications available covering Arabic speech recognition. Although statistical techniques have been the most applied techniques for such classification problems, Neural Networks have also recorded successful results in speech recognition. In this research three different biologically inspired Continuous Arabic Speech Recognition neural network system structures are presented. An Arabic phoneme database (APD) of six male speakers was constructed manually from the King Abdulaziz Arabic Phonetics Database (KAPD). The Mel-Frequency Cepstrum Coefficients (MFCCs) algorithm was used to extract the phoneme features from the speech signals of this database. The normalized dataset was used to train and test three different architectures of Multilayer Perceptron (MLP) neural network identification systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Yuk D.: Robust speech recognition using neural networks and hidden markov modelsadaptations using non-linear transformations [dissertation]. New Jersey: The State University of New Jersey; 1999.
Google Scholar
Tursun, N., Silamu, W. In: Large vocabulary continuous speech recognition in uyghur: Data preparation and experimental results. Network computing and information security (NCIS), international conference; China. ; 2011. p. 197-200. N. Hmad and T. Allen
Google Scholar
Al-manie, M.A., Alkanhal, M.I., Al-ghamdi, M.M. In: Automatic speech segmentation using the arabic phonetic database. Proceedings of the 10th WSEAS international conference on AUTOMATION & INFORMATION; ; 2006. p. 76-9.
Google Scholar
Renals, S.: Radial Basis Function Network For Speech Pattern Classification. [Internet]. 1989;25(7):437; 439. Available from: internal-pdf://18-0638587909/18.pdf.
Google Scholar
Maheswari, N.U., Kabilan, A.P., Venkatesh R.: Speaker independent phoneme recognition using neural networks. Journal of Theoretical and Applied Information Technology (JATIT). 2009:230-5.
Google Scholar
Chen, W., Chen, S., Lin, C.: A speech recognition method based on the sequential multi-layer perceptrons. Neural Networks. 1996;9(4):655-69.
Article Google Scholar
Alghmadi, M.M.: KACST arabic phonetics database. Congress of Phonetics Sci. 2003;15:3109-12.
Google Scholar
Mosa, G.S., Ali, A.A.: Arabic phoneme recognition using hierarchical neural fuzzy petri net and LPC feature extraction. Signal Processing: An International Journal (SPIJ). 2009;3(5):161-71.
Google Scholar
Anwar, M.J., Awais, M.M., Masud, S., Shamail, S.: Automatic arabic speech segmentation system. International Journal of Information Technology. 2006;12(6):102-11.
Google Scholar
Waheed, K., Weaver, K., Salam, F.M.: A robust algorithm for detecting speech segments using an entropic contrast. In proc. of the IEEE Midwest Symposium on Circuits and Systems. Lida Ray Technologies Inc. 2002;45.
Google Scholar
Hong, L., Yanmin, Q., Jia, L.: English speech recognition system on chip. Tsinghua Science & Technology. 2011;16(1):95-9.
Article Google Scholar
Reynolds, T.J., Antoniou, C.A.: Experiments in speech recognition using a modular MLP architecture for acoustic modelling. Information Sciences. 2003;156(1-2):39-54.
Article Google Scholar
Jou, S., Schultz, T., Walliczek, M., Kraft, F., Waibel, A.: Towards continuous speech recognition using surface electromyography. Interspeech. 2006:573-6.
Google Scholar
Kirchhoff, K., Vergyri, D.: Cross-dialectal data sharing for acoustic modeling in arabic speech recognition. Speech Communication. 2005;46(1):37-51.
Article Google Scholar
Szczurowska, I., Kuniszyk-Jókowiak, W., Smoka, E.: The application of kohonen and multilayer perceptron networks in the speech nonfluecy analysis. Archives of Acoustics. 2006;31(4):205-10.
Google Scholar
Nakamura, M., Tsuda, K., Aoe, J.: A new approach to phoneme recognition by phoneme filter neural networks. Information Sciences. 1996;90(1-4):109-19.
Article Google Scholar
Koizumi, T., Mori, M., Taniguchi, S., Maruya, M. In: Recurrent neural networks for phoneme recognition. Department of information science, fourth international conference; 3- 6 Oct 1996; Fukui University, Fukui, Japan. Spoken Language, ICSLP 96; 1996. p. 326-9.
Google Scholar
Skowronski, M.D., Harris, J.G.: Automatic speech recognition using a predictive echo state network classifier. Neural Networks. 2007;20(3):414-23.
Article MATH Google Scholar
Ismail, S., Bin Ahmad, A.M. In: Recurrent neural network with backpropagation through time algorithm for arabic recognition. ; 2004.
Google Scholar
Bengio, Y.: Neural Networks For Speech And Sequence Recognition. first edition ed. International Thomson Computer Press; 1996.
Google Scholar
Sweeney, L., Thompson, P.: Speech perception using real-time phoneme detection: The BeBe system. Laboratory for Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; 1997.
Google Scholar

Download references

Author information

Authors and Affiliations

Nottingham Trent University, NG1 4BU, Nottingham, UK
N. Hmad & T. Allen

Authors

N. Hmad
View author publications
You can also search for this author in PubMed Google Scholar
T. Allen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to N. Hmad .

Editor information

Editors and Affiliations

School of Computing, University of Portsmouth, Whitepost Lane The Lilacs, Portsmouth, PO1 3AH, Hampshire, United Kingdom
Max Bramer
School of Computing, Engineering & Mathe, University of Brighton, Lewes Road, Brighton, BN2 4GJ, West Sussex, United Kingdom
Miltos Petridis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hmad, N., Allen, T. (2012). Biologically inspired Continuous Arabic Speech Recognition. In: Bramer, M., Petridis, M. (eds) Research and Development in Intelligent Systems XXIX. SGAI 2012. Springer, London. https://doi.org/10.1007/978-1-4471-4739-8_20

Download citation

DOI: https://doi.org/10.1007/978-1-4471-4739-8_20
Published: 09 October 2012
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4738-1
Online ISBN: 978-1-4471-4739-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics