Arabic Phonemes Recognition Using Convolutional Neural Network

Mazlin, Irwan; Nasruddin, Zan Azma; Adnan, Wan Adilah Wan; Razak, Fariza Hanis Abdul

doi:10.1007/978-981-15-0399-3_21

Irwan Mazlin¹¹,
Zan Azma Nasruddin¹¹,
Wan Adilah Wan Adnan¹¹ &
…
Fariza Hanis Abdul Razak¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1100))

Included in the following conference series:

International Conference on Soft Computing in Data Science

692 Accesses
4 Citations

Abstract

This paper focuses on a machine learning that learn the correct pronunciation Arabic phonemes. In this study, the researchers develop using convolutional neural network as feature extraction in order to enhance the performance of the model and Multi layer perceptron as the classifier to classify classes. Different parameters of CNN model are used in order to investigate the best parameter for the recognition purpose. The dataset have been recorded from experts using smartphone which consist of 880 recorded audios to train the model (210 for each class). The researchers have experimented the models to measure the accuracy and the cross entropy in the training process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Wahidah, A., Suriazalmi, M., Niza, M.: Makhraj recognition using speech processing. In: 7th International Conference on Computing and Convergence Technology (ICCCT) 2012, pp. 689–693 (2012)
Google Scholar
Leaman, O.: The Qur’an: An Encyclopedia. Routledge, Abingdon (2006). T. Qur, T. Qur, A. Ency-
Book Google Scholar
Arshad, N.W., Aziz, S.N.A., Hamid, R., Karim, R.A., Naim, F., Zakaria, N.F.: Speech processing for makhraj recognition. In: International Conference on Electrical, Control and Computer Engineering 2011, pp. 323–327 (2011)
Google Scholar
Sainath, T.N., Parada, C.: Convolutional neural networks for small-footprint keyword spotting. In: 16th Annual Conference of the International Speech Communication Association. INTERSPEECH 2015, pp. 1478–1482 (2015)
Google Scholar
Wahidah, A., et al.: Makhraj recognition for Al-Quran recitation using MFCC. Int. J. Intell. Inf. Process. 4(2), 45–53 (2013)
Google Scholar
Abdel-hamid, O., Jiang, H., Penn, G.: Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition. In: Department of Computer Science and Engineering, York University, Toronto, Canada, pp. 4277–4280 (2012)
Google Scholar
Khahriri, F.A, Ibrahim, Z., Rashidin, R., Ismail, N., Ahmad, A.: Malay dialect translator for android. In: Language Invention, Innovation and Design Exposition (LIID2017), UiTM Shah Alam (2017)
Google Scholar
Mazlin, I., Nasruddin, Z.A., Hamzah, P., Abdul Aziz, M.: Musafir application development using mobile application development life cycle. In: 3rd International Conference on Innovation in Computer Science and Engineering 2019 (iCiCSE) (2019)
Google Scholar
Tóth, L.: Combining time-and frequency-domain convolution in convolutional neural network-based phone recognition. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 190–194. IEEE (2015)
Google Scholar
Chen, G., Parada, C., Heigold, G.: Small-footprint keyword spotting using deep neural networks. In: Acoustics, Speech and Signal Processing, no. i, pp. 1–5, 2014
Google Scholar
Chao, Q., Xiao-Guang, G., Da-Qing, C.: On distributed deep network for processing large-scale sets of complex data. In: 2016 8th International Conference on Intelligent Human-Machine Systems and Cybernetics, pp. 395–399 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, 40450, Shah Alam, Selangor Darul Ehsan, Malaysia
Irwan Mazlin, Zan Azma Nasruddin, Wan Adilah Wan Adnan & Fariza Hanis Abdul Razak

Authors

Irwan Mazlin
View author publications
You can also search for this author in PubMed Google Scholar
Zan Azma Nasruddin
View author publications
You can also search for this author in PubMed Google Scholar
Wan Adilah Wan Adnan
View author publications
You can also search for this author in PubMed Google Scholar
Fariza Hanis Abdul Razak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zan Azma Nasruddin .

Editor information

Editors and Affiliations

University of Tennessee, Knoxville, TN, USA
Michael W. Berry
Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia
Bee Wah Yap
Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia
Azlinah Mohamed
Kyushu Institute of Technology, Fukuoka, Japan
Mario Köppen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mazlin, I., Nasruddin, Z.A., Adnan, W.A.W., Razak, F.H.A. (2019). Arabic Phonemes Recognition Using Convolutional Neural Network. In: Berry, M., Yap, B., Mohamed, A., Köppen, M. (eds) Soft Computing in Data Science. SCDS 2019. Communications in Computer and Information Science, vol 1100. Springer, Singapore. https://doi.org/10.1007/978-981-15-0399-3_21

Download citation

DOI: https://doi.org/10.1007/978-981-15-0399-3_21
Published: 24 September 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-0398-6
Online ISBN: 978-981-15-0399-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics