Abstract
This paper focuses on a machine learning that learn the correct pronunciation Arabic phonemes. In this study, the researchers develop using convolutional neural network as feature extraction in order to enhance the performance of the model and Multi layer perceptron as the classifier to classify classes. Different parameters of CNN model are used in order to investigate the best parameter for the recognition purpose. The dataset have been recorded from experts using smartphone which consist of 880 recorded audios to train the model (210 for each class). The researchers have experimented the models to measure the accuracy and the cross entropy in the training process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wahidah, A., Suriazalmi, M., Niza, M.: Makhraj recognition using speech processing. In: 7th International Conference on Computing and Convergence Technology (ICCCT) 2012, pp. 689–693 (2012)
Leaman, O.: The Qur’an: An Encyclopedia. Routledge, Abingdon (2006). T. Qur, T. Qur, A. Ency-
Arshad, N.W., Aziz, S.N.A., Hamid, R., Karim, R.A., Naim, F., Zakaria, N.F.: Speech processing for makhraj recognition. In: International Conference on Electrical, Control and Computer Engineering 2011, pp. 323–327 (2011)
Sainath, T.N., Parada, C.: Convolutional neural networks for small-footprint keyword spotting. In: 16th Annual Conference of the International Speech Communication Association. INTERSPEECH 2015, pp. 1478–1482 (2015)
Wahidah, A., et al.: Makhraj recognition for Al-Quran recitation using MFCC. Int. J. Intell. Inf. Process. 4(2), 45–53 (2013)
Abdel-hamid, O., Jiang, H., Penn, G.: Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition. In: Department of Computer Science and Engineering, York University, Toronto, Canada, pp. 4277–4280 (2012)
Khahriri, F.A, Ibrahim, Z., Rashidin, R., Ismail, N., Ahmad, A.: Malay dialect translator for android. In: Language Invention, Innovation and Design Exposition (LIID2017), UiTM Shah Alam (2017)
Mazlin, I., Nasruddin, Z.A., Hamzah, P., Abdul Aziz, M.: Musafir application development using mobile application development life cycle. In: 3rd International Conference on Innovation in Computer Science and Engineering 2019 (iCiCSE) (2019)
Tóth, L.: Combining time-and frequency-domain convolution in convolutional neural network-based phone recognition. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 190–194. IEEE (2015)
Chen, G., Parada, C., Heigold, G.: Small-footprint keyword spotting using deep neural networks. In: Acoustics, Speech and Signal Processing, no. i, pp. 1–5, 2014
Chao, Q., Xiao-Guang, G., Da-Qing, C.: On distributed deep network for processing large-scale sets of complex data. In: 2016 8th International Conference on Intelligent Human-Machine Systems and Cybernetics, pp. 395–399 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mazlin, I., Nasruddin, Z.A., Adnan, W.A.W., Razak, F.H.A. (2019). Arabic Phonemes Recognition Using Convolutional Neural Network. In: Berry, M., Yap, B., Mohamed, A., Köppen, M. (eds) Soft Computing in Data Science. SCDS 2019. Communications in Computer and Information Science, vol 1100. Springer, Singapore. https://doi.org/10.1007/978-981-15-0399-3_21
Download citation
DOI: https://doi.org/10.1007/978-981-15-0399-3_21
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-0398-6
Online ISBN: 978-981-15-0399-3
eBook Packages: Computer ScienceComputer Science (R0)