Skip to main content

Arabic Speech Processing: State of the Art and Future Outlook

  • Conference paper
  • First Online:
Intelligent Decision Technologies

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 309))

  • 392 Accesses

Abstract

The aim of this article is to study the Arabic speech processing applications which have introduced the voice communication as a solution for specific situations (non-native speakers, speakers with voice disabilities, learners of Arabic vocabulary, speech recognition or speech synthesis). We present the principal applications of processing the Arabic spoken language accentuating the most challenges preventing obtaining better results. The current paper gives in detail the followed approaches and the applied techniques in the automatic processing applications of spoken Arabic, so it can be a reference study for researchers and developers who deal with this topic.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 379.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Srivastava, V., Singh, M.: Challenges and considerations with Code-Mixed NLP for Multilingual Societies. arXiv preprint arXiv:2106.07823 (2021)

  2. Ruder, S., Constant, N., Botha, J., Siddhant, A., Firat, O., Fu, J., Johnson, M.: XTREME-R: towards more challenging and nuanced multilingual evaluation. arXiv preprint arXiv:2104.07412 (2021)

  3. Li, X., Gong, H.: Demystify optimization challenges in multilingual transformers. arXiv preprint arXiv:2104.07639 (2021)

  4. Darwish, K., Habash, N., Abbas, M., Al-Khalifa, H., Al-Natsheh, H.T., Bouamor, H., Mubarak, H.: A panoramic survey of natural language processing in the Arab world. Commun. ACM 64(4), 72–81 (2021)

    Article  Google Scholar 

  5. Dressler, W.U., Mattiello, E., Ritt-Benmimoun, V.: Typological impact of morphological richness and priority of pragmatics over semantics in Italian, Arabic, German, and English diminutives.

    Google Scholar 

  6. Elfaik, H.: Combining context-aware embeddings and an attentional deep learning model for Arabic affect analysis on twitter. IEEE Access 9, 111214–111230 (2021)

    Article  Google Scholar 

  7. Kawar, K.: Morphology and syntax in Arabic-speaking adolescents who are deaf and hard of hearing. J. Speech Lang. Hear. Res. 1–16 (2021)

    Google Scholar 

  8. Abd, D.H., Khan, W., Thamer, K.A., Hussain, A.J.: Arabic light stemmer based on ISRI stemmer. In: International Conference on Intelligent Computing, pp. 32–45 (2021)

    Google Scholar 

  9. Arian, A., Rahimi Khoigani, M.: Investigating quranic ambiguity translation strategies in Persian and Chinese: lexical and grammatical ambiguity in focus. Linguist. Res. Holy Quran 10(1), 61–78 (2021)

    Google Scholar 

  10. Ezzini, S., Abualhaija, S., Arora, C., Sabetzadeh, M., Briand, L.C.: MAANA: an automated tool for DoMAin-specific HANdling of ambiguity. In: IEEE/ACM 43rd International Conference on Software Engineering, pp. 188–189 (2021)

    Google Scholar 

  11. Habash, N.: 13 Arabic dialect processing. In: Similar Languages, Varieties, and Dialects: A Computational Perspective, 279 (2021)

    Google Scholar 

  12. Ullah, A., Kui, Z., Ullah, S., Pinglu, C., Khan, S.: Sustainable utilization of financial and institutional resources in reducing income inequality and poverty. Sustainability 13(3), 1038 (2021)

    Article  Google Scholar 

  13. Guellil, I., Saâdane, H., Azouaou, F., Gueni, B., Nouvel, D.: Arabic natural language processing: an overview. J. King Saud University-Comput. Inf. Sci. 33(5), 497–507 (2021)

    Google Scholar 

  14. Guellil, I., Adeel, A., Azouaou, F., Benali, F., Hachani, A.E., Dashtipour, K., Hussain, A.: A semi-supervised approach for sentiment analysis of arab (ic/izi) messages: application to the Algerian dialect. SN Comput. Sci. 2(2), 1–18 (2021)

    Article  Google Scholar 

  15. Talafha, B., Abuammar, A., Al-Ayyoub, M.: ATAR: Attention-based LSTM for Arabizi transliteration. Int. J. Electr. Comput. Eng. (IJECE) 11(3), 2327–2334 (2021)

    Article  Google Scholar 

  16. Eryani, F., Habash, N.: Automatic romanization of arabic bibliographic records. In: 6th Arabic Natural Language Processing Workshop, pp. 213–218 (2021)

    Google Scholar 

  17. Ouisaadane, A., Safi, S.: A comparative study for Arabic speech recognition system in noisy environments. Int. J. Speech Technol. 1–10 (2021)

    Google Scholar 

  18. Al-Anzi, F.S., AbuZeina, D.: Synopsis on Arabic speech recognition. Ain Shams Eng. J. (2021)

    Google Scholar 

  19. Mittal, V., Sharma, R.K.: Deep Learning Approach for Voice Pathology Detection and Classification. Int. J. Healthcare Inf. Syst. Inform. (IJHISI) 16(4), 1–30 (2021)

    Google Scholar 

  20. Harder, B.: Speech language pathology, occupational therapy, and physical therapy student perspectives of an interprofessional education simulation (2021)

    Google Scholar 

  21. Yusof, N., Baharudin, H., Hamzah, M.I., Malek, N.I.A.: Fuzzy Delphi method application in the development of I-Aqran module for Arabic vocabulary consolidation. Ijaz Arabi J. Arabic Learn. 4(2) (2021)

    Google Scholar 

  22. Ali, Z., Saleh, M., Al-Maadeed, S., Abou Elsaud, S., Khalifa, B., AlJa’am, J.M., Massaro, D.: Understand my world: an interactive app for children learning Arabic vocabulary. In: IEEE Global Engineering Education Conference, pp. 1143–1148 (2021)

    Google Scholar 

  23. Farghaly, A., Shaalan, K.: Arabic natural language processing: challenges and solutions. ACM Trans. Asian Lang. Inf. Process. 8(4), 1–22 (2009)

    Article  Google Scholar 

  24. Habash, N.Y.: Introduction to Arabic natural language processing, vol. 3. Morgan & Claypool Publishers (2010)

    Google Scholar 

  25. Alsayadi, H.A., Abdelhamid, A.A., Hegazy, I., Fayed, Z.T.: Arabic speech recognition using end-to-end deep learning. IET Signal Process. (2021)

    Google Scholar 

  26. Zhang, J., Wang, B., Zhang, C., Xiao, Y., Wang, M.Y.: An EEG/EMG/EOG-based multimodal human-machine interface to real-time control of a soft robot hand. Front. Neurorobot. 13, 7 (2019)

    Article  Google Scholar 

  27. Friedrich, M., Peinecke, N., Geister, D.: Human machine interface aspects of the ground control station for unmanned air transport. In: Automated Low-Altitude Air Delivery, pp. 289–301 (2022)

    Google Scholar 

  28. Vacher, M., Lecouteux, B., Portet, F.: Recognition of voice commands by multisource ASR and noise cancellation in a smart home environment. In: 20th European Signal Processing Conference (EUSIPCO), pp. 1663–1667 (2012)

    Google Scholar 

  29. McLaughlin, N., Ming, J., Crookes, D.: Speaker recognition in noisy conditions with limited training data. In: 19th European Signal Processing Conference, pp. 1294–1298 (2011)

    Google Scholar 

  30. Biagetti, G., Crippa, P., Falaschetti, L., Orcioni, S., Turchetti, C.: Speaker identification in noisy conditions using short sequences of speech frames. In: International Conference on Intelligent Decision Technologies, pp. 43–52 (2017)

    Google Scholar 

  31. Ming, J., Hazen, T.J., Glass, J.R., Reynolds, D.A.: Robust speaker recognition in noisy conditions. IEEE Trans. Audio Speech Lang. Process. 15(5), 1711–1723 (2007)

    Article  Google Scholar 

  32. Biagetti, G., Crippa, P., Curzi, A., Orcioni, S., Turchetti, C.: Speaker identification with short sequences of speech frames. ICPRAM (2), pp. 178–185 2015

    Google Scholar 

  33. Deshpande, M.S., Holambe, R.S.: Speaker identification based on robust AM-FM features. In: 2nd International Conference on Emerging Trends in Engineering & Technology, pp. 880–884 (2009)

    Google Scholar 

  34. Ali, A.H., Magdy, M., Alfawzy, M., Ghaly, M., Abbas, H.: Arabic speech synthesis using deep neural networks. In: International Conference on Communications, Signal Processing, and their Applications (ICCSPA), pp. 1–6. IEEE (2021)

    Google Scholar 

  35. Mutawa, A.M.: Machine learning for Arabic text to speech synthesis: a Tacotron approach (2021)

    Google Scholar 

  36. Bettayeb, N., Guerti, M.: Speech synthesis system for the holy quran recitation. Int. Arab J. Inf. Technol. 18(1), 8–15 (2021)

    Google Scholar 

  37. El-Dakhs, D.A.S., Ahmed, M.M.: A variational pragmatic analysis of the speech act of complaint focusing on Alexandrian and Najdi Arabic. J. Pragmat. 181, 120–138 (2021)

    Article  Google Scholar 

  38. Shaalan, K., Talhami, H.: Error analysis and handling in Arabic icall systems. In: Artificial Intelligence and Applications (2006). Citeseer, pp. 109–114

    Google Scholar 

  39. Shaalan, K.F.: An intelligent computer assisted language learning system for Arabic learners. Comput. Assist. Lang. Learn. 18(1–2), 81–109 (2005)

    Article  Google Scholar 

  40. Meftouh, K., Harrat, S., Jamoussi, S., Abbas, M., Smaili, K.: Machine translation experiments on padic: a parallel Arabic dialect corpus. In: Pacific Asia Conference on Language, Information and Computation (2015)

    Google Scholar 

  41. Terbeh, N., Zrigui, M.: Vers la correction automatique de la Parole Arabe. Citala 2014 (2014)

    Google Scholar 

  42. Maraoui, M., Terbeh, N., Zrigui, M.: Arabic discourse analysis based on acoustic, prosodic and phonetic modeling: elocution evaluation, speech classification and pathological speech correction. Int. J. Speech Technol. 1071–1090 (2018)

    Google Scholar 

  43. Terbeh, N., Zrigui, M.: Vocal pathologies detection and mispronounced phonemes identification: case of Arabic continuous speech. In: 10th International Conference on Language Resources and Evaluation (LREC’16), pp. 2108–2113 (2016)

    Google Scholar 

  44. Terbeh, N., Zrigui, M.: Identification of pronunciation defects in spoken Arabic language. In: International Conference of the Pacific Association for Computational Linguistics, pp. 355–365 (2017)

    Google Scholar 

  45. Terbeh, N., Zrigui, M.: A novel approach to identify factor posing pronunciation disorders. In: International Conference on Computational Collective Intelligence, pp. 153–162 (2016)

    Google Scholar 

  46. Terbeh, N., Trigui, A., Maraoui, M., Zrigui, M.: Arabic speech analysis to identify factors posing pronunciation disorders and to assist learners with vocal disabilities. In: 2016 International Conference on Engineering & MIS (ICEMIS), pp. 1–8 (2016)

    Google Scholar 

  47. Terbeh, N., Trigui, A., Maraoui, M., Zrigui, M.: Correction of pathological speeches and assistance to learners with vocal disabilities. Multimedia Tools Appl. 77(14), 17779–17802 (2018)

    Article  Google Scholar 

  48. Terbeh, N., Labidi, M., Zrigui, M.: Automatic speech correction: A step to speech recognition for people with disabilities. In: Fourth International Conference on Information and Communication Technology and Accessibility (ICTA), pp. 1–6 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Naim Terbeh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Terbeh, N., Teyeb, R., Zrigui, M. (2022). Arabic Speech Processing: State of the Art and Future Outlook. In: Czarnowski, I., Howlett, R.J., Jain, L.C. (eds) Intelligent Decision Technologies. Smart Innovation, Systems and Technologies, vol 309. Springer, Singapore. https://doi.org/10.1007/978-981-19-3444-5_5

Download citation

Publish with us

Policies and ethics