Skip to main content

Arabic Speech Recognition Systems

  • Chapter
  • First Online:
Cross-Word Modeling for Arabic Speech Recognition

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

Abstract

This chapter presents a brief overview of the evolution of Arabic speech recognition systems. It provides a literature survey of Arabic speech recognition systems and discusses some of the challenges of Arabic from the speech recognition point of view.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Abdou SM, Hamid SE, Rashwan M, Samir A, Abd-Elhamid O, Shahin M, Naz W (2006) Computer aided pronunciation learning system using speech recognition techniques, NTERSPEECH 2006, ICSLP, pp 249–252

    Google Scholar 

  • Abushariah MAM, Ainon RN et al (2010) Natural speaker-independent Arabic speech recognition system based on Hidden Markov Models using Sphinx tools. 2010 international conference on computer and communication engineering (ICCCE)

    Google Scholar 

  • Afify M, Nguyen L, Xiang B, Abdou S, Makhoul J. Recent progress in Arabic broadcast news transcription at BBN. In: Proceedings of INTERSPEECH. 2005, pp 1637–1640

    Google Scholar 

  • Algamdi M (2003) KACST Arabic phonetics database. The fifteenth international congress of phonetics science, Barcelona, pp 3109–3112

    Google Scholar 

  • Alghamdi M (2000) Arabic phonetics. Attaoobah, Riyadh

    Google Scholar 

  • Alghamdi M, Elshafei M, Almuhtasib H (2002) Speech units for Arabic text-to-speech. The fourth workshop on computer and information sciences, pp 199–212

    Google Scholar 

  • Alghamdi M, Elshafei M, Almuhtasib H (2009) Arabic broadcast news transcription system. Int J Speech Tech 10:183–195

    Article  Google Scholar 

  • Al-Ghamdi M, Elshafei M, Al-Muhtaseb H (2003) An experimental Arabic text-to-speech system. Final report, King Abudaziz City of Science and Technology

    Google Scholar 

  • Alimi AM, Ben Jemaa M (2002) Beta fuzzy neural network application in recognition of spoken isolated Arabic words. Int J Contr Intell Syst 30(2), Special issue on speech processing techniques and applications

    Google Scholar 

  • Alotaibi YA (2004) Spoken Arabic digits recognizer using recurrent neural networks. In: Proceedings of the fourth IEEE international symposium on signal processing and information technology, pp 195–199

    Google Scholar 

  • Al-Otaibi F (2001) speaker-dependant continuous Arabic speech recognition. M.Sc. thesis, King Saud University

    Google Scholar 

  • Alotaibi Y, Selouani S, O’Shaughnessy D (2008) Experiments on automatic recognition of nonnative Arabic speech. EURASIP J Audio Speech Music Process: 9 pages. doi:10.1155/2008/679831, Article ID 679831

  • Azmi M, Tolba H, Mahdy S, Fashal M (2008) Syllable-based automatic Arabic speech recognition in noisy-telephone channel. In: WSEAS transactions on signal processing proceedings, World Scientific and Engineering Academy and Society (WSEAS), vol 4, issue 4, pp 211–220

    Google Scholar 

  • Bahi H, Sellami M (2001) Combination of vector quantization and hidden Markov models for Arabic speech recognition. ACS/IEEE international conference on computer systems and applications, 2001

    Google Scholar 

  • Bahi H, Sellami M (2003) A hybrid approach for Arabic speech recognition. ACS/IEEE international conference on computer systems and applications, 14–18 July 2003

    Google Scholar 

  • Billa J, Noamany M et al (2002) Audio indexing of Arabic broadcast news. 2002 IEEE international conference on acoustics, speech, and signal processing (ICASSP)

    Google Scholar 

  • Bourouba H, Djemili R et al (2006) New hybrid system (supervised classifier/HMM) for isolated Arabic speech recognition. 2nd Information and Communication Technologies, 2006. ICTTA’06

    Google Scholar 

  • Choi F, Tsakalidis S et al (2008) Recent improvements in BBN’s English/Iraqi speech-to-speech translation system. IEEE Spoken language technology workshop, 2008. SLT 2008

    Google Scholar 

  • Choueiter G, Povey D et al (2006) Morpheme-based language modeling for Arabic LVCSR. 2006 IEEE international conference on acoustics, speech and signal processing. ICASSP 2006 proceedings

    Google Scholar 

  • Elmahdy M, Gruhn R et al (2009) Modern standard Arabic based multilingual approach for dialectal Arabic speech recognition. In: Eighth international symposium on natural language processing, 2009. SNLP’09

    Google Scholar 

  • Elmisery FA, Khalil AH et al (2003) A FPGA-based HMM for a discrete Arabic speech recognition system. In: Proceedings of the 15th international conference on microelectronics, 2003. ICM 2003

    Google Scholar 

  • El-Ramly SH, Abdel-Kader NS, El-Adawi R (2002) Neural networks used for speech recognition. In: Proceedings of the nineteenth national radio science conference (NRSC 2002), March 2002, pp 200–207

    Google Scholar 

  • Elshafei MA (1991) Toward an Arabic text-to-speech system. Arab J Sci Eng 16(4B):565–583

    MathSciNet  Google Scholar 

  • Elshafei M, Almuhtasib H, Alghamdi M (2002) Techniques for high quality text-to-speech. Inform Sci 140(3–4):255–267

    Article  MATH  Google Scholar 

  • Elshafei M, Al-Muhtaseb H, Alghamdi M (2006) Statistical methods for automatic diacritization of Arabic text. In: Proceedings of 18th national computer conference NCC’18, Riyadh, March 26–29, 2006

    Google Scholar 

  • Elshafei M, Ali M, Al-Muhtaseb H, Al-Ghamdi M (2007) Automatic segmentation of Arabic speech. Workshop on information technology and Islamic sciences, Imam Mohammad Ben Saud University, Riyadh, March 2007

    Google Scholar 

  • Emami A, Mangu L (2007) Empirical study of neural network language models for Arabic speech recognition. IEEE workshop on automatic speech recognition and understanding, 2007. ASRU

    Google Scholar 

  • Essa EM, Tolba AS et al (2008) A comparison of combined classifier architectures for Arabic speech recognition. International conference on computer engineering and systems, 2008. ICCES 2008

    Google Scholar 

  • Farghaly A, Shaalan K (2009) Arabic natural language processing: challenges and solutions. ACM Trans Asian Lang Inform Process 8(4):1–22

    Article  Google Scholar 

  • Gales MJF, Diehl F et al (2007) Development of a phonetic system for large vocabulary Arabic speech recognition. IEEE workshop on automatic speech recognition and understanding, 2007. ASRU

    Google Scholar 

  • Hyassat H, Abu Zitar R (2008) Arabic speech recognition using SPHINX engine. Int J Speech Tech 9(3–4):133–150

    Google Scholar 

  • Imai T, Ando A et al (1995) A new method for automatic generation of speaker-dependent phonological rules. 1995 international conference on acoustics, speech, and signal processing, 1995. ICASSP-95

    Google Scholar 

  • Khasawneh M, Assaleh K et al (2004) The application of polynomial discriminant function classifiers to isolated Arabic speech recognition. In: Proceedings of the IEEE international joint conference on neural networks, 2004

    Google Scholar 

  • Kirchhofl K, Bilmes J, Das S, Duta N, Egan M, Ji G, He F, Henderson J, Liu D, Noamany M, Schoner P, Schwartz R, Vergyri D (2003) Novel approaches to Arabic speech recognition: report from the 2002 John-Hopkins summer workshop, ICASSP 2003, pp I344–I347

    Google Scholar 

  • Kuo HJ, Mangu L et al (2010) Morphological and syntactic features for Arabic speech recognition. 2010 IEEE international conference on acoustics speech and signal processing (ICASSP)

    Google Scholar 

  • Lamel L, Messaoudi A et al (2009) Automatic speech-to-text transcription in Arabic. ACM Trans Asian Lang Inform Process 8(4):1–18

    Article  Google Scholar 

  • Messaoudi A, Gauvain JL et al (2006) Arabic broadcast news transcription using a one million word vocalized vocabulary. 2006 IEEE international conference on acoustics, speech and signal processing, 2006. ICASSP 2006 proceedings

    Google Scholar 

  • Mokhtar MA, El-Abddin AZ (1996) A model for the acoustic phonetic structure of Arabic language using a single ergodic hidden Markov model. In: Proceedings of the fourth international conference on spoken language, 1996. ICSLP 96

    Google Scholar 

  • Muhammad G, AlMalki K et al (2011) Automatic Arabic digit speech recognition and formant analysis for voicing disordered people. 2011 IEEE symposium on computers and informatics (ISCI)

    Google Scholar 

  • Nofal M, Abdel Reheem E et al (2004) The development of acoustic models for command and control Arabic speech recognition system. 2004 international conference on electrical, electronic and computer engineering, 2004. ICEEC’04

    Google Scholar 

  • Park J, Diehl F et al (2009) Training and adapting MLP features for Arabic speech recognition. IEEE international conference on acoustics, speech and signal processing, 2009. ICASSP 2009

    Google Scholar 

  • Rambow O et al (2006) Parsing Arabic dialects, final report version 1, Johns Hopkins summer workshop 2005

    Google Scholar 

  • Sagheer A, Tsuruta N et al (2005) Hyper column model vs. fast DCT for feature extraction in visual Arabic speech recognition. In: Proceedings of the fifth IEEE international symposium on signal processing and information technology, 2005

    Google Scholar 

  • Saon G, Soltau H et al (2010) The IBM 2008 GALE Arabic speech transcription system. 2010 IEEE international conference on acoustics speech and signal processing (ICASSP)

    Google Scholar 

  • Satori H, Harti M, Chenfour N (2007) Introduction to Arabic speech recognition using CMU Sphinx system. Information and communication technologies international symposium proceeding ICTIS07, 2007

    Google Scholar 

  • Selouani S-A, Alotaibi YA (2011) Adaptation of foreign accented speakers in native Arabic ASR systems. Appl Comput Informat 9(1):1–10

    Article  Google Scholar 

  • Shoaib M, Rasheed F, Akhtar J, Awais M, Masud S, Shamail S (2003) A novel approach to increase the robustness of speaker independent Arabic speech recognition. 7th international multi topic conference, 2003. INMIC 2003. 8–9 Dec 2003, pp 371–376

    Google Scholar 

  • Soltau H, Saon G et al (2007) The IBM 2006 Gale Arabic ASR system. IEEE international conference on acoustics, speech and signal processing, 2007. ICASSP 2007

    Google Scholar 

  • Taha M, Helmy T et al (2007) Multi-agent based Arabic speech recognition. 2007 IEEE/WIC/ACM international conferences on web intelligence and intelligent agent technology workshops

    Google Scholar 

  • Vergyri D, Kirchhoff K, Duh K, Stolcke A (2004) Morphology-based language modeling for Arabic speech recognition. International conference on speech and language processing. Jeju Island, pp 1252–1255

    Google Scholar 

  • Xiang B, Nguyen K, Nguyen L, Schwartz R, Makhoul J (2006) Morphological ecomposition for Arabic broadcast news transcription. In: Proceedings of ICASSP, vol I. Toulouse, pp 1089–1092

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dia AbuZeina .

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Dia AbuZeina

About this chapter

Cite this chapter

AbuZeina, D., Elshafei, M. (2012). Arabic Speech Recognition Systems. In: Cross-Word Modeling for Arabic Speech Recognition. SpringerBriefs in Electrical and Computer Engineering(). Springer, Boston, MA. https://doi.org/10.1007/978-1-4614-1213-7_2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-1213-7_2

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4614-1212-0

  • Online ISBN: 978-1-4614-1213-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics