Skip to main content

Toward an Automatic Fongbe Speech Recognition System: Hierarchical Mixtures of Algorithms for Phoneme Recognition

  • Chapter
  • First Online:
Informatics in Control, Automation and Robotics

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 430))

  • 947 Accesses

Abstract

In this paper, we have demonstrated the efficacy of an automatic continuous speech recognition system by mixing fuzzy and neuronal approaches and an acoustic analysis of the sounds of an under-resourced language. The system we propose integrates the modules such as extraction module, segmentation and phoneme recognition modules and whose the core is based on the phoneme detection in continuous speech. This work offers a complete recipe of algorithms to perform hierarchically the following tasks: speech segmentation - phoneme classification - phoneme recognition. The segmentation task provides as output phoneme segment which are subsequently classified according to their nature (consonant or vowel voiced or unvoiced etc.). The segmentation and classification are based exclusively on a fuzzy approach while the phoneme recognition task exploits the acoustic features such as the formants for vowels and the pitch and intensity for consonants. Experiments were per- formed on Fongbe language (an African tonal language spoken especially in Benin, Togo and Nigeria) and results of phoneme error rate are reported.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Palaz, D., Collobert, R., Magimai.-Doss, M.: End-to-end phoneme sequence recognition using convolutional neural networks. Idiap-RR (2013)

    Google Scholar 

  2. Yousafzai, J., Cvetkovic, Z., Sollich, P.: Tuning support vector machines for robust phoneme classification with acoustic waveforms. In: 10th Annual conference of the International Speech communication association, pp. 2359–2362, England, ISCA-INST SPEECH COMMUNICATION ASSOC (2009)

    Google Scholar 

  3. chwarz, P., Matejka, P., Cernocky, J.: Hierarchical structures of neural networks for phoneme recognition. IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (2006)

    Google Scholar 

  4. Young, S.: Hmms and related speech recognition technologies. Springer Handbook of Speech Processing. Springer, Berlin (2008)

    Book  Google Scholar 

  5. Marani, S., Raviram, P., Wahidabanu, R.: Implementation of hmm and radial basis function for speech recognition. In: International Conference on Intelligent Agent and Multi-Agent Systems, pp. 1–4. Chennai (2009)

    Google Scholar 

  6. Solera-Urena, R., Martin-Iglesias, D., Gallardo-Antolin, A., Pelaez-Moreno, C., Diaz-de Maria, F.: Robust asr using support vector machines. Speech Commun. 49(4), 253–267 (2007)

    Article  Google Scholar 

  7. Trentin, E., Gori, M.: A survey of hybrid ann/hmm models for automatic speech recognition. Neurocomputing 37(1), 91–126 (2007)

    MATH  Google Scholar 

  8. Anapathy, S., Thomas, S., Hermansky, H.: Modulation frequency features for phoneme recognition in noisy speech. J. Acoust. Soc. Am. 125(1), EL8–EL11 (2009)

    Article  Google Scholar 

  9. Laleye, F.A.A., Ezin, E.C., Motamed, C.: Adaptive decision-level fusion for fongbe phoneme classification. In: Proceedings of the 12th International Conference on Informatics in Control, Automation and Robotics, Vol. 1, pp. 15–24. Colmar, Alsace, France, 21–23 July 2015

    Google Scholar 

  10. Laleye, F.A.A., Ezin, E.C., Motamed, C.: An algorithm based on fuzzy logic for text-independent fongbe speech segmentation. In: 11th International Conference on Signal-Image Technology & Internet-Based Systems, SITIS 2015, pp. 1–6. Bangkok, Thailand, 23-27 Nov 2015

    Google Scholar 

  11. Lefebvre, C., Brousseau., A.: A grammar of fonge, de gruyter mouton, p. 608

    Google Scholar 

  12. Laleye F., Ezin E., Motamed C.: Automatic fongbe phoneme recognition from spoken speech signal. In: Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics, pp. 102–109 (2016)

    Google Scholar 

  13. Huang, X., Acero, A., Hon, H-W.: Spoken Language Processing, A Guide to Theory, Algorithm and System Development. In Prentice Hall (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fréjus A. A. Laleye .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Laleye, F.A.A., Ezin, E.C., Motamed, C. (2018). Toward an Automatic Fongbe Speech Recognition System: Hierarchical Mixtures of Algorithms for Phoneme Recognition. In: Madani, K., Peaucelle, D., Gusikhin, O. (eds) Informatics in Control, Automation and Robotics . Lecture Notes in Electrical Engineering, vol 430. Springer, Cham. https://doi.org/10.1007/978-3-319-55011-4_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-55011-4_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-55010-7

  • Online ISBN: 978-3-319-55011-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics