Learning phonetic rules in a speech recognition system

  • Zoltán Alexin
  • János Csirik
  • Tibor Gyimóthy
  • Mark Jelasity
  • László Tóth
Part II Papers
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1297)

Abstract

Current speech recognition systems can be categorized into two broad classes; the knowledge-based approach and the stochastic one. In this paper we present a rule-based method for the recognition of Hungarian vowels. A spectrogram model was used as a front-end module and some acoustic features were extracted (e.g. locations, intensities and shapes of local maxima) from spectrograms by using a genetic algorithm method. On the basis of these features we developed a rule set for the recognition of isolated Hungarian vowels. These rules represented by Prolog clauses were refined by the IMPUT Inductive Logic Programming method.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Alexin, Z., Gyimóthy, T., Boström, H.: Integrating Algorithmic Debugging and Unfolding Transformation in an Interactive Learner in Proceedings of the 12th European Conference on Artificial Intelligence ECAI-96 ed. Wolfgang Wahlster, Budapest, Hungary (1996) 403–407 John Wiley & Son's Ltd. (1996)Google Scholar
  2. 2.
    Bolla, Kálmán: Magyar fonetikai atlasz, A szegmentálás hangszerkezet elemei Nemzeti Tankönyvkiadó Rt Budapest (1995) in HungarianGoogle Scholar
  3. 3.
    Cooke, M. (eds.): Visual Representations of Speech Signals, Wiley, 1993Google Scholar
  4. 4.
    Fohr, D., Haton, J., Laprie, Y.: Knowledge-Based Techniques in Acoustic-Phonetic Decoding of Speech: Interest and Limitations, International Journal on Pattern Recognition and Artificial Intelligence Vol. 8, No. 1, (1994) 133–153Google Scholar
  5. 5.
    Huebener, K., Carson-Berndsen, J.: Phoneme Recognition Using Acoustic Events, Verbmobil Technical Report No. 15, June 1994Google Scholar
  6. 6.
    Jelasity, M., Dombi, J.: GAS, an Approach on Modeling Species in Genetic Algorithms. Proceedings of the EA'95 (1995)Google Scholar
  7. 7.
    Kókai, G., Alexin, Z., Kocsis, F.: The IDT System and its Application for Learning Prolog Programs. Proc. of the Sixth International Conference on Artificial Intelligence and Information Control Systems of Robots (AIICSR-94) Smolenice Castle Slovakia September 12–16. (1994) 315–320Google Scholar
  8. 8.
    Lamel, L. F.: A knowledge-based system for stop consonant identification based on speech spectrogram reading, Computer Speech and Language (1993) 2, 169–191Google Scholar
  9. 9.
    Lavrač, N., Džeroski, S.: Inductive Logic Programming: Techniques and Applications Ellis Horwood, (1994)Google Scholar
  10. 10.
    Mammachandran, R. P., Mammone, R. J. (eds.): Modern Methods of Speech Processing, Kluwer Academic, 1995Google Scholar
  11. 11.
    Mercier, G. et al.: Recognition of speaker-dependent continuous speech with KEAL, IEE Proceedings Vol. 136, Pt.1, No.2, April (1989)Google Scholar
  12. 12.
    Muggleton, S., De Raedt, L.: Inductive Logic Programming: Theory and methods, Journal of Logic Programming 19 (20) (1994) 629–679Google Scholar
  13. 13.
    Rabiner, L. R., Schafer, R. W.: Digital Processing of Speech Signals Prentice Hall (1978)Google Scholar
  14. 14.
    Walbel, A., Lee, K. (eds.): Readings in Speech Recognition, Morgan Kaufmann (1990)Google Scholar
  15. 15.
    Zue, V. W.: The Use of Speech Knowledge in Automatic Speech Recognition, Proc. IEEE, Vol. 73, No. 11, (1985) 1602–1615Google Scholar

Copyright information

© Springer-Verlag 1997

Authors and Affiliations

  • Zoltán Alexin
    • 1
  • János Csirik
    • 2
  • Tibor Gyimóthy
    • 3
  • Mark Jelasity
    • 3
  • László Tóth
    • 3
  1. 1.Department of Applied InformaticsJózsef Attila UniversitySzegedHungary
  2. 2.Department of Computer ScienceJózsef Attila UniversitySzegedHungary
  3. 3.Research Group on Artificial IntelligenceHungarian Academy of SciencesSzegedHungary

Personalised recommendations