Skip to main content

Classification of a Sequence of Objects with the Fuzzy Decoding Method

  • Conference paper
Rough Sets and Current Trends in Computing (RSCTC 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8536))

Included in the following conference series:

Abstract

The problem of recognition of a sequence of objects (e.g., video-based image recognition, phoneme recognition) is explored. The generalization of the fuzzy phonetic decoding method is proposed by assuming the distribution of the classified object to be of exponential type. Its preliminary phase includes association of each model object with the fuzzy set of model classes with grades of membership defined as the confusion probabilities estimated with the Kullback-Leibler divergence between model distributions. At first, each object (e.g., frame) in a classified sequence is put in correspondence with the fuzzy set which grades are defined as the posterior probabilities. Next, this fuzzy set is intersected with the fuzzy set corresponding to the nearest neighbor. Finally, the arithmetic mean of these fuzzy intersections is assigned to the decision for the whole sequence. In this paper we propose not to limit the method’s usage with the Kullback-Leibler discrimination and to estimate the grades of membership of models and query objects based on an arbitrary distance with appropriate scale factor. The experimental results in the problem of isolated Russian vowel phonemes and words recognition for state-of-the-art measures of similarity are presented. It is shown that the correct choice of the scale parameter can significantly increase the recognition accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Savchenko, A.V.: Probabilistic neural network with homogeneity testing in recognition of discrete patterns set. Neural Networks 46, 227–241 (2013)

    Article  MATH  Google Scholar 

  2. Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Elsevier Inc. (2009)

    Google Scholar 

  3. Benesty, J., Sondh, M., Huang, Y. (eds.): Springer Handbook of Speech Recognition. Springer (2008)

    Google Scholar 

  4. Savchenko, L.V., Savchenko, A.V.: Fuzzy Phonetic Decoding Method in a Phoneme Recognition Problem. In: Drugman, T., Dutoit, T. (eds.) NOLISP 2013. LNCS, vol. 7911, pp. 176–183. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  5. Wang, H., Wang, Y., Cao, Y.: Video-based face recognition: a survey. World Academy of Science. Engineering and Technologies 60, 293–302 (2009)

    Google Scholar 

  6. Zadeh, L.A.: Fuzzy Sets. Information Control 8, 338–353 (1965)

    Article  MathSciNet  MATH  Google Scholar 

  7. Sarkar, M.: Fuzzy-rough nearest neighbor algorithms in classification. Fuzzy Sets and Systems 158(19), 2134–2152 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  8. Kullback, S.: Information Theory and Statistics. Dover Pub. (1997)

    Google Scholar 

  9. Anusuya, M.A., Katti, S.K.: Speech recognition by Machine: A Review. International Journal of Computer Science and Information Security 6(3), 181–205 (2009)

    Google Scholar 

  10. Kipyatkova, I.S., Karpov, A.A.: An Analytical Survey of Large Vocabulary Russian Speech Recognition Systems. SPIIRAS Proceedings 12, 7–20 (2010)

    Google Scholar 

  11. Keener, R.W.: Theoretical Statistics: Topics for a Core Course. Springer, New York (2010)

    Google Scholar 

  12. Reddy, D.R.: Speech recognition by machine: a review. Proceedings of the IEEE 64(4), 501–531 (1976)

    Article  Google Scholar 

  13. Hill, J.E.: The minimum of n independent normal distributions, http://www.untruth.org/~josh/math/normal-min.pdf

  14. Savchenko, A.V.: Adaptive Video Image image Recognition recognition System Using using a Committee committee Machinemachine. Optical Memory and Neural Networks (Information Optics) 21(4), 219–226 (2012)

    Article  Google Scholar 

  15. Specht, D.F.: Probabilistic neural networks. Neural Networks 3(1), 109–118 (1990)

    Article  Google Scholar 

  16. Itakura, F., Saito, S.: An analysis–synthesis telephony based on the maximum likelihood method. In: Proc. of International Congress on Acoustics c-5-5, vol. 5, pp. 17–20 (1968)

    Google Scholar 

  17. Basseville, M.: Distance measures for signal processing and pattern recognition. Signal Processing 18, 349–369 (1989)

    Article  MathSciNet  Google Scholar 

  18. Mérialdo, B.: Multilevel Decoding for Very-Large-Size-Dictionary Speech Recognition. IBM Journal of Research and Development 32(2), 227–237 (1988)

    Article  Google Scholar 

  19. Sirigos, J., Fakotakis, N., Kokkinakis, G.: A hybrid syllable recognition system based on vowel spotting. Speech Communication 38, 427–440 (2002)

    Article  MATH  Google Scholar 

  20. Savchenko, A.V.: Phonetic words decoding software in the problem of Russian speech recognition. Automation and Remote Control 74(7), 1225–1232 (2013)

    Article  Google Scholar 

  21. Savchenko, A.V.: Phonetic encoding method in the isolated words recognition problem. Journal of Communications Technology and Electronics 59(4), 310–315 (2014)

    Article  Google Scholar 

  22. CMU Sphinx, http://cmusphinx.sourceforge.net/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Savchenko, A.V., Savchenko, L.V. (2014). Classification of a Sequence of Objects with the Fuzzy Decoding Method. In: Cornelis, C., Kryszkiewicz, M., Ślȩzak, D., Ruiz, E.M., Bello, R., Shang, L. (eds) Rough Sets and Current Trends in Computing. RSCTC 2014. Lecture Notes in Computer Science(), vol 8536. Springer, Cham. https://doi.org/10.1007/978-3-319-08644-6_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08644-6_32

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08643-9

  • Online ISBN: 978-3-319-08644-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics