Classification of a Sequence of Objects with the Fuzzy Decoding Method

Savchenko, Andrey V.; Savchenko, Lyudmila V.

doi:10.1007/978-3-319-08644-6_32

Andrey V. Savchenko²⁵ &
Lyudmila V. Savchenko²⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8536))

Included in the following conference series:

International Conference on Rough Sets and Current Trends in Computing

885 Accesses
1 Citations

Abstract

The problem of recognition of a sequence of objects (e.g., video-based image recognition, phoneme recognition) is explored. The generalization of the fuzzy phonetic decoding method is proposed by assuming the distribution of the classified object to be of exponential type. Its preliminary phase includes association of each model object with the fuzzy set of model classes with grades of membership defined as the confusion probabilities estimated with the Kullback-Leibler divergence between model distributions. At first, each object (e.g., frame) in a classified sequence is put in correspondence with the fuzzy set which grades are defined as the posterior probabilities. Next, this fuzzy set is intersected with the fuzzy set corresponding to the nearest neighbor. Finally, the arithmetic mean of these fuzzy intersections is assigned to the decision for the whole sequence. In this paper we propose not to limit the method’s usage with the Kullback-Leibler discrimination and to estimate the grades of membership of models and query objects based on an arbitrary distance with appropriate scale factor. The experimental results in the problem of isolated Russian vowel phonemes and words recognition for state-of-the-art measures of similarity are presented. It is shown that the correct choice of the scale parameter can significantly increase the recognition accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Savchenko, A.V.: Probabilistic neural network with homogeneity testing in recognition of discrete patterns set. Neural Networks 46, 227–241 (2013)
Article MATH Google Scholar
Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Elsevier Inc. (2009)
Google Scholar
Benesty, J., Sondh, M., Huang, Y. (eds.): Springer Handbook of Speech Recognition. Springer (2008)
Google Scholar
Savchenko, L.V., Savchenko, A.V.: Fuzzy Phonetic Decoding Method in a Phoneme Recognition Problem. In: Drugman, T., Dutoit, T. (eds.) NOLISP 2013. LNCS, vol. 7911, pp. 176–183. Springer, Heidelberg (2013)
Chapter Google Scholar
Wang, H., Wang, Y., Cao, Y.: Video-based face recognition: a survey. World Academy of Science. Engineering and Technologies 60, 293–302 (2009)
Google Scholar
Zadeh, L.A.: Fuzzy Sets. Information Control 8, 338–353 (1965)
Article MathSciNet MATH Google Scholar
Sarkar, M.: Fuzzy-rough nearest neighbor algorithms in classification. Fuzzy Sets and Systems 158(19), 2134–2152 (2007)
Article MathSciNet MATH Google Scholar
Kullback, S.: Information Theory and Statistics. Dover Pub. (1997)
Google Scholar
Anusuya, M.A., Katti, S.K.: Speech recognition by Machine: A Review. International Journal of Computer Science and Information Security 6(3), 181–205 (2009)
Google Scholar
Kipyatkova, I.S., Karpov, A.A.: An Analytical Survey of Large Vocabulary Russian Speech Recognition Systems. SPIIRAS Proceedings 12, 7–20 (2010)
Google Scholar
Keener, R.W.: Theoretical Statistics: Topics for a Core Course. Springer, New York (2010)
Google Scholar
Reddy, D.R.: Speech recognition by machine: a review. Proceedings of the IEEE 64(4), 501–531 (1976)
Article Google Scholar
Hill, J.E.: The minimum of n independent normal distributions, http://www.untruth.org/~josh/math/normal-min.pdf
Savchenko, A.V.: Adaptive Video Image image Recognition recognition System Using using a Committee committee Machinemachine. Optical Memory and Neural Networks (Information Optics) 21(4), 219–226 (2012)
Article Google Scholar
Specht, D.F.: Probabilistic neural networks. Neural Networks 3(1), 109–118 (1990)
Article Google Scholar
Itakura, F., Saito, S.: An analysis–synthesis telephony based on the maximum likelihood method. In: Proc. of International Congress on Acoustics c-5-5, vol. 5, pp. 17–20 (1968)
Google Scholar
Basseville, M.: Distance measures for signal processing and pattern recognition. Signal Processing 18, 349–369 (1989)
Article MathSciNet Google Scholar
Mérialdo, B.: Multilevel Decoding for Very-Large-Size-Dictionary Speech Recognition. IBM Journal of Research and Development 32(2), 227–237 (1988)
Article Google Scholar
Sirigos, J., Fakotakis, N., Kokkinakis, G.: A hybrid syllable recognition system based on vowel spotting. Speech Communication 38, 427–440 (2002)
Article MATH Google Scholar
Savchenko, A.V.: Phonetic words decoding software in the problem of Russian speech recognition. Automation and Remote Control 74(7), 1225–1232 (2013)
Article Google Scholar
Savchenko, A.V.: Phonetic encoding method in the isolated words recognition problem. Journal of Communications Technology and Electronics 59(4), 310–315 (2014)
Article Google Scholar
CMU Sphinx, http://cmusphinx.sourceforge.net/

Download references

Author information

Authors and Affiliations

National Research University Higher School of Economics, Nizhny Novgorod, Russia
Andrey V. Savchenko
Nizhny Novgorod State Linguistic University, Russia
Lyudmila V. Savchenko

Authors

Andrey V. Savchenko
View author publications
You can also search for this author in PubMed Google Scholar
Lyudmila V. Savchenko
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Artificial Intelligence, University of Granada, Calle del Periodista Daniel Saucedo Aranda s/n, 18071, Granada, Spain
Chris Cornelis
Institute of Computer Science, Warsaw University of Technology, Nowowiejska 15/19, 00-665, Warsaw, Poland
Marzena Kryszkiewicz
University of Warsaw, Poland
Dominik Ślȩzak
Polytechnic University of Madrid, Spain
Ernestina Menasalvas Ruiz
Deparment of Computer Sciences, Universidad Central Marta Abreu de las Villas, Santa Clara, Villa Clara, Cuba
Rafael Bello
Department of Computer Science and Technology, Nanjing University, 210023, Nanjing, China
Lin Shang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Savchenko, A.V., Savchenko, L.V. (2014). Classification of a Sequence of Objects with the Fuzzy Decoding Method. In: Cornelis, C., Kryszkiewicz, M., Ślȩzak, D., Ruiz, E.M., Bello, R., Shang, L. (eds) Rough Sets and Current Trends in Computing. RSCTC 2014. Lecture Notes in Computer Science(), vol 8536. Springer, Cham. https://doi.org/10.1007/978-3-319-08644-6_32

Download citation

DOI: https://doi.org/10.1007/978-3-319-08644-6_32
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08643-9
Online ISBN: 978-3-319-08644-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics