Information-theoretic analysis of efficiency of the phonetic encoding–decoding method in automatic speech recognition

Theory and Methods of Signal Processing

DOI: 10.1134/S1064226916040112

Cite this article as:
Savchenko, V.V. & Savchenko, A.V. J. Commun. Technol. Electron. (2016) 61: 430. doi:10.1134/S1064226916040112
  • 45 Downloads

Abstract

A words phonetic decoding method in automatic speech recognition is considered. The properties of Kullback–Leibler divergence are used to synthesize the estimation of the distribution of divergence between minimum speech units (e.g., single phonemes) inside a single class. It is demonstrated that the minimum variance of the intraphonemic divergence is reached when the phonetic database is tuned to the voice of a single speaker. The estimations are proven by experimental results on the recognition of vowel sounds and isolated words of Russian language.

Copyright information

© Pleiades Publishing, Inc. 2016

Authors and Affiliations

  1. 1.Nizhny Novgorod State Linguistic UniversityNizhny NovgorodRussia
  2. 2.National Research University Higher School of EconomicsNizhny NovgorodRussia

Personalised recommendations