Modèle probabiliste d’un langage en reconnaissance de la parole

Derouault, Anne-Marie; Jelinek, Fred

doi:10.1007/BF02997937

Modèle probabiliste d’un langage en reconnaissance de la parole

Probabilist language model in speech recognition

Published: March 1984

Volume 39, pages 143–151, (1984)
Cite this article

Annales des Télécommunications Aims and scope Submit manuscript

Anne-Marie Derouault¹ &
Fred Jelinek²

22 Accesses
1 Citation
Explore all metrics

Analyse

Dans les diverses tâches de reconnaissance de la parole, en particulier celles qui utilisent un large vocabulaire, il est nécessaire d’avoir un modèle de langage qui participe au choix des candidats pour la suite de la phrase. L’idée de construire un modèle probabiliste adapté à cette tâche provient naturellement du fait que la succession des mots dans une phrase est soumise à des contraintes d’ordres grammatical et sémantique. Le principe est d’estimer la probabilité conditionnelle de l’apparition d’un mot, le début de la phrase étant fixé. Après avoir posé le problème du modèle adapté à un large vocabulaire et une tâche naturelle, les auteurs présentent un outil théorique (Source de Markov en théorie de l’information), utile à sa formalisation, et les méthodes automatiques d’estimation des paramètres nécessaires. Puis ils exposent l’expérience d’un modèle particulier pour l’anglais, et ses résultats.

Abstract

In speech recognition with a large size vocabulary, a language model is used to direct the choice of the word candidates along the sentence being decoded. The word succession is subject to syntactic and semantic constraints. A probabilistic language model should estimate the conditional probability of utterrance of any word, given the past sequence. In this paper, the authors introduce the language modelling problem for natural tasks. They give an information theoretic tool for its formalisation: the notion of Markov source. The automatic training and parameters estimation are showed. Then a particular model experiment for natural english is related.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Rivalisierende Paradigmen in der Linguistik: Generative Grammatik und Konstruktionsgrammatik

Formen des Anthropozän in der Gegenwartslyrik. Die sprachliche Ausgestaltung differierender Assemblagen in Nico Bleutges dämmerung. schwanken

Syntax – die Analyse des Satzes und seiner Bestandteile

Bibliographie

Baum (L.E.) An inequality and associated maximisation technique in statistical estimation of probabilistic fonctions of Markov process.Inequalities, USA (1972), vol. 3, pp. 1–8.
Google Scholar
Jelinek (F) andMercer (R.L.), Interpolated estimation of Markov source parameters from sparse data. Proceedings of the Workshop on Pattern Recognition in Practice. Amsterdam,North Holland, May 21–23, 1980.
Google Scholar
Jelinek (F). Continuous speech recognition: Statistical methods.IEEE Trans. PAM, USA (1983),5, pp.

Download references

Author information

Authors and Affiliations

Centre Scientifique IBM France, 36, av. R. Poincaré, 75116, Paris, France
Anne-Marie Derouault
IBM TJ Watson Research Center, Yorktown Heights, 10598, NY, USA
Fred Jelinek

Authors

Anne-Marie Derouault
View author publications
You can also search for this author in PubMed Google Scholar
Fred Jelinek
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Derouault, AM., Jelinek, F. Modèle probabiliste d’un langage en reconnaissance de la parole. Ann. Télécommun. 39, 143–151 (1984). https://doi.org/10.1007/BF02997937

Download citation

Received: 20 June 1983
Accepted: 08 February 1984
Issue Date: March 1984
DOI: https://doi.org/10.1007/BF02997937

Mots clés

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Modèle probabiliste d’un langage en reconnaissance de la parole

Analyse

Abstract

Access this article

Similar content being viewed by others

Rivalisierende Paradigmen in der Linguistik: Generative Grammatik und Konstruktionsgrammatik

Formen des Anthropozän in der Gegenwartslyrik. Die sprachliche Ausgestaltung differierender Assemblagen in Nico Bleutges dämmerung. schwanken

Syntax – die Analyse des Satzes und seiner Bestandteile

Bibliographie

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Mots clés

Key words

Navigation

Modèle probabiliste d’un langage en reconnaissance de la parole

Analyse

Abstract

Access this article

Similar content being viewed by others

Rivalisierende Paradigmen in der Linguistik: Generative Grammatik und Konstruktionsgrammatik

Formen des Anthropozän in der Gegenwartslyrik. Die sprachliche Ausgestaltung differierender Assemblagen in Nico Bleutges dämmerung. schwanken

Syntax – die Analyse des Satzes und seiner Bestandteile

Bibliographie

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Mots clés

Key words

Search

Navigation