Transducer-learning experiments on language understanding

Picó, David; Vidal, Enrique

doi:10.1007/BFb0054071

David Picó¹ &
Enrique Vidal¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1433))

Included in the following conference series:

International Colloquium on Grammatical Inference

104 Accesses

Abstract

The interest in using Finite-State Models in a large variety of applications is recently growing as more powerful techniques for learning them from examples have been developed. Language Understanding can be approached this way as a problem of language translation in which the target language is a formal language rather than a natural one. Finite-state transducers are used to model the translation process, and are automatically learned from training data consisting of pairs of natural-language/formal-language sentences. The need for training data is dramatically reduced by performing a two-level learning process based on lexical/phrase categorization. Successful experiments are presented on a task consisting in the “understanding” of Spanish natural-language sentences describing dates and times, where the target formal language is the one used in the popular Unix command “at”.

Work partially supported by EuTrans, European Union ESPRIT LTR Project 30268, and the Spanish CICYT under grant TIC-0745-CO2.

Author supported by a FPI grant from the Conselleria d'Educació i Ciència of Valencian Government.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

J.C Amengual, J.B. Benedí, F. Casacuberta, A. Castaño, A. Castellanos, D. Llorens, A. Marzal, F. Prat, E. Vidal and J.M. Vilar: “Using Categories in the Eutrans System”. ACL-ELSNET Workshop on Spoken Language Translation, Madrid, Spain, pp. 44–52. (1997)
Google Scholar
J.C. Amengual, E. Vidal. Two Different Approaches for Cost-efficient Viterbi Parsing with Error Correction. Proc. of the SSPR'96, IAPR International Workshop on Structural and Syntactical Pattern Recognition, August 20–23, 1996, Leipzig. To be published in the Proceedings.
Google Scholar
J.C. Amengual, E. Vidal and J.M. Benedí. “Simplifying Language through Error-Correcting Decoding”. Proceedings of the ICSLP96 (IV International Conference on Spoken Language Processing). To be published. October, 1996.
Google Scholar
J. Berstel. Transductions and Context-Free Languages. Teubner, Stuttgart. 1979.
Google Scholar
P.F. Brown et al.. “A Statistical Approach to Machine Translation”. Computational Linguistics, Vol. 16, No.2, pp.79–85, 1990.
Google Scholar
A. Castellanos, E. Vidal, J. Oncina. “Language Understanding and Subsequential Transducer Learning”. 1st International Colloquium on Grammatical Inference,Colchester, England. proc., pp. 11/1–11/10. April, 1993.
Google Scholar
J.G. Bauer, H. Stahl, J. Mller: “A One-pass Search Algorithm for Understanding Natural Spoken Time Utterances by Stochastic Models”. Proc. of the EUROSPEECH'95, Madrid, Spain, vol.I, pp. 567–570. (1995)
Google Scholar
F. Casacuberta: “Maximum Mutual Information and Conditional Maximum Likelihood Estimations on Stochastic Regular Syntax-Directed TranslationSchemes”, Lecture notes in Artificial Intelligence, vol.1147, pp. 282–291, Springer-Verlag. (1996)
Google Scholar
F. Casacuberta: “Growth Transformations for Probabilistic Functions on Stochastic Grammars”, International Journal of Pattern Recognition and Artificial Intelligence, vol. 10, n. 3, pp. 183–201, Word Scientific Publishing Company. (1996)
Google Scholar
C.T. Hemphill, J.J. Godfrey, G.R. Doddington. “The ATIS Spoken Language Systems, pilot Corpus”. Proc. of 3rd DARPA Workshop on Speech and Natural Language, pp. 102–108, Hidden Valley (PA), June 1990.
Google Scholar
F. Jelinek: “Language Modeling for Speech Recognition”. In [13] (1996).
Google Scholar
V. Jimenez, A. Castellanos, E. Vidal. “Some results with a trainable speech translation and understanding system”. In Proceedings of the ICASSP-95, Detroit, MI (USA), 1995
Google Scholar
A. Kornai (ed.); Proceedings of the ECAI'96 Workshop: Extended Finite State Models of Language. Budapest, 1996.
Google Scholar
A. Lavie, A. Waibel, L. Levin, M. Finke, D. Gates, M. Gavaldà, T. Zeppenfeld and P. Zhan: “JANUS-III: Speech-to-speech Translation in Multiple Languages”, Proc. of the ICASSP'97, Munich, Germany, vol.I, pp. 99–102. (1997)
Google Scholar
E. Maier and S. McGlashan: “Semantic and Dialogue Processing in the VERBMOBIL Spoken Dialogue Translation System”, In Proceedings in Artificial Intelli-gence: CRIM/FORWISS Workshop on Progress and Prospects of Speech Research and Technology, H. Niemann, R. de Mori and G. Hanrieder (eds.), Infix, pp. 270–273. (1994)
Google Scholar
J. Oncina. “Aprendizaje de Lenguages RegulÄres y Funciones Subsecuenciales”. Ph.D. diss., Universidad Politecnica de Valencia, 1991.
Google Scholar
J. Oncina, P. Garcia, E. Vidal. “Learning Subsequential Transducers for Pattern Recognition Interpretation Tasks”. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.15, No.5, pp.448–458. May, 1993.
Article Google Scholar
J. Oncina, A. Castellanos, E. Vidal, V. Jimenez. “Corpus-Based Machine Translation through Subsequential Transducers”. Third Int. Conf. on the Cognitive Science of Natural Language Processing, proc., Dublin, 1994
Google Scholar
J. Oncina, M.A. Var. “Using domain information during the learning of a subsequential transducer”. In Laurent Miclet and Colin de la Higuera, editors, Gram-matical Inference: Learning Syntax from Sentences, Lecture Notes in Computer Science, vol. 1147, pp. 301–312. Springer-Verlag. 1996
Google Scholar
R. Pieraccini, E. Levin. “Stochastic Representation of Semantic Structure for Speech Understanding”. EUROSPEECH'91, Proc., Vol. 2, pp.383–386. Genoa Sept, 1991.
Google Scholar
R. Pieraccini, E. Levin, E. Vidal. “Learning How To Understand Language”. EUROSPEECH'93, proc., Vol.2, pp. 1407–1412. Berlin, Sept, 1993.
Google Scholar
N. Prieto, E. Vidal. “Learning Language Models through the ECGI method”. Speech Communication, No.11, pp.299–309. 1992.
Article Google Scholar
K. Seymore, R. Rosenfeld. “Scalable Backoff Language Models”. ICSLP-96, proc. pp.232–235. Philadelfia, 1996.
Google Scholar
E. Vidal: “Language Learning, Understanding and Translation”, In Proc. in Art. Intell.: CRIM/FORWISS Workshop on Progress and Prospects of Speech Research and Technology, H. Niemann, R. de Mori and G. Hanrieder (eds.), pp. 131–140. Infix, (1994).
Google Scholar
E. Vidal: “Finite-State Speech-to-speech Translation”, Proc. of the ICASSP'97,Munich, Germany, vol.1, pp. 111–122. (1997)
Google Scholar
E. Vidal, F. Casacuberta, P. Garcia. “Grammatical Inference and Automatic Speech Recognition”. In Speech Recognition and Coding. New Advances and Trends, J.Rubio and J.M.Lopez, Eds. Springer Verlag, 1994.
Google Scholar
E. Vidal, D. Llorens. “Using knowledge to improve N-Gram Language Modeling through the MGGI methodology”. In Grammatical Inference: Learning Syntax from Sentences, L.Miclet, C.De La Higuera, Eds. LNAI (1147), Springer-Verlag, 1996.
Google Scholar
J.M. Vilar, A. Marzal, E. Vidal: “Learning Language Translation in Limited Domains using Finite-State Models: some Extensions and Improvements”. Proceedings of the EUROSPEECH-95, Madrid, Spain, pp. 1231–1234. (1995)
Google Scholar
J.M. Vilar, E. Vidal and J.C. Amengual: “Learning Extended Finite State Models for Language Translation”. Proceedings of the ECAI96 (12th European Conference on Artificial Intelligence). August (1996).
Google Scholar
Linux system documentacion, at directory “/usr/doc/at” (Debian distribution). Also, see “man at” on a Unix system.
Google Scholar

Download references

Author information

Authors and Affiliations

Institut TecnolÒgic d'Informàtica, Universitat Politècnica de València, València, Spain
David Picó & Enrique Vidal

Authors

David Picó
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Vidal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Vasant Honavar Giora Slutzki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Picó, D., Vidal, E. (1998). Transducer-learning experiments on language understanding. In: Honavar, V., Slutzki, G. (eds) Grammatical Inference. ICGI 1998. Lecture Notes in Computer Science, vol 1433. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0054071

Download citation

DOI: https://doi.org/10.1007/BFb0054071
Published: 23 May 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64776-8
Online ISBN: 978-3-540-68707-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics