EM Learning for Symbolic-Statistical Models in Statistical Abduction

Sato, Taisuke

doi:10.1007/3-540-45884-0_11

EM Learning for Symbolic-Statistical Models in Statistical Abduction

Taisuke Sato²

Chapter
First Online: 01 January 2002

498 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2281))

Abstract

We first review a logical-statistical framework called statistical abduction and identify its three computational tasks, one of which is the learning of parameters from observations by ML (maximum likelihood) estimation. Traditionally, in the presence of missing values, the EM algorithm has been used for ML estimation. We report that the graphical EM algorithm, a new EM algorithm developed for statistical abduction, achieved the same time complexity as specialized EM algorithms developed in each discipline such as the Inside-Outside algorithm for PCFGs (probabilistic context free grammars). Furthermore, learning experiments using two corpora revealed that it can outperform the Inside-Outside algorithm by orders of magnitude. We then specifically look into a family of extensions of PCFGs that incorporate context sensitiveness into PCFGs. Experiments show that they are learnable by the graphical EM algorithm using at most twice as much time as plain PCFGs even though these extensions have higher time complexity.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baker, J. K., Trainable grammars for speech recognition, Proc. of Spring Conference of the Acoustical Society of America, pp.547–550, 1979.
Google Scholar
Castillo, E., Gutierrez, J.M., and Hadi, A.S., Expert Systems and Probabilistic Network Models, Springer-Verlag, 1997.
Google Scholar
Charniak, E. and Carroll, G., Context-sensitive statistics for improved grammatical language models, Proc. of AAAI’94, pp.728–733, 1994.
Google Scholar
EDR Electronic Dictionary Technical Guide (2nd edition), Japan Electronic Dictionary Research Institute, Ltd., Technical report EDR TR-045, http://www.iijnet.or.jp/edr/E Struct.html, 1995.
Fujisaki, T. and Jelinek, F. and Cocke, J. and Black, E. and Nishino, T., A Probabilistic Parsing Method for Sentence Disambiguation, Proc. of International Parsing Workshop’ 89, pp.85–94,1989.
Google Scholar
Fung, T.H. and Kowalski, R., The iff procedure for abductive logic programming, Journal of Logic Programming, 33, pp.151–165, 1997.
Article MATH MathSciNet Google Scholar
Inoue, K. and Sakama, C., Computing extended abduction through transaction programs, Annals of Mathematics and Artificial Intelligence, 25(3,4), pp.339–367, 1999.
Article MATH MathSciNet Google Scholar
Kakas, A.C., Kowalski, R.A. and Toni, F., Abductive Logic Programming, J. Logic Computation, Vol.2 No.6, pp.719–770, 1992.
Article MATH MathSciNet Google Scholar
Kakas, A.C., Kowalski, R.A. and Toni, F., The Role of Abduction in Logic Programming, in Handbook of Logic in Artificial Intelligence and Logic Programming 5, D.M. Gabbay, J.J. Hogger and J.A. Robinson eds. Oxford University Press, pp.235–324, 1998.
Google Scholar
Kameya, Y. and Sato, T., Efficient EM learning for parameterized logic programs, Proc. of CL2000, LNAI 1861, pp.269–294, 2000.
Google Scholar
Kita, K., Morimoto, T., Ohkura, K., Sagayama, S. and Yano, Y., Spoken sentence recognition based on HMM-LR with hybrid language modeling, IEICE Trans. on Info. & Syst., Vol.E77-D, No.2, 1994.
Google Scholar
Lloyd, J. W., Foundations of Logic Programming, Springer-Verlag, 1984.
Google Scholar
Manning, C. D. and Schütze, H., Foundations of Statistical Natural Language Processing, The MIT Press, 1999.
Google Scholar
McLachlan, G. J. and Krishnan, T., The EM Algorithm and Extensions, Wiley Interscience, 1997.
Google Scholar
Rabiner, L. R. and Juang, B., Foundations of Speech Recognition, Prentice-Hall, 1993.
Google Scholar
Sato, T., A statistical learning method for logic programs with distribution semantics, Proc. of ICLP’95, pp.715–729, 1995.
Google Scholar
Sato, T. and Kameya, Y., PRISM:A Language for Symbolic-Statistical Modeling, Proc. of IJCAI’97, pp.1330–1335, 1997.
Google Scholar
Sato, T. and Kameya, Y., “A Viterbi-like algorithm and EM learning for statistical abduction”, Proc. of UAI2000 Workshop on Fusion of Domain Knowledge with Data for Decision Support, 2000.
Google Scholar
Sato, T. and Kameya, Y., Parameter Learning of Logic Programs for Symbolicstatistical Modeling, submitted for publication, 2000.
Google Scholar
Sato, T., Parameterized Logic Programs where Computing Meets Learning Proc. of FLOPS 2001, LNCS 2024, pp.40–60, 2001.
Google Scholar
Sato, T., Kameya, Y., Abe, S. and Shirai, K., Fast EM learning of a Family of PCFGs, Titech Technical Report (Dept. of CM) TR01-0006, Tokyo Institute of Technology, 2001.
Google Scholar
Stolcke, A., An efficient probabilistic context-free parsing algorithm that computes prefix probabilities, Computational Linguistics, Vol.21 No.2, pp.165–201, 1995.
MathSciNet Google Scholar
Tamaki, H. and Sato, T., OLD resolution with tabulation, Proc. of ICLP’86, London, LNCS 225, pp.84–98, 1986.
Google Scholar
Tanaka, H. and Takezawa, T. and Etoh, J., Japanese grammar for speech recognition considering the MSLR method (in Japanese), Proc. of the meeting of SIGSLP (Spoken Language Processing), 97-SLP-15-25, Information Processing Society of Japan, pp.145–150, 1997.
Google Scholar
Uratani, N. and Takezawa, T. and Matsuo, H. and Morita, C., ATR Integrated Speech and Language Database (in Japanese), TR-IT-0056, ATR Interpreting Telecommunications Research Laboratories, 1994.
Google Scholar

Download references

Author information

Authors and Affiliations

Tokyo Institute of Technology, 2-12-1 Ôokayama Meguro-ku, 152-8552, Tokyo, Japan
Taisuke Sato

Authors

Taisuke Sato
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, Kyushu University, 6-10-1 Hakozaki, Higashi-ku, 812-8581, Fukuoka, Japan
Setsuo Arikawa & Ayumi Shinohara &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sato, T. (2002). EM Learning for Symbolic-Statistical Models in Statistical Abduction. In: Arikawa, S., Shinohara, A. (eds) Progress in Discovery Science. Lecture Notes in Computer Science(), vol 2281. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45884-0_11

Download citation

DOI: https://doi.org/10.1007/3-540-45884-0_11
Published: 14 March 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43338-5
Online ISBN: 978-3-540-45884-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics