A Model for Part-of-Speech Prediction

Franz, Alexander

doi:10.1007/978-1-4612-2404-4_40

Alexander Franz³

Part of the book series: Lecture Notes in Statistics ((LNS,volume 112))

860 Accesses

Abstract

Robust natural language analysis systems must be able to handle words that are not in the lexicon. This paper describes a statistical model that predicts the most likely Parts-of-Speech for previously unseen words. The method uses a loglinear model to combine a number of orthographic and morphological features, and returns a probability distribution over the open word classes. The model is combined with a stochastic Part-of-Speech tagger to provide a model of context. Empirical evaluation shows that this results in significant gains in Part-of-Speech prediction accuracy over simpler methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agresti, A. (1990). Categorical Data Analysis. John Wiley & Sons, New York.
MATH Google Scholar
Bishop, Y. M., Fienberg, S. E., and Holland, P. W. (1975). Discrete Multivariate Analysis: Theory and Practice. MIT Press, Cambridge, MA.
Google Scholar
Charniak, E., Hendrickson, C., Jacobson, N., and Perkowitz, M. (1993). Equations for part-of-speech tagging. In AAAI-93, pages 784–789.
Google Scholar
de Marcken, C. G. (1990). Parsing the LOB corpus. In Proceedings of ACL-90, pages 243–251.
Google Scholar
Deming, W. E. and Stephan, F. F. (1940). On a least squares adjustment of a sampled frequency table when the expected marginal totals are known. Ann. Math. Statis, (11):427–444.
Google Scholar
Duda, R. O. and Hart, P. E. (1973). Pattern Classification and Scene Analysis. John Wiley & Sons, New York.
MATH Google Scholar
Fienberg, S. E. (1980). The Analysis of Cross-Classified Categorical Data. The MIT Press, Cambridge, MA, second edition edition.
MATH Google Scholar
Jelinek, F., Mercer, R. L., Bahl, L. R., and J, K. B. (1977). Perplexity — a measure of difficulty of speech recognition tasks. In 94th Meeting of the Acoustical Society of America, Miami Beach, FL.
Google Scholar
Marcus, M. P., Santorini, B., and Marcinkiewicz, M. A. (1993). Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313–330.
Google Scholar
Weischedel, R., Meteer, M., Schwartz, R., Ramshaw, L., and Palmucci, J. (1993). Coping with ambiguity and unknown words through probabilistic models. Computational Linguistics, 19(2):359–382.
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Machine Translation, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Alexander Franz

Authors

Alexander Franz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Vanderbilt University, Box 1679, Station B, Nashville, Tennessee, 37235, USA
Doug Fisher
Department of Economics Institute of Statistics and Econometrics, Free University of Berlin, 14185, Berlin, Garystre 21, Germany
Hans-J. Lenz

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Franz, A. (1996). A Model for Part-of-Speech Prediction. In: Fisher, D., Lenz, HJ. (eds) Learning from Data. Lecture Notes in Statistics, vol 112. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2404-4_40

Download citation

DOI: https://doi.org/10.1007/978-1-4612-2404-4_40
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-94736-5
Online ISBN: 978-1-4612-2404-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics