Supervised HDP Using Prior Knowledge

Xie, Boyi; Passonneau, Rebecca J.

doi:10.1007/978-3-642-31178-9_21

Boyi Xie¹⁹ &
Rebecca J. Passonneau¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7337))

Included in the following conference series:

International Conference on Application of Natural Language to Information Systems

2308 Accesses
4 Citations

Abstract

End users can find topic model results difficult to interpret and evaluate. To address user needs, we present a semi-supervised hierarchical Dirichlet process for topic modeling that incorporates user-defined prior knowledge. Applied to a large electronic dataset, the generated topics are more fine-grained, more distinct, and align better with users’ assignments of topics to documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

AlSumait, L., Barbará, D., Gentle, J., Domeniconi, C.: Topic Significance Ranking of LDA Generative Models. In: Buntine, W., Grobelnik, M., Mladenić, D., Shawe-Taylor, J. (eds.) ECML PKDD 2009, Part I. LNCS, vol. 5781, pp. 67–82. Springer, Heidelberg (2009)
Chapter Google Scholar
Andrzejewski, D., Zhu, X.: Latent dirichlet allocation with topic-in-set knowledge. In: Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing, pp. 43–48 (2009)
Google Scholar
Blei, D.M., McAuliffe, J.D.: Supervised topic models. In: Advances in Neural Information Processing Systems, NIPS (2007)
Google Scholar
Blei, D.M., Ng, A., Jordan, M.: Latent dirichlet allocation. JMLR 3, 993–1022 (2003)
MATH Google Scholar
Escobar, M.D., West, M.: Bayesian density estimation and inference using mixtures. Journal of the American Statistical Association 90, 577–588 (1995)
MathSciNet MATH Google Scholar
Jaccard, P.: Nouvelles recherches sur la distribution florale. Bulletin de la Société Vaudoise des Sciences Naturelles 44, 223–270 (1908)
Google Scholar
Mimno, D., Wallach, H., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, pp. 262–272 (July 2011)
Google Scholar
Perotte, A., Bartlett, N., Elhadad, N., Wood, F.: Hierarchically supervised latent dirichlet allocation. In: Advances in Neural Information Processing Systems, NIPS (2011)
Google Scholar
Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled lda: a supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 248–256 (2009)
Google Scholar
Teh, Y.W., Jordan, M.I., Beal, M.J., Blei, D.M.: Hierarchical dirichlet processes. Journal of the American Statistical Association 101(476), 1566–1581 (2006)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Center for Computational Learning Systems, Columbia University, 475 Riverside Drive MC 7717, 10115, New York, USA
Boyi Xie & Rebecca J. Passonneau

Authors

Boyi Xie
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca J. Passonneau
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Information Science Department, University of Groningen, Oude Kijk in ’t Jatstraat 26, 9712 EK, Groningen, The Netherlands
Gosse Bouma
Faculty of Economics and Business, University of Groningen, Nettelbosje 2, 9747 AE, Groningen, The Netherlands
Ashwin Ittoo & Hans Wortmann &
CNAM-Laboratoire Cédric, 292 rue St. Martin, 75141, Paris Cedex 03, France
Elisabeth Métais

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xie, B., Passonneau, R.J. (2012). Supervised HDP Using Prior Knowledge. In: Bouma, G., Ittoo, A., Métais, E., Wortmann, H. (eds) Natural Language Processing and Information Systems. NLDB 2012. Lecture Notes in Computer Science, vol 7337. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31178-9_21

Download citation

DOI: https://doi.org/10.1007/978-3-642-31178-9_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31177-2
Online ISBN: 978-3-642-31178-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics