Skip to main content

Combining User Interested Topic and Document Topic for Personalized Information Retrieval

  • Conference paper
Big Data Analytics (BDA 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8883))

Included in the following conference series:

Abstract

Personalization aims to improve user’s searching experience by tailoring search results according to individual user’s interests. Typically, search engines employ two-level ranking strategy. Firstly, initial list of documents is prepared using a low-quality ranking function that is less computationally expensive. Secondly, initial list is re-ranked by machine learning algorithms which involve expensive computation. The proposed approach explores the second level of ranking strategy which exploits user information. In this approach, queries and search-result clicks are used to model the user interest profiles probabilistically. The user’s history provides the prior probability that a user searches for a topic which is independent of user query. The document topical features are combined with user specific information to determine whether a document satisfies user’s information need or not. The probability of relevance of each retrieved document for a query is computed by integrating user topic model and document topic model. Thus, documents are re-ranked according to the personalized score computed for each document. The proposed approach has been implemented and evaluated using real dataset similar to AOL search log dataset for personalization. Empirical results along with the theoretical foundations of the model confirm that the proposed approach shows promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berger, A., Lafferty, J.: Information retrieval as statistical translation. In Proc. SIGIR, pp. 222–229. ACM (1999)

    Google Scholar 

  2. Sieg, A., Mobasher, B., Burke, R.: Web search personalization with ontological user profiles. In: Proc. CIKM, pp. 525–534. ACM (2007)

    Google Scholar 

  3. Lin, C., Xue, G.-R., Zeng, H.-J., Yu, Y.: Using Probabilistic Latent Semantic Analysis for Personalized Web Search. In: Zhang, Y., Tanaka, K., Yu, J.X., Wang, S., Li, M. (eds.) APWeb 2005. LNCS, vol. 3399, pp. 707–717. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  4. Manning, C.D., Raghavan, P., Schutze, H.: Introduction to Information Retrieval. Book published by Cambridge University Press (2008)

    Google Scholar 

  5. Carpineto, C., Romano, G.: ODP239 dataset (2009), http://credo.fub.it/odp239/

  6. Blei, D.M., Lafferty, J.D.: Topic Models. Technical Report, Princeton University (2009)

    Google Scholar 

  7. Sontag, D., Collins-Thompson, K., Bennett, P.N., White, R.W., Dumais, S., Billerbeck, B.: Probabilistic Models for Personalizing Web Search. In: Proc. WSDM, pp. 433–442 (2012)

    Google Scholar 

  8. Agichtein, E., Brill, E., Dumais, S.: Improving Web Search Ranking by ncorporating user behavior information. In: Proc. SIGIR, pp. 19–26. ACM (2006)

    Google Scholar 

  9. Qiu, F., Cho, J.: Automatic identification of user interest for personalized search. In: Proc. 15th Intl. Conf. on World Wide Web, pp. 727–736. ACM (2006)

    Google Scholar 

  10. Radlinski, F., Dumais, S.: Improving Personalized Web Search using result diversification. In: Proc. SIGIR, pp. 691–692. ACM (2006)

    Google Scholar 

  11. Bordogna, G., Campi, A., Psaila, G., Ronchi, S.: Disambiguated Query Suggestions and Personalized Content-Similarity and Novelty Ranking of Clustered Results to Optimize Web Searches. Elsevier - Information Processing and Management (48), 1067–1077 (2012)

    Google Scholar 

  12. Pass, G., Chowdhury, A., Torgeson, C.: A Picture of Search. In: Proc. 1st Intl. Conf. on Scalable Information Systems (2006)

    Google Scholar 

  13. Xue, G.-R., Zeng, H.-J., Chen, Z., Yu, Y., Ma, W.-Y., Xi, W., Fan, W.: Optimizing web search using web click-through data. In: Proc. CIKM, pp. 118–126. ACM (2004)

    Google Scholar 

  14. Rocchio, J.J.: Relevance feedback in information retrieval. In: Proc. The smart retrieval system - Experiments in Automatic Document Processing, pp. 313–323 (1971)

    Google Scholar 

  15. Teevan, J., Morris, M.R., Bush, S.: Discovering and using groups to improve personalized search. In: Proc. WSDM, pp. 15–24. ACM (2009)

    Google Scholar 

  16. Teevan, J., Dumais, S.T., Liebling, D.J.: To Personalize or Not to Personalize: Modeling Queries with Variation in User Intent. In: Proc. SIGIR, pp. 163–170. ACM (2008)

    Google Scholar 

  17. Teevan, J., Dumais, S.T., Horvitz, E.: Beyond the Commons: Investigating the Value of Personalizing Web Search. In: Proc. Workshop New Technologies for Personalized Information Access (PIA), pp. 84–92 (2005)

    Google Scholar 

  18. Kalervo, J., Jaana, K.: Cumulated Gain-Based Evaluation of IR Techniques. ACM Transactions on Information Systems (2002)

    Google Scholar 

  19. Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: Proc. SIGIR, pp. 275–281. ACM (1998)

    Google Scholar 

  20. Hu, J., Chan, P.K.: Personalized Web Search by Using Learned User profiles in re-ranking. In: Proc. SIGKDD, pp. 1–14. ACM (2008)

    Google Scholar 

  21. Gao, J., He, X., Nie, J.-Y.: Clickthrough-Based Translation Models for Web Search: from Word Models to Phrase Models. In: Proc. CIKM, pp. 1139–1148. ACM (2010)

    Google Scholar 

  22. Sugiyama, K., Hatano, K., Yoshikawa, M.: Adaptive Web Search Based on User Profile Constructed without any Effort from Users. In: Proc. 13th Intl. Conf. on World Wide Web, pp. 675–684. ACM (2004)

    Google Scholar 

  23. Daoud, M., Tamine-Lechani, L., Boughanem, M.: Learning user interests for a session-based personalized Search. In: Proc. 2nd Intl. Symposium on Information Interaction in Context, pp. 57–64. ACM (2008)

    Google Scholar 

  24. Matthijs, N., Radlinski, F.: Personalizing Web Search using Long Term Browsing History. In: Proc. WSDM, pp. 25–34. ACM (2011)

    Google Scholar 

  25. Agrawal, R., Gollapudi, S.: Diversifying Search Results. In: Proc. WSDM, pp. 5–14. ACM (2009)

    Google Scholar 

  26. Krestel, R., Fankhauser, P.: Reranking web search results for diversity. Springer Information Retrieval (15), 458–477 (2012)

    Google Scholar 

  27. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison Wesley (1999)

    Google Scholar 

  28. White, R.W., Chu, W., Hassan, A., He, X., Song, Y., Wang, H.: Enhancing personalized search by mining and modeling task behavior. In: Proc. WWW, pp. 1411–1420. ACM (2013)

    Google Scholar 

  29. Anand, S.S., Mobasher, B.: Intelligent techniques for web personalization. In: Mobasher, B., Anand, S.S. (eds.) ITWP 2003. LNCS (LNAI), vol. 3169, pp. 1–36. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  30. Stamou, S., Ntoulas, A.: Search Personalization through Query and Page Topical Analysis. Proc. User Model User-adapt Interact 19(1-2), 5–33 (2009)

    Article  Google Scholar 

  31. Robertson, S.E., Walker, S., Hancock-Beaulieu, M., Gull, A., Lau, M.: Okapi at TREC. In: Proc. Text Retrieval Conference, pp. 21–30 (1992)

    Google Scholar 

  32. Hofmann, T.: Probabilistic Latent Semantic Indexing. In: Proc. SIGIR, pp. 50–57. ACM (1999)

    Google Scholar 

  33. V.: K, M. Simon. Collaborative Filtering for Sharing the Concept Based User Profiles. In: Proc. of 3rd IEEE International Conference on Electronics Computer Technology (ICECT), vol. 4, pp. 187–191 (2011)

    Google Scholar 

  34. Veningston, K., Shanmugalakshmi, R.: Enhancing personalized web search re-ranking algorithm by incorporating user profile. In: Proc. of 3rd IEEE International Conference on Computing Communication and Networking Technologies (ICCCNT), pp. 1–6 (2012)

    Google Scholar 

  35. Wei, X., Bruce, C.W.: LDA-based document models for ad-hoc retrieval. In: Proc. SIGIR, pp. 178–185. ACM (2006)

    Google Scholar 

  36. Ng, W., Deng, L., Lee, D.L.: Mining User Preference Using Spy Voting for Search Engine Personalization. ACM Trans. Internet Technology 7(4), article 19 (2007)

    Google Scholar 

  37. Shen, X., Tan, B., Zhai, C.: Implicit User Modeling for Personalized Search. In: Proc. CIKM, pp. 824–831. ACM (2005)

    Google Scholar 

  38. Dou, Z., Song, R., Wen, J.-R., Yuan, X.: Evaluating the Effectiveness of Personalized Web Search. IEEE Trans. Knowledge and Data Engineering 21(8), 1178–1190 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Veningston, K., Shanmugalakshmi, R. (2014). Combining User Interested Topic and Document Topic for Personalized Information Retrieval. In: Srinivasa, S., Mehta, S. (eds) Big Data Analytics. BDA 2014. Lecture Notes in Computer Science, vol 8883. Springer, Cham. https://doi.org/10.1007/978-3-319-13820-6_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13820-6_6

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13819-0

  • Online ISBN: 978-3-319-13820-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics