Skip to main content

Optimizing Personalized Retrieval System Based on Web Ranking

  • Conference paper
Computer Science – Theory and Applications (CSR 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3967))

Included in the following conference series:

  • 980 Accesses

Abstract

This paper drew up a personalized recommender system model combined the text categorization with the pagerank. The document or the page was considered in two sides: the content of the document and the domain it belonged to. The features were extracted in order to form the feature vector, which would be used in computing the difference between the documents or keywords with the user’s interests and the given domain. It set up the structure of four block levels in information management of a website. The link information was downloaded in the domain block level, which is the top level of the structure. In the host block level, the links were divided into two parts, the inter-link and the intra-link. All links were setup with different weights. The stationary eigenvector of the link matrix was calculated. The final order of documents was determined by the vector distance and the eigenvector of the link matrix.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wang, J.-C., et al.: State of the Art of Information Retrieval on the Web. Journal of Computer Research & Development 38(2), 187–193 (2001)

    Google Scholar 

  2. Kempf, J.: Evolving the Internet Addressing Architecture. In: Proceedings of the 2004 International Symposium on Applications and the Internet (SAINT 2004) (2004)

    Google Scholar 

  3. Rafiei, D., Mendelzon, A.: What is this page known for Computing web page reputations. In: 9th International World Wide Web Conference, Amsterdam, Netherlands (May 2000)

    Google Scholar 

  4. Shaw, N.G., et al.: A comprehensive agent-based architecture for intelligent information retrieval in a distributed heterogeneous environment. Decision Support Systems 32, 401–415 (2002)

    Article  Google Scholar 

  5. Lempel, R., Moran, S.: The stochastic approach for link-structure analysis (SALSA) and the TKC effect. In: 9th International World Wide Web Conference, Amsterdam, Netherlands (May 2000)

    Google Scholar 

  6. Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank Citation Ranking: Bringing Order to the Web. Stanford Digital Libraries Working Paper (1998)

    Google Scholar 

  7. Haveliwala, T.: Efficient computation of PageRank. Technical report, Computer Science Department, Stanford University (1999)

    Google Scholar 

  8. Xing, W., Ghorbani, A.: Weighted PageRank Algorithm. In: Second Annual Conference on Communication Networks and Services Research (CNSR 2004), pp. 305–314 (2004)

    Google Scholar 

  9. Soucy, P., Mineau, G.W.: Beyond TFIDF Weighting for Text Categorization in the Vector Space Model. In: Proceeding of IJCAI-2005, pp. 1136–1141 (2005)

    Google Scholar 

  10. Guo, G., Wang, H., Bell, D.A., Bi, Y., Greer, K.: An kNN Model-Based Approach and Its Application in Text Categorization. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 559–570. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  11. Steinberger, R., Pouliquen, B., Hagman, J.: Cross-Lingual Document Similarity Calculation Using the Multilingual Thesaurus EUROVOC. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 415–424. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  12. Jing, L.-P., Huang, H.-K.: Improved feature selection approach tfidf in text mining - Machine. In: Proceedings of the First International Conference of Machine learning and Cybernetics, 2002, pp. 944–946 (2002)

    Google Scholar 

  13. Bianchini, M., Gori, M., Scarselli, F.: Inside Pagerank. ACM Transactions on Internet Technology 5(1), 92–128 (2005)

    Article  Google Scholar 

  14. Jiang, X.-M., Xue, G.-R., Song, W.-G., Zeng, H.-J., Chen, Z., Ma, W.-Y.: Exploiting PageRank at Different Block Level. In: Zhou, X., Su, S., Papazoglou, M.P., Orlowska, M.E., Jeffery, K. (eds.) WISE 2004. LNCS, vol. 3306, pp. 241–252. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  15. Arslan, B., Ricci, F., Mirzadeh, N., Venturini, A.: A dynamic approach to feature weighting. In: Proceedings of Data Mining 2002 Conference (2002)

    Google Scholar 

  16. Joachims, T.: A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization. In: Proceedings of the Fourteenth International Conference on Machine Learning table of contents (ICML), pp. 143–151 (1997)

    Google Scholar 

  17. http://www.searchtools.com/robots/robot-code.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, Hm., Guo, Y., Feng, Bq. (2006). Optimizing Personalized Retrieval System Based on Web Ranking. In: Grigoriev, D., Harrison, J., Hirsch, E.A. (eds) Computer Science – Theory and Applications. CSR 2006. Lecture Notes in Computer Science, vol 3967. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11753728_63

Download citation

  • DOI: https://doi.org/10.1007/11753728_63

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-34166-6

  • Online ISBN: 978-3-540-34168-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics