Advertisement

A Novel Ranking Technique Based on Page Queries

  • Gwangbum PyunEmail author
  • Unil Yun
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 274)

Abstract

Keyword-based information retrieval finds webpages with queries composed of keywords to provide users with needed information. However, since the keywords are only a part of the necessary information, it may be hard to search intended results from the keyword-based methods. Furthermore, users should make efforts to select proper keywords many times in general because they cannot know which keyword is effective in obtaining meaningful information they really want. In this paper, we propose a novel algorithm, called PQ_Rank, which can find intended webpages more exactly than the existing keyword-based ones. To rank webpages more effectively, it considers not only keywords but also all of the words included in webpages, named page queries. Experimental results show that PQ_Rank outperforms PageRank, a famous algorithm used by Google, in terms of MAP, average recall, and NDCG.

Keywords

Information retrieval Page query Grouping webpages Ranking technique 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Derhami, V., Khodadadian, E., Ghasemzadeh, M., Bidoki, A.M.: Applying reinforcement learning for web pages ranking algorithms. Applied Soft Computing 13(4), 1686–1692 (2013)CrossRefGoogle Scholar
  2. 2.
    Ermelinda, O., Massimo, R.: Towards a Spatial Instance Learning Method for Deep Web Pages. In: Industrial Conference on Data Mining, pp. 270–285 (December 2011)Google Scholar
  3. 3.
    Geng, B., Yang, L., Xu, C., Hua, X.S.: Ranking Model Adaptation for Domain-Specific Search. IEEE Transactions on Knowledge and Data Engineering 24(4), 745–758 (2012)CrossRefGoogle Scholar
  4. 4.
    Ishii, H., Tempo, R., Bai, E.: A Web Aggregation Approach for Distributed Randomized PageRank Algorithms. IEEE Transactions on Automatic Control 57(11), 2703–2717 (2012)MathSciNetCrossRefGoogle Scholar
  5. 5.
    Metzler, D.: Generalized Inverse Document Frequency. In: Conference on Information and Knowledge Management, pp. 399–408 (October 2008)Google Scholar
  6. 6.
    Pyun, G., Yun, U.: Ranking Techniques for Finding Correlated Webpages. In: International Conference on IT Convergence and Security, pp. 1085–1095 (December 2012)Google Scholar
  7. 7.
    Telang, A., Li, C., Chakravarthy, S.: One Size Does Not Fit All: Toward User- and Query-Dependent Ranking for Web Databases. IEEE Transactions on Knowledge and Data Engineering 24(9), 1671–1685 (2012)CrossRefGoogle Scholar
  8. 8.
    CLucene Project web page, http://clucene.sourceforge.net/

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  1. 1.Department of Computer EngineeringSejong UniversitySeoulRepublic of Korea

Personalised recommendations