Advertisement

An Efficient Hybrid Usage-Based Ranking Algorithm for Arabic Search Engines

  • Safaa I. HajeerEmail author
  • Rasha M. Ismail
  • Nagwa L. Badr
  • M. F. Tolba
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9155)

Abstract

There are billions of web pages available on the Internet. Search Engines always have a challenge to find the best ranked list to the user’s query from those huge numbers of pages. A lot of search results that correspond to a user’s query are not relevant to the user’s needs. Most of the page ranking algorithms use Link-based ranking (web structure) or Content-based ranking to calculate the relevancy of the information to the user’s need, but those ranking algorithms might be not enough to provide a good ranked list for the Arabic search. So, in this paper we proposed an efficient Arabic information retrieval system using a new hybrid usage-based ranking algorithm called EHURA. The objective of this algorithm is to overcome the drawbacks of the ranking algorithms and improve the efficiency of web searching. EHURA was applied to 242 Arabic Corpus to measure its performance. The result shows our proposed EHURA algorithm improves the precision over the Content-Based ranking algorithm representation, as well as the recall is affected too in this improvement.

Keywords

Information retrieval (IR) Tokenization Usage-based ranking Content-based ranking Link-based ranking Pagerank Weighted pagerank Implicit judgment and explicit judgment 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Internet World Stats: Usage and Population Statistics (2015). http://www.internetworldstats.com/stats.htm
  2. 2.
    Dilekh, T., Behloul, A.: Implementation of New Hybrid Method for Stemming of Arabic Text. International Journal of Computer Applications 46(8), 14–19 (2012)Google Scholar
  3. 3.
    Dilekh T., Behloul A.: Implementation of a New Hybrid Method for Stemming of Arabic Text. International Journal of Computer Applications 46(8) (2012)Google Scholar
  4. 4.
    Jiang, X.-M., Song, W.-G., Zeng, H.-J.: Applying associative relationship on the clickthrough data to improve web search. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 475–486. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  5. 5.
    Ding, C., Chi, C.-H., Luo, T.: An improved usage-based ranking. In: Meng, X., Su, J., Wang, Y. (eds.) WAIM 2002. LNCS, vol. 2419, pp. 346–353. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  6. 6.
    Rodríguez-Mulà, G., Garcia-Molina, H., Paepcke, A.: Collaborative Value Filtering on the Web. Computer Networks 30(8), 736–738 (1998)Google Scholar
  7. 7.
    Kritikopoulos A., Sideri M., Varlamis I.: Success index: measuring the efficiency of search engines using implicit user feedback. In: Proceedings of the 11th Pan-Hellenic Conference on Informatics. Special Session on Web Search and Mining (2007)Google Scholar
  8. 8.
    Liu, Y., Liu, T., Gao, B., Ma, Z., Li, H.: A framework to compute page importance based on user behaviors. Information Retrieval 13, 22–45 (2010)CrossRefGoogle Scholar
  9. 9.
    Jain, R., Purohit Dr., G.N.: Page Ranking Algorithms for Web Mining. International Journal of Computer Applications 13(5), 22–25 (2011)CrossRefGoogle Scholar
  10. 10.
    Rekha, C., Usharani, J., Iyakutti, K.: Improving the Information Retrieval System through Effective Evaluation of Web Page in Client Side Analysis. International Journal of Computer Applications 15(6), 35–39 (2011)CrossRefGoogle Scholar
  11. 11.
    Mukherjee I., Bhattacharya V., Banerjee S., Gupta P., Mahanti, P.: Efficient Web Information Retrieval based on Usage Mining. IEEE (2012)Google Scholar
  12. 12.
    Tuteja, S.: Enhancement in Weighted PageRank Algorithm Using VOL. Journal of Computer Engineering 14(5), 135–141 (2013)Google Scholar
  13. 13.
    Arafat, S., Saad, S.: An Affix removal stemming algorithm for Arabic Language. International Journal of Inelligent Computing and Information Science 8(2), 141–153 (2008)Google Scholar
  14. 14.
    Hajeer S.: Comparison on the Effectiveness of Different Statistical Similarity Measures. International Journal of Computer Applications 53(8) (2012)Google Scholar
  15. 15.
    Hajeer, S.: Vector Space Model: Comparison between Euclidean Distance & Cosine Measure On Arabic Documents. International Journal Engineering Research and Applications 2(4), 2085–2090 (2012)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Safaa I. Hajeer
    • 1
    Email author
  • Rasha M. Ismail
    • 1
  • Nagwa L. Badr
    • 1
  • M. F. Tolba
    • 1
  1. 1.Faculty of Computer & Information SciencesAin Shams UniversityCairoEgypt

Personalised recommendations