An Effective Term-Ranking Function for Query Expansion Based on Information Foraging Assessment

  • Ilyes Khennak
  • Habiba Drias
  • Hadia Mosteghanemi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8891)

Abstract

With the exponential growth of information on the Internet and the significant increase in the number of pages published each day have led to the emergence of new words in the Internet. Owning to the difficulty of achieving the meaning of these new terms, it becomes important to give more weight to subjects and sites where these new words appear, or rather, to give value to the words that occur frequently with them. For this reason, in this work, we propose an effective term-ranking function for query expansion based on the co-occurrence and proximity of words for retrieval effectiveness enhancement. A novel efficiency/effectiveness measure based on the principle of optimal information forager is also proposed in order to evaluate the quality of the obtained results. Our experiments were conducted using the OHSUMED test collection and show significant performance improvement over the state-of-the-art.

Keywords

Information retrieval information foraging theory query expansion term proximity term co-occurrence 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Cambazoglu, B.B., Aykanat, C.: Performance of query processing implementations in ranking-based text retrieval systems using inverted indices. Information Processing & Management 42(4), 875–898 (2006)CrossRefGoogle Scholar
  2. 2.
    Carpineto, C., Romano, G.: A survey of automatic query expansion in information retrieval. ACM Computing Surveys 44(1), 1–50 (2012)CrossRefGoogle Scholar
  3. 3.
    Chen, Q., Li, M., Zhou, M.: Improving query spelling correction using web search results. In: EMNLP-CoNLL 2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 181–189. ACL, Stroudsburg (2007)Google Scholar
  4. 4.
    Dix, A., Howes, A., Payne, S.: Post-web cognition: evolving knowledge strategies for global information environments. International journal of Web engineering and technology 1(1), 112–126 (2003)CrossRefGoogle Scholar
  5. 5.
    Dominich, S.: The modern algebra of information retrieval. Springer, Heidelberg (2008)MATHGoogle Scholar
  6. 6.
    Eisenstein, J., O’Connor, B., Smith, N.A., Xing, E.P.: Mapping the geographical diffusion of new words. In: NIPS 2012: Workshop on Social Network and Social Media Analysis: Methods, Models and Applications (2012)Google Scholar
  7. 7.
    Frøkjær, E., Hertzum, M., Hornbæk, K.: Measuring usability: Are effectiveness, efficiency, and satisfaction really correlated? In: CHI 2000: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 345–352. ACM, New York (2000)Google Scholar
  8. 8.
    Pirolli, P., Card, S.: Information foraging. Psychological Review 106(4), 643–675 (1999)CrossRefGoogle Scholar
  9. 9.
    Ramos, C., Augusto, J.C., Shapiro, D.: Ambient intelligence the next step for artificial intelligence. IEEE Intelligent Systems 23(2), 15–18 (2008)CrossRefGoogle Scholar
  10. 10.
    Robertson, S.E., Jones, K.S.: Relevance weighting of search terms. Journal of the American Society for Information science 27(3), 129–146 (1976)CrossRefGoogle Scholar
  11. 11.
    Robertson, S., Zaragoza, H.: The probabilistic relevance framework: BM25 and beyond. Foundations and Trends in Information Retrieval 3(4), 333–389 (2009)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Ilyes Khennak
    • 1
  • Habiba Drias
    • 1
  • Hadia Mosteghanemi
    • 1
  1. 1.Laboratory for Research in Artificial IntelligenceUSTHBAlgiersAlgeria

Personalised recommendations