Evaluating Topic Difficulties from the Viewpoint of Query Term Expansion

  • Masaharu Yoshioka
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4182)

Abstract

Query term expansion is an important technique for achieving higher retrieval performance. However, since many factors affects the quality of this technique, it is difficult to evaluate this technique in isolation. Feature quantities that characterize the quality of the initial query are defined in this study for evaluating topic difficulties from the viewpoint of query term expansion. I also briefly review the result of the NTCIR-5 query term expansion subtask that uses these quantities for evaluating the effectiveness of the query term expansion techniques. I also describe detailed analysis results on the effect of query term expansion based on topic-by-topic analysis.

Keywords

Relevant Document Relevance Feedback Retrieval Performance Query Term Query Expansion 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Baeza-Yates, R., Ribeiro-Neto, B.: 5 Query Operations. In: Modern Information Retrieval, pp. 19–71. Addison-Wesley, Reading (1999)Google Scholar
  2. 2.
    Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting query performance. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 299–306 (2002)Google Scholar
  3. 3.
    Harman, D., Buckley, C.: SIGIR 2004 workshop: RIA and where can IR go from here? SIGIR Forum 38, 45–49 (2004)CrossRefGoogle Scholar
  4. 4.
    Buckley, C.: Why current IR engines fail. In: SIGIR 2004: Proceedings of the 27th annual international conference on Research and development in information retrieval, pp. 584–585. ACM Press, New York (2004)CrossRefGoogle Scholar
  5. 5.
    Warren, R.H., Liu, T.: A review of relevance feedback experiments at the 2003 Reliable Information Access (RIA) workshop. In: SIGIR 2004: Proceedings of the 27th annual international conference on Research and development in information retrieval, pp. 570–571. ACM Press, New York (2004)CrossRefGoogle Scholar
  6. 6.
    Yoshioka, M.: Overview of the NTCIR-5 web query term expansion subtask. In: Proceedings of the Fifth NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access, pp. 443–454 (2005), http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings5/data/WEB/NTCIR5-OV-WEB-YoshiokaM.pdf
  7. 7.
    Buckley, C., Harman, D.: Reliable information access final workshop report. Technical report, Northeast Regional Research Center, MITRE Data/RIA 2003/ria final.pdf (2004), http://nrrc.mitre.org/NRRC/Docs
  8. 8.
    Voorhees, E.: Query expansion using lexical-semantic relations. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 61–69 (1994)Google Scholar
  9. 9.
    Yoshioka, M., Haraguchi, M.: On a combination of probabilistic and Boolean IR models for WWW document retrieval. ACM Transactions on Asian Language Information Processing (TALIP) 4, 340–356 (2005)CrossRefGoogle Scholar
  10. 10.
    Yoshioka, M., Haraguchi, M.: An appropriate Boolean query reformulation interface for information retrieval based on adaptive generalization. In: International Workshop on Challenges in Web Information Retrieval and Integration, pp. 145–150 (2005)Google Scholar
  11. 11.
    Eguchi, K., Oyama, K., Aizawa, A., Ishikawa, H.: Overview of the informational retrieval task at NTCIR-4 web. In: Proceedings of the Fourth NTCIR Workshop on Research in Information Access Technologies Information Retrieval, Question Answering and Summarization (2004), http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings4/WEB/NTCIR4-OV-WEB-A-EguchiK.pdf
  12. 12.
    Tanioka, H., Yamamoto, K., Nakagawa, T.: A distributed retrieval system for NTCIR-5 web task. In: Proceedings of the Fifth NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access, pp. 472–480 (2005), http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings5/data/WEB/NTCIR5-WEB-TaniokaH.pdf
  13. 13.
    Toda, H., Kataoka, R.: Search result clustering method at NTCIR-5 web query expansion subtask. In: Proceedings of the Fifth NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access, pp. 481–485 (2005), http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings5/data/WEB/NTCIR5-WEB-TodaH.pdf
  14. 14.
    Masada, T., Kanazawa, T., Takasu, A., Adachi, J.: Improving web search by query expansion with a small number of terms. In: Proceedings of the Fifth NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access, pp. 486–493 (2005), http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings5/data/WEB/NTCIR5-WEB-MasadaT.pdf
  15. 15.
    Eguchi, K.: NTCIR-5 query expansion experiments using term dependence models. In: Proceedings of the Fifth NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access, pp. 494–501 (2005), http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings5/data/WEB/NTCIR5-WEB-EguchiK.pdf

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Masaharu Yoshioka
    • 1
    • 2
  1. 1.Graduate School of Information Science and TechnologyHokkaido UniversitySapporoJapan
  2. 2.National Institute of InformaticsTokyoJapan

Personalised recommendations