Advertisement

Cross-Language Information Retrieval Using Meta-language Index Construction and Structural Queries

  • Amir Hossein Jadidinejad
  • Fariborz Mahmoudi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6241)

Abstract

Structural Query Language allows expert users to richly represent its information needs but unfortunately, the complexity of SQLs make them impractical in the Web search engines. Automatically detecting the concepts in an unstructured user’s information need and generating a richly structured, multilingual equivalent query is an ideal solution. We utilize Wikipedia as a great concept repository and also some state of the art algorithms for extracting Wikipedia’s concepts from the user’s information need. This process is called “Query Wikification”. Our experiments on the TEL corpus at CLEF2009 achieves +23% and +17% improvement in Mean Average Precision and Recall against the baseline. Our approach is unique in that, it does improve both precision and recall; two pans that often improving one, hurt the another.

Keywords

Retrieval Model Query Term Structural Query Language Structure Query Persian Language 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agirre, E., Di Nunzio, G.M., Ferro, N., Mandl, T., Peters, C.: CLEF 2008 Ad hoc track overview. In: Peters, C., Deselaers, T., Ferro, N., Gonzalo, J., Jones, G.J.F., Kurimo, M., Mandl, T., Peñas, A., Petras, V. (eds.) CLEF 2008. LNCS, vol. 5706, pp. 15–37. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  2. 2.
    Callan, J.P., Croft, W.B., Broglio, J.: Trec and tipster experiments with inquery. Inf. Process. Manage. 31(3), 327–343 (1995)CrossRefGoogle Scholar
  3. 3.
    Ferro, N., Peters, C.: CLEF 2009 Ad Hoc Track Overview: TEL & Persian Tasks. In: Workshop on Cross-Language Information Retrieval and Evaluation, Corfu, Greece (2009)Google Scholar
  4. 4.
    Gabrilovich, E., Markovitch, S.: Wikipedia-based Semantic Interpretation for Natural Language Processing. J. Artificial Intelligence Research (JAIR) 34, 443–498 (2009)zbMATHGoogle Scholar
  5. 5.
    Jadidinejad, A.H., Mahmoudi, F.: Query Wikification: Mining Structured Queries From Unstructured Information Needs using Wikipedia-based Semantic Analysis. Technical report, CLEF2009 Working Notes (2009)Google Scholar
  6. 6.
    Medelyan, O., Milne, D., Legg, C., Witten, I.H.: Mining meaning from Wikipedia. International Journal of Human-Computer Studies 67(89), 716–754 (2009)CrossRefGoogle Scholar
  7. 7.
    Metzler, D., Croft, W.B.: Combining the language model and inference network approaches to retrieval. Inf. Process. Manage. 40(5), 735–750 (2004)CrossRefGoogle Scholar
  8. 8.
    Metzler, D., Croft, W.B.: A markov random field model for term dependencies. In: SIGIR 2005: Proceedings of the 28th annual international ACM SIGIR Conference on Research and Development in Information Retrieval, New York, NY, USA, pp. 472–479 (2005)Google Scholar
  9. 9.
    Mihalcea, R., Csomai, A.: Wikify!: linking documents to encyclopedic knowledge. In: 16th ACM Conference on Information and Knowledge Management, pp. 233–242. ACM, New York (2007)CrossRefGoogle Scholar
  10. 10.
    Milne, D., Witten, I.H.: Learning to link with wikipedia. In: 17th ACM Conference on Information and Knowledge Management, pp. 509–518. ACM, New York (2008)Google Scholar
  11. 11.
    Milne, D., Witten, I.H.: An Open-Source Toolkit for Mining Wikipedia. To be Announced, http://wikipedia-miner.sourceforge.net
  12. 12.
    Ogilvie, P., Callan, J.: Experiments Using the Lemur Toolkit. In: 10th Text Retrieval Conference (TREC-10), pp. 103–108. TREC (2002)Google Scholar
  13. 13.
    Strohman, T., Metzler, D., Turtle, H., Croft, W.B.: Indri: A languagemodel based search engine for complex queries (extended version). Technical report, University of Massachusetts (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Amir Hossein Jadidinejad
    • 1
  • Fariborz Mahmoudi
    • 1
  1. 1.Electrical and Computer Engineering DepartmentIslamic Azad UniversityQazvinIran

Personalised recommendations