Skip to main content

Combining Global and Local Semantic Contexts for Improving Biomedical Information Retrieval

  • Conference paper
Advances in Information Retrieval (ECIR 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6611))

Included in the following conference series:

Abstract

In the context of biomedical information retrieval (IR), this paper explores the relationship between the document’s global context and the query’s local context in an attempt to overcome the term mismatch problem between the user query and documents in the collection. Most solutions to this problem have been focused on expanding the query by discovering its context, either global or local. In a global strategy, all documents in the collection are used to examine word occurrences and relationships in the corpus as a whole, and use this information to expand the original query. In a local strategy, the top-ranked documents retrieved for a given query are examined to determine terms for query expansion. We propose to combine the document’s global context and the query’s local context in an attempt to increase the term overlap between the user query and documents in the collection via document expansion (DE) and query expansion (QE). The DE technique is based on a statistical method (IR-based) to extract the most appropriate concepts (global context) from each document. The QE technique is based on a blind feedback approach using the top-ranked documents (local context) obtained in the first retrieval stage. A comparative experiment on the TREC 2004 Genomics collection demonstrates that the combination of the document’s global context and the query’s local context shows a significant improvement over the baseline. The MAP is significantly raised from 0.4097 to 0.4532 with a significant improvement rate of +10.62% over the baseline. The IR performance of the combined method in terms of MAP is also superior to official runs participated in TREC 2004 Genomics and is comparable to the performance of the best run (0.4075).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Stokes, N., Li, Y., Cavedon, L., Zobel, J.: Exploring criteria for successful query expansion in the genomic domain. Information Retrieval 12(1), 17–50 (2009)

    Article  Google Scholar 

  2. Voorhees, E.M.: Query expansion using lexical semantic relations. In: SIGIR 1994 Conference on Research and Development in Information Retrieval, pp. 61–69 (1994)

    Google Scholar 

  3. Le, D.T.H., Chevallet, J.P., Dong, T.B.T.: Thesaurus-based query and document expansion in conceptual indexing with umls. In: RIVF 2007, pp. 242–246 (2007)

    Google Scholar 

  4. Zhou, W., Yu, C.T., et al.: Knowledge-intensive conceptual retrieval and passage extraction of biomedical literature. In: SIGIR, pp. 655–662 (2007)

    Google Scholar 

  5. Lu, Z., Kim, W., Wilbur, W.J.: Evaluation of query expansion using mesh in pubmed. Information Retrieval 12(1), 69–80 (2009)

    Article  Google Scholar 

  6. Gobeill, J., Ruch, P., Zhou, X.: Query and document expansion with medical subject headings terms at medical imageclef 2008. In: Peters, C., Deselaers, T., Ferro, N., Gonzalo, J., Jones, G.J.F., Kurimo, M., Mandl, T., Peñas, A., Petras, V. (eds.) CLEF 2008. LNCS, vol. 5706, pp. 736–743. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  7. Billerbeck, B., Zobel, J.: Document expansion versus query expansion for ad-hoc retrieval. In: The 10th Australasian Document Comput. Symp., pp. 34–41 (2005)

    Google Scholar 

  8. Tao, T., Wang, X., et al.: Language model information retrieval with document expansion. In: Association for Computational Linguistics, pp. 407–414 (2006)

    Google Scholar 

  9. Sparck Jones, K.: Automatic Keyword Classification for Information Retrieval. Butterworths, London (1971)

    Google Scholar 

  10. Rocchio, J.: Relevance Feedback in Information Retrieval, pp. 313–323 (1971)

    Google Scholar 

  11. Xu, J., Croft, W.B.: Query expansion using local and global document analysis. In: Conference on Research and Development in Information Retrieval, pp. 4–11 (1996)

    Google Scholar 

  12. Amati, G.: Probabilistic models for Information Retrieval based on Divergence from Randomness. PhD thesis, University of Glasgow (2003)

    Google Scholar 

  13. Abdou, S., Savoy, J.: Searching in medline: Query expansion and manual indexing evaluation. Information Processing Management 44(2), 781–789 (2008)

    Article  Google Scholar 

  14. Robertson, S.E., Walker, S.: Okapi/keenbow at trec-8. TREC 8, 151–162 (1999)

    Google Scholar 

  15. Robertson, S.E., Walker, S., Hancock-Beaulieu, M.: Okapi at trec-7: Automatic ad hoc, filtering, vlc and interactive. In: TREC-7 Proceedings, pp. 199–210 (1998)

    Google Scholar 

  16. Hersh, W., Bhuptiraju, R.: Trec 2004 genomics track overview. In: The Thirteenth Text Retrieval Conference, TREC 2004 (2004)

    Google Scholar 

  17. Ounis, I., Lioma, T.: Research directions in terrier. In: Baeza-Yates, R., et al. (eds.) Novatica Special Issue on Web Information Access (2007) (invited paper)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dinh, D., Tamine, L. (2011). Combining Global and Local Semantic Contexts for Improving Biomedical Information Retrieval. In: Clough, P., et al. Advances in Information Retrieval. ECIR 2011. Lecture Notes in Computer Science, vol 6611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20161-5_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20161-5_38

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20160-8

  • Online ISBN: 978-3-642-20161-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics