Skip to main content

Query Expansion Using Medical Subject Headings Terms in the Biomedical Documents

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8397))

Abstract

MEDLINE database is most resourceful of biomedical literatures. Lay users may get difficulty to formulate a query. Query expansion technique reformulates user query by adding more significant and related terms to original terms to retrieve more relevant results. Finding related terms are explored form external resources, collection and query context. Since each MEDLINE document is manually assigned with controlled vocabularies which is called MeSH (Medical Subject Headings). These controlled vocabularies may be beneficial for query expansion. This paper proposes pseudo-relevance feedback by using MeSH terms in documents for query expansion. Additionally, re-weighting scheme called RABAM-PRF (Rank-Based MeSH Pseudo-Relevance Feedback) for filtering misleading terms is studied. In experiment, we use Lucene to retrieve the OHSUMED collection as baseline. The proposed method improves retrieval performance in MAP, P@10, and B-pref. Furthermore, the experiment showed that not all MeSH terms should be included to the query.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fontaine, J.-F., Barbosa-Silva, A., Schaefer, M., Huska, M.R., Muro, E.M., Andrade-Navarro, M.A.: MedlineRanker: Flexible ranking of biomedical literature. Nucleic Acids Res. 37, W141-W146 (2009)

    Google Scholar 

  2. Yoo, S., Choi, J.: On the query reformulation technique for effective MEDLINE document retrieval. J. Biomed. Inform. 43, 686–693 (2010)

    Article  Google Scholar 

  3. Jalali, V., Matash Borujerdi, M.: Information retrieval with concept-based pseudo-relevance feedback in MEDLINE. J. Knowl. Inf. Syst. 29, 237–248 (2011)

    Article  Google Scholar 

  4. Rocchio, J.J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The SMART Retrieval System: Experiments in Automatic Document Processing, pp. 313–323. Prentice-Hall, Englewood Cliffs (1971)

    Google Scholar 

  5. William Hersh, S.P., Donohoe, L.: Assessing thesaurus-based query expansion using the UMLS Metathesaurus. AMIA, 344–348 (2000)

    Google Scholar 

  6. Baeza-Yates, R., Ribeiro-Neto, B.: Modern information retrieval. ACM press, New York (1999)

    Google Scholar 

  7. Xu, X., Zhu, W., Zhang, X., Hu, X., Song, I.-Y.: A comparison of local analysis, global analysis and ontology-based query expansion strategies for bio-medical literature search. In: IEEE International Conference on Systems, Man and Cybernetics, SMC 2006, pp. 3441–3446. IEEE (2006)

    Google Scholar 

  8. Xu, X., Zhang, X., Hu, X.: Using Two-Stage Concept-Based Singular Value Decomposition Technique as a Query Expansion Strategy. In: 21st International Conference on Advanced Information Networking and Applications Workshops, AINAW 2007, pp. 295–300 (2007)

    Google Scholar 

  9. Abdou, S., Savoy, J.: Searching in Medline: Query expansion and manual indexing evaluation. J. Infoproman. 44, 781–789 (2008)

    Google Scholar 

  10. Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by Latent Semantic Analysis. JASIST 41, 391–407 (1990)

    Article  Google Scholar 

  11. Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques: Practical Machine Learning Tools and Techniques. Elsevier (2011)

    Google Scholar 

  12. Xu, X., Hu, X.: Cluster-based query expansion using language modeling in the biomedical domain. In: 2010 IEEE International Conference on International Conference on Bioinformatics and Biomedicine Workshops (BIBMW), pp. 185–188. IEEE (2010)

    Google Scholar 

  13. Zhu, W., Xu, X., Hu, X., Song, I.-Y., Allen, R.B.: Using UMLS-based Re-Weighting Terms as a Query Expansion Strategy. In: IEEE International Conference on Granular Computing, pp. 217–222. IEEE (2006)

    Google Scholar 

  14. Benjamin King, L.W., Provalor, I., Zhou, J.: Cengage Learning at TREC 2011 Medical Track. In: The 20th Text REtrieval Conference (TREC). National Institute for Standards and Technology (2011)

    Google Scholar 

  15. Jalali, V., Borujerdi, M.R.M.: The Effect of Using Domain Specific Ontologies in Query Expansion in Medical Field. In: IEEE Innovations in Information Technology (2008)

    Google Scholar 

  16. Jalali, V., Borujerdi, M.R.M.: A Hybrid Information Retrieval System for Medical Field Using MeSH Ontology. In: Prasad, S.K., Routray, S., Khurana, R., Sahni, S. (eds.) ICISTM 2009. CCIS, vol. 31, pp. 31–40. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  17. Unified Medical Language Systems, http://www.nlm.nih.gov/research/umls

  18. Hersh, W., Buckley, C., Leone, T.J., Hickam, D.: OHSUMED: An Interactive Retrieval Evaluation and New Large Test Collection for Research. In: Croft, B., Rijsbergen, C.J. (eds.) SIGIR 1994, pp. 192–201. Springer, London (1994)

    Google Scholar 

  19. Text REtrieval Conference. The trec eval Evaluation Package (2004), http://trec.nist.gov/trec_eval/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Thesprasith, O., Jaruskulchai, C. (2014). Query Expansion Using Medical Subject Headings Terms in the Biomedical Documents. In: Nguyen, N.T., Attachoo, B., Trawiński, B., Somboonviwat, K. (eds) Intelligent Information and Database Systems. ACIIDS 2014. Lecture Notes in Computer Science(), vol 8397. Springer, Cham. https://doi.org/10.1007/978-3-319-05476-6_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05476-6_10

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05475-9

  • Online ISBN: 978-3-319-05476-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics