Skip to main content
Log in

Bat-Inspired Algorithm Based Query Expansion for Medical Web Information Retrieval

  • Systems-Level Quality Improvement
  • Published:
Journal of Medical Systems Aims and scope Submit manuscript

Abstract

With the increasing amount of medical data available on the Web, looking for health information has become one of the most widely searched topics on the Internet. Patients and people of several backgrounds are now using Web search engines to acquire medical information, including information about a specific disease, medical treatment or professional advice. Nonetheless, due to a lack of medical knowledge, many laypeople have difficulties in forming appropriate queries to articulate their inquiries, which deem their search queries to be imprecise due the use of unclear keywords. The use of these ambiguous and vague queries to describe the patients’ needs has resulted in a failure of Web search engines to retrieve accurate and relevant information. One of the most natural and promising method to overcome this drawback is Query Expansion. In this paper, an original approach based on Bat Algorithm is proposed to improve the retrieval effectiveness of query expansion in medical field. In contrast to the existing literature, the proposed approach uses Bat Algorithm to find the best expanded query among a set of expanded query candidates, while maintaining low computational complexity. Moreover, this new approach allows the determination of the length of the expanded query empirically. Numerical results on MEDLINE, the on-line medical information database, show that the proposed approach is more effective and efficient compared to the baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. https://www-01.ibm.com/software/data/bigdata/what-is-big-data.html

  2. http://www.pewinternet.org/2009/06/11/61-of-american-adults-look-online-for-health-information/

  3. rand is a function that generates a random real number in the [0,1] interval

References

  1. Alihodzic, A., and Tuba, M. Improved Bat Algorithm Applied to Multilevel Image Thresholding. The Scientific World Journal (2014)

  2. Attardi, G., Atzori, L., Simi, M.: Index expansion for machine reading and question answering. In: CLEF 2012 Evaluation Labs and Workshop, Online Working Notes (2012)

  3. Bernardini, A., Carpineto, C., D’Amico, M.: Full-subtopic retrieval with keyphrase-based search results clustering. In: Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, pp. 206–213. IEEE (2009)

  4. Bindal, A.K., and Sanyal, S.: Query optimization in context of pseudo relevant documents. In: 3rd Italian Information Retrieval Workshop (2012)

  5. de Boer, M., Schutte, K., Kraaij, W., Knowledge based query expansion in complex multimedia event detection. Multimedia Tools and Applications,1–19, 2015.

  6. Cao, G., Nie, J.Y., Gao, J., Robertson, S.: Selecting good expansion terms for pseudo-relevance feedback. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 243–250. ACM (2008)

  7. Carpineto, C., and Romano, G. Concept Data Analysis: Theory and Applications. Wiley (2004)

  8. Carpineto, C., and Romano, G., A survey of automatic query expansion in information retrieval. ACM Comput. Surveys 44(1):1–50, 2012.

    Article  Google Scholar 

  9. Chandrasekar, C., An optimized approach of modified bat algorithm to record deduplication. Int. J. Comput. Appl. 62(1), 2013.

  10. Crestani, F., Application of spreading activation techniques in information retrieval. Artif. Intell. Rev. 11(6): 453–482, 1997.

    Article  Google Scholar 

  11. Curé, O.C., Maurer, H., Shah, N.H., Le Pendu, P., A formal concept analysis and semantic query expansion cooperation to refine health outcomes of interest. BMC Med. Inf. Decis. Making 15(Suppl 1):S8, 2015.

    Article  Google Scholar 

  12. Dao, T.K., Pan, T.S., Pan, J.S., Parallel bat algorithm for optimizing makespan in job shop scheduling problems. J. Intell. Manuf.,1–12, 2015.

  13. Díaz-Galiano, M.C., Martín-Valdivia, M.T., Ureña-López, L., Query expansion with a medical ontology to improve a multimodal information retrieval system. Comput. Biol. Med. 39(4):396–403, 2009.

    Article  PubMed  Google Scholar 

  14. Durao, F., Bayyapu, K., Xu, G., Dolog, P., Lage, R., Expanding user’s query with tag-neighbors for effective medical information retrieval. Multimed. Tools Appl. 71(2):905–929 , 2014.

    Article  Google Scholar 

  15. Gao, K., Zhang, Y., Zhang, D., Lin, S., Accurate off-line query expansion for large-scale mobile visual search. Signal Process. 93(8):2305–2315, 2013.

    Article  Google Scholar 

  16. Hersh, W., Buckley, C., Leone, T., Hickam, D. Ohsumed: An interactive retrieval evaluation and new large test collection for research. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 192–201. Springer (1994)

  17. Jaddi, N.S., Abdullah, S., Hamdan, A.R., Optimization of neural network model using modified bat-inspired algorithm. Appl. Soft Comput. 37:71–86, 2015.

    Article  Google Scholar 

  18. Jain, H., Thao, C., Zhao, H., Enhancing electronic medical record retrieval through semantic query expansion. Inf. Syst. e-Business Manag. 10(2):165–181, 2012.

    Article  Google Scholar 

  19. Jalali, V., and Borujerdi, M.R.M., Information retrieval with concept-based pseudo-relevance feedback in medline. Knowledge Inf. Syst. 29(1):237–248, 2011.

    Article  Google Scholar 

  20. Jouglet, A., and Carlier, J., Dominance rules in combinatorial optimization problems. Eur. J. Oper. Res. 212(3):433–444 , 2011.

    Article  Google Scholar 

  21. Kennedy, J. Particle swarm optimization. In: Encyclopedia of Machine Learning, pp. 760–766. Springer (2011)

  22. Kennedy, J., Kennedy, J.F., Eberhart, R.C., Shi, Y. Swarm Intelligence. Morgan Kaufmann (2001)

  23. Khennak, I., and Drias, H. Bat algorithm for efficient query expansion: Application to medline. In: Proceedings of the 4th World Conference on Information Systems and Technologies, pp. 113–122. Springer (2016)

  24. Komarasamy, G., and Wahi, A., An optimized k-means clustering technique using bat algorithm. Eur. J. Sci. Res. 84(2):26–273, 2012.

    Google Scholar 

  25. Lee, A., and Chau, M.: The impact of query suggestion in e-commerce websites. In: E-Life: Web-Enabled Convergence of Commerce, Work, and Social Life 10th Workshop on E-Business, WEB 2011, pp. 248–254 (2011)

  26. Lee, K.S., and Croft, W.B., A deterministic resampling method using overlapping document clusters for pseudo-relevance feedback. Inf. Process. Manag. 49(4):792–806, 2013.

    Article  Google Scholar 

  27. Leturia, I., Gurrutxaga, A., Areta, N., Alegria, I., Ezeiza, A., Morphological query expansion and language-filtering words for improving basque web retrieval. Lang. Resour. Eval. 47(2):425–448, 2013.

    Article  Google Scholar 

  28. Lu, Z., Kim, W., Wilbur, W.J., Evaluation of query expansion using mesh in pubmed. Inf. Retriev. 12 (1):69–80, 2009.

    Article  Google Scholar 

  29. Melucci, M., A basis for information retrieval in context. ACM Transactions on Information Systems 26(3): 14:1–14:41, 2008.

    Article  Google Scholar 

  30. Miao, J., Huang, J.X., Ye, Z.: Proximity-based rocchio’s model for pseudo relevance. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 535–544. ACM (2012)

  31. Pérez, J., Valdez, F., Castillo, O.: A new bat algorithm with fuzzy logic for dynamical parameter adaptation and its applicability to fuzzy control design. In: Fuzzy Logic Augmentation of Nature-Inspired Optimization Metaheuristics, pp. 65–79. Springer (2015)

  32. Robertson, S., and Zaragoza, H.: The Probabilistic Relevance Framework: BM25 and Beyond. Now Publishers Inc (2009)

  33. Robertson, S.E., and Jones, K.S., Relevance weighting of search terms. J. Amer. Soc. Inf. Sci. 27(3):129–146, 1976.

    Article  Google Scholar 

  34. Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M.M., Gatford, M., et al., Okapi at trec-3. NIST Spec. Publ. SP 109:109, 1995.

    Google Scholar 

  35. Rocchio, J.J., Relevance feedback in information retrieval. SMART Retriev. Syst. Exper. Autom. Doc. Process., 313–323, 1971.

  36. Sahlgren, M.: An introduction to random indexing. In: Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering, TKE (2005)

  37. Véronis, J., Hyperlex: Lexical cartography for information retrieval. Comput. Speech Lang. 18(3):223–252, 2004.

    Article  Google Scholar 

  38. Wong, S.K., Ziarko, W., Raghavan, V.V., Wong, P.C., On modeling of information retrieval concepts in vector spaces. ACM Trans. Data. Syst. 12(2):299–321, 1987.

    Article  Google Scholar 

  39. Wu, I.C., Chen, G.W., Hsu, J.L., Lin, C.Y., An entropy-based query expansion approach for learning researchers’ dynamic information needs. Knowledge-Based Syst. 52:133–146, 2013.

    Article  Google Scholar 

  40. Yang, N.C., and Le, M.D., Optimal design of passive power filters based on multi-objective bat algorithm and pareto front. Appl. Soft Comput. 35:257–266, 2015.

    Article  Google Scholar 

  41. Yang, X.S.: Nature-Inspired Metaheuristic Algorithms. Luniver Press (2008)

  42. Yang, X.S.: Nature-Inspired Metaheuristic Algorithms: Second Edition. Luniver Press (2010)

  43. Yang, X.S.: A new metaheuristic bat-inspired algorithm. In: Nature Inspired Cooperative Strategies for Optimization, pp. 65–74 (2010)

  44. Yang, X.S.: Nature-Inspired Optimization Algorithms. Elsevier (2014)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ilyes Khennak.

Additional information

This article is part of the Topical Collection on Systems-Level Quality Improvement

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khennak, I., Drias, H. Bat-Inspired Algorithm Based Query Expansion for Medical Web Information Retrieval. J Med Syst 41, 34 (2017). https://doi.org/10.1007/s10916-016-0668-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10916-016-0668-1

Keywords

Navigation