Skip to main content

Information Retrieval in Bioinformatics: State of the Art and Challenges

  • Chapter
  • First Online:
Information Retrieval in Bioinformatics
  • 280 Accesses

Abstract

Information Retrieval (IR) is a process for information processing with the aim of concerned among dealing with documents containing free text in order to acquire them rapidly using keywords process in a user’s search string. Designing effective tools to collect and utilize electronic resources has become a key challenge as the large amount of internet resource grows. The semantic-web era based on domain ontologies has provided some benefits. Life sciences, health care, and biomedicine are gradually becoming data intensives fields of study. We countenance not only increased amount and variety of well difficult, multi-dimensional, and frequently weakly structured and noisy data in bioinformatics and computational biology, although a growing requirement for integrative investigation and modeling. Following a review of the major topics in information retrieval, we give an overview of the major works in literature- study retrieval and mining in bioinformatics. While stating that IR methodologies are valuable in bioinformatics jobs, we outline certain problems that must be overcome in order to demonstrate their efficacy. Information retrieval using biomedical data mining for text, images, and visual content is addressed in Biomedical Data Mining for Information Retrieval. There are many possibilities and challenges in biomedical and health informatics, which lies at the intersection of information science, computer science, and health care. Biomedical and health data is abundant, easily accessible, and can be analyzed in a wide range of ways. By analyzing biomedical and healthcare data, such as patient records, electronic health records (EHRs), and lifestyle data, healthcare informatics will be able to provide high-quality, efficient health care, better treatment, and better quality of life.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Abdou, S., & Savoy, J. (2008). Searching in Medline: Query expansion and manual indexing evaluation. Information Processing & Management, 44(2), 781–789.

    Google Scholar 

  • Alipanah, N., Parveen, P., Khan, L., & Thuraisingham, B. (2011, July). Ontology-driven query expansion using map/reduce framework to facilitate federated queries. In 2011 IEEE International Conference on Web Services (pp. 712–713). IEEE.

    Google Scholar 

  • Blynova, N. (2019). Latent semantic indexing (LSI) and its impact on copywriting. Communications and Communicative Technologies, (19), 4–12.

    Google Scholar 

  • Bordawekar, R., & Shmueli, O. (2017, May). Using word embedding to enable semantic queries in relational databases. In Proceedings of the 1st Workshop on Data Management for End-to-End Machine Learning (pp. 1–4).

    Google Scholar 

  • Buscher, G., Dengel, A., Biedert, R., & Elst, L. V. (2012). Attentive documents: Eye tracking as implicit feedback for information retrieval and beyond. ACM Transactions on Interactive Intelligent Systems (TiiS), 1(2), 1–30.

    Google Scholar 

  • Dadheech, P., Goyal, D., Srivastava, S., & Choudhary, C. M. (2018). An efficient approach for big data processing using spatial Boolean queries. Journal of Statistics and Management Systems, 21(4), 583–591.

    Google Scholar 

  • Dang, V., Bendersky, M., & Croft, W. B. (2013, March). Two-stage learning to rank for information retrieval. In European Conference on Information Retrieval (pp. 423–434). Springer.

    Google Scholar 

  • Dey, A., Jenamani, M., & Thakkar, J. J. (2017, December). Lexical TF-IDF: An n-gram feature space for cross-domain classification of sentiment reviews. In International Conference on Pattern Recognition and Machine Intelligence (pp. 380–386). Springer.

    Google Scholar 

  • Dogan, R. I., Chatr-aryamontri, A., Kim, S., Wei, C. H., Peng, Y., Comeau, D. C., & Lu, Z. (2017, August). BioCreative VI precision medicine track: Creating a training corpus for mining protein–protein interactions affected by mutations. In BioNLP 2017 (pp. 171–175).

    Google Scholar 

  • Drost, H. G., & Paszkowski, J. (2017). Biomartr: Genomic data retrieval with R. Bioinformatics, 33(8), 1216–1217.

    Google Scholar 

  • Du, L., Li, K., Liu, Q., Wu, Z., & Zhang, S. (2020). Dynamic multi-client searchable symmetric encryption with support for Boolean queries. Information Sciences, 506, 234–257.

    Google Scholar 

  • Hersh, W. (2020). Information retrieval: A biomedical and health perspective. Health Informatics. https://doi.org/10.1007/978-3-030-47686-1

    Article  Google Scholar 

  • Jang, H., Jeong, Y., & Yoon, B. (2021). TechWord: Development of a technology lexical database for structuring textual technology information based on natural language processing. Expert Systems with Applications, 164, 114042.

    Google Scholar 

  • Krallinger, M., Rabal, O., Lourenco, A., Oyarzabal, J., & Valencia, A. (2017). Information retrieval and text mining technologies for chemistry. Chemical Reviews, 117(12), 7673–7761.

    Google Scholar 

  • Matos, S., Arrais, J. P., Maia-Rodrigues, J., & Oliveira, J. L. (2010). Concept-based query expansion for retrieving gene related publications from MEDLINE. BMC Bioinformatics, 11(1), 1–9.

    Google Scholar 

  • Nadkarni, P. M. (2002). An introduction to information retrieval: Applications in genomics. The Pharmacogenomics Journal, 2(2), 96–102.

    Google Scholar 

  • Pérez-Agüera, J. R., Arroyo, J., Greenberg, J., Iglesias, J. P., & Fresno, V. (2010, April). Using BM25F for semantic search. In Proceedings of the 3rd International Semantic Search Workshop (pp. 1–8).

    Google Scholar 

  • Rimal, Y., Gochhait, S., & Bisht, A. (2021). Data interpretation and visualization of COVID-19 cases using R programming. Informatics in Medicine Unlocked, 26 (6), 100705. Elsevier, ISSN: 0146-4116.

    Google Scholar 

  • Rivas, A. R., Iglesias, E. L., & Borrajo, L. (2014). Study of query expansion techniques and their application in the biomedical information retrieval. The Scientific World Journal, (1), 1–10.

    Google Scholar 

  • Tellez, E. S., Moctezuma, D., Miranda-Jiménez, S., & Graff, M. (2018). An automated text categorization framework based on hyperparameter optimization. Knowledge-Based Systems, 149, 110–123.

    Google Scholar 

  • Wang, Y., Wang, M., & Fujita, H. (2020). Word sense disambiguation: A comprehensive knowledge exploitation framework. Knowledge-Based Systems, 190, 105030.

    Google Scholar 

  • Xu, X., Zhu, W., Zhang, X., Hu, X., & Song, I. Y. (2006, October). A comparison of local analysis, global analysis and ontology-based query expansion strategies for bio-medical literature search. In 2006 IEEE International Conference on Systems, Man and Cybernetics (Vol. 4, pp. 3441–3446). IEEE.

    Google Scholar 

  • Young, N. E., Anderson, R. S., Chignell, S. M., Vorster, A. G., Lawrence, R., & Evangelista, P. H. (2017). A survival guide to Landsat preprocessing. Ecology, 98(4), 920–932.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sunita .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Sunita, Sharma, S., Rana, V., Kumar, V. (2022). Information Retrieval in Bioinformatics: State of the Art and Challenges. In: Dutta, S., Gochhait, S. (eds) Information Retrieval in Bioinformatics. Palgrave Macmillan, Singapore. https://doi.org/10.1007/978-981-19-6506-7_6

Download citation

Publish with us

Policies and ethics