Abstract
Information Retrieval (IR) is a process for information processing with the aim of concerned among dealing with documents containing free text in order to acquire them rapidly using keywords process in a user’s search string. Designing effective tools to collect and utilize electronic resources has become a key challenge as the large amount of internet resource grows. The semantic-web era based on domain ontologies has provided some benefits. Life sciences, health care, and biomedicine are gradually becoming data intensives fields of study. We countenance not only increased amount and variety of well difficult, multi-dimensional, and frequently weakly structured and noisy data in bioinformatics and computational biology, although a growing requirement for integrative investigation and modeling. Following a review of the major topics in information retrieval, we give an overview of the major works in literature- study retrieval and mining in bioinformatics. While stating that IR methodologies are valuable in bioinformatics jobs, we outline certain problems that must be overcome in order to demonstrate their efficacy. Information retrieval using biomedical data mining for text, images, and visual content is addressed in Biomedical Data Mining for Information Retrieval. There are many possibilities and challenges in biomedical and health informatics, which lies at the intersection of information science, computer science, and health care. Biomedical and health data is abundant, easily accessible, and can be analyzed in a wide range of ways. By analyzing biomedical and healthcare data, such as patient records, electronic health records (EHRs), and lifestyle data, healthcare informatics will be able to provide high-quality, efficient health care, better treatment, and better quality of life.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdou, S., & Savoy, J. (2008). Searching in Medline: Query expansion and manual indexing evaluation. Information Processing & Management, 44(2), 781–789.
Alipanah, N., Parveen, P., Khan, L., & Thuraisingham, B. (2011, July). Ontology-driven query expansion using map/reduce framework to facilitate federated queries. In 2011 IEEE International Conference on Web Services (pp. 712–713). IEEE.
Blynova, N. (2019). Latent semantic indexing (LSI) and its impact on copywriting. Communications and Communicative Technologies, (19), 4–12.
Bordawekar, R., & Shmueli, O. (2017, May). Using word embedding to enable semantic queries in relational databases. In Proceedings of the 1st Workshop on Data Management for End-to-End Machine Learning (pp. 1–4).
Buscher, G., Dengel, A., Biedert, R., & Elst, L. V. (2012). Attentive documents: Eye tracking as implicit feedback for information retrieval and beyond. ACM Transactions on Interactive Intelligent Systems (TiiS), 1(2), 1–30.
Dadheech, P., Goyal, D., Srivastava, S., & Choudhary, C. M. (2018). An efficient approach for big data processing using spatial Boolean queries. Journal of Statistics and Management Systems, 21(4), 583–591.
Dang, V., Bendersky, M., & Croft, W. B. (2013, March). Two-stage learning to rank for information retrieval. In European Conference on Information Retrieval (pp. 423–434). Springer.
Dey, A., Jenamani, M., & Thakkar, J. J. (2017, December). Lexical TF-IDF: An n-gram feature space for cross-domain classification of sentiment reviews. In International Conference on Pattern Recognition and Machine Intelligence (pp. 380–386). Springer.
Dogan, R. I., Chatr-aryamontri, A., Kim, S., Wei, C. H., Peng, Y., Comeau, D. C., & Lu, Z. (2017, August). BioCreative VI precision medicine track: Creating a training corpus for mining protein–protein interactions affected by mutations. In BioNLP 2017 (pp. 171–175).
Drost, H. G., & Paszkowski, J. (2017). Biomartr: Genomic data retrieval with R. Bioinformatics, 33(8), 1216–1217.
Du, L., Li, K., Liu, Q., Wu, Z., & Zhang, S. (2020). Dynamic multi-client searchable symmetric encryption with support for Boolean queries. Information Sciences, 506, 234–257.
Hersh, W. (2020). Information retrieval: A biomedical and health perspective. Health Informatics. https://doi.org/10.1007/978-3-030-47686-1
Jang, H., Jeong, Y., & Yoon, B. (2021). TechWord: Development of a technology lexical database for structuring textual technology information based on natural language processing. Expert Systems with Applications, 164, 114042.
Krallinger, M., Rabal, O., Lourenco, A., Oyarzabal, J., & Valencia, A. (2017). Information retrieval and text mining technologies for chemistry. Chemical Reviews, 117(12), 7673–7761.
Matos, S., Arrais, J. P., Maia-Rodrigues, J., & Oliveira, J. L. (2010). Concept-based query expansion for retrieving gene related publications from MEDLINE. BMC Bioinformatics, 11(1), 1–9.
Nadkarni, P. M. (2002). An introduction to information retrieval: Applications in genomics. The Pharmacogenomics Journal, 2(2), 96–102.
Pérez-Agüera, J. R., Arroyo, J., Greenberg, J., Iglesias, J. P., & Fresno, V. (2010, April). Using BM25F for semantic search. In Proceedings of the 3rd International Semantic Search Workshop (pp. 1–8).
Rimal, Y., Gochhait, S., & Bisht, A. (2021). Data interpretation and visualization of COVID-19 cases using R programming. Informatics in Medicine Unlocked, 26 (6), 100705. Elsevier, ISSN: 0146-4116.
Rivas, A. R., Iglesias, E. L., & Borrajo, L. (2014). Study of query expansion techniques and their application in the biomedical information retrieval. The Scientific World Journal, (1), 1–10.
Tellez, E. S., Moctezuma, D., Miranda-Jiménez, S., & Graff, M. (2018). An automated text categorization framework based on hyperparameter optimization. Knowledge-Based Systems, 149, 110–123.
Wang, Y., Wang, M., & Fujita, H. (2020). Word sense disambiguation: A comprehensive knowledge exploitation framework. Knowledge-Based Systems, 190, 105030.
Xu, X., Zhu, W., Zhang, X., Hu, X., & Song, I. Y. (2006, October). A comparison of local analysis, global analysis and ontology-based query expansion strategies for bio-medical literature search. In 2006 IEEE International Conference on Systems, Man and Cybernetics (Vol. 4, pp. 3441–3446). IEEE.
Young, N. E., Anderson, R. S., Chignell, S. M., Vorster, A. G., Lawrence, R., & Evangelista, P. H. (2017). A survival guide to Landsat preprocessing. Ecology, 98(4), 920–932.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Sunita, Sharma, S., Rana, V., Kumar, V. (2022). Information Retrieval in Bioinformatics: State of the Art and Challenges. In: Dutta, S., Gochhait, S. (eds) Information Retrieval in Bioinformatics. Palgrave Macmillan, Singapore. https://doi.org/10.1007/978-981-19-6506-7_6
Download citation
DOI: https://doi.org/10.1007/978-981-19-6506-7_6
Published:
Publisher Name: Palgrave Macmillan, Singapore
Print ISBN: 978-981-19-6505-0
Online ISBN: 978-981-19-6506-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)