Abstract
Large language models (LLMs) such as ChatGPT have recently attracted significant attention due to their impressive performance on many real‐world tasks. These models have also demonstrated the potential in facilitating various biomedical tasks. However, little is known of their potential in biomedical information retrieval, especially identifying drug-disease associations. This study aims to explore the potential of ChatGPT, a popular LLM, in discerning drug-disease associations. We collected 2694 true drug-disease associations and 5662 false drug-disease pairs. Our approach involved creating various prompts to instruct ChatGPT in identifying these associations. Under varying prompt designs, ChatGPT’s capability to identify drug-disease associations with an accuracy of 74.6–83.5% and 96.2–97.6% for the true and false pairs, respectively. This study shows that ChatGPT has the potential in identifying drug-disease associations and may serve as a helpful tool in searching pharmacy-related information. However, the accuracy of its insights warrants comprehensive examination before its implementation in medical practice.
References
Liu, J., C. Wang, and S. Liu. Utility of ChatGPT in clinical practice. J. Med. Internet Res.25:e48568, 2023. https://doi.org/10.2196/48568.
Takagi, S., T. Watari, A. Erabi, and K. Sakaguchi. Performance of GPT-3.5 and GPT-4 on the Japanese medical licensing examination: comparison study. JMIR Med. Educ.9:e48002, 2023. https://doi.org/10.2196/48002.
Zhao, W. X., et al. A survey of large language models. 2023. https://doi.org/10.48550/ARXIV.2303.18223
Eggmann, F., R. Weiger, N. U. Zitzmann, and M. B. Blatz. Implications of large language models such as ChatGPT for dental medicine. J. Esthet. Restor. Dent. 2023. https://doi.org/10.1111/jerd.13046.
Thirunavukarasu, A. J., D. S. J. Ting, K. Elangovan, L. Gutierrez, T. F. Tan, and D. S. W. Ting. Large language models in medicine. Nat. Med. 29(8):1930–1940, 2023. https://doi.org/10.1038/s41591-023-02448-8.
OpenAI. Models. 2023. https://platform.openai.com/docs/models/gpt-3-5
Gilson, A., et al. How does ChatGPT perform on the united states medical licensing examination? The implications of large language models for medical education and knowledge assessment. JMIR Med. Educ.9:e45312, 2023. https://doi.org/10.2196/45312.
Walker, H. L., et al. Reliability of medical information provided by ChatGPT: assessment against clinical guidelines and patient information quality instrument. J. Med. Internet Res.25:e47479, 2023. https://doi.org/10.2196/47479.
Gu, Y., et al. Distilling large language models for biomedical knowledge extraction: a case study on adverse drug events. 2023. https://doi.org/10.48550/ARXIV.2307.06439.
Jahan, I., M. T. R. Laskar, C. Peng, and J. Huang. Evaluation of ChatGPT on biomedical tasks: a zero-shot comparison with fine-tuned generative transformers. 2023. https://doi.org/10.48550/ARXIV.2306.04504.
Juhi, A., N. Pipil, S. Santra, S. Mondal, J. K. Behera, and H. Mondal. The capability of ChatGPT in predicting and explaining common drug-drug interactions. Cureus. 2023. https://doi.org/10.7759/cureus.36272.
Chen, S., et al. Use of artificial intelligence Chatbots for cancer treatment information. JAMA Oncol. 2023. https://doi.org/10.1001/jamaoncol.2023.2954.
Eysenbach, G. The role of ChatGPT, generative language models, and artificial intelligence in medical education: a conversation with ChatGPT and a call for papers. JMIR Med. Educ.9:e46885, 2023. https://doi.org/10.2196/46885.
Miao, H., and H. Ahn. Impact of ChatGPT on interdisciplinary nursing education and research. Asian Pac. Isl. Nurs. J.7:e48136, 2023. https://doi.org/10.2196/48136.
Wang, Q., Z. Gao, and R. Xu. Exploring the in-context learning ability of large language model for biomedical concept linking. 2023. https://doi.org/10.48550/ARXIV.2307.01137.
Dave, T., S. A. Athaluri, and S. Singh. ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations. Front. Artif. Intell. 6:1169595, 2023. https://doi.org/10.3389/frai.2023.1169595.
Drees, J. Google receives more than 1 billion health questions every day. Mar. 11, 2019. https://www.beckershospitalreview.com/healthcare-information-technology/google-receives-more-than-1-billion-health-questions-every-day.html. Accessed 23 Aug 2023.
Ayoub, N. F., Y.-J. Lee, D. Grimm, and K. Balakrishnan. Comparison between ChatGPT and google search as sources of postoperative patient instructions. JAMA Otolaryngol. Head Neck Surg. 149(6):556, 2023. https://doi.org/10.1001/jamaoto.2023.0704.
Xu, R., Y. Feng, and H. Chen. ChatGPT vs. Google: a comparative study of search performance and user experience. 2023. https://doi.org/10.48550/ARXIV.2307.01135.
Dudley, J. T., T. Deshpande, and A. J. Butte. Exploiting drug-disease relationships for computational drug repositioning. Brief. Bioinform. 12(4):303–311, 2011. https://doi.org/10.1093/bib/bbr013.
Avram, S., et al. DrugCentral 2021 supports drug discovery and repositioning. Nucleic Acids Res. 49(D1):D1160–D1169, 2021. https://doi.org/10.1093/nar/gkaa997.
Wang, Q., and R. Xu. Drug repositioning for prostate cancer: using a data-driven approach to gain new insights. AMIA Annu. Symp. Proc. 2017:1724–1733, 2017.
Wang, Q., and R. Xu. Disease comorbidity-guided drug repositioning: a case study in schizophrenia. AMIA Annu. Symp. Proc. 2018:1300–1309, 2018.
White, J., et al. A prompt pattern catalog to enhance prompt engineering with ChatGPT. 2023. https://doi.org/10.48550/ARXIV.2302.11382.
Li, L., L. Fan, S. Atreja, and L. Hemphill. ‘HOT’ ChatGPT: the promise of ChatGPT in detecting and discriminating hateful, offensive, and toxic comments on social media. 2023. https://doi.org/10.48550/ARXIV.2304.10619.
Saravia, E. Prompt engineering guide. https://github.com/dair-ai/Prompt-Engineering-Guide. Accessed 5 Aug 2023
Wei, J., et al. Finetuned language models are zero-shot learners. 2021. https://doi.org/10.48550/ARXIV.2109.01652.
Xian, Y., B. Schiele, and Z. Akata. Zero-shot learning—the good, the bad and the ugly. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI: IEEE, 2017, pp. 3077–3086. https://doi.org/10.1109/CVPR.2017.328.
Brown, T. B., et al. Language models are few-shot learners. 2020. https://doi.org/10.48550/ARXIV.2005.14165.
Wang, Y., Q. Yao, J. T. Kwok, and L. M. Ni. Generalizing from a few examples: a survey on few-shot learning. ACM Comput. Surv. 53(3):1–34, 2021. https://doi.org/10.1145/3386252.
Google AI. Google AI PaLM 2. Google AI 2023. https://ai.google/discover/palm2/ Accessed 6 Aug. 2023.
Meta, A.I. Introducing LLaMA: a foundational, 65-billion-parameter large language model. https://ai.facebook.com/blog/large-language-model-llama-meta-ai/. Accessed 24 Feb 2023
Chen, L., M. Zaharia, and J. Zou. How is ChatGPT’s behavior changing over time? 2023. https://doi.org/10.48550/ARXIV.2307.09009.
Natalie. What is ChatGPT?|OpenAI Help Center. 2023. https://help.openai.com/en/articles/6783457-what-is-chatgpt. Accessed 6 Aug 2023.
Zuccon, G., and B. Koopman. Dr ChatGPT, tell me what I want to hear: how prompt knowledge impacts health answer correctness. 2023. https://doi.org/10.48550/ARXIV.2302.13793.
Xu, R., and Q. Wang. Large-scale extraction of accurate drug-disease treatment pairs from biomedical literature for drug repurposing. BMC Bioinform. 14(1):1–11, 2013.
Han, R., T. Peng, C. Yang, B. Wang, L. Liu, X. Wan. Is information extraction solved by ChatGPT? An analysis of performance, evaluation criteria, robustness and errors. 2023. https://doi.org/10.48550/arXiv.2305.14450.
Brinkmann, A., R. Shraga, R. C. Der, C. Bizer. Product information extraction using ChatGPT. 2023. https://doi.org/10.48550/arXiv.2306.14921.
Li, B., G. Fang, Y. Yang, Q. Wang, W. Ye, W. Zhao, S. Zhang. Evaluating ChatGPT's information extraction capabilities: an assessment of performance, explainability, calibration, and faithfulness. 2023. https://doi.org/10.48550/arXiv.2304.11633.
Funding
Funding was provided by National Institute on Alcohol Abuse and Alcoholism (AA029831), National Institute on Aging (AG057557, AG061388, AG062272, AG07664), National Eye Institute (EY029297), and National Institute on Drug Abuse (UG1DA049435, CTN-0114).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Associate Editor Joel Stitzel oversaw the review of this article.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Gao, Z., Li, L., Ma, S. et al. Examining the Potential of ChatGPT on Biomedical Information Retrieval: Fact-Checking Drug-Disease Associations. Ann Biomed Eng (2023). https://doi.org/10.1007/s10439-023-03385-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10439-023-03385-w