Examining the Potential of ChatGPT on Biomedical Information Retrieval: Fact-Checking Drug-Disease Associations

Gao, Zhenxiang; Li, Lingyao; Ma, Siyuan; Wang, Qinyong; Hemphill, Libby; Xu, Rong

doi:10.1007/s10439-023-03385-w

Examining the Potential of ChatGPT on Biomedical Information Retrieval: Fact-Checking Drug-Disease Associations

Letter to the Editor
Published: 19 October 2023

(2023)
Cite this article

Annals of Biomedical Engineering Aims and scope Submit manuscript

Zhenxiang Gao ORCID: orcid.org/0000-0002-2223-5092¹^na1,
Lingyao Li²^na1,
Siyuan Ma³^na1,
Qinyong Wang¹,
Libby Hemphill² &
…
Rong Xu¹

857 Accesses
1 Altmetric
Explore all metrics

Abstract

Large language models (LLMs) such as ChatGPT have recently attracted significant attention due to their impressive performance on many real‐world tasks. These models have also demonstrated the potential in facilitating various biomedical tasks. However, little is known of their potential in biomedical information retrieval, especially identifying drug-disease associations. This study aims to explore the potential of ChatGPT, a popular LLM, in discerning drug-disease associations. We collected 2694 true drug-disease associations and 5662 false drug-disease pairs. Our approach involved creating various prompts to instruct ChatGPT in identifying these associations. Under varying prompt designs, ChatGPT’s capability to identify drug-disease associations with an accuracy of 74.6–83.5% and 96.2–97.6% for the true and false pairs, respectively. This study shows that ChatGPT has the potential in identifying drug-disease associations and may serve as a helpful tool in searching pharmacy-related information. However, the accuracy of its insights warrants comprehensive examination before its implementation in medical practice.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Liu, J., C. Wang, and S. Liu. Utility of ChatGPT in clinical practice. J. Med. Internet Res.25:e48568, 2023. https://doi.org/10.2196/48568.
Article PubMed PubMed Central Google Scholar
Takagi, S., T. Watari, A. Erabi, and K. Sakaguchi. Performance of GPT-3.5 and GPT-4 on the Japanese medical licensing examination: comparison study. JMIR Med. Educ.9:e48002, 2023. https://doi.org/10.2196/48002.
Article PubMed PubMed Central Google Scholar
Zhao, W. X., et al. A survey of large language models. 2023. https://doi.org/10.48550/ARXIV.2303.18223
Eggmann, F., R. Weiger, N. U. Zitzmann, and M. B. Blatz. Implications of large language models such as ChatGPT for dental medicine. J. Esthet. Restor. Dent. 2023. https://doi.org/10.1111/jerd.13046.
Article PubMed Google Scholar
Thirunavukarasu, A. J., D. S. J. Ting, K. Elangovan, L. Gutierrez, T. F. Tan, and D. S. W. Ting. Large language models in medicine. Nat. Med. 29(8):1930–1940, 2023. https://doi.org/10.1038/s41591-023-02448-8.
Article CAS PubMed Google Scholar
OpenAI. Models. 2023. https://platform.openai.com/docs/models/gpt-3-5
Gilson, A., et al. How does ChatGPT perform on the united states medical licensing examination? The implications of large language models for medical education and knowledge assessment. JMIR Med. Educ.9:e45312, 2023. https://doi.org/10.2196/45312.
Article PubMed PubMed Central Google Scholar
Walker, H. L., et al. Reliability of medical information provided by ChatGPT: assessment against clinical guidelines and patient information quality instrument. J. Med. Internet Res.25:e47479, 2023. https://doi.org/10.2196/47479.
Article PubMed PubMed Central Google Scholar
Gu, Y., et al. Distilling large language models for biomedical knowledge extraction: a case study on adverse drug events. 2023. https://doi.org/10.48550/ARXIV.2307.06439.
Jahan, I., M. T. R. Laskar, C. Peng, and J. Huang. Evaluation of ChatGPT on biomedical tasks: a zero-shot comparison with fine-tuned generative transformers. 2023. https://doi.org/10.48550/ARXIV.2306.04504.
Juhi, A., N. Pipil, S. Santra, S. Mondal, J. K. Behera, and H. Mondal. The capability of ChatGPT in predicting and explaining common drug-drug interactions. Cureus. 2023. https://doi.org/10.7759/cureus.36272.
Article PubMed PubMed Central Google Scholar
Chen, S., et al. Use of artificial intelligence Chatbots for cancer treatment information. JAMA Oncol. 2023. https://doi.org/10.1001/jamaoncol.2023.2954.
Article PubMed PubMed Central Google Scholar
Eysenbach, G. The role of ChatGPT, generative language models, and artificial intelligence in medical education: a conversation with ChatGPT and a call for papers. JMIR Med. Educ.9:e46885, 2023. https://doi.org/10.2196/46885.
Article PubMed PubMed Central Google Scholar
Miao, H., and H. Ahn. Impact of ChatGPT on interdisciplinary nursing education and research. Asian Pac. Isl. Nurs. J.7:e48136, 2023. https://doi.org/10.2196/48136.
Article PubMed PubMed Central Google Scholar
Wang, Q., Z. Gao, and R. Xu. Exploring the in-context learning ability of large language model for biomedical concept linking. 2023. https://doi.org/10.48550/ARXIV.2307.01137.
Dave, T., S. A. Athaluri, and S. Singh. ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations. Front. Artif. Intell. 6:1169595, 2023. https://doi.org/10.3389/frai.2023.1169595.
Article PubMed PubMed Central Google Scholar
Drees, J. Google receives more than 1 billion health questions every day. Mar. 11, 2019. https://www.beckershospitalreview.com/healthcare-information-technology/google-receives-more-than-1-billion-health-questions-every-day.html. Accessed 23 Aug 2023.
Ayoub, N. F., Y.-J. Lee, D. Grimm, and K. Balakrishnan. Comparison between ChatGPT and google search as sources of postoperative patient instructions. JAMA Otolaryngol. Head Neck Surg. 149(6):556, 2023. https://doi.org/10.1001/jamaoto.2023.0704.
Article PubMed Google Scholar
Xu, R., Y. Feng, and H. Chen. ChatGPT vs. Google: a comparative study of search performance and user experience. 2023. https://doi.org/10.48550/ARXIV.2307.01135.
Dudley, J. T., T. Deshpande, and A. J. Butte. Exploiting drug-disease relationships for computational drug repositioning. Brief. Bioinform. 12(4):303–311, 2011. https://doi.org/10.1093/bib/bbr013.
Article CAS PubMed PubMed Central Google Scholar
Avram, S., et al. DrugCentral 2021 supports drug discovery and repositioning. Nucleic Acids Res. 49(D1):D1160–D1169, 2021. https://doi.org/10.1093/nar/gkaa997.
Article CAS PubMed Google Scholar
Wang, Q., and R. Xu. Drug repositioning for prostate cancer: using a data-driven approach to gain new insights. AMIA Annu. Symp. Proc. 2017:1724–1733, 2017.
PubMed Google Scholar
Wang, Q., and R. Xu. Disease comorbidity-guided drug repositioning: a case study in schizophrenia. AMIA Annu. Symp. Proc. 2018:1300–1309, 2018.
PubMed PubMed Central Google Scholar
White, J., et al. A prompt pattern catalog to enhance prompt engineering with ChatGPT. 2023. https://doi.org/10.48550/ARXIV.2302.11382.
Li, L., L. Fan, S. Atreja, and L. Hemphill. ‘HOT’ ChatGPT: the promise of ChatGPT in detecting and discriminating hateful, offensive, and toxic comments on social media. 2023. https://doi.org/10.48550/ARXIV.2304.10619.
Saravia, E. Prompt engineering guide. https://github.com/dair-ai/Prompt-Engineering-Guide. Accessed 5 Aug 2023
Wei, J., et al. Finetuned language models are zero-shot learners. 2021. https://doi.org/10.48550/ARXIV.2109.01652.
Xian, Y., B. Schiele, and Z. Akata. Zero-shot learning—the good, the bad and the ugly. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI: IEEE, 2017, pp. 3077–3086. https://doi.org/10.1109/CVPR.2017.328.
Brown, T. B., et al. Language models are few-shot learners. 2020. https://doi.org/10.48550/ARXIV.2005.14165.
Wang, Y., Q. Yao, J. T. Kwok, and L. M. Ni. Generalizing from a few examples: a survey on few-shot learning. ACM Comput. Surv. 53(3):1–34, 2021. https://doi.org/10.1145/3386252.
Article Google Scholar
Google AI. Google AI PaLM 2. Google AI 2023. https://ai.google/discover/palm2/ Accessed 6 Aug. 2023.
Meta, A.I. Introducing LLaMA: a foundational, 65-billion-parameter large language model. https://ai.facebook.com/blog/large-language-model-llama-meta-ai/. Accessed 24 Feb 2023
Chen, L., M. Zaharia, and J. Zou. How is ChatGPT’s behavior changing over time? 2023. https://doi.org/10.48550/ARXIV.2307.09009.
Natalie. What is ChatGPT?|OpenAI Help Center. 2023. https://help.openai.com/en/articles/6783457-what-is-chatgpt. Accessed 6 Aug 2023.
Zuccon, G., and B. Koopman. Dr ChatGPT, tell me what I want to hear: how prompt knowledge impacts health answer correctness. 2023. https://doi.org/10.48550/ARXIV.2302.13793.
Xu, R., and Q. Wang. Large-scale extraction of accurate drug-disease treatment pairs from biomedical literature for drug repurposing. BMC Bioinform. 14(1):1–11, 2013.
Article Google Scholar
Han, R., T. Peng, C. Yang, B. Wang, L. Liu, X. Wan. Is information extraction solved by ChatGPT? An analysis of performance, evaluation criteria, robustness and errors. 2023. https://doi.org/10.48550/arXiv.2305.14450.
Brinkmann, A., R. Shraga, R. C. Der, C. Bizer. Product information extraction using ChatGPT. 2023. https://doi.org/10.48550/arXiv.2306.14921.
Li, B., G. Fang, Y. Yang, Q. Wang, W. Ye, W. Zhao, S. Zhang. Evaluating ChatGPT's information extraction capabilities: an assessment of performance, explainability, calibration, and faithfulness. 2023. https://doi.org/10.48550/arXiv.2304.11633.

Download references

Funding

Funding was provided by National Institute on Alcohol Abuse and Alcoholism (AA029831), National Institute on Aging (AG057557, AG061388, AG062272, AG07664), National Eye Institute (EY029297), and National Institute on Drug Abuse (UG1DA049435, CTN-0114).

Author information

Zhenxiang Gao, Lingyao Li, and Siyuan Ma have contributed equally to this paper.

Authors and Affiliations

Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
Zhenxiang Gao, Qinyong Wang & Rong Xu
School of Information, University of Michigan, Ann Arbor, MI, USA
Lingyao Li & Libby Hemphill
Vanderbilt University Medical Center, Nashville, TN, USA
Siyuan Ma

Authors

Zhenxiang Gao
View author publications
You can also search for this author in PubMed Google Scholar
Lingyao Li
View author publications
You can also search for this author in PubMed Google Scholar
Siyuan Ma
View author publications
You can also search for this author in PubMed Google Scholar
Qinyong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Libby Hemphill
View author publications
You can also search for this author in PubMed Google Scholar
Rong Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rong Xu.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Associate Editor Joel Stitzel oversaw the review of this article.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 269 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gao, Z., Li, L., Ma, S. et al. Examining the Potential of ChatGPT on Biomedical Information Retrieval: Fact-Checking Drug-Disease Associations. Ann Biomed Eng (2023). https://doi.org/10.1007/s10439-023-03385-w

Download citation

Received: 13 September 2023
Accepted: 09 October 2023
Published: 19 October 2023
DOI: https://doi.org/10.1007/s10439-023-03385-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Examining the Potential of ChatGPT on Biomedical Information Retrieval: Fact-Checking Drug-Disease Associations

Abstract

Access this article

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (PDF 269 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation