Validity of ChatGPT-generated musculoskeletal images

Ajmera, P.; Nischal, N.; Ariyaratne, S.; Botchu, B.; Bhamidipaty, K. D. P.; Iyengar, K. P.; Ajmera, S. R.; Jenko, N.; Botchu, R.

doi:10.1007/s00256-024-04638-y

Validity of ChatGPT-generated musculoskeletal images

Scientific Article
Published: 04 March 2024

(2024)
Cite this article

Skeletal Radiology Aims and scope Submit manuscript

P. Ajmera¹,
N. Nischal²,
S. Ariyaratne³,
B. Botchu⁴,
K. D. P. Bhamidipaty⁵,
K. P. Iyengar⁶,
S. R. Ajmera¹,
N. Jenko³ &
…
R. Botchu³

423 Accesses
2 Citations
2 Altmetric
Explore all metrics

A Letter to the Editor to this article was published on 20 March 2024

Abstract

Objective

In the evolving landscape of medical research and radiology, effective communication of intricate ideas is imperative, with visualizations playing a crucial role. This study explores the transformative potential of ChatGPT4, a powerful Large Language Model (LLM), in automating the creation of schematics and figures for radiology research papers, specifically focusing on its implications for musculoskeletal studies.

Materials and methods

Deploying ChatGPT4, the study aimed to assess the model’s ability to generate anatomical images of six large joints—shoulder, elbow, wrist, hip, knee, and ankle. Four variations of a text prompt were utilized, to generate a coronal illustration with annotations for each joint. Evaluation parameters included anatomical correctness, correctness of annotations, aesthetic nature of illustrations, usability of figures in research papers, and cost-effectiveness. Four panellists performed the assessment using a 5-point Likert Scale.

Results

Overall analysis of the 24 illustrations encompassing the six joints of interest (4 of each) revealed significant limitations in ChatGPT4’s performance. The anatomical design ranged from poor to good, all of the illustrations received a below-average rating for annotation, with the majority assessed as poor. All of them ranked below average for usability in research papers. There was good agreement between raters across all domains (ICC = 0.61).

Conclusion

While LLMs like ChatGPT4 present promising prospects for rapid figure generation, their current capabilities fall short of meeting the rigorous standards demanded by musculoskeletal radiology research. Future developments should focus on iterative refinement processes to enhance the realism of LLM-generated musculoskeletal schematics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Translating musculoskeletal radiology reports into patient-friendly summaries using ChatGPT-4

Article 25 January 2024

Simplifying radiologic reports with natural language processing: a novel approach using ChatGPT in enhancing patient understanding of MRI results

Article 11 November 2023

ChatGPT makes medicine easy to swallow: an exploratory case study on simplified radiology reports

Article Open access 05 October 2023

Data availability

The images were generated using the open access software “Chat GPT-4v”, with the specific prompts we have already provided in the article. Users can generate similar images using the prompts we have provided.

References

Divecha C, Tullu M, Karande S. Utilizing tables, figures, charts and graphs to enhance the readability of a research paper. Medknow; 2023. p. 125–31.
Google Scholar
Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, et al. Large language models encode clinical knowledge. Nature. 2023;620(7972):172–80.
Article ADS CAS PubMed PubMed Central Google Scholar
Clusmann J, Kolbinger FR, Muti HS, Carrero ZI, Eckardt JN, Laleh NG, et al. The future landscape of large language models in medicine. Commun Med (Lond). 2023;3(1):141.
Article PubMed Google Scholar
Naveed H, Khan AU, Qiu S, Saqib M, Anwar S, Usman M, et al. A comprehensive overview of large language models. arXiv preprint arXiv:230706435.2023.
Ariyaratne S, Iyengar KP, Botchu R. Will collaborative publishing with ChatGPT drive academic writing in the future? Br J Surg. 2023;110(9):1213–4.
Article PubMed Google Scholar
Botchu R, Iyengar KP. Will ChatGPT drive radiology in the future? Indian J Radiol Imaging. 2023;33(4):436–7.
Sullivan GM, Artino AR Jr. Analyzing and interpreting data from Likert-type scales. J Grad Med Educ. 2013;5(4):541–2.
Article PubMed PubMed Central Google Scholar
Corl FM, Garland MR, Fishman EK. Role of computer technology in medical illustration. Am J Roentgenol. 2000;175(6):1519–24.
Article CAS Google Scholar
Stokel-Walker C, Van Noorden R. What ChatGPT and generative AI mean for science. Nature. 2023;614(7947):214–6.
Article ADS CAS PubMed Google Scholar
Sallam M. ChatGPT utility in healthcare education, research, and practice: systematic review on the promising perspectives and valid concerns. Healthcare 2023;11(6):887. MDPI.
Liebrenz M, Schleifer R, Buadze A, Bhugra D, Smith A. Generating scholarly content with ChatGPT: ethical challenges for medical publishing. Lancet Digital Health. 2023;5(3):e105–6.
Article CAS PubMed Google Scholar
Li H, Moon JT, Purkayastha S, Celi LA, Trivedi H, Gichoya JW. Ethics of large language models in medicine and medical research. Lancet Digital Health. 2023;5(6):e333–5.
Article CAS PubMed Google Scholar
https://www.theverge.com/2023/11/6/23948386/chatgpt-active-user-count-openai-developer-conference (accessed: 26th December 2023, at 6:00 PM).

Download references

Author information

Authors and Affiliations

Department of Radiology, Mayo Clinic, Rochester, MN, USA
P. Ajmera & S. R. Ajmera
Department of Radiology, Holy Family Hospital, New Delhi, India
N. Nischal
Department of Musculoskeletal Radiology, Royal Orthopedic Hospital, Bristol Road South, Northfield, Birmingham, UK
S. Ariyaratne, N. Jenko & R. Botchu
Solihull School, Solihull, UK
B. Botchu
Department of Radiology, GIMSR, Vizag, India
K. D. P. Bhamidipaty
Department of Orthopedics, Southport and Ormskirk Hospital, Mersey and West Lancashire NHS Trust, Southport, UK
K. P. Iyengar

Authors

P. Ajmera
View author publications
You can also search for this author in PubMed Google Scholar
N. Nischal
View author publications
You can also search for this author in PubMed Google Scholar
S. Ariyaratne
View author publications
You can also search for this author in PubMed Google Scholar
B. Botchu
View author publications
You can also search for this author in PubMed Google Scholar
K. D. P. Bhamidipaty
View author publications
You can also search for this author in PubMed Google Scholar
K. P. Iyengar
View author publications
You can also search for this author in PubMed Google Scholar
S. R. Ajmera
View author publications
You can also search for this author in PubMed Google Scholar
N. Jenko
View author publications
You can also search for this author in PubMed Google Scholar
R. Botchu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Botchu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ajmera, P., Nischal, N., Ariyaratne, S. et al. Validity of ChatGPT-generated musculoskeletal images. Skeletal Radiol (2024). https://doi.org/10.1007/s00256-024-04638-y

Download citation

Received: 23 January 2024
Revised: 26 February 2024
Accepted: 27 February 2024
Published: 04 March 2024
DOI: https://doi.org/10.1007/s00256-024-04638-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Validity of ChatGPT-generated musculoskeletal images