Letter to the editor, “How does artificial intelligence master urological board examinations?”

Wang, Junjun; Yun, Xing

doi:10.1007/s00345-024-04844-2

Letter to the editor, “How does artificial intelligence master urological board examinations?”

Letter to the Editor
Published: 26 February 2024

Volume 42, article number 104, (2024)
Cite this article

Download PDF

World Journal of Urology Aims and scope Submit manuscript

Letter to the editor, “How does artificial intelligence master urological board examinations?”

Download PDF

Junjun Wang¹ &
Xing Yun²

308 Accesses
Explore all metrics

The Original Article was published on 10 January 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Dear Editor,

I am writing to provide a response to the article by Kollitsch [1] et al. titled “How does artificial intelligence master urological board examinations?”. The authors performed a comparative analysis of various large language models (LLMs) to assess their accuracy and reliability in answering urological knowledge-based questions and revealed a correlation between question complexity and the performance of LLMs, underscoring the importance of conducting further research in specific subdomains of urology.

In addition to the evaluation presented in this paper, we would like to provide further insights into the implications and future directions. First, the findings of this study indicated that ChatGPT-4 and Bing AI consistently outperformed ChatGPT-3.5 in terms of RoCA scores. However, the reliability of responses across multiple rounds varied. This suggests the need for careful assessment of the reliability of LLM-generated responses in the context of medical education and knowledge acquisition. Second, the study uncovered a consistent trend across all three LLMs, indicating a decrease in test accuracy as question complexity increased. This underscores the importance of training LLMs on a wide range of medical literature and resources to enhance their performance in tackling complex questions. Third, the study's findings suggest that the adaptive learning capacity of LLMs may have limitations. Therefore, there is a significant need for continuous updates, ongoing training, and active maintenance of LLMs to ensure their reliability and effectiveness in acquiring medical knowledge. Furthermore, the study highlights significant concerns regarding the quality and consistency of responses generated by LLMs. It underscores the necessity for additional research to comprehensively evaluate the reliability and the reasoning quality of responses generated by LLMs, especially in the context of medical education and knowledge assessment.

In conclusion, the authors’ research enhances our understanding of the potential of LLMs in medical education and clinical practice, highlighting the need for further research to assess the performance of LLMs in specific subdomains within urology and other medical disciplines.

Availability of data and materials

Not applicable.

Reference

Kollitsch L, Eredics K, Marszalek M et al (2024) How does artificial intelligence master urological board examinations? A comparative analysis of different Large Language Models’ accuracy and reliability in the 2022 In-Service Assessment of the European Board of Urology. World J Urol 42(1):20
Article PubMed Google Scholar

Download references

Funding

This study received no funding.

Author information

Authors and Affiliations

Department of Urology, The First People’s Hospital of Xiaoshan District, Xiaoshan Affiliated Hospital of Wenzhou Medical University, Hangzhou, China
Junjun Wang
Department of Medical Insurance, The First People’s Hospital of Xiaoshan District, Xiaoshan Affiliated Hospital of Wenzhou Medical University, Hangzhou, China
Xing Yun

Authors

Junjun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xing Yun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JJW and XY: contributed the writing of this letter.

Corresponding author

Correspondence to Xing Yun.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, J., Yun, X. Letter to the editor, “How does artificial intelligence master urological board examinations?”. World J Urol 42, 104 (2024). https://doi.org/10.1007/s00345-024-04844-2

Download citation

Received: 24 January 2024
Accepted: 24 January 2024
Published: 26 February 2024
DOI: https://doi.org/10.1007/s00345-024-04844-2

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Letter to the editor, “How does artificial intelligence master urological board examinations?”

Availability of data and materials

Reference

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation