Document Type
Article
Publication Date
10-15-2025
Abstract
OBJECTIVES: Antimicrobial resistance is a critical public health threat. Large language models (LLMs) show great capability for providing health information. This study evaluates the effectiveness of LLMs in providing information on antibiotic use and infection management.
METHODS: Using a mixed-method approach, responses to healthcare expert-designed scenarios from ChatGPT 3.5, ChatGPT 4.0, Claude 2.0 and Gemini 1.0, in both Italian and English, were analysed. Computational text analysis assessed readability, lexical diversity and sentiment, while content quality was assessed by three experts via DISCERN tool.
RESULTS: 16 scenarios were developed. A total of 101 outputs and 5454 Likert-scale (1-5) scores were obtained for the analysis. A general positive performance gradient was found from ChatGPT 3.5 and 4.0 to Claude to Gemini. Gemini, although producing only five outputs before self-inhibition, consistently outperformed the other models across almost all metrics, producing more detailed, accessible, varied content and a positive overtone. ChatGPT 4.0 demonstrated the highest lexical diversity. A difference in performance by language was observed. All models showed a median score of 1 (IQR=2) regarding the domain addressing antimicrobial resistance.
DISCUSSION: The study highlights a positive performance gradient towards Gemini, which showed superior content quality, accessibility and contextual awareness, although acknowledging its smaller dataset. Generating appropriate content to address antimicrobial resistance proved challenging.
CONCLUSIONS: LLMs offer great promise to provide appropriate medical information. However, they should play a supporting role rather than representing a replacement option for medical professionals, confirming the need for expert oversight and improved artificial intelligence design.
Recommended Citation
Di Pumpo, Marcello; Gualano, Maria Rosaria; Buonsenso, Danilo; Raffaelli, Francesca; Donà, Daniele; Maio, Vittorio; Laurenti, Patrizia; Ricciardi, Walter; and Villani, Leonardo, "Large Language Models as Information Providers for Appropriate Antimicrobial Use: Computational Text Analysis and Expert-Rated Comparison of ChatGPT, Claude and Gemini" (2025). College of Population Health Faculty Papers. Paper 229.
https://jdc.jefferson.edu/healthpolicyfaculty/229
Creative Commons License

This work is licensed under a Creative Commons Attribution-Noncommercial 4.0 License
online supplemental file 2.docx (13 kB)
online supplemental file 3.docx (14 kB)
Language
English
Included in
Artificial Intelligence and Robotics Commons, Biomedical Informatics Commons, Chemical Actions and Uses Commons, Chemical and Pharmacologic Phenomena Commons, Public Health Commons


Comments
This article is the author’s final published version in BMJ Health and Care Informatics, Volume 32, Issue 1, 2025, Article number e101632.
The published version is available at https://doi.org/10.1136/bmjhci-2025-101632. Copyright © Author(s) (or their employer(s)) 2025.