Repository logo
  • English
  • Deutsch
  • Français
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. CRIS
  3. Publication
  4. Large Language Model-Based Patient Simulation to Foster Communication Skills in Health Care Professionals: User-Centered Development and Usability Study
 

Large Language Model-Based Patient Simulation to Foster Communication Skills in Health Care Professionals: User-Centered Development and Usability Study

URI
https://arbor.bfh.ch/handle/arbor/46106
Version
Published
Identifiers
10.2196/81271
Date Issued
2025
Author(s)
Elhilali, Ahmed
Ngo, Andy Suy-Huor
Reichenpfader, Daniel  
Denecke, Kerstin  
Type
Article
Language
English
Subjects

chatbot

large language model

medical education

patient simulation

vignette

Abstract
Background: Case-based learning using standardized patients is a key method for teaching communication skills in medicine. Besides logistical and financial hurdles, standardized patients portrayed by actors cannot cover the complete diversity of sociodemographic factors of patients. Large language models (LLMs) show promise for creating scalable patient simulations and could probably cover a broader diversity of factors. They could also be integrated into the continuous training of future health care professionals' communication and interaction skills. Objective: This study aimed to introduce the system architecture of a digital tool that leverages LLMs to simulate patient conversations for medical education, focusing specifically on medical history taking. Through an explorative analysis, we aimed to assess the tool's usability and examine differences between LLMs in simulating patient encounters. Methods: We followed a user-centered design process, gathering initial requirements from 2 medical students. We then developed a fully functional web prototype using a Python Flask backend and a PostgreSQL database, integrating 5 LLMs from OpenAI, Anthropic, and xAI. The system includes an artificial intelligence-assisted case vignette generator and a dynamic patient simulator. For the explorative analysis of the prototype, we conducted a task-based usability test with 5 medical students, measuring their experience using the System Usability Scale (SUS) questionnaire and qualitative questions. We then conducted an explorative analysis in which 4 practicing physicians evaluated the simulation quality of 3 models (Grok 3, GPT-4, and Claude 3 Opus) across 7 criteria on a 5-point Likert scale. Results: Usability testing yielded a mean SUS score of 91.5 (SD 8.40), indicating high perceived usability in a small formative sample. Students praised the system's simplicity and intuitive design but noted the absence of a formal conclusion and performance feedback, expressing a desire for a "didactic loop" to maximize learning. The models showed limitations in simulating uncertainties and memory lapses, responding to follow-up questions, and producing natural conversational flow. They perform well in simulating a coherent symptom profile, in using patient-like language, and in describing a realistic timeline and symptom progression. The differences among the models were not statistically significant. Ratings showed limited discriminative reliability (Kendall W=0-0.19, ie, very low) and a ceiling effect, with most scores clustered at 4-5, constraining interpretation; all group differences should therefore be viewed as exploratory. Conclusions: We successfully developed a highly usable patient simulation tool that serves as a foundation for further development. Our results show that while the tool could be effective for communication training, its full potential will only be realized by integrating an automated feedback mechanism to create a complete didactic loop, as requested by the test users. Future work should assess in more depth the differences among the models in simulating psychosocial patient characteristics.
DOI
https://doi.org/10.24451/arbor.12534
Publisher DOI
10.2196/81271
Journal or Serie
JMIR medical education
Journal or Serie
JMIR Medical Education
ISSN
2369-3762
Organization
Technik und Informatik  
Institute for Patient-centered Digital Health  
AI for Health  
Volume
11
Publisher
JMIR Publications
Submitter
Denecke, Kerstin
Citation apa
Elhilali, A., Ngo, A. S.-H., Reichenpfader, D., & Denecke, K. (2025). Large Language Model-Based Patient Simulation to Foster Communication Skills in Health Care Professionals: User-Centered Development and Usability Study. In JMIR Medical Education (Vol. 11). JMIR Publications. https://doi.org/10.24451/arbor.12534
File(s)
Loading...
Thumbnail Image
Download

open access

Name

mededu-2025-1-e81271.pdf

License
Attribution 4.0 International
Version
published
Size

2.36 MB

Format

Adobe PDF

Checksum (MD5)

b1b177a16bbe814e98bc603444e68ec2

About ARBOR

Built with DSpace-CRIS software - System hosted and mantained by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Our institution