Pediatric surgical trainees and artificial intelligence: a comparative analysis of DeepSeek, Copilot, Google Bard and pediatric surgeons’ performance on the European Pediatric Surgical In-Training Examinations (EPSITE)

Large language models (LLMs) have advanced rapidly, but their utility in pediatric surgery remains uncertain. This study assessed the performance of three AI models—DeepSeek, Microsoft Copilot (GPT-4) and Google Bard—on the European Pediatric Surgery In-Training Examination (EPSITE).

Saved in:
Bibliographic Details
Main Authors: Gnatzy, Richard (Author) , Lacher, Martin (Author) , Cascio, Salvatore (Author) , Münsterer, Oliver (Author) , Wagner, Richard (Author) , Aubert, Ophelia (Author)
Format: Article (Journal)
Language:English
Published: December 2025
In: Pediatric surgery international
Year: 2025, Volume: 41, Issue: 1, Pages: 1-9
ISSN:1437-9813
DOI:10.1007/s00383-025-06104-9
Online Access:Verlag, kostenfrei, Volltext: https://doi.org/10.1007/s00383-025-06104-9
Get full text
Author Notes:Richard Gnatzy, Martin Lacher, Salvatore Cascio, Oliver Münsterer, Richard Wagner, Ophelia Aubert
Description
Summary:Large language models (LLMs) have advanced rapidly, but their utility in pediatric surgery remains uncertain. This study assessed the performance of three AI models—DeepSeek, Microsoft Copilot (GPT-4) and Google Bard—on the European Pediatric Surgery In-Training Examination (EPSITE).
Item Description:Online veröffentlicht: 8. August 2025
Gesehen am 06.11.2025
Physical Description:Online Resource
ISSN:1437-9813
DOI:10.1007/s00383-025-06104-9