Endoscopy 2025; 57(S 02): S514-S515
DOI: 10.1055/s-0045-1806335
Abstracts | ESGE Days 2025
ePosters

Evaluating the Potential of Generative AI in Digestive Endoscopy: ChatGPT-4 for Advanced Image Analysis in Gastroenterology

M Mascarenhas
1   Department of Gastroenterology, São João University Hospital Center, Porto, Portugal
,
T Ribeiro
1   Department of Gastroenterology, São João University Hospital Center, Porto, Portugal
,
B Agudo
2   Puerta de Hierro Majadahonda University Hospital, Madrid, Spain
,
J Afonso
1   Department of Gastroenterology, São João University Hospital Center, Porto, Portugal
,
F Mendes
1   Department of Gastroenterology, São João University Hospital Center, Porto, Portugal
,
M Martins
1   Department of Gastroenterology, São João University Hospital Center, Porto, Portugal
,
P Marílio Cardoso
1   Department of Gastroenterology, São João University Hospital Center, Porto, Portugal
,
J Mota
1   Department of Gastroenterology, São João University Hospital Center, Porto, Portugal
,
M J Almeida
1   Department of Gastroenterology, São João University Hospital Center, Porto, Portugal
,
A Martins Pinto da Costa
2   Puerta de Hierro Majadahonda University Hospital, Madrid, Spain
,
M González-Haba Ruiz
2   Puerta de Hierro Majadahonda University Hospital, Madrid, Spain
,
J Widmer
3   NYU Langone Hospital Long Island, Mineola, United States of America
,
E G Hourneaux de Moura
4   Hospital das Clínicas of the University of São Paulo, São Paulo, Brazil
,
A Javed
5   Royal Liverpool University Hospital, Liverpool, United Kingdom
,
T Manzione
6   Instituto de Infectologia Emílio Ribas, Sao Paulo, Brazil
,
N Sidney
7   Instituto de Infectologia Emílio Ribas, São Paulo, Brazil
,
L Barroso
8   Wake Forest University School of Medicine, Winston-Salem, United States of America
,
V D Parades
9   Hospital Paris Saint-Joseph, Paris, France
,
J Ferreira
10   Faculty of Engineering – University of Porto, Porto, Portugal
,
M Guilherme
1   Department of Gastroenterology, São João University Hospital Center, Porto, Portugal
› Institutsangaben
 

Aims In recent years, there has been exponential growth in the application of artificial intelligence in healthcare, with development of several tools based on large language models (LLMs). Recently, there is a growing interest in the clinical implementation of this tools, with recent versions incorporating imaging analysis capability. Nevertheless, their potential utility in Gastroenterology remains largely unexplored. This study aimed to assess ChatGPT-4’s performance in interpreting images from different Gastroenterology procedures.

Methods A total of 740 images from five procedures – capsule endoscopy (CE), device-assisted enteroscopy (DAE), endoscopic ultrasound (EUS), digital single-operator cholangioscopy (D-SOC) and high-resolution anoscopy (HRA)–were included and analyzed by ChatGPT-4 using a personalized predefined prompt. ChatGPT-4 predictions were compared to gold standard diagnoses. Statistical analyses included accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and area under the curve (AUC) [1] [2] [3].

Results For CE, ChatGPT-4 demonstrated accuracies ranging from 50.0% to 90.0%, with AUCs between 0.50 and 0.90. For DAE, the model achieved an accuracy of 67.0% (AUC 0.67). For EUS, the system showed AUCs of 0.488 and 0.550 for the detection of pancreatic cystic and solid lesions, respectively. The LLM differentiated benign from malignant biliary strictures with an AUC of 0.550. For HRA, ChatGPT-4 overal accuracy ranged between 47.5% and 67.5%.

Conclusions This multicentric study highlights that while ChatGPT4 demonstrates notable capabilities in text analysis, its performance in image analysis remains suboptimal, underscoring the need for significant improvements before clinical adoption.



Publikationsverlauf

Artikel online veröffentlicht:
27. März 2025

© 2025. European Society of Gastrointestinal Endoscopy. All rights reserved.

Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany

 
  • References

  • 1 Koga S., Du W.. From text to image: challenges in integrating vision into ChatGPT for medical image interpretation. Neural Regen Res 2025; 20: 487-488
  • 2 Sonoda Y., Kurokawa R., Nakamura Y., Kanzawa J., Kurokawa M., Ohizumi Y., Gonoi W., Abe O.. Diagnostic perfor-mances of GPT-4o, Claude 3 Opus, and Gemini 1.5 Pro in "Diagnosis Please" cases Jpn J Radiol. 2024; doi:10.1007/s11604-024-01619-y.
  • 3 Lahat A., Shachar E., Avidan B., Shatz Z., Glicksberg B.S., Klang E.. Evaluating the use of large language model in identi-fying top research questions in gastroenterology. Sci Rep 2023; 13: 4164