Subscribe to RSS
DOI: 10.1055/a-2780-0664
Generative artificial intelligence for patient education material on gastric cancer prevention
Authors

Abstract
Background This study assessed the effectiveness of large language models (LLMs) in generating lay summaries for patient education on the management of precancerous lesions and early neoplasia in the stomach.
Methods In this pilot study, we used a two-period, crossover, blinded design to compare a ChatGPT-4o summary versus a Digestive Cancers Europe (DiCE) summary. Two panels rated the materials: expert physicians and DiCE Patient Advisory Committee members. Experts scored accuracy, completeness, comprehensibility, and satisfaction (across five sections); patients rated overall completeness, comprehensibility, and satisfaction. Paired comparisons used mixed-effects estimates. Readability was assessed with Flesch–Kincaid grade level (FKGL) and SMOG index.
Results Median expert ratings were similar between materials across metrics. For the overall summary, median (range; IQR) scores were: accuracy 5 (4–6; 1) for ChatGPT-4o vs. 5 (3–6; 1) for DiCE (P = 0.10); completeness 4 (3–5; 1) vs. 4 (2–5; 1; P = 0.27); comprehensibility 4 (3–5; 1) vs. 4 (2–5; 1; P = 0.33); and satisfaction 4 (2–5; 1) vs. 3 (1–5; 2; P = 0.53). Patient ratings mirrored experts, with very similar results. Readability failed to meet guideline recommendations for both summaries on both FKGL and SMOG scores.
Conclusion ChatGPT-4o produced patient materials comparable to DiCE, but both require readability optimization; a human-in-the-loop workflow and future tests across prompts and models are warranted.
‡ joint first authors.
* joint senior authors.
Publication History
Received: 14 March 2025
Accepted after revision: 29 December 2025
Article published online:
13 February 2026
© 2026. Thieme. All rights reserved.
Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany
-
References
- 1 Baskar S, Schoeneich R, Baskar A. et al. Leveraging patient education to amplify colorectal cancer screening in the United States: strategies and implications. J Cancer Educ 2025; 40: 321-328
- 2 Zafar N, Wolf AB, Kepniss JL. et al. Effectiveness of community education for breast cancer screening. J Breast Imaging 2024; 6: 166-174
- 3 Aydin S, Karabacak M, Vlachos V. et al. Large language models in patient education: a scoping review of applications in medicine. Front Med (Lausanne) 2024; 11: 1477898
- 4 Chang PW, Amini MM, Davis RO. et al. ChatGPT4 outperforms endoscopists for determination of postcolonoscopy rescreening and surveillance recommendations. Clin Gastroenterol Hepatol 2024; 22: 1917-1925.e17
- 5 Pugliese N, Wai-Sun Wong V, Schattenberg JM. et al. Accuracy, reliability, and comprehensibility of ChatGPT-generated medical responses for patients with nonalcoholic fatty liver disease. Clin Gastroenterol Hepatol 2024; 22: 886-889.e5
- 6 Lee T-C, Staller K, Botoman V. et al. ChatGPT answers common patient questions about colonoscopy. Gastroenterology 2023; 165: 509-511.e7
- 7 Thrift AP, El-Serag HB. Burden of gastric cancer. Clin Gastroenterol Hepatol 2020; 18: 534-542
- 8 Morgan E, Arnold M, Camargo MC. et al. The current and future incidence and mortality of gastric cancer in 185 countries, 2020–40: A population-based modelling study. EClinicalMedicine 2022; 47: 101404
- 9 Leja M. Where are we with gastric cancer screening in Europe in 2024?. Gut 2024; 73: 2074-2082
- 10 Kang H-T. Current status of the National Health screening programs in South Korea. Korean J Fam Med 2022; 43: 168-173
- 11 Kim GH, Liang PS, Bang SJ. et al. Screening and surveillance for gastric cancer in the United States: Is it needed?. Gastrointest Endosc 2016; 84: 18-28
- 12 Dinis-Ribeiro M, Libânio D, Uchima H. et al. Management of epithelial precancerous conditions and early neoplasia of the stomach (MAPS III): European Society of Gastrointestinal Endoscopy (ESGE), European Helicobacter and Microbiota Study Group (EHMSG) and European Society of Pathology (ESP) Guideline update 2025. Endoscopy 2025; 57: 504-554
- 13 Digestive Cancers Europe (DiCE). Accessed: 14 January 2026. https://digestivecancers.eu/
- 14 ChatGPT version 4o. Accessed: 14 January 2026. https://chatgpt.com
- 15 Maida M, Ramai D, Mori Y. et al. The role of generative language systems in increasing patient awareness of colon cancer screening. Endoscopy 2025; 57: 262-268
- 16 Weiss BD. American Medical Association Foundation and American Medical Association. Health literacy and patient safety: Help patients understand. Manual for clinicians. 2nd Edition; 2007. Accessed: 10 February 2026 https://med.fsu.edu/sites/default/files/userFiles/file/ahec_health_clinicians_manual.pdf
- 17 Pham DK, Vo BQ. Towards reliable medical question answering: techniques and challenges in mitigating hallucinations in language models. Accessed: 14 January 2025 https://arxiv.org/html/2408.13808v1
