Appl Clin Inform
DOI: 10.1055/a-2681-5008
Brief Scientific Communication

Generative AI Summaries to Facilitate ED Handoff

Nicholas Genes
1   Ronald O. Perelman Department of Emergency Medicine, NYU Grossman School of Medicine, New York, United States (Ringgold ID: RIN12296)
,
Gregory Simon
1   Ronald O. Perelman Department of Emergency Medicine, NYU Grossman School of Medicine, New York, United States (Ringgold ID: RIN12296)
,
Christian Koziatek
1   Ronald O. Perelman Department of Emergency Medicine, NYU Grossman School of Medicine, New York, United States (Ringgold ID: RIN12296)
,
Jung G. Kim
1   Ronald O. Perelman Department of Emergency Medicine, NYU Grossman School of Medicine, New York, United States (Ringgold ID: RIN12296)
,
Kar-mun Woo
1   Ronald O. Perelman Department of Emergency Medicine, NYU Grossman School of Medicine, New York, United States (Ringgold ID: RIN12296)
,
Cassidy Dahn
1   Ronald O. Perelman Department of Emergency Medicine, NYU Grossman School of Medicine, New York, United States (Ringgold ID: RIN12296)
,
Leland Chan
1   Ronald O. Perelman Department of Emergency Medicine, NYU Grossman School of Medicine, New York, United States (Ringgold ID: RIN12296)
,
Batia Wiesenfeld
2   Kaufman Management Center, New York University Leonard N Stern School of Business, New York, United States (Ringgold ID: RIN5893)
› Author Affiliations
Preview

Background Emergency Department (ED) handoff to inpatient teams is a potential source of error. Generative Artificial Intelligence (AI) has shown promise in succinctly summarizing large quantities of clinical data and may help improve Emergency Department (ED) handoff. Objectives Our objectives were to: 1) evaluate the accuracy, clinical utility, and safety of AI-generated ED-to-inpatient handoff summaries; 2) identify patient and visit characteristics influencing summary effectiveness; and 3) characterize potential error patterns to inform implementation strategies. Methods This exploratory study evaluated AI-generated handoff summaries at an urban academic ED (February–April 2024). A HIPAA-compliant GPT-4 model generated summaries aligned with the IPASS framework; ED providers assessed summary accuracy, usefulness, and safety through on-shift surveys. Results Among 50 cases, median quality and usefulness scores were 4/5 (SE = 0.13). Safety concerns arose in 6% of cases, with issues including data omissions and mischaracterizations. Consultation status significantly affected usefulness scores (p < 0.05). Omissions of relevant medications, laboratory results, and other essential detailss were noted (n=6), and EM clinicians disagreed with some AI characterizations of patient stability, vitals and workup (n=8). The most common response was positive impressions of the technology incorporated into the handoff process (n=11). Conclusions This exploratory provider-in-the-loop model demonstrated clinical acceptability and highlighted areas for refinement. Future studies should incorporate recipient perspectives and examine clinical outcomes to scale and optimize AI implementation.



Publication History

Received: 01 May 2025

Accepted after revision: 11 August 2025

Accepted Manuscript online:
12 August 2025

© . Thieme. All rights reserved.

Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany