Thromb Haemost
DOI: 10.1055/a-2796-1975
Original Article: Stroke, Systemic or Venous Thromboembolism

Optimizing the Accuracy of Natural Language Processing Tools for Pulmonary Embolism Detection Through Integration with Claims Data: The PE-EHR+ Study

Authors

  • Sina Rashedi

    1   Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • Syed Bukhari

    2   Division of Cardiology, Department of Medicine, Johns Hopkins University, Baltimore, Maryland, United States
  • Darsiya Krishnathasan

    1   Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • Candrika D. Khairani

    1   Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • Antoine Bejjani

    1   Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
    3   Department of Internal Medicine, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, United States
  • Mariana B. Pfeferman

    1   Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • Julia Malejczyk

    1   Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • Mehrdad Zarghami

    4   Jamaica Hospital Medical Center, Queens, New York, United States
  • Eric A. Secemsky

    5   Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
    6   Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States
    7   Harvard Medical School, Boston, Massachusetts, United States
  • Farbod N. Rahaghi

    8   Division of Pulmonary and Critical Care, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • Mohamad A. Hussain

    9   Division of Vascular and Endovascular Surgery and the Center for Surgery and Public Health, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • Hamid Mojibian

    10   Department of Radiology and Biomedical Imaging, Yale University, New Haven, Connecticut, United States
  • Samuel Z. Goldhaber

    1   Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
    11   Division of Cardiovascular Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • David Jiménez

    12   Respiratory Department, Hospital Ramón y Cajal and Instituto Ramón y Cajal de Investigación Sanitaria (IRYCIS), Madrid, Spain
    13   Medicine Department, Universidad de Alcalá, Campus Científico-Tecnológico, Crta. de Madrid-Barcelona, Madrid, Spain
    14   CIBER Enfermedades Respiratorias (CIBERES), Instituto de Salud Carlos III, Madrid, Spain
  • Manuel Monreal

    15   Cátedra de Enfermedad Tromboembólica, Universidad Católica San Antonio de Murcia, Spain
  • Richard Yang

    16   YNHH/Yale Center for Outcomes Research and Evaluation (CORE), New Haven, Connecticut, United States
  • Li Zhou

    16   YNHH/Yale Center for Outcomes Research and Evaluation (CORE), New Haven, Connecticut, United States
  • Gregory Piazza

    1   Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
    11   Division of Cardiovascular Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • Harlan M. Krumholz

    16   YNHH/Yale Center for Outcomes Research and Evaluation (CORE), New Haven, Connecticut, United States
    17   Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, Connecticut, United States
    18   Department of Health Policy and Management, Yale School of Public Health, New Haven, Connecticut, United States
  • Liqin Wang

    19   Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • Behnood Bikdeli

    1   Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
    11   Division of Cardiovascular Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States
    18   Department of Health Policy and Management, Yale School of Public Health, New Haven, Connecticut, United States

Funding Information Dr. Bikdeli was supported by a Career Development Award from the American Heart Association and Vascular InterVentional Advances Physicians (#938814) for the PE-EHR+ study.


Graphical Abstract

Abstract

Background

Rule-based natural language processing (NLP) tools can identify pulmonary embolism (PE) via radiology reports. However, their external validity remains uncertain.

Methods

In this cross-sectional study, 1,712 hospitalized patients (with and without PE) at Mass General Brigham (MGB) hospitals (2016–2021) were analyzed. Two previously published NLP algorithms were applied to radiology reports to identify PE. Chart review by two physicians was the reference standard. We tested three approaches: (A) NLP applied to all patients; (B) NLP limited to radiology reports of patients with principal or secondary International Classification of Diseases 10th revision (ICD-10) PE discharge codes; and (C) NLP applied to patients with PE discharge codes or a Present-on-Admission (POA) indicator (“Y”) for PE. All others were assumed PE-negative in Approaches B and C to minimize NLP false positives. Weighted estimates were derived from the MGB hospitalized cohort (n = 381,642) to calculate F1 scores (as the harmonic mean of sensitivity and positive predictive value [PPV]).

Results

In Approach A, both NLP tools showed high sensitivity (82.5%, 93.0%) and specificity (98.9%, 98.7%) but low PPV (60.3%, 59.6%). Approach B improved PPV (95.2%, 94.9%) but reduced sensitivity (74.1%, 76.2%), while Approach C preserved both high sensitivity (82.5%, 93.0%) and PPV (95.6%, 95.8%). Approach C demonstrated the best performance, yielding significantly higher F1 scores for both NLP tools (88.6%, 94.4%) compared with Approach A (69.7%, 72.6%) and Approach B (83.3%, 84.5%) (P < 0.001).

Conclusion

The accuracy of PE detection improves when rule-based NLP algorithms are operationalized using administrative claims data in addition to radiology reports.

Data Availability Statement

The data for this study are available from the corresponding author upon reasonable request.




Publication History

Received: 20 August 2025

Accepted: 23 January 2026

Accepted Manuscript online:
28 January 2026

Article published online:
09 February 2026

© 2026. Thieme. All rights reserved.

Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany