Open Access
CC BY-NC-ND 4.0 · Endosc Int Open 2020; 08(10): E1448-E1454
DOI: 10.1055/a-1229-3927
Original article

Training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos

Taibo Li
1   Johns Hopkins School of Medicine – MD-PhD Program, Baltimore, Maryland, United States
2   MIT – Department of Electrical Engineering and Computer Science, Cambridge, Massachusetts, United States
,
Jeremy R. Glissen Brown
3   Center for Advanced Endoscopy, Division of Gastroenterology, Beth Israel Deaconess, Medical Center and Harvard Medical School, Boston, Massachusetts 02130
,
Kelovoulos Tsourides
4   MIT – Department of Brain and Cognitive Sciences, Cambridge, Massachusetts, United States
,
Nadim Mahmud
5   Hospital of the University of Pennsylvania – Division of Gastroenterology, Boston, Massachusetts, United States
,
Jonah M. Cohen
3   Center for Advanced Endoscopy, Division of Gastroenterology, Beth Israel Deaconess, Medical Center and Harvard Medical School, Boston, Massachusetts 02130
,
Tyler M. Berzin
3   Center for Advanced Endoscopy, Division of Gastroenterology, Beth Israel Deaconess, Medical Center and Harvard Medical School, Boston, Massachusetts 02130
› Institutsangaben
Preview

Abstract

Background Colorectal cancer (CRC) is a major public health burden worldwide, and colonoscopy is the most commonly used CRC screening tool. Still, there is variability in adenoma detection rate (ADR) among endoscopists. Recent studies have reported improved ADR using deep learning models trained on videos curated largely from private in-house datasets. Few have focused on the detection of sessile serrated adenomas (SSAs), which are the most challenging target clinically.

Methods We identified 23 colonoscopy videos available in the public domain and for which pathology data were provided, totaling 390 minutes of footage. Expert endoscopists annotated segments of video with adenomatous polyps, from which we captured 509 polyp-positive and 6,875 polyp-free frames. Via data augmentation, we generated 15,270 adenomatous polyp-positive images, of which 2,310 were SSAs, and 20,625 polyp-negative images. We used the CNN AlexNet and fine-tuned its parameters using 90 % of the images, before testing its performance on the remaining 10 % of images unseen by the model.

Results We trained the model on 32,305 images and tested performance on 3,590 images with the same proportion of SSA, non-SSA polyp-positive, and polyp-negative images. The overall accuracy of the model was 0.86, with a sensitivity of 0.73 and a specificity of 0.96. Positive predictive value was 0.93 and negative predictive value was 0.96. The area under the curve was 0.94. SSAs were detected in 93 % of SSA-positive images.

Conclusions Using a relatively small set of publicly-available colonoscopy data, we obtained sizable training and validation sets of endoscopic images using data augmentation, and achieved an excellent performance in adenomatous polyp detection.



Publikationsverlauf

Artikel online veröffentlicht:
07. Oktober 2020

© 2020. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commecial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/)

Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany