Endoscopy 2023; 55(04): 320-323
DOI: 10.1055/a-2038-7078
Editorial

Artificial intelligence and the push for small adenomas: all we need?

Referring to Ahmad A et al. p. 313–319
Katharina Zimmermann-Fraedrich
1   Department of Interdisciplinary Endoscopy University Hospital Hamburg-Eppendorf, Hamburg, Germany
,
1   Department of Interdisciplinary Endoscopy University Hospital Hamburg-Eppendorf, Hamburg, Germany
› Author Affiliations

Artificial intelligence (AI) has been enthusiastically welcomed by the endoscopic community, although many of the potential applications have remained in the testing environment using images and videos, and have not been tested in the clinical setting. Some computer-aided polyp detection (CADe) systems have entered the field of clinical application, however, and have been studied extensively in randomized trials. At the time of writing, 19 such trials were available as full publications [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19]. They all show a more or less significant increase in detection of small adenomas. The same results are summarized over and over again in no less than 16 meta-analyses [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] [32] [33] [34] [35], plus a protocol for a further one [36], as well as a meta-analysis of some of these meta-analyses [37]!

“The question relates to the ultimate clinical relevance of increasing ADR from high to even higher levels …, we still do not know whether we have a cutoff ADR level, above which there is no further improvement in cancer prevention.”

In this issue of Endoscopy, Ahmad et al. report on their use of CADe within a bowel cancer screening program [38]. The study is distinct from most other randomized trials in several respects. The patient population was homogeneous, including only colonoscopies from a fecal immunochemical test (FIT)-based screening program. Furthermore, a cap device, mostly Endocuff Vision (Olympus, Tokyo, Japan), was used at the discretion of the endoscopist. This device has been shown to increase polyp yield in a recent meta-analysis [39], and also recently in a FIT-based randomized screening colonoscopy study published in Endoscopy [40]. Endocuff was used in about 70 % of colonoscopies in the Ahmad et al. study, and it is a shame that its use was not standardized. Finally, polyp detection rate (PDR) and not adenoma detection rate (ADR) was used as the main outcome parameter, in contrast to almost all other studies.

The results show that PDR was slightly higher in the CADe group (85.7 %) compared with the control group (79.7 %; P = 0.05). ADRs were 71.4 % and 65.0 %, respectively. So, almost all patients had a polyp and more than two-thirds had an adenoma. No difference in ADR was found, but the level was already high in the control group. In other FIT-based screening programs, ADR was as high as in the control group of the current study in the Netherlands [41] and lower in studies from other countries, namely 55 % in Spain [42], 51 % in Sweden, and 41 % in Italy [40]. In primary screening colonoscopy, ADR is naturally lower, as patients have not been selected by a stool test. In the recently published NordICC trial, ADR was 31 % [43], which is similar to the ADR found in the German screening colonoscopy setting [44]. A recent review examined ADR differences in the control groups of randomized trials and found generally a very wide variation. The overall mean value of 37.5 % was however increased by a factor of 1.6 within FIT-based screening programs [45].

The sample size calculation in the present study seems a bit strange. The study was powered “to detect a 10 % increase in polyp detection, from a detection rate of 20 % in the control group up to 30 % in the CADe group”. In reality, the PDR was almost 80 % in the control group. In addition, higher ADRs and PDRs have been published in FIT-based screening colonoscopy programs, as outlined above. So, in essence, the study could have been underpowered, and with larger case numbers, an ADR increase from 65 % to 71 % could have been significant (if relevant is another question). Interestingly, numerical differences in adenomas were in the group 1–5 mm (from 765 to 846) but also in the group > 10 mm (from 94 to 120 polyps), presenting some contradiction to most previous studies, which show that small polyps are found more frequently by AI, but this may be coincidental.

So, what can we conclude about clinical relevance of AI increasing ADR? The first question is, do we see an upper limit of ADR that cannot be substantially improved upon by AI or any other means? Furthermore, may this study by Ahmad et al. have reached the possible ADR ceiling? Looking at the randomized trials cited above, the relative increase in ADR was dependent on the basic ADR level in the control group in most of the papers; relative ADR increases were around 50 % in low ADR papers [15] [18], about 30 % in trials with ADR levels between 20 %–30 % [11] [17], and 20 %–25 % in studies with basic ADR levels of 30 %–45 % [8] [10] [14], such as in FIT-based screening programs [2]. This is somewhat logical: there is less room for improvement in high ADR performers. We are confident that this will be the topic of another meta-analysis soon. Tandem studies in AI showed an even greater effect on the adenoma miss rate, but it is well known that tandem trials yield better results than comparative randomized studies [46].

The second question relates to the ultimate clinical relevance of increasing ADR from high to even higher levels. We know that ADR correlates with interval cancer rate (i. e. higher ADR prevents more colorectal cancer). However, we still do not know whether we have a cutoff ADR level, above which there is no or not much further improvement in cancer prevention. This was indirectly suggested by the Polish and Austrian follow-up studies [47] [48], which primarily took a cutoff level of 20 % for their data analyses. In contrast, the Californian group suggested a linear correlation in their studies [49] [50]. Their recent publication however includes figures that somehow contradict the conclusions [50]. It appears logical that if more than two-thirds of individuals undergoing endoscopy have a (mostly small/very small) adenoma, then the risk associated with adenomas per se may be of less importance, and finding more and more small adenomas may not be the key to improved cancer prevention.

The third and fourth questions are which adenomas should we chase and what else needs to be done? The relevance of small adenomas has been doubted by many database analyses in the past 10 years, which correlated patients’ prognoses with the stage of adenomas found and removed during colonoscopy, and was reconfirmed in a large review at the end of 2022 [51]. Another recent meta-analysis summarized the evidence of 12 such studies [52]. The authors uniformly showed no relevant disadvantage with regard to colorectal cancer development for individuals with small adenomas only, compared with the wider population or people with a normal colonoscopy. The worrisome finding of all these studies, however, is that the colorectal cancer incidence was fourfold higher when patients had advanced adenomas, which were obviously removed. Should we be ensuring that these patients benefit more from colonoscopy and polypectomy in order to bring their risk level down to “normal”? Therefore, we presume that there is more to colonoscopy quality and cancer prevention than just finding more and more small adenomas. We know that resection can be incomplete, with sometimes worrisome rates reported [53] [54] [55], and adherence to follow-up recommendations is not optimal. Mostly, patients with small adenomas tend to undergo surveillance earlier than recommended, and those with advanced lesions have surveillance intervals that are longer than those recommended in guidelines [56] [57] [58] (and perhaps are initially also under-resected). Is it possible that our continual focus on ADR is misplaced?

Finally, randomized trials, despite the high-quality evidence they provide, may also have shortcomings. The focus of a study and the willingness to produce good results, and to achieve a better publication, may increase the examiner’s awareness and attention in both groups (Hawthorne effect), but perhaps more in the study group, such as an AI-based application. On the other hand, there are recent sobering reports about what happens if AI systems are incorporated into a colonoscopy program. ADR did not increase with AI assistance in a US center [59], and they decreased in an Israeli hospital [60], perhaps due to shorter withdrawal times. The group from Würzburg used eye-tracking technology and showed that trainees focussed their eyes in the middle of the picture when using AI, whereas without AI, their gaze tended to travel further around the endoscopy screen [61]. This phenomenon is known from radiology and is called “deskilling”. This may not be the intention of AI, which is still dependent on the examiner’s ability to maneuver the scope adequately and also to pay attention; however, if the expectation is that AI will do the job, then attention may be reduced.

What do we need? Studies should look at more relevant outcomes than just whether we find more small polyps (we already have abundant evidence that this is the case). Authors and editors should refrain from writing and accepting more and more meta-analyses about AI polyp detection. There are also more challenging applications to be studied, starting with tissue diagnosis (and histology replacement, which may face quite a few legal and credentialing hurdles), but also performance, interpretation, documentation, risk stratification, and follow-up of colonoscopies and patients with more advanced polyps, all parameters that have to be documented before, during, and after colonoscopy. If some of this documentation paperwork were automatic and the burden reduced for endoscopists, we could perhaps focus better on good patient care and finally also on good clinical research.



Publication History

Article published online:
07 March 2023

© 2023. Thieme. All rights reserved.

Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany

 
  • References

  • 1 Li MD, Huang ZR, Shan QY. et al. Performance and comparison of artificial intelligence and human experts in the detection and classification of colonic polyps. BMC Gastroenterol 2022; 22: 517
  • 2 Rondonotti E, Di Paolo D, Rizzotto ER. et al. Efficacy of a computer-aided detection system in a fecal immunochemical test-based organized colorectal cancer screening program: a randomized controlled trial (AIFIT study). Endoscopy 2022; 54: 1171-1179
  • 3 Gimeno-García AZ, Negrin DH, Hernández A. et al. Usefulness of a novel computer-aided detection system for colorectal neoplasia: a randomized controlled trial. Gastrointest Endosc 2022; DOI: 10.1016/j.gie.2022.09.029.
  • 4 Xu H, Tang RSY, Lam TYT. et al. Artificial intelligence-assisted colonoscopy for colorectal cancer screening: a multicenter randomized controlled trial. Clin Gastroenterol Hepatol 2023; 21: 337-346
  • 5 Yao L, Zhang L, Liu J. et al. Effect of an artificial intelligence-based quality improvement system on efficacy of a computer-aided detection system in colonoscopy: a four-group parallel study. Endoscopy 2022; 54: 757-768
  • 6 Wallace MB, Sharma P, Bhandari P. et al. Impact of artificial intelligence on miss rate of colorectal neoplasia. Gastroenterology 2022; 163: 295-304
  • 7 Glissen Brown JR, Mansour NM, Wang P. et al. Deep learning computer-aided polyp detection reduces adenoma miss rate: a United States multi-center randomized tandem colonoscopy study (CADeT-CS Trial). Clin Gastroenterol Hepatol 2022; 20: 1499-1507
  • 8 Xu L, He X, Zhou J. et al. Artificial intelligence-assisted colonoscopy: a prospective, multicenter, randomized controlled trial of polyp detection. Cancer Med 2021; 10: 7184-7193
  • 9 Kamba S, Tamai N, Saitoh I. et al. Reducing adenoma miss rate of colonoscopy assisted by artificial intelligence: a multicenter randomized controlled trial. J Gastroenterol 2021; 56: 746-757
  • 10 Repici A, Spadaccini M, Antonelli G. et al. Artificial intelligence and colonoscopy experience: lessons from two randomised trials. Gut 2022; 71: 757-765
  • 11 Liu P, Wang P, Glissen Brown JR. et al. The single-monitor trial: an embedded CADe system increased adenoma detection during colonoscopy: a prospective randomized study. Therap Adv Gastroenterol 2020; 13: 1756284820979165
  • 12 Luo Y, Zhang Y, Liu M. et al. Artificial intelligence-assisted colonoscopy for detection of colon polyps: a prospective, randomized cohort study. J Gastrointest Surg 2021; 25: 2011-2018
  • 13 Wang P, Liu P, Glissen Brown JR. et al. Lower adenoma miss rate of computer-aided detection-assisted colonoscopy vs routine white-light colonoscopy in a prospective tandem study. Gastroenterology 2020; 159: 1252-1261
  • 14 Repici A, Badalamenti M, Maselli R. et al. Efficacy of real-time computer-aided detection of colorectal neoplasia in a randomized trial. Gastroenterology 2020; 159: 512-520
  • 15 Gong D, Wu L, Zhang J. et al. Detection of colorectal adenomas with a real-time computer-aided system (ENDOANGEL): a randomised controlled study. Lancet Gastroenterol Hepatol 2020; 5: 352-361
  • 16 Wang P, Liu X, Berzin TM. et al. Effect of a deep-learning computer-aided detection system on adenoma detection during colonoscopy (CADe-DB trial): a double-blind randomised study. Lancet Gastroenterol Hepatol 2020; 5: 343-351
  • 17 Liu WN, Zhang YY, Bian XQ. et al. Study on detection rate of polyps and adenomas in artificial-intelligence-aided colonoscopy. Saudi J Gastroenterol 2020; 26: 13-19
  • 18 Su JR, Li Z, Shao XJ. et al. Impact of a real-time automatic quality control system on colorectal polyp and adenoma detection: a prospective randomized controlled study (with videos). Gastrointest Endosc 2020; 91: 415-424
  • 19 Wang P, Berzin TM, Glissen Brown JR. et al. Real-time automatic detection system increases colonoscopic polyp and adenoma detection rates: a prospective randomised controlled study. Gut 2019; 68: 1813-1819
  • 20 Ashat M, Klair JS, Singh D. et al. Impact of real-time use of artificial intelligence in improving adenoma detection during colonoscopy: a systematic review and meta-analysis. Endosc Int Open 2021; 9: E513-E521
  • 21 Aziz M, Fatima R, Dong C. et al. The impact of deep convolutional neural network-based artificial intelligence on colonoscopy outcomes: a systematic review with meta-analysis. J Gastroenterol Hepatol 2020; 35: 1676-1683
  • 22 Aziz M, Haghbin H, Sayeh W. et al. Comparison of artificial intelligence with other interventions to improve adenoma detection rate for colonoscopy: a network meta-analysis. J Clin Gastroenterol 2022; DOI: 10.1097/MCG.0000000000001813.
  • 23 Barua I, Vinsard DG, Jodal HC. et al. Artificial intelligence for polyp detection during colonoscopy: a systematic review and meta-analysis. Endoscopy 2021; 53: 277-284
  • 24 Deliwala SS, Hamid K, Barbarawi M. et al. Artificial intelligence (AI) real-time detection vs. routine colonoscopy for colorectal neoplasia: a meta-analysis and trial sequential analysis. Int J Colorectal Dis 2021; 36: 2291-2303
  • 25 Hassan C, Spadaccini M, Iannone A. et al. Performance of artificial intelligence in colonoscopy for adenoma and polyp detection: a systematic review and meta-analysis. Gastrointest Endosc 2021; 93: 77-85
  • 26 Huang D, Shen J, Hong J. et al. Effect of artificial intelligence-aided colonoscopy for adenoma and polyp detection: a meta-analysis of randomized clinical trials. Int J Colorectal Dis 2022; 37: 495-506
  • 27 Li J, Lu J, Yan J. et al. Artificial intelligence can increase the detection rate of colorectal polyps and adenomas: a systematic review and meta-analysis. Eur J Gastroenterol Hepatol 2021; 33: 1041-1048
  • 28 Lui TKL, Guo CG, Leung WK. Accuracy of artificial intelligence on histology prediction and detection of colorectal polyps: a systematic review and meta-analysis. Gastrointest Endosc 2020; 92: 11-22
  • 29 Mori Y, Wang P, Løberg M. et al. Impact of artificial intelligence on colonoscopy surveillance after polyp removal: a pooled analysis of randomized trials. Clin Gastroenterol Hepatol 2022; DOI: 10.1016/j.cgh.2022.08.022.
  • 30 Nazarian S, Glover B, Ashrafian H. et al. Diagnostic accuracy of artificial intelligence and computer-aided diagnosis for the detection and characterization of colorectal polyps: systematic review and meta-analysis. J Med Internet Res 2021; 23: e27370
  • 31 Shah S, Park N, Chehade NEH. et al. Effect of computer-aided colonoscopy on adenoma miss rates and polyp detection: a systematic review and meta-analysis. J Gastroenterol Hepatol 2023; 38: 162-176
  • 32 Sivananthan A, Nazarian S, Ayaru L. et al. Does computer-aided diagnostic endoscopy improve the detection of commonly missed polyps? A meta-analysis. Clin Endosc 2022; 55: 355-364
  • 33 Spadaccini M, Iannone A, Maselli R. et al. Computer-aided detection versus advanced imaging for detection of colorectal neoplasia: a systematic review and network meta-analysis. Lancet Gastroenterol Hepatol 2021; 6: 793-802
  • 34 Xu Y, Ding W, Wang Y. et al. Comparison of diagnostic performance between convolutional neural networks and human endoscopists for diagnosis of colorectal polyp: systematic review and meta-analysis. PLoS One 2021; 16: e0246892
  • 35 Zhang Y, Zhang X, Wu Q. et al. Artificial intelligence-aided colonoscopy for polyp detection: a systematic review and meta-analysis of randomized clinical trials. J Laparoendosc Adv Surg Tech A 2021; 31: 1143-1149
  • 36 Shao L, Yan X, Liu C. et al. Effects of AI-assisted colonoscopy on adenoma miss rate/adenoma detection rate: a protocol for systematic review and meta-analysis. Medicine (Baltimore) 2022; 101: e31945
  • 37 Pan H, Cai M, Liao Q. et al. Artificial intelligence-aid colonoscopy vs. conventional colonoscopy for polyp and adenoma detection: a systematic review of 7 discordant meta-analyses. Front Med (Lausanne) 2021; 8: 775604
  • 38 Ahmad A, Wilson A, Haycock A. et al. Evaluation of a real-time computer aided polyp detection system during screening colonoscopy: AI-DETECT study. Endoscopy 2023; 55: 313-319 DOI: 10.1055/a-1966-0661.
  • 39 Walls M, Houwen B, Rice S. et al. The effect of the endoscopic device Endocuff™/Endocuff vision™ on quality standards in colonoscopy: a systematic review and meta-analysis of randomized trials. Colorectal Dis 2022; DOI: 10.1111/codi.16444.
  • 40 Zorzi M, Hassan C, Battagello J. et al. Adenoma detection by Endocuff-assisted versus standard colonoscopy in an organized screening program: the “ItaVision” randomized controlled trial. Endoscopy 2022; 54: 138-147
  • 41 Wisse PHA, Erler NS, de Boer SY. et al. Adenoma detection rate and risk for interval postcolonoscopy colorectal cancer in fecal immunochemical test-based screening: a population-based cohort study. Ann Intern Med 2022; 175: 1366-1373
  • 42 Cubiella J, Castells A, Andreu M. et al. Correlation between adenoma detection rate in colonoscopy- and fecal immunochemical testing-based colorectal cancer screening programs. United European Gastroenterol J 2017; 5: 255-260
  • 43 Bretthauer M, Løberg M, Wieszczy P. et al. Effect of colonoscopy screening on risks of colorectal cancer and related death. N Engl J Med 2022; 387: 1547-1556
  • 44 Brenner H, Altenhofen L, Kretschmann J. et al. Trends in adenoma detection rates during the first 10 y of the German screening colonoscopy program. Gastroenterology 2015; 149: 356-366
  • 45 Hassan C, Piovani D, Spadaccini M. et al. Variability in adenoma detection rate in control groups of randomized colonoscopy trials. Gastrointest Endosc 2023; 97: 212-225
  • 46 Zimmermann-Fraedrich K, Pohl H, Rösch T. et al. Designs of colonoscopic adenoma detection trials: more positive results with tandem than with parallel studies – an analysis of studies on imaging techniques and mechanical devices. Gut 2021; 70: 268-275
  • 47 Kaminski MF, Regula J, Kraszewska E. et al. Quality indicators for colonoscopy and the risk of interval cancer. N Engl J Med 2010; 362: 1795-803
  • 48 Waldmann E, Penz D, Šinkovec H. et al. Interval cancer after colonoscopy in the Austrian National Screening Programme: influence of physician and patient factors. Gut 2021; 70: 1309-1317
  • 49 Corley DA, Jensen CD, Marks AR. et al. Adenoma detection rate and risk of colorectal cancer and death. N Engl J Med 2014; 370: 1298-1306
  • 50 Schottinger JE, Jensen CD, Ghai NR. et al. Association of physician adenoma detection rates with postcolonoscopy colorectal cancer. JAMA 2022; 327: 2114-2122
  • 51 Baile-Maxía S, Mangas-Sanjuán C, Ladabaum U. et al. Risk factors for metachronous colorectal cancer or advanced adenomas after endoscopic resection of high-risk adenomas. Clin Gastroenterol Hepatol 2022; DOI: 10.1016/j.cgh.2022.12.005.
  • 52 Duvvuri A, Chandrasekar VT, Srinivasan S. et al. Risk of colorectal cancer and cancer related mortality after detection of low-risk or high-risk adenomas, compared with no adenoma, at index colonoscopy: a systematic review and meta-analysis. Gastroenterology 2021; 160: 1986-1996
  • 53 Pohl H, Anderson JC, Aguilera-Fish A. et al. Recurrence of colorectal neoplastic polyps after incomplete resection. Ann Intern Med 2021; 174: 1377-1384
  • 54 Djinbachian R, Iratni R, Durand M. et al. Rates of incomplete resection of 1- to 20-mm colorectal polyps: a systematic review and meta-analysis. Gastroenterology 2020; 159: 904-914
  • 55 Rotermund C, Djinbachian R, Taghiakbari M. et al. Recurrence rates after endoscopic resection of large colorectal polyps: a systematic review and meta-analysis. World J Gastroenterol 2022; 28: 4007-4018
  • 56 Schoen RE, Pinsky PF, Weissfeld JL. et al. Utilization of surveillance colonoscopy in community practice. Gastroenterology 2010; 138: 73-81
  • 57 Murphy CC, Sandler RS, Grubber JM. et al. Underuse and overuse of colonoscopy for repeat screening and surveillance in the Veterans Health Administration. Clin Gastroenterol Hepatol 2016; 14: 436-444
  • 58 Schwarz S, Schäfer W, Horenkamp-Sonntag D. et al. Follow-up of 3 million persons undergoing colonoscopy in germany: utilization of repeat colonoscopies and polypectomies within 10 years. Clin Transl Gastroenterol 2020; 12: e00279
  • 59 Ladabaum U, Shepard J, Weng Y. et al. Computer-aided detection of polyps does not improve colonoscopist performance in a pragmatic implementation trial. Gastroenterology 2022; DOI: 10.1053/j.gastro.2022.12.004.
  • 60 Levy I, Bruckmayer L, Klang E. et al. Artificial intelligence-aided colonoscopy does not increase adenoma detection rate in routine clinical practice. Am J Gastroenterol 2022; 117: 1871-1873
  • 61 Troya J, Fitting D, Brand M. et al. The influence of computer-aided polyp detection systems on reaction time for polyp detection and eye gaze. Endoscopy 2022; 54: 1009-1014