Introduction
Barrett’s esophagus (BE) is defined as columnar metaplasia of the distal esophagus.
It involves abnormal cellular changes in the esophageal mucosa and is considered a
precursor of BE adenocarcinoma (BEA) [1], the incidence rate of which has been rapidly increasing in Western countries [2]. In Asian countries, including Japan, the incidence of BEA has also risen owing
to an increase in the prevalence of obesity and gastroesophageal reflux disease as
a consequence of lower Helicobacter pylori infection rates [3]
[4].
The prognosis of BEA is poor when the disease is diagnosed in an advanced stage [5] but is relatively favorable when treated while still in its superficial stage [6]. Therefore, its early detection is critically important; however, identifying superficial
BEA is often difficult. In Western countries, the Seattle protocol for BE surveillance
recommends that four-quadrant biopsy specimens be acquired at intervals of 1 to 2 cm
[7]. However, random biopsies can lead to numerous sampling errors, and only 4 % to
5 % of BEs are discovered using this method [8]
[9]
[10]. Owing to such limitations, various endoscopic imaging techniques such as narrow
band imaging (NBI), acetic acid chromoendoscopy (AAC), and endoscopy-based confocal
laser endomicroscopy were developed to improve the diagnosis of superficial BEA [11]; among these, NBI has been the most widely researched.
To diagnose neoplastic lesions in patients with BE, several groups proposed classifications
based on mucosal and vascular patterns visualized using magnifying endoscopy with
NBI (M-NBI); however, these classifications did not markedly improve lesion detection
owing to the complexity and diversification of vascular patterns [12]
[13]
[14]
[15]. To address these issues, the Japan Esophageal Society Barrett’s esophagus (JES-BE)
working group proposed a simpler M-NBI classification [16] that is based on representative early gastric cancer diagnostic criteria [17]. Although a study to validate this method showed promising diagnostic accuracy and
interobserver agreement, it was conducted only by endoscopists at high-volume academic
centers [18].
In contrast, a recent meta-analysis revealed that AAC was effective [19]. Acetic acid causes reversible acetylation of BE nucleoproteins and vascular congestion,
which leads to highlighting microstructural patterns. However, there have been no
dedicated studies on the usefulness of M-NBI that incorporates acetic acid spraying
(M-AANBI). In addition, we are aware of no study that directly compared the diagnostic
accuracy of M-AANBI with that of M-NBI. Hence, this study aimed to determine the ability
of M-AANBI to detect BEA and compare it to that of M-NBI when used by either expert
or non-expert endoscopists.
Patients and methods
Patients
This retrospective study included consecutive patients with BEA who underwent endoscopic
submucosal dissection (ESD) between July 2005 and November 2021 at Mie University
Hospital. Operable patients with a preoperative diagnosis of early-stage BEA were
eligible for ESD. During the study period, 34 BEA lesions from 33 patients were evaluated
with preoperative M-NBI/M-AANBI. Of these, four samples lacking images of either M-NBI
or M-AANBI and one with low-quality endoscopic images were excluded; hence, 29 BEA
lesions from 28 patients were retrospectively analyzed. The study was approved by
the local ethics committee and was conducted according to the ethical standards laid
out in the Declaration of Helsinki. The requirement for written informed consent was
waived owing to the retrospective nature of the study. Instead, detailed information
about the study was available to the public on our institutional website, and patients
were offered the opportunity to opt out.
Definitions of BE, BEA, and non-neoplastic BE (NNBE)
BE was defined as an esophagus in which any portion of the normal distal squamous
epithelial lining was replaced by metaplastic columnar epithelium as clearly visible
endoscopically ≥ 1 cm above the esophagogastric junction and confirmed histopathologically
[20]. The esophagogastric junction was defined as the end of the lower esophageal palisade
vessels or upper limit of the gastric fold [21]
[22].
BEA was defined as adenocarcinoma that arose from BE as verified endoscopically and
histologically, including non-invasive well-differentiated adenocarcinoma (high grade)
that is equivalent to high-grade dysplasia in Western countries. NNBE was defined
as the mucosa adjacent to the BEA that was resected during the ESD procedure.
Evaluation of endoscopic findings
All endoscopic images were obtained using a magnifying endoscope (GIF-Q240Z, GIF-H260Z,
or GIF-H290Z; Olympus Corp., Tokyo, Japan) and an endoscopic system with NBI (EVIS
LUCERA ELITE or EVIS LUCERA SPECTRUM; Olympus Corp.). A distal attachment (D-201-11804,
D-201-11804, MAJ-1989, or MAJ-1990; Olympus Corp.) was placed on the tip of the endoscope
to maintain a suitable focusing distance during magnification.
Tumor size, surface color, and macroscopic type were evaluated using white-light endoscopy.
M-NBI was performed with optimal foci to evaluate mucosal and surface patterns according
to the JES-BE classification [16]. Subsequently, 1.5 % acetic acid was sprayed onto the lesion with a 20-mL syringe
at low pressure, and M-AANBI images were obtained and evaluated as detailed below.
Non-magnifying NBI was performed before each M-NBI or M-AANBI session; alternating
between magnified and non-magnified images helped identify the lesion section that
was assessed.
Histopathological evaluation
All BEA lesions were resected using the ESD procedure. Each resected specimen was
cut into 2 mm slices after formalin fixation; the histological type, size, depth of
invasion, and margins (horizontal/vertical) were then evaluated. The pathological
diagnosis was performed based on hematoxylin-eosin staining by two expert pathologists,
who were blinded to endoscopic findings, according to the Japanese Classification
of Esophageal Cancer [22]. The depth of tumor invasion was recorded as the T category; T1a and T1b were defined
as tumors confined to the mucosa and submucosa, respectively. The extent of BEA was
pathologically evaluated and compared to the endoscopic demarcation line (DL), which
was confirmed by comparison to the pathological BEA border.
Derivation study for developing BEA/NNBE classifications (phase 1)
In phase 1, two endoscopists (Y.I. and K.T.) evaluated 60 high-quality M-AANBI images
of BEA and NNBE based on the AAC classification for gastric cancer [23]. Disagreements between the raters were resolved through discussion. The images were
independent of those used for validation.
The characteristic surface patterns of BEA/NNBE were classified into five types as
follows: type I, small round pits of uniform size and shape ([Fig. 1a]); type II, slit-like pits ([Fig. 1b]); type III, gyrus and villous patterns ([Fig. 1c, d]); type IV, irregular arrangement and size ([Fig. 1e]); and type V, destructive pattern ([Fig. 1f]).
Fig. 1 Classifications of BEA/NNBE surface patterns using M-AANBI. The surface patterns of
BEA and NNBE obtained using M-AANBI were classified into five types. a Small round pits of uniform size and shape (type I); b Slit-like pits (type II); c Gyrus pattern (type III); d, Villous pattern (type III); e, f Irregular arrangement and size (type IV); g, h Destructive pattern (type V); The characteristic surface patterns of NNBE were types
I-III, while those of BEA were types IV and V. BEA, Barrett’s esophageal adenocarcinoma;
NNBE, non-neoplastic Barrett’s esophagus; M-AANBI, magnifying endoscopy with narrow
band imaging plus acetic acid spraying.
Validation studies (phase 2)
In phase 2, we conducted three validation studies to evaluate the diagnostic performance
of M-AANBI. The images were examined by four expert endoscopists (Y.U., H.Y., Y.H.,
and M.K.) and four non-expert endoscopists (A.H., S.S., I.A., and W.Y.). We defined
experts as operators who had experience with more than 100 M-NBI and M-AANBI procedures
and non-experts as those who had experience with fewer than 20 such procedures. Non-expert
endoscopists were provided a short explanation of M-AANBI-based diagnosis of BEA/NNBE
(according to our developed classification) prior to their participation.
Validation 1 (main validation): Comparing the diagnostic performances of M-NBI and
M-AANBI
An independent validation dataset was prepared to compare the accuracies of M-NBI
and M-AANBI when diagnosing BEA. Based on the sample size (described below), the validation
dataset included 130 each of M-NBI- and M-AANBI-acquired images; 65 BEA and 65 NNBE
images were obtained with each of these two modalities. All images were randomized
and verified, and each endoscopist then made a diagnosis of BEA or NNBE while assigning
a confidence level (high/low) for each image. Agreement among the raters was quantified
using Fleiss’ kappa coefficient.
Validation 2: Changes in diagnostic performance owing to combining M-AANBI and M-NBI
To evaluate the effect of combining M-AANBI with M-NBI on diagnosis, 56 pairs of M-NBI
and M-AANBI images acquired from the same lesion areas (28 BEA and 28 NNBE) were prepared.
First, endoscopists performed their diagnoses using only M-NBI images. Subsequently,
they re-diagnosed the samples using both M-NBI and M-AANBI; the difference in diagnostic
performance between these two methods was then evaluated ([Fig. 2]). Agreement among the raters was quantified using Fleiss’ kappa coefficient.
Fig. 2 Evaluating changes in diagnostic performance when using both M-AANBI and M-NBI. The
first evaluation was performed using M-NBI only: a second evaluation was then performed
using both M-NBI and M-AANBI. Abbreviations: M-AANBI, magnifying endoscopy with narrow
band imaging plus acetic acid spraying; M-NBI, magnifying endoscopy with narrow band
imaging.
Validation 3: Evaluating the ease of recognizing the DL using three methods
The DL was defined as the endoscopic border between the background non-cancerous and
cancerous mucosae [24]
[25]
[26]; this has been confirmed to be consistent with pathological findings. DL recognition
in the BEA samples of 28 patients was evaluated using three different methods: magnifying
endoscopy with white-light imaging (M-WLI), M-NBI, and M-AANBI. For each lesion, three
images of the same DL area were arranged on one slide for examination ([Fig. 3]). Each endoscopist was instructed to score the ease of DL recognition when using each
of these methods according to a visual analog scale (VAS), which was graded from 0
to 10 wherein 0 represented “invisible” and 10 represented “perfect visibility”.
Fig. 3 Magnifying endoscopic images of the DL of BEA (same area). Abbreviations: DL, demarcation
line; BEA, Barrett’s esophageal adenocarcinoma; M-NBI, magnifying endoscopy with narrow
band imaging; M-WLI, magnifying endoscopy with white-light imaging; M-AANBI, magnifying
endoscopy with narrow band imaging plus acetic acid spraying; “Marked,” marked image
of the DL superimposed on the magnifying endoscopic image.
Sample size determination and statistical analysis
A pilot study was conducted to determine the appropriate sample size. Four expert
endoscopists (Y.U., H.Y., Y.H., and M.K.) and four non-expert endoscopists (A.H.,
S.S., I.A., and W.Y.) evaluated 40 M-NBI and 40 M-AANBI images; 20 BEA and 20 NNBE
images were examined using each modality. The mean diagnostic accuracies were 72.5 %
for M-NBI and 85.6 % for M-AANBI. According to McNemar’s test, 130 pairs of M-NBI
and M-AANBI images were required to achieve a power of > 90 % (assuming a two-sided
alpha of 0.05) based on eight diagnostic accuracy ratings from as many endoscopists.
Categorical variables are summarized as frequencies and percentages, while quantitative
variables are presented as means and standard deviations. The performance of BEA diagnosis
was determined by calculating the sensitivity, specificity, and accuracy for each
endoscopist and the overall group. The interobserver agreement among endoscopists
was calculated using Fleiss’ kappa, with the strength of each agreement graded using
the kappa value (< 0.20 = poor, 0.21–0.40 = fair, 0.41–0.60 = moderate, 0.61–0.80 = substantial,
and 0.81–1.00 = almost perfect) [27]. Fisher’s exact test was used to compare M-AANBI findings. The differences in the
diagnostic test results between M-NBI and M-AANBI were analyzed using the McNemar’s
test. The average scores reflecting the ease of DL recognition using the three methods
were compared using the Friedman test and Bonferroni’s multiple comparison test. P < 0.05 was considered indicative of statistical significance in all tests. All calculations
were performed using EZR version 1.27 (Saitama Medical Center, Jichi Medical University,
Japan) [28].
Results
Patient characteristics
The clinical characteristics of patients with BEA and of their lesions are shown in
[Table 1]. The mean patient age was 73.6 years (range, 50–92 years), and 79.2 % were men.
Per the Prague criteria, the mean circumferential and maximal extents of BE were 2.2 cm
and 3.4 cm, respectively. The mean tumor size was 20.5 mm; the most frequent macroscopic
type was depressed (51.7 %), and 65.5 % of lesions had a reddish color. The dominant
histopathology in most cases was well-differentiated adenocarcinoma (86.2 %), and
tumor invasion in most cases was confined to the mucosa (79.3 %).
Table 1
Clinicopathological characteristics of patients with BEA.
Patient characteristics (n = 28)
|
Age, mean ± SD, years (range)
|
73.6 ± 9.6 (50–92)
|
Sex, male/female, n
|
24/4
|
Circumferential length of BE, mean cm (range)
|
2.2 ± 3.1 (0–15)
|
Maximal length of BE, mean cm (range)
|
3.4 ± 2.9 (1–15)
|
Lesion characteristics (n = 29)
|
Size
|
|
|
20.5 ± 12.3 (6–53)
|
|
16 (55.2)
|
|
13 (44.8)
|
Macroscopic type, n (%)
|
|
|
3 (10.3)
|
|
7 (24.1)
|
|
4 (13.8)
|
|
15 (51.7)
|
Color, n (%)
|
|
|
10 (34.5)
|
|
19 (65.5)
|
Dominant histopathology, n (%)
|
|
|
25 (86.2)
|
|
3 (10.3)
|
|
1 (3.4)
|
Depth of invasion
|
|
|
23 (79.3)
|
|
6 (20.7)
|
BEA, Barrett’s esophageal adenocarcinoma; SD, standard deviation; BE, Barrett’s esophagus;
T1a, tumor confined to the mucosa; T1b, submucosal invasion.
Derivation study (phase 1)
The assessment of surface patterns on BEA/NNBE using M-AANBI is shown in [Table 2]. The characteristic surface patterns of NNBE were wholly type I, type II, or type
III. In contrast, almost all BEA images had two characteristic surface patterns (type
IV or type V [P < 0.001]). Therefore, type IV and V surface patterns observed on M-AANBI were deemed
as predictive of BEA.
Table 2
Distribution of surface patterns in BEA and NNBE samples.
Surface patterns of M-AANBI
|
BEA
|
NNBE
|
P value[1]
|
Types I or II or III, n (type I/type II/type III)
|
1 (0/0/1)
|
30 (4/11/15)
|
< 0.001
|
Type IV or V, n (type IV/type V)
|
29 (26/3)
|
0 (0/0)
|
BEA, Barrett’s esophageal adenocarcinoma; NNBE, non-neoplastic Barrett’s esophagus.
1 Fisher’s exact test.
Validation studies (phase 2)
Validation 1 (main validation): Comparing the diagnostic performances of M-NBI and
M-AANBI
[Table 3] shows the diagnostic performance, interobserver agreement (κ value), and confidence
levels for predicting BEA according to the validation 1 scheme. For M-NBI, the mean
accuracy, sensitivity, and specificity among all endoscopists were 65.8 %, 61.7 %,
and 69.8 %, respectively; in comparison, the values for M-AANBI were 91.1 %, 86.3 %,
and 95.2 %, respectively. All parameters were significantly higher when using M-AANBI
than with M-NBI among all eight endoscopists (P < 0.05).
Table 3
Diagnostic performance of M-NBI and M-AANBI in terms of detecting BEA.
Raters
|
Modalities
|
Accuracy,
% (95 % CI)
|
Sensitivity,
% (95 % CI)
|
Specificity,
% (95 % CI)
|
IO agreement
K value
(95 %CI)
|
High confidence rate,
% (95 % CI)
|
All
|
M-NBI
|
65.8
(61.1–70.5)
|
61.7
(50.3–73.1)
|
69.8
(58.9–80.7)
|
0.25
(0.21–0.28)
|
53.3
(41.0–65.5)
|
M-AANBI
|
91.1[1]
(85.7–96.4)
|
86.3[1]
(76.6–96.1)
|
95.2[1]
(91.0–99.4)
|
0.65
(0.61–0.68)
|
75.2[1]
(64.9–85.5)
|
Experts
|
M-NBI
|
70.0
(66.1–73.9)
|
64.6
(38.6–90.7)
|
75.4
(55.1–95.7)
|
0.38
(0.31–0.45)
|
61.2
(34.9–87.4)
|
M-AANBI
|
91.3[1]
(78.3–104.4)
|
85.0[1]
(57.8–112.2)
|
97.7[1]
(95.2–100.1 %)
|
0.64
(0.56–0.72)
|
81.7[1]
(58.4–100.5)
|
Non-experts
|
M-NBI
|
61.5
(54.4–68.7)
|
58.5
(39.7–78.0)
|
64.2
(44.6–83.9)
|
0.33
(0.26–0.40)
|
45.4
(32.9–57.9)
|
M-AANBI
|
90.8[1]
(82.2–99.3)
|
87.7[2]
(81.1–94.3)
|
92.7[1]
(82.6–102.8)
|
0.66
(0.59–0.73)
|
68.7
|
BEA, Barrett’s esophageal adenocarcinoma; M-NBI, magnifying narrow band imaging; M-AANBI,
magnifying narrow band imaging plus acetic acid spraying; CI, confidence interval;
IO, interobserver.
1
P < 0.001, vs. M-NBI using McNemar’s test.
2
P < 0.01, vs. M-NBI using McNemar’s test.
The accuracy of M-NBI was significantly higher in the expert group than in the non-expert
group (70.0 % vs. 61.5 %, P < 0.05). However, both groups achieved high accuracy with M-AANBI, with no difference
between them.
The interobserver agreement among endoscopists was fair for M-NBI (κ = 0.25) and substantial
for M-AANBI (κ = 0.65). Moreover, the proportion of high confidence for M-AANBI was
greater than that for M-NBI (75.2 % vs. 53.3 %, P < 0.001); both the expert and non-expert groups had similar tendencies.
Validation 2: Changes in diagnostic performance owing to combining M-AANBI and M-NBI
The results of validation 2-related analyses are presented in [Table 4]. The mean accuracy, sensitivity, and specificity of BEA diagnosis using M-NBI alone
were 68.5 %, 64.6 %, and 73.2%, respectively. When using both M-NBI and M-AANBI, these
values improved to 87.3 %, 85.7 %, and 90.6 %, respectively; the accuracy was significantly
greater than that of M-NBI alone (P < 0.05). The interobserver agreement when using M-NBI alone was fair (κ = 0.31),
whereas that when combining M-NBI and M-AANBI was substantial (κ = 0.69). The high
confidence rate improved from 57.6 % to 76.3 % when using both modalities.
Table 4
Diagnostic performance parameters before/after adding M-AANBI to M-NBI.
Modalities
|
Accuracy
mean, %
(95 % CI)
|
Sensitivity
mean, %
(95 % CI)
|
Specificity
mean, %
(95 % CI)
|
IO agreement
K value
(95 %CI)
|
High confidence rate, %
(95 % CI)
|
M-NBI
|
68.5
(62.9–74.2)
|
64.6
(56.2–73.2)
|
73.2
(61.3–85.2)
|
0.31
(0.26–0.36)
|
57.6
(45.0–70.2)
|
M-NBI +
M-AANBI
|
87.3[1]
(82.7–91.9)
|
85.7
(77.9–93.5)
|
90.6
(85.6–95.7)
|
0.69
(0.64–0.74)
|
76.3[2]
(63.0–89.6)
|
M-NBI, magnifying endoscopy with narrow band imaging; M-AANBI, magnifying endoscopy
with narrow band imaging plus acetic acid spraying; CI, confidence interval; IO, interobserver.
1
P < 0.05, vs. M-NBI using McNemar’s test.
2
P < 0.01, vs. M-NBI using McNemar’s test.
Validation 3: Evaluating the ease of recognizing the DL using three methods
As shown in [Table 5], the mean VAS scores for the ease of DL recognition as evaluated by all endoscopists
were 3.60 ± 1.27 for M-WLI, 6.16 ± 1.36 for M-NBI, and 8.40 ± 0.76 for M-AANBI; the
differences between these scores (compared using the Friedman test) were significant
(P < 0.001). Using Bonferroni’s multiple comparison test, the mean VAS score of M-AANBI
was significantly higher than those of the other two methods, with M-NBI scoring significantly
higher than M-WLI. Similar trends were observed in both the expert and non-expert
groups (P < 0.001).
Table 5
Ease of demarcation line recognition when using M-WLI, M-NBI, and M-AANBI.
Modalities
|
All endoscopists
|
Experts
|
Non-experts
|
M-WLI,
mean VAS ± SD
|
3.60 ± 1.27
|
3.53 ± 1.28
|
3.68 ± 1.41
|
M-NBI,
mean VAS ± SD
|
6.16 ± 1.36[1]
|
6.16 ± 1.53[1]
|
6.16 ± 1.38[1]
|
M-AANBI,
mean VAS ± SD
|
8.40 ± 0.76[1]
[2]
|
8.41 ± 0.82[1]
[2]
|
8.40 ± 0.92[1]
[2]
|
VAS, visual analog scale; M-WLI, magnifying endoscopy with white-light imaging; M-NBI,
magnifying endoscopy with narrow band imaging; M-AANBI, magnifying endoscopy with
narrow band imaging plus acetic acid spraying; SD, standard deviation.
1
P < 0.001, vs. M-WLI using Friedman’s test.
2
P < 0.001, vs. M-NBI using Friedman’s test.
Discussion
This study was the first to demonstrate the usefulness of M-AANBI in BEA diagnosis
through direct comparison with M-NBI. The diagnostic accuracy, sensitivity, specificity,
interobserver agreement, and confidence level of M-AANBI were significantly higher
than those of M-NBI. Furthermore, combining M-NBI with M-AANBI produced an additive
effect in terms of improving diagnostic accuracy; M-AANBI also tended to facilitate
DL recognition.
To date, several NBI classifications for the diagnosis of BEA have been developed;
however, they have proven to be complicated with limited diagnostic abilities [12]
[13]
[14]
[15], rendering them inadequate for use in clinical practice. In 2016, the Barrett’s
International NBI Group (BING) constructed a simpler NBI classification for differentiating
between dysplasia and non-dysplasia and demonstrated high diagnostic performance for
the former (overall accuracy: 85.4 %, sensitivity: 80.4 %, and specificity: 88.4 %)
and substantial interobserver agreement (κ = 0.681) [29]. However, this validation was performed only by experts, and it remained unclear
whether a similar diagnostic performance could be achieved by non-experts.
Subsequently, the JES-BE classification that was based on the typical M-NBI diagnostic
criteria for early gastric cancer was proposed [16]
[17]. BEA is diagnosed by an irregular mucosal or vascular pattern; the criteria were
further modified to include the flat pattern originally regarded to be physiological.
The JES-BE classification has been reported to have a high diagnostic ability (overall
accuracy, 91 %; sensitivity, 87 %; and specificity, 97 %) and high interobserver agreement
(κ = 0.77); however, that study was single-armed and could not be compared with other
representative modalities such as M-AANBI [18]. Although the JES-BE classification was useful even for non-experts with little
experience in diagnosing BEA, they were engaged in high-volume or academic centers
and had substantial experience using magnification endoscopy for detecting early gastric
cancer. Given that the JES-BE classification is based on the criteria used for detecting
early gastric cancer, the existing familiarity of operators with magnification endoscopy
may have affected the results; hence, the versatility of this classification in clinical
practice remains unknown. With respect to this point, when we performed our tests
on non-experts, the accuracy of M-NBI for BEA diagnosis using the JES-BE classification
was significantly lower than it was among experts.
AAC has been reported to be a useful technique for BEA diagnosis [19]. Acetic acid changes the color of the BE epithelial surface to white, making it
easier to recognize microstructural patterns. AAC is easy, safe, inexpensive, and
effective, particularly for lesions with blood oozing on the surface; however, its
superiority with respect to M-AANBI remains unknown as this question was not the focus
of this study. Moreover, inflammatory changes can lead to incorrect BEA diagnoses
when using AAC because of the rapid loss of acetowhitening [30]. Despite limited studies on the usefulness of M-AANBI in diagnosing BEA [31], it is better suited to avoid such misdiagnoses.
We previously classified surface patterns when using AAC in the stomach into five
types, as described above [23]. When planning our current study, this classification was applied to differentiate
BEAs from NNBEs ([Fig. 1]); its use with M-AANBI showed a high diagnostic ability (overall accuracy, 91.1 %;
sensitivity, 86.3 %; and specificity, 95.2 %). All these parameters were superior
to those of M-NBI. Although we provided a short tutorial for the diagnostic classification
of M-AANBI to non-experts who had little experience with M-NBI and M-AANBI, the accuracy
of BEA diagnosis by non-experts was high when using M-AANBI (equivalent to that of
experts). Furthermore, the diagnostic performance and certainty were significantly
improved by performing both M-AANBI and M-NBI. M-NBI-based diagnosis normally requires
observation at high magnification to evaluate not only the structure but also the
vascular pattern. However, since M-AANBI only examines structural patterns, it is
possible to establish diagnoses with low magnification that emulates non-magnification;
hence, this method is easy to use for endoscopists accustomed only to NBI in Western
countries. Furthermore, M-AANBI may be utilized without magnification when using high-resolution
endoscopes, which are likely to become more common in the future. Notably, the interobserver
agreement was fair for M-NBI (κ = 0.25) whereas the value for M-AANBI was substantial
(κ = 0.65). This indicates that M-NBI is subjective and difficult to interpret, while
M-AANBI is an objective diagnostic method for all endoscopists including non-experts.
Identifying the DL between BEA and NNBE is generally considered difficult [32]. Since BE exhibits heterogeneous mucosal patterns of intestinal metaplasia, false
DLs can be designated in NNBEs; this is also one of the reasons it is difficult to
recognize the precise DL between BEA and NNBE. The ease of DL recognition using M-WLI
was also low in the current study, whereas M-AANBI recognized the DL much more readily
than M-NBI and M-WLI ([Table 5]).
Artificial intelligence (AI) has recently allowed for remarkable progress in image
recognition of gastrointestinal lesions [33]. A Previous study found that AI-based techniques are highly reliable for diagnosing
gastric cancer using M-NBI [34]; furthermore, Ling et al. reported a unique AI system for identifying the DL in
gastric cancer [35]. As such, AI may also be useful for the qualitative diagnosis of BEA and identification
of the DL using M-AANBI.
This study had certain limitations. First, it was a retrospective study that used
selected endoscopic images, and selection bias toward high-quality images existed.
However, considering that physicians can observe all lesions in real-time in clinical
practice, using high-quality images ought to be acceptable. Second, since this was
a detailed comparative study of M-AANBI and M-NBI, validation was only performed on
a cohort from our own institution. Third, the results were based on a small sample
size, and several images were extracted from the same lesion. Despite our best efforts
to avoid it, some images may have been of overlapping sections of a given lesion.
Hence, more data derived from a larger number of cases (including from multicenter
prospective studies) are required to improve the diagnostic yield of such endoscopic
findings. Fourth, the validation 3 scheme, which measured the confidence of recognizing
the DL with each of three modalities, may have a subjective aspect. However, an endoscopic
diagnosis is essentially a series of subjective evaluations; furthermore, the evaluators
were blinded to the greatest extent possible during testing. As such, subjective bias
ought to have been minimized. Fifth, M-AANBI is not a standard practice technique
at present, with concerns that the essential system with magnifying endoscope for
M-NBI is more expensive than the conventional endoscopy system. Furthermore, the M-AANBI
technique may be complicated for endoscopists who are not familiar with the use of
magnifying endoscopes. However, the NBI instruments have increased in popularity and
acetic acid is not expensive due to its use as a threefold vinegar dilution. Moreover,
similar to M-AANBI diagnosis, a high-resolution non-magnifying endoscope with a combination
of AAC and NBI may be sufficient for diagnosis of BEA. Sixth, this study evaluated
images of exposed BEA areas; however, certain BEA areas can sometimes be covered with
normal squamous epithelium in patients treated with proton pump inhibitors. Therefore,
the usefulness of M-AANBI for qualitative diagnosis and DL detection in areas covered
with squamous epithelium is unknown.
Conclusions
In conclusion, we demonstrated the usefulness of M-AANBI in diagnosing BEA and its
superiority to M-NBI. Although the diagnostic ability of M-NBI was low among non-experts,
that of M-AANBI was high, with good interobserver agreement among all endoscopists
and non-experts alike. The results of this study suggest that M-AANBI is useful for
BEA diagnosis by endoscopists of all experience levels, including non-experts who
have little experience with magnifying endoscopy. In clinical practice, the use of
M-AANBI in addition to M-NBI might be beneficial for BEA detection during surveillance
endoscopy for BE and for DL recognition during ESD for BEA.