Introduction
The incidence of Barrett’s esophageal adenocarcinoma (BAC) has increased rapidly in
the Western countries over the past few decades and it now accounts for more than
half of all esophageal cancers [1]
[2]. In contrast, BAC remains rare in Japan. However, the number of Japanese patients
with BAC is expected to increase in the future due to the decreasing rate of Helicobacter pylori infections and the increasing incidence of gastroesophageal reflux disease [3]
[4]. The 5-year survival rate for patients with BAC, including those with locally advanced
disease, is reportedly less than 20 % [5]. The poor survival rate for patients with BAC necessitates its early detection [6]
[7]. To detect a BAC lesion in the early stage, the Seattle protocol is used for surveillance
of Barrett’s lesions in Western countries [8]. However, it is expensive and time-consuming, and involves a high risk of sampling
error [9]. Targeted biopsy using narrow-band imaging (NBI) may overcome these weaknesses,
and several NBI classification systems have been proposed [10]
[11]
[12]
[13]
[14]. However, these classifications have not gained worldwide acceptance, as they involve
complicated and diverse criteria.
The Japan Esophageal Society (JES) proposed a new magnifying endoscopic classification
for predicting the histology of Barrett’s epithelium (JES-BE classification) in 2018
[15]. The diagnostic flow of this classification is simply described as follows: magnifying
endoscopic findings are composed of mucosal and vascular patterns, and each are classified
as “regular” or “irregular.” Then, the histology, non-dysplastic or dysplastic, was
comprehensively predicted. Compared to the other NBI classification, the JES-BE classification
has more simplified criteria including a flat pattern. These characteristics lead
to the high diagnostic accuracy of this classification. The sensitivity and specificity
of this classification were reported to be 87 % and 97 %, respectively [16]. Consequently, this classification is now widely accepted by Japanese endoscopists.
However, in daily clinical practice, we often observe Barrett’s lesions that are difficult
to diagnose as non-dysplastic or dysplastic by magnifying endoscopy with NBI (ME-NBI),
even using the JES-BE classification. In addition, to date, it is unclear for which
cases this classification is useful and for which cases additional careful attention
is needed. Therefore, this retrospective study aimed to clarify the diagnostic accuracy
of this classification and reveal the clinical and pathological characteristics of
Barrett’s esophagus that is difficult to diagnose on ME-NBI using the JES-BE classification.
Patients and methods
Study design
This image assessment study comprised the following two stages: 1) diagnosis by endoscopists
of images as dysplasia or non-dysplasia based on the JES-BE classification; and 2)
assessment of the diagnostic accuracy of the JES-BE classification and investigation
of the features of the misdiagnosed images, such as histology, magnification power,
and location of captured images.
Image selection
First, 1372 ME-NBI images from 98 successive patients with Barrett’s epithelium (BE)
or Barrett’s dysplasia/early cancer obtained between January 2012 and July 2019 were
retrieved from the endoscopic database of the Sendai Kousei Hospital. All images were
captured using a high-definition magnification endoscope (GIF-H260Z, Olympus Corp.,
Tokyo, Japan) and a videoendoscopy system (EVIS LUCERA SPECTRUM, CV-260SL and CLV-260SL,
Olympus Corp.). This scope has optical zoom function that enables adjustment of magnification
level. Second, among these images, only the images captured just before taking biopsy
samples were selected. Therefore, the site of the selected images and the site of
biopsies were in one-to-one correspondence. The images of dysplastic lesion were captured
before taking biopsies for tumor diagnosis. In contrast, the images of non-dysplastic
epithelium were captured before taking negative biopsies necessary for diagnosis of
tumor demarcation. Images of the biopsy site that were taken because of suspicion
for dysplastic lesions but that were pathologically non-dysplastic were not included
in this study. Images were excluded from the study if the view was indistinct (i. e.,
with blood or mucus or out of focus). If several images were taken from a single biopsy
site, the best quality image was selected. Finally, 142 ME-NBI images from 50 patients
diagnosed histopathologically were chosen for the present study. Image collection
and selection were performed by the principal investigator (IT).
JES-BE classification
[Fig. 1] shows the diagnostic flowchart for the JES-BE classification [15]. First, a mucosal pattern is assessed under low-power magnifying observation (regular/irregular).
Then, a vascular pattern is evaluated under fully magnifying observation (regular/irregular).
Finally, a comprehensive diagnosis to predict histology is made based on mucosal and
vascular patterns. If both mucosal and vascular patterns are regular, the lesion is
considered non-dysplastic; if irregular, it is considered dysplastic. If judgment
about regularity of the mucosal and vascular patterns differs, the overall judgment
should be irregular and the lesion considered to dysplastic. In addition, this classification
includes the modified flat pattern, which is defined as flat mucosa (none of pits
and villi: i. e., absent pattern) without clear demarcation with green thick vessels.
It is classified into a “regular” mucosal/vascular pattern and suggested to be non-dysplastic
BE.
Fig. 1 Diagnostic flowchart of the JES-BE classification.
Reviewer recruitment
Five endoscopists at Sendai Kousei Hospital were enrolled as image reviewers. They
had experience with more than 100 cases of magnification endoscopy for superficial
gastric cancer but had fewer than 10 cases of NBI magnifying observation for superficial
BAC, and therefore, were considered general endoscopists in Japan. The principal investigator
(IT) who prepared the ME-NBI images was not included as a reviewer. In addition, to
reflect real-world clinical practice, experts in the field of esophagus cancer were
not included as reviewers.
Image preparation
First, the 142 ME-NBI images were divided into two groups: training images (n = 10)
and test images (n = 132). The 132 images were pooled into a single dataset and inserted
into Microsoft PowerPoint (Windows 2010; Microsoft, Santa Clara, California, United
States) presentation slides against a black background. Each of the 132 images was
numbered and randomly arranged according to random number tables created by Microsoft
Excel (Windows 2010; Microsoft, Santa Clara, California, United States).
Image assessment
A short lecture about JES-BE classification using 10 sheets of training images was
conducted by an expert in magnifying endoscopic diagnosis of Barrett’s esophageal
cancer (DH). After the lecture, each of the five endoscopists diagnosed the mucosal
and vascular pattern of 132 images according to the JES-BE classification, and predicted
the histology of these images (i. e., non-dysplastic or dysplastic). The reviewers
had no access to the patient’s clinical information, histological data, or other imaging
material.
Histology
Biopsy samples were embedded in paraffin and stained with hematoxylin and eosin, and
immunohistologically, such as p53 and Ki-67. All samples were evaluated by two expert
pathologists specializing in BAC diagnosis who were blinded to any endoscopic findings.
When the diagnoses of the lesions by the two pathologists differed, the final diagnosis
was reached after they had a discussion and reach consensus. The histological diagnosis
was established based on the Vienna classification [17]. The pathologist graded the samples as non-dysplastic, low-grade dysplasia (LGD),
high-grade dysplasia (HGD), or adenocarcinoma. Because pathologists in Japan are not
familiar with the diagnosis of LGD, the description and reference images of LGD in
the Vienna classification were always reviewed before diagnosing LGD ([Fig. 2a, b]) [17]. In the current study, LGD, HGD, and adenocarcinoma were classified as dysplastic.
Fig. 2 Representative ME-NBI image of LGD. a ME-NBI revealed that mucosal and vascular patterns are both irregular. The surface
pattern is classified as a non-pit pattern according to the JES-BE classification.
b Histological examination showed cellular atypia, including nuclear hyperchromatism
and absence of goblet cells. Thus, this specimen was diagnosed as LGD. NBI, narrow-band
imaging; ME-NBI, magnifying endoscopy combined with narrow-band imaging; LGD, low-grade
dysplasia.
Outcomes
We analyzed the diagnostic accuracy of the JES-BE classification based on the sensitivity,
specificity, positive predictive value (PPV), negative predictive value (NPV), and
overall accuracy of the predicted histology of BE. The diagnostic accuracy of the
mucosal pattern alone was also evaluated. We analyzed interobserver agreement of the
five reviewers in prediction of histology and interpretation of NBI surface patterns.
To identify risk factors for misdiagnosed images, images with a correct rate of less
than 40 % were defined as difficult-to-diagnose, and those with 60 % or more were
defined as easy-to-diagnose. Logistic regression analysis was performed to identify
risk factors for difficult-to-diagnose images. We hypothesized that histology, magnification
power, and location of captured images are factors involved in misdiagnosis. The magnification
power of each image was evaluated based on the size of the mucosa and vessels ([Fig. 3a, b]). Based on the length of the background BE, each case was classified into the following
two groups: long-segment Barrett’s esophagus (LSBE) or short-segment Barrett’s esophagus
(SSBE). For instance, when an image of a site within 1 cm from the esophago-gastro-jejunum
was evaluated in a patient with LSBE, it was categorized as LSBE.
Fig. 3 Representative ME-NBI images of the same site at high- and low-power magnification. a ME-NBI image at low-power magnification. The mucosal pattern was irregular. b ME-NBI image at high-power magnification. The vascular pattern was irregular. This
area was confirmed to be dysplastic on biopsy. NBI, narrow-band imaging; ME-NBI, magnifying
endoscopy combined with narrow-band imaging.
This study was conducted in accordance with the standards of the Declaration of Helsinki
and the current ethical guidelines, and was approved by the institutional Ethics Board
at Sendai Kousei Hospital.
Statistical analysis
Diagnostic accuracy was calculated based on per image analysis. Observer agreement
between reviewers was evaluated using Fleiss’ kappa statistics for the predicted histology
and surface pattern of the JES-BE classification [18]
[19]. Kappa values were interpreted as previously described [20], as follows: kappa = 0, absence of agreement; kappa < 0.20, slight agreement; kappa = 0.21–0.40,
fair agreement; kappa = 0.41–0.60, moderate agreement; kappa = 0.61–0.80, substantial
agreement; and kappa > 0.81, almost perfect agreement. Continuous variables were compared
using the Mann-Whitney rank-sum test. The ratios of nominal variables were compared
using Fisher’s exact test. Risk factors for misdiagnosis were evaluated using logistic
regression analysis. Candidate variables were histology, magnification power, and
location of the captured image.
Statistical analyses were performed using JMP 13 (SAS Institute Inc., Cary, North
Carolina, United States) and P < 0.05 was considered to indicate statistical significance.
Results
Baseline characteristics
Characteristics of patients and ME-NBI images are shown in [Table 1]. One hundred thirty-two ME-NBI images were retrieved from 44 patients with BE. Twenty-nine
patients had Barrett’s esophageal dysplastic lesion, whereas 15 patients had non-dysplastic
BE. Management of patients was as follows: endoscopic submucosal dissection (ESD)
(n = 28), surgery (n = 1), and no treatment (n = 15). Of the 44 patients, 35 (80 %)
were male with a mean age of 71.0 years (interquartile range [IQR] 66–78 years]).
The median maximum length of BE was 2.0 cm (IQR 1.0 cm-3.0 cm). Thirty-six patients
(81 %) had hiatal hernia, and 31 (70 %) had reflux esophagitis.
Table 1
Demographics of patients and histology associated with NBI-ME images.
Characteristics Value
|
No. patients
|
44
|
Age, years, median (range)
|
71.0 (45–87)
|
Male
|
35 (80 %)
|
Barrett’s esophagus (short segment:long segment)
|
35:9
|
Barrett’s esophagus length, cm, median (range)
|
1.0 (0–10.0)
|
circumferential/maximal extent
|
3.0 (1.0–14.0)
|
Hiatal hernia, n (%)
|
36 (81 %)
|
Reflux esophagitis, n (%)
|
31 (70 %)
|
Treatment method (ESD:surgery:no treatment)
|
28:1:15
|
Number of NBI-ME images
|
132
|
Histology
|
|
|
57
|
|
75 (26/49)
|
Magnification power
|
|
|
46 (35 %)
|
|
86 (65 %)
|
Location of the captured image
|
|
|
81 (61 %)
|
|
51 (49 %)
|
NBI-ME, narrow band imaging-magnification endoscopy; LGD, low-grade dysplasia; HGD,
high-grade dysplasia; LSBE, long-segment Barrett’s esophagus; SSBE, short-segment
Barrett’s esophagus.
Of the 132 ME-NBI images, 75 (57 %) were dysplastic (LGD, n = 26; HGD or adenocarcinoma,
n = 49) and 57 (43 %) were non-dysplastic. The invasion depth of all adenocarcinomas
was limited to the mucosal or submucosal layers. Forty-six images (35 %) were high
magnification, and 86 were low magnification. Eighty-one images (61 %) were captured
from LSBE, and 51 were from SSBE.
Diagnostic accuracy and interobserver agreement of the JES-BE classification
The diagnostic accuracy of the predicted histology with both mucosal and vascular
pattern and with mucosal pattern alone is shown in [Table 2]. Mean values for sensitivity, specificity, PPV, and NPV were 67 %, 80 %, 82 %, and
65 %, respectively. The overall accuracy of the JES-BE classification was 73 %. The
diagnostic accuracy of the predicted histology with mucosal pattern alone was as follows.
Mean values of sensitivity, specificity, PPV, NPV, and overall accuracy were 79 %,
72 %, 79 %, 72 %, and 76 %, respectively.
Table 2
Diagnostic accuracy of predicted histology of Barrett’s esophagus.
|
Sensitivity
|
Specificity
|
PPV
|
NPV
|
Accuracy
|
Comprehensive Diagnosis
|
67
(64–70)
|
80
(76–84)
|
82
(80–84)
|
65
(61–69)
|
73
(70–76)
|
Mucosal pattern only
|
79
(77–81)
|
72
(69–75)
|
79
(77–81)
|
72
(70–74)
|
76
(74–78)
|
Data show mean values with 95 % confidence intervals in parentheses.
PPV, positive predictive value; NPV, negative predictive value.
Interobserver agreement for predicted histology and NBI surface patterns was moderate
and fair, respectively (κ = 0.720 and 0.573, respectively).
Risk factor analysis for misdiagnosed images
Of the 132 ME-NBI images, 34 (26 %) were difficult-to-diagnose and 99 (74 %) were
easy-to-diagnose. The characteristics of these two groups are shown in [Table 3]. [Table 4] shows the results of logistic regression analysis. LGD and high-power magnification
images were each significant risk factors for difficult-to-diagnose images (OR 6.80,
P = 0.0017 and OR 3.31, P = 0.0125, respectively). Images captured from LSBE were more likely to be difficult-to-diagnose
than those from SSBE, but the difference was insignificant (OR 2.13, P = 0.157).
Table 3
Characteristics of difficult-to-diagnose and easy-to-diagnose groups.
|
Difficult-to-diagnose
|
Easy-to-diagnose
|
Histology
(non-dysplastic/LGD/HGD and adenocarcinoma)
|
10/17/7
|
47/11/40
|
Magnification power
(high/low)
|
19/15
|
27/71
|
Location of the captured image
(LSBE/SSBE)
|
26/8
|
55/43
|
LGD, low-grade dysplasia; HGD, high-grade dysplasia; LSBE, long-segment Barrett’s
esophagus; SSBE, short-segment Barrett’s esophagus.
Table 4
Risk factors of misdiagnosis based on logistic regression analysis.
|
Difficult-to-diagnose images
|
OR
|
95 % CI
|
P value
|
Histology
|
|
|
6.81
|
2.05
|
22.58
|
0.001
|
|
1.36
|
0.42
|
4.37
|
0.61
|
|
Reference
|
Magnification power
|
|
|
|
|
|
3.31
|
1.29
|
8.48
|
0.01
|
|
Reference
|
Location of captured image
|
|
|
|
|
|
2.13
|
0.74
|
6.11
|
0.16
|
|
Reference
|
LGD, low-grade dysplasia; HGD, high-grade dysplasia; LSBE, long-segment Barrett’s
esophagus; SSBE, short-segment Barrett’s esophagus
Discussion
In the current study, five endoscopists reviewed 132 ME-NBI images of BE using the
JES-BE classification, and the diagnostic accuracy of this classification was relatively
high (73 %). Moreover, this study revealed that LGD and high-power magnification images
were significant risk factors for misdiagnosis.
The JES-BE classification was proposed in 2018 [15]. Although the diagnostic accuracy of this classification has not been evaluated
on a large scale, a review article described a validation study in which 156 images
were reviewed by 10 endoscopists [16]. This study reported sensitivity and specificity for this classification of 87 %
and 97 %, respectively. The high diagnostic accuracy evaluated in the review article
indicates that the JES-BE classification is practical in a clinical setting. However,
in our study, the diagnostic accuracy of this classification was lower. The main reason
may be the differences in the reviewers. The previous article included experts in
the field of esophageal cancer, which may have contributed to the high diagnostic
accuracy. On the other hand, our study did not include experts, and all reviewers
were general and non-expert endoscopists working in a city hospital. We believe that
our study reflects a real-world situation more accurately. Another reason for the
low diagnostic accuracy is that some patients in whom the diagnosis was extremely
difficult to make were included in this study. Moreover, because patients with difficult-to-diagnose
cases undergo biopsy preoperatively multiple times to check lesion demarcation, the
number of difficult-to-diagnose images would have increased. These are the main reasons
why the diagnostic accuracy in this study was lower than that in the previous study.
LGD was a significant risk factor for misdiagnosis compared to HGD and adenocarcinoma
in the current study (OR: 6.80, P = 0.0017). Dysplasia is defined as a neoplastic epithelium that is confined to the
basement membrane of the gland [21]
[22]. In the Vienna classification, LGD in BE is characterized by relative preservation
of glandular architecture but with cellular atypia, including nuclear hyperchromatism,
pleomorphism, mucin depletion and absence of goblet cells [17]. It is familiar to endoscopists and pathologists in Western countries, but not in
Eastern countries including Japan. Indeed, Japanese pathology guidelines for esophageal
cancer do not include a description of dysplasia in BE. Therefore, in the current
study conducted in Japan, pathological diagnosis of LGD was made carefully. Two expert
pathologists in the field of gastrointestinal cancer made the diagnosis of LGD while
reviewing the description and reference images for LGD in the Vienna classification.
Previous reports showed that histopathological and endoscopic diagnosis of LGD was
difficult [23]. The main reason for this is the presence of inflammation, as the non-dysplastic
epithelium in BE may mimic that of LGD [24]. Moreover, because LGD is a low-grade atypical tumor, it often shows no or weak
irregularity endoscopically [25]. Therefore, it may be difficult to differentiate LGD from non-dysplastic epithelium
even with ME-NBI. Endoscopists should recognize that LGD is a dysplastic lesion that
is difficult to differentiate from non-dysplastic epithelium in Barrett's mucosa.
Consequently, when encountering a difficult lesion to diagnose, we should actively
check pathological diagnosis based on biopsy specimen rather than trying to diagnose
with endoscopy alone. Adopting this practice may help reduce the number of misdiagnoses
of Barrett’s lesions.
In this study, obtaining images with high-power magnification was a significant risk
factor for misdiagnosis. According to the JES-BE classification, high-power magnifying
observation should be performed to evaluate a vascular pattern, and a mucosal pattern
should be assessed under low-power magnifying observation. That is because when Barrett's
mucosa is observed at high-power magnification, the target mucosa is stretched by
the scope, and the mucosal pattern becomes obscured. Unclear mucosal pattern may be
the reason why high-power magnification images were risk factor for misdiagnosis.
In this study, there was no significant difference in accuracy between diagnosis using
mucosal patterns alone and diagnosis including both mucosal and vascular patterns
([Table 2]). In other words, the additional effect of diagnosis of a vascular pattern was insignificant.
Based on these results, high-power magnification images may be better evaluated in
combination with low-power magnification images, or it might be better to consider
simplifying the diagnostic strategy of the JES-BE classification, such as using just
mucosal pattern.
Images captured in LSBE were more likely to be misdiagnosed than those in SSBE, but
there was no significant difference. Probst et al. reported that the R0 resection rate for BAC in LSBE was significantly lower than
that in SSBE (70.4 % vs. 90.0 %, P = 0.029) [26]. The main reason for this difference might be difficulty in recognizing lateral
extension of the lesion. LSBE is often associated with background inflammation due
to reflux esophagitis. Inflamed mucosa sometimes shows mild atypia, which makes it
extremely difficult to distinguish dysplastic mucosa from non-dysplastic mucosa [24]. However, some previous articles reported that R0 resection rates of ESD cases were
comparable between SSBE and LSBE [27]
[28]. Therefore, larger-scale prospective studies should be conducted to evaluate the
difficulty of endoscopic diagnosis of lesions in SSBE and LSBE.
The present study had some limitations. First, it was a single-center study conducted
in Asia. Thus, fewer patients with LSBE were enrolled, which led to the small number
of LSBE images. As a result, the odds ratio of 2.13 for LSBE was relatively high but
it caused a wide 95 % confidence interval and was not statistically significant. Further
studies at multiple institutions with large sample size are recommended. Second, because
the decision about whether an image is high or low magnification was made by a single
endoscopist who prepared the images, which may have added bias to the study. No fair
comparison may be possible, because both images were not selected from the same lesion
as paired views. In addition, some images were taken from the same patient, which
would lead to selection bias and affect the results of statistical analysis. Third,
easy- and difficult-to-diagnose images were defined based on the proportion of accurate
diagnoses (less than 40 % or more than 60 %). This means the definition of these two
groups were relatively determined, not by an absolute criterion. In addition, the
correct rate of 60 %, which was the definition of easy-to-diagnose images, is not
very high compared to that for diagnosis of early gastric cancer [29]. However, considering the difficulty of diagnosing BAC due to background inflammation
and existence of LGD, this criterion may be acceptable. Fourth, LGD is not well known
in Eastern countries, including Japan, either pathologically or endoscopically. In
addition, the diagnosis of LGD can vary among pathologists [30]. Therefore, in the current study, we asked the two pathologists to check the definition
of LGD in the Vienna classification every time before diagnosing LGD. As a result,
the diagnosis of LGD may be nearly identical.
Conclusions
In conclusion, this study elucidated risk factors for misdiagnosis of BE based on
the JES-BE classification. We found that LGD and high-power magnification images were
risk factors for misdiagnosis. Although a prospective validation study is necessary
to confirm these results, we believe that endoscopists keep these risk factors in
mind during endoscopic examination of patients suspected of having BE.