Introduction
The size of colorectal polyps is associated with the risk that a polyp yields advanced
histologic features [1]
[2]. Moreover, a polyp size ≥ 10 mm is associated with an increased risk of metachronous
advanced neoplasia and colorectal cancer [3]. Therefore, accurate polyp size measurement, especially at the 10-mm threshold,
is important for polyp risk stratification and decision making regarding colonoscopy
surveillance intervals [4]
[5]. Polyp size also matters for deciding on optimal resection technique [6]
[7] and the safe implementation of the “leave-in-situ” and “resect-and-discard” optical
diagnosis strategies [8]
[9].
In daily practice, polyp size is mostly measured based on visual estimation by the
endoscopist. However, visual size estimation is known to be inaccurate and prone to
bias and interobserver variability [10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20], reportedly leading to considerable missizing at relevant size thresholds and inappropriate
surveillance recommendations for up to 35 % of polyps [10]
[12]
[19]. Using instruments of known size as a visual reference can improve polyp size measurement
accuracy and reduce interobserver variability, but is known to be time consuming and
costly [15]
[18]
[21].
Recently, a new virtual scale function (SCALE EYE; Fujifilm, Tokyo, Japan) was developed
that allows polyp size measurement through projection of an adaptive virtual scale
onto colorectal polyps during real-time endoscopy [22]. The size of the virtual scale is adapted based on the distance between the endoscope
tip and the polyp the endoscopist aims to measure. This distance is calculated using
an endoscope-integrated laser in combination with specific computer (image processing)
software ([Fig. 1]). The laser and the virtual scale can be simultaneously activated with a single
push of a button on the handle of the endoscope.
Fig. 1 Schematic overview of the principle of distance estimation using the virtual scale
endoscope. The reflection of the laser beam, as emitted from the endoscope, can be
used to determine distance between the endoscope tip and the object (mucosal wall
or polyp) the laser is positioned on based on the triangulation method. This information
is used to continuously adapt the size of the virtual scale: the size of the virtual
scale increases whenever the distance between the endoscope tip and the mucosal wall
(or polyp) is shrinking, and decreases as the endoscope tips moves farther away. To
enable real-time polyp size measurement, the virtual scale contains markings at 5,
10, and 20 mm.
Several (pre-)clinical studies have evaluated the performance of the virtual scale.
These studies showed superior measurement accuracy of virtual scale polyp size measurements
compared with both visual and instrument-aided polyp size measurements [17]
[18]
[19]
[20]
[21]
[22]
[23]. However, the clinical relevance of measurement accuracy as the primary outcome
measure in evaluating polyp size measurement methods can be debated due to the absence
of a robust reference standard for polyp size. To more thoroughly evaluate the clinical
potential of the virtual scale, we aimed to evaluate the virtual scale in terms of
variability and systematic differences. In addition, we evaluated feasibility of the
virtual scale in terms of measurement success rate, measurement duration, and user-friendliness.
Methods
Setting and study design
The video-based study was conducted according to the principles of the Declaration
of Helsinki and the Medical Research Involving Human Subjects Act (WMO) and was approved
by the Institutional Review Board of the Academic Medical Center, Amsterdam (2022.084).
The size of consecutively detected polyps was measured in screening colonoscopies.
All measurements were video recorded. Video recording extracts of these measurements
were later presented to a group of both expert and trainee endoscopists. All endoscopists
estimated the size of each polyp based on each of the three measurement methods: (1)
visual estimation (without the aid of a tool), (2) 9-mm polypectomy snare as a visual
reference, and (3) virtual scale as a visual reference. Polyps were also measured
at histopathologic analysis.
Colonoscopy procedures
Consecutive patients undergoing colonoscopy after a positive fecal immunochemical
test within the context of the Dutch national colorectal cancer screening program
at Bergman Clinics, Amsterdam were invited to participate in the study. All participants
provided written informed consent. Patients were recruited between October 2022 and
March 2023.
Colonoscopies were performed by four endoscopists (E.D., H.B., J.G., M.V.). All colonoscopies
were performed using the EC-760S-A/L endoscope (Fujifilm) in combination with the
EX-1 processor with EW10-VM01 software (Fujifilm), facilitating the use of the virtual
scale. Endoscopists had measured at least 10 polyps using the virtual scale before
the start of study inclusions.
As the upper limit of the virtual scale is 20 mm, only polyps of 20 mm or smaller,
according to initial visual size estimation by the endoscopist, were deemed eligible
for inclusion. Polyps were consecutively included, with the start of withdrawal within
the cecum as the starting point. Eligible polyps were endoscopically measured using
three methods and in similar order: (1) visual, (2) virtual scale, (3) snare ([Fig. 2], [Video 1]). Measurements were performed using high definition white-light endoscopy.
Fig. 2 Endoscopic polyp size measurements methods. a Visual measurement (without the aid of a tool). b Measurement with the aid of the linear virtual scale. c Measurement with the aid of the circular virtual scale. d Measurement with the aid of a polypectomy snare of known size (maximum width of 9
mm).
Video 1 Examples of endoscopic polyp size measurements using different measurement methods.
In consecutive order: (1) visual measurement (without the aid of a tool); (2) measurement
with the aid of the linear and circular virtual scale; (3) measurement with the aid
of a polypectomy snare of known size (maximum width 9 mm).
Visual measurements were performed without the use of a tool as a visual reference.
For virtual scale measurements, use of either the linear virtual scale, circular virtual
scale, or both, was left to the discretion of the endoscopist. Duration of virtual
scale measurements, defined as the time from activation of the virtual scale until
the first image freeze with the virtual scale in a feasible position to estimate polyp
size (i. e. on the left center edge of the polyp), was recorded. If no successful
measurement could be performed within 180 seconds, the measurement was recorded as
failed. For snare measurements, the Exacto Cold Snare (Steris, Mentor, Ohio, USA)
with a maximum width of 9 mm (see Fig. 1 s, in the online-only Supplementary material) was used as a visual reference. The snare
was positioned around the polyp while fully extended. If it was not possible to adequately
position the snare around the polyp, the snare was positioned adjacent to the polyp. Snare
measurements were recorded as failed if the snare could not be fully extended with
both the polyp and complete snare clearly visible. All polyps were resected after
snare measurement using the Exacto Cold Snare. Resection specimens were retrieved
through suctioning and collected using a two-drawer polyp trap (ENDO-SAFIER Polyp
Trap; Suzhou GZM Medical Co., Suzhou, China).
Video-based assessment of polyp size
For each polyp, three 10–15-second video extracts were derived from the video recordings.
Each extract included the measurement of a polyp using one of the three measurement
methods. The video extracts were incorporated into an online survey environment (Fig. 2 s) and distributed over six surveys. Each survey contained 60 videos. All videos within
each survey only showed polyp size measurements using one of the three measurement
methods. The order of polyps within each survey was determined using a randomization
tool [24].
All video extracts were presented to eight dedicated colorectal gastroenterologists
specialized in endoscopic diagnosis and treatment of early colorectal cancer and its
precursor lesions (experts), as well as nine gastroenterology residents following
endoscopy training with between 2 and 4 years of endoscopy experience (trainees).
Before participation, all participants completed an e-learning program on use and
interpretation of the virtual scale.
For each video, endoscopists were asked to report the size of the polyp displayed
within the video. Size was reported in mm and at 1-mm increments. Playback and pausing
of videos were allowed, as this mimics (re)positioning of the endoscope and image
freezing during clinical procedures. To reduce the risk of recognition bias, intervals
of at least 2 weeks were maintained between surveys containing video extracts of the
same polyps. For each polyp, videos were presented to all endoscopists in a standardized
order: (1) visual, (2) snare, (3) virtual scale. Endoscopists were also asked to grade
the quality of each video for assessment of polyp size as “good,” “sufficient,” or
“insufficient.” Endoscopists were blinded to the size as determined during clinical
assessment and assessments by other endoscopists, and received no information on any
results until completion of all surveys.
Histopathologic polyp size measurement
Resection specimens of included polyps were collected in formalin and sent to the
pathology laboratory. Resection specimens were measured by a dedicated gastrointestinal
pathologist using a conventional ruler (macroscopic measurement) and during light
microscope examination after embedment in paraffin and sectioning (microscopic measurement).
In cases where the resection specimen was fragmented, this was recorded. Sizes were
reported in mm at 1-mm increments.
Statistical analysis
Continuous data are presented as mean with SD in cases of normally distributed data,
and as median with interquartile range (IQR) in cases of non-normally distributed
data. Categorical data are presented as count with percentages.
To estimate method-specific variance (as measure of variability), as well as systematic
differences between methods, we used a mixed linear model (MLM). Our MLM included
polyp size as the dependent variable, method-specific intercept as fixed effect, polyp-specific
and endoscopist-specific intercepts as random effects, with a random slope to account
for method-specific variance across different endoscopists. To test for a significant
difference in variance between methods, we used the generalized likelihood ratio test
statistic, comparing the model as described to a similar model presuming equal method-specific
variances. In cases of significant differences in variance, MLM analyses were repeated
on datasets comprising data of only two measurement methods. This way, any difference
in variance between two specific methods could be evaluated in detail. To meet MLM
assumptions of normality and homoscedasticity, log-transformed polyp size measurements
were used for all hypothesis tests. To ease interpretation of the results, we report
estimated variances and systematic differences without such transformation.
For subgroup analyses involving diminutive and nondiminutive polyps, mean polyp size
according to snare measurements by expert endoscopists was used to distribute polyps
over different size categories. For subgroup analyses with exclusion of polyps with
videos of insufficient quality, polyps for which at least one video was rated as of
insufficient quality by six or more (≥ 35 %) endoscopists were excluded.
We assessed uniformity of polyp size classification as the percentage of polyps assigned
to the same size category (≤ 5 mm, 6–9 mm, ≥ 10 mm) by all endoscopists with each
endoscopic measurement method. Differences in measured size between the various endoscopic
measurement methods were illustrated using Bland–Altman plots [25]. Mean differences between endoscopic measurements and histopathologic measurements
were calculated using the mean polyp size according to expert endoscopists for each
endoscopic method. Polyps for which macroscopic and microscopic polyp size were not
both available, as well as polyps that were resected in piecemeal or for which the
histopathologic specimen was fragmented, were excluded from these analyses.
All analyses were performed using R version 4.2.1 (R Foundation for Statistical Computing,
Vienna, Austria). Two-sided P values of < 0.05 were considered statistically significant.
Sample size calculation
We based our sample size calculation on measurements by expert endoscopists, anticipating
a 20 % or larger difference in variance between measurement methods. To reach the
desired statistical power of 80 %, sample size calculation using a two-sided F-test
for difference in variance with an alpha of 0.05 showed that inclusion of a total
of 947 polyp measurements per method was required. When involving eight expert endoscopists,
this implied inclusion of at least 119 unique polyps. To facilitate a balanced distribution
of polyps over the six study surveys, we decided to extent the sample size to 120
polyps.
Results
A total of 120 polyps, detected during screening colonoscopy in 52 patients (median
age 65 [IQR 60–71] years; male 62 %) were included (Table 1 s, Fig. 3 s). The success rate for virtual scale measurements was 95 %, which was comparable
to the success rate for snare measurements (97 %). Median virtual scale measurement
duration was 17 (IQR 8–33) seconds (Table 2 s). Median measurement duration was lower for the last 30 included polyps compared
with the first 30 included polyps (12 [IQR 5–23] vs. 22 [IQR 14–44] seconds). Median
measurement durations for flat and nonflat polyps were comparable (15 [IQR 6–30] vs.
18 [IQR 10–33] seconds). Five endoscopists who used the virtual scale in clinical
practice graded virtual scale endoscope user-friendliness on average at 5 on a 10-point
scale (with 1 representing the worst user-friendliness and 10 representing the best
user-friendliness).
Video-based assessment of polyp size
Eight experts (median endoscopy experience 13 [IQR 8–21] years) and nine trainees
(median endoscopy experience 3 [IQR 2–4] years) completed all study surveys. For 97/120
polyps (81 %), all three measurement videos were assessed as being of sufficient to
good quality. The percentage of videos with insufficient quality was comparable between
the three measurement methods (Table 3 s).
Differences in variability for endoscopic measurements methods
Method-specific variances are shown in [Table 1]. Variance for virtual scale measurements was significantly lower than for visual
and snare measurements for both experts (0.52 [95 %CI 0.47 to 0.57] vs. 1.96 [95 %CI
1.88 to 2.06] and 1.59 [95 %CI 1.50 to 1.67), P < 0.001) and trainees (0.59 [95 %CI 0.54 to 0.63] vs. 2.21 [95 %CI 2.12 to 2.30]
and 2.53 [95 %CI 2.43 to 2.62], P < 0.001). One-to-one comparisons of method-specific variances showed statistically
significant differences in all cases, with the exception of comparison of variance
for visual vs. snare measurements for trainees.
Table 1
Variance for different endoscopic polyp size measurement methods as estimated using
mixed linear model analyses.
|
Group
|
No. of measurements per method
|
Variance (95 %CI)[1]
|
|
Visual
|
Snare
|
Virtual scale
|
|
Experts
|
960
|
1.96 (1.88 to 2.06)
|
1.59 (1.50 to 1.67)
|
0.52 (0.47 to 0.57)
|
|
Trainees
|
1080
|
2.21 (2.12 to 2.30)
|
2.53 (2.43 to 2.62)
|
0.59 (0.54 to 0.63)
|
|
All
|
2040
|
2.06 (1.99 to 2.12)
|
2.02 (1.96 to 2.09)
|
0.58 (0.52 to 0.64)
|
Difference in variance between methods (visual vs. snare, visual vs. virtual scale,
and snare vs. virtual scale) was statistically significant in all cases (P < 0.05), with the exception of the difference in variance between visual and snare
measurements for trainees (P = 0.21).
1 Variances are based on the squared deviations from the mean and are therefore reported
in the square of the units of the original data (mm²). Standard deviations, expressed
in the original units of the polyp size measurements (mm), can be calculated by taking
the square root of the reported variances.
Subgroup analyses were conducted to evaluate variances within specific polyp subgroups.
Polyps were subdivided based on polyp size (≤ 5 mm and > 5 mm polyps), morphology
(flat and nonflat polyps), and the virtual scale used (linear or circular). These
analyses showed similar results compared with the main analyses: variance for virtual
scale measurements was lower than for visual and snare measurements in all cases,
for both experts and trainees (Tables 4s–6 s). When excluding polyps with videos of insufficient quality, 97 polyps remained.
Analyses involving this subgroup also revealed lower variance for virtual scale measurements
compared with visual and snare measurements in both endoscopist groups (Table 7 s).
Mean and systematic differences between measurement methods
Mean differences in polyp size between the three endoscopic measurement methods, based
on assessments by expert endoscopists, are illustrated in Bland–Altman plots ([Fig. 3]). These plots indicate that, on average, snare measurements resulted in the largest
polyp size, respectively followed by virtual scale and visual measurements. Systematic
differences between methods, as estimated using MLM analyses, are shown in [Table 2]. For expert endoscopists, virtual scale measurements generally resulted in larger
polyp size compared with visual measurements (+ 0.11 mm [95 %CI 0.00 to 0.23]) and
smaller polyp size compared with snare measurements (–0.09 mm [95 %CI –0.21 to 0.02]).
Fig. 3 Bland–Altman plots illustrating the differences between polyp size measurements by
expert endoscopists using different measurement methods. Within these plots, the polyp
size according to two different measurement methods (x axis) is plotted against the difference in polyp size according to these methods
(y axis). Each plot comprises 960 observations, representing measurements of 120 polyps
by eight different expert endoscopists. Count for each point within the plot is indicated
by the legend on the top of each plot. Dotted blue lines represent mean differences.
Dotted red lines represent upper and lower limits of the 95 %CIs. The plots represent
the following methods (y axis): a Visual and snare measurements. b Visual and virtual scale measurements. c Snare and virtual scale measurements.
Table 2
Systematic differences between different endoscopic polyp size measurement methods
as estimated using mixed linear model analyses
|
Operator (no. of measurements per method)
|
Method one
|
Method two
|
Mean difference (95 %CI), mm[1]
|
|
Experts (n = 960)
|
|
Visual
|
Snare
|
+ 0.21 (0.09 to 0.33)
|
|
Visual
|
Virtual scale
|
+ 0.11 (0.00 to 0.23)
|
|
Snare
|
Virtual scale
|
–0.09 (-0.21 to 0.02)
|
|
Trainees (n = 1080)
|
|
Visual
|
Snare
|
+ 0.36 (0.23 to 0.48)
|
|
Visual
|
Virtual scale
|
+ 0.21 (0.08 to 0.33)
|
|
Snare
|
Virtual scale
|
–0.15 (-0.27 to –0.03)
|
|
All (n = 2040)
|
|
Visual
|
Snare
|
+ 0.29 (0.20 to 0.37)
|
|
Visual
|
Virtual scale
|
+ 0.16 (0.08 to 0.25)
|
|
Snare
|
Virtual scale
|
–0.13 (-0.21 to –0.04)
|
1 Mean difference between polyp size measurements using method two compared with polyp
size measurements using method one.
Mean differences between endoscopic and histopathologic measurement methods, based
on the analysis of 71 nonfragmented polyps with both macroscopic and microscopic histopathologic
measurement available, are shown in Table 8 s.
Uniformity of polyp size classification for the different measurement methods
Uniformity of polyp size classification was assessed through distribution of polyps
over three size categories (≤ 5 mm, 6–9 mm, ≥ 10 mm) based on assessments by the individual
endoscopists. The percentage of polyps assigned to the same size category by all endoscopists
was higher for virtual scale measurements compared with visual and snare measurements
for both experts (69 % vs. 55 % and 59 %), trainees (67 % vs. 51 % and 47 %), and
all endoscopists (58 % vs. 48 % and 43 %) ([Fig. 4a, ]
Table 9 s).
Fig. 4 Bar plots. a The percentage of polyps assigned to the same category (≤ 5 mm, 6–9 mm, ≥ 10 mm)
by all endoscopists using different polyp size measurements methods. b The maximum difference between individual endoscopists regarding the percentage of
polyps assigned to the ≥ 10-mm size category.
Use of the virtual scale resulted in more uniform decision making around the 10-mm
threshold for both experts and trainees, as demonstrated by the maximum differences
between individual endoscopists in the total number of polyps assigned to the ≥ 10-mm
size category; this difference was lower for virtual scale measurements than for visual
and snare measurements. For experts, based on analyses with 120 included polyps, a
maximum difference regarding the number of polyps assigned to the ≥ 10-mm size category
of 2 polyps (1.7 %) was found for virtual scale measurement, compared with 12 polyps
(10.0 %) for visual measurement and 6 polyps (5.0 %) for snare measurement. For trainees,
the maximum differences for virtual scale, visual, and snare measurement respectively
concerned 3 (2.5 %), 8 (6.7 %), and 14 (11.7 %) polyps ([Fig. 4b
]). A more detailed overview of polyp distribution over the different size categories
by individual endoscopists is shown in Table 10 s.
Discussion
This video-based study showed that use of the virtual scale resulted in lower polyp
size measurement variability compared with visual and snare measurements, for both
endoscopy experts and trainees. This resulted in more uniform assignment of polyps
to different size categories by individual endoscopists and a reduction in the differences
in the number of polyps assigned to the ≥ 10-mm size category. Moreover, estimated
systematic differences for virtual scale measurements compared with other methods
were small (< 0.5 mm).
Most studies evaluating polyp size measurement methods use measurement accuracy as
the primary outcome measure. However, the clinical value of measurement accuracy is
compromised by the lack of a robust reference standard for polyp size. Frequently
used reference standards are the measured size of (fresh) resection specimens [12]
[18]
[19] or size as endoscopically determined by in situ comparison of polyp size with the
size of an instrument of known size [13]
[14]
[15]
[26]
[27]. A limitation of the use of resection specimen size as a reference standard is that
resection specimens can become fragmented or lost. Moreover, the size of resection
specimens might not represent real polyp size as these are prone to deformation due
to vascular collapse, compression, cauterization, and suction [28], as well as shrinkage due to formalin fixation [29]
[30]
[31]. Instrument-aided measurements can be erroneous due to problems with adequate positioning
of instruments with the largest diameter in a perpendicular view toward the polyp. Instrument-aided
measurements are also prone to bias owing to changed instrument proportions due to
mechanic factors (e. g. compression, deformation) and distortion or warping of instruments
on the endoscopy monitor due to the endoscope fisheye lens structure [32]. Our study illustrates that the latter may especially affect less experienced endoscopists,
while variability for snare measurements exceeded variability for visual measurements
for trainees. Given the drawbacks of comparing accuracies based on suboptimal reference
standards, we chose to evaluate the different methods for polyp size measurement in
terms of variability and systematic differences.
Lower measurement variability indicates that results of repeated measurements are
concentrated closer around their mean. Use of measurement methods associated with
lower measurement variability, such as the virtual scale in our study, will therefore
result in a reduction of significant outliers and a lower rate of observer disagreement
compared with measurement methods associated with higher measurement variability [33]. In our study, lower measurement variability for virtual scale measurements resulted
in an increase in uniformity of polyp size category assignment up to 20 % compared
with the other methods. The clinical relevance of the more uniform polyp sizing using
the virtual scale is more specifically illustrated through the reduction in the percentual
difference in the total number of polyps assigned to the ≥ 10-mm size category between
individual endoscopists (i. e. experts: 10.0 % for visual to 1.7 % for virtual scale;
trainees: 11.7 % for snare to 2.5 % for virtual scale). Sizing a polyp either under
or over the 10-mm threshold may result in defining an adenoma as advanced (≥ 10 mm)
or nonadvanced (< 10 mm), which in most guidelines implies a difference between either
a 3– or 10-year surveillance interval [4]
[5]. As such, more consistent polyp sizing around the 10-mm threshold using the virtual
scale could aid in preventing unnecessary (early) colonoscopies and erroneously delayed
surveillance.
Despite benefits of the virtual scale in terms of measurement variability, this does
not directly imply that virtual scale measurements also yield a high measurement accuracy.
The small systematic differences (< 0.5 mm) as found in our study, in combination
with results of preliminary preclinical studies evaluating performance of the virtual
scale on artificial polyps (maximum measurement errors of 0.7 mm and relative accuracies
of 82 %–84 %) [17]
[20]
[21]
[22]
[23], do however support the idea that virtual scale measurements generally provide reliable
estimates of real polyp size.
General benefits of the virtual scale include the fact that it is an intuitive push-a-button
tool, which is easy to use and does not require additional (disposable) instruments.
In addition, as shown in our study, virtual scale measurement is possible for the
vast majority of polyps and can be performed in under a minute for 90 % of polyps.
Although we did not compare virtual scale and snare measurement duration, the measurement
duration when using the virtual scale is suggested to be comparable to other instrument-aided
measurements in an ex vivo setting [21]. Nonetheless, there are also several factors compromising the usability of the virtual
scale. First, maneuvering the virtual scale (endoscope) into a perpendicular position
toward the polyp can be a tedious process and is not always possible, especially for
polyps located in difficult positions (e. g. within folds). This is the main reason
endoscopists graded user-friendliness on average at 5 on a 10-point scale. Moreover,
the virtual scale only includes markings at 5, 10, and 20 mm, which hampers measurement
of larger polyps. Finally, virtual scale measurements still require interpretation
by physicians, thereby not completely ruling out bias due to interobserver variability.
In the future, the feasibility of the virtual scale as an alternative reference standard
for clinical polyp size measurement should be evaluated considering its current limitations.
This is particularly relevant given that artificial intelligence (AI) has already
been proposed as an alternative for automated polyp size measurement. AI might facilitate
polyp size measurements without bias due to human factors. However, the lack of robust
datasets with ground truth information is currently still complicating development
of AI-based polyp measurement systems [34]
[35]. In the context of lower measurement variability and proven measurement accuracy
of the virtual scale on artificial polyps (of which exact size is known), the virtual
scale might however open doors to development of certain datasets and further development
of AI-based approaches.
To the best of our knowledge, this is the first study involving the virtual scale
to evaluate multiple endoscopic polyp size measurement methods in terms of variability
and systematic differences. Due to the prospective inclusion of consecutive polyps
of different morphologic subtypes and sizes in a real-time clinical setting, our study
provides realistic insights into the clinical potential of the virtual scale. In addition,
both endoscopy experts and trainees participated in the study. To prevent recognition
bias, we maintained intervals of at least 2 weeks between assessments of different
measurements of the same polyps and applied video sequence randomization within surveys.
The generalizability of our study may however be compromised by the fact that the
majority of polyps were ≤ 10 mm, and by the fact that participating trainee endoscopists
had 2–4 years of endoscopy experience. Therefore, future studies involving more polyps
over 10 mm in size and studies exploring the specific benefits of the virtual scale
for novice endoscopists are still warranted. Moreover, while four endoscopists performed
all study measurements, parameters such as measurement duration were largely dependent
on the endoscopy skills of these endoscopists. Finally, polyp measurements were conducted
based on video extracts with a maximum duration of 15 seconds. Although assessments
based on videos are likely to be more accurate than assessments based on still images,
the use of such short video extracts will always induce some bias. Nonetheless, analyses
with exclusion of polyps with videos of insufficient quality showed results comparable
to the results of the main analyses. This indicates that lower video quality was not
the ground cause for observed differences in variability.
In conclusion, this study showed that use of the virtual scale led to lower polyp
size measurement variability and more uniform polyp sizing by individual endoscopists
compared with other measurement methods. Therefore, use of the virtual scale in daily
clinical practice could reduce the risk of polyp under- or oversizing and polyp misclassification
at relevant size thresholds. This could support better clinical decision making processes
involving polyp size.