Introduction
Gastroesophageal reflux (GER) is a common disorder with an approximate prevalence
of 10 – 20 % in the Western world [1 ]. The clinical manifestations of GER (i. e. heartburn and acid regurgitation) are
due to reflux of gastric content into the esophagus. The present theory for the pathogenesis
of GER is multifactorial involving the lower esophageal sphincter (LES), diaphragmatic
crus, esophageal acid clearance, gastric acid secretion, gastric emptying, and intra-abdominal
pressure [2 ]. The Montreal definition of gastroesophageal reflux disease (GERD) states that GERD
is present when the reflux of stomach contents causes troublesome symptoms and/or
complications [3 ]. Possible complications include esophagitis and Barrett’s esophagus. In Barrett’s
esophagus, the normal squamous epithelium of the distal esophagus has been replaced
by a columnar epithelium, giving rise to a columnar lined esophagus (CLE). The definition
of Barrett’s esophagus has always been controversial and no uniformly accepted criteria
exist. The most widespread definition of Barrett’s esophagus requires histologically
verified intestinal metaplasia in the segment of CLE [4 ].
The gastroesophageal junction (GEJ) is the anatomical area where the distal esophagus
joins the proximal stomach. Under normal conditions, it is located at the level of
the diaphragmatic crus. The location of the GEJ is however not static, and moves several
centimeters during swallowing and breathing [5 ]. During swallowing, the longitudinal smooth muscle of the esophagus contracts which
shortens the esophagus, resulting in a physiological herniation. The GEJ is later
returned to its original location by elastic supporting structures, especially by
the phrenoesophageal membrane. When the GEJ together with the LES and the gastric
cardia are permanently displaced upward into the thoracic cavity through the diaphragmatic
hiatus, a hiatal hernia is present [2 ]
[5 ]
[6 ]. Studies have shown that a hiatal hernia impairs the LES pressure and the sphincter
function of the diaphragm [7 ]
[8 ]
[9 ]. The presence and axial length of a hiatal hernia have also been shown to correlate
with the severity of GER [10 ].
Esophagogastroduodenoscopy (EGD) is the standard investigation method for assessing
the upper gastrointestinal tract. The competence of the mechanical anti-reflux barrier
can be evaluated endoscopically in two ways; one way is to measure the axial length
of any hiatal hernia present (between the hiatus and the GEJ). At endoscopy, the hiatus
is represented by the diaphragmatic pinch and the GEJ is defined by the proximal margin
of the gastric mucosal folds [11 ]. Due to the physiologic dynamics in this area, it can be difficult to measure the
length of a hiatal hernia [5 ]. It has also been shown that the interobserver agreement with regard to endoscopic
measurement is poor even under ideal conditions [12 ]. It is not clear at which length a hiatal hernia becomes clinically significant,
and since the GEJ is not static, most endoscopists use a 2 cm cutoff [6 ]. Another way to assess the GEJ is by grading the gastroesophageal flap valve (GEFV)
using the Hill classification ([Figs. 1 – 4 ]) [13 ]. Studies have shown an association between higher Hill grades and the frequency
of GERD [8 ]
[14 ]. Higher Hill grades are also associated with lower LES pressure [8 ], increased prevalence of hiatal hernia [15 ], and are able to predict poor response to proton pump inhibitor treatment [16 ]. The Hill classification has been proven to be reproducible and provides useful
information when evaluating patients with suspected GERD who are undergoing endoscopy
[8 ]. Esophagitis can be defined endoscopically and classified according to the Los Angeles
(LA) classification [17 ]. The extent of CLE can be evaluated in a standardized manner endoscopically by the
Z-line appearance (ZAP) classification. The ZAP classification has been proven highly
reproducible, and is associated both with the prevalence of intestinal metaplasia,
as well as with GERD [18 ]
[19 ]
[20 ].
Fig. 1 Hill Grade I: a prominent fold of tissue along the lesser curvature next to the endoscope.
Fig. 2 Hill Grade II: the fold is less prominent and there are periods of opening and rapid
closing around the endoscope.
Fig. 3 Hill Grade III: the fold is not prominent and the endoscope is not tightly gripped
by the tissue.
Fig. 4 Hill Grade IV: there is no fold, and the lumen of the esophagus is open, often allowing
the squamous epithelium to be viewed from below. A hiatal hernia is always present.
The aim of the present study was to investigate how the two ways of assessing the
competence of the mechanical anti-reflux barrier of the GEJ correlated with GERD.
We hypothesized that the Hill classification would be superior to measuring the axial
length of any hiatal hernia present in terms of association with GERD.
Materials and methods
A thorough description of the setting, population, endoscopy, and symptom evaluation
has been given elsewhere [21 ].
Study population
A population cohort in Östhammar, Sweden, has been studied for over two decades with
regard to gastrointestinal symptoms. The study population is representative of the
general Swedish population with regard to gender, age, income, and other potential
selection factors. In 1988, a questionnaire with regard to abdominal symptoms (ASQ)
[22 ] was sent by mail to the cohort for the first time. In 1989, 1995, and 2011 slightly
updated forms of questionnaires were sent to the same population. In 2012, the same
population was invited to participate in the present study. To be invited to the EGD,
the subjects had to have participated in the 2011 study. [Fig. 5 ] illustrates the study population and the dropouts. Exclusion criteria for EGD were
angina pectoris, myocardial infarction (in the last 6 months), congestive heart failure,
severe lung disease, severe liver disease, esophageal varices, treatment with anticoagulants,
need for anesthesia for the endoscopy, earlier surgery of the stomach, and those 80
years or above in age.
Fig. 5 Study flow chart, illustrating the study population and the dropouts.
Of the 388 individuals who completed the EGD, 54 were excluded because of missing
data, leaving 334 individuals in the present study.
Endoscopy
Each EGD was performed by one of five experienced endoscopists. The endoscopists were
scheduled to work at the research facility during different weeks. Research assistants,
who were unaware which endoscopist was going to perform the endoscopy, invited the
eligible participants to the endoscopy. Before the study, a consensus meeting led
by an external expert (Professor Lars Lundell) reviewed multiple video recordings
according to the study protocol. Each endoscopist was monitored on the first day by
the project leader (LA) and a part of the procedure (the hiatus and distal esophagus)
was video recorded.
A structured endoscopic protocol was used to investigate the esophagus, stomach, and
upper duodenum. Biopsies for histopathologic examination were taken according to a
strict protocol. The mechanical anti-reflux barrier of the gastroesophageal junction
was assessed in two ways:
The axial length of any hiatus hernia present: The axial length of the hiatal hernia
was defined as the distance between the GEJ and the hiatus. Using the hash marks on
the endoscope, the distance between the GEJ (represented by the transition from the
gastric folds to tubular esophagus) and the hiatus (represented by the diaphragmatic
pinch) was measured at the incisors. The axial length of the hiatal hernia was measured
in centimeters on withdrawal of the endoscope [23 ].
The GEFV or Hill classification was graded I – IV according to the Hill classification
[13 ] ([Figs. 1 – 4 ]).
If esophagitis was present, it was graded according to the Los Angeles (LA) classification
[17 ]. If columnar metaplasia was present, it was graded according to the ZAP classification
[20 ]:
Grade 0: The Z-line is sharp and circular.
Grade I: The Z-line is irregular and shows tongue-like protrusions and/or islands
of columnar epithelium.
Grade II: Distinct tongues of columnar epithelium < 3 cm can be seen.
Grade III: Distinct tongues of columnar epithelium, or a cephaled displacement of
the Z-line, > 3 cm, can be seen.
Definition of GERD
The Montreal Definition states that GERD is present when the reflux of stomach contents
causes troublesome symptoms and/or complications [3 ]. It has previously been shown that reflux symptoms present at least weekly impair
quality of life [24 ] and may thus be considered troublesome. The most common complications of GERD are
esophagitis and Barrett’s esophagus. Hence, for this study, GERD was defined as present
if (1) the subject reported acid regurgitation and/or heartburn on at least a weekly
basis, and/or (2) esophagitis or Barrett’s esophagus was present. Barrett’s esophagus
was considered present if intestinal metaplasia was found in a segment of ZAP Grade
II or III (i. e. distinct, obvious tongues of metaplastic, columnar-appearing epithelium).
Histology
For the present study, biopsies from the squamocolumnar junction were investigated.
To eliminate any doubt concerning the origin of the intestinal metaplasia, the intention
was to obtain biopsies from the Z-line containing both squamous and columnar epithelium.
The biopsy specimens were fixed in 4 % phosphate-buffered formaldehyde then processed
and embedded in paraffin wax. From each specimen, 4-µm sections were cut and stained
with hematoxylin/eosin and periodic acid – Schiff reagent. One of the investigators
(MV), a pathologist, blindly examined the stained sections. Intestinal metaplasia
was considered to be present if goblet cells were identified.
Statistics
Using logistic regression models, we compared the predictive power of the two endoscopic
measurements (hiatal hernia length and Hill classification) on GERD. The evaluation
of hiatal hernia was based on hiatal hernia as a continuous variable and as factor
variables categorized as (0, 1, 2, 3, 4, 5, ≥ 6 cm), (≤ 2 cm, > 2 cm) and (≤ 3 cm,
> 3 cm). The Hill classification was evaluated as a continuous variable (based on
category scores 1 – 4 representing grades I – IV) and as factor variables: grades
(I, II, III, IV) and (≤ II, ≥ III).
The evaluation of the Hill classification and hiatal hernia as a predictor of GERD
was performed in two steps.
The Akaike’s information criterion (AIC) and Bayesian information criterion (BIC)
were used to find the best candidate variable type for each of Hill and hiatal hernia
(from the different types of variables listed above), given no other variables in
the model. AIC and BIC penalize for the addition of parameters and thus we avoid that
the final candidate type of variable automatically is the one with the most parameters
(e. g. hiatal hernia categorized in 1 to ≥ 6 cm vs hiatal hernia dichotomized). For
each endoscopic measurement, a likelihood ratio chi-squared test was performed to
test if a model containing an endoscopic measurement statistically significantly contributed
to the model.
For each of the best candidate variable types of the Hill classification and hiatal
hernia, the predictive power was evaluated by constructing receiver-operating characteristic
(ROC) curves and calculating the area under curve (AUC) statistic with 95 % confidence
intervals. To assess how the results would generalize to an independent data set,
10-fold data cross-validations (removing 10 fold for the testing set and modeling
the remaining training set) of the AUC estimates were performed. Bias corrected and
accelerated (BSa) confidence intervals were estimated by bootstrapping the data and
repeating the cross-validation 1000 times. All two-sided P values < 0.05 were considered statistically significant.
Data analyses were performed with Stata/IC software (Stata Statistical Software: Release
13. StataCorp LP, College Station, TX, United States). Cross-validation was performed
in the statistical software package R (version 3.2.2, R Development Core Team, R Foundation
for Statistical Computing, Vienna, Austria).
Ethics
The 1989 study was approved by the Ethical Review Board of the Medical Faculty of
Uppsala University (Dnr. 1989 /220). Approval for the 2011 – 2012 study was obtained
from the Ethics Committee of Uppsala University (Dnr. 2010 /443), and all participants
gave their informed consent.
Results
A total of 334 subjects were included in the study. Out of these, 86 subjects were
found to have GERD according to the Montreal definition and 211 did not have GERD.
Thirty-seven subjects reported GER less frequently than on a weekly basis, and none
of these 37 were found to have esophagitis or Barrett’s esophagus. Since these 37
subjects reported symptoms suggestive of GER, but did not fulfill the criteria for
GERD, they were excluded from the final analysis so as to define the groups of subjects
with and without GERD more clearly ([Table 1 ]). The subjects with GERD had a mean Hill-grade of 2.7 and a mean hiatal hernia length
of 1.9 cm, while the subjects without GERD had a mean Hill-grade of 2.1 and a mean
hiatal hernia length of 1.3 cm. The 37 subjects with less frequent symptoms had values
in between those with and without GERD, with a mean Hill-grade of 2.2 and a mean hiatal
hernia length of 1.6 cm.
Table 1
Demographic data and endoscopic findings in the subjects with and without GERD, and
in the subjects with more infrequent symptoms of GER.
No GERD n = 211
%
GER n = 37
%
GERD n = 86
%
Female
112
53.1
20
54.0
42
48.8
Mean age, years Esophagitis Barrett’s esophagus
54.5 0 0
55.2 0 0
55.3 52 7
60.5 8.1
Hill I
72
34.1
12
32.4
12
14.0
Hill II
68
32.2
10
27.0
22
25.6
Hill III
47
22.3
9
24.3
33
38.4
Hill IV
24
11.4
6
16.2
19
22.1
Mean Hill-grade
2.1
2.2
2.7
Hiatal hernia
0 cm
70
33.2
12
32.4
19
22.1
1 cm
60
28.4
9
24.3
18
20.1
2 cm
48
22.8
7
18.9
23
26.7
3 cm
22
10.4
4
10.8
14
16.3
4 cm
8
3.8
3
8.1
8
9.3
5 cm
1
0.5
1
2.7
2
2.3
6 cm
1
0.5
0
0
1
1.2
7 cm
0
0
1
2.7
1
1.1
8 cm
1
0.5
0
0
0
0
Mean hiatal hernia, cm
1.3
1.6
1.9
GER, gastroesophageal reflux; GERD, gastroesophageal reflux disease; ZAP, Z-line appearance. All values except mean are given as n and (%). GERD was considered to be present if
(1) the subject reported acid regurgitation and/or heartburn on a weekly basis, and/or
(2) esophagitis or Barrett’s esophagus was present. Barrett’s esophagus was considered
present if intestinal metaplasia was found in a segment of ZAP Grade II or III (i. e.
distinct, obvious tongues of metaplastic, columnar-appearing epithelium). GER was
considered present if the subject reported acid regurgitation and/or heartburn less
frequently than on a weekly basis, without signs of esophagitis or Barrett’s esophagus.
The concordance between the two endoscopic classification systems is presented in
[Table 2 ] (Kendall Tau-β correlation coefficient: 0.38).
Table 2
Concordance between the Hill classification and the axial length of hiatal hernia.
Hiatal hernia,
Hill grade
cm
I
II
III
IV
Total
0
43
40
15
3
101
1
30
33
21
3
87
2
14
16
30
18
78
3
7
8
17
8
40
4
1
1
5
12
19
5
1
2
0
2
5
6
0
0
0
2
2
7
0
0
1
1
2
8
0
0
0
1
1
Total
96
100
89
49
334
Of the different candidate variables representing hiatal hernia from logistic regression,
the candidate with hiatal hernia as a continuous variable resulted in the best prediction
of GERD with AIC: 351 (likelihood ratio chi-squared test: P = 0.0013). Correspondingly, the Hill classification showed the best prediction of
GERD when the model was based on category scores (1 – 4) as a continuous variable
with AIC: 342 (likelihood ratio chi-squared test: P < 0.0001) ([Table 3 ]). Including this best-fitting Hill classification variable given that the corresponding
hiatal hernia variable was also in the model improved the fit of the model (likelihood
ratio chi-squared test: P = 0.0008 for including Hill as a continuous variable based on category scores). Conversely,
including the best-fitting variable of hiatal hernia given that the corresponding
Hill classification was also in the model did not improve the fit of the model (likelihood
ratio chi-squared test: P = 0.1393).
Table 3
Logistic regression with GERD as dependent variable and different ways of looking
at Hill grade (I – IV) and hiatal hernia length (cm), as independent variables. The
group of hiatal hernia ≥ 6 cm consisted of two 6-cm hiatal hernias, one 7-cm hiatal
hernia, and one 8-cm hiatal hernia.
P value
AIC-value
BIC-value
OR
95 %CI
Hiatal hernia (continuous)
0.0013
351.1
358.5
1.34
1.12 – 11.61
Hiatal hernia (ordinal)
0.0771
359.7
385.5
0 (reference)
1
1 cm
1.10
0.53 – 2.30
2 cm
1.76
0.87 – 3.59
3 cm
2.34
1.01 – 5.43
4 cm
3.68
1.22 – 11.10
5 cm
7.37
0.63 – 85.68
≥ 6 cm
3.68
0.48 – 27.90
Hiatal hernia ≥ 2 cm (dichotomous)
0.0035
352.9
360.3
2.12
1.28 – 3.53
Hiatal hernia ≥ 3 cm (dichotomous)
0.0055
353.7
361.1
2.34
1.29 – 4.22
Hill (continuous)
< 0.0001
342.0
349.4
1.75
1.36 – 2.27
Hill (ordinal)
0.0001
344.5
359.2
I (reference)
1
II
1.94
0.89 – 4.11
III
4.21
1.98 – 8.98
IV
4.75
2.01 – 11.20
Hill ≥ III (dichotomous)
< 0.0001
343.5
350.8
3.02
1.80 – 5.06
AIC, Akaike’s information criterion; BIC, Bayesian information criterion; GERD, gastroesophageal
reflux disease.
The predictive powers of the best-fitting model for each classification of hiatal
hernia and Hill are illustrated in [Fig. 6 ] by ROC curves. The area under the ROC curve was 0.61 (95 %CI 0.54 – 0.68) for hiatal
hernia and 0.65 (95 %CI 0.59 – 0.72) for the Hill classification. The difference between
the two AUC estimates was not statistically significant (P = 0.225). The corresponding 10-fold cross-validated estimates were 0.58 (95 %CI 0.51 – 0.65)
for hiatal hernia and 0.62 (95 %CI 0.53 – 0.68) for Hill.
Fig. 6 Receiver-operating characteristic (ROC) curves for each of the hiatal hernia and
Hill classifications that resulted in the best prediction of GERD (hiatal hernia as
a continuous variable and Hill as a continuous variable based on category scores).
Analysis of the endoscopists revealed that one endoscopist (Group A) did 125 endoscopies,
another endoscopist (Group B) did 114 endoscopies, and the remaining 95 endoscopies
were done by three endoscopists (Group C). The three groups did not differ with regard
to demographics or prevalence of GERD. One of the endoscopists rated both the Hill
grade and the axial length of a hiatal hernia significantly higher (Group B) than
the other two groups. Excluding the endoscopies done by this endoscopist did not change
the results with regard to the association between GERD and the two endoscopic classification
systems among the remaining endoscopists. The subjects in Group B did, however, only
reach a statistically significant association between GERD and the Hill classification,
and not with the axial length of a hiatal hernia.
The biopsies from the squamocolumnar junction contained both squamous and columnar
epithelium in 81 % of the subjects.
Discussion
The present study compares two different ways to assess the anti-reflux barrier of
the GEJ (axial length of a hiatal hernia and GEFV graded by the Hill classification)
endoscopically and the association between these two assessments and GERD. Of these
two techniques, the most widely used is to assess the presence and axial length of
any hiatal hernia. It has, however, been shown that the interobserver agreement with
regard to endoscopic length measurement is poor even under ideal conditions. As a
result of this, the concept of hiatal hernia length also suffers from this inherent
weakness. Furthermore, there is no consensus at which length the physiological movement
of the GEJ becomes a hiatal hernia.
An important strength of this study is that the same study population and sample (with
the same abdominal symptom questionnaire (ASQ) and esophagogastroduodenoscopy (EGD))
were used for both of the endoscopic grading methods. Because of this, we did not
adjust for other variables in the logistic regression models (i. e. the need to adjust
for confounding due to heterogeneous populations was taken care of by the design).
Confounding could still be an issue if the causal mechanisms between the two methods
and GERD differed. However, the axial length of a hiatal hernia and the Hill classification
evaluate the same mechanism but in different ways and what affects the hiatal hernia
length will probably also affect the Hill grade. Another strength was that the endoscopies
were done by rather a large group of endoscopists. Even though one of the endoscopists
rated the Hill grade and the axial hiatal hernia length differently, this did not
affect our main finding. The differences between endoscopists are a reflection of
the difficulty in the rating of hiatal hernia and Hill, something that affects the
predictive power of the endoscopic measurements. However, the comparison between hiatal
hernia and Hill as predictors should not be biased, as GERD was not associated with
endoscopist.
A weakness of the present study is the method used to define GERD; including a more
objective way of measuring GERD such as 24-h pH measurement might have been useful
as a complement to the anamnestic information used.
The axial hiatal hernia length and the Hill grade were tested in different ways to
determine the strongest predictive power for GERD. Comparing the association between
GERD and the two endoscopic classification systems based on logistic regression, the
Hill classification showed a slightly stronger association with GERD in our data with
AIC- and BIC-values generally lower than those for the axial length of a hiatal hernia
([Table 3 ]). The estimated AUC for Hill (0.65 {95 %CI 0.59 – 0.72]) was also higher than the
AUC for hiatal hernia (0.61 [95 %CI 0.54 – 0.68]). However, this study could not statistically
significantly verify that Hill was superior as a predictor of GERD compared to hiatal
hernia. Since the Hill classification has been shown to be highly reproducible, and
since it is a known fact that endoscopic length measurement is difficult even under
ideal conditions, it seems reasonable to consider using the Hill classification instead
of hiatal hernia length in assessment of the anti-reflux barrier.