Key words next-generation sequencing (NGS) - quantitative reverse-transcription polymerase chain
reaction (qRT-PCR) - aldosterone-producing adenoma (APA) - cortisol-producing adenoma
(CPA) - adrenal cortical carcinoma (ACC) - FFPE
Introduction
The adrenal cortex plays an important role in maintenance of metabolic homeostasis
through the production of steroid hormones, which help regulate mineral and
carbohydrate balance [1 ]. Inappropriate
adrenal production of these hormones leads to primary aldosteronism (PA) in the case
of aldosterone [2 ] and Cushing’s
syndrome (CS) in the case of cortisol [3 ].
Somatic gene mutations resulting in the formation of benign, hormone-secreting
adrenal cortical tumors [i. e., aldosterone-producing adenomas (APA) or
cortisol-producing adenomas (CPA)] are a frequent underlying cause of PA [4 ] and CS [5 ]. Somatic mutations and other genomic alterations are also common in
adrenal cortical carcinomas (ACC) – rare malignant adrenal cortical tumors,
which may be associated with excess steroid hormone production [6 ]. While the distinction between adenoma and
carcinoma is critical for the clinical management of adrenal cortical tumors, these
entities may demonstrate overlapping clinical and pathologic features, suggesting
a
potential need for ancillary molecular tools to facilitate accurate diagnosis.
Prior genome-wide gene expression studies suggest that APA, CPA, and ACC have
distinct transcriptomic profiles [6 ]
[7 ]
[8 ];
however, the relative rarity of adrenal cortical tumors (ACC, in particular) in
surgical cohorts has been a limiting factor for the study of these tumors.
Furthermore, until recently, comprehensive molecular profiling was essentially
limited to high-quality fresh or frozen tissue samples, which are not routinely
available for most clinical adrenal cortical tumor specimens. Our group has
pioneered the use of targeted amplicon-based next-generation DNA sequencing (DNAseq)
for formalin-fixed paraffin-embedded (FFPE) tissue to characterize the spectrum of
somatic mutations in APA [9 ] and
aldosterone-producing cell clusters (APCC) [10 ] in routine clinical adrenal specimens. We have also recently
developed and validated targeted amplicon-based next-generation RNA sequencing
(RNAseq) approaches for transcriptomic profiling of FFPE tissue and applied these
methods to explore transcriptomic heterogeneity in clinical bladder and prostate
cancer specimens [11 ]
[12 ].
In this study, we describe the development and validation of a targeted RNAseq assay
for transcriptomic analysis of adrenal tumors using clinical-grade FFPE specimens
and demonstrate that this approach has potential utility for differentiating among
benign from malignant adrenal cortical tumors.
Materials and Methods
This study was approved by the Institutional Review Board at the University of
Michigan (HUM00083056) with a waiver of informed consent. The experimental approach
is outlined in [Fig. 1 ] and described in
detail below.
Fig. 1 Immunohistochemistry (IHC)-guided capture of formalin-fixed
paraffin-embedded (FFPE) adrenal tissue for targeted next-generation RNA
sequencing (RNAseq) analysis: Serial 5 μm sections were generated
from archival FFPE adrenal specimens, and IHC for CYP11B2
(aldosterone-producing adenomas) or CYP17A1 (cortisol-producing adenomas)
was performed on the first slide to identify the area for transcriptomic
profiling. Using the IHC-stained slide as a guide, the outlined area was
captured from multiple unstained slides using a scalpel and a dissecting
microscope. [For adrenal cortical carcinoma (ACC) specimens, all FFPE tissue
on unstained slides was captured (see Materials and Methods for details)].
RNA was extracted from pooled captured FFPE tissue, and after sample-level
quality control (QC) with quantitative reverse transcription PCR (qRT-PCR),
the FFPE-extracted RNA was utilized as input for targeted amplicon-based
RNAseq using Ion Torrent technology. Log2-transformed read-level data was
normalized to a set of housekeeping genes for bioinformatics analysis.
Cohort selection
Clinical adrenal specimens from patients who underwent adrenalectomy at Michigan
Medicine were identified from surgical pathology databases and/or
prospectively-maintained clinical adrenal tumor databases. The clinical
diagnosis of PA and CS was made according to the Endocrine Society Clinical
Practice Guidelines or institutional consensus available at the time. For all
cases, available H & E slides and ancillary material was reviewed by an
experienced endocrine pathologist (T.J.G.) to confirm the diagnosis and select
material for analysis, and the corresponding FFPE tissue blocks were retrieved
for subsequent molecular studies. Using 5 μm FFPE sections, CYP11B2
(aldosterone synthase) and CYP17A1 (17α-hydroxylase)
immunohistochemistry (IHC) was performed for APA and CPA specimens,
respectively, as described previously [9 ].
Relative to the adjacent normal adrenal cortical tissue, APA were required to be
CYP11B2 IHC positive, while CPA were required to be CYP17A1 IHC positive. ACC
specimens were included based on histopathologic review alone, without relative
steroidogenic enzyme expression analysis. Clinical information for the cohort
and available mutational data for APA and CPA samples is available in Table 1S .
Sample preparation and RNA extraction
For APA and CPA specimens, CYP11B2 and CYP17A1 IHC staining, respectively, was
used to guide tissue sampling; IHC positive areas were outlined in ink on eight
serial unstained slides and FFPE tissue was scraped for RNA extraction using a
scalpel under a dissecting microscope. In contrast, for ACC specimens, all FFPE
tissue on unstained slides was scraped for RNA extraction. Scraped FFPE tissue
was stored at 4 °C in microcentrifuge tubes prior to RNA extraction
using the AllPrep DNA/RNA FFPE Kit (Qiagen, Hilden, Germany).
Quantitative reverse transcription PCR (qRT-PCR)
For each sample, 200 ng of FFPE-extracted RNA was reverse transcribed using the
Applied Biosystems High-Capacity cDNA Reverse Transcription Kit (Thermo Fisher
Scientific, Waltham, MA, USA). Quantitative PCR (qPCR) was performed using an
Applied Biosystems StepOnePlus Real-Time PCR System (Thermo Fisher Scientific)
with commercially-available or custom-designed TaqMan primers and the TaqMan
Fast Universal PCR Master Mix II (Thermo Fisher Scientific) using standard
conditions (see Table 2S for qPCR assay
information). Threshold cycle (Ct) values were calculated from the resulting
fluorescence curves. PPIA was utilized for sample quality control (QC),
and samples with highly-degraded RNA [based on a Ct (PPIA ) >31]
were excluded from subsequent analyses. ACTB was used as a housekeeping
gene for normalization. For each sample, log2-normalized expression values were
calculated as follows: Ct (ACTB )–Ct (gene of interest). [Samples
with a Ct (gene of interest) ≥40 were assigned a Ct value of 40].
Targeted RNAseq
FFPE-extracted RNA was quantitated using a Qubit fluorometer (Thermo Fisher
Scientific), and for each sample, up to 15 ng of FFPE-extracted RNA was reverse
transcribed using SuperScript VILO (Thermo Fisher Scientific). Amplicon
libraries were then generated for each sample from the resulting cDNA using the
Ion AmpliSeq Library Kit Plus (Thermo Fisher Scientific) and a custom
adrenal-specific AmpliSeq RNA panel, which was designed using the Ion AmpliSeq
Designer (Thermo Fisher Scientific). This custom AmpliSeq RNA panel targets 194
genes, including adrenal and adrenal tumor-related transcripts associated with
development, differentiation, tumorigenesis, and steroidogenesis; the panel also
targets a number of housekeeping genes for sample QC and normalization. Target
gene and amplicon information is available in Table 3S . Amplicon libraries were quantitated on a QuantStudio 12K
Flex (Thermo Fisher Scientific) using the Ion Library TaqMan Quantitation Kit
(Thermo Fisher Scientific) prior to templating with an Ion Chef System (Thermo
Fisher Scientific) and next-generation sequencing with an Ion S5 or Ion
GeneStudio S5 Prime System (Thermo Fisher Scientific).
Next-generation sequencing (NGS) data analysis
NGS read data were processed and aligned using Torrent Suite (Thermo Fisher
Scientific), and for each sample, amplicon-level read count data were generated
using the coverageAnalysis plugin. Samples with less than 500 000 total reads
and/or less than 55% end-to-end (E2E) reads were excluded from
subsequent analyses. Amplicon-level E2E read counts were utilized to generate
gene expression estimates, and amplicons without at least 100 E2E reads in at
least two samples were excluded from subsequent analyses. For initial analysis
of housekeeping genes, log2-transformed amplicon-level E2E read counts were
normalized to sample-level total E2E read counts. Potential housekeeping genes
with high variance across samples (σ >1), high relative average
expression levels (Z-score >1.5), and/or low correlation with
other housekeeping genes (average Pearson correlation coefficient <0.4)
were excluded, leaving a final set of six housekeeping genes for normalization:
BANF1 , CFL1 , NUDFA2 , PSMB4 , RPN1 , and
SUMO2 . For all subsequent analyses, log2-transformed amplicon-level
E2E read counts were normalized to the mean of these six housekeeping genes. For
each sample, a “Proliferation Score” was calculated as:
Proliferation Score=log base 2 {2 ^ [mean (median-centered expression
value of all cell cycle/proliferation transcripts)]}
The following cell cycle/proliferation transcripts were included in the
Proliferation Score calculation: ASPM , BUB1B , CDK1 ,
DLGAP5 , KIAA0101 , MKI67 , PBK , PRC1 ,
RRM2 , TOP2A , and UBE2C . Raw Proliferation Scores were
converted to percentile rank for subsequent visualization and analysis.
Statistical analyses
Exploratory pairwise differential expression analysis of log2-normalized
expression values derived from targeted RNAseq data was performed using a
standard parametric approach (i. e., Student’s t -test)
with Benjamini–Hochberg post-hoc correction for multiple hypothesis
testing [false discovery rate (FDR) <5%]. Pairwise
differentially expressed genes (DEGs) were explored using the Gene List Analysis
tool from PANTHER [13 ]. Subsequently, DEGs
with the potential for distinguishing among tumor types were identified from
targeted RNAseq data using differential expression analysis with a standard
parametric approach (i. e., one-way ANOVA), Bonferroni post-hoc
correction for multiple hypothesis testing, and Tukey’s range test for
significance among pairwise comparisons. Log2-normalized expression values
derived from qRT-PCR data and Proliferation Scores were compared using one-way
ANOVA, and single-gene correlation between paired RNAseq- and qRT-PCR-derived
log2-normalized expression values was assessed using the Pearson correlation
coefficient and Student’s t -test. Differences in clinical
variables across tumor types within the study cohort were examined using
standard statistical methods. Adjusted or unadjusted p-values <0.05 were
considered statistically significant (as applicable). All statistical analyses
were performed in Excel (Microsoft, Redmond, WA, USA) using the XLSTAT add-on
(Addinsoft, Paris, France).
Results
A novel targeted RNAseq assay generates biologically relevant transcriptomic
information from FFPE adrenal tumor specimens
A total of 32 adrenal cortical tumor samples (10 APA, 11 CPA, and 11 ACC) were
selected for initial validation of the RNAseq assay. Of these, 31
(96.8%) passed the PPIA qRT-PCR QC threshold, and 27
(87.1%) passed subsequent RNAseq QC criteria (see Materials and Methods
for details). These 27 samples formed the final study cohort, and included 8 APA
[mean age=50.6 years (range=41–64 years); male-to-female
ratio (M:F)=3:1], 11 CPA [mean age=46.7 years
(range=33–62 years); M:F=1:10], and 8 ACC [mean
age=55.8 years (range=37–83 years); M:F=1:1]
(see Table 1S for details). While a
statistically significant difference in sex distribution across the tumor types
was detected (p <0.05), there was no significant difference in age
distribution (p >0.05). The majority of the included FFPE specimens were
less than 5 years old; however, a subset of the specimens was more than 10 years
old. Median E2E RNAseq reads were 3 082 034 [interquartile range (IQR)=2
128 180–3 742 391], and median %E2E RNAseq reads were
90.81% (IQR=89.61–91.56%) (see Table 4S
for details). Of the 194 amplicons that comprise our targeted adrenal-focused
RNAseq panel, 185 passed initial QC criteria and were included for analysis in
this study (see Materials and Methods for details). Raw read counts and
log2-normalized expression values are available in Table 5S and Table
6S , respectively. Amplicon-level analysis across all samples
demonstrated expected patterns of gene expression correlation (i. e.,
cell cycle/proliferation transcripts, adrenal medulla transcripts, etc.;
see Fig. 1S ). Similarly, sample-level principal component analysis (PCA)
utilizing all gene expression values highlighted discrete clustering of tumor
types using the first two principal components (see Fig. 2S ). Finally,
unsupervised clustering of median-centered gene expression values revealed
distinct transcriptional modules associated with specific tumor subtypes
(i. e., high expression of cell cycle/proliferation genes in
ACC, high expression of subsets of metabolic enzymes in APA and CPA, etc.; see
Fig. 3S ).
Targeted RNAseq can differentiate among benign and malignant adrenal cortical
tumors
These data support the ability of our targeted RNAseq panel to robustly detect
differential gene expression among FFPE adrenal cortical tumor specimens. Given
the potential overlap of clinical and pathologic findings in benign and
malignant adrenal cortical tumors, we hypothesized that the ability to detect
unique transcriptomic signatures may facilitate accurate classification of these
tumors. To identify DEGs from our RNAseq data that may be informative for tumor
type classification, we utilized a separate differential expression analysis
approach with conservative post-hoc corrections for multiple hypothesis testing
and multiple comparisons. This approach identified 40 DEGs across all tumor
types, including CYP11B2 , IGF2 , and AVPR1A (see Table 11S ), and unsupervised hierarchical
clustering of all samples with this subset of 40 DEGs revealed three discrete
sample-level clusters representing APA, CPA, and ACC tumors with corresponding
unique amplicon-level gene expression clusters (see [Fig. 3a ]). Importantly, despite a
significant difference in sex distribution among the tumor types in our cohort,
no sex-related differential gene expression was detected, suggesting that the
observed transcriptomic differences are intrinsic to tumor types. Finally, as
many of the ACC-high DEGs correspond to cell cycle/proliferation
transcripts (e. g., PBK , TOP2A , etc.), a median-weighted
expression average of these transcripts [i. e., “Proliferation
Score”] was calculated for each sample (see Materials and Methods for
details), and as expected, ACC showed significantly higher Proliferation Scores
than APA and CPA (p <0.001) (see [Fig.
3b ]). Indeed, sample-level integration of IGF2 expression and
Proliferation Scores revealed a discrete ACC sample cluster with high
IGF2 expression values and Proliferation Scores (see [Fig. 3c ]). Interestingly, one sample
(“ACC1”) – corresponding to a low-grade ACC specimen
– showed intermediate IGF2 expression but a comparatively high
Proliferation Score, highlighting the potential utility of this integrated
transcriptome-based approach to tumor classification.
Fig. 2 Orthogonal validation of differential gene expression
identified by targeted next-generation RNA sequencing (RNAseq) of archival
formalin-fixed paraffin-embedded (FFPE) adrenal tumor tissue: a and
c Box plots of log2-normalized CYP11B2 and IGF2
expression values generated from targeted RNAseq data highlight differential
gene expression across adrenal tumor types [adrenal cortical carcinoma
(ACC), cortisol-producing adenoma (CPA), aldosterone-producing adenoma
(APA)] (p <0.001), which is confirmed by quantitative reverse
transcription PCR (qRT-PCR) analysis (p <0.001). b and
d Scatter plots of paired RNAseq- and qRT-PCR-derived
log2-normalized CYP11B2 and IGF2 expression values demonstrate
high correlation across adrenal tumor types (Pearson correlation coefficient
> 0.92; p <0.001). ACC: yellow; CPA: blue; and APA: red.
Fig. 3 Targeted next-generation RNA sequencing (RNAseq) of
archival formalin-fixed paraffin-embedded (FFPE) adrenal tumor tissue
generates biologically and clinically relevant transcriptomic data:
a : Unsupervised hierarchical clustering of median-centered
log2-normalized RNAseq expression values for 40 differentially-expressed
genes reveals three discrete sample-level clusters representing adrenal
cortical carcinoma (ACC), cortisol-producing adenoma (CPA), and
aldosterone-producing adenoma (APA) with corresponding unique gene-level
expression clusters. Range=–5 to 5 (blue to red).
b : Box plots of calculated Proliferation Score (percentile
rank) generated from targeted RNAseq data confirm the high level of cell
cycle/proliferation transcript expression in ACC compared to CPA and APA
(p <0.001). c : Integrated visualization of calculated
Proliferation Score (percentile rank; x -axis) and log2-normalized
IGF2 expression values (y -axis) generated from
targeted RNAseq data demonstrate a discrete ACC sample cluster with high
IGF2 and cell cycle/proliferation transcript expression; a
low-grade ACC sample (ACC1) showed intermediate IGF2 expression
with a comparatively high Proliferation Score.
Discussion and Conclusions
Discussion and Conclusions
The development and application of robust NGS approaches to study molecular
alterations in archival FFPE tissue specimens has been integral to our understanding
of the molecular basis of human disease, as the ability to characterize increasing
numbers of rare tumors (e. g., ACC) and/or larger, more diverse
cohorts of common tumors (e. g., APA and CPA) has shed new light on genomic
heterogeneity across human neoplasia. Indeed, over the past decade, our group has
pioneered the use of IHC-guided tissue macrodissection and targeted amplicon-based
DNAseq with clinical FFPE adrenal specimens to characterize the genetic causes of
PA, including APA and APCC [9 ]
[10 ]
[14 ]
[15 ]
[16 ]
[17 ].
It was just under ten years ago that the first somatic aldosterone-driver mutations
(in KCNJ5 ) were described in APA; however, with the recent identification of
somatic CACNA1H and CLCN2 mutations in small subsets of APA [14 ]
[18 ],
aldosterone-driver mutations can now be detected in over 90% of APA.
Similar to the progress made in our understanding of the genomic basis for adrenal
cortical tumors over the past decade, the ability to obtain robust transcriptomic
data in a high-throughput manner from archival FFPE specimens has the potential to
facilitate breakthroughs for translational research of these tumors. In this study,
we validated a novel adrenal-focused targeted RNAseq assay for use with FFPE
clinical adrenal specimens. Despite targeting only 194 genes and utilizing only up
to 15 ng of FFPE-extracted RNA, we demonstrated the ability of this approach to
generate biologically relevant transcriptomic information and highlighted its
potential clinical utility for the classification of benign and malignant adrenal
cortical tumors.
First, utilizing pairwise differential expression analysis, we identified distinct
DEG subsets among the different tumor types: APA-high genes were associated with
GPCR signaling pathways; CPA-high genes were associated with the IGF signaling
pathway and sex hormone biosynthesis; and, ACC-high genes were associated with the
IGF signaling pathway, nucleotide biosynthesis, DNA replication, and the p53 and
ubiquitin proteasome pathways. We also showed that, as expected, ACC had
significantly higher indicators of cellular proliferation (i. e.,
Proliferation Score) than APA and CPA. These results support previously published
genome-wide gene expression data indicating that APA, CPA, and ACC have distinct
transcriptomic profiles [6 ]
[7 ]
[8 ].
One recent study explored transcriptomic differences among APA with different
aldosterone-driver mutations [19 ]; however,
most of the differences were observed between tumors with and without CTNNB1
mutations – which are present in only a small subset of APA. In contrast,
relatively few gene expression differences were detected among APA with other
aldosterone-driver mutations (i. e., KCNJ5 , ATP1A1 ,
ATP2B3 , and CACNA1D ) – although the total number of tumors
profiled was relatively small, and not all aldosterone-driver mutation subgroups
were adequately represented. Ongoing work in our laboratory is focused on applying
our targeted RNAseq approach to analyze potential transcriptomic differences across
the APA genomic subtypes; if necessary to identify non-targeted DEGs among these
tumors, we have also pioneered the application of amplicon-based RNAseq for
whole-transcriptome profiling using archival FFPE tissue [20 ].
We also identified a subset of high-confident DEGs among APA, CPA, and ACC that was
able to accurately group samples by tumor type using an unbiased approach
(i. e., unsupervised hierarchical clustering). Similar to prior genome-wide
microarray-based studies, the ACC transcriptome is distinct from that of adrenal
cortical adenomas (ACA; i. e., APA and CPA) and, in particular, shows
IGF2 upregulation with increased expression of cell
cycle/proliferation transcripts [6 ]
[7 ]
[8 ]
[21 ]
[22 ]
[23 ]. While clearly not the primary focus of
this initial validation study, these data suggest that our targeted RNAseq assay may
have clinical utility for the classification of adrenal cortical tumors. This is
particularly intriguing given its low input RNA requirements (up to 15 ng) and
ability to generate robust transcriptomic information from clinical FFPE specimens.
While the distinction between ACA and ACC is frequently straightforward based on
well-established clinicopathologic parameters (i. e., modified Weiss
criteria, etc.), the diagnosis of ACC may be challenging in low-grade tumors with
oncocytic features, as well as in minute core biopsy specimens. This potential issue
is highlighted by the ACC1 sample in our dataset. Pathologically, ACC1 is a
low-grade ACC with a low mitotic count (less than 20 per 50 high-power fields);
while our targeted RNAseq data shows that its IGF2 expression and
Proliferation Score are low relative to other ACC in the cohort, integrative
analysis clearly supports its classification as ACC (see [Fig. 3a, c ]). Indeed, previous studies have
introduced the notion that this kind of integrative analysis is a possible approach
to improve classification and prognostication of low-grade adrenal cortical tumors
[24 ]
[25 ]. While this possibility remains specifically untested in the current
study, ACC1 is an example which supports the use of such an integrative approach in
classification of low-grade adrenal cortical tumors, and this is an area of future
translational research interest for our group.
Despite initial validation of our targeted RNAseq panel, the relatively small number
of samples analyzed in this study is a clear limitation, and additional
transcriptomic profiling of APA, CPA, and ACC samples is warranted to confirm the
robustness of this approach. Similarly, the small number of genes targeted by the
panel limits its potential utility as a discovery tool; however, as noted above, our
group has pioneered amplicon-based whole-transcriptome RNAseq using archival FFPE
tissue, which could be applied in future studies.
In conclusion, our data support the use of targeted amplicon-based RNAseq for
comprehensive transcriptomic profiling of archival FFPE adrenal tumor material and
indicate that this approach may facilitate important translational research
opportunities for the study of these tumors.