J Neurol Surg B Skull Base 2025; 86(S 01): S1-S576
DOI: 10.1055/s-0045-1803059
Presentation Abstracts
Podium Presentations
Oral Presentations

Use of Machine-Learning to Predict Gross Total Resection of Nonfunctioning Pituitary Adenomas: A Single-Institution Series

Benjamin B. Fixman
1   University of Southern California, Los Angeles, California, United States
,
Ishan Shah
1   University of Southern California, Los Angeles, California, United States
,
Apurva Prasad
1   University of Southern California, Los Angeles, California, United States
,
Gage A. Guerra
1   University of Southern California, Los Angeles, California, United States
,
Jeffrey J. Feng
1   University of Southern California, Los Angeles, California, United States
,
Robert G. Briggs
1   University of Southern California, Los Angeles, California, United States
,
Gabriel Zada
1   University of Southern California, Los Angeles, California, United States
› Institutsangaben
 
Zoom

Pituitary adenomas are a common intracranial neoplasm for which surgical resection remains the definitive treatment. Though the decision to operate is multifactorial, the likelihood of GTR is a particularly notable consideration. The rapid advancement of machine learning (ML) presents an opportunity to determine the likelihood of GTR based on preoperative characteristics and aid in preoperative planning for these patients.

A prospectively maintained database of 219 endoscopic operations completed by one surgeon at our institution between 2013 and 2023 was first analyzed with a logistic regression model for statistical inference of potentially predictive features. Four features, including gender, prior operation, tumor diameter, and Knosp score were selected for model training and prediction. Next, the dataset was stratified by outcome and split 80:20 (training/validation:testing). Various models, including logistic regression, K-nearest neighbors, random forest, and gradient boosting were trained, fivefold cross-validated, and hyperparameter tuned to optimize classification accuracy using mlr3 in R. Finally, the optimal model was trained on the full training/validation dataset and used to make predictions on the test set.

Gradient boosting, which employs successive iterations of decision trees trained on the residuals of previous trees, proved optimal at minimizing cross-validation error (0.27). On an independent test set, this model achieved an error of 0.23, sensitivity of 0.83, specificity of 0.64, AUC of 0.76, PPV of 0.83, and NPV of 0.64.

The use of ML to preoperatively predict probability of GTR may help surgeons and patients make informed decisions regarding risk and benefit of surgical resection. Our models performed well on an independent test set, achieving 77% accuracy. The PPV and NPV highlight their utility.

Table 1 Data characteristics

Gender

Reoperation

Diameter

Knosp score

GTR

Patients

N (%) [SD]

M: 113 (51.6)

No: 175 (79.9)

Median: 25 mm [9.93]

0: 33 (15.1)Low: 109 (49.8)High: 77 (35.2)

Yes: 147 (67.1)

219

Table 2 Inference model

Variable

Coefficient

p-Value

Gender

−0.59

0.098

Reoperation

−0.34

0.40

Maximal tumor diameter

−0.071

0.0010

Knosp (low)

0.58

0.27

Knosp (high)

−1.03

0.061

Table 3 5-Fold cross-validated scoring metrics of tuned models

Model

Error

Sensitivity

Specificity

AUC

Logistic regression

0.27

0.89

0.43

0.78

K nearest neighbors

0.29

0.90

0.34

0.74

Random forest

0.30

0.89

0.32

0.75

Gradient boosting

0.27

0.87

0.49

0.76

Table 4 Test performance of gradient boosting

Model

Error

Sensitivity

Specificity

AUC

PPV

NPV

Gradient boosting

0.23

0.83

0.64

0.76

0.83

0.64



Publikationsverlauf

Artikel online veröffentlicht:
07. Februar 2025

© 2025. Thieme. All rights reserved.

Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany