Optimization and validation of 18F-DCFPyL PET radiomics-based machine learning models in intermediate- to high-risk primary prostate cancer

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Introduction Radiomics extracted from prostate-specific membrane antigen (PSMA)-PET modeled with machine learning (ML) may be used for prediction of disease risk. However, validation of previously proposed approaches is lacking. We aimed to optimize and validate ML models based on 18F-DCFPyL-PET radiomics for the prediction of lymph-node involvement (LNI), extracapsular extension (ECE), and postoperative Gleason score (GS) in primary prostate cancer (PCa) patients. Methods Patients with intermediate- to high-risk PCa who underwent 18F-DCFPyL-PET/CT before radical prostatectomy with pelvic lymph-node dissection were evaluated. The training dataset included 72 patients, the internal validation dataset 24 patients, and the external validation dataset 27 patients. PSMA-avid intra-prostatic lesions were delineated semiautomatically on PET and 480 radiomics features were extracted. Conventional PET-metrics were derived for comparative analysis. Segmentation, preprocessing, and ML methods were optimized in repeated 5-fold cross-validation (CV) on the training dataset. The trained models were tested on the combined validation dataset. Combat harmonization was applied to external radiomics data. Model performance was assessed using the receiver-operatingcharacteristics curve (AUC). Results The CV-AUCs in the training dataset were 0.88, 0.79 and 0.84 for LNI, ECE, and GS, respectively. In the combined validation dataset, the ML models could significantly predict GS with an AUC of 0.78 (p<0.05). However, validation AUCs for LNI and ECE prediction were not significant (0.57 and 0.63, respectively). Conventional PET metrics-based models had comparable AUCs for LNI (0.59, p>0.05) and ECE (0.66, p>0.05), but a lower AUC for GS (0.73, p<0.05). In general, Combat harmonization improved external validation AUCs (-0.03 to +0.18). Conclusion In internal and external validation, 18F-DCFPyL-PET radiomics-based ML models predicted high postoperative GS but not LNI or ECE in intermediate- to high-risk PCa. Therefore, the clinical benefit seems to be limited. These results underline the need for external and/or multicenter validation of PET radiomics-based ML model analyses to assess their generalizability.

Original languageEnglish
Article numbere0293672
JournalPLOS ONE
Volume18
Issue number11 NOVEMBER
DOIs
Publication statusPublished - 1 Nov 2023

Cite this