Journal Article DKFZ-2025-01343

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
End-to-end prediction of clinical outcomes in head and neck squamous cell carcinoma with foundation model-based multiple instance learning.

 ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;

2025
Springer London

BMC artificial intelligence 1(1), 3 () [10.1186/s44398-025-00003-8]  GO

Abstract: Foundation models have shown promise in medical AI by learning flexible features from large datasets, offering new opportunities for improving endpoint prediction. However, usage of foundation models for endpoint prediction using routine imaging in head and neck squamous cell carcinoma patients remains unexplored. Within this study, we evaluated the potential of foundation-model based multiple instance learning for prediction of 2-year overall survival, locoregional control and freedom from distant metastasis across three external head and neck squamous cell carcinoma patient cohorts using 2D, multiview and 3D approaches while comparing prediction and stratification performance with handcrafted radiomics and clinical baselines.2D multiple-instance learning models achieved 2-year test area under the receiver-operator curve (AUROC) range of 0.75-0.84 for 2-year overall survival, 0.66-0.75 for 2-year locoregional control and 0.71-0.78 for 2-year freedom from distant metastasis across three different external cohorts, outperforming multiview and 3D multiple instance learning models (AUROC range: 0.50-0.77, p ≥ 0.15) and showing comparable or superior performance to handcrafted radiomics (AUROC range: 0.64-0.74, p ≥ 0.012). Significant stratification was observed from the 2D MIL models (hazard ratios: 2.14-4.77, p ≤ 0.039). 2D MIL models were also shown to learn endpoint-specific correlation patterns such as N-stage for 2-year freedom from distant metastasis prognosis. Multimodal enhancement of 2-year OS/FFDM (AUROC range: 0.82-0.87, p ≤ 0.018) for patients without human papilloma virus positive tumors.FM-based 2D MIL demonstrates promise in HNSCC risk prediction as well as stratification of clinical outcomes. The models match or outperform radiomics baselines, learning clinically-related patterns and showing enhancement of clinical baselines in non-human papilloma virus positive patients.The online version contains supplementary material available at 10.1186/s44398-025-00003-8.

Keyword(s): Foundation models ; Head and neck cancer ; Multimodality ; Prognosis ; Radiomics


Note: BMC Artificial Intelligence (BMC Artif. Intell.) = 3005-1924

Contributing Institute(s):
  1. DKTK Koordinierungsstelle Dresden (DD01)
  2. Digitale Prävention, Diagnostik und Therapiesteuerung (C140)
Research Program(s):
  1. 313 - Krebsrisikofaktoren und Prävention (POF4-313) (POF4-313)

Appears in the scientific report 2025
Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Public records
Publications database

 Record created 2025-07-16, last modified 2025-08-01


Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)