Journal Article DKFZ-2025-01925

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Learning the natural history of human disease with generative transformers.

 ;  ;  ;  ;  ;  ;  ;

2025
Nature Publ. Group London [u.a.]

Nature 647(8088), 248-256 () [10.1038/s41586-025-09529-3]
 GO

This record in other databases:  

Please use a persistent id in citations: doi:

Abstract: Decision-making in healthcare relies on understanding patients' past and current health states to predict and, ultimately, change their future course1-3. Artificial intelligence (AI) methods promise to aid this task by learning patterns of disease progression from large corpora of health records4,5. However, their potential has not been fully investigated at scale. Here we modify the GPT6 (generative pretrained transformer) architecture to model the progression and competing nature of human diseases. We train this model, Delphi-2M, on data from 0.4 million UK Biobank participants and validate it using external data from 1.9 million Danish individuals with no change in parameters. Delphi-2M predicts the rates of more than 1,000 diseases, conditional on each individual's past disease history, with accuracy comparable to that of existing single-disease models. Delphi-2M's generative nature also enables sampling of synthetic future health trajectories, providing meaningful estimates of potential disease burden for up to 20 years, and enabling the training of AI models that have never seen actual data. Explainable AI methods7 provide insights into Delphi-2M's predictions, revealing clusters of co-morbidities within and across disease chapters and their time-dependent consequences on future health, but also highlight biases learnt from training data. In summary, transformer-based models appear to be well suited for predictive and generative health-related tasks, are applicable to population-scale datasets and provide insights into temporal dependencies between disease events, potentially improving the understanding of personalized health risks and informing precision medicine approaches.

Classification:

Note: #EA:B450#LA:B450# / 2025 Nov;647(8088):248-256

Contributing Institute(s):
  1. Künstl. Intelligenz in der Onkologie (B450)
Research Program(s):
  1. 312 - Funktionelle und strukturelle Genomforschung (POF4-312) (POF4-312)

Appears in the scientific report 2025
Database coverage:
Medline ; BIOSIS Previews ; Biological Abstracts ; Chemical Reactions ; Clarivate Analytics Master Journal List ; Current Contents - Agriculture, Biology and Environmental Sciences ; Current Contents - Life Sciences ; Current Contents - Physical, Chemical and Earth Sciences ; DEAL Nature ; Ebsco Academic Search ; Essential Science Indicators ; IF >= 60 ; Index Chemicus ; JCR ; SCOPUS ; Science Citation Index Expanded ; Web of Science Core Collection ; Zoological Record
Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Public records
Publications database

 Record created 2025-09-18, last modified 2025-11-06



Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)