Large language models forecast patient health trajectories enabling digital twins - Report - MDSpire

Large language models forecast patient health trajectories enabling digital twins

  • By

  • Nikita Makarov

  • Maria Bordukova

  • Papichaya Quengdaeng

  • Daniel Garger

  • Raul Rodriguez-Esteban

  • Fabian Schmich

  • Michael P. Menden

  • October 1, 2025

  • 0 min

Share

Predicting Patient Health Pathways Using Large Language Models for Digital Twins

Overview

The Digital Twin—Generative Pretrained Transformer (DT-GPT) leverages large language models to predict longitudinal clinical trajectories from electronic health records without data imputation. Benchmarking on datasets for non-small cell lung cancer, ICU patients, and Alzheimer’s disease demonstrated DT-GPT’s superior forecasting accuracy and explainability compared to state-of-the-art models.

Background

Clinical forecasting is critical for monitoring patient outcomes, guiding treatment decisions, and supporting drug development. Digital twins—virtual patient representations—enable personalized simulation of health trajectories by integrating diverse longitudinal data. Generative AI, particularly large language models (LLMs), offers promising capabilities for creating digital twins but faces challenges such as handling missing data and modeling complex variable interactions. DT-GPT addresses these challenges by fine-tuning LLMs on real-world electronic health record data to forecast multivariate clinical trajectories with improved accuracy and interpretability.

Data Highlights

DatasetScaled Mean Absolute Error Reduction
Non-Small Cell Lung Cancer3.4%
Intensive Care Unit1.3%
Alzheimer’s Disease1.8%

Key Findings

  • DT-GPT forecasts clinical variable trajectories across short-, medium-, and long-term horizons using EHR data without requiring imputation or normalization.
  • It outperforms state-of-the-art machine learning models by reducing scaled mean absolute error by up to 3.4% across diverse clinical datasets.
  • DT-GPT maintains the distributions and cross-correlations of clinical variables, capturing complex interdependencies inherent in patient data.
  • The model demonstrates zero-shot forecasting capabilities, enabling prediction of clinical variables without prior task-specific training.
  • Explainability is enhanced through a human-interpretable chatbot interface, facilitating clinical insight into model predictions.

Clinical Implications

DT-GPT’s ability to accurately predict longitudinal patient trajectories supports personalized medicine by enabling real-time monitoring and treatment optimization. Its zero-shot forecasting and interpretability features may facilitate clinical decision-making and adverse event mitigation without extensive retraining. This approach paves the way for integrating digital twins into clinical trials and routine care to improve patient outcomes.

Conclusion

DT-GPT represents a significant advancement in clinical forecasting by harnessing large language models to create interpretable digital twins that accurately predict patient health pathways. Its robust performance across multiple diseases highlights its potential as a versatile tool for precision medicine applications.

References

  1. Author/Source/2024 -- Predicting Patient Health Pathways through Large Language Models for Digital Twin Applications

Original Source(s)

Related Content