CancerLLM: a large language model in cancer domain

By
Mingchen Li
Zaifu Zhan
Jiatan Huang
Jeremy Yeung
Kai Ding
Anne Blaes
Steven Johnson
Hongfang Liu
Hua Xu
Rui Zhang
February 20, 2026
0 min

Npj Digital Medicine

Objective:

To develop a specialized language model for cancer phenotyping and diagnosis that reduces computational burden while improving performance.

Key Findings:

CancerLLM achieved an F1 score of 91.78% on phenotyping extraction.
The model scored 86.81% on diagnosis generation.
CancerLLM outperformed existing LLMs with an average F1 score improvement of 9.23%.
Demonstrated efficiency in time and GPU usage compared to other LLMs.

Interpretation:

CancerLLM represents a significant advancement in the application of language models in oncology, providing robust and efficient tools for clinical research and practice.

Limitations:

The model's performance is based on internal benchmarks and may require external validation.
The dataset used for training may not encompass all cancer types or variations.

Conclusion:

CancerLLM has the potential to enhance clinical decision-making and research in oncology through its specialized capabilities.

CancerLLM: a large language model in cancer domain

Objective:

Key Findings:

Interpretation:

Limitations:

Conclusion:

Original Source(s)

CancerLLM: a large language model in cancer domain

Related Content

Tumor-localized CD40 agonism with MP0317, a FAP x CD40 DARPin, reprograms the tumor microenvironment in patients with advanced solid tumors: an open-label, nonrandomized, dose-escalation phase 1 study

Silencing TMED2 suppresses cell growth and tumor progression in diffuse large B-cell lymphoma via inducing G0/G1 cell cycle arrest

MYC Gene Amplification Frequently Observed in Stomach and Gastroesophageal Junction Cancers, Associated with Male Gender and Diminished Neoadjuvant Treatment Efficacy