Assessing ChatGPT-4 as a clinical decision support tool in neuro-oncology radiotherapy: a prospective comparative study

By
Paolo Tini
Federica Novi
Flavio Donnini
Armando Perrella
Giulio Bagnacci
Maria Antonietta Mazzei
Giuseppe Minniti
October 15, 2025
0 min

Journal Of Neuro-Oncology

Overview

This prospective study evaluated ChatGPT-4’s ability to support radiotherapy decisions in 101 neuro-oncology cases. The AI showed high concordance with expert tumor board recommendations in low-complexity cases but demonstrated decreased accuracy in intermediate and high-complexity scenarios.

Background

Radiotherapy planning for central nervous system tumors requires integrating evidence-based guidelines with patient-specific factors, often involving multidisciplinary consensus. Errors in treatment planning can significantly affect outcomes and toxicity. Large language models like ChatGPT-4 have shown potential in medical knowledge tasks but their reliability in specialized domains such as neuro-oncology radiotherapy remains uncertain. This study aimed to assess ChatGPT-4’s concordance with expert decisions and clinical guidelines across varying case complexities.

Data Highlights

Case Complexity	Number of Cases	ChatGPT-4 Concordance with MTB (%)	Concordance with Guidelines (%)
Low Complexity	61	92%	95%
Intermediate Complexity	26	65%	N/A
High Complexity	14	43%	N/A

Key Findings

ChatGPT-4 achieved 92% concordance with tumor board decisions in low-complexity neuro-oncology radiotherapy cases.
Concordance dropped to 65% and 43% in intermediate and high-complexity cases, respectively, indicating challenges in ambiguous or guideline-lacking scenarios.
ChatGPT-4’s recommendations aligned closely with established guidelines (e.g., ESMO, NCCN) in straightforward cases.
Repeated queries after 30 days showed high consistency in ChatGPT-4’s responses, suggesting stable performance over time.
Discrepancies often involved under- or over-treatment recommendations, highlighting risks of AI hallucination or lack of nuanced clinical judgment.
Performance varied by tumor histology, with best results in glioblastoma and meningioma cases and lower accuracy in rare tumor types.

Clinical Implications

ChatGPT-4 can serve as a reliable adjunct for radiotherapy decision support in neuro-oncology when cases have clear guideline-based recommendations. However, caution is warranted in complex or borderline cases where AI may not capture nuanced clinical factors, underscoring the need for expert oversight. Integration of such AI tools should complement, not replace, multidisciplinary clinical judgment.

Conclusion

ChatGPT-4 demonstrates promising concordance with expert neuro-oncology radiotherapy decisions in low-complexity cases but shows limitations in more complex scenarios. These findings support cautious, supervised use of LLMs as decision support aids in specialized oncology workflows.

References

OpenAI/ChatGPT-4/2024 -- Evaluating ChatGPT-4 in Neuro-Oncology Radiotherapy Decision Support

Assessing ChatGPT-4 as a clinical decision support tool in neuro-oncology radiotherapy: a prospective comparative study

Clinical Report: ChatGPT-4 as Decision Support in Neuro-Oncology Radiotherapy

Overview

Background

Data Highlights

Key Findings

Clinical Implications

Conclusion

References

Original Source(s)

Assessing ChatGPT-4 as a clinical decision support tool in neuro-oncology radiotherapy: a prospective comparative study

Related Content

A Radiomics Model Utilizing CT Imaging to Forecast Pain Relief Following Radiotherapy in Bone Metastasis Patients: Findings from a Dual-Center Investigation

Combination of Anlotinib and Temozolomide Results in Sustained Response in Multimodal-Resistant Primary Breast Angiosarcoma with Metastasis to Skull Base and Dural Involvement: A Case Study

Induction Immunochemotherapy Followed by Concurrent Chemoradiotherapy Improves Survival in Unresectable Esophageal Cancer: a systematic review, meta-analysis, and network meta-analysis