Assessing ChatGPT-4 as a clinical decision support tool in neuro-oncology radiotherapy: a prospective comparative study - Report - MDSpire

Assessing ChatGPT-4 as a clinical decision support tool in neuro-oncology radiotherapy: a prospective comparative study

  • By

  • Paolo Tini

  • Federica Novi

  • Flavio Donnini

  • Armando Perrella

  • Giulio Bagnacci

  • Maria Antonietta Mazzei

  • Giuseppe Minniti

  • October 15, 2025

  • 0 min

Share

Clinical Report: ChatGPT-4 as Decision Support in Neuro-Oncology Radiotherapy

Overview

This prospective study evaluated ChatGPT-4’s ability to support radiotherapy decisions in 101 neuro-oncology cases. The AI showed high concordance with expert tumor board recommendations in low-complexity cases but demonstrated decreased accuracy in intermediate and high-complexity scenarios.

Background

Radiotherapy planning for central nervous system tumors requires integrating evidence-based guidelines with patient-specific factors, often involving multidisciplinary consensus. Errors in treatment planning can significantly affect outcomes and toxicity. Large language models like ChatGPT-4 have shown potential in medical knowledge tasks but their reliability in specialized domains such as neuro-oncology radiotherapy remains uncertain. This study aimed to assess ChatGPT-4’s concordance with expert decisions and clinical guidelines across varying case complexities.

Data Highlights

Case ComplexityNumber of CasesChatGPT-4 Concordance with MTB (%)Concordance with Guidelines (%)
Low Complexity6192%95%
Intermediate Complexity2665%N/A
High Complexity1443%N/A

Key Findings

  • ChatGPT-4 achieved 92% concordance with tumor board decisions in low-complexity neuro-oncology radiotherapy cases.
  • Concordance dropped to 65% and 43% in intermediate and high-complexity cases, respectively, indicating challenges in ambiguous or guideline-lacking scenarios.
  • ChatGPT-4’s recommendations aligned closely with established guidelines (e.g., ESMO, NCCN) in straightforward cases.
  • Repeated queries after 30 days showed high consistency in ChatGPT-4’s responses, suggesting stable performance over time.
  • Discrepancies often involved under- or over-treatment recommendations, highlighting risks of AI hallucination or lack of nuanced clinical judgment.
  • Performance varied by tumor histology, with best results in glioblastoma and meningioma cases and lower accuracy in rare tumor types.

Clinical Implications

ChatGPT-4 can serve as a reliable adjunct for radiotherapy decision support in neuro-oncology when cases have clear guideline-based recommendations. However, caution is warranted in complex or borderline cases where AI may not capture nuanced clinical factors, underscoring the need for expert oversight. Integration of such AI tools should complement, not replace, multidisciplinary clinical judgment.

Conclusion

ChatGPT-4 demonstrates promising concordance with expert neuro-oncology radiotherapy decisions in low-complexity cases but shows limitations in more complex scenarios. These findings support cautious, supervised use of LLMs as decision support aids in specialized oncology workflows.

References

  1. OpenAI/ChatGPT-4/2024 -- Evaluating ChatGPT-4 in Neuro-Oncology Radiotherapy Decision Support

Original Source(s)

Related Content