Preliminary evaluation of DeepSeek-R1 and GPT-5.3 in selected PET/CT clinical scenarios: patient preparation, report interpretation, and diagnostic reasoning

By
Runze Duan
Jing Pang
Lu Zheng
Ziyu Guo
Tianyue Li
Yanzhu Bian
Yujing Hu
June 11, 2026
0 min

Frontiers In Medicine

Overview

This study evaluates the performance of DeepSeek-R1 and GPT-5.3 in clinical scenarios involving PET/CT.

Background

The integration of [18F]FDG PET/CT imaging is increasingly utilized in clinical practice, necessitating efficient tools to assist nuclear medicine professionals. This study assesses the clinical applicability of DeepSeek-R1 as a cost-effective AI assistant compared to GPT-5.3.

Data Highlights

Model	Appropriateness	Helpfulness	Empathy	Inconsistency	Valid References
DeepSeek-R1	94.9%	100%	91.7%	7.7%	37%
GPT-5.3	94.9%	100%	66.7%	5.1%	33%

Key Findings

DeepSeek-R1 achieved 94.9% appropriateness and 100% helpfulness across 39 tasks.
91.7% of DeepSeek-R1's responses to follow-up inquiries were rated empathetic.
DeepSeek-R1 had a 7.7% inconsistency rate, primarily in tumor staging.
GPT-5.3 showed a lower inconsistency rate of 5.1% but lower empathy at 66.7%.
Both models had a primary diagnosis accuracy of 10% and differential diagnosis accuracy of 60% for difficult cases.
37% of DeepSeek-R1's cited references were fully valid, compared to 33% for GPT-5.3.

Clinical Implications

The findings suggest that while both DeepSeek-R1 and GPT-5.3 can assist in clinical scenarios, they cannot replace clinicians due to reference validity issues and potential inconsistencies. DeepSeek-R1 may serve as a cost-effective auxiliary tool in nuclear medicine.

Conclusion

DeepSeek-R1 and GPT-5.3 exhibit complementary strengths but face challenges with reference validity and consistency.

Preliminary evaluation of DeepSeek-R1 and GPT-5.3 in selected PET/CT clinical scenarios: patient preparation, report interpretation, and diagnostic reasoning

Clinical Report: Initial Assessment of DeepSeek-R1 and GPT-5.3 in PET/CT

Overview

Background

Data Highlights

Key Findings

Clinical Implications

Conclusion

Related Resources & Content

Original Source(s)

Preliminary evaluation of DeepSeek-R1 and GPT-5.3 in selected PET/CT clinical scenarios: patient preparation, report interpretation, and diagnostic reasoning

Related Content

Dynamic consent framework for low-dose CT scan lung cancer screening: autonomy, privacy, ethical data management

Case Report: Fatal pneumonitis caused by Camrelizumab and Erlotinib in a patient with metastatic pancreatic cancer

Multimodal treatment and long-term survival in a rare case of melanoma with pancreatic and splenic metastases: a case report and literature review