Utility of large language models as information tools for nursing care in gout: a comparative study of DeepSeek and ChatGPT - Report - MDSpire

Utility of large language models as information tools for nursing care in gout: a comparative study of DeepSeek and ChatGPT

  • By

  • Xia Pan

  • Yali Wang

  • QiaoLan Yang

  • Jing Wang

  • Yun Tong

  • Duanfeng Zhang

  • Xiaofeng Lv

  • Chun Zheng

  • Miaoyin Wu

  • Tianwang Li

  • Li Tang

  • Zhengping Huang

  • May 7, 2026

  • 0 min

Share

Clinical Report: Evaluating the Effectiveness of DeepSeek-R1 and ChatGPT-4.0

Overview

This study compares the effectiveness of DeepSeek-R1 and ChatGPT-4.0 as informational resources for nursing care in gout. Findings indicate that while both models provide high-quality responses, ChatGPT-4.0 offers greater readability and coherence.

Background

The integration of large language models (LLMs) like DeepSeek-R1 and ChatGPT-4.0 in nursing care represents a significant advancement in utilizing artificial intelligence for clinical decision-making. Understanding their effectiveness is crucial for enhancing evidence-based management of conditions such as gout, which requires precise and accessible information for optimal patient care.

Data Highlights

MetricDeepSeek-R1ChatGPT-4.0p-value
FKGL13.04 ± 1.6211.41 ± 1.740.013
FRE40.50 ± 8.1249.08 ± 8.900.010
mDISCERN Score3.98 ± 0.704.30 ± 0.730.16
Average Publication Age (years)3.57 ± 2.335.42 ± 2.34<0.05

Key Findings

  • DeepSeek-R1 had a higher FKGL than ChatGPT-4.0, indicating lower readability.
  • ChatGPT-4.0 scored better on the mDISCERN quality score, though not statistically significant.
  • DeepSeek-R1 referenced 21 sources, while ChatGPT-4.0 referenced 23, with clinical guidelines being predominant in both.
  • DeepSeek-R1 provided more current citations but had invalid reference links.
  • Both models produced high-quality, professional responses for nursing care in gout.

Clinical Implications

Healthcare professionals should consider the readability and coherence of AI-generated content when utilizing LLMs for patient education and clinical decision support. While both models are valuable, ChatGPT-4.0 may be more suitable for direct patient communication due to its superior readability.

Conclusion

This comparative analysis highlights the strengths and weaknesses of DeepSeek-R1 and ChatGPT-4.0 as informational resources in nursing care for gout, emphasizing the importance of readability and quality in AI-generated responses.

Related Resources & Content

  1. DosSantos et al., Pain Medicine, 2023 -- Investigating Innovative Educational Strategies for Neuropathic Pain
  2. Clinical Rheumatology, 2025 -- Assessment of Large Language Models for Providing Guideline-Adherent Recommendations on Topical NSAID Application
  3. Infection, 2024 -- Evaluating the Theoretical Knowledge and Treatment Recommendations of ChatGPT in Bacterial Infections
  4. Arthritis Care & Research, 2020 -- Gout Guideline
  5. PMC, 2019 -- Efficacy and cost-effectiveness of nurse-led care involving education and engagement of patients
  6. Assessing the Effectiveness of Artificial Intelligence in Urology: An In-Depth Examination of Kidney Stone-Related Inquiries
  7. Arthritis Care & Research
  8. Efficacy and cost-effectiveness of nurse-led care involving education and engagement of patients and a treat-to-target urate-lowering strategy versus usual care for gout: a randomised controlled trial - PMC
  9. Results of phase III MIRROR trial of Krystexxa in gout published in Arthritis & Rheumatology | medthority.com

Original Source(s)

Related Content