Performance of DeepSeek V3.2 and ChatGPT 5.1 in Musculoskeletal Triage and Differential Diagnosis of Outpatients With Low Back Pain: Multidimensional Comparative Study

By
Ziqian Ma
Ruiyuan Chen
Aobo Wang
Yu Xi
Minghui Liang
Shuo Yuan
Ning Fan
Jianwei Zang
Tianyi Wang
Lei Zang
July 3, 2026
0 min

Journal Of Medical Internet Research (Jmir)

Overview

This study evaluates the diagnostic capabilities of two AI chatbots, DeepSeek V3.2 and ChatGPT 5.1, in the triage and differential diagnosis of outpatients with low back pain (LBP).

Background

Musculoskeletal disorders (MSDs) are prevalent and contribute significantly to healthcare burdens, with a notable increase in incidence over the past two decades. Effective triage and diagnosis of conditions like low back pain are essential for optimizing patient care and resource allocation. The integration of artificial intelligence, particularly large language models, is being explored for enhancing diagnostic accuracy in outpatient settings.

Data Highlights

No numerical data or trial data was provided in the source material.

Key Findings

DeepSeek V3.2 and ChatGPT 5.1 were assessed for their ability to classify MSDs based on patient complaints.
The study utilized standardized questionnaires derived from real outpatient records for evaluation.
Both AI models were evaluated for their performance in the clinical diagnosis and triage of low back pain.
The complexity of MSDs necessitates advanced diagnostic tools.
LLMs like DeepSeek and ChatGPT are being investigated for their roles in the medical field.

Clinical Implications

AI chatbots may support outpatient physicians in the triage and diagnosis of low back pain.

Conclusion

The study evaluates AI chatbots in the triage and diagnostic processes for low back pain, addressing challenges posed by musculoskeletal disorders.

Performance of DeepSeek V3.2 and ChatGPT 5.1 in Musculoskeletal Triage and Differential Diagnosis of Outpatients With Low Back Pain: Multidimensional Comparative Study

Clinical Report: Evaluation of DeepSeek V3.2 and ChatGPT 5.1 in Triage of LBP

Overview

Background

Data Highlights

Key Findings

Clinical Implications

Conclusion

Related Resources & Content

Original Source(s)

Performance of DeepSeek V3.2 and ChatGPT 5.1 in Musculoskeletal Triage and Differential Diagnosis of Outpatients With Low Back Pain: Multidimensional Comparative Study

Related Content

Effects of Tai Chi on pain, functional dysfunction, and sleep in patients with chronic nonspecific low back pain: a systematic review and meta-analysis

Multi-scale feature refinement network for lower limb fracture detection in X-ray images

Sensor-Based Monitoring of Knee Osteoarthritis Symptoms in Free-Living Settings: Scoping Review