Development and external validation of a diagnostic model for differentiating major depressive disorder from bipolar disorder

By
Hongxin Zheng
Xialong Cheng
Wenxin Gan
Shuyu Duan
Yizi Liu
Kun Li
Chen Su
Chenxi Xu
Yongcan Zhou
Wenwei Zhang
Runbo Wu
Yu Xie
January 28, 2026
0 min

Bmc Psychiatry (Springer)

Overview

This study developed and externally validated machine learning models using electronic medical record data to distinguish major depressive disorder (MDD) from bipolar disorder (BD). The models demonstrated good performance and generalizability across two independent medical centers, with interpretability enhanced by SHAP analysis.

Background

Bipolar disorder affects approximately 2% of the global population and is characterized by mood fluctuations between manic and depressive episodes. Differentiating BD from MDD is challenging due to overlapping depressive symptoms, leading to frequent misdiagnosis and inappropriate treatment. There is a lack of clear biomarkers and universal diagnostic tools for BD, highlighting the need for reliable screening methods. Machine learning offers potential for improving diagnostic accuracy but requires rigorous external validation to ensure clinical applicability.

Data Highlights

Dataset	MDD Patients	BD Patients
Development and Internal Validation (Fourth People’s Hospital of Hefei)	991	423
External Validation (Tongde Hospital of Zhejiang)	188	161

Key Findings

Machine learning models were constructed using 34 EMR variables including sociodemographic, clinical, and biochemical data.
Feature selection was performed using LASSO regression and Boruta algorithm to identify relevant predictors.
Six different ML models were developed and internally validated on a large single-center dataset.
External validation on an independent dataset demonstrated good generalization ability of the models.
SHAP analysis provided transparent explanations of feature contributions to model predictions.
The study addresses limitations of previous research by including rigorous external validation and interpretability.

Clinical Implications

The validated machine learning framework can assist clinicians in differentiating MDD from BD using routinely collected EMR data, potentially reducing misdiagnosis and improving treatment decisions. The interpretability of the model via SHAP values supports clinical trust and understanding of key diagnostic features. This approach may facilitate earlier and more accurate identification of BD in patients presenting with depressive episodes.

Conclusion

This study presents a rigorously validated, interpretable machine learning diagnostic tool that effectively distinguishes bipolar disorder from major depressive disorder using electronic medical record data. The framework holds promise for enhancing clinical diagnostic accuracy and guiding appropriate treatment strategies.

References

Zhu et al. 2023 -- Diagnostic model to distinguish BD from MDD using XGBoost
Huang et al. 2023 -- Logistic Regression model for BD vs MDD diagnosis
International Statistical Classification of Mental Disorders (ICD-10)
SHAP methodology references (Lundberg & Lee, 2017)

Development and external validation of a diagnostic model for differentiating major depressive disorder from bipolar disorder

Diagnostic Framework to Differentiate Major Depressive Disorder from Bipolar Disorder

Overview

Background

Data Highlights

Key Findings

Clinical Implications

Conclusion

References

Original Source(s)

Development and external validation of a diagnostic model for differentiating major depressive disorder from bipolar disorder

Related Content

Who Decides When a Doctor Steps Back?

Mortality Rates from All Causes and Specific Conditions in Individuals Diagnosed with Bipolar II Disorder

When Crisis Calls Go Unanswered