An XGBoost-based model for detecting undiagnosed type 2 diabetes using routine physical and lifestyle data from a multi-center Chinese population - Report - MDSpire
Advertisement
An XGBoost-based model for detecting undiagnosed type 2 diabetes using routine physical and lifestyle data from a multi-center Chinese population
Clinical Report: Machine Learning for Identifying Undiagnosed Type 2 Diabetes
Overview
This study developed and validated an XGBoost model to identify undiagnosed type 2 diabetes (T2D) using routine health checkup data from a multi-center cohort in China. The model achieved a moderate predictive performance with an AUC of 77.2%.
Background
Diabetes, particularly type 2 diabetes (T2D), is a growing global health concern, with projections indicating a significant rise in prevalence. Early diagnosis is crucial for improving patient outcomes. This study addresses the gap in utilizing routine health checkup data for real-time risk assessment of undiagnosed T2D.
Data Highlights
Predictor
Influence (%)
Fasting blood glucose
50.6
Creatinine
6.6
Triglyceride
5.6
Age
5.1
Low-density lipoprotein
5.0
Key Findings
The XGBoost model was developed using data from 11,382 individuals.
The model was validated on an independent test set of 1,026 individuals.
The area under the receiver operating characteristic curve (AUC) for the model was 77.2% (95% CI: 70.3%–84.1%).
Fasting blood glucose was identified as the most influential predictor of undiagnosed T2D.
The model includes 12 predictors related to metabolic and inflammatory markers.
Clinical Implications
The XGBoost model can assist clinicians in identifying individuals at high risk for undiagnosed T2D during routine health examinations.
Conclusion
The study demonstrates the feasibility of using machine learning to enhance the identification of undiagnosed T2D through routine health data.