Development and validation of an interpretable machine learning-based predictive model for breast cancer bone metastasis - Summary - MDSpire

Development and validation of an interpretable machine learning-based predictive model for breast cancer bone metastasis

  • By

  • Caiyun Fan

  • Ming Tian

  • Zhendong Ding

  • Jun Peng

  • Gulisitan Yiliyiming

  • Mingjiang Fan

  • Abuduaini Tuerxun

  • Binxu Qiu

  • Xiaojuan Zhu

  • July 3, 2026

  • 0 min

Share

Objective:

To develop a machine learning-based predictive model for accurately assessing the risk of bone metastasis in breast cancer patients.

Approach:
  • Data Source: Utilized the Surveillance, Epidemiology, and End Results database for model development.
  • Model Development: Conducted univariate and multivariate logistic regression analyses to identify key predictive variables, followed by constructing eight machine learning algorithms.
  • Model Evaluation: Employed 10-fold cross-validation for hyperparameter optimization and evaluated model performance using AUC, AUPRC, decision curve analysis, and calibration curves.
  • Interpretability: Applied SHAP analysis to enhance model interpretability and developed a web-based calculator for clinical application.
Key Findings:
  • Majority of patients were aged over 50 years, female, with HR+/HER2− subtype, and low incidence of bone metastasis.
  • Key independent risk factors identified included age >50 years, higher tumor grade, advanced T stage, N stage, clinical stage, HR-/HER2- subtype, radiotherapy, and married status.
  • The LGB model achieved an AUC of 0.98 in training and internal validation sets, and 0.91 in the external validation set.
  • SHAP analysis indicated surgery as the primary protective factor, with advanced N stage associated with increased bone metastasis risk.
Interpretation:

The developed LGB model and web-based calculator provide tools for assessing the risk of bone metastasis in breast cancer patients.

Limitations:
  • Potential selection bias due to reliance on public databases.
  • Limited generalizability and external validity due to lack of diverse clinical settings.
Conclusion:

The study presents a transparent machine learning model that can aid in the early detection and management of bone metastasis in breast cancer patients.

Original Source(s)

Related Content