Development and validation of a machine learning model for sperm DNA fragmentation rate in infertile men: a multicenter retrospective study

By
Ke Wang
Jinxia Zheng
Xuanxuan Ge
Jie Bai
Mengmeng Ma
Ningxin Qin
Xin Huang
Hui Jiang
You Zhang
June 22, 2026
0 min

Frontiers In Endocrinology

Overview

This study developed a machine learning-based model to predict sperm DNA fragmentation index (DFI) in infertile men, utilizing clinical and semen parameters. The Random Forest model demonstrated the highest predictive accuracy, but showed miscalibration in external validation.

Background

Infertility is a significant global health issue, with male factors contributing to a substantial proportion of cases. The sperm DNA fragmentation index (DFI) is increasingly recognized as a critical measure of male fertility potential, particularly in the context of assisted reproductive technology (ART). Accurate assessment of sperm quality, including DFI, is essential for improving reproductive outcomes in infertile men.

Data Highlights

Model	Development Cohort AUC	External Validation AUC
Random Forest	0.979 (0.972−0.986)	0.945 (95% CI: 0.916−0.975)

Key Findings

The study included 1,037 patients in the development cohort and 290 in the external validation cohort.
The Random Forest model outperformed other machine learning models in predicting DFI.
Core factors influencing DFI included sperm motility, concentration, viability, lifestyle factors, and stress levels.
The model exhibited miscalibration in external validation, indicating systematic overestimation of risk.
An online prediction platform was developed for practical use based on the model.

Clinical Implications

Further validation of the machine learning model is necessary before clinical implementation.

Conclusion

The machine learning model requires additional validation.

Development and validation of a machine learning model for sperm DNA fragmentation rate in infertile men: a multicenter retrospective study

Clinical Report: Machine Learning Model for Predicting Sperm DNA Fragmentation

Overview

Background

Data Highlights

Key Findings

Clinical Implications

Conclusion

Related Resources & Content

Original Source(s)

Development and validation of a machine learning model for sperm DNA fragmentation rate in infertile men: a multicenter retrospective study

Related Content

Association between HPV-16 and HPV-18 viral loads and severity of cervical pre-invasive lesions in women with and without HIV in Botswana

Rethinking preeclampsia: six paradigm shifts in placental pathophysiology for maternal–foetal medicine

The impact of the integrated labor-delivery-recovery-postpartum unit on maternal–neonatal outcomes and psychological experiences among low-risk parturients: a prospective cohort study from a high-volume tertiary center in china