Assessing ChatGPT vs. evidence-based online responses for polycystic ovary syndrome self-management and education: an international cross-sectional blinded survey of healthcare professionals

By
Sandro Graca
Alexander Dallaway
Folashade Alloh
Harpal S. Randeva
Chris Kite
Ioannis Kyrou
March 31, 2026
0 min

Frontiers In Digital Health

Objective:

To assess the reliability of ChatGPT-generated responses to frequently asked questions about PCOS compared to evidence-based recommendations, focusing on accuracy and clarity.

Key Findings:

ChatGPT responses were rated significantly higher than evidence-based responses for 11 out of 12 questions, indicating a preference for AI-generated content.
Moderate to large effect sizes were observed (rrb = −0.46 to −1.00; all p-values <0.05), suggesting substantial differences in ratings.
The average rating difference was 0.824 units in favor of ChatGPT, highlighting its perceived effectiveness.
Scoring agreement varied from poor to fair, with seven questions showing fair agreement (κ = 0.24–0.37, p < 0.05), indicating variability in evaluations.
No significant differences in readability between ChatGPT and evidence-based responses were found, but simplification of responses improved clarity.

Interpretation:

ChatGPT shows potential as a complementary tool for patient self-education in PCOS, enhancing engagement and simplifying complex medical language, which could improve patient understanding.

Limitations:

Sample size limited to 43 healthcare professionals, which may affect the generalizability of the findings.
Scoring agreement varied, indicating inconsistent evaluations among participants.
Further research is needed to validate clinical applicability and integration of AI tools in diverse healthcare settings.

Conclusion:

ChatGPT may serve as a valuable resource for PCOS education, but further studies are necessary to optimize its use in clinical settings and explore its impact on patient outcomes.

Assessing ChatGPT vs. evidence-based online responses for polycystic ovary syndrome self-management and education: an international cross-sectional blinded survey of healthcare professionals

Objective:

Key Findings:

Interpretation:

Limitations:

Conclusion:

Original Source(s)

Assessing ChatGPT vs. evidence-based online responses for polycystic ovary syndrome self-management and education: an international cross-sectional blinded survey of healthcare professionals

Related Content

Who Decides When a Doctor Steps Back?

Effects of Tai Chi combined with dietary intervention on health-promoting lifestyle and metabolic and reproductive outcomes in female college students with polycystic ovary syndrome: a randomized controlled trial

GLP-1 Drugs Tied to Lower Addiction Risk