Assessing ChatGPT vs. evidence-based online responses for polycystic ovary syndrome self-management and education: an international cross-sectional blinded survey of healthcare professionals - Summary - MDSpire

Assessing ChatGPT vs. evidence-based online responses for polycystic ovary syndrome self-management and education: an international cross-sectional blinded survey of healthcare professionals

  • By

  • Sandro Graca

  • Alexander Dallaway

  • Folashade Alloh

  • Harpal S. Randeva

  • Chris Kite

  • Ioannis Kyrou

  • March 31, 2026

  • 0 min

Share

Objective:

To assess the reliability of ChatGPT-generated responses to frequently asked questions about PCOS compared to evidence-based recommendations, focusing on accuracy and clarity.

Key Findings:
  • ChatGPT responses were rated significantly higher than evidence-based responses for 11 out of 12 questions, indicating a preference for AI-generated content.
  • Moderate to large effect sizes were observed (rrb = −0.46 to −1.00; all p-values <0.05), suggesting substantial differences in ratings.
  • The average rating difference was 0.824 units in favor of ChatGPT, highlighting its perceived effectiveness.
  • Scoring agreement varied from poor to fair, with seven questions showing fair agreement (κ = 0.24–0.37, p < 0.05), indicating variability in evaluations.
  • No significant differences in readability between ChatGPT and evidence-based responses were found, but simplification of responses improved clarity.
Interpretation:

ChatGPT shows potential as a complementary tool for patient self-education in PCOS, enhancing engagement and simplifying complex medical language, which could improve patient understanding.

Limitations:
  • Sample size limited to 43 healthcare professionals, which may affect the generalizability of the findings.
  • Scoring agreement varied, indicating inconsistent evaluations among participants.
  • Further research is needed to validate clinical applicability and integration of AI tools in diverse healthcare settings.
Conclusion:

ChatGPT may serve as a valuable resource for PCOS education, but further studies are necessary to optimize its use in clinical settings and explore its impact on patient outcomes.

Original Source(s)

Related Content