Large language model chatbots as sources of pediatric anesthesia health advice: An evaluation of reliability and readability - Report - MDSpire

Large language model chatbots as sources of pediatric anesthesia health advice: An evaluation of reliability and readability

  • By

  • Xue Zhang

  • Yuchen Dai

  • Xin Zhao

  • Lin Wu

  • Boming Shao

  • Xisheng Shan

  • Fuhai Ji

  • Runzhi Deng

  • Baojian Zhao

  • June 29, 2026

  • 0 min

Share

Clinical Report: Assessing the Reliability and Readability of Pediatric Anesthesia Guidance

Overview

This study evaluates the reliability and readability of pediatric anesthesia information provided by four large language models (LLMs). Findings indicate variability in content quality and readability.

Background

Pediatric anesthesia presents unique challenges due to the physiological differences between children and adults, as well as high levels of perioperative anxiety among pediatric patients. Caregivers often seek online information, but the quality of this information can be inconsistent. The rise of LLMs as sources of health information necessitates an evaluation of their reliability and readability.

Data Highlights

No numerical data was provided in the source material.

Key Findings

  • The study systematically evaluated four widely used LLMs: ChatGPT, Claude, DeepSeek, and Gemini.
  • LLM-generated health information often demonstrates limited readability and occasional factual inconsistencies.
  • Excessive linguistic complexity in LLM responses may hinder caregivers' understanding of anesthesia-related risks.
  • There is a critical need for accurate and clear information for caregivers regarding pediatric anesthesia.

Clinical Implications

Healthcare professionals should be aware of the potential inaccuracies in LLM-generated information regarding pediatric anesthesia.

Conclusion

The evaluation of LLMs in pediatric anesthesia highlights concerns regarding the reliability and readability of the information they provide.

Related Resources & Content

  1. JMIR Medical Informatics, 2026 -- Evaluation of Five Large Language Models for Parental Education in Pediatric Anesthesia: Reliability and Readability Study
  2. npj Digital Medicine, 2026 -- Large language models provide unsafe answers to patient-posed medical questions
  3. Journal of Medical Internet Research (JMIR), 2026 -- Patient Cognitive Bias in Large Language Model–Supported Health Consultations: Simulation-Based Comparative Study
  4. DIGITAL HEALTH, 2026 -- Evaluation of free-access artificial intelligence chatbots in preoperative patient education about general anesthesia: A comparative study of ChatGPT, gemini, and copilot
  5. Critical Elements for the Pediatric Periprocedural Anesthesia Environment: Policy Statement | Pediatrics | American Academy of Pediatrics, 2025
  6. 2023 American Society of Anesthesiologists Practice Guidelines for Preoperative Fasting, PubMed
  7. Relationship between perioperative medications and risk of emergence agitation in children after sevoflurane anesthesia: a network meta-analysis | Pediatric Research, 2026
  8. Guidelines to the Practice of Anesthesia—Revised Edition 2025
  9. Critical Elements for the Pediatric Periprocedural Anesthesia Environment: Policy Statement | Pediatrics | American Academy of Pediatrics
  10. 2023 American Society of Anesthesiologists Practice Guidelines for Preoperative Fasting: Carbohydrate-containing Clear Liquids with or without Protein, Chewing Gum, and Pediatric Fasting Duration-A Modular Update of the 2017 American Society of Anesthesiologists Practice Guidelines for Preoperative Fasting - PubMed
  11. Relationship between perioperative medications and risk of emergence agitation in children after sevoflurane anesthesia: a network meta-analysis | Pediatric Research
  12. Guidelines to the Practice of Anesthesia—Revised Edition 2025

Original Source(s)

Related Content