Performance and usability of retrieval-augmented large language models for stroke patient and caregiver support - Summary - MDSpire

Performance and usability of retrieval-augmented large language models for stroke patient and caregiver support

  • By

  • Jinxia Rong

  • Min Liang

  • Zheyan Wang

  • Zhixue Ye

  • Jingjing Luo

  • Yan Liang

  • June 25, 2026

  • 0 min

Share

Objective:

To develop and systematically evaluate a retrieval-augmented generation (RAG) question-answering system tailored for stroke-related queries.

Approach:
  • Study Design: Comparative experimental design evaluating LLM performance and usability under zero-shot and RAG settings.
  • Knowledge Base Construction: Compiled a localized stroke knowledge base from authoritative sources, including both domestic and international guidelines.
  • Evaluation Metrics: Assessed technical performance, clinical validity, and end-user usability using various metrics including exam questions and clinician-led validation.
Key Findings:
  • RAG technology improves accuracy and robustness of responses by retrieving pertinent information from external knowledge bases.
  • The study highlights trade-offs between performance enhancement and output understandability.
  • End-user usability was assessed using System Usability Scale (SUS) and Net Promoter Score (NPS) scales.
Interpretation:

The integration of RAG with LLMs shows potential to enhance the reliability of AI-assisted stroke care, addressing gaps in user experience and information accuracy.

Limitations:
  • The study did not involve clinical intervention or treatment modification.
  • Limited empirical validation involving end users such as patients or caregivers.
Conclusion:

This research provides a feasible, evidence-based solution for stroke management and lays groundwork for AI integration in chronic disease care.

Original Source(s)

Related Content