Performance of four large language models in addressing family-centered questions on morbidity and outcomes in patients supported by extracorporeal membrane oxygenation

By
Wouter Vankrunkelsven
Leen Vercaemst
Dieter F Dauwe
June 22, 2026
0 min

Critical Care

Objective:

To investigate the performance of four commonly used large language models (LLMs) in answering family-centered questions about morbidity and outcomes for adult patients undergoing ECMO.

Approach:

Key Findings:

All LLMs acknowledged the high-risk nature of ECMO and differentiated between VV-ECMO and VA-ECMO.
Responses generally aligned with ELSO-registry data, but LLMs reported shorter VV-ECMO support durations, specifically indicating a mean duration of X days compared to Y days in ELSO data.
Discrepancies in complication rates were noted between LLMs and ELSO-registry data.
Gemini achieved the highest average scores in both content and expert-perceived communicative quality.
A significant number of responses were rated unsatisfactory by ECMO experts.

Interpretation:

Limitations:

Questions were queried sequentially in a single chat session, which may inflate internal coherence.
Results reflect a snapshot of LLM capabilities in May 2025 and may not represent future performance.

Performance of four large language models in addressing family-centered questions on morbidity and outcomes in patients supported by extracorporeal membrane oxygenation

Objective:

Approach:

Key Findings:

Interpretation:

Limitations:

Conclusion:

Original Source(s)

Performance of four large language models in addressing family-centered questions on morbidity and outcomes in patients supported by extracorporeal membrane oxygenation

Related Content

Recent COVID-19 Shot Linked to Less Spread

CXCR3 ameliorates neutrophil-dependent disease severity in SARS-CoV-2 infection by regulating CD4+ T cell recruitment

ACOG Releases 2026 Maternal Immunization Schedule