One Step Closer to Real-Time Detection of Missed Opportunities for Diagnosis in the ED Using LLMs

By
Fernanda Bellolio
Daniel Cabrera
June 29, 2026
0 min

Jama Network Open

Overview

This study evaluates the use of large language models (LLMs) to identify missed diagnostic opportunities in the emergency department (ED), finding a prevalence of 13.5% among analyzed encounters. The models demonstrated varying sensitivity and specificity.

Background

Identifying diagnostic oversights in the ED is crucial for improving patient outcomes, as traditional methods rely on retrospective reviews that can be time-consuming and inefficient. Automated tools like electronic triggers (eTriggers) have been proposed to enhance this process.

Data Highlights

Model	AUC (72-hour return)	AUC (floor-to-ICU)
Claude Sonnet 4	0.65	0.57
Claude Sonnet 4.6
Claude Opus 4.6
Gemini 3 Pro
GPT-5
GPT-5mini	0.73	0.82

Key Findings

The overall prevalence of missed opportunities for diagnosis was 13.5% among 288 encounters.
The number needed to screen was 9.1 for 72-hour return and 5.4 for floor-to-ICU cohorts.
Model discrimination AUCs ranged from 0.65 to 0.73 for 72-hour return and 0.57 to 0.82 for floor-to-ICU cohorts.
Claude Sonnet 4 favored higher sensitivity, while GPT-5mini favored higher specificity in binary classifications.
Physician interrater agreement was 81.9%.
LLMs can analyze unstructured clinical notes to detect missed diagnostic opportunities.

Clinical Implications

The findings suggest that LLMs can enhance the identification of missed diagnostic opportunities in real-time, potentially improving patient safety. The choice of model based on sensitivity and specificity trade-offs is critical for optimizing the review process in clinical settings.

Conclusion

The study indicates that LLMs can identify missed diagnostic opportunities in emergency medicine.

One Step Closer to Real-Time Detection of Missed Opportunities for Diagnosis in the ED Using LLMs

Clinical Report: Advancing Real-Time Identification of Diagnostic Oversights in the ED

Overview

Background

Data Highlights

Key Findings

Clinical Implications

Conclusion

Related Resources & Content

Original Source(s)

One Step Closer to Real-Time Detection of Missed Opportunities for Diagnosis in the ED Using LLMs

Related Content

Clinical characteristics of acute pancreatitis in patients with inflammatory bowel disease: a nationwide survey in Japan

Screening for Missed Opportunities for Diagnosis in the ED Using eTriggers and Large Language Models

Managing mild autonomous cortisol secretion (MACS): evaluating the role of medical treatment