AI Outperforms MDs on Reasoning Tasks - Summary - MDSpire
From the Journals

AI Outperforms MDs on Reasoning Tasks

Share

A study led by Peter G. Brodeur, MD, and colleagues from Harvard Medical School and Beth Israel Deaconess Medical Center found that OpenAI's large language model, o1, surpassed physician baselines in diagnostic and management reasoning across multiple evaluations, including emergency room cases. In a blinded proof-of-concept, o1 identified accurate diagnoses in 67%-82% of cases compared to 50%-70% for physicians. The study revealed both strengths and limitations of LLMs, suggesting a need for further research to assess their impact on clinical practice.

Original Source(s)

Related Content