Benchmark Integrity and Reasoning-Trace Errors in Medical Question Answering With Large Language Models: Mixed Methods Study With Sparse Autoencoders - Top-Institutions - MDSpire

Benchmark Integrity and Reasoning-Trace Errors in Medical Question Answering With Large Language Models: Mixed Methods Study With Sparse Autoencoders

By
Jialin Liu
Siru Liu
Adam Wright
June 12, 2026
0 min

Journal Of Medical Internet Research (Jmir)

Share

Top Institutions in Internal Medicine

Brief introduction explaining scope and methodology.

#1

Stanford University School of Medicine

Stanford, California
Key Differentiators
- Medical Informatics
#2

Harvard Medical School

Boston, Massachusetts
Key Differentiators
- Medical Informatics
#3

Mount Sinai

New York, New York
Key Differentiators
- Medical Informatics
#4

Mass General Brigham

Boston, Massachusetts
Key Differentiators
- Medical Informatics
#5

NYU Langone Health

New York, New York
Key Differentiators
- Medical Informatics