AI Division and Partners publish novel research: Performance of a large language model on the reasoning tasks of a physician

Computational tools for medical decision support have been advancing over time, mainly by serving as resources for limited applications. Machine learning tools for autonomous interpretation of clinical cases have also been gradually improving over time. Brodeur et al. pitted a large language model, the OpenAI o1 series, directly against hundreds of physicians at different levels of training and experience on a variety of clinical cases ranging from published patient vignettes to evaluations of brand-new emergency room patients, as well as on clinical tasks including both diagnosis and planning of clinical management (see the Perspective by Hopkins and Cornelisse). Across a variety of scenarios and applications, the large language model outperformed both human physicians and older models, suggesting its potential utility for clinical care. —Yevgeniya Nusinovich

Justin Mabee

Designer. 15 year web design veteran. 600+ projects completed. Memberships, Courses, Websites, Product Strategy and more.

https://arcandatlas.co
Previous
Previous

Harvard Medical School and the BIDMC Division of AI in Emergency Medicine Launch 1st CME Course “AI in Emergency Medicine” for December 3-4, 2026

Next
Next

Dr. John Lee Awarded National Grant for Emergency AI Project