Research

JAMA Internal Medicine
Oct 14, 2025

How Physicians Can Prepare for Generative AI

Generative artificial intelligence (GenAI) large language models (LLMs) have rapidly integrated into everyday life, from internet search engines to chatbot […]

Preprint
Oct 5, 2025

A global log for medical AI

As hospitals rush to pilot large language models and other AI-based clinical decision support tools, we still lack a standard […]

Preprint
Oct 1, 2025

Advancing Medical Artificial Intelligence Using a Century of Cases

LLMs exceed physician performance on complex text-based differential diagnosis andconvincingly emulate expert medical presentations, but image interpretation and literatureretrieval remain […]

NEJM AI
Sep 25, 2025

Assessment of Large Language Models in Clinical Reasoning: A Novel Benchmarking Study

SCT exposes persistent limitations in LLM clinical reasoning, especially in models optimized for explicit reasoning. Although SCT performance offers analogies […]

Preprint
Aug 16, 2025

Automated Evaluation of Large Language Model Response Concordance with Human Specialist Responses on Physician-to-Physician eConsult Cases

Specialist consults in primary care and inpatient settings typically address complex clinicalquestions beyond standard guidelines. eConsults have been developed as […]

NEJM AI
Aug 14, 2025

MedAgentBench: A Virtual EHR Environment to Benchmark Medical LLM Agents

Recent large language models (LLMs) have demonstrated significant advancements, particularly in their ability to serve as agents, thereby surpassing their […]

Preprint
Aug 13, 2025

Asking the Right Questions: Benchmarking Large Language Models in the Development of Clinical Consultation Templates

This study evaluates the capacity of large language models (LLMs) to generate structured clinicalconsultation templates for electronic consultation. Using 145 […]

JAMA Surgery
Aug 6, 2025

Intelligent, Human-Centric Delivery Is Needed to Maximize AI

Artificial intelligence (AI)–augmented learning is here, and many believe it is superior to all past endeavors. AI’s ability to deliver […]

Preprint
Jul 23, 2025

A typology of physician input approaches to using AI chatbots for clinical decision-making: a mixed methods study

This study aimed to identify how physicians interacted with LLM chatbots on clinical reasoning tasks to create a typology of […]

Preprint
Jun 8, 2025

From Tool to Teammate: A Randomized Controlled Trial of Clinician-AI Collaborative Workflows for Diagnosis

Early studies of large language models (LLMs) in clinical settings have largely treated artificialintelligence (AI) as a tool rather than […]

Latest News

View all

Get the latest on our studies, grant awards, and media coverage.