Join the ARISE mailing list to stay updated on healthcare AI research →

Research

Explore our publications and preprints advancing healthcare through rigorous AI evaluation.

Science

Jun 4, 2026

A narrowing window to understand AI

As capabilities of artificial intelligence (AI) advance rapidly, human understanding of these systems is increasingly falling behind. Several trends are […]

Preprint

May 24, 2026

Teaching large language models to reason like expert diagnosticians

Differential diagnosis is an iterative process that integrates patient information with broader medical knowledge. Clinical case series such as the […]

Nature Medicine

May 14, 2026

Advancing conversational diagnostic AI with multimodal reasoning

Real-world clinical practice is inherently multimodal, relying on the synthesis of patient history with visual information such as medical imagery […]

BMJ Digital Health & AI

May 14, 2026

Context is key: context engineering as the next frontier of medical AI

Large language models (LLMs) have been rapidly adopted for their potential to reduce clinician documentation burden and assist with clinical […]

Nature

May 12, 2026

What the AI era doctor should know: a scoping review of proposed artificial intelligence competencies for medical education

Artificial intelligence (AI) is rapidly reshaping healthcare and the competencies expected of graduating medical students, yet AI curricula and competency […]

BMJ Digital Health

May 11, 2026

When medical AI fails outside English

Large language models (LLMs) are increasingly positioned as general-purpose medical systems, with demonstrated potential in diagnosis and management reasoning for […]

JAMA Network

May 8, 2026

Physician-Reported Safety Outcomes of AI-Generated Hospital Course Summaries

High-quality discharge summaries are essential for safe care transitions but contribute substantially to clinician documentation burden and burnout. While retrospective […]

Preprint

May 4, 2026

PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments

We introduce PhysicianBench, a benchmark for evaluating LLM agents on physician tasks grounded in real clinical setting within electronic health […]

Science

Apr 30, 2026

Performance of a large language model on the reasoning tasks of a physician

More than 65 years ago, complex clinical diagnostic reasoning cases were introduced as the gold standard for the evaluation of […]

BMJ Digital Health & AI

Apr 27, 2026

Why are humans still in the loop with advancing AI capabilities?

Large language model generative artificial intelligence (AI) systems have opened Pandora’s box, beating human benchmarks across a range of tasks. […]

Latest News

View all

Get the latest on our studies, grant awards, and media coverage.

In Harvard study, AI offered more accurate emergency room diagnoses than two human doctors

Tech CrunchMay 3, 2026