Explore our publications and preprints advancing healthcare through rigorous AI evaluation.
Early studies of large language models (LLMs) in clinical settings have largely treated artificial intelligence (AI) as a tool rather […]
Surgical co-management (SCM) is an evidence-based model in which hospitalists jointly manage medically complex perioperative patients alongside surgical teams. Despite […]
Large language models (LLMs) are entering clinician workflows, yet evaluations rarely measure how clinician reasoning shapes model behavior during clinical […]
Clinical evaluation of large language models (LLMs) currently relies on static datasets and isolated scenarios that fail to capture the […]
The large volume of abdominal computed tomography (CT) scans coupled with the shortage of radiologists have intensified the need for […]
General-purpose large language models (LLMs) are now commonplace throughout society, becoming de facto health advisors for millions worldwide1. The public […]
The deployment of artificial intelligence (AI) translation tools in healthcare is accelerating rapidly, yet regulatory frameworks lag dangerously behind clinical […]
mportance: High-quality discharge summaries are essential for safe care transitions but contribute substantially to clinician documentation burden and burnout. While […]
Medical artificial intelligence (AI) tools, including clinical language models, vision–language models and multimodal health record models, are used to summarize […]
Large language model (LLM) chat tools have the potential to transform healthcare workflows by improving efficiency and reducing administrative burdens. […]
Get the latest on our studies, grant awards, and media coverage.