PhysAssistBench
Benchmark for evaluating LLM agents in doctor-patient-EHR clinical assistance workflows
About
PhysAssistBench is a research benchmark designed to evaluate LLM agents in realistic physician assistance scenarios. It tests coordinated capabilities across clinical knowledge, electronic health record (EHR) system interactions, and patient communication. The benchmark provides a standardized evaluation framework for multi-agent medical AI systems that must navigate complex healthcare workflows involving both structured data and natural conversation.
Details
| Type | |
| Integrations | |
| Language |
Tags
Quick Info
- Organization
- Research Team (Du et al.)
- Pricing
- open-source
- Free Tier
- Yes
- Updated
- Jun 18, 2026
Also in Dev Tools
Crawl4AI
Open-source web crawler optimized for LLMs and AI agents — 62K+ stars
Firecrawl
Web scraping API built for LLMs — turn any website into LLM-ready data — 89K+ stars
Headroom Context Optimization
Reduce LLM API costs by 50-90% through advanced context compression