Skip to content

Pull requests: MedARC-AI/medmarks

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Improve HealthBench parity with openai/simple-evals
#132 opened May 14, 2026 by bofenghuang Contributor Loading…
Patch thinking-model grader bugs
#128 opened Apr 13, 2026 by Honyant Loading…
2 tasks done
Add RL training configs, scripts, and environment updates
#113 opened Jan 29, 2026 by nikhilk7153 Contributor Loading…
Parsed Model Answer logging+ MCQ Answer analysis
#110 opened Jan 26, 2026 by mnishant2 Contributor Loading…
Add NLG metrics to aci_bench
#107 opened Jan 22, 2026 by jbdel Contributor Loading…
Add BioASQ
#105 opened Jan 22, 2026 by Ash-29 Loading…
Add open-i-summarization environment for radiology report summarization
#102 opened Jan 22, 2026 by jbdel Contributor Loading…
Casereportbench environment
#99 opened Jan 20, 2026 by ss8319 Contributor Loading…
Add MedSafetyBench environment
#97 opened Jan 18, 2026 by anas-zafar Loading…
Added paper citations to environments
#86 opened Dec 24, 2025 by mkieffer1107 Contributor Loading…
Feat/medexqa judges
#67 opened Nov 3, 2025 by mnishant2 Contributor Draft
Add MedHALT evaluation environment
#65 opened Oct 30, 2025 by geetua Contributor Loading…
K-QA #45 added
#52 opened Oct 13, 2025 by Manishram-ai Contributor Loading…
Mebullets changes
#26 opened Oct 3, 2025 by mkieffer1107 Contributor Loading…
ProTip! Filter pull requests by the default branch with base:main.