Skip to content

Pull requests: openai/simple-evals

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add ArxivRollBench eval
#110 opened May 24, 2026 by liangzid Loading…
Anca/b
#92 opened Jun 23, 2025 by anca-loo Loading…
Anca br
#91 opened Jun 23, 2025 by anca-loo Loading…
Enhance simple-evals for beginner to run
#87 opened Jun 5, 2025 by ECNU3D Loading…
Jules version of beginner friendly README
#81 opened May 20, 2025 by rarhs Loading…
feat: add len_var scorer (B-0)
#71 opened May 8, 2025 by Yuu6798 Loading…
fix regex bug in browsecomp
#67 opened Apr 22, 2025 by tengyaolong2000 Loading…
fix: import collision for types
#66 opened Apr 21, 2025 by Ithanil Loading…
Small typo in grader
#57 opened Mar 25, 2025 by chiruu12 Loading…
add aime task
#55 opened Mar 12, 2025 by jason9693 Loading…
Initial commit
#45 opened Feb 1, 2025 by osmanjamalfarag Loading…
correct string spelling error
#37 opened Dec 27, 2024 by owos Loading…
Use correct _pack_message function name
#12 opened May 20, 2024 by andrewmbenton Loading…
Added Chartqa Dataset
#6 opened Apr 14, 2024 by tarunamasa Loading…
ProTip! Filter pull requests by the default branch with base:main.