Domain-specific benchmark for B2B sales agents — 250 tasks, SimPO judge model, published on HuggingFace.
-
Updated
May 2, 2026 - Python
Domain-specific benchmark for B2B sales agents — 250 tasks, SimPO judge model, published on HuggingFace.
Add a description, image, and links to the simpo topic page so that developers can more easily learn about it.
To associate your repository with the simpo topic, visit your repo's landing page and select "manage topics."