Prompt-only boundary prediction for IFEval-style instruction-checker pass/fail behavior.
-
Updated
Jun 16, 2026 - Python
Prompt-only boundary prediction for IFEval-style instruction-checker pass/fail behavior.
Task-dependent benchmark gap between /v1/completions and /v1/chat/completions on instruction-tuned LLMs -- two case studies on Qwen3.x GGUFs, reproduction recipe, and probes.
Add a description, image, and links to the ifeval topic page so that developers can more easily learn about it.
To associate your repository with the ifeval topic, visit your repo's landing page and select "manage topics."