Skip to content

docs: threshold the aggregate rubric metric, not the dimension ids#295

Merged
placerda merged 1 commit into
developfrom
docs/threshold-aggregate-rubric
Jun 10, 2026
Merged

docs: threshold the aggregate rubric metric, not the dimension ids#295
placerda merged 1 commit into
developfrom
docs/threshold-aggregate-rubric

Conversation

@placerda

Copy link
Copy Markdown
Contributor

Step 11.3 instructed readers to set thresholds on rubric dimension ids (correct_itinerary, adherence_to_constraints, clear_practical_notes), but azd ai agent eval emits one aggregate pass-rate metric per evaluator (coherence, fluency, smoke-core) and does not surface dimension-level metrics. Following the tutorial as written always failed with threshold metric(s) not found in azd results. Switches the example to threshold smoke-core / coherence / fluency on the 0..1 pass-rate scale, adds a callout explaining why dimension-level thresholds aren't supported, and points readers at aggregate_metrics in results.json for the authoritative list.

azd ai agent eval emits one aggregate pass-rate metric per evaluator (coherence, fluency, smoke-core), not one metric per rubric dimension. Step 11.3 previously instructed readers to set thresholds on the dimension ids (correct_itinerary, adherence_to_constraints, clear_practical_notes), which always fails with `threshold metric(s) not found in azd results`. Switch the example thresholds to the evaluator names azd actually emits (0..1 pass-rate scale) and add a callout explaining why dimension-level thresholds are not supported today.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@placerda placerda merged commit 2ae0a2a into develop Jun 10, 2026
12 checks passed
@placerda placerda deleted the docs/threshold-aggregate-rubric branch June 10, 2026 15:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant