docs: threshold the aggregate rubric metric, not the dimension ids by placerda · Pull Request #295 · Azure/agentops

placerda · 2026-06-10T15:33:14Z

Step 11.3 instructed readers to set thresholds on rubric dimension ids (correct_itinerary, adherence_to_constraints, clear_practical_notes), but azd ai agent eval emits one aggregate pass-rate metric per evaluator (coherence, fluency, smoke-core) and does not surface dimension-level metrics. Following the tutorial as written always failed with threshold metric(s) not found in azd results. Switches the example to threshold smoke-core / coherence / fluency on the 0..1 pass-rate scale, adds a callout explaining why dimension-level thresholds aren't supported, and points readers at aggregate_metrics in results.json for the authoritative list.

azd ai agent eval emits one aggregate pass-rate metric per evaluator (coherence, fluency, smoke-core), not one metric per rubric dimension. Step 11.3 previously instructed readers to set thresholds on the dimension ids (correct_itinerary, adherence_to_constraints, clear_practical_notes), which always fails with `threshold metric(s) not found in azd results`. Switch the example thresholds to the evaluator names azd actually emits (0..1 pass-rate scale) and add a callout explaining why dimension-level thresholds are not supported today. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

placerda merged commit 2ae0a2a into develop Jun 10, 2026
12 checks passed

placerda deleted the docs/threshold-aggregate-rubric branch June 10, 2026 15:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: threshold the aggregate rubric metric, not the dimension ids#295

docs: threshold the aggregate rubric metric, not the dimension ids#295
placerda merged 1 commit into
developfrom
docs/threshold-aggregate-rubric

placerda commented Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

placerda commented Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant