Add local end-to-end test harness for the Ollama provider by adrianchung · Pull Request #75 · gke-labs/devops-bench

adrianchung · 2026-06-15T18:28:04Z

Summary

Scripts to exercise the local pipeline without model weights or a live cluster.

Scope

scripts/mock_ollama_server.py — mock of Ollama's OpenAI-compatible chat API; classifies calls as agent/steps/score and returns canned responses so DeepEval GEval metrics complete deterministically
scripts/run_ollama_e2e_test.sh — driver: starts the mock server, sets env, runs evaluate.py against a simple task end-to-end
scripts/setup_local_env.sh — one-shot setup of Ollama + kind + node image + model

Context

3 of 4 PRs splitting #72. These scripts depend at runtime on the Ollama provider (#74) and NoOpDeployer/BENCH_NO_INFRA (#73) — no file overlap, so it merges cleanly in any order, but it's only runnable once those two land. Recommended review/merge order: #73, #74, then this.

Scripts to exercise the local pipeline without model weights or a live cluster: - scripts/mock_ollama_server.py: minimal mock of Ollama's OpenAI-compatible chat API; classifies calls as agent/steps/score and returns canned responses so DeepEval GEval metrics complete deterministically. - scripts/run_ollama_e2e_test.sh: driver that starts the mock server, sets the Ollama env vars, and runs evaluate.py against a simple task end-to-end. - scripts/setup_local_env.sh: one-shot setup of Ollama + kind + node image + model. Depends at runtime on the Ollama provider and NoOpDeployer (BENCH_NO_INFRA).

adrianchung mentioned this pull request Jun 15, 2026

Support a local option to test for inner dev loop #72

Closed

3 tasks

adrianchung requested review from itssimrank and pradeepvrd June 15, 2026 20:05

pradeepvrd approved these changes Jun 17, 2026

View reviewed changes

pradeepvrd merged commit ea275e2 into gke-labs:main Jun 18, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add local end-to-end test harness for the Ollama provider#75

Add local end-to-end test harness for the Ollama provider#75
pradeepvrd merged 1 commit into
gke-labs:mainfrom
adrianchung:add-local-e2e-harness

adrianchung commented Jun 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

adrianchung commented Jun 15, 2026

Summary

Scope

Context

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants