Skip to content

Add local end-to-end test harness for the Ollama provider#75

Merged
pradeepvrd merged 1 commit into
gke-labs:mainfrom
adrianchung:add-local-e2e-harness
Jun 18, 2026
Merged

Add local end-to-end test harness for the Ollama provider#75
pradeepvrd merged 1 commit into
gke-labs:mainfrom
adrianchung:add-local-e2e-harness

Conversation

@adrianchung

Copy link
Copy Markdown
Collaborator

Summary

Scripts to exercise the local pipeline without model weights or a live cluster.

Scope

  • scripts/mock_ollama_server.py — mock of Ollama's OpenAI-compatible chat API; classifies calls as agent/steps/score and returns canned responses so DeepEval GEval metrics complete deterministically
  • scripts/run_ollama_e2e_test.sh — driver: starts the mock server, sets env, runs evaluate.py against a simple task end-to-end
  • scripts/setup_local_env.sh — one-shot setup of Ollama + kind + node image + model

Context

3 of 4 PRs splitting #72. These scripts depend at runtime on the Ollama provider (#74) and NoOpDeployer/BENCH_NO_INFRA (#73) — no file overlap, so it merges cleanly in any order, but it's only runnable once those two land. Recommended review/merge order: #73, #74, then this.

Scripts to exercise the local pipeline without model weights or a live cluster:

- scripts/mock_ollama_server.py: minimal mock of Ollama's OpenAI-compatible chat
  API; classifies calls as agent/steps/score and returns canned responses so
  DeepEval GEval metrics complete deterministically.
- scripts/run_ollama_e2e_test.sh: driver that starts the mock server, sets the
  Ollama env vars, and runs evaluate.py against a simple task end-to-end.
- scripts/setup_local_env.sh: one-shot setup of Ollama + kind + node image + model.

Depends at runtime on the Ollama provider and NoOpDeployer (BENCH_NO_INFRA).
@pradeepvrd pradeepvrd merged commit ea275e2 into gke-labs:main Jun 18, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants