GitHub - akronim26/match-bench: A judging and benchmarking platform for trading algorithms

Fair, repeatable benchmarking for high-frequency trading algorithms.

A benchmark is only useful if nobody can game the measurement.

match-bench lets participants upload trading algorithms, runs each submission in an isolated environment, sends every participant the same deterministic market workload, measures latency outside the participant's code, validates correctness, and publishes scores on a live leaderboard.

Overview

match-bench is a multi-service benchmark platform for evaluating untrusted trading algorithms. A submitted algorithm is built into a container, deployed into a locked-down Kubernetes pod, driven by deterministic benchmark traffic, measured at the Linux network boundary, checked against a correctness model, and scored.

Benchmark Snapshot

The figures below are measured on EKS (or the local k3s HDR harness where noted), not projected. Methodology, the raw data files, and the bottleneck hunt behind each number are in the architecture doc's benchmarking section.

Measurement	Result
Order generation, telemetry off (drain), per botworker node	~600–790k orders/sec (peak ~748k on one `c6i.xlarge`)
Order generation, telemetry on (single worker → 1 broker)	~445k orders/sec, lossless
Measurement pipeline (eBPF capture → Kafka → ingester)	lossless ≥ ~144k samples/sec (own ceiling not yet reached)
Single echo-contestant pod, cross-node	~150k delivered/sec (caps before the pipeline)
Kernel-stamped service-time p99 (healthy)	~98 µs
Platform self-benchmark tier (target)	~2M orders/sec — 3 botworker nodes, 2-broker Kafka, 96 partitions, 8 ingesters

The load generator is not the bottleneck at contest scale: a single node sustains hundreds of thousands of orders/sec, and the data plane scales out across Kafka partitions (24 in prod, 96 in the bench tier). The 2M/s tier is a documented, partially-validated target — the drain path (generation + telemetry + ingester) is ready to validate; eBPF-capture and echo-contestant capacity at 2M/s are still unverified. Revalidate on the exact node type, kernel, networking mode, and Kafka settings used for the event.

Key Features

Untrusted-code sandboxing: participant algorithms run in isolated Kubernetes pods with hardened security settings.
Deterministic workloads: every participant receives the same logical test stream for a given seed and workload mix.
Kernel-side latency measurement: request and response timing is captured outside the participant's process using Linux eBPF hooks.
Correctness validation: outputs are replayed through a reference model before a score is accepted.
Latency histograms: telemetry is aggregated into HDR histograms for percentile-based analysis.
Live leaderboard: APIs and frontend expose scores, run details, and live updates.
Cloud-ready deployment: Terraform and Kubernetes manifests support AWS EKS deployment.
Local development stack: Docker Compose brings up the core data, Kafka, platform, and observability dependencies.

How It Works

For each submission, match-bench follows this flow:

The participant uploads an algorithm.
The build system creates a runnable container image.
The sandbox orchestrator starts the algorithm in an isolated pod.
The bot fleet sends deterministic market traffic to the algorithm.
The latency service records when requests enter and responses leave the pod.
Telemetry services aggregate latency and throughput metrics.
The correctness validator checks whether the algorithm behaved correctly.
The score computer produces the final leaderboard score.

The important detail is where measurement happens. match-bench does not ask the submitted algorithm how fast it was. It observes network traffic from outside the algorithm and computes latency from those observations.

Architecture

The full, code-derived architecture lives in docs/architecture/ARCHITECTURE.md — a standalone deep reference with a section per microservice, the Kafka topology and partitioning model, the horizontal-scaling design, the benchmarking numbers, the bottleneck hunt, and the consolidated open items. It was written by reading the source directly, with every non-obvious claim cited to a path:line.

It contains fresh, code-derived diagrams: a system-context map, the run-lifecycle sequence, the Kafka topic graph, and the node-pool isolation model. design.md remains the detailed engineering-rationale companion.

Tech Stack

Area	Technology
APIs and orchestration	Go
Load generation and telemetry	Rust
Frontend	Next.js
Event bus	Kafka
Metadata storage	PostgreSQL
Time-series metrics	TimescaleDB
Hot snapshots	Redis
Artifact storage	MinIO / S3
Runtime isolation	Kubernetes
Kernel measurement	eBPF, XDP, tc
Observability	Prometheus, Grafana, Loki
Cloud infrastructure	Terraform, AWS EKS, ECR, IRSA, KEDA

Repository Layout

frontend/                  Web application
services/                  Backend, benchmark, telemetry, and scoring services
libs/                      Shared Go and Rust libraries
schemas/                   Shared event and Kafka topic schemas
k8s/                       Kubernetes manifests
infra/                     AWS Terraform and deployment Makefile
ops/                       Prometheus, Grafana, Kafka, and operational config
bootstrap/                 Secret templates and bootstrap notes
docs/                      Local run and cloud deployment guides
design.md                  Detailed engineering design

Getting Started

There are two local paths, both current:

1. Dependency stack + services from source (fast dev loop). Bring up Postgres, TimescaleDB, Redis, Kafka, and MinIO (optionally the platform APIs and observability too) with Docker Compose, then run individual services from source. Full steps and the local endpoint table are in docs/local-run.md.

cp .env.example .env
docker compose -f docker-compose.yml -f docker-compose.kafka.yml up -d   # core deps + Kafka topic init

2. Full platform on local k3s (no-cost end-to-end demo). deploy-local/up.sh deploys the entire stack — data tier, all services, eBPF capture, telemetry, validator, and frontend — onto a single local k3s node (1-broker Kafka, all replicas 1, gVisor off, tiny scenarios). This mirrors the EKS end-to-end run without the cloud cost and is the demo path.

deploy-local/up.sh        # builds images into the in-cluster registry, then applies the platform

Testing

Test commands are maintained separately:

Testing command reference

The test guide covers Go tests, Rust tests, frontend checks, integration tests, and Kubernetes smoke checks.

Deployment (AWS EKS)

The recommended end-to-end path is the e2e/ suite — clone → Terraform → deploy → run a real benchmark with every service on (load generator, eBPF capture, telemetry, validator, scoring, leaderboard). It is the most current and complete guide:

e2e/README.md — topology, contestants, scenarios, the required capture-fidelity setup, validated ceilings, and the full run order.
e2e/cluster.md — Terraform apply/destroy, kubeconfig, and re-scaling (you run Terraform).

cd infra/terraform && AWS_PROFILE=iicpc terraform apply -var-file="$(git rev-parse --show-toplevel)/e2e/e2e.tfvars"
AWS_PROFILE=iicpc aws eks update-kubeconfig --name iicpc-prod --region us-east-1
e2e/01-images.sh && e2e/02-bootstrap.sh && e2e/04-scenarios.sh   # then run via the UI, or e2e/run.sh <scenario>

For deeper or alternative needs:

DEPLOYMENT_EKS.md — the detailed, manual EKS reference (node groups, network policies, IRSA, KEDA, ALB controller, secrets, dependency-correct deploy order, gVisor).
Scaling tiers: bench/ drives the ~2M orders/s platform self-benchmark (multi-broker Kafka, 96 partitions, 8 ingesters); deploy-bench/ runs the load-generator capacity sweep (drain sink, 1→2 nodes).
Supporting: infra/README.md, the secret bootstrap guide, and the higher-level docs/aws-deployment.md.

Free-plan caveat: new AWS Free-plan accounts cap nodes at 2 vCPU, which cannot run the platform (the data tier alone needs ~5 vCPU and the exclusive-core sandbox needs ≥4 vCPU). Use a paid account for EKS, or the local k3s path above for a no-cost demo. See the architecture doc's deployment section for details.

Architecture Document

Platform Architecture — canonical, code-derived, standalone
DeepWiki — community-generated; mostly correct but ~2–3 days behind on the latest benchmarking and bug-fix work
Google Doc

Demo Video

TODO: Add link to the project demo video.

Suggested target:

Product walkthrough
Benchmark run demo
Leaderboard and telemetry demo

Design Notes

The detailed system design is documented in design.md. It explains the measurement model, deterministic workload generation, sandboxing strategy, telemetry pipeline, correctness validation, and scoring model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fair, repeatable benchmarking for high-frequency trading algorithms.

Table Of Contents

Overview

Benchmark Snapshot

Key Features

How It Works

Architecture

Tech Stack

Repository Layout

Getting Started

Testing

Deployment (AWS EKS)

Architecture Document

Demo Video

Design Notes

Development Team

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 154 Commits
.github/workflows		.github/workflows
bench		bench
bootstrap		bootstrap
deploy-bench		deploy-bench
docs		docs
e2e		e2e
frontend		frontend
infra		infra
k8s		k8s
libs		libs
mermaids		mermaids
ops		ops
schemas		schemas
services		services
tools		tools
wiki		wiki
.env.example		.env.example
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
design.md		design.md
docker-compose.kafka.yml		docker-compose.kafka.yml
docker-compose.observability.yml		docker-compose.observability.yml
docker-compose.platform.yml		docker-compose.platform.yml
docker-compose.yml		docker-compose.yml
go.work		go.work
go.work.sum		go.work.sum
mkdocs.yml		mkdocs.yml
requirements-docs.txt		requirements-docs.txt

Folders and files

Latest commit

History

Repository files navigation

Fair, repeatable benchmarking for high-frequency trading algorithms.

Table Of Contents

Overview

Benchmark Snapshot

Key Features

How It Works

Architecture

Tech Stack

Repository Layout

Getting Started

Testing

Deployment (AWS EKS)

Architecture Document

Demo Video

Design Notes

Development Team

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages