diff --git a/cookbook/README.md b/cookbook/README.md index f96677b..4b4f80b 100644 --- a/cookbook/README.md +++ b/cookbook/README.md @@ -28,3 +28,4 @@ The agent will walk through each step, ask you for any required inputs (model, t | [openai-agents-sdk](openai-agents-sdk/) | Customer-style OpenAI Agents SDK cookbook showing how to keep an existing agent app and add a thin SimLab adapter. | | [secure-agent-eval](secure-agent-eval/) | Evaluate agent behavior through OneCLI's credential proxy — compare correctness, audit for credential leakage, and test rate limit resilience. | | [simlab-auto-research](simlab-auto-research/) | Autonomous system prompt optimization using the [auto-research](https://github.com/karpathy/autoresearch) pattern. An outer agent iterates on prompts, measured by SimLab task scores. | +| [prime-rl-training](prime-rl-training/) | Collect SimLab trajectories and train agent models with Prime Intellect's prime-rl (SFT warmup + hosted RL). | diff --git a/cookbook/prime-rl-training/.gitignore b/cookbook/prime-rl-training/.gitignore new file mode 100644 index 0000000..6343811 --- /dev/null +++ b/cookbook/prime-rl-training/.gitignore @@ -0,0 +1,9 @@ +# Generated artifacts (re-create with run_pipeline.sh) +output/ +dataset/ +dist/ +.prime/ +__pycache__/ +*.pyc +taskgen/ +generated-tasks/ diff --git a/cookbook/prime-rl-training/SKILL.md b/cookbook/prime-rl-training/SKILL.md new file mode 100644 index 0000000..f815a8f --- /dev/null +++ b/cookbook/prime-rl-training/SKILL.md @@ -0,0 +1,161 @@ +# Prime-RL Training with SimLab Trajectories + +Train agent models with Prime Intellect's prime-rl using SimLab-collected trajectories. + +## Prerequisites + +Before starting, confirm: + +1. SimLab is installed: `simlab --version` +2. prime CLI is installed: `prime --version` +3. `SIMLAB_COLLINEAR_API_KEY` is set +4. `PRIME_API_KEY` is set +5. `OPENAI_API_KEY` is set (for baseline agent) + +If any prerequisite is missing, tell the user what to set and **wait before proceeding**. + +## Workflow + +### 1. Install cookbook dependencies + +```bash +cd cookbook/prime-rl-training +uv sync +``` + +### 2. Create SimLab environment + +```bash +simlab templates list +``` + +Ask the user which template to use (default: `customer_service`). + +```bash +simlab env init prime-rl-env --template