Skip to content

13point5/swe-grep-oss

Repository files navigation

swe-grep-oss

Overview

  • Environment ID: swe-grep-oss
  • Short description: Environment for evaluating and developing models like SWE-grep

result

Datasets

Task

  • Type: <single-turn | multi-turn | tool use>
  • Parser: <e.g., ThinkParser, XMLParser, custom>
  • Rubric overview:

Quickstart

Run an evaluation with your model of choice (repos are cloned automatically and deleted after each rollout):

  • Default rollout clone root: system temp directory under swe-grep-oss-repos
  • Rollout directories are unique per rollout and look like <repo>_<instance_id>_<random_suffix>
  • Repositories are cloned directly at the target commit with git clone --revision <sha> --depth 1 when supported, with a git init + fetch fallback for older Git versions
  • Set SWE_GREP_ENV_BACKEND=sandbox to switch from the default local env to a sandbox-backed env
  • The sandbox variant uses a minimal public image (python:3.11-slim) with 1 CPU core, 2 GB RAM, and 5 GB disk, then installs git, jq, and ripgrep during setup before checking out the repo into /workspace/repo
uv run vf-eval swe-grep-oss \
  --api-base-url https://api.openai.com/v1 \
  --api-key-var OPENAI_API_KEY \
  --model "gpt-4o-mini" \
  --num-examples 2 \
  --rollouts-per-example 1

About

An RL environment similar to Cognition's SWE-Grep

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages