Releases: 8-Sync-Dev/su-code
v0.24.0 — AI TEAM + /gs handshake + Mnemopi memory + token-bench + eval --project
AI team, made real + proven on real codebases.
Added
8synchelp leads with AI TEAM (8sync harness+/gs) — the two most important commands are now the first section; fixed stale lines (skill sync→skill update,up=8sync-only)./gs <goal>scope-handshake — on a medium+/ambiguous goal it grounds, proposes 2–4 concrete options (scope · team · effort · tradeoff, from the senior bench) + asks before building.autostays unattended.- Mnemopi long-term memory wired into
8sync harness+doctor— API-only (llmMode: smol+noEmbeddings), no local model, runs on any machine. Recall/retain durable project memory across sessions ("deep awareness, never forgets"). token-benchskill (bundled,uv/PEP723, stdlib-only) — proves code-intel token savings on a real repo with a correctness gate. Measured: 8syncdev-pro-v2 −96.6%, gsd-pi −78.6% (range 54–98%, scales with footprint; honestly negative for tiny 1-file symbols). Multi-language (TS/JS/Rust/Go/Py). Correctness gsd-pi 10/10.8sync harness eval --project— agent-team readiness scorecard (% per role: dev/qa/research/ba-po/fe/be/docs/memory/token-opt). Deterministic capability coverage (not output quality). su-code 93%.8sync harness evalreports%(3/3 = 100%).- 6 reference submodules for inspecting/tracking upstream (deinit by default): gstack · gsd-pi · agent-reach · addyosmani/agent-skills · DietrichGebert/ponytail · DeusData/codebase-memory-mcp.
- Docs:
outputs/agent-team-workflow-automation-plan.md(UI/UX Design Lane + Lighthouse gate) ·outputs/omp-customization-memory-platform-research.md(no-train/GGUF reality, API-only memory, omp-native customization).
Changed
- Declutter skill registry — dropped the
companion-inc/feynmanresearch pack (20 on-demand skills, wrong domain for a coding harness; pure prefix noise). on-demand 55 → 36; force-load prefix ~1998 → ~1717 tok; stable-prefix A1 PASS.
Notes
- Loop-quality probe (
harness eval) is non-deterministic (model + network) — a signal, not a CI gate; recent runs 2–3/3. - "Self-learn" = memory accumulation (Mnemopi + cbm + spine), not weight training/fine-tune (omp is inference-only; local models are ONNX q4 for titles/memory, not GGUF).
v0.23.0 — harness eval + concrete /gs L3 worktree isolation
Measure the loop, isolate the loop.
Added
8sync harness eval— loop quality probe. Runs a fixed task-suite through omp (omp -p --no-session --auto-approve) and scores each task with a deterministicverify.shthe agent can't game. Fixtures:fix-failing-test,add-fn-with-test,locate-symbol. JSON scorecard +--baselinereference in the gitignored.cache/8sync/eval/; later runs print the pass delta. Non-deterministic quality signal, not a CI gate. Verified 3/3.
Changed
/gsL3 worktree isolation is now concrete:git worktree add .gs/wt/<slug> -b gs/<slug>→ implement+verify+commit there →git worktree remove; never editmain's tree.
Phase 3b (gstack omp host) deferred — additive with a working bundled fallback; the host lives in the deinitialized gstack submodule, not su-code.
Install: curl -fsSL https://raw.githubusercontent.com/8-Sync-Dev/su-code/main/install.sh | sh
v0.22.0 — doc-hygiene audit + AI-engine health + loop fixes
Make the engineering-loop harness verifiably correct.
Added
8sync harness audit— code-backed doc-hygiene: stale repo-relative paths in docs, oversized docs (>400 lines / >120-line force-load block), 30-day churn hotspots. Report-only.8sync doctorshows a one-line summary;/gsdoc-hygiene runs it.8sync doctorAI-engine health check — codegraph + codebase-memory-mcp + headroom must be installed AND registered in omp (enforces STEP 0 token discipline).
Fixed
- codegraph STEP 0 verbs corrected to
query/callers/callees/impact(weresearch/deps/defs, nonexistent in 0.9.6). - force-load now dedups skills by frontmatter name (the stale
karpathydir double-listedkarpathy-guidelines). - impeccable
.agents/skills→agents/skills(28 refs) so its setup scripts run under 8sync's layout.
Changed
- managed
.gitignoreignores.gs/.
Install: curl -fsSL https://raw.githubusercontent.com/8-Sync-Dev/su-code/main/install.sh | sh
v0.21.0 — right-size /gs (fix post-/gs quality regression)
Eval + deep-research showed the post-/gs quality drop was process over-engineering, not tokens (harness bench: ~8.5k upfront, 79% saved, KV-cache stable). /gs was a 93-line command forcing a team + full Closeout on every task, with auto "never asking".
Changed — /gs redesigned to right-size effort
- Right-size first — trivial/small → solo (no team, no Closeout) · medium → solo + one verifier · large → full loop + roles + Closeout. A team is the exception you justify (Cognition/Anthropic: single-agent default).
- Solo-by-default delegation — subagents only for parallel-independent / context-isolation / specialization; scoped objective + summary return (never free-form, never inline transcript).
- Autonomy confidence-gated — a high-stakes hard-to-undo low-confidence call is now a blocker (Anthropic 2026: "agents learning when to ask"); prefer reversible, never compound.
- Doc-hygiene — detect stale paths / junk / superseded docs → fix or delete (no addition without the matching deletion); keep docs lean.
- Codebase-history — git log/blame + DECISIONS + cbm before load-bearing edits.
- Leaner command — 93 → 56 lines (lower constraint density → better instruction-following).
Research brief: outputs/gs-eval-improve-research-brief.md (+ provenance). Redeploy with 8sync harness. Upgrade: curl -fsSL https://raw.githubusercontent.com/8-Sync-Dev/su-code/main/install.sh | sh
v0.20.1 — /gs truly unattended + QA/Closeout + reference submodules
Fixed
/gs autonow truly runs unattended. Autonomy contract: inautothe team NEVER asks — it resolves unknowns by research (codegraph/cbm → memory/PLAYBOOKS → skills → web_search/autoresearch/deep-research), picks the boring/reversible option, logs it under## Assumptionsin STATE, and proceeds. A "blocker" is now only a missing credential / external approval / destructive-irreversible action. (omp's defaulttools.approvalMode: yoloalready auto-approves tools; a slash command cannot bypass the approval gate.)/gsargument hint when you type/gs(front-loaded description +argument-hintfrontmatter).- QA + test are first-class — per-slice verify-gate runs tests + a QA pass (no skipping/weakening), plus a mandatory Closeout (full test suite + end-to-end QA + independent re-review vs Definition-of-Done + handoff summary) before the loop reports done.
Added
reference/gstack+reference/gsd-pigit submodules for studying the patterns behind/gs. Pointers committed; working trees deinitialized by default so they never bloat the index (codegraph honors no exclude). Study on demand:git submodule update --init reference/<name>.
Bare 8sync harness = full auto-setup (MCP + skills + /gs + memory + inject + index) in one command.
v0.20.0 — /gs autonomous engineering-team loop
Added
/gs— one command, a whole engineering team that runs until done./gs <goal>plans + runs · bare/gsresumes ·/gs autoruns unattended ·/gs status|next|stop. Drives the loop-engineering v2 (A–E) machinery offagents/STATE.md: plan → delegate to specialist roles (tasksubagents / gstack roles if installed) → verify-gate → commit → record (KNOWLEDGE/PLAYBOOKS) → advance until Definition-of-Done or a blocker.- Token-lean + guardrailed. codegraph + codebase-memory-mcp + headroom mandatory; verify-gate before every commit; worktree isolation + no push/PR at L3; hard-stop via
/gs stop. Modeled on gsd-pi/gsd auto. - Team-shared.
8sync harness/init/updeploy it to~/.omp/agent/commands/gs.md(global) and<repo>/.omp/commands/gs.md(committed → whole team gets/gs). Run 24/7 with8sync harness up --timer 30m. New on-demandgsskill documents the protocol.
bench on this repo: A1 stable-prefix PASS, upfront ~7.3k tok, A2 progressive-disclosure saved 81%.
v0.19.0 — loop-engineering v2 + harness bench
Loop-engineering v2 (Phases A–E) grounded in 2026 context-engineering research, plus a deterministic benchmark verb.
Added
8sync harness bench— deterministic loop-engineering scorecard (no model calls): upfront context budget (force-load prefix + CORE skill bodies + memory spine) vs deferred (SPECIALIST + on-demand), A2 progressive-disclosure saving, A1 KV-cache stable-prefix gate. This repo: upfront ~7.1k tok, A2 saved ~32.3k tok (81%), A1 PASS.agents/PLAYBOOKS.md— Voyager-style procedural memory (validated runbooks indexed byWhen:).
Changed (loop engineering v2)
- A token & stable-prefix: CORE/SPECIALIST progressive disclosure, mandatory
headroom_compress(>~50 lines), KV-cache-stable KNOWLEDGE breadcrumb (no volatile epoch). - B live memory: structured
agents/STATE.mdlive-plan recitation + compaction + budget-awareness. - C maker/checker:
taskimplementer ↔ independent verifier (verify-gate before commit) + Reflexionfailure:capture. - D procedural memory + tiering (KNOWLEDGE / PLAYBOOKS / DECISIONS).
- E phased autonomy L1→L3 + guardrails (no auto push/PR at L3); timer-loop per-tick job.
Plan + provenance: outputs/harness-loop-engineering-v2-plan.md.
v0.18.1
Fixed
8sync harness initnow pulls registered manifest skills —initrunsskill updateagainstagents/skills.toml(git collections likefeynman: deep-research, autoresearch, …) before mirroring, making it a true superset of bare8sync harness. Previouslyinitonly deployed the bundled skills + 2 hardcoded external packs (ponytail, addyosmani), so manifest-only skills never reachedagents/skills/viainit— only bare8sync harness/up --pulldid.
v0.18.0 — Headroom context-compression wired as an omp MCP
8sync harness now auto-sets-up Headroom (context compression) as a third token-efficiency engine alongside codegraph + codebase-memory-mcp.
Added
- Headroom MCP —
8sync harness/initauto-installsheadroom-ai[mcp](uv → pipx → pip) and registersheadroom mcp serve(stdio) in~/.omp/agent/mcp.json. Toolsheadroom_compress/headroom_retrieve/headroom_statscompress long tool outputs / logs / diffs 60–95% before they reach the model. Force-injected into STEP 0 +00-force-load.md.
Researched, skipped
- PixelRAG (screenshot-RAG) — overlaps
8sync shot/ ompread/browser. - LocateAnything3D — 3D-vision model, out of scope for a coding harness.
Restart omp (or /mcp reload) to load the new server. Upgrade: curl -fsSL https://raw.githubusercontent.com/8-Sync-Dev/su-code/main/install.sh | sh
v0.17.1 — fix: skills propagate to other machines (committed agents/skills.toml)
Fix: 8sync harness now reproduces your skills on a second machine.
Root cause
The skill registry that records what to pull (~/.config/8sync/skills.toml) is machine-local and never travels. On a fresh machine, 8sync harness re-pulled nothing beyond the 15 binary-embedded skills + 2 hardcoded external packs — every skill add-ed source (e.g. git collections like feynman) was invisible.
Fix
harness / skill update now write a committed project manifest agents/skills.toml (union of the machine-local registry) and read it back on any machine. A fresh git clone + 8sync harness re-pulls the exact same skills. agents/skills.toml is a file, so it travels even when the agents/skills/ directory is gitignored.
To propagate skills to another machine
- On machine A:
8sync harness(writesagents/skills.toml), then commit + push it. - On machine B:
git clone→8sync up(get ≥ v0.17.1) →8sync harness→ skills re-pulled from the manifest.
Upgrade: curl -fsSL https://raw.githubusercontent.com/8-Sync-Dev/su-code/main/install.sh | sh