Skip to content

Latest commit

 

History

History
235 lines (175 loc) · 18.4 KB

File metadata and controls

235 lines (175 loc) · 18.4 KB

Changelog

All notable changes to this skill are documented here.


[Unreleased]


[3.6.0] — 2026-05-05

Added

  • Node observer + npx continuous-improvement backfill (#52) — replaces the bash thin-schema fallback that depended on jq. The new bin/observe.mjs reads stdin, parses the hook payload natively, and writes the rich event schema (tool_input.command for Bash, Edit.file_path for Edit/Write/Read, tool_output for tool_complete) without external dependencies. hooks/observe.sh becomes a two-phase shim: prefer the Node observer when present, fall back to the prior bash thin-schema path when not, so operators who do not re-run npx continuous-improvement install see no behavior change. The companion backfill subcommand walks existing observations.jsonl files and tags every row with schema: "thin" | "rich" so the analysis pass can cleanly skip thin rows and surface a "X% thin" stat to operators. Idempotent; preserves operator data via .bak and observations.corrupt.jsonl quarantine. Closes the audit-derived gap where 22,065 observations across 11 projects on a jq-less host yielded 0 auto-detected instincts. Live backfill against the maintainer's host: 25,077 rows tagged → 24,547 thin (97.9%), 530 rich (2.1%), across 14 projects.
  • Proactive Roadmap Surfacing section in wild-risa-balance (#53) — names the surface-don't-execute boundary explicitly. Trigger conditions (persistent roadmap, finished tasks implying next steps, drift, instinct/memory predictions); hard boundary citing global CLAUDE.md and Auto Mode rules; format with (surfaced — <source>) marker; anti-patterns (nagging, citation-free speculation, bundling surface with execution, inventing roadmaps).
  • meta instinct pack (#50) — promotes the two cross-project reflection-instincts (skip-thin-observation-schema, parallelize-independent-tool-calls) from per-project YAML into a shared starter pack. Test loop drives off PACK_FLOORS so language packs keep ≥5 floor while meta ships at ≥2.

Changed

  • README install ergonomics (#50) — jq listed alongside Node and bash in Preconditions with per-OS install commands; new "Operator modes" section adjacent to install with both bash/zsh and PowerShell export syntax for CLAUDE_THREE_SECTION_CLOSE_DISABLED.
  • CONTRIBUTING.md Source of truth: src/ callout (#50) — hoisted as a one-line warning at the top of ## Architecture; the existing edit-src-then-build workflow at lines 101-118 was correct but buried.

Fixed

  • hooks/observe.sh jq-missing one-shot warning (#50) — emits a single stderr line per host on the first invocation when jq is absent on PATH, so operators learn the auto-instinct gap at install time instead of discovering weeks of thin-schema collection. Marker lives at ~/.claude/.continuous-improvement-jq-warned, deliberately outside ~/.claude/instincts/ so directory iterators are unaffected.

[3.5.0] — 2026-05-04

Added

  • Audience-tier system in wild-risa-balance — beginner emits 3–5 goal-driven items with no WILD/RISA labels; expert keeps the ≥7 floor (2 WILD + ≥5 RISA). proceed-with-the-recommendation Phase 1 validates against the tier contract instead of a flat floor, and the Phase 7 close renders the tier suffix in the heading (## Recommendation (expert|beginner)) so the audit trail records which tier produced the list.
  • Recommendation: no escape valve in both tiers of wild-risa-balance and proceed-with-the-recommendation. When no real recommendation can be produced without padding, the close ships a literal no body under the tier-suffixed heading — an explicit operator handoff signal (switch session, switch specialist agent, switch framing, sleep on it), never a silent skip. Padding to hit the floor is the failure mode this prevents.
  • CLAUDE_THREE_SECTION_CLOSE_DISABLED=1 operator opt-out for hooks/three-section-close.mjs. When set, the hook short-circuits before any enforcement or telemetry. Per-operator escape hatch for cases where end-of-turn reflection should run as internal thinking instead of visible output. Public default unchanged — the rule still fires for everyone else. Test infrastructure (buildIsolatedEnv() + 5 manual env constructions) now strips the env var before spawning the hook so existing enforcement tests cannot silently disable themselves when the developer has the flag set.

Tests

  • Locked the no escape valve literals in verify:docs-substrings (102 → 112 assertions) and in the wild-risa-tiers test (22 → 32 assertions, 5 literals × 2 mirrors). Each lock carries a rationale string naming the behavior it defends so a future maintainer reading a failure understands why the literal is locked.

[3.4.1] — 2026-05-03

Added

  • verify:everything-mirror lint (npm run verify:everything-mirror) — walks plugins/continuous-improvement/ and asserts every non-skill file has a byte-identical sibling at the same relative path under the repo root. Closes the drift-protection gap that check-skill-mirror.mjs left open for commands/, hooks/, instinct-packs/, templates/, lib/, bin/mcp-server.mjs, and LICENSE. Currently passes 23 mirrored files. Allowlists the four plugin-only surfaces (.claude-plugin/ manifests, the bundle README, hooks/hooks.json, the generated skills/README.md).
  • verify:all umbrella script — single contributor gate that runs all 5 verify lints + typecheck cheapest-first. Replaces the six-line per-command checklist in CONTRIBUTING.md.
  • bin/pre-commit-block-strays.sh pre-commit hook — refuses any commit whose staged paths match ^.tmp-stop-e2e/ or ^nanobanana-output/. Belt-and-suspenders alongside the new .gitignore entries; .gitignore is silent if bypassed with git add -f, this hook is loud. Install snippet documented in CONTRIBUTING.md.

Changed

  • gateguard Law tag reconciled to "Law 1" only across README.md Tier-1 row and skills/README.md. The skill's frontmatter source-of-truth declares only Law 1; the Tier-1 rows previously over-claimed "Law 1 + Law 3". The three views now agree.
  • skills/README.md superpowers row rewritten to match the 3.4.0 "Law activator" reframe already in the source skill frontmatter and root README.md. The 3.4.0 reframe (commit 2ddea8a) missed this row.
  • README.md plugin-marketplace row — fixed stale "4 skills" claim → "13 skills" (the actual bundle size verified by check-skill-mirror.mjs).
  • docs/testing/proceed-with-the-recommendation.TESTING.md footnote added next to the historical RED/GREEN URLs explaining the 3.4.0 rename (proceed-with-claude-recommendationproceed-with-the-recommendation) so the pre-rename URLs in test artifacts no longer surprise readers.

Removed

  • 6 unreferenced nanobanana-output/*.jpg images (3.1 MB) — committed in 3f0dbcb as a one-off save and never referenced anywhere in the repo (verified via grep across non-vendor paths). Recoverable from git history if ever needed.
  • .tmp-stop-e2e/transcript.jsonl — leftover test artifact from a stop-hook E2E run.

Fixed

  • .gitignore — was a single line (node_modules/). Now also blocks .tmp-stop-e2e/, nanobanana-output/, *.tmp, and dist/ so the deleted artifacts (and similar future cruft) cannot re-enter via a careless stage.

[3.4.0] — 2026-05-03

⚠️ Breaking

  • Marketplace dropped 8 third-party PM plugin entries (pm-data-analytics, pm-execution, pm-go-to-market, pm-market-research, pm-marketing-growth, pm-product-discovery, pm-product-strategy, pm-toolkit) to refocus the marketplace on the 7 Laws of AI Agent Discipline. After updating, anyone with those plugins installed from this marketplace loses the update source — the plugins keep working until uninstalled, but /plugin marketplace update continuous-improvement will no longer resolve them. To keep them, install from a separate marketplace or re-add the entries downstream.

Added

  • /seven-laws slash command — brand-aligned alias to /continuous-improvement, so the 7 Laws name surfaces directly in the command palette without breaking the existing entrypoint
  • Skill Law-tag lint (npm run verify:skill-law-tag) — CI lint that requires every non-core skill description to lead with the Law it enforces, preventing description drift from the 7-Laws frame
  • README Law Coverage matrix — explicit map from each bundled skill / command / hook / instinct pack to the Law it serves, so contributors can see at a glance which Laws are well-covered and which need work

Changed

  • Skill descriptions lead with their Law — Laws 1–7 source skills and the orchestrator now open with the Law they enforce, replacing generic blurbs with intent-first framing that matches the lint
  • superpowers reframed as a Law activator, not a peer skill — clarified in skill description and README so users stop treating it as one option among many
  • Renamed proceed-with-claude-recommendationproceed-with-the-recommendation — drops Claude-specific branding from the skill identifier so the same skill can be installed into non-Claude agents (Codex, Gemini CLI, etc.). Identifier-only rename: file paths, frontmatter name:, slash command, install snippets, and cross-references updated. Body language about "Claude-emitted recommendation" is intentionally untouched in this release; that agent-genericization pass is a separate follow-up. Old-name installations need to re-run the install snippet under the new path.
  • Version bump to 3.4.0

Migration

  • If you depend on any of the 8 dropped PM plugins, pin them via a separate marketplace before running /plugin marketplace update continuous-improvement. Existing installs continue to work; only the update path is removed.

[3.3.0] — 2026-04-25

Added

  • proceed-with-claude-recommendation companion skill — walks a Claude recommendation list top-to-bottom, routes each item to the right specialist (superpowers:*, schedule, loop, simplify, security-review, etc.), falls back to inline behavior when the specialist is not installed, verifies per item, and stops at items that need user approval
  • /proceed-with-claude-recommendation slash command — entrypoint that runs the companion skill on the most recent list of Claude recommendations
  • Skill hardening — explicit guardrails against the three most common rationalizations (skipping verification, bundling items, silently deferring approval-needed items)
  • Pressure-test baseline log — recorded under reports/ so regressions in skill behavior are detectable

Changed

  • 7-Laws engine integrationproceed-with-claude-recommendation now routes through the same research → plan → verify → reflect flow enforced by SKILL.md, so per-item behavior matches the core 7 Laws rather than running as a parallel track
  • Plugin bundleplugins/continuous-improvement/ now ships the new skill and command (generated by npm run build), so Claude Code marketplace installs and Codex plugin bundles pick them up without extra steps
  • Installernpx continuous-improvement install now deploys /proceed-with-claude-recommendation alongside the existing core commands
  • Skills bundler filter hardened — the generator now only treats kebab-case *.md files as skills, so reference logs like *.TESTING.md no longer leak into the plugin as fake skills
  • Version bump to 3.3.0

Docs

  • Clarified in README.md that ci is a separate unified workflow CLI and not a shorthand for the continuous-improvement installer
  • Aligned docs/unified-plugin-guide.md version reference with the current package version

[3.2.0] — 2026-04-19

Added

  • Planning-With-Files workflow — opt-in persistent project-root planning via task_plan.md, findings.md, and progress.md
  • /planning-with-files command — initialize, inspect, checkpoint, and recover file-based plans in Claude Code
  • Planning templates — packaged repo-owned templates under templates/planning-with-files/
  • Expert MCP planning toolsci_plan_init and ci_plan_status for initializing and summarizing planning files programmatically

Changed

  • Expert plugin surface — expert mode now exposes 12 tools instead of 10
  • Installer — now installs /planning-with-files alongside the existing Claude commands
  • Docs and metadata — updated README, quickstart, skill docs, marketplace metadata, and translations to document the opt-in planning workflow
  • Version bump to 3.2.0

[3.0.0] — 2026-04-06

Added

  • MCP server (bin/mcp-server.mjs) — zero-dependency JSON-RPC stdio server exposing instincts as MCP tools and resources. Works with Claude Code, Claude Desktop, Cursor, Zed, Windsurf, VS Code.
  • Beginner / Expert modes--mode beginner (default, 3 tools) vs --mode expert (8 tools with import/export, observation viewer, manual instinct creation, confidence tuning)
  • MCP-only mode--mode mcp for editors that support MCP but not Claude Code hooks
  • Plugin manifestsplugins/beginner.json and plugins/expert.json describe available tools per mode
  • Session hooks (hooks/session.sh) — SessionStart loads instincts and shows status; SessionEnd reminds to reflect
  • Import/export toolsci_export and ci_import for sharing instincts as JSON between team members
  • 34-test suite — added MCP server tests (beginner + expert mode) and plugin config validation. Up from 20 tests.
  • Multi-editor MCP support — installer patches both settings.json and claude_desktop_config.json

Changed

  • Installer upgraded--mode flag replaces single-mode install. Supports beginner, expert, mcp
  • Uninstaller upgraded — cleans up MCP server config, session hooks, and desktop config
  • Version bump to 3.0.0 — breaking change: new install modes and MCP server architecture

[2.3.0] — 2026-04-06

Changed

  • Public npm — package name changed from @naimkatiman/continuous-improvement to continuous-improvement. Removed GitHub Packages publishConfig. npx continuous-improvement install now works for everyone.
  • Expanded keywords — added claude-code-skill, agent-skill, gemini-cli for better npm discoverability
  • Improved description — package description now leads with the value prop, lists supported platforms

Added

  • Test suite — 20 tests covering installer, hook, and SKILL.md validation. Zero dependencies (Node.js built-in test runner).
  • Before/after examples in README — collapsible terminal output showing the framework in action vs. without it
  • Real-world examplesexamples/ directory with 3 detailed scenarios (bug fix, feature build, refactor)
  • Platform badges — Claude Code, Cursor, Codex compatibility badges in README
  • Gemini CLI to supported platforms list
  • Roadmap — "Roadmap to 1000 Stars" section in README with phased plan

[2.1.0] — 2026-04-05

Changed

  • Auto-leveling — system promotes itself from CAPTURE → ANALYZE → SUGGEST → AUTO-APPLY based on observation count and instinct confidence. No user action needed.
  • No background daemon — analysis runs inline at session start. Removed start-observer.sh, observer-loop.sh, PID management.
  • Simplified directory~/.claude/instincts/ replaces ~/.claude/mulahazah/ with flat structure
  • jq optional — observe.sh works with pure bash fallback if jq isn't installed
  • Leaner installer — no daemon files copied, no config.json, no analyze.sh. Just skill + hook + command.
  • Smaller package — removed agents/ and config.json from published files

Removed

  • Background observer daemon (deferred to v2.2 as opt-in)
  • ~/.claude/mulahazah/ directory structure (replaced by ~/.claude/instincts/)
  • rules.md (replaced by YAML instinct files with confidence scoring)
  • bin/analyze.sh (analysis now inline via SKILL.md prompt)

[2.0.1] — 2026-04-05

Fixed

  • SKILL.md rewritten to be honest — removed claims about features that didn't work (YAML instinct auto-loading, graduated confidence enforcement)
  • Law 7 now uses ~/.claude/mulahazah/rules.md — a markdown file Claude can actually read/write reliably
  • Added bin/analyze.sh — the actual analysis pipeline that was missing (calls Haiku to extract rules from observations)
  • Added commands/continuous-improvement.md — the actual /continuous-improvement command file
  • Installer now copies analyze.sh, command file, and initializes rules.md
  • observer-loop.sh rewritten to use analyze.sh instead of broken YAML instinct pipeline
  • README rewritten to match what the tool actually does

[2.0.0] — 2026-04-05

Added

  • Law 7: Learn From Every Session — Mulahazah learning system
  • PreToolUse/PostToolUse hooks for session observation
  • Background Haiku observer agent for pattern detection
  • Project-scoped observation (per-project JSONL files)
  • /continuous-improvement command
  • hooks/observe.sh — lightweight observation hook (<50ms)
  • agents/ — observer agent scripts
  • config.json — observer configuration

Changed

  • Upgraded from 5-phase framework to 7-Law system
  • The Loop: Research → Plan → Execute → Verify → Reflect → Learn → Iterate
  • Installer sets up Mulahazah hooks and directories for Claude Code

Improved

  • Law 6 (Iterate) now explicit — one change → verify → next change

[1.1.0] — 2026-04-04

Improved

  • README completely rewritten for cleaner onboarding — hook first, install in 30 seconds, first-task prompt
  • Added real example of a successful agent run (rate limiting walkthrough)
  • Added QUICKSTART.md for step-by-step first-use guide
  • Added CHANGELOG.md for version tracking
  • Removed internal references from SKILL.md (now works for any user, any project)
  • Red flags section now in README for discoverability before install

Fixed

  • Fake Claude Code marketplace install command removed
  • Internal variable references (Naim, PROJECT_REGISTRY, STATE_TEMPLATE) made universal

[1.0.0] — 2026-04-04

Added

  • Initial release
  • 5-phase framework: Research → Plan → Execute → Verify → Reflect
  • Iron Law with 3 hard constraints
  • Phase gates (explicit conditions before proceeding)
  • Red Flags list (thought patterns that indicate a skip)
  • Common Rationalizations table
  • Subagent delegation rules with 4 status handlers
  • Pre-completion self-review checklist
  • marketplace.json for plugin discoverability