A local-first workbench for authoring, evaluating, versioning, and hardening Agent Skills before they enter real agent workflows.
SkillOps Studio started as a lightweight prompt QA tool. It is now oriented around the emerging Agent Skills pattern: a portable SKILL.md manifest plus instructions, scripts, references, assets, eval cases, and regression snapshots.
Agent Skills are becoming a reusable capability layer for coding agents and workflow agents. They are more structured than prompts, lighter than model fine-tuning, and complementary to tools or MCP servers.
This app focuses on the practical middle layer:
- Author a standards-friendly Agent Skill draft.
- Generate a copyable
SKILL.md. - Track bundled scripts, references, and assets.
- Define eval cases with expected outcomes, assertions, and evidence requirements.
- Run lightweight readiness checks without calling a model API.
- Save snapshots and compare score, metadata, instruction, and resource changes.
- Skill manifest editor for
name,description,compatibility,allowed-tools, and instruction body. - Generated
SKILL.mdpreview with copy-to-clipboard. - Resource inventory for
scripts/,references/, andassets/. - Eval case editor for realistic user requests, expected outcomes, assertions, and required evidence.
- Deterministic readiness checks for routing clarity, workflow structure, validation language, eval quality, resource completeness, and risky-action review language.
- Version snapshots with average score and lightweight diff.
- Browser
localStoragepersistence.
- It does not execute an agent or call a model API.
- It does not run bundled scripts.
- It does not sandbox third-party skills.
- It does not perform trace-driven automatic skill revision.
- It does not import or export full skill folders yet.
Those are intentionally left out of the current refactor because they need deeper debugging, security review, or runtime design.
npm installnpm run devThe default Vite URL is:
http://localhost:5173
npm run lint
npm run build- React 19
- TypeScript
- Vite
- Mantine UI
- Tabler Icons
src/
App.tsx Main SkillOps Studio interface
domain/
skillEvaluation.ts SKILL.md generation and readiness checks
skillVersioning.ts Skill snapshot and diff logic
storage/
skillOpsStorage.ts localStorage persistence
types/
skillOps.ts Skill, eval, result, and version types
The previous prompt-oriented modules were removed during the SkillOps refactor.
- Add Vitest coverage for
skillEvaluation.tsandskillVersioning.ts. - Add JSON import/export for the full workspace state.
- Add Agent Skills folder export:
SKILL.md,scripts/,references/, andassets/. - Add run history and baseline comparison across eval iterations.
- Add a stronger static security linter for risky commands, network calls, dependency installation, and credential handling.
- Add optional trace-driven revision once real agent run logs are available.
SkillOps Studio should become a lightweight CI surface for Agent Skills:
- Every skill is portable.
- Every workflow has eval cases.
- Every change is comparable.
- Every regression is visible before the skill is used by an agent.