VEGA Agent

Vector Encoded General Agent — a local-first AI workspace for people who want power-user LLM workflows without becoming terminal-native developers first.

VEGA sits in the missing middle between easy-but-limited desktop AI apps and developer-only terminal/CLI agent setups. It gives non-developer power users the kind of control that technical users already build for themselves with shells, MCP servers, scripts, local daemons, and custom model routing.

VEGA Agent is an open, generalized agent harness — the workspace, context, memory, and workflow layer that wraps interchangeable models. It ships with the harness infrastructure; you supply the domain knowledge via data/agents/_default.md and let users evolve behavior through natural language.

Product thesis

Most people now use several AI apps: ChatGPT, Claude, Codex-style coding agents, local models, OpenRouter models, and specialized tools. The problem is not a lack of models. The problem is that context, memory, permissions, files, and workflows are scattered across apps.

Technical users can bridge that gap with terminals, CLI tools, MCP configs, shell scripts, local services, and custom glue code. Most power users cannot or do not want to pay that setup tax.

VEGA exists to make that terminal-level AI setup feel like a desktop app:

Easy like a desktop app — install, open, connect accounts, and work.
Powerful like a terminal setup — local tools, files, model routing, MCP, workflow memory, command execution, and approval gates.
Local-first by default — the user's working state, data, and authority stay on their machine unless they explicitly opt into sync or remote features.

In one sentence:

VEGA is the AI workspace between chat apps and command lines.

Possible product language:

Power-user AI, without the terminal.

Terminal-level AI power. Desktop-app simple.

The last AI workspace you need.

The long-term business shape follows from this: the local workspace can be free and useful on its own, while paid tiers can unlock account sync, encrypted backup, remote access, managed connectors, team policy, and support.

What it is

LLM-agnostic — ChatGPT (Codex Responses), OpenRouter, LM Studio / local models. Swap providers without touching the rest of the stack.
Persistent identity — persona, memory, and rules live in SQLite + Markdown files, not inside the model. Sessions start with full context every time.
Three-layer self-evolution
- Operator sets domain knowledge → data/agents/_default.md (immutable constitution)
- Workflows → /skill creates slash commands, saved to data/commands/*.md
- Behavior correction → "from now on do X" triggers rule_save → data/agents/RULES.md
Compaction-based retrospect — every 20 turns, the agent summarizes history, updates memory, and auto-saves behavior rules it detected from user corrections.
Tool telemetry — /audit shows per-tool call/failure rates from SQLite.
Desktop app — Tauri v2 native window (tray, global hotkey, drag-and-drop, vision).
Multi-access levels — local (full) / enterprise (X-VEGA-Key header) / CE (restricted).

Quick start

1. Server (FastAPI)

cp .env.example .env        # fill in your API keys
pip install -r requirements.txt
python -m uvicorn web.server:app --host 0.0.0.0 --port 8100

Open http://localhost:8100 in your browser.

2. Desktop app (optional)

cd desktop
cargo tauri dev

Code execution (python_exec / bash_exec / host_exec) runs host-first, out of the box — no Docker or extra setup required.

3. Web search (optional — required for `web_search`)

Run a SearXNG instance and point VEGA_SEARXNG_URL at it. For team deployments, one shared instance is enough for all users.

# Quick local instance
docker run -d -p 18888:8080 searxng/searxng

Key features

Feature	Command / API
Multi-model chat + tool use	`POST /api/chat/stream`
Persistent memory	`memory_persona_update`, `memory_event_add` tools
Behavior rules	Say "from now on ~" → auto `rule_save` to `RULES.md`
View saved rules	`/rules`
Tool telemetry	`/audit`
Research mode	`/research [topic]` — web-first, cite sources
Plan mode	`/plan` — no writes, planning only
YOLO mode	`/yolo` — auto-approve `host_exec` (hard blocks still apply)
Custom slash commands	`/skill` wizard
Dashboard widgets	`/widget` wizard
MCP server integration	Edit `data/mcp.json` (Claude Desktop format)

Configuration

File	Purpose
`.env`	API keys, endpoints — never commit
`data/agents/_default.md`	Operator constitution — domain knowledge, tool rules, persona. Edit this to customize for your team.
`data/agents/RULES.md`	User rules — auto-updated via `rule_save`, or edit directly.
`data/agents/{provider}.md`	Per-provider overrides (chatgpt / openrouter / lmstudio)
`data/commands/*.md`	Custom slash commands (created via `/skill`)
`data/mcp.json`	MCP server config (Claude Desktop format)
`data/widgets.json`	Agent View dashboard widgets
`data/user_profile.json`	User display name, email accounts

`_default.md` — the most important file

This is your agent's constitution. Before deploying to a team:

Replace the generic response rules with your domain conventions.
Add tool usage rules specific to your stack.
Add a memory update section matching your entity types.
Leave RULES.md empty — users will evolve it through conversation.

Architecture

web/server.py             FastAPI — REST + SSE streaming, access control, mode flags
pipeline/streaming.py     GPT tool-use SSE loop (plan/research/CE modes + RULES synthesis)
pipeline/llm_gateway.py   Multi-provider router (ChatGPT / OpenRouter / LM Studio)
pipeline/tools*.py        Tool layer (Google / Code / Office / Memory / RULES)
pipeline/compaction.py    20-turn compaction — summary + memory update + rule retrospect
pipeline/tool_telemetry.py  Per-tool call stats (SQLite)
pipeline/self_improve.py  Tool failure → GPT patch → host verify → user approval
pipeline/session_store.py Session + message persistence (SQLite)
desktop/                  Tauri v2 desktop app
data/agents/_default.md   Operator constitution (immutable)
data/agents/RULES.md      User behavior rules (mutable via rule_save)

See ARCHITECTURE.md for the full module reference.

Deployment patterns

Single user (default)

Run the server locally, connect from localhost:8100 or via Tailscale.

Team deployment

Host the server on an internal machine.
Set VEGA_SEARXNG_URL to a shared SearXNG instance.
Set enterprise keys via POST /api/admin/keys (loopback only).
Distribute the desktop app (DMG) to team members — they connect to the server URL.
Customize data/agents/_default.md with your team's domain knowledge.

Multi-user isolation

Each user gets their own VEGA_DATA_DIR. See scripts/init_user_db.py.

Channel bots (Telegram / Slack)

Drive VEGA from familiar messengers so the in-house AI experience matches a normal chat app. Both bots share one agent core (pipeline/channels/core.py → run_agent_turn) and stream the answer progressively (edit-in-place).

# Telegram — token from @BotFather
export TELEGRAM_BOT_TOKEN=...
python -m pipeline.channels.telegram_bot

# Slack — Socket Mode (xoxb bot token + xapp app-level token)
export SLACK_BOT_TOKEN=xoxb-... SLACK_APP_TOKEN=xapp-...
python -m pipeline.channels.slack_bot

DM is always handled; group/channel messages only when the bot is @-mentioned.
Each conversation (Telegram chat_id, Slack thread_ts) maps to its own VEGA session — history persists across turns (data/channel_sessions.json).
Remote channels run in CE mode (restricted tool allowlist — no local file/exec).

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 389 Commits
.github/workflows		.github/workflows
bin		bin
data		data
desktop		desktop
docs		docs
pipeline		pipeline
reports		reports
scripts		scripts
testing		testing
tests		tests
web		web
wiki		wiki
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
DEBUGGING.md		DEBUGGING.md
LICENSE		LICENSE
README.md		README.md
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VEGA Agent

Product thesis

What it is

Quick start

1. Server (FastAPI)

2. Desktop app (optional)

3. Web search (optional — required for `web_search`)

Key features

Configuration

`_default.md` — the most important file

Architecture

Deployment patterns

Single user (default)

Team deployment

Multi-user isolation

Channel bots (Telegram / Slack)

License

About

Uh oh!

Releases 46

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VEGA Agent

Product thesis

What it is

Quick start

1. Server (FastAPI)

2. Desktop app (optional)

3. Web search (optional — required for web_search)

Key features

Configuration

_default.md — the most important file

Architecture

Deployment patterns

Single user (default)

Team deployment

Multi-user isolation

Channel bots (Telegram / Slack)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 46

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

3. Web search (optional — required for `web_search`)

`_default.md` — the most important file

Packages