Oh My Agent

Multi-platform bot that routes messages to CLI-based AI agents (Claude, Gemini, Codex) with persistent memory, autonomous task execution, and scheduled automations.

Inspired by OpenClaw.

Features

Multi-Agent Fallback — Claude, Gemini, and Codex run as CLI subprocesses; if one fails, the next in line takes over automatically
Persistent Memory — SQLite conversation history with FTS5 search, plus an event-driven Judge that maintains a single-tier memories.yaml and injects scoped context across sessions
Autonomous Runtime — durable task state machine with merge gates, worktree isolation, HITL prompts, and Discord approval buttons
Skill System — bidirectional skill sync across agent directories, skill evaluation with auto-disable, and agent-driven skill creation
Scheduled Automations — cron / interval jobs defined as YAML files with hot-reload, per-job auto_approve, and /automation_run manual trigger
Workspace Isolation — three-layer sandbox: workspace cwd confinement, env-var whitelisting, and CLI-native sandboxing
Intent Router — optional LLM-based classification routes messages to reply, skill invocation, task proposal, or skill creation
Image Support — Discord attachment download, per-agent image handling, and temp file lifecycle management
Platform Adapters — Discord (full-featured with slash commands); extensible via BaseChannel ABC

Architecture

User (Discord)
         │ message, @agent prefix, or /ask command
         ▼
   GatewayManager
         │ routes to ChannelSession (per channel, isolated)
         ▼
   AgentRegistry ── [claude, gemini, codex]
         │ fallback order, or force specific agent
         ▼
   BaseCLIAgent.run(prompt, history)
     ├── ClaudeAgent      (session resume via --resume)
     ├── GeminiCLIAgent   (--yolo mode)
     └── CodexCLIAgent    (--full-auto, JSONL output)
         │
         ▼   cwd = workspace/ (sandbox-isolated)
   Response → Markdown-aware chunk → thread.send()

Nine subsystems: Gateway (platform adapters, slash commands, message routing), Agents (CLI subprocess wrappers with fallback), Memory (SQLite history + event-driven Judge writing memories.yaml), Skills (bidirectional sync, evaluation, creation), Runtime (autonomous task orchestration), Router (LLM intent classification), Automation (cron/interval scheduler), Auth (per-provider QR flows + cached credential reuse), Push notifications (off-platform delivery via Bark/etc.). A scripted offline E2E harness under tests/harness/ exercises the full stack via BaseChannel-only contracts so cross-platform regressions are caught without a human in the loop.

→ Full architecture walkthrough: EN · 中文

Quick Start

Prerequisites

Python 3.11+
At least one CLI agent installed: claude, gemini, or codex
A Discord bot token with Message Content Intent enabled

Install

git clone https://github.com/TCoherence/oh-my-agent.git
cd oh-my-agent
python3 -m venv .venv && source .venv/bin/activate
pip install -e .

Configure

cp .env.example .env          # put secrets here (DISCORD_BOT_TOKEN, etc.)
cp config.yaml.example config.yaml   # adjust channels, agents, features

Key config sections: gateway (platform + agents), memory, skills, runtime, automations, workspace, router. Secrets use ${ENV_VAR} substitution from .env.

→ Full config reference: config.yaml.example

Run

oh-my-agent           # start the bot
oh-my-agent --version # check installed version

Docker

./scripts/docker-build.sh                # build image
./scripts/docker-run.sh                  # dev/foreground mode
./scripts/docker-start.sh                # long-running/detached mode
./scripts/docker-logs.sh                 # inspect logs

The Docker image preinstalls claude, gemini, and codex CLIs. The host repo is mounted at /repo and runtime state at /home.

→ Full Docker & deployment guide: EN · 中文

Usage

Messages

Post in the configured channel → auto-creates a thread with an AI reply
Reply inside the thread to continue the conversation with full context
Prefix with @claude, @gemini, or @codex to force a specific agent for that turn
Attach images (≤10 MB) for visual analysis

Slash Commands

Category	Commands
Conversation	`/ask`, `/reset`, `/history`, `/agent`, `/search`
Runtime Tasks	`/task_start`, `/task_status`, `/task_list`, `/task_approve`, `/task_reject`, `/task_suggest`, `/task_resume`, `/task_stop`, `/task_merge`, `/task_discard`, `/task_replace`, `/task_changes`, `/task_logs`, `/task_cleanup`
Skills	`/reload-skills`, `/skill_stats`, `/skill_enable`
Automations	`/automation_status`, `/automation_reload`, `/automation_enable`, `/automation_disable`, `/automation_run`
Memory	`/memories`, `/forget`, `/memorize`
Auth	`/auth_login`, `/auth_status`, `/auth_clear`
Operator	`/doctor`

Automations

Automation jobs are defined as YAML files in ~/.oh-my-agent/automations/. The scheduler hot-reloads on file changes — no restart needed.

name: daily-ai-briefing
enabled: true
platform: discord
channel_id: "${DISCORD_CHANNEL_ID}"
prompt: "Run the market-briefing-ai skill for today's AI digest."
agent: claude
skill_name: market-briefing-ai
cron: "0 9 * * *"
auto_approve: true

→ Full automation reference: automation.yaml.example

Autonomous Runtime

Long-running tasks are orchestrated through a durable state machine:

DRAFT → RUNNING → VALIDATING → WAITING_MERGE → MERGED / COMPLETED
                              ↕ PAUSED          ↕ FAILED / STOPPED

Task types: artifact (no merge gate), repo_change (merge required), skill_change (validate + merge)
Isolation: each task runs in its own git worktree under ~/.oh-my-agent/runtime/tasks/
HITL: tasks can pause for owner approval, QR auth, or custom single-choice questions
Controls: Discord buttons for approval + slash command fallback + natural language stop/pause/resume

Built-in Skills

Skill	Description
`market-briefing-ai`	Chinese AI daily briefing (frontier labs / paper layer / people pool / 5-layer / cross-layer); reads paper-digest's daily JSON; 4-section checkpoint persists. Persisted reports under `~/.oh-my-agent/reports/market-briefing/`.
`market-briefing-finance`	Chinese finance daily briefing (China macro / US macro / volatility / China-HK / property / tracked holdings) with podcast prefetch. Persisted reports under `~/.oh-my-agent/reports/market-briefing/`.
`market-briefing-politics`	Chinese politics daily briefing (China central / US federal / China-US geopolitics). Persisted reports under `~/.oh-my-agent/reports/market-briefing/`.
`market-briefing-weekly`	Chinese cross-domain weekly synthesis reading the past 7 days of stored daily reports. Persisted reports under `~/.oh-my-agent/reports/market-briefing/weekly/`.
`paper-digest`	Daily arXiv + HuggingFace + Semantic Scholar paper radar (Chinese)
`youtube-video-summary`	Single-link YouTube video summary via transcript-first extraction
`youtube-podcast-digest`	Weekly digest of subscribed YouTube podcast channels (VC / AI / markets)
`bilibili-video-summary`	Single-link Bilibili video summary with cookie-based auth fallback
`deals-scanner`	Chinese-first deal scans across US credit cards, Rakuten, Slickdeals, Dealmoon
`seattle-metro-housing-watch`	Seattle metro housing snapshots and deep-dives with stored report history
`scheduler`	Create, update, and validate recurring automation YAML jobs
`skill-creator`	Guide and scaffolding for creating new skills
`adapt-community-skill`	Rewrite an imported / community skill to fit this workspace's conventions

Skills live in skills/<name>/SKILL.md. The SkillSync system distributes them to all CLI agent directories automatically.

→ Adding a new skill: create skills/<name>/SKILL.md (+ optional scripts/); it will be picked up on next startup or /reload-skills.

Documentation

Document	EN	中文
Architecture	architecture.md	architecture.md
Operator Guide	operator-guide.md	operator-guide.md
Dev Environment	dev-environment.md	dev-environment.md
Config Reference	config-reference.md	config-reference.md
Troubleshooting	troubleshooting.md	troubleshooting.md
Monitoring	monitoring.md	monitoring.md
Upgrade Guide	upgrade-guide.md	upgrade-guide.md
Development Log	development.md	development.md
Roadmap	todo.md	todo.md
v1.0 Plan	v1.0-plan.md	v1.0-plan.md
Changelog	CHANGELOG.md	—

Versioning

The package version is sourced from src/oh_my_agent/_version.py. CHANGELOG.md tracks released and unreleased changes.

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 322 Commits
.github/workflows		.github/workflows
dashboard-web		dashboard-web
docker		docker
docs		docs
scripts		scripts
skills		skills
src/oh_my_agent		src/oh_my_agent
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
AGENT.md		AGENT.md
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
GEMINI.md		GEMINI.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
TECH_OVERVIEW.md		TECH_OVERVIEW.md
WORKSPACE_AGENTS.md		WORKSPACE_AGENTS.md
automation.yaml.example		automation.yaml.example
compose.yaml		compose.yaml
config.dev.yaml.example		config.dev.yaml.example
config.yaml.example		config.yaml.example
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Oh My Agent

Features

Architecture

Quick Start

Prerequisites

Install

Configure

Run

Docker

Usage

Messages

Slash Commands

Automations

Autonomous Runtime

Built-in Skills

Documentation

Versioning

License

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Oh My Agent

Features

Architecture

Quick Start

Prerequisites

Install

Configure

Run

Docker

Usage

Messages

Slash Commands

Automations

Autonomous Runtime

Built-in Skills

Documentation

Versioning

License

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages