Kaya -- Personal AI Infrastructure

A production-grade AI agent framework with 60+ composable skills, autonomous task execution, voice interaction, and persistent memory. Built on Anthropic's Claude Code as the foundation for a fully autonomous personal AI assistant.

Why I Built This

After working extensively with AI assistants, I noticed a fundamental gap: every session starts from zero. There is no continuity, no memory of preferences, no ability to proactively take action. I wanted an AI system that:

Remembers everything -- past decisions, preferences, learnings across sessions
Acts autonomously -- executes multi-step workflows without constant supervision
Composes capabilities -- chains specialized skills together for complex tasks
Speaks and listens -- bidirectional voice interaction, not just text

Kaya is the result: a skill-based architecture where each capability is a self-contained module that Claude Code can discover, load, and execute. The system handles everything from calendar management and grocery shopping to security reconnaissance and multi-agent debates.

Architecture

kaya/
  skills/             # 60+ composable skill modules
    Agents/           # Multi-agent orchestration and composition
    AutonomousWork/   # Parallel task execution engine
    CalendarAssistant/# Google Calendar automation
    VoiceInteraction/ # Bidirectional voice (desktop + mobile)
    Browser/          # Playwright-based browser automation
    ...               # 55+ more skills
  agents/             # Agent personality definitions and traits
  bin/                # CLI tools and cron scripts
  hooks/              # Git hooks and lifecycle automation
  lib/                # Shared libraries (cron, daemon, messaging)
  VoiceServer/        # ElevenLabs-powered TTS server
  MEMORY/             # Persistent state, learnings, and context
  Observability/      # System monitoring and health checks
  KAYASECURITYSYSTEM/  # Security protocols and threat models

Key Capabilities

Autonomous Task Execution

The AutonomousWork skill orchestrates parallel agent execution -- multiple Claude instances working on independent tasks simultaneously with branch-isolated git operations.

Skill Composition

Skills are composable modules with standardized interfaces. Each skill exposes:

A SKILL.md manifest with triggers, workflows, and integration points
Optional TypeScript tooling in Tools/ directories
Workflow definitions in Workflows/ directories
Context files that load domain knowledge on demand

Voice Interaction

Bidirectional voice system supporting desktop (local mic/speaker) and mobile (Telegram) channels, powered by ElevenLabs TTS with configurable voice personalities per agent.

Persistent Memory

The MEMORY/ subsystem provides:

Learning signals -- Pattern recognition across sessions with sentiment tracking
State management -- Persistent JSON state for skills, work queues, and cron jobs
Validation logs -- Configuration and work integrity checks
Voice event history -- Timestamped voice interaction logs

Multi-Agent System

The Agents/ skill enables dynamic agent composition with:

Specialized agent roles (Engineer, Designer, Researcher)
Personality trait mapping and voice assignment
Parallel orchestration with branch isolation
Council-style multi-agent debates

Skill Catalog

Category	Skills	Description
Core	System, lib/core	System kernel and maintenance
Agents	Agents, AgentMonitor, Council, Simulation	Multi-agent orchestration and evaluation
Productivity	CalendarAssistant, Gmail, Kaya, DailyBriefing	Personal assistant capabilities
Development	AgentProjectSetup, CreateCLI, CreateSkill, Browser	Engineering and automation tools
Research	OSINT, Recon, FirstPrinciples, RedTeam	Intelligence gathering and analysis
Content	ContentAggregator, Fabric, Obsidian, KnowledgeGraph	Knowledge management and synthesis
Commerce	Shopping, Instacart, Cooking	Consumer automation
Communication	Telegram, VoiceInteraction, CommunityOutreach	Messaging and outreach
Security	WebAssessment, PromptInjection, KAYASECURITYSYSTEM	Security testing and protocols
Meta	SkillAudit, SpecSheet, Evals, KayaUpgrade	Self-improvement and quality

Tech Stack

Runtime: Bun (TypeScript/JavaScript)
AI Foundation: Claude Code (Anthropic)
Voice: ElevenLabs TTS with WebSocket streaming
Browser Automation: Playwright CLI (Browse.ts)
Messaging: Telegram Bot API
Calendar: Google Calendar CLI
State: JSON-based persistent state with validation
Scheduling: macOS launchd for cron-style automation

Quick Start

# Clone and install
git clone https://github.com/[user]/kaya.git ~/.claude
cd ~/.claude
bun run install.ts

# Start the voice server
cd VoiceServer && ./start.sh

# Launch Claude Code with Kaya loaded
claude

See INSTALL.md for detailed setup instructions.

How Skills Work

Each skill follows a standardized structure:

skills/ExampleSkill/
  SKILL.md            # Manifest: triggers, workflows, integration
  _Context.md         # Domain knowledge loaded on demand
  Tools/              # TypeScript utilities
  Workflows/          # Step-by-step workflow definitions

Skills are discovered and loaded dynamically by the CORE router based on keyword matching in user requests. The router reads each skill's USE WHEN trigger clause to determine relevance.

Development

# Run the installer wizard
bun run install.ts

# Validate system integrity
# (within a Claude Code session)
/system integrity check

# Audit skill quality
/skill-audit

Documentation

Installation Guide -- Prerequisites, setup, and configuration
Architecture -- System design and data flow
ADR-001: Skill-based Architecture
ADR-002: Memory Persistence
Voice Server -- TTS server setup and usage

License

MIT

Related Projects

ai-assistant — Autonomous AI assistant powered by Claude Code
mcp-toolkit-server — MCP server toolkit for Claude AI integration
context-engineering-toolkit — Context window optimization tools

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.checkpoints		.checkpoints
.github		.github
.playwright-mcp		.playwright-mcp
Commands		Commands
KAYASECURITYSYSTEM		KAYASECURITYSYSTEM
MEMORY		MEMORY
SYSTEM		SYSTEM
VoiceServer		VoiceServer
agents		agents
bin		bin
context		context
docs		docs
hooks		hooks
lib		lib
scripts		scripts
sessions		sessions
skills		skills
tests		tests
tools		tools
.current-session		.current-session
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTEXT-ROUTING.md		CONTEXT-ROUTING.md
Customize		Customize
INSTALL.md		INSTALL.md
README.md		README.md
SECURITY.md		SECURITY.md
bun.lock		bun.lock
install.ts		install.ts
mcp-needs-auth-cache.json		mcp-needs-auth-cache.json
package.json		package.json
ralph-loop.local.md		ralph-loop.local.md
secrets.example.json		secrets.example.json
set		set
settings.example.json		settings.example.json
settings.json		settings.json
stats-cache.json		stats-cache.json
statusline-command.sh		statusline-command.sh
statusline.sh		statusline.sh
tsconfig.ci.json		tsconfig.ci.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kaya -- Personal AI Infrastructure

Why I Built This

Architecture

Key Capabilities

Autonomous Task Execution

Skill Composition

Voice Interaction

Persistent Memory

Multi-Agent System

Skill Catalog

Tech Stack

Quick Start

How Skills Work

Development

Documentation

License

Related Projects

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kaya -- Personal AI Infrastructure

Why I Built This

Architecture

Key Capabilities

Autonomous Task Execution

Skill Composition

Voice Interaction

Persistent Memory

Multi-Agent System

Skill Catalog

Tech Stack

Quick Start

How Skills Work

Development

Documentation

License

Related Projects

About

Topics

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages