diff --git a/.gitignore b/.gitignore
index 9f86bc3f..b4c8f6f3 100644
--- a/.gitignore
+++ b/.gitignore
@@ -85,3 +85,4 @@ yarn-error.log*
 /docs/marketing/
 /docs/tasks/
 /.gradle-user-home-fix/
+/.intellijPlatform/
diff --git a/CHANGELOG.md b/CHANGELOG.md
index 7af2d82b..5287401b 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -9,6 +9,70 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 
 ## [Unreleased]
 
+### Added
+
+- **`sleep` tool** — Pause agent execution for up to 30 seconds; useful for rate limiting or waiting on external processes.
+- **`ask_user` tool** — Agent can ask the user a question mid-execution and wait for a response; supports free-text and predefined options. Backed by `UserQuestionService` with CompletableDeferred pattern (same as ToolApprovalService).
+- **`web_search` tool** — Search the web via Brave Search, SerpAPI, or DuckDuckGo Instant Answers; configurable provider and API key in `config.yaml`.
+- **`fetch_webpage` tool** — Fetch a URL, convert HTML to Markdown (via Jsoup), then process with the weak LLM model using a custom prompt; for extracting specific information from web pages.
+- **`run_process_background` tool** — Start a shell command in the background and return immediately with a `process_id`; for long-running builds or dev servers.
+- **`monitor_process` tool** — Read stdout from a background process started with `run_process_background`; call repeatedly to stream output.
+- **`code_intelligence` tool** — Analyze code structure without IDE: find symbol usages (grep), find definitions (ctags/grep), list symbols (ctags), run compiler diagnostics; works in CLI and plugin.
+- `ProcessManager` service for managing long-running background processes with start/stop/monitor lifecycle.
+- `UserQuestionService` for suspending the agent loop while waiting for user answers.
+- `CommandLimits` helper for centralized terminal command output/size limits.
+- DB migration `V3RenameSlashCommandToSlashPrompt` that remaps existing `prompts.type='SLASH_COMMAND'` rows to `SLASH_PROMPT` in-place; preserves user-defined and built-in entries.
+- **Multi-agent orchestration pipeline** — dedicated `core/agents/orchestration/` package with `OrchestrationDispatcher`, `TaskDecomposer`, and `ResultMerger`. Supports three strategies: `ORCHESTRATOR` (single turn with `multi-agent-coordinator` subagent dynamically spawning via `invoke_subagent`), `PARALLEL` (independent specs executed concurrently via `MultiAgentRunner`), and `PIPELINE` (linear chain using `{{prev.output}}` placeholder between stages).
+- **`CoreSessionService` in `:core`** — session lifecycle, state, and dispatch moved out of the IntelliJ plugin. Backed by `CoreSessionServiceFactory` and `DefaultWorkflowStreamingListener`. The IntelliJ `SessionManager` is now a thin adapter (886 → 704 LOC).
+- **Reusable tool-call listeners** — `AbstractToolCallLifecycleListener` base plus `CoreMessageToolCallListener` (core persistence) and `TuiToolCallListener` (TUI rendering); eliminates duplicated tool-call plumbing between plugin and CLI.
+- `BuiltinSubagentOverrides` — user-level override mechanism for built-in subagent definitions without forking the Markdown source.
+- `example-config.yaml` shipped under `core/src/main/resources/config/` as a reference for the new typed YAML schema.
+- `OpenAICompatibleAdapter` base class consolidating shared `/v1/chat/completions` protocol, SSE streaming, and per-provider quirks for all OpenAI-compatible backends.
+- Test fixtures `SubtaskResponseFixtures` and `TaskResponseFixtures` plus new adapter characterization tests (`OpenAICompatibleBaseTest`, `GenericOpenAIAdapterMalformedTest`).
+
+### Changed
+
+- **Renamed feature "Slash Commands" → "Prompts"** across code and UI. The `/name` invocation syntax is unchanged; these are prompt templates (not plugin/CLI commands), so everything now reflects that:
+  - API model `SlashCommand` → `SlashPrompt` (`core/src/main/kotlin/pl/jclab/refio/api/models/SlashPrompt.kt`)
+  - DB enum `PromptType.SLASH_COMMAND` → `SLASH_PROMPT` (migrated automatically via V3)
+  - Router API: `getEnabledCommands`/`findCommand`/`saveCommand` → `getEnabledSlashPrompts`/`findSlashPrompt`/`saveSlashPrompt`; request model `SaveCommandRequest` → `SaveSlashPromptRequest`
+  - IntelliJ dialog `CommandEditDialog` → `PromptEditDialog`; intention action `RefioSlashCommandIntentionAction` → `RefioSlashPromptIntentionAction`
+  - Settings UI: tab label `Commands` → `Prompts`, column `Command` → `Prompt`
+  - Chat input placeholder now reads `(@context, /prompt, !subagent)`
+  - CLI/TUI methods `getSlashCommands`/`processSlashCommands` → `getSlashPrompts`/`processSlashPrompts`
+- **CommandWhitelist system replaced by unified `CommandRule`** (regex-based `ALLOW` / `BLOCK` / `ASK`). The legacy `CommandWhitelist`, `CommandWhitelistConfig`, `CommandWhitelistDefaults`, and `CommandDenylist` classes were removed; default ruleset is expanded in `CommandRuleDefaults`. Terminal command decisions are now a single-pass regex evaluation with a catch-all `ASK` rule.
+- **Models settings tab no longer auto-refreshes from providers on open.** The panel now reads from the in-memory model cache (or DB visibility settings) and never blocks the UI on slow/unreachable providers. Remote model discovery is triggered only by the **Refresh** button. New `fetchIfMissing: Boolean` parameter was added to `ModelRegistry.getAllModels` / `ConfigRouter.getModelsWithVisibility` / `CoreApiClient.getModelsWithVisibility` (defaults preserve previous behavior for other callers). `ModelRegistry.getCachedModelsSnapshot()` exposes the cache without triggering fetches.
+- **Shorter `listModels` timeout for local providers.** Ollama and LM Studio now use a 3-second per-provider timeout when listing models (down from 15s) since they are localhost services and 15s just extended UI freezes when the daemon wasn't running. Cloud providers keep 15s.
+- **Tools settings tab layout is now responsive.** Fixed column widths (180/320/90/90) were replaced with flexible bounds: `Plan Mode` and `Agent Mode` columns are clamped to 80–110 px, while Tool Name and Description stretch with the panel. The Terminal Command Rules table uses the same flexible layout. Removes the forced 700 px horizontal overflow that cropped column headers on narrow sidebars.
+- Built-in system-agent prompt (`system-agent.md`) refreshed.
+- `ToolDescriptionBuilder` adjusted for the new tool set and prompt-template renames.
+- **`CoreApiRouter` modularized from ~987 → ~300 LOC.** Composition-root concerns split into dedicated modules under `core/api/modules/`: `CoreApiRouterBootstrap` (system tool registration, Ollama concurrency, core init), `DomainRouters` (12 domain router wiring), `AgentExecutionModule`, `ChatPlanningModule`, `PersistenceModule`, `ProjectRouterFactory`, `SupportServicesModule`, `EmbeddingProviderFactory`, `AnalysisStack`, and `AgentTurnLoopFactory`. `CoreApiRouter` is now a readable composition root with no business logic.
+- **`ConfigService` split (2175 → 290 LOC, –87%).** Responsibilities extracted to focused services: `ModelSelectionService` (logical-slot resolution, model visibility), `ConfigResolver` (precedence-aware reads), `ConfigValidator`, `ConfigDefaultsInitializer`, `ConfigYamlApplier` (DB ← YAML application), and `ContextBudgetResolver`. Public API remains stable via `ConfigService` delegation.
+- **`ConfigYaml` redesigned around a typed schema.** The 1300+ LOC monolithic loader/emitter replaced by `ConfigYamlModel` (`kotlinx.serialization` data classes with typed sections), `ConfigYamlEmitter`, `ConfigYamlIO`, `ConfigYamlMerger`, and `YamlSanitizer`. `ConfigYaml.kt` shrank to 76 LOC. Unknown keys now fail loud (aligned with the no-fallbacks policy).
+- **OpenAI-compatible adapter consolidation.** Per-provider classes now extend the new `OpenAICompatibleAdapter` base:
+  - `GenericOpenAIAdapter` 547 → 189 LOC
+  - `LMStudioAdapter` 461 → 108 LOC
+  - `OpenRouterAdapter` 869 → 180 LOC (keeps its `data:`-prefix/mid-stream-error quirks)
+  - `ZAIAdapter` reduced to 23 LOC with endpoint resolution delegated to the new `ZAIUrls` helper
+  - `OpenAICompatibleHelpers` gained `consumeChatCompletionsSSE`, `buildMessages`, `addCommonKwargs`, and `resolveEffectiveMaxTokens` used across the family.
+- **Coverage gate moved to `:core`.** `:core:jacocoTestCoverageVerification` enforces a 35% instructions minimum and is wired into `:core:check`. `:intellij-plugin:jacocoTestReport` still produces HTML/XML but no longer enforces a threshold (tests live in `:core`).
+- **Session layer migrated from `:intellij-plugin` to `:core`.** `SessionStateManager`, `SessionLifecycleService`, `MessageDispatcher`, `SubtaskTracker`, `PromptStateTracker`, and `ExecutionMonitor` live in `core/session/` and are reused by both the IntelliJ plugin and the TUI.
+- **TUI state layer simplified.** `TuiViewModel`, `TuiSessionViewModel`, `TuiChatViewModel`, `TuiState`, `TuiStepsView`, and `TuiWorkflowListener` aligned with the new core session service; silent subtask-operation fallbacks removed; shared factory companions (`TuiSessionEntry.fromTaskResponse`, `TuiSubtask.fromSubtaskResponse`) eliminate inline DTO mapping.
+- Settings panels (`AdvancedSettingsPanel`, `ContextSettingsPanel`, `GeneralSettingsPanel`, `ModelsSettingsPanel`, `ProvidersSettingsPanel`, `SettingsView`) rewritten against the typed config model.
+- `AgentTurnLoop` + `TurnLLMCaller`/`TurnToolExecutor`/`TurnFinalizer`/`TurnEventListener`/`TurnPromptBuilder`/`TurnPrompt` updated for listener-based tool-call lifecycle, cleaner prompt assembly, and richer event propagation.
+
+### Removed
+
+- `core/src/main/kotlin/pl/jclab/refio/api/models/TaskPlanModels.kt` — superseded by the subtask pipeline.
+- `TurnLoopConfigAliases.kt` and `TurnPromptAliases.kt` backcompat shims (replaced by a single `TurnPrompt.kt`).
+- Legacy `CoreApiClient` passthroughs and `DualLogger` backcompat shim under `services/logging/` (call sites migrated to domain routers / `core.logging.dualLogger`).
+
+### Fixed
+
+- Four EDT-blocking `runBlocking` calls in `SessionLifecycleService` converted to `withContext`, eliminating UI-thread stalls during session persistence.
+- Silent fallbacks removed from OpenAI response parsing, `CommandRule` regex compilation, and `HttpRequestTool` content-type detection — errors now surface instead of masking bad input.
+- Duplicate `runTurnCallback` wiring in `CoreApiRouter` (InvokeSubagent + DelegateToStrong) consolidated into a single source of truth for agent turn dispatch.
+
 ## [0.0.1.6] - 2025-04-12
 
 ### Added
diff --git a/CLAUDE.md b/CLAUDE.md
index fe3c479d..bea2d8e1 100644
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -25,7 +25,8 @@ Local-first AI coding assistant for IntelliJ IDEA and the terminal. Kotlin/JVM p
 
 # Quality
 ./gradlew :intellij-plugin:check           # Includes detectSensitiveLogging task
-./gradlew :intellij-plugin:jacocoTestReport # Coverage report (40% minimum)
+./gradlew :core:jacocoTestCoverageVerification # Coverage gate (35% minimum instructions, wired into :core:check)
+./gradlew :intellij-plugin:jacocoTestReport    # Plugin coverage report (HTML/XML only, no threshold — tests live in :core)
 ```
 
 ## Module Architecture
@@ -46,17 +47,17 @@ UI (IntelliJ Swing / TUI Mordant)
     → Domain Routers (12 routers: Task, Chat, Agent, Subtask, Config, Prompts, Tool, RAG, ApiLogs, MultiAgent, ProjectContext, Subagent)
     → CoreApiRouter (composition root — creates dependencies, exposes routers, no business logic)
       → Execution (WorkflowOrchestrator → ChatService for CHAT | AgentTurnLoop for PLAN/AGENT)
-        → LLMClient (8 provider adapters) + ToolRegistry (15 tools) + ContextService (14 providers)
+        → LLMClient (8 provider adapters) + ToolRegistry (24 tools) + ContextService (14 providers)
           → Infrastructure (SQLite via Exposed ORM, Ktor HTTP, Caffeine cache)
 ```
 
-Callers access domain routers directly via `coreApiRouter.taskRouter`, `coreApiRouter.chatRouter`, etc. CoreApiRouter itself is a thin composition root (~987 LOC) with no facade methods.
+Callers access domain routers directly via `coreApiRouter.taskRouter`, `coreApiRouter.chatRouter`, etc. CoreApiRouter itself is a thin composition root (~300 LOC) with no facade methods.
 
 ## Three Execution Modes
 
 - **CHAT** — No tools. Conversation-only via WorkflowOrchestrator → ChatService.
-- **PLAN** — Read-only tools (6). AgentTurnLoop with max 25 iterations.
-- **AGENT** — Full read/write tools (15). AgentTurnLoop with max 50 iterations. File snapshots before edits.
+- **PLAN** — Read-only tools (14). AgentTurnLoop with max 25 iterations.
+- **AGENT** — Full read/write tools (24). AgentTurnLoop with max 50 iterations. File snapshots before edits.
 
 Subagents use a nested invocation model (max depth 3) with custom system prompts and tool filtering.
 
@@ -100,7 +101,7 @@ JUnit 5 + MockK + Turbine (Flow testing). Tests mirror source structure under `s
 
 ## Important Patterns
 
-- **Thin router pattern**: CoreApiRouter is a composition root (~987 LOC) that creates dependencies and exposes 12 domain routers. Callers use domain routers directly (e.g., `coreApiRouter.taskRouter.createTask()`). No facade methods — zero business logic in CoreApiRouter.
+- **Thin router pattern**: CoreApiRouter is a composition root (~300 LOC) that creates dependencies and exposes 12 domain routers. Callers use domain routers directly (e.g., `coreApiRouter.taskRouter.createTask()`). No facade methods — zero business logic in CoreApiRouter.
 - **StateFlow reactivity**: SessionManager exposes 11 StateFlows; UI observes via `Flow.collect`.
 - **Separate source trees**: Each module has its own `src/main/kotlin`. When adding new core files, ensure they don't depend on IntelliJ Platform APIs — the `:core` module has no IntelliJ dependency.
 - **Security layers**: PathSandbox restricts file ops to project root; CommandRule (regex-based ALLOW/BLOCK/ASK) replaces legacy CommandWhitelist for terminal commands; FileLimits enforces size/extension restrictions. ToolPermissionsService provides 3-level (ON/ASK/OFF) per-mode access control. ToolApprovalService handles user approval flow with session trust rules.
diff --git a/README.md b/README.md
index f5e4b346..0322b80c 100644
--- a/README.md
+++ b/README.md
@@ -138,7 +138,7 @@ Refio includes a standalone CLI with a full-screen TUI that mirrors the IntelliJ
 
 - **Split-pane layout** — Chat on the left (55%), active tab on the right (45%). Full-width when no tab is selected.
 - **8 tabs + 2 screens** — F1 Help, F2-F7 tabs (Steps, Context, RAG, Logs, Debug, API), F8 Files, F9 Settings. Tab bar shows mode, cost, and streaming status.
-- **Two input modes** — Raw TTY (real terminal: F-keys, Shift+Tab, single-char dispatch) and line mode (IDE terminal, pipes: /commands, :tab shortcuts).
+- **Two input modes** — Raw TTY (real terminal: F-keys, Shift+Tab, single-char dispatch) and line mode (IDE terminal, pipes: `/prompt`, `:tab` shortcuts).
 - **@context autocomplete** — Type `@` for a popup with context prefixes (file, folder, grep, diff, etc.). 9 of 14 providers available in CLI mode.
 - **Settings screen** — 11 sub-tabs covering providers, models, prompts, context/RAG, MCP, tools, subagents, and more.
 - **Resize-responsive** — UI adapts to terminal window size changes in real time.
@@ -199,7 +199,7 @@ Refio includes a standalone CLI with a full-screen TUI that mirrors the IntelliJ
 |-----|--------|
 | `@` | Open context autocomplete (@file, @folder, @grep, @diff, etc.) |
 | `!` | Open subagent autocomplete (!review, !security, etc.) |
-| `/` | Open slash command autocomplete (/explain, /refactor, etc.) |
+| `/` | Open slash prompt autocomplete (/explain, /refactor, etc.) |
 | Tab / Arrow Down | Next autocomplete candidate |
 | Arrow Up | Previous autocomplete candidate |
 | Enter | Accept autocomplete selection |
@@ -209,7 +209,7 @@ Refio includes a standalone CLI with a full-screen TUI that mirrors the IntelliJ
 
 - **Multi-line input** — input expands up to 4 visible lines as you type
 - **Paste support** — large pastes (>200 chars) show a preview marker
-- **Slash commands** — type `/` at the start for prompt templates (/explain, /test, /fix, /refactor, etc.)
+- **Slash prompts** — type `/` at the start for reusable prompt templates (/explain, /test, /fix, /refactor, etc.). Manage them under **Settings → Prompts**.
 - **System commands** — /help, /quit, /clear, /history, /export, /resend, /rewind, /edit, and more (type `/help` for full list)
 
 ### Architecture
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/StandaloneCoreBootstrap.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/StandaloneCoreBootstrap.kt
index 5bb78216..dc1a556d 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/StandaloneCoreBootstrap.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/StandaloneCoreBootstrap.kt
@@ -101,7 +101,7 @@ class StandaloneCoreBootstrap(
         val projectRouter = appRouter.createProjectRouter(
             projectRoot = absolutePath,
             projectHandle = projectHandle,
-            ideProject = null
+            platformProject = null
         )
         projectRouter.initialize(dbPath)
         this.projectRouter = projectRouter
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleter.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleter.kt
index d5a1ce64..3f003714 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleter.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleter.kt
@@ -4,21 +4,21 @@ import org.jline.reader.Candidate
 import org.jline.reader.Completer
 import org.jline.reader.LineReader
 import org.jline.reader.ParsedLine
-import pl.jclab.refio.api.models.SlashCommand
+import pl.jclab.refio.api.models.SlashPrompt
 
 /**
  * JLine3 Completer for TUI prompt input.
- * Provides completion for @mentions, /slash commands (prompt templates), and !subagents.
+ * Provides completion for @mentions, /slash prompts (prompt templates), and !subagents.
  */
 class TuiCompleter : Completer {
 
     /** System commands removed — all operations accessed through GUI keybindings/screens */
     private val systemCommands = emptyList<String>()
 
-    /** Slash command prompt templates from SlashCommand.BUILTINS */
-    private val slashCommands: List<Pair<String, String>> by lazy {
+    /** Slash prompt templates from SlashPrompt.BUILTINS */
+    private val slashPrompts: List<Pair<String, String>> by lazy {
         try {
-            SlashCommand.BUILTINS.map { "/${it.name}" to it.description }
+            SlashPrompt.BUILTINS.map { "/${it.name}" to it.description }
         } catch (_: Exception) {
             emptyList()
         }
@@ -34,8 +34,8 @@ class TuiCompleter : Completer {
 
         when {
             word.startsWith("/") -> {
-                // Slash commands (prompt templates) first
-                slashCommands.filter { it.first.startsWith(word) }
+                // Slash prompts (prompt templates) first
+                slashPrompts.filter { it.first.startsWith(word) }
                     .forEach { candidates.add(Candidate(it.first, it.first, null, it.second, null, null, true)) }
                 // Then system commands
                 systemCommands.filter { it.startsWith(word) }
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandler.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandler.kt
index b72eb8fc..e0de96b9 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandler.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandler.kt
@@ -666,17 +666,17 @@ class TuiInputHandler(private val terminal: Terminal) {
     }
 
     /**
-     * Handle slash commands (prompt templates only).
+     * Handle slash prompts (prompt templates only).
      * Returns true if the input was handled.
      *
      * All system operations (history, settings, export, etc.) are accessed
-     * through GUI keybindings and screens, not slash commands.
-     * Slash commands: /explain, /refactor, etc. — prompt templates from SlashCommand.BUILTINS
+     * through GUI keybindings and screens, not slash prompts.
+     * Slash prompts: /explain, /refactor, etc. — prompt templates from SlashPrompt.BUILTINS
      */
     internal fun handleCommand(input: String, viewModel: TuiViewModel): Boolean {
-        // Slash commands (prompt templates) are NOT handled here.
-        // They are processed inline in TuiViewModel.sendMessage() — same as the plugin.
-        // Unknown /commands are passed through as normal messages.
+        // Slash prompts (prompt templates) are NOT handled here.
+        // They are expanded inline in TuiViewModel.sendMessage() — same as the plugin.
+        // Unknown /names are passed through as normal messages.
         return false
     }
 }
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/screens/TuiSettingsScreen.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/screens/TuiSettingsScreen.kt
index fcbcb7b1..9e9b4ecd 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/screens/TuiSettingsScreen.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/screens/TuiSettingsScreen.kt
@@ -173,9 +173,9 @@ object TuiSettingsScreen {
                 "lmstudio.lmstudio_context_size" to "Context size"
             ),
             "Custom OpenAI" to listOf(
-                "custom_openai.custom_openai_base_url" to "Base URL",
-                "custom_openai.custom_openai_api_key" to "API Key",
-                "custom_openai.custom_openai_model" to "Model"
+                "generic_openai.generic_openai_base_url" to "Base URL",
+                "generic_openai.generic_openai_api_key" to "API Key",
+                "generic_openai.generic_openai_model" to "Model"
             ),
             "Z.AI" to listOf(
                 "zai.zai_base_url" to "Base URL",
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiChatViewModel.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiChatViewModel.kt
index 7632b5e4..30e07c58 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiChatViewModel.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiChatViewModel.kt
@@ -8,7 +8,7 @@ import kotlinx.coroutines.flow.StateFlow
 import kotlinx.coroutines.flow.update
 import kotlinx.coroutines.launch
 import mu.KotlinLogging
-import pl.jclab.refio.api.models.SlashCommand
+import pl.jclab.refio.api.models.SlashPrompt
 import pl.jclab.refio.cli.tui.input.TuiContextValidator
 import pl.jclab.refio.core.agents.events.AgentEvent
 import pl.jclab.refio.core.agents.events.AgentEventBus
@@ -224,31 +224,31 @@ class TuiChatViewModel(
         }
     }
 
-    // --- Slash commands (prompt templates) ---
+    // --- Slash prompts (prompt templates) ---
 
-    fun getSlashCommands(): List<SlashCommand> {
-        return SlashCommand.BUILTINS
+    fun getSlashPrompts(): List<SlashPrompt> {
+        return SlashPrompt.BUILTINS
     }
 
     /**
-     * Process slash commands inline (same as plugin's PromptInputPanel.processSlashCommand).
-     * Replaces each "/command" with its template, supporting multiple commands anywhere in text.
-     * Only matches /command after whitespace or at start (not in URLs like https://example.com).
+     * Expand slash prompts inline (same as plugin's PromptInputPanel.processSlashPrompt).
+     * Replaces each "/name" with its template, supporting multiple occurrences anywhere in text.
+     * Only matches /name after whitespace or at start (not in URLs like https://example.com).
      */
-    fun processSlashCommands(text: String): String {
-        val commandRegex = Regex("""(?<=\s|^)/([\w-]+)""")
-        val matches = commandRegex.findAll(text).toList()
+    fun processSlashPrompts(text: String): String {
+        val slashRegex = Regex("""(?<=\s|^)/([\w-]+)""")
+        val matches = slashRegex.findAll(text).toList()
         if (matches.isEmpty()) return text
 
-        val commands = getSlashCommands()
+        val slashPrompts = getSlashPrompts()
         var result = text
         var offset = 0
 
         for (match in matches) {
-            val commandName = match.groupValues[1]
-            val cmd = commands.find { it.name.equals(commandName, ignoreCase = true) } ?: continue
+            val promptName = match.groupValues[1]
+            val sp = slashPrompts.find { it.name.equals(promptName, ignoreCase = true) } ?: continue
 
-            var template = cmd.template
+            var template = sp.template
 
             // Substitute template variables
             template = template
@@ -396,7 +396,7 @@ class TuiChatViewModel(
                     for (p in prompts.prompts.take(5)) {
                         appendLine()
                         appendLine("--- ${p.type} ${if (p.isEnabled) "✓" else "✗"} ---")
-                        val content = p.content ?: "(empty)"
+                        val content = p.content
                         appendLine(content.take(500))
                         if (content.length > 500) appendLine("... (${content.length} chars)")
                     }
@@ -583,8 +583,8 @@ class TuiChatViewModel(
             return
         }
 
-        // Process slash commands inline (like the plugin does)
-        val processedInput = processSlashCommands(input)
+        // Expand slash prompts inline (like the plugin does)
+        val processedInput = processSlashPrompts(input)
 
         // Validate context references before sending
         val contextWarning = validateContextReferences(processedInput)
@@ -651,7 +651,7 @@ class TuiChatViewModel(
                         // Finalize stream with the full response
                         workflowListener.onStreamComplete(response.output)
                         // Update metrics
-                        response.costs?.let { costs ->
+                        response.costs.let { costs ->
                             onUpdateTotalTokens((costs.tokensIn + costs.tokensOut).toLong())
                             onUpdateTotalCost(costs.usdEst)
                         }
@@ -660,110 +660,14 @@ class TuiChatViewModel(
                     CoreTaskMode.PLAN, CoreTaskMode.AGENT -> {
                         onUpdateExecutionStatus(if (taskMode == CoreTaskMode.PLAN) "Planning..." else "Agent executing...")
 
-                        // Track temp tool message IDs for real-time updates
-                        val toolCallMessageIds = mutableMapOf<String, String>()
-
-                        val turnListener = object : AgentTurnLoop.TurnEventListener {
-                            override fun onTurnStarted(
-                                taskId: String,
-                                mode: pl.jclab.refio.core.db.TaskMode,
-                                runId: String,
-                                parentRunId: String?,
-                                depth: Int
-                            ) {
-                                logger.info { "[TURN] Started: mode=$mode, depth=$depth" }
-                            }
-
-                            override fun onToolExecutionStarted(taskId: String, toolCall: ToolCallData) {
-                                logger.info { "[TURN] Tool started: ${toolCall.name}" }
-                                workflowListener.onToolStarted(toolCall.name)
-
-                                // Add temporary tool message for real-time display
-                                val tempId = "temp-${toolCall.id}"
-                                val argsSummary = try {
-                                    val args = toolCall.arguments
-                                    if (args.length <= 120) args else "${args.take(120)}..."
-                                } catch (_: Exception) { "" }
-
-                                _messages.update { messages ->
-                                    messages + TuiChatMessage(
-                                        id = tempId,
-                                        timestamp = System.currentTimeMillis(),
-                                        role = "tool",
-                                        content = "Running ${toolCall.name}...",
-                                        messageType = TuiMessageType.TOOL_CALL,
-                                        toolName = toolCall.name,
-                                        isStreaming = true,
-                                        metadata = mapOf("args" to argsSummary)
-                                    )
-                                }
-                                toolCallMessageIds[toolCall.id] = tempId
-
-                                // Reload subtasks from DB for steps panel
-                                scope.launch {
-                                    try { onLoadSubtasksFromDb(r, tid) } catch (_: Exception) {}
-                                }
-                            }
-
-                            override fun onToolStreamChunk(
-                                taskId: String,
-                                toolCallId: String,
-                                delta: String,
-                                accumulated: String
-                            ) {
-                                val msgId = toolCallMessageIds[toolCallId] ?: return
-                                _messages.update { messages ->
-                                    messages.map { msg ->
-                                        if (msg.id == msgId) msg.copy(content = accumulated, isStreaming = true)
-                                        else msg
-                                    }
-                                }
-                            }
-
-                            override fun onToolExecutionCompleted(
-                                taskId: String,
-                                toolCall: ToolCallData,
-                                result: String,
-                                success: Boolean
-                            ) {
-                                logger.info { "[TURN] Tool completed: ${toolCall.name}, success=$success" }
-                                val msgId = toolCallMessageIds.remove(toolCall.id) ?: return
-                                val resultSummary = if (result.isNotBlank()) {
-                                    val trimmed = result.trim()
-                                    if (trimmed.length <= 200) trimmed else "${trimmed.take(200)}..."
-                                } else if (success) "Done" else "Failed"
-
-                                _messages.update { messages ->
-                                    messages.map { msg ->
-                                        if (msg.id == msgId) msg.copy(
-                                            content = resultSummary,
-                                            isStreaming = false,
-                                            metadata = msg.metadata + ("success" to success)
-                                        )
-                                        else msg
-                                    }
-                                }
-
-                                // Reload subtasks for steps panel
-                                scope.launch {
-                                    try { onLoadSubtasksFromDb(r, tid) } catch (_: Exception) {}
-                                }
-                            }
-
-                            override fun onStreamChunk(taskId: String, delta: String, accumulated: String) {
-                                // Handled by streamCallback
-                            }
-
-                            override fun onTurnCompleted(
-                                taskId: String,
-                                result: TurnResult,
-                                runId: String,
-                                parentRunId: String?,
-                                depth: Int
-                            ) {
-                                logger.info { "[TURN] Completed: success=${result.success}" }
-                            }
-                        }
+                        val turnListener = TuiToolCallListener(
+                            scope = scope,
+                            messagesState = _messages,
+                            onToolStarted = { toolName -> workflowListener.onToolStarted(toolName) },
+                            onReloadSubtasks = {
+                                try { onLoadSubtasksFromDb(r, tid) } catch (_: Exception) {}
+                            },
+                        )
 
                         val turnRequest = TurnRequest(
                             taskId = tid,
@@ -801,6 +705,22 @@ class TuiChatViewModel(
 
                 // Refresh API logs from database
                 onRefreshApiLogs(r)
+            } catch (e: pl.jclab.refio.core.errors.RefioError.MalformedResponse) {
+                logger.error(e) {
+                    "Malformed response from provider=${e.provider}/${e.model}: reason=${e.reason}, " +
+                            "bodyPreview=${e.bodyPreview.take(500)}"
+                }
+                _isStreaming.value = false
+                onUpdateExecutionStatus("Error")
+                _messages.update { messages ->
+                    messages + TuiChatMessage(
+                        id = UUID.randomUUID().toString(),
+                        timestamp = System.currentTimeMillis(),
+                        role = "system",
+                        content = "Provider ${e.provider} returned an invalid response — check CLI logs for details.",
+                        messageType = TuiMessageType.AGENT_FAILED
+                    )
+                }
             } catch (e: Exception) {
                 logger.error(e) { "Workflow error" }
                 _isStreaming.value = false
@@ -880,10 +800,10 @@ class TuiChatViewModel(
     private var cachedSubagentNames: List<String> = emptyList()
     private var subagentCacheTime: Long = 0
 
-    /** Built-in slash commands for autocomplete */
+    /** Built-in slash prompts for autocomplete */
     private val builtinCommandNames: List<String> by lazy {
         try {
-            pl.jclab.refio.api.models.SlashCommand.BUILTINS.map { "/${it.name}" }
+            pl.jclab.refio.api.models.SlashPrompt.BUILTINS.map { "/${it.name}" }
         } catch (_: Exception) {
             listOf("/explain", "/fix", "/test", "/refactor", "/optimize", "/simplify",
                 "/document", "/security-review", "/translate", "/implement")
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiObservabilityViewModel.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiObservabilityViewModel.kt
index e7e417b8..622f223c 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiObservabilityViewModel.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiObservabilityViewModel.kt
@@ -327,17 +327,15 @@ class TuiObservabilityViewModel(
                 }
 
                 // Always load indexed files for the RAG files table
-                try {
-                    val files = r.ragRouter.getRagIndexedFiles()
-                    _ragIndexedFiles.value = files.map { f ->
-                        TuiRagFile(
-                            filePath = f.filePath,
-                            chunks = f.chunksCount,
-                            embeddings = f.embeddingsCount,
-                            sizeBytes = f.fileSize
-                        )
-                    }
-                } catch (_: Exception) {}
+                val files = r.ragRouter.getRagIndexedFiles()
+                _ragIndexedFiles.value = files.map { f ->
+                    TuiRagFile(
+                        filePath = f.filePath,
+                        chunks = f.chunksCount,
+                        embeddings = f.embeddingsCount,
+                        sizeBytes = f.fileSize
+                    )
+                }
             } catch (e: Exception) {
                 logger.debug(e) { "RAG stats not available (indexing may not be configured)" }
             }
@@ -350,7 +348,7 @@ class TuiObservabilityViewModel(
         scope.launch {
             val r = getRouter() ?: return@launch
             try {
-                val source = r.addDocumentationSource(url, depth)
+                val source = r.ragRouter.addDocumentationSource(url, depth)
                 addSystemMessageFn("Added documentation source: $url (ID: ${source.id})")
             } catch (e: Exception) {
                 addSystemMessageFn("Failed to add docs: ${e.message}")
@@ -362,7 +360,7 @@ class TuiObservabilityViewModel(
         scope.launch {
             val r = getRouter() ?: return@launch
             try {
-                r.deleteDocumentationSource(docId)
+                r.ragRouter.deleteDocumentationSource(docId)
                 addSystemMessageFn("Deleted documentation source #$docId")
             } catch (e: Exception) {
                 addSystemMessageFn("Failed to delete docs: ${e.message}")
@@ -375,7 +373,7 @@ class TuiObservabilityViewModel(
             val r = getRouter() ?: return@launch
             try {
                 addSystemMessageFn("Indexing documentation #$docId...")
-                r.indexDocumentation(docId).collect { progress ->
+                r.ragRouter.indexDocumentation(docId).collect { progress ->
                     _ragIndexingStatus.value = progress.statusMessage
                     _ragIndexingProgress.value = progress.progressPercent / 100.0
                 }
@@ -419,7 +417,6 @@ class TuiObservabilityViewModel(
         "code_analysis" to 3, "framework_analysis" to 3,
         "current_task" to 4, "subtasks" to 4,
         "conversation_history" to 5, "conversation" to 5, "recent_work" to 5,
-        "rag_fragments" to 6, "rag_index" to 6,
         "user_context" to 7, "user_requirements" to 7, "task_requirements" to 7,
         "key_components" to 8, "domain_analysis" to 8,
         "working_memory" to 9, "mcp_resources" to 9,
@@ -437,7 +434,6 @@ class TuiObservabilityViewModel(
             key.startsWith("project") || key == "semantic_summary" -> "project"
             key.startsWith("user") -> "user"
             key == "task_requirements" -> "user"
-            key.startsWith("rag") -> "rag"
             key.startsWith("conversation") || key == "recent_work" -> "conversation"
             key.startsWith("mcp") || key.startsWith("tool") -> "tools"
             else -> "project"
@@ -475,7 +471,6 @@ class TuiObservabilityViewModel(
         "user_context" to "USER_PROVIDED_CONTEXT",
         "working_memory" to "WORKING_MEMORY",
         "mcp_resources" to "MCP_RESOURCES",
-        "rag_fragments" to "RAG_FRAGMENTS",
         "conversation" to "CONVERSATION_HISTORY",
         "recent_work" to "RECENT_WORK",
         "subtasks" to "SUBTASKS_STATUS",
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiSessionViewModel.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiSessionViewModel.kt
index 90164c0c..22e5f345 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiSessionViewModel.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiSessionViewModel.kt
@@ -10,6 +10,7 @@ import kotlinx.coroutines.launch
 import mu.KotlinLogging
 import pl.jclab.refio.core.api.CoreApiRouter
 import pl.jclab.refio.core.api.ModelOperation
+import pl.jclab.refio.core.api.SubtaskResponse
 import pl.jclab.refio.core.api.TurnRequest
 import pl.jclab.refio.core.api.UpdateSubtaskRequest
 import pl.jclab.refio.core.api.UpdateTaskRequest
@@ -32,22 +33,19 @@ class TuiSessionViewModel(
     internal val mode: MutableStateFlow<String>,
     internal val model: MutableStateFlow<String?>,
     internal val projectPath: Path,
-    internal val projectId: String
+    internal val projectId: String,
+    internal val stateManager: pl.jclab.refio.core.session.SessionStateManager
 ) {
     // --- StateFlows owned by this sub-VM ---
 
-    /** Exposed as internal for coordinator wiring (workflowListener, clearSteps callback). */
-    internal val _stepsInternal = MutableStateFlow<List<TuiStep>>(emptyList())
-    val steps: StateFlow<List<TuiStep>> = _stepsInternal.asStateFlow()
+    /** Subtasks sourced from core [pl.jclab.refio.core.session.SessionStateManager]. */
+    val subtasks: StateFlow<List<SubtaskResponse>> = stateManager.subtasks
 
-    private val _subtasks = MutableStateFlow<List<TuiSubtask>>(emptyList())
-    val subtasks: StateFlow<List<TuiSubtask>> = _subtasks.asStateFlow()
+    private val _activePlan = MutableStateFlow<List<SubtaskResponse>?>(null)
+    val activePlan: StateFlow<List<SubtaskResponse>?> = _activePlan.asStateFlow()
 
-    private val _activePlan = MutableStateFlow<TuiPlan?>(null)
-    val activePlan: StateFlow<TuiPlan?> = _activePlan.asStateFlow()
-
-    private val _isPaused = MutableStateFlow(false)
-    val isPaused: StateFlow<Boolean> = _isPaused.asStateFlow()
+    /** Paused flag sourced from core [pl.jclab.refio.core.session.SessionStateManager]. */
+    val isPaused: StateFlow<Boolean> = stateManager.isPaused
 
     private val _pendingPlanApproval = MutableStateFlow<TuiPlanApproval?>(null)
     val pendingPlanApproval: StateFlow<TuiPlanApproval?> = _pendingPlanApproval.asStateFlow()
@@ -61,8 +59,8 @@ class TuiSessionViewModel(
     internal val _executionMode = MutableStateFlow("AUTO")
     val executionMode: StateFlow<String> = _executionMode.asStateFlow()
 
-    private val _sessions = MutableStateFlow<List<TuiSessionEntry>>(emptyList())
-    val sessions: StateFlow<List<TuiSessionEntry>> = _sessions.asStateFlow()
+    private val _sessions = MutableStateFlow<List<pl.jclab.refio.core.api.TaskResponse>>(emptyList())
+    val sessions: StateFlow<List<pl.jclab.refio.core.api.TaskResponse>> = _sessions.asStateFlow()
 
     private val _selectedHistoryIndex = MutableStateFlow(0)
     val selectedHistoryIndex: StateFlow<Int> = _selectedHistoryIndex.asStateFlow()
@@ -85,11 +83,9 @@ class TuiSessionViewModel(
     private val _totalTokens = MutableStateFlow(0L)
     val totalTokens: StateFlow<Long> = _totalTokens.asStateFlow()
 
-    private val _thinkingEnabled = MutableStateFlow(false)
-    val thinkingEnabled: StateFlow<Boolean> = _thinkingEnabled.asStateFlow()
-
-    private val _noEgressEnabled = MutableStateFlow(false)
-    val noEgressEnabled: StateFlow<Boolean> = _noEgressEnabled.asStateFlow()
+    /** Thinking/no-egress flags sourced from core [pl.jclab.refio.core.session.SessionStateManager]. */
+    val thinkingEnabled: StateFlow<Boolean> = stateManager.thinkingEnabled
+    val noEgressEnabled: StateFlow<Boolean> = stateManager.noEgressEnabled
 
     // --- Callbacks to parent TuiViewModel ---
 
@@ -142,11 +138,11 @@ class TuiSessionViewModel(
     var resolveContextWindow: (String?) -> Unit = {}
 
     fun setNoEgressEnabled(enabled: Boolean) {
-        _noEgressEnabled.value = enabled
+        stateManager.setNoEgressEnabled(enabled)
     }
 
     fun setThinkingEnabled(enabled: Boolean) {
-        _thinkingEnabled.value = enabled
+        stateManager.setThinkingEnabled(enabled)
     }
 
     // =============================================
@@ -157,21 +153,8 @@ class TuiSessionViewModel(
         val r = getRouter() ?: return
         scope.launch(Dispatchers.IO) {
             try {
-                val tasks = r.taskRouter.listTasks().tasks
-                _sessions.value = tasks.map { task ->
-                    TuiSessionEntry(
-                        id = task.id,
-                        name = task.name,
-                        mode = task.mode,
-                        status = task.status,
-                        tokensIn = task.tokensIn,
-                        tokensOut = task.tokensOut,
-                        costUsd = task.costUsd,
-                        createdAt = task.createdAt,
-                        updatedAt = task.updatedAt,
-                        pinned = task.pinned
-                    )
-                }.sortedByDescending { it.updatedAt }
+                _sessions.value = r.taskRouter.listTasks().tasks
+                    .sortedByDescending { it.updatedAt }
             } catch (e: Exception) {
                 logger.warn(e) { "Failed to load sessions" }
             }
@@ -234,7 +217,7 @@ class TuiSessionViewModel(
         _selectedHistoryIndex.value = 0
     }
 
-    private fun filteredSessions(): List<TuiSessionEntry> {
+    private fun filteredSessions(): List<pl.jclab.refio.core.api.TaskResponse> {
         val filter = _historyFilter.value
         return if (filter == "*") _sessions.value
         else _sessions.value.filter { it.mode == filter }
@@ -247,7 +230,7 @@ class TuiSessionViewModel(
         setTaskId(newId)
         clearMessages()
         clearSteps()
-        _subtasks.value = emptyList()
+        stateManager.setSubtasks(emptyList())
         _activePlan.value = null
         _pendingPlanApproval.value = null
         clearContextSections()
@@ -301,29 +284,29 @@ class TuiSessionViewModel(
         _executionStatus.value = status
     }
 
-    fun setSubtasks(subtasks: List<TuiSubtask>) {
-        _subtasks.value = subtasks
+    fun setSubtasks(subtasks: List<SubtaskResponse>) {
+        stateManager.setSubtasks(subtasks)
     }
 
     fun updateSubtaskStatus(subtaskId: String, status: String, error: String? = null) {
-        _subtasks.update { list ->
-            list.map {
-                if (it.id == subtaskId) it.copy(status = status, error = error) else it
+        stateManager.setSubtasks(
+            stateManager.getSubtasks().map {
+                if (it.id == subtaskId) it.copy(status = status, errorMessage = error) else it
             }
-        }
+        )
     }
 
-    fun setPendingPlanApproval(plan: TuiPlan) {
+    fun setPendingPlanApproval(taskId: String, steps: List<SubtaskResponse>) {
         _pendingPlanApproval.value = TuiPlanApproval(
-            taskId = plan.taskId,
-            plan = plan
+            taskId = taskId,
+            steps = steps,
         )
     }
 
     fun approvePlan() {
         val approval = _pendingPlanApproval.value ?: return
-        _activePlan.value = approval.plan
-        _subtasks.value = approval.plan.steps
+        _activePlan.value = approval.steps
+        stateManager.setSubtasks(approval.steps)
         _pendingPlanApproval.value = null
     }
 
@@ -336,93 +319,52 @@ class TuiSessionViewModel(
     // Subtask operations
     // =============================================
 
-    fun approveSubtask(subtaskId: String) {
+    private fun updateSubtaskApproval(subtaskId: String, status: ApprovalStatus) {
         scope.launch {
             val r = getRouter() ?: return@launch
             val tid = getTaskId() ?: return@launch
-            try {
-                r.subtaskRouter.updateSubtask(tid, subtaskId, UpdateSubtaskRequest(approvalStatus = ApprovalStatus.APPROVED))
-                loadSubtasksFromDb(r, tid)
-            } catch (e: Exception) {
-                logger.warn(e) { "Failed to approve subtask: $subtaskId" }
-                updateSubtaskStatus(subtaskId, "APPROVED") // fallback to local
-            }
+            r.subtaskRouter.updateSubtask(tid, subtaskId, UpdateSubtaskRequest(approvalStatus = status))
+            loadSubtasksFromDb(r, tid)
         }
     }
 
-    fun skipSubtask(subtaskId: String) {
-        scope.launch {
-            val r = getRouter() ?: return@launch
-            val tid = getTaskId() ?: return@launch
-            try {
-                r.subtaskRouter.updateSubtask(tid, subtaskId, UpdateSubtaskRequest(approvalStatus = ApprovalStatus.SKIPPED))
-                loadSubtasksFromDb(r, tid)
-            } catch (e: Exception) {
-                logger.warn(e) { "Failed to skip subtask: $subtaskId" }
-                updateSubtaskStatus(subtaskId, "SKIPPED") // fallback to local
-            }
-        }
-    }
+    fun approveSubtask(subtaskId: String) = updateSubtaskApproval(subtaskId, ApprovalStatus.APPROVED)
+
+    fun skipSubtask(subtaskId: String) = updateSubtaskApproval(subtaskId, ApprovalStatus.SKIPPED)
 
     fun deleteSubtask(subtaskId: String) {
         scope.launch {
             val r = getRouter() ?: return@launch
             val tid = getTaskId() ?: return@launch
-            try {
-                r.subtaskRouter.deleteSubtask(tid, subtaskId)
-                loadSubtasksFromDb(r, tid)
-            } catch (e: Exception) {
-                logger.warn(e) { "Failed to delete subtask: $subtaskId" }
-                _subtasks.update { it.filter { s -> s.id != subtaskId } } // fallback
-            }
+            r.subtaskRouter.deleteSubtask(tid, subtaskId)
+            loadSubtasksFromDb(r, tid)
         }
     }
 
     fun moveStepUp(index: Int) {
         if (index <= 0) return
-        val list = _subtasks.value
+        val list = stateManager.getSubtasks()
         val current = list.getOrNull(index) ?: return
         val above = list.getOrNull(index - 1) ?: return
         scope.launch {
             val r = getRouter() ?: return@launch
             val tid = getTaskId() ?: return@launch
-            try {
-                r.subtaskRouter.swapSubtaskOrder(tid, current.id, above.id)
-                loadSubtasksFromDb(r, tid)
-            } catch (e: Exception) {
-                logger.warn(e) { "Failed to swap subtask order" }
-                // Fallback to local swap
-                _subtasks.update { l ->
-                    l.toMutableList().apply {
-                        val item = removeAt(index)
-                        add(index - 1, item)
-                    }
-                }
-            }
+            r.subtaskRouter.swapSubtaskOrder(tid, current.id, above.id)
+            loadSubtasksFromDb(r, tid)
         }
         _selectedStepIndex.update { (it - 1).coerceAtLeast(0) }
     }
 
     fun moveStepDown(index: Int) {
-        val list = _subtasks.value
+        val list = stateManager.getSubtasks()
         if (index >= list.size - 1) return
         val current = list.getOrNull(index) ?: return
         val below = list.getOrNull(index + 1) ?: return
         scope.launch {
             val r = getRouter() ?: return@launch
             val tid = getTaskId() ?: return@launch
-            try {
-                r.subtaskRouter.swapSubtaskOrder(tid, current.id, below.id)
-                loadSubtasksFromDb(r, tid)
-            } catch (e: Exception) {
-                logger.warn(e) { "Failed to swap subtask order" }
-                _subtasks.update { l ->
-                    l.toMutableList().apply {
-                        val item = removeAt(index)
-                        add(index + 1, item)
-                    }
-                }
-            }
+            r.subtaskRouter.swapSubtaskOrder(tid, current.id, below.id)
+            loadSubtasksFromDb(r, tid)
         }
         _selectedStepIndex.update { (it + 1).coerceAtMost(list.size - 1) }
     }
@@ -431,34 +373,27 @@ class TuiSessionViewModel(
         scope.launch {
             val r = getRouter() ?: return@launch
             val tid = getTaskId() ?: return@launch
-            try {
-                r.subtaskRouter.deletePendingSubtasks(tid)
-                loadSubtasksFromDb(r, tid)
-            } catch (e: Exception) {
-                logger.warn(e) { "Failed to cancel all pending" }
-                _subtasks.update { list ->
-                    list.map {
-                        if (it.status in listOf("NEW", "PENDING", "APPROVED")) it.copy(status = "SKIPPED") else it
-                    }
-                }
-            }
+            r.subtaskRouter.deletePendingSubtasks(tid)
+            loadSubtasksFromDb(r, tid)
         }
     }
 
     fun executeStep(index: Int) {
-        val subtask = _subtasks.value.getOrNull(index) ?: return
+        val subtask = stateManager.getSubtasks().getOrNull(index) ?: return
         if (subtask.status !in listOf("NEW", "PENDING", "APPROVED")) {
-            addSystemMessage("Step '${subtask.name}' is ${subtask.status}, cannot execute")
+            addSystemMessage("Step '${subtask.description}' is ${subtask.status}, cannot execute")
             return
         }
         scope.launch {
             val r = getRouter() ?: return@launch
             val tid = getTaskId() ?: return@launch
             try {
-                _executionStatus.value = "Executing: ${subtask.name}"
-                _subtasks.update { list ->
-                    list.map { if (it.id == subtask.id) it.copy(status = "RUNNING") else it }
-                }
+                _executionStatus.value = "Executing: ${subtask.description}"
+                stateManager.setSubtasks(
+                    stateManager.getSubtasks().map {
+                        if (it.id == subtask.id) it.copy(status = "RUNNING") else it
+                    }
+                )
                 val result = r.agentRouter.executeSubtaskStep(tid, subtask.id)
                 loadSubtasksFromDb(r, tid)
                 loadMessagesFromDb(r, tid)
@@ -466,9 +401,11 @@ class TuiSessionViewModel(
                 _executionStatus.value = "Idle"
             } catch (e: Exception) {
                 logger.error(e) { "Failed to execute step: ${subtask.id}" }
-                _subtasks.update { list ->
-                    list.map { if (it.id == subtask.id) it.copy(status = "FAILED", error = e.message) else it }
-                }
+                stateManager.setSubtasks(
+                    stateManager.getSubtasks().map {
+                        if (it.id == subtask.id) it.copy(status = "FAILED", errorMessage = e.message) else it
+                    }
+                )
                 _executionStatus.value = "Error"
                 addSystemMessage("Step execution failed: ${e.message}")
             }
@@ -546,16 +483,16 @@ class TuiSessionViewModel(
     }
 
     fun togglePause() {
-        val wasPaused = _isPaused.value
-        _isPaused.update { !it }
+        val wasPaused = stateManager.getIsPaused()
+        stateManager.setPaused(!wasPaused)
         // On resume, check for pending subtasks needing approval
         if (wasPaused) {
-            val subtasks = _subtasks.value
+            val subtasks = stateManager.getSubtasks()
             val nextPending = subtasks.indexOfFirst { it.status in listOf("NEW", "PENDING") }
             if (nextPending >= 0) {
                 setActiveTab(TuiTab.STEPS)
                 selectStep(nextPending)
-                addSystemMessage("Resumed. Next step awaiting approval: ${subtasks[nextPending].name}")
+                addSystemMessage("Resumed. Next step awaiting approval: ${subtasks[nextPending].description}")
             } else {
                 addSystemMessage("Resumed. No pending steps.")
             }
@@ -563,7 +500,7 @@ class TuiSessionViewModel(
     }
 
     fun selectStep(index: Int) {
-        _selectedStepIndex.value = index.coerceIn(0, (_subtasks.value.size - 1).coerceAtLeast(0))
+        _selectedStepIndex.value = index.coerceIn(0, (stateManager.getSubtasks().size - 1).coerceAtLeast(0))
     }
 
     fun selectStepUp() {
@@ -571,7 +508,7 @@ class TuiSessionViewModel(
     }
 
     fun selectStepDown() {
-        _selectedStepIndex.update { (it + 1).coerceAtMost((_subtasks.value.size - 1).coerceAtLeast(0)) }
+        _selectedStepIndex.update { (it + 1).coerceAtMost((stateManager.getSubtasks().size - 1).coerceAtLeast(0)) }
     }
 
     // =============================================
@@ -647,7 +584,7 @@ class TuiSessionViewModel(
      * (e.g. no providers configured, endpoints unreachable).
      */
     private fun getStaticModelList(): List<String> {
-        val providers = listOf("openai", "anthropic", "openrouter", "gemini", "ollama", "lmstudio", "custom_openai", "zai")
+        val providers = listOf("openai", "anthropic", "openrouter", "gemini", "ollama", "lmstudio", "generic_openai", "zai")
         val result = mutableListOf<String>()
         for (provider in providers) {
             val definitions = pl.jclab.refio.core.llm.ModelDefinitions.getProviderDefinitions(provider)
@@ -680,9 +617,7 @@ class TuiSessionViewModel(
             val newId = createNewTaskInDb(r)
             setTaskId(newId)
             mode.value = newMode
-            try {
-                r.configRouter.updateConfig("ui", "app", null, mapOf("selected_mode" to newMode))
-            } catch (_: Exception) {}
+            persistUiSetting("selected_mode", newMode)
             updateDebugInfo(newId, newMode)
             addSystemMessage("Mode switched to $newMode (new session)")
             return
@@ -698,7 +633,7 @@ class TuiSessionViewModel(
             val newId = createNewTaskInDb(r)
             setTaskId(newId)
             clearMessages()
-            _subtasks.value = emptyList()
+            stateManager.setSubtasks(emptyList())
             clearSteps()
             _activePlan.value = null
             _pendingPlanApproval.value = null
@@ -706,17 +641,11 @@ class TuiSessionViewModel(
         }
 
         mode.value = newMode
-
-        // Persist to config (same key as IntelliJ: ui.selected_mode)
-        try {
-            getRouter()?.configRouter?.updateConfig("ui", "app", null, mapOf("selected_mode" to newMode))
-        } catch (e: Exception) {
-            logger.debug(e) { "Failed to persist selected mode" }
-        }
+        persistUiSetting("selected_mode", newMode)
 
         // Clear plan/step state when switching to CHAT (no tools in chat mode)
         if (newMode == "CHAT") {
-            _subtasks.value = emptyList()
+            stateManager.setSubtasks(emptyList())
             _activePlan.value = null
             _pendingPlanApproval.value = null
         }
@@ -725,34 +654,25 @@ class TuiSessionViewModel(
         addSystemMessage("Mode switched to $newMode")
     }
 
+    private fun persistUiSetting(key: String, value: String) {
+        getRouter()?.configRouter?.updateConfig("ui", "app", null, mapOf(key to value))
+    }
+
     fun toggleThinking() {
-        _thinkingEnabled.update { !it }
-        // Persist to config
-        try {
-            getRouter()?.configRouter?.updateConfig("ui", "app", null, mapOf("thinking_enabled" to _thinkingEnabled.value.toString()))
-        } catch (e: Exception) {
-            logger.debug(e) { "Failed to persist thinking state" }
-        }
+        val next = !stateManager.thinkingEnabled.value
+        stateManager.setThinkingEnabled(next)
+        persistUiSetting("thinking_enabled", next.toString())
     }
 
     fun toggleNoEgress() {
-        _noEgressEnabled.update { !it }
-        // Persist to config
-        try {
-            getRouter()?.configRouter?.updateConfig("ui", "app", null, mapOf("no_egress_enabled" to _noEgressEnabled.value.toString()))
-        } catch (e: Exception) {
-            logger.debug(e) { "Failed to persist no-egress state" }
-        }
+        val next = !stateManager.noEgressEnabled.value
+        stateManager.setNoEgressEnabled(next)
+        persistUiSetting("no_egress_enabled", next.toString())
     }
 
     fun toggleExecutionMode() {
         _executionMode.update { if (it == "AUTO") "INTERACTIVE" else "AUTO" }
-        // Persist to config (same key as IntelliJ: ui.execution_mode)
-        try {
-            getRouter()?.configRouter?.updateConfig("ui", "app", null, mapOf("execution_mode" to _executionMode.value))
-        } catch (e: Exception) {
-            logger.debug(e) { "Failed to persist execution mode" }
-        }
+        persistUiSetting("execution_mode", _executionMode.value)
     }
 
     // =============================================
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiState.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiState.kt
index cbe58265..7fa88ceb 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiState.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiState.kt
@@ -1,8 +1,13 @@
 package pl.jclab.refio.cli.tui.state
 
+import pl.jclab.refio.core.api.SubtaskResponse
+
 /**
  * Unified TUI state — all data needed to render the full UI.
  * Pure data classes, no Compose/Swing dependencies.
+ *
+ * Subtasks / plan use the core [SubtaskResponse] type directly — no TUI-specific
+ * projection (DTO de-duplication, 2026-04-16).
  */
 data class TuiState(
     val screen: TuiScreen = TuiScreen.MAIN,
@@ -10,9 +15,8 @@ data class TuiState(
     val messages: List<TuiChatMessage> = emptyList(),
     val isStreaming: Boolean = false,
     val agents: List<TuiAgentState> = emptyList(),
-    val steps: List<TuiStep> = emptyList(),
-    val subtasks: List<TuiSubtask> = emptyList(),
-    val activePlan: TuiPlan? = null,
+    val subtasks: List<SubtaskResponse> = emptyList(),
+    val activePlan: List<SubtaskResponse>? = null,
     val isPaused: Boolean = false,
     val pendingPlanApproval: TuiPlanApproval? = null,
     val selectedStepIndex: Int = 0,
@@ -22,7 +26,7 @@ data class TuiState(
     val debugInfo: TuiDebugInfo = TuiDebugInfo(),
     val pendingApprovals: List<TuiPendingApproval> = emptyList(),
     val pendingToolApproval: TuiToolApprovalRequest? = null,
-    val sessions: List<TuiSessionEntry> = emptyList(),
+    val sessions: List<pl.jclab.refio.core.api.TaskResponse> = emptyList(),
     val activeSessionId: String? = null,
     val selectedHistoryIndex: Int = 0,
     val historyFilter: String = "*", // *, CHAT, PLAN, AGENT
@@ -158,14 +162,6 @@ data class TuiAgentState(
     val costUsd: Double = 0.0
 )
 
-data class TuiStep(
-    val id: String,
-    val name: String,
-    val status: String,
-    val details: String = "",
-    val expanded: Boolean = false
-)
-
 data class TuiContextSection(
     val name: String,
     val category: String,
@@ -247,50 +243,10 @@ data class TuiToolApprovalRequest(
     val arguments: Map<String, Any>
 )
 
-data class TuiSessionEntry(
-    val id: String,
-    val name: String,
-    val mode: String,
-    val status: String,
-    val tokensIn: Int,
-    val tokensOut: Int,
-    val costUsd: Double,
-    val createdAt: Long,
-    val updatedAt: Long,
-    val pinned: Boolean = false
-)
-
-data class TuiSubtask(
-    val id: String,
-    val name: String,
-    val description: String = "",
-    val status: String = "NEW", // NEW, PENDING, APPROVED, RUNNING, COMPLETED, FAILED, SKIPPED
-    val toolName: String? = null,
-    val toolArgs: String? = null,
-    val result: String? = null,
-    val error: String? = null,
-    val tokensIn: Long = 0,
-    val tokensOut: Long = 0,
-    val costUsd: Double = 0.0,
-    val order: Int = 0,
-    val model: String? = null,
-    val provider: String? = null,
-    val startedAt: Long? = null,
-    val finishedAt: Long? = null,
-    val resultSummary: String? = null
-)
-
-data class TuiPlan(
-    val taskId: String,
-    val steps: List<TuiSubtask>,
-    val totalReadSteps: Int = 0,
-    val totalWriteSteps: Int = 0
-)
-
 data class TuiPlanApproval(
     val taskId: String,
-    val plan: TuiPlan,
-    val isVisible: Boolean = true
+    val steps: List<SubtaskResponse>,
+    val isVisible: Boolean = true,
 )
 
 data class TuiFileEntry(
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiToolCallListener.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiToolCallListener.kt
new file mode 100644
index 00000000..af4d93ae
--- /dev/null
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiToolCallListener.kt
@@ -0,0 +1,90 @@
+package pl.jclab.refio.cli.tui.state
+
+import kotlinx.coroutines.CoroutineScope
+import kotlinx.coroutines.flow.MutableStateFlow
+import kotlinx.coroutines.flow.update
+import pl.jclab.refio.core.db.ToolCallData
+import pl.jclab.refio.core.session.AbstractToolCallLifecycleListener
+
+/**
+ * [AbstractToolCallLifecycleListener] variant that renders tool-call lifecycle
+ * as [TuiChatMessage] entries in the TUI chat stream.
+ *
+ * Keeps TUI-specific concerns (role="tool" messages, `TOOL_CALL` message type,
+ * `Running ...` placeholder copy, success/failure metadata) out of the
+ * shared base class.
+ */
+class TuiToolCallListener(
+    scope: CoroutineScope,
+    private val messagesState: MutableStateFlow<List<TuiChatMessage>>,
+    private val onToolStarted: (String) -> Unit,
+    private val onReloadSubtasks: suspend () -> Unit,
+) : AbstractToolCallLifecycleListener(scope) {
+
+    override fun onCreateTempMessage(taskId: String, toolCall: ToolCallData): String {
+        onToolStarted(toolCall.name)
+
+        val tempId = "temp-${toolCall.id}"
+        val argsSummary = try {
+            val args = toolCall.arguments
+            if (args.length <= 120) args else "${args.take(120)}..."
+        } catch (_: Exception) { "" }
+
+        messagesState.update { messages ->
+            messages + TuiChatMessage(
+                id = tempId,
+                timestamp = System.currentTimeMillis(),
+                role = "tool",
+                content = "Running ${toolCall.name}...",
+                messageType = TuiMessageType.TOOL_CALL,
+                toolName = toolCall.name,
+                isStreaming = true,
+                metadata = mapOf("args" to argsSummary),
+            )
+        }
+        return tempId
+    }
+
+    override fun onUpdateTempMessage(
+        messageId: String,
+        toolCallId: String,
+        delta: String,
+        accumulated: String,
+    ) {
+        messagesState.update { messages ->
+            messages.map { msg ->
+                if (msg.id == messageId) {
+                    msg.copy(content = accumulated, isStreaming = true)
+                } else msg
+            }
+        }
+    }
+
+    override fun onFinalizeTempMessage(
+        messageId: String,
+        toolCall: ToolCallData,
+        result: String,
+        success: Boolean,
+    ) {
+        val resultSummary = if (result.isNotBlank()) {
+            val trimmed = result.trim()
+            if (trimmed.length <= 200) trimmed else "${trimmed.take(200)}..."
+        } else if (success) "Done" else "Failed"
+
+        messagesState.update { messages ->
+            messages.map { msg ->
+                if (msg.id == messageId) {
+                    msg.copy(
+                        content = resultSummary,
+                        isStreaming = false,
+                        metadata = msg.metadata + ("success" to success),
+                    )
+                } else msg
+            }
+        }
+    }
+
+    override suspend fun onAfterToolLifecycleEvent(taskId: String) {
+        onReloadSubtasks()
+    }
+}
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModel.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModel.kt
index 248e857c..07e76523 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModel.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModel.kt
@@ -66,6 +66,18 @@ class TuiViewModel(
     // --- Infrastructure ---
     private var bootstrap: StandaloneCoreBootstrap? = null
     private var router: CoreApiRouter? = null
+    /**
+     * Core session facade — created after router is up in [initialize]. Sub-VMs read it through
+     * [getCoreSession] so they can observe [pl.jclab.refio.core.session.SessionStateManager] flows
+     * instead of keeping their own copies of execution state.
+     */
+    /**
+     * Shared session execution state. Created eagerly so sub-VMs can bind their observers
+     * before [CoreSessionService] is wired up in [initialize].
+     */
+    internal val sessionStateManager = pl.jclab.refio.core.session.SessionStateManager()
+    private var coreSession: pl.jclab.refio.core.session.CoreSessionService? = null
+    fun getCoreSession(): pl.jclab.refio.core.session.CoreSessionService? = coreSession
     var taskId: String? = null
         private set
     val agentEventBus = AgentEventBus()
@@ -86,7 +98,8 @@ class TuiViewModel(
         mode = _mode,
         model = _model,
         projectPath = projectPath,
-        projectId = projectId
+        projectId = projectId,
+        stateManager = sessionStateManager
     )
 
     // chat needs workflowListener but workflowListener needs chat._messages.
@@ -107,7 +120,6 @@ class TuiViewModel(
         agentColorIndex = 0,
         messagesState = chat._messages,
         streamingState = chat._isStreaming,
-        stepsState = session._stepsInternal,
         scope = scope,
         viewModel = this
     )
@@ -161,7 +173,6 @@ class TuiViewModel(
         chat._autocompleteCandidates.map { Unit },
         chat._autocompleteSelectedIndex.map { Unit },
         // Session sub-VM flows
-        session.steps.map { Unit },
         session.subtasks.map { Unit },
         session.activePlan.map { Unit },
         session.isPaused.map { Unit },
@@ -251,7 +262,6 @@ class TuiViewModel(
         sessionTokensIn = chat._messages.value.sumOf { it.tokensIn.toLong() },
         sessionTokensOut = chat._messages.value.sumOf { it.tokensOut.toLong() },
         // Session sub-VM
-        steps = session.steps.value,
         subtasks = session.subtasks.value,
         activePlan = session.activePlan.value,
         isPaused = session.isPaused.value,
@@ -318,6 +328,13 @@ class TuiViewModel(
             val r = boot.initialize()
             bootstrap = boot
             router = r
+            coreSession = pl.jclab.refio.core.session.CoreSessionServiceFactory.create(
+                projectRouter = r,
+                projectId = projectId,
+                projectPath = projectPath,
+                scope = scope,
+                stateManager = sessionStateManager,
+            )
             _isInitialized.value = true
 
             // Wire sub-VM callbacks
@@ -455,7 +472,7 @@ class TuiViewModel(
         session.onStreamChunk = { delta -> workflowListener.onStreamChunk(delta) }
         session.onStreamComplete = { response -> workflowListener.onStreamComplete(response) }
         session.clearMessages = { chat._messages.value = emptyList() }
-        session.clearSteps = { session._stepsInternal.value = emptyList() }
+        session.clearSteps = { /* no-op: steps removed, subtasks flow owns UI state */ }
         session.clearContextSections = { obs._contextSections.value = emptyList() }
         session.clearInputBuffer = { chat._inputBuffer.value = "" }
         session.setStreaming = { chat._isStreaming.value = it }
@@ -558,11 +575,9 @@ class TuiViewModel(
         if (tcJson != null && tcJson.isNotBlank()) return TuiMessageType.TOOL_CALL
         val meta = msg.metadata
         if (meta != null) {
-            try {
-                if (meta.contains("\"orchestrator_question\"")) return TuiMessageType.ORCHESTRATOR_QUESTION
-                if (meta.contains("\"execution_summary\"")) return TuiMessageType.EXECUTION_SUMMARY
-                if (meta.contains("\"plan\"")) return TuiMessageType.PLAN
-            } catch (_: Exception) {}
+            if (meta.contains("\"orchestrator_question\"")) return TuiMessageType.ORCHESTRATOR_QUESTION
+            if (meta.contains("\"execution_summary\"")) return TuiMessageType.EXECUTION_SUMMARY
+            if (meta.contains("\"plan\"")) return TuiMessageType.PLAN
         }
         return TuiMessageType.TEXT
     }
@@ -579,25 +594,7 @@ class TuiViewModel(
         try {
             val response = r.subtaskRouter.getSubtasks(tid)
             if (response.subtasks.isNotEmpty()) {
-                session.setSubtasks(response.subtasks.map { st ->
-                    TuiSubtask(
-                        id = st.id,
-                        name = st.description,
-                        description = st.description,
-                        status = st.status,
-                        toolName = st.kind,
-                        tokensIn = st.tokensIn.toLong(),
-                        tokensOut = st.tokensOut.toLong(),
-                        costUsd = st.costUsd,
-                        order = st.orderIndex,
-                        model = st.model,
-                        provider = st.provider,
-                        startedAt = st.startedAt,
-                        finishedAt = st.finishedAt,
-                        resultSummary = st.resultSummary,
-                        error = st.errorMessage
-                    )
-                })
+                session.setSubtasks(response.subtasks)
                 logger.info { "Loaded ${response.subtasks.size} subtasks from DB" }
             }
         } catch (e: Exception) {
@@ -962,8 +959,8 @@ class TuiViewModel(
 
     // --- Chat delegations ---
     fun sendMessage(input: String) = chat.sendMessage(input)
-    fun getSlashCommands() = chat.getSlashCommands()
-    fun processSlashCommands(text: String) = chat.processSlashCommands(text)
+    fun getSlashPrompts() = chat.getSlashPrompts()
+    fun processSlashPrompts(text: String) = chat.processSlashPrompts(text)
     fun updateInputBuffer(input: String) = chat.updateInputBuffer(input)
     fun moveCursorLeft() = chat.moveCursorLeft()
     fun moveCursorRight() = chat.moveCursorRight()
@@ -1026,7 +1023,7 @@ class TuiViewModel(
 
     // --- Session delegations ---
     fun updateExecutionStatus(status: String) = session.updateExecutionStatus(status)
-    fun setSubtasks(subtasks: List<TuiSubtask>) = session.setSubtasks(subtasks)
+    fun setSubtasks(subtasks: List<pl.jclab.refio.core.api.SubtaskResponse>) = session.setSubtasks(subtasks)
     fun updateSubtaskStatus(subtaskId: String, status: String, error: String? = null) = session.updateSubtaskStatus(subtaskId, status, error)
     fun cycleMode() = session.cycleMode()
     fun toggleThinking() = session.toggleThinking()
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiWorkflowListener.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiWorkflowListener.kt
index aa87c0da..c9897bf7 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiWorkflowListener.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/state/TuiWorkflowListener.kt
@@ -28,7 +28,6 @@ class TuiWorkflowListener(
     private val agentColorIndex: Int,
     private val messagesState: MutableStateFlow<List<TuiChatMessage>>,
     private val streamingState: MutableStateFlow<Boolean>,
-    private val stepsState: MutableStateFlow<List<TuiStep>>,
     private val scope: CoroutineScope,
     private var viewModel: TuiViewModel? = null
 ) : WorkflowEventListener {
@@ -191,21 +190,11 @@ class TuiWorkflowListener(
 
     override fun onToolStarted(toolName: String) {
         viewModel?.updateExecutionStatus("Tool: $toolName")
-        stepsState.update { steps ->
-            val existing = steps.indexOfLast { it.id == "tool-$toolName" && it.status == "RUNNING" }
-            if (existing >= 0) steps // already tracking this tool
-            else steps + TuiStep(
-                id = "tool-$toolName-${System.currentTimeMillis()}",
-                name = toolName,
-                status = "RUNNING"
-            )
-        }
-        // Also update subtask status if we're tracking subtasks
+        // Update subtask execution status if subtasks are being tracked
         viewModel?.let { vm ->
             val subtasks = vm.stateFlow.value.subtasks
             val running = subtasks.indexOfFirst { it.status == "RUNNING" }
             if (running >= 0) {
-                // Currently executing subtask — update execution status with step count
                 val total = subtasks.size
                 val completed = subtasks.count { it.status in listOf("COMPLETED", "SKIPPED", "FAILED") }
                 vm.updateExecutionStatus("Executing step ${completed + 1}/$total: $toolName")
@@ -214,27 +203,10 @@ class TuiWorkflowListener(
     }
 
     override fun onStepStarted(subtaskId: String) {
-        stepsState.update { steps ->
-            val existing = steps.indexOfLast { it.id == subtaskId }
-            if (existing >= 0) {
-                steps.toMutableList().also { it[existing] = it[existing].copy(status = "RUNNING") }
-            } else {
-                steps + TuiStep(id = subtaskId, name = subtaskId, status = "RUNNING")
-            }
-        }
-        // Update corresponding subtask status
         viewModel?.updateSubtaskStatus(subtaskId, "RUNNING")
     }
 
     override fun onIntentCompleted(intent: WorkflowIntent, result: IntentResult) {
-        // Mark last RUNNING step as completed
-        stepsState.update { steps ->
-            val running = steps.indexOfLast { it.status == "RUNNING" }
-            if (running >= 0) {
-                steps.toMutableList().also { it[running] = it[running].copy(status = "COMPLETED") }
-            } else steps
-        }
-
         // In INTERACTIVE mode, auto-switch to Steps tab for next approval
         val vm = viewModel ?: return
         val state = vm.stateFlow.value
@@ -244,7 +216,7 @@ class TuiWorkflowListener(
             if (nextPending >= 0) {
                 vm.setActiveTab(TuiTab.STEPS)
                 vm.selectStep(nextPending)
-                vm.addSystemMessage("Step completed. Next step awaiting approval: ${subtasks[nextPending].name}")
+                vm.addSystemMessage("Step completed. Next step awaiting approval: ${subtasks[nextPending].description}")
             }
         }
     }
@@ -252,10 +224,6 @@ class TuiWorkflowListener(
     override fun onWorkflowComplete(result: IntentResult) {
         viewModel?.updateExecutionStatus("Idle")
         streamingState.value = false
-        // Mark all remaining RUNNING steps as completed
-        stepsState.update { steps ->
-            steps.map { if (it.status == "RUNNING") it.copy(status = "COMPLETED") else it }
-        }
         // Extract per-message metrics from result and update last assistant message
         val costs = extractCosts(result)
         if (costs != null) {
diff --git a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsView.kt b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsView.kt
index aef97579..8526043b 100644
--- a/cli/src/main/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsView.kt
+++ b/cli/src/main/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsView.kt
@@ -17,151 +17,128 @@ object TuiStepsView {
         // Plan approval overlay takes priority
         val planApproval = state.pendingPlanApproval
         if (planApproval != null && planApproval.isVisible) {
-            renderPlanApproval(buf, planApproval.plan, width, height)
+            renderPlanApproval(buf, planApproval.steps, width, height)
             return buf
         }
 
-        // Show subtasks if available, otherwise fall back to legacy steps
         val subtasks = state.subtasks
-        val steps = state.steps
-
-        if (subtasks.isEmpty() && steps.isEmpty()) {
+        if (subtasks.isEmpty()) {
             buf.addLine(TuiColors.muted("No active steps."))
             buf.addLine(TuiColors.muted("Start a task in PLAN or AGENT mode."))
             return buf
         }
 
-        // Subtask-based view (PLAN/AGENT mode)
-        if (subtasks.isNotEmpty()) {
-            val completed = subtasks.count { it.status in listOf("COMPLETED", "SKIPPED") }
-            val failed = subtasks.count { it.status == "FAILED" }
-            val running = subtasks.count { it.status == "RUNNING" }
-
-            buf.addLine(TuiColors.highlight("Steps (${completed}/${subtasks.size} done" +
-                    (if (failed > 0) ", ${failed} failed" else "") +
-                    (if (running > 0) ", ${running} running" else "") +
-                    ")") +
-                    (if (state.isPaused) TuiColors.statusPending(" [PAUSED]") else "")
-            )
-            buf.addLine()
-
-            for ((index, subtask) in subtasks.withIndex()) {
-                if (buf.lineCount >= height - 4) break
-
-                val isSelected = index == state.selectedStepIndex
-                val prefix = if (isSelected) "> " else "  "
-                val orderNum = "${index + 1}".padStart(2)
-                val statusIcon = statusIcon(subtask.status)
-                val statusStyle = statusStyle(subtask.status)
-
-                // Data preservation badge for completed steps
-                val dataBadge = if (subtask.status == "COMPLETED" && subtask.result != null && subtask.resultSummary != null) {
-                    val resultLen = subtask.result.length
-                    val summaryLen = subtask.resultSummary.length
-                    if (resultLen <= 32_000 && resultLen > summaryLen) {
-                        TuiColors.statusSuccess(" [FULL]")
-                    } else if (resultLen > 32_000) {
-                        TuiColors.statusPending(" [SUMM]")
-                    } else ""
-                } else ""
+        val completed = subtasks.count { it.status in listOf("COMPLETED", "SKIPPED") }
+        val failed = subtasks.count { it.status == "FAILED" }
+        val running = subtasks.count { it.status == "RUNNING" }
 
-                // Model + duration suffix for completed/failed steps
-                val modelSuffix = if (subtask.status in listOf("COMPLETED", "FAILED")) {
-                    val parts = mutableListOf<String>()
-                    if (subtask.model != null) parts.add(subtask.model)
-                    if (subtask.startedAt != null && subtask.finishedAt != null) {
-                        val durationSec = (subtask.finishedAt - subtask.startedAt) / 1000.0
-                        parts.add("${String.format("%.1f", durationSec)}s")
-                    }
-                    if (parts.isNotEmpty()) TuiColors.muted(" [${parts.joinToString(" ")}]") else ""
-                } else ""
+        buf.addLine(TuiColors.highlight("Steps (${completed}/${subtasks.size} done" +
+                (if (failed > 0) ", ${failed} failed" else "") +
+                (if (running > 0) ", ${running} running" else "") +
+                ")") +
+                (if (state.isPaused) TuiColors.statusPending(" [PAUSED]") else "")
+        )
+        buf.addLine()
 
-                val line = "$prefix[$orderNum] $statusIcon ${statusStyle(subtask.status.padEnd(9))} ${subtask.name}$dataBadge$modelSuffix"
-                buf.addLine(line)
-
-                // Show expanded details for selected step
-                if (isSelected && subtask.status in listOf("COMPLETED", "FAILED", "RUNNING")) {
-                    if (subtask.description.isNotBlank()) {
-                        buf.addLine(TuiColors.muted("       Desc: ${subtask.description.take(120)}"))
-                    }
-                    if (subtask.provider != null) {
-                        buf.addLine(TuiColors.muted("       Provider: ${subtask.provider}"))
-                    }
-                    if (subtask.model != null) {
-                        buf.addLine(TuiColors.muted("       Model: ${subtask.model}"))
-                    }
-                    if (subtask.tokensIn > 0 || subtask.tokensOut > 0 || subtask.costUsd > 0.0) {
-                        val totalTokens = subtask.tokensIn + subtask.tokensOut
-                        buf.addLine(TuiColors.muted(
-                            "       Tokens: ${subtask.tokensIn} in / ${subtask.tokensOut} out ($totalTokens total)"
-                        ))
-                        buf.addLine(TuiColors.muted(
-                            "       Cost: $${String.format("%.6f", subtask.costUsd)}"
-                        ))
-                    }
-                    if (subtask.startedAt != null && subtask.finishedAt != null) {
-                        val durationSec = (subtask.finishedAt - subtask.startedAt) / 1000.0
-                        buf.addLine(TuiColors.muted("       Duration: ${String.format("%.1f", durationSec)}s"))
-                    }
-                    if (subtask.toolName != null) {
-                        buf.addLine(TuiColors.muted("       Tool: ${subtask.toolName}"))
-                    }
-                    if (subtask.toolArgs != null) {
-                        buf.addLine(TuiColors.muted("       Args: ${subtask.toolArgs.take(150)}"))
-                    }
-                    if (subtask.resultSummary != null) {
-                        val summary = subtask.resultSummary.take(200)
-                        buf.addLine(TuiColors.muted("       Result: $summary"))
-                    }
-                }
+        for ((index, subtask) in subtasks.withIndex()) {
+            if (buf.lineCount >= height - 4) break
 
-                // Show error for failed steps (always visible, not just when selected)
-                if (subtask.status == "FAILED" && subtask.error != null) {
-                    buf.addLine(TuiColors.statusFailed("       Error: ${subtask.error}"))
+            val isSelected = index == state.selectedStepIndex
+            val prefix = if (isSelected) "> " else "  "
+            val orderNum = "${index + 1}".padStart(2)
+            val statusIcon = statusIcon(subtask.status)
+            val statusStyle = statusStyle(subtask.status)
+
+            // Data preservation badge for completed steps
+            val dataBadge = if (subtask.status == "COMPLETED" && subtask.result != null && subtask.resultSummary != null) {
+                val resultLen = subtask.result!!.length
+                val summaryLen = subtask.resultSummary!!.length
+                if (resultLen <= 32_000 && resultLen > summaryLen) {
+                    TuiColors.statusSuccess(" [FULL]")
+                } else if (resultLen > 32_000) {
+                    TuiColors.statusPending(" [SUMM]")
+                } else ""
+            } else ""
+
+            // Model + duration suffix for completed/failed steps
+            val modelSuffix = if (subtask.status in listOf("COMPLETED", "FAILED")) {
+                val parts = mutableListOf<String>()
+                subtask.model?.let { parts.add(it) }
+                if (subtask.startedAt != null && subtask.finishedAt != null) {
+                    val durationSec = (subtask.finishedAt!! - subtask.startedAt!!) / 1000.0
+                    parts.add("${String.format("%.1f", durationSec)}s")
                 }
-            }
-
-            // Toolbar hint
-            buf.addLine()
-            buf.addLine(TuiColors.muted("[a]pprove [s]kip [d]elete [u/j]move [p]ause [r]eplan [R]un [C]ancel-all"))
-            return buf
-        }
-
-        // Legacy step view (from workflow listener)
-        buf.addLine(TuiColors.highlight("Steps (${steps.size})"))
-        buf.addLine()
+                if (parts.isNotEmpty()) TuiColors.muted(" [${parts.joinToString(" ")}]") else ""
+            } else ""
 
-        for (step in steps) {
-            if (buf.lineCount >= height - 2) break
-            val statusStyle = statusStyle(step.status)
-            buf.addLine("${statusStyle("[${step.status}]")} ${step.name}")
+            val line = "$prefix[$orderNum] $statusIcon ${statusStyle(subtask.status.padEnd(9))} ${subtask.description}$dataBadge$modelSuffix"
+            buf.addLine(line)
 
-            if (step.expanded && step.details.isNotBlank()) {
-                for (line in step.details.lines()) {
-                    if (buf.lineCount >= height - 1) break
-                    buf.addLine(TuiColors.muted("  $line"))
+            // Show expanded details for selected step
+            if (isSelected && subtask.status in listOf("COMPLETED", "FAILED", "RUNNING")) {
+                if (subtask.description.isNotBlank()) {
+                    buf.addLine(TuiColors.muted("       Desc: ${subtask.description.take(120)}"))
+                }
+                subtask.provider?.let {
+                    buf.addLine(TuiColors.muted("       Provider: $it"))
+                }
+                subtask.model?.let {
+                    buf.addLine(TuiColors.muted("       Model: $it"))
+                }
+                if (subtask.tokensIn > 0 || subtask.tokensOut > 0 || subtask.costUsd > 0.0) {
+                    val totalTokens = subtask.tokensIn + subtask.tokensOut
+                    buf.addLine(TuiColors.muted(
+                        "       Tokens: ${subtask.tokensIn} in / ${subtask.tokensOut} out ($totalTokens total)"
+                    ))
+                    buf.addLine(TuiColors.muted(
+                        "       Cost: $${String.format("%.6f", subtask.costUsd)}"
+                    ))
+                }
+                if (subtask.startedAt != null && subtask.finishedAt != null) {
+                    val durationSec = (subtask.finishedAt!! - subtask.startedAt!!) / 1000.0
+                    buf.addLine(TuiColors.muted("       Duration: ${String.format("%.1f", durationSec)}s"))
+                }
+                if (subtask.kind.isNotEmpty()) {
+                    buf.addLine(TuiColors.muted("       Tool: ${subtask.kind}"))
                 }
+                subtask.paramsJson?.let {
+                    buf.addLine(TuiColors.muted("       Args: ${it.take(150)}"))
+                }
+                subtask.resultSummary?.let {
+                    buf.addLine(TuiColors.muted("       Result: ${it.take(200)}"))
+                }
+            }
+
+            // Show error for failed steps (always visible, not just when selected)
+            if (subtask.status == "FAILED" && subtask.errorMessage != null) {
+                buf.addLine(TuiColors.statusFailed("       Error: ${subtask.errorMessage}"))
             }
         }
 
+        // Toolbar hint
+        buf.addLine()
+        buf.addLine(TuiColors.muted("[a]pprove [s]kip [d]elete [u/j]move [p]ause [r]eplan [R]un [C]ancel-all"))
         return buf
     }
 
     private fun renderPlanApproval(
         buf: TuiRenderBuffer,
-        plan: pl.jclab.refio.cli.tui.state.TuiPlan,
+        steps: List<pl.jclab.refio.core.api.SubtaskResponse>,
         width: Int,
         height: Int
     ) {
+        val readSteps = steps.count { it.kind.startsWith("read") || it.kind == "grep_search" || it.kind == "file_search" }
+        val writeSteps = steps.size - readSteps
+
         buf.addLine(TuiColors.highlight("=== Plan Approval ==="))
         buf.addLine()
-        buf.addLine("${plan.steps.size} steps: ${plan.totalReadSteps} read-only, ${plan.totalWriteSteps} write")
+        buf.addLine("${steps.size} steps: $readSteps read-only, $writeSteps write")
         buf.addLine()
 
-        for ((index, step) in plan.steps.withIndex()) {
+        for ((index, step) in steps.withIndex()) {
             if (buf.lineCount >= height - 4) break
-            buf.addLine("  ${index + 1}. ${step.name}" +
-                    (if (step.toolName != null) " (${step.toolName})" else ""))
+            val toolSuffix = if (step.kind.isNotEmpty()) " (${step.kind})" else ""
+            buf.addLine("  ${index + 1}. ${step.description}$toolSuffix")
         }
 
         buf.addLine()
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/TuiAppTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/TuiAppTest.kt
index 7c5c1e55..3325210e 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/TuiAppTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/TuiAppTest.kt
@@ -68,9 +68,9 @@ class TuiAppTest {
                 TuiChatMessage("1", 1000L, "user", "Hello"),
                 TuiChatMessage("2", 2000L, "assistant", "Plan created")
             ),
-            steps = listOf(
-                TuiStep("s1", "Analyze code", "RUNNING"),
-                TuiStep("s2", "Write tests", "PENDING")
+            subtasks = listOf(
+                pl.jclab.refio.cli.tui.state.subtaskFixture(id = "s1", description = "Analyze code", status = "RUNNING"),
+                pl.jclab.refio.cli.tui.state.subtaskFixture(id = "s2", description = "Write tests", status = "PENDING"),
             )
         )
         renderer.render(state)
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/components/TuiPlanApprovalTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/components/TuiPlanApprovalTest.kt
index 7a6a3abb..0344a747 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/components/TuiPlanApprovalTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/components/TuiPlanApprovalTest.kt
@@ -17,21 +17,16 @@ class TuiPlanApprovalTest {
     private val handler = TuiInputHandler(terminal)
     private val viewModel = mockk<TuiViewModel>(relaxed = true)
 
-    private val plan = TuiPlan(
-        taskId = "task1",
-        steps = listOf(
-            TuiSubtask(id = "s1", name = "Read project", status = "NEW", toolName = "read_file"),
-            TuiSubtask(id = "s2", name = "Edit code", status = "NEW", toolName = "code_editing")
-        ),
-        totalReadSteps = 1,
-        totalWriteSteps = 1
+    private val planSteps = listOf(
+        subtaskFixture(id = "s1", description = "Read project", status = "NEW", kind = "read_file"),
+        subtaskFixture(id = "s2", description = "Edit code", status = "NEW", kind = "code_editing"),
     )
 
     @BeforeEach
     fun setup() {
         every { viewModel.stateFlow } returns mockk {
             every { value } returns TuiState(
-                pendingPlanApproval = TuiPlanApproval(taskId = "task1", plan = plan)
+                pendingPlanApproval = TuiPlanApproval(taskId = "task1", steps = planSteps)
             )
         }
     }
@@ -70,7 +65,7 @@ class TuiPlanApprovalTest {
     @Test
     fun `renderToBuffer should show plan approval overlay`() {
         val state = TuiState(
-            pendingPlanApproval = TuiPlanApproval(taskId = "task1", plan = plan)
+            pendingPlanApproval = TuiPlanApproval(taskId = "task1", steps = planSteps)
         )
         val buf = TuiStepsView.renderToBuffer(state, 80, 20)
         val lines = buf.getLines().joinToString("\n")
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterSlashCommandsTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterSlashCommandsTest.kt
deleted file mode 100644
index 9163c523..00000000
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterSlashCommandsTest.kt
+++ /dev/null
@@ -1,41 +0,0 @@
-package pl.jclab.refio.cli.tui.input
-
-import org.junit.jupiter.api.Test
-import org.junit.jupiter.api.Assertions.*
-
-class TuiCompleterSlashCommandsTest {
-
-    @Test
-    fun `slash command names should be available from SlashCommand BUILTINS`() {
-        val builtins = pl.jclab.refio.api.models.SlashCommand.BUILTINS
-        assertTrue(builtins.isNotEmpty(), "BUILTINS should not be empty")
-        assertTrue(builtins.any { it.name == "explain" }, "Should have /explain")
-        assertTrue(builtins.any { it.name == "fix" }, "Should have /fix")
-        assertTrue(builtins.any { it.name == "test" }, "Should have /test")
-        assertTrue(builtins.any { it.name == "refactor" }, "Should have /refactor")
-    }
-
-    @Test
-    fun `slash commands should have templates`() {
-        val builtins = pl.jclab.refio.api.models.SlashCommand.BUILTINS
-        for (cmd in builtins) {
-            assertTrue(cmd.template.isNotBlank(), "Command ${cmd.name} should have a template")
-        }
-    }
-
-    @Test
-    fun `slash commands should have descriptions`() {
-        val builtins = pl.jclab.refio.api.models.SlashCommand.BUILTINS
-        for (cmd in builtins) {
-            assertTrue(cmd.description.isNotBlank(), "Command ${cmd.name} should have a description")
-        }
-    }
-
-    @Test
-    fun `slash command names should not have leading slash`() {
-        val builtins = pl.jclab.refio.api.models.SlashCommand.BUILTINS
-        for (cmd in builtins) {
-            assertFalse(cmd.name.startsWith("/"), "Command name '${cmd.name}' should not start with /")
-        }
-    }
-}
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterSlashPromptsTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterSlashPromptsTest.kt
new file mode 100644
index 00000000..ae580bfc
--- /dev/null
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterSlashPromptsTest.kt
@@ -0,0 +1,41 @@
+package pl.jclab.refio.cli.tui.input
+
+import org.junit.jupiter.api.Test
+import org.junit.jupiter.api.Assertions.*
+
+class TuiCompleterSlashPromptsTest {
+
+    @Test
+    fun `slash prompt names should be available from SlashPrompt BUILTINS`() {
+        val builtins = pl.jclab.refio.api.models.SlashPrompt.BUILTINS
+        assertTrue(builtins.isNotEmpty(), "BUILTINS should not be empty")
+        assertTrue(builtins.any { it.name == "explain" }, "Should have /explain")
+        assertTrue(builtins.any { it.name == "fix" }, "Should have /fix")
+        assertTrue(builtins.any { it.name == "test" }, "Should have /test")
+        assertTrue(builtins.any { it.name == "refactor" }, "Should have /refactor")
+    }
+
+    @Test
+    fun `slash prompts should have templates`() {
+        val builtins = pl.jclab.refio.api.models.SlashPrompt.BUILTINS
+        for (sp in builtins) {
+            assertTrue(sp.template.isNotBlank(), "Prompt ${sp.name} should have a template")
+        }
+    }
+
+    @Test
+    fun `slash prompts should have descriptions`() {
+        val builtins = pl.jclab.refio.api.models.SlashPrompt.BUILTINS
+        for (sp in builtins) {
+            assertTrue(sp.description.isNotBlank(), "Prompt ${sp.name} should have a description")
+        }
+    }
+
+    @Test
+    fun `slash prompt names should not have leading slash`() {
+        val builtins = pl.jclab.refio.api.models.SlashPrompt.BUILTINS
+        for (sp in builtins) {
+            assertFalse(sp.name.startsWith("/"), "Prompt name '${sp.name}' should not start with /")
+        }
+    }
+}
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterTest.kt
index 73284316..dd0940c8 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiCompleterTest.kt
@@ -23,7 +23,7 @@ class TuiCompleterTest {
     @Test
     fun `should complete slash commands from prompt templates`() {
         val candidates = complete("/")
-        // Only prompt templates (SlashCommand.BUILTINS), no system commands
+        // Only prompt templates (SlashPrompt.BUILTINS), no system commands
         assertFalse(candidates.contains("/help"), "System commands should not appear")
         assertFalse(candidates.contains("/quit"), "System commands should not appear")
     }
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandlerSlashCommandsTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandlerSlashPromptsTest.kt
similarity index 70%
rename from cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandlerSlashCommandsTest.kt
rename to cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandlerSlashPromptsTest.kt
index ef6ea312..7544021b 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandlerSlashCommandsTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/input/TuiInputHandlerSlashPromptsTest.kt
@@ -5,11 +5,11 @@ import com.github.ajalt.mordant.terminal.TerminalRecorder
 import io.mockk.*
 import org.junit.jupiter.api.BeforeEach
 import org.junit.jupiter.api.Test
-import pl.jclab.refio.api.models.SlashCommand
+import pl.jclab.refio.api.models.SlashPrompt
 import pl.jclab.refio.cli.tui.state.TuiState
 import pl.jclab.refio.cli.tui.state.TuiViewModel
 
-class TuiInputHandlerSlashCommandsTest {
+class TuiInputHandlerSlashPromptsTest {
 
     private val terminal = Terminal(terminalInterface = TerminalRecorder())
     private val handler = TuiInputHandler(terminal)
@@ -20,29 +20,29 @@ class TuiInputHandlerSlashCommandsTest {
         every { viewModel.stateFlow } returns mockk {
             every { value } returns TuiState()
         }
-        every { viewModel.getSlashCommands() } returns SlashCommand.BUILTINS
+        every { viewModel.getSlashPrompts() } returns SlashPrompt.BUILTINS
     }
 
     @Test
-    fun `handleCommand should pass slash command explain to sendMessage`() {
-        // Slash commands (prompt templates) are processed inline in sendMessage(),
+    fun `handleCommand should pass slash prompt explain to sendMessage`() {
+        // Slash prompts (prompt templates) are expanded inline in sendMessage(),
         // not intercepted by handleCommand(). handleCommand always returns false.
         val result = handler.handleCommand("/explain some code here", viewModel)
-        assert(!result) { "Slash commands should not be intercepted by handleCommand" }
+        assert(!result) { "Slash prompts should not be intercepted by handleCommand" }
         verify(exactly = 0) { viewModel.sendMessage(any()) }
     }
 
     @Test
-    fun `handleCommand should pass slash command fix to sendMessage`() {
+    fun `handleCommand should pass slash prompt fix to sendMessage`() {
         val result = handler.handleCommand("/fix this bug", viewModel)
-        assert(!result) { "Slash commands should not be intercepted by handleCommand" }
+        assert(!result) { "Slash prompts should not be intercepted by handleCommand" }
         verify(exactly = 0) { viewModel.sendMessage(any()) }
     }
 
     @Test
-    fun `handleCommand should not handle unknown slash command`() {
-        val result = handler.handleCommand("/nonexistent-command", viewModel)
-        assert(!result) { "Should not handle unknown command" }
+    fun `handleCommand should not handle unknown slash prompt`() {
+        val result = handler.handleCommand("/nonexistent-prompt", viewModel)
+        assert(!result) { "Should not handle unknown prompt" }
         verify(exactly = 0) { viewModel.sendMessage(any()) }
     }
 
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/rendering/TuiRendererTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/rendering/TuiRendererTest.kt
index 4b421e74..6302dfa0 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/rendering/TuiRendererTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/rendering/TuiRendererTest.kt
@@ -82,7 +82,7 @@ class TuiRendererTest {
             messages = listOf(
                 TuiChatMessage("1", System.currentTimeMillis(), "user", "Hello")
             ),
-            steps = listOf(TuiStep("s1", "Analyze code", "RUNNING"))
+            subtasks = listOf(pl.jclab.refio.cli.tui.state.subtaskFixture(id = "s1", description = "Analyze code", status = "RUNNING"))
         )
         renderer.render(state)
     }
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/screens/TuiHistoryScreenInteractiveTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/screens/TuiHistoryScreenInteractiveTest.kt
index ad596bb8..6811f3bb 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/screens/TuiHistoryScreenInteractiveTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/screens/TuiHistoryScreenInteractiveTest.kt
@@ -17,9 +17,9 @@ class TuiHistoryScreenInteractiveTest {
     private val viewModel = mockk<TuiViewModel>(relaxed = true)
 
     private val sessions = listOf(
-        TuiSessionEntry("id1", "Session 1", "CHAT", "SUCCESS", 100, 50, 0.01, 1000L, 2000L),
-        TuiSessionEntry("id2", "Session 2", "AGENT", "RUNNING", 200, 100, 0.02, 1100L, 2100L, pinned = true),
-        TuiSessionEntry("id3", "Session 3", "PLAN", "SUCCESS", 50, 25, 0.005, 900L, 1900L)
+        taskResponseFixture(id = "id1", name = "Session 1", mode = "CHAT", tokensIn = 100, tokensOut = 50, costUsd = 0.01, createdAt = 1000L, updatedAt = 2000L),
+        taskResponseFixture(id = "id2", name = "Session 2", mode = "AGENT", status = "RUNNING", tokensIn = 200, tokensOut = 100, costUsd = 0.02, createdAt = 1100L, updatedAt = 2100L, pinned = true),
+        taskResponseFixture(id = "id3", name = "Session 3", mode = "PLAN", tokensIn = 50, tokensOut = 25, costUsd = 0.005, createdAt = 900L, updatedAt = 1900L),
     )
 
     @BeforeEach
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/SubtaskResponseFixtures.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/SubtaskResponseFixtures.kt
new file mode 100644
index 00000000..cf31391b
--- /dev/null
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/SubtaskResponseFixtures.kt
@@ -0,0 +1,54 @@
+package pl.jclab.refio.cli.tui.state
+
+import pl.jclab.refio.core.api.SubtaskResponse
+
+/**
+ * Test helper — builds a [SubtaskResponse] with sensible defaults so tests only
+ * specify the fields they care about. Used by TuiStepsView tests and related
+ * fixtures after TUI DTO de-duplication (TuiSubtask removed).
+ */
+fun subtaskFixture(
+    id: String,
+    description: String,
+    status: String = "NEW",
+    kind: String = "",
+    tokensIn: Int = 0,
+    tokensOut: Int = 0,
+    costUsd: Double = 0.0,
+    model: String? = null,
+    provider: String? = null,
+    startedAt: Long? = null,
+    finishedAt: Long? = null,
+    resultSummary: String? = null,
+    errorMessage: String? = null,
+    result: String? = null,
+    orderIndex: Int = 0,
+    paramsJson: String? = null,
+): SubtaskResponse = SubtaskResponse(
+    id = id,
+    taskId = "t-fixture",
+    orderIndex = orderIndex,
+    kind = kind,
+    status = status,
+    approvalStatus = "NONE",
+    requiresApproval = false,
+    approvedByUser = false,
+    description = description,
+    paramsJson = paramsJson,
+    stepPlanJson = null,
+    summary = null,
+    result = result,
+    startedAt = startedAt,
+    finishedAt = finishedAt,
+    errorCode = null,
+    errorMessage = errorMessage,
+    tokensIn = tokensIn,
+    tokensOut = tokensOut,
+    costUsd = costUsd,
+    latencyMs = 0,
+    model = model,
+    provider = provider,
+    resultSummary = resultSummary,
+    createdAt = 0L,
+    updatedAt = 0L,
+)
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TaskResponseFixtures.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TaskResponseFixtures.kt
new file mode 100644
index 00000000..58038b5b
--- /dev/null
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TaskResponseFixtures.kt
@@ -0,0 +1,35 @@
+package pl.jclab.refio.cli.tui.state
+
+import pl.jclab.refio.core.api.TaskResponse
+
+/**
+ * Test fixture helper — covers the TaskResponse defaults that TUI tests don't care about
+ * (readOnly/executionMode/uiState/…). Keeps call sites short while letting tests override
+ * only the fields they actually assert on.
+ */
+fun taskResponseFixture(
+    id: String = "task-1",
+    name: String = "Session",
+    mode: String = "CHAT",
+    status: String = "SUCCESS",
+    tokensIn: Int = 0,
+    tokensOut: Int = 0,
+    costUsd: Double = 0.0,
+    createdAt: Long = 0,
+    updatedAt: Long = 0,
+    pinned: Boolean = false,
+): TaskResponse = TaskResponse(
+    id = id,
+    name = name,
+    mode = mode,
+    status = status,
+    readOnly = false,
+    pinned = pinned,
+    executionMode = "AUTO",
+    uiState = null,
+    createdAt = createdAt,
+    updatedAt = updatedAt,
+    tokensIn = tokensIn,
+    tokensOut = tokensOut,
+    costUsd = costUsd,
+)
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModelSessionTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModelSessionTest.kt
index 339ab9c4..6d50dd87 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModelSessionTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModelSessionTest.kt
@@ -10,8 +10,8 @@ import org.junit.jupiter.api.Assertions.*
 class TuiViewModelSessionTest {
 
     @Test
-    fun `TuiSessionEntry should have default pinned false`() {
-        val session = TuiSessionEntry(
+    fun `TaskResponse fixture should have default pinned false`() {
+        val session = taskResponseFixture(
             id = "s1", name = "Test", mode = "CHAT", status = "SUCCESS",
             tokensIn = 100, tokensOut = 50, costUsd = 0.01,
             createdAt = 1000L, updatedAt = 2000L
@@ -40,10 +40,10 @@ class TuiViewModelSessionTest {
     @Test
     fun `session filtering by mode should work`() {
         val sessions = listOf(
-            TuiSessionEntry("id1", "S1", "CHAT", "SUCCESS", 0, 0, 0.0, 0, 0),
-            TuiSessionEntry("id2", "S2", "AGENT", "SUCCESS", 0, 0, 0.0, 0, 0),
-            TuiSessionEntry("id3", "S3", "CHAT", "SUCCESS", 0, 0, 0.0, 0, 0),
-            TuiSessionEntry("id4", "S4", "PLAN", "SUCCESS", 0, 0, 0.0, 0, 0)
+            taskResponseFixture(id = "id1", name = "S1", mode = "CHAT"),
+            taskResponseFixture(id = "id2", name = "S2", mode = "AGENT"),
+            taskResponseFixture(id = "id3", name = "S3", mode = "CHAT"),
+            taskResponseFixture(id = "id4", name = "S4", mode = "PLAN"),
         )
         val chatOnly = sessions.filter { it.mode == "CHAT" }
         assertEquals(2, chatOnly.size)
@@ -55,9 +55,9 @@ class TuiViewModelSessionTest {
     @Test
     fun `session sorting by updatedAt should work`() {
         val sessions = listOf(
-            TuiSessionEntry("id1", "Old", "CHAT", "SUCCESS", 0, 0, 0.0, 0, 1000L),
-            TuiSessionEntry("id2", "New", "CHAT", "SUCCESS", 0, 0, 0.0, 0, 3000L),
-            TuiSessionEntry("id3", "Mid", "CHAT", "SUCCESS", 0, 0, 0.0, 0, 2000L)
+            taskResponseFixture(id = "id1", name = "Old", updatedAt = 1000L),
+            taskResponseFixture(id = "id2", name = "New", updatedAt = 3000L),
+            taskResponseFixture(id = "id3", name = "Mid", updatedAt = 2000L),
         )
         val sorted = sessions.sortedByDescending { it.updatedAt }
         assertEquals("New", sorted[0].name)
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModelSubtaskTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModelSubtaskTest.kt
index 121d8e17..75d734ee 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModelSubtaskTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiViewModelSubtaskTest.kt
@@ -6,57 +6,36 @@ import org.junit.jupiter.api.Assertions.*
 /**
  * Tests subtask state management in TuiState (pure state tests).
  * ViewModel methods that call router are tested via integration tests.
+ *
+ * Uses core [pl.jclab.refio.core.api.SubtaskResponse] directly — TUI no longer has
+ * its own subtask/plan DTOs (DTO de-duplication, 2026-04-16).
  */
 class TuiViewModelSubtaskTest {
 
     @Test
-    fun `TuiSubtask should have correct default values`() {
-        val subtask = TuiSubtask(id = "s1", name = "test step")
-        assertEquals("NEW", subtask.status)
-        assertEquals("", subtask.description)
-        assertNull(subtask.toolName)
-        assertNull(subtask.error)
-        assertEquals(0L, subtask.tokensIn)
-        assertEquals(0L, subtask.tokensOut)
-        assertEquals(0.0, subtask.costUsd)
-    }
-
-    @Test
-    fun `TuiSubtask copy should update status`() {
-        val original = TuiSubtask(id = "s1", name = "test", status = "NEW")
+    fun `SubtaskResponse copy should update status`() {
+        val original = subtaskFixture(id = "s1", description = "test", status = "NEW")
         val updated = original.copy(status = "RUNNING")
         assertEquals("NEW", original.status)
         assertEquals("RUNNING", updated.status)
     }
 
     @Test
-    fun `TuiSubtask copy should update error`() {
-        val original = TuiSubtask(id = "s1", name = "test", status = "RUNNING")
-        val failed = original.copy(status = "FAILED", error = "Connection timeout")
+    fun `SubtaskResponse copy should update errorMessage`() {
+        val original = subtaskFixture(id = "s1", description = "test", status = "RUNNING")
+        val failed = original.copy(status = "FAILED", errorMessage = "Connection timeout")
         assertEquals("FAILED", failed.status)
-        assertEquals("Connection timeout", failed.error)
+        assertEquals("Connection timeout", failed.errorMessage)
     }
 
     @Test
-    fun `TuiPlan should track read and write steps`() {
-        val plan = TuiPlan(
-            taskId = "t1",
-            steps = listOf(
-                TuiSubtask(id = "s1", name = "read", toolName = "read_file"),
-                TuiSubtask(id = "s2", name = "edit", toolName = "code_editing")
-            ),
-            totalReadSteps = 1,
-            totalWriteSteps = 1
+    fun `TuiPlanApproval should carry plan steps`() {
+        val steps = listOf(
+            subtaskFixture(id = "s1", description = "read", kind = "read_file"),
+            subtaskFixture(id = "s2", description = "edit", kind = "code_editing"),
         )
-        assertEquals(2, plan.steps.size)
-        assertEquals(1, plan.totalReadSteps)
-        assertEquals(1, plan.totalWriteSteps)
-    }
-
-    @Test
-    fun `TuiPlanApproval should be visible by default`() {
-        val plan = TuiPlan(taskId = "t1", steps = emptyList())
-        val approval = TuiPlanApproval(taskId = "t1", plan = plan)
+        val approval = TuiPlanApproval(taskId = "t1", steps = steps)
+        assertEquals(2, approval.steps.size)
         assertTrue(approval.isVisible)
     }
 
@@ -73,11 +52,11 @@ class TuiViewModelSubtaskTest {
     }
 
     @Test
-    fun `subtask list operations should work correctly`() {
+    fun `subtask list counts should work correctly`() {
         val subtasks = listOf(
-            TuiSubtask(id = "s1", name = "a", status = "COMPLETED"),
-            TuiSubtask(id = "s2", name = "b", status = "RUNNING"),
-            TuiSubtask(id = "s3", name = "c", status = "NEW")
+            subtaskFixture(id = "s1", description = "a", status = "COMPLETED"),
+            subtaskFixture(id = "s2", description = "b", status = "RUNNING"),
+            subtaskFixture(id = "s3", description = "c", status = "NEW"),
         )
         val completed = subtasks.count { it.status == "COMPLETED" }
         val running = subtasks.count { it.status == "RUNNING" }
@@ -88,7 +67,7 @@ class TuiViewModelSubtaskTest {
     @Test
     fun `subtask status update should be immutable`() {
         val list = listOf(
-            TuiSubtask(id = "s1", name = "test", status = "NEW")
+            subtaskFixture(id = "s1", description = "test", status = "NEW"),
         )
         val updated = list.map {
             if (it.id == "s1") it.copy(status = "APPROVED") else it
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiWorkflowListenerTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiWorkflowListenerTest.kt
index cb33f782..5b7338e3 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiWorkflowListenerTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/state/TuiWorkflowListenerTest.kt
@@ -13,7 +13,6 @@ class TuiWorkflowListenerTest {
 
     private val messagesState = MutableStateFlow<List<TuiChatMessage>>(emptyList())
     private val streamingState = MutableStateFlow(false)
-    private val stepsState = MutableStateFlow<List<TuiStep>>(emptyList())
     private val scope = CoroutineScope(Dispatchers.Unconfined)
 
     private val listener = TuiWorkflowListener(
@@ -22,7 +21,6 @@ class TuiWorkflowListenerTest {
         agentColorIndex = 0,
         messagesState = messagesState,
         streamingState = streamingState,
-        stepsState = stepsState,
         scope = scope
     )
 
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsViewInteractiveTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsViewInteractiveTest.kt
index 6ce94b53..5700adcb 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsViewInteractiveTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsViewInteractiveTest.kt
@@ -17,9 +17,9 @@ class TuiStepsViewInteractiveTest {
     private val viewModel = mockk<TuiViewModel>(relaxed = true)
 
     private val sampleSubtasks = listOf(
-        TuiSubtask(id = "s1", name = "Read files", status = "COMPLETED", tokensIn = 100, tokensOut = 50, costUsd = 0.001),
-        TuiSubtask(id = "s2", name = "Edit code", status = "PENDING"),
-        TuiSubtask(id = "s3", name = "Verify", status = "NEW")
+        subtaskFixture(id = "s1", description = "Read files", status = "COMPLETED", tokensIn = 100, tokensOut = 50, costUsd = 0.001),
+        subtaskFixture(id = "s2", description = "Edit code", status = "PENDING"),
+        subtaskFixture(id = "s3", description = "Verify", status = "NEW"),
     )
 
     @BeforeEach
diff --git a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsViewTest.kt b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsViewTest.kt
index 49aa93f9..6100b6b9 100644
--- a/cli/src/test/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsViewTest.kt
+++ b/cli/src/test/kotlin/pl/jclab/refio/cli/tui/views/TuiStepsViewTest.kt
@@ -3,7 +3,8 @@ package pl.jclab.refio.cli.tui.views
 import com.github.ajalt.mordant.terminal.Terminal
 import com.github.ajalt.mordant.terminal.TerminalRecorder
 import org.junit.jupiter.api.Test
-import pl.jclab.refio.cli.tui.state.*
+import pl.jclab.refio.cli.tui.state.TuiState
+import pl.jclab.refio.cli.tui.state.subtaskFixture
 
 class TuiStepsViewTest {
 
@@ -16,40 +17,29 @@ class TuiStepsViewTest {
     }
 
     @Test
-    fun `should render steps with various statuses`() {
+    fun `should render subtasks with various statuses`() {
         val state = TuiState(
-            steps = listOf(
-                TuiStep(id = "1", name = "Analyze code", status = "COMPLETED"),
-                TuiStep(id = "2", name = "Write tests", status = "RUNNING"),
-                TuiStep(id = "3", name = "Deploy", status = "PENDING"),
-                TuiStep(id = "4", name = "Broken step", status = "FAILED")
+            subtasks = listOf(
+                subtaskFixture(id = "1", description = "Analyze code", status = "COMPLETED"),
+                subtaskFixture(id = "2", description = "Write tests", status = "RUNNING"),
+                subtaskFixture(id = "3", description = "Deploy", status = "PENDING"),
+                subtaskFixture(id = "4", description = "Broken step", status = "FAILED"),
             )
         )
         TuiStepsView.render(terminal, state, 20)
     }
 
     @Test
-    fun `should render expanded step with details`() {
+    fun `should render expanded subtask with details`() {
         val state = TuiState(
-            steps = listOf(
-                TuiStep(
-                    id = "1", name = "Read files", status = "COMPLETED",
-                    details = "Read 5 files, total 1200 lines", expanded = true
-                )
-            )
-        )
-        TuiStepsView.render(terminal, state, 20)
-    }
-
-    @Test
-    fun `should not show details for collapsed step`() {
-        val state = TuiState(
-            steps = listOf(
-                TuiStep(
-                    id = "1", name = "Read files", status = "COMPLETED",
-                    details = "Some details", expanded = false
-                )
-            )
+            subtasks = listOf(
+                subtaskFixture(
+                    id = "1", description = "Read files", status = "COMPLETED",
+                    resultSummary = "Read 5 files, total 1200 lines",
+                    tokensIn = 100, tokensOut = 50,
+                ),
+            ),
+            selectedStepIndex = 0,
         )
         TuiStepsView.render(terminal, state, 20)
     }
@@ -57,15 +47,15 @@ class TuiStepsViewTest {
     @Test
     fun `should render NEW status`() {
         val state = TuiState(
-            steps = listOf(TuiStep(id = "1", name = "New task", status = "NEW"))
+            subtasks = listOf(subtaskFixture(id = "1", description = "New task", status = "NEW"))
         )
         TuiStepsView.render(terminal, state, 20)
     }
 
     @Test
-    fun `should render OK status`() {
+    fun `should render SKIPPED status`() {
         val state = TuiState(
-            steps = listOf(TuiStep(id = "1", name = "Done task", status = "OK"))
+            subtasks = listOf(subtaskFixture(id = "1", description = "Skipped task", status = "SKIPPED"))
         )
         TuiStepsView.render(terminal, state, 20)
     }
diff --git a/core/build.gradle.kts b/core/build.gradle.kts
index bd089f7e..bfd861a6 100644
--- a/core/build.gradle.kts
+++ b/core/build.gradle.kts
@@ -118,4 +118,20 @@ tasks {
             csv.required.set(false)
         }
     }
+
+    jacocoTestCoverageVerification {
+        dependsOn(test)
+        violationRules {
+            rule {
+                limit {
+                    counter = "INSTRUCTION"
+                    minimum = "0.35".toBigDecimal()
+                }
+            }
+        }
+    }
+
+    check {
+        dependsOn(jacocoTestCoverageVerification)
+    }
 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/api/models/SlashCommand.kt b/core/src/main/kotlin/pl/jclab/refio/api/models/SlashPrompt.kt
similarity index 98%
rename from core/src/main/kotlin/pl/jclab/refio/api/models/SlashCommand.kt
rename to core/src/main/kotlin/pl/jclab/refio/api/models/SlashPrompt.kt
index 6cf537eb..05f9edae 100644
--- a/core/src/main/kotlin/pl/jclab/refio/api/models/SlashCommand.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/api/models/SlashPrompt.kt
@@ -1,12 +1,12 @@
 package pl.jclab.refio.api.models
 
 /**
- * Slash command definition.
- * Stored in PromptsTable with kind = 'command'.
+ * Slash prompt definition - reusable prompt template invoked via `/name` in chat input.
+ * Stored in PromptsTable with type = SLASH_PROMPT.
  */
-data class SlashCommand(
+data class SlashPrompt(
     val id: String,
-    val name: String,              // Command name (without /)
+    val name: String,              // Prompt name (without /)
     val description: String,       // Shown in autocomplete
     val template: String,          // Prompt template
     val variables: List<String> = emptyList(),  // Template variables
@@ -187,13 +187,13 @@ Then adapt your questions based on their answer, always asking one at a time.
         """.trimIndent()
 
         /**
-         * Built-in commands organized by category.
+         * Built-in slash prompts organized by category.
          */
         val BUILTINS = listOf(
             // =====================================================================
             // UNDERSTANDING
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "explain",
                 name = "explain",
                 description = "Explain what this code does, how it works, and what to watch out for",
@@ -249,7 +249,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // IMPROVEMENT
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "refactor",
                 name = "refactor",
                 description = "Suggest focused refactoring with concrete safe changes",
@@ -300,7 +300,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 showInEditor = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "simplify",
                 name = "simplify",
                 description = "Simplify code while preserving behavior",
@@ -343,7 +343,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 showInEditor = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "optimize",
                 name = "optimize",
                 description = "Analyze performance and propose practical optimizations",
@@ -392,7 +392,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // TESTING
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "test",
                 name = "test",
                 description = "Generate high-value unit tests",
@@ -438,7 +438,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 showInEditor = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "test-integration",
                 name = "test-integration",
                 description = "Generate integration tests for real component collaboration",
@@ -477,7 +477,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "test-edge-cases",
                 name = "test-edge-cases",
                 description = "Generate tests focused on edge cases and boundaries",
@@ -513,7 +513,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // FIXING
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "fix",
                 name = "fix",
                 description = "Fix a bug with root cause analysis and regression check",
@@ -555,7 +555,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // DOCUMENTATION
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "document",
                 name = "document",
                 description = "Add useful KDoc and non-obvious inline documentation",
@@ -585,7 +585,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // TRANSLATION
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "translate-comments",
                 name = "translate-comments",
                 description = "Translate comments and docs to professional English",
@@ -609,7 +609,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "translate-messages",
                 name = "translate-messages",
                 description = "Translate user-facing messages and logs to English",
@@ -639,7 +639,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "translate-all",
                 name = "translate-all",
                 description = "Translate all human-readable text in code to English",
@@ -672,7 +672,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // ENHANCEMENT
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "add-logging",
                 name = "add-logging",
                 description = "Add useful structured logging without noise",
@@ -705,7 +705,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "add-error-handling",
                 name = "add-error-handling",
                 description = "Add meaningful error handling at real failure points",
@@ -738,7 +738,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "add-validation",
                 name = "add-validation",
                 description = "Add input validation at real system boundaries",
@@ -775,7 +775,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // REFACTORING
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "extract-method",
                 name = "extract-method",
                 description = "Extract selected logic into a well-named method",
@@ -808,7 +808,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // SECURITY
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "security-review",
                 name = "security-review",
                 description = "Review code for realistic security issues with severity",
@@ -855,7 +855,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 showInEditor = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "threat-model",
                 name = "threat-model",
                 description = "Perform a lightweight STRIDE threat model",
@@ -905,7 +905,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "security-fix",
                 name = "security-fix",
                 description = "Fix visible security issues with safe-by-default changes",
@@ -943,7 +943,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // ANALYSIS
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "analyze",
                 name = "analyze",
                 description = "Perform a deep code quality analysis",
@@ -988,7 +988,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "architecture",
                 name = "architecture",
                 description = "Review the architecture role, boundaries, and coupling",
@@ -1027,7 +1027,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "implementation-analysis",
                 name = "implementation-analysis",
                 description = "Analyze a topic and prepare an implementation-ready markdown document",
@@ -1095,7 +1095,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "code-review",
                 name = "code-review",
                 description = "Review code like a pull request",
@@ -1134,7 +1134,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 showInEditor = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "dependencies",
                 name = "dependencies",
                 description = "Analyze dependencies, hidden coupling, and decoupling options",
@@ -1174,7 +1174,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // IMPLEMENTATION
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "implement",
                 name = "implement",
                 description = "Implement a requested change with plan, code, and verification",
@@ -1224,7 +1224,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "migrate",
                 name = "migrate",
                 description = "Migrate code to a new API, pattern, or framework version",
@@ -1266,7 +1266,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // UI
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "ui-review",
                 name = "ui-review",
                 description = "Review UI for usability, layout, and maintainability",
@@ -1302,7 +1302,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "ui-improve",
                 name = "ui-improve",
                 description = "Improve a UI component with concrete code changes",
@@ -1337,7 +1337,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // CLEANUP
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "clean",
                 name = "clean",
                 description = "Clean up code without changing behavior",
@@ -1371,7 +1371,7 @@ Then adapt your questions based on their answer, always asking one at a time.
                 isBuiltin = true
             ),
 
-            SlashCommand(
+            SlashPrompt(
                 id = "decompose",
                 name = "decompose",
                 description = "Split a large file into smaller focused parts",
@@ -1417,7 +1417,7 @@ Then adapt your questions based on their answer, always asking one at a time.
             // =====================================================================
             // CREATION
             // =====================================================================
-            SlashCommand(
+            SlashPrompt(
                 id = "create-agent",
                 name = "create-agent",
                 description = "Create a custom AI subagent through guided conversation",
diff --git a/core/src/main/kotlin/pl/jclab/refio/api/models/TaskPlanModels.kt b/core/src/main/kotlin/pl/jclab/refio/api/models/TaskPlanModels.kt
deleted file mode 100644
index 26c07c71..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/api/models/TaskPlanModels.kt
+++ /dev/null
@@ -1,65 +0,0 @@
-package pl.jclab.refio.api.models
-
-import com.google.gson.annotations.SerializedName
-
-/**
- * Sub‑task DTO used by the UI to render the Steps Queue.
- * Contains only fields required for read‑only display.
- * Mirrors Python SubtaskResponse from agent/core/schema/schemas.py:164
- */
-data class SubtaskDto(
-    @SerializedName("id")
-    val id: String,
-    @SerializedName("task_id")
-    val taskId: String,
-    @SerializedName("order_index")
-    val orderIndex: Int,
-    @SerializedName("kind")
-    val kind: String,
-    @SerializedName("status")
-    val status: String,
-    @SerializedName("approval_status")
-    val approvalStatus: String = "not_required",
-    @SerializedName("requires_approval")
-    val requiresApproval: Boolean = false,
-    @SerializedName("approved_by_user")
-    val approvedByUser: Boolean = false,
-    @SerializedName("description")
-    val description: String? = null,
-    @SerializedName("params_json")
-    val paramsJson: String? = null,
-    @SerializedName("step_plan_json")
-    val stepPlanJson: String? = null,
-    @SerializedName("summary")
-    val summary: String? = null,
-    @SerializedName("result")
-    val result: String? = null,
-    @SerializedName("started_at")
-    val startedAt: Long? = null,
-    @SerializedName("finished_at")
-    val finishedAt: Long? = null,
-    @SerializedName("error_code")
-    val errorCode: String? = null,
-    @SerializedName("error_message")
-    val errorMessage: String? = null,
-    @SerializedName("tokens_in")
-    val tokensIn: Int? = null,
-    @SerializedName("tokens_out")
-    val tokensOut: Int? = null,
-    @SerializedName("cost_usd")
-    val costUsd: Double? = null,
-    @SerializedName("latency_ms")
-    val latencyMs: Int? = null,
-    @SerializedName("model")
-    val model: String? = null,
-    @SerializedName("provider")
-    val provider: String? = null,
-    @SerializedName("result_summary")
-    val resultSummary: String? = null,
-    @SerializedName("created_at")
-    val createdAt: Long? = null,
-    @SerializedName("updated_at")
-    val updatedAt: Long? = null,
-    @SerializedName("completed_at")
-    val completedAt: Long? = null
-)
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/agents/events/AgentEvent.kt b/core/src/main/kotlin/pl/jclab/refio/core/agents/events/AgentEvent.kt
index 8acf2cce..9272ca02 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/agents/events/AgentEvent.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/agents/events/AgentEvent.kt
@@ -153,7 +153,13 @@ sealed interface AgentEvent {
         override val correlationId: String,
         val delta: String,
         val accumulated: String,
-        val isComplete: Boolean
+        val isComplete: Boolean,
+        // Per-turn identity for routing to a per-agent streaming bubble. runId is unique per
+        // subagent invocation (even when sourceAgentId is shared with the parent), depth lets
+        // the renderer indent, and agentName populates ChatMessage.agentName on persistence.
+        val runId: String? = null,
+        val depth: Int = 0,
+        val agentName: String? = null,
     ) : AgentEvent
 
     // ── TURN LIFECYCLE — per-iteration events for Session Trace panel ──
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/agents/orchestration/OrchestrationDispatcher.kt b/core/src/main/kotlin/pl/jclab/refio/core/agents/orchestration/OrchestrationDispatcher.kt
new file mode 100644
index 00000000..090aee66
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/agents/orchestration/OrchestrationDispatcher.kt
@@ -0,0 +1,343 @@
+package pl.jclab.refio.core.agents.orchestration
+
+import kotlinx.coroutines.sync.Mutex
+import kotlinx.coroutines.sync.withLock
+import pl.jclab.refio.api.models.ContextReference
+import pl.jclab.refio.api.models.MultiAgentStrategy
+import pl.jclab.refio.core.agents.AgentResult
+import pl.jclab.refio.core.agents.AgentSpec
+import pl.jclab.refio.core.agents.MultiAgentRunner
+import pl.jclab.refio.core.api.CreateTaskRequest
+import pl.jclab.refio.core.api.LEGACY_PROJECT_ID
+import pl.jclab.refio.core.api.LEGACY_PROJECT_PATH
+import pl.jclab.refio.core.api.StreamCallback
+import pl.jclab.refio.core.api.TaskResponse
+import pl.jclab.refio.core.api.TurnProfileOverrides
+import pl.jclab.refio.core.api.TurnRequest
+import pl.jclab.refio.core.api.TurnRunProfile
+import pl.jclab.refio.core.db.ExecutionMode
+import pl.jclab.refio.core.db.MessageRole
+import pl.jclab.refio.core.db.TaskMode
+import pl.jclab.refio.core.db.repositories.ChatMessageRepository
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.TurnResult
+import pl.jclab.refio.core.subagents.SubagentRouter
+import pl.jclab.refio.core.subagents.models.SubagentDefinition
+
+private val logger = dualLogger("OrchestrationDispatcher")
+
+/**
+ * Routes a user turn to a multi-agent flow selected by [MultiAgentStrategy].
+ *
+ * - [MultiAgentStrategy.ORCHESTRATOR] runs a single turn with the `multi-agent-coordinator`
+ *   subagent's system prompt + tool whitelist. The LLM decides dynamically which subagents
+ *   to spawn via `invoke_subagent`.
+ * - [MultiAgentStrategy.PARALLEL] asks [TaskDecomposer] for independent specs, runs them
+ *   concurrently via [MultiAgentRunner], then [ResultMerger] collapses N outputs to one.
+ * - [MultiAgentStrategy.PIPELINE] decomposes into a linear chain where each stage consumes
+ *   the previous stage's output via the `{{prev.output}}` placeholder. Returns the last
+ *   stage's response as-is.
+ *
+ * Each sub-agent run emits an assistant message to the parent chat with `agentName` populated,
+ * so [pl.jclab.refio.ui.components.chat.bubble.ChatMessageBubbleRouter] renders a per-agent
+ * bubble header. The sub-agent's internal turn-loop messages land in an isolated child task
+ * so they don't pollute the main conversation.
+ */
+class OrchestrationDispatcher(
+    private val configService: ConfigService,
+    private val subagentRouter: SubagentRouter,
+    private val multiAgentRunner: MultiAgentRunner,
+    private val chatMessageRepository: ChatMessageRepository,
+    private val taskDecomposer: TaskDecomposer,
+    private val resultMerger: ResultMerger,
+    private val createTaskFn: (CreateTaskRequest) -> TaskResponse,
+    private val runTurnFn: suspend (TurnRequest, StreamCallback?) -> TurnResult,
+    private val projectId: String?,
+    private val projectPath: String?,
+) {
+
+    data class OrchestrationResult(
+        val response: String,
+        val agents: Map<String, AgentResult>,
+        val totalTokensIn: Int,
+        val totalTokensOut: Int,
+        val totalCost: Double,
+    )
+
+    suspend fun dispatch(
+        parentTaskId: String,
+        input: String,
+        contextRefs: List<ContextReference>,
+        parentModel: String?,
+        parentProvider: String?,
+        stream: Boolean,
+        streamCallback: StreamCallback?,
+        strategy: MultiAgentStrategy,
+    ): OrchestrationResult {
+        logger.info { "[ORCHESTRATE] strategy=$strategy taskId=$parentTaskId" }
+
+        return when (strategy) {
+            MultiAgentStrategy.ORCHESTRATOR -> runOrchestrator(
+                parentTaskId, input, contextRefs, parentModel, parentProvider, stream, streamCallback
+            )
+            MultiAgentStrategy.PARALLEL -> runParallel(
+                parentTaskId, input, parentModel, parentProvider, stream, streamCallback
+            )
+            MultiAgentStrategy.PIPELINE -> runPipeline(
+                parentTaskId, input, parentModel, parentProvider
+            )
+            MultiAgentStrategy.SINGLE -> throw IllegalArgumentException(
+                "OrchestrationDispatcher must not be called for SINGLE strategy"
+            )
+        }
+    }
+
+    // ========== ORCHESTRATOR ==========
+
+    private suspend fun runOrchestrator(
+        parentTaskId: String,
+        input: String,
+        contextRefs: List<ContextReference>,
+        parentModel: String?,
+        parentProvider: String?,
+        stream: Boolean,
+        streamCallback: StreamCallback?,
+    ): OrchestrationResult {
+        val coordinator = subagentRouter.getSubagent(COORDINATOR_NAME)
+            ?: throw IllegalStateException("Subagent '$COORDINATOR_NAME' not found — install built-in subagents")
+        if (!coordinator.enabled) {
+            throw IllegalStateException("Subagent '$COORDINATOR_NAME' is disabled — enable it in Settings")
+        }
+
+        val (resolvedModel, resolvedProvider) = coordinator.resolveModel(
+            configService, joinModel(parentModel, parentProvider)
+        )
+
+        val request = buildSubagentTurnRequest(
+            taskId = parentTaskId,
+            userInput = input,
+            definition = coordinator,
+            resolvedModel = resolvedModel,
+            resolvedProvider = resolvedProvider,
+            contextRefs = contextRefs,
+            emitSessionId = parentTaskId,
+            emitSourceAgentId = "orchestrator",
+        )
+
+        val result = runTurnFn(request, streamCallback.takeIf { stream })
+        return OrchestrationResult(
+            response = result.response,
+            agents = mapOf(
+                COORDINATOR_NAME to AgentResult(
+                    agentName = COORDINATOR_NAME,
+                    success = result.success,
+                    response = result.response,
+                    tokensUsed = (result.tokensIn + result.tokensOut).toLong(),
+                    costUsd = result.cost,
+                )
+            ),
+            totalTokensIn = result.tokensIn,
+            totalTokensOut = result.tokensOut,
+            totalCost = result.cost,
+        )
+    }
+
+    // ========== PARALLEL ==========
+
+    private suspend fun runParallel(
+        parentTaskId: String,
+        input: String,
+        parentModel: String?,
+        parentProvider: String?,
+        stream: Boolean,
+        streamCallback: StreamCallback?,
+    ): OrchestrationResult {
+        val specs = taskDecomposer.decompose(input, TaskDecomposer.Mode.PARALLEL)
+        if (specs.isEmpty()) {
+            throw DecompositionFailedException(
+                "Decomposer produced no plan for PARALLEL — user should re-run with SINGLE strategy"
+            )
+        }
+
+        logger.info { "[ORCHESTRATE/PARALLEL] ${specs.size} agents: ${specs.map { it.name }}" }
+
+        val results = multiAgentRunner.run(parentTaskId, specs) { spec, agentId ->
+            executeSpec(parentTaskId, spec, agentId, parentModel, parentProvider, originalInput = input)
+        }
+
+        val merged = resultMerger.merge(input, results, stream = stream, onChunk = streamCallback.takeIf { stream })
+        return buildResult(merged, results)
+    }
+
+    // ========== PIPELINE ==========
+
+    private suspend fun runPipeline(
+        parentTaskId: String,
+        input: String,
+        parentModel: String?,
+        parentProvider: String?,
+    ): OrchestrationResult {
+        val specs = taskDecomposer.decompose(input, TaskDecomposer.Mode.PIPELINE)
+        if (specs.isEmpty()) {
+            throw DecompositionFailedException(
+                "Decomposer produced no plan for PIPELINE — user should re-run with SINGLE strategy"
+            )
+        }
+
+        logger.info { "[ORCHESTRATE/PIPELINE] ${specs.size} stages: ${specs.map { it.name }}" }
+
+        val outputs = mutableMapOf<String, String>()
+        val outputsMutex = Mutex()
+
+        val results = multiAgentRunner.run(parentTaskId, specs) { spec, agentId ->
+            val resolvedTask = resolvePipelinePlaceholders(spec, outputs, outputsMutex)
+            val resolvedSpec = spec.copy(task = resolvedTask)
+            val result = executeSpec(parentTaskId, resolvedSpec, agentId, parentModel, parentProvider, originalInput = input)
+            outputsMutex.withLock { outputs[spec.name] = result.response }
+            result
+        }
+
+        // For PIPELINE, the last stage's response is the final answer — no LLM merge needed.
+        val finalResponse = specs.lastOrNull()?.let { results[it.name]?.response }.orEmpty()
+        return buildResult(finalResponse, results)
+    }
+
+    private suspend fun resolvePipelinePlaceholders(
+        spec: AgentSpec,
+        outputs: Map<String, String>,
+        outputsMutex: Mutex,
+    ): String {
+        val prevName = spec.dependsOn.firstOrNull() ?: return spec.task
+        val prevOutput = outputsMutex.withLock { outputs[prevName].orEmpty() }
+        return spec.task.replace("{{prev.output}}", prevOutput)
+    }
+
+    // ========== Per-spec execution ==========
+
+    private suspend fun executeSpec(
+        parentTaskId: String,
+        spec: AgentSpec,
+        agentId: String,
+        parentModel: String?,
+        parentProvider: String?,
+        originalInput: String,
+    ): AgentResult {
+        val subagentName = spec.profile ?: spec.name
+        val definition = subagentRouter.getSubagent(subagentName)
+            ?: throw IllegalStateException("Subagent '$subagentName' not found during orchestration")
+
+        val (resolvedModel, resolvedProvider) = definition.resolveModel(
+            configService, joinModel(parentModel, parentProvider)
+        )
+
+        val childTask = createTaskFn(
+            CreateTaskRequest(
+                name = "orchestrate: ${spec.name}",
+                mode = TaskMode.AGENT,
+                projectId = projectId ?: LEGACY_PROJECT_ID,
+                projectPath = projectPath ?: LEGACY_PROJECT_PATH,
+            )
+        )
+
+        val request = buildSubagentTurnRequest(
+            taskId = childTask.id,
+            userInput = spec.task,
+            definition = definition,
+            resolvedModel = resolvedModel,
+            resolvedProvider = resolvedProvider,
+            contextRefs = emptyList(),
+            emitSessionId = parentTaskId,
+            emitSourceAgentId = agentId,
+        )
+
+        val turnResult = runTurnFn(request, null)
+
+        // Publish per-agent bubble in the parent conversation.
+        chatMessageRepository.create(
+            taskId = parentTaskId,
+            role = MessageRole.ASSISTANT,
+            content = turnResult.response,
+            agentName = spec.name,
+            agentDepth = 1,
+            tokensIn = turnResult.tokensIn,
+            tokensOut = turnResult.tokensOut,
+            cost = turnResult.cost,
+        )
+
+        return AgentResult(
+            agentName = spec.name,
+            success = turnResult.success,
+            response = turnResult.response,
+            tokensUsed = (turnResult.tokensIn + turnResult.tokensOut).toLong(),
+            costUsd = turnResult.cost,
+        )
+    }
+
+    private fun buildSubagentTurnRequest(
+        taskId: String,
+        userInput: String,
+        definition: SubagentDefinition,
+        resolvedModel: String,
+        resolvedProvider: String,
+        contextRefs: List<ContextReference>,
+        emitSessionId: String,
+        emitSourceAgentId: String,
+    ): TurnRequest {
+        return TurnRequest(
+            taskId = taskId,
+            userInput = userInput,
+            mode = TaskMode.AGENT,
+            executionMode = ExecutionMode.AUTO,
+            model = resolvedModel,
+            provider = resolvedProvider,
+            userContextRefs = contextRefs,
+            runProfile = TurnRunProfile.SUBAGENT,
+            profileOverrides = TurnProfileOverrides(
+                subagentName = definition.name,
+                systemPromptOverride = definition.systemPrompt,
+                allowedTools = definition.allowedTools,
+                disallowedTools = definition.disallowedTools,
+                modelOverride = resolvedModel,
+                providerOverride = resolvedProvider,
+                maxIterationsOverride = definition.maxSteps,
+                depth = 0,
+                subagentChain = emptyList(),
+                contextProfile = definition.contextProfile,
+                reasoningEffort = definition.reasoningEffort,
+            ),
+            emitSessionId = emitSessionId,
+            emitSourceAgentId = emitSourceAgentId,
+        )
+    }
+
+    private fun buildResult(
+        response: String,
+        agents: Map<String, AgentResult>,
+    ): OrchestrationResult {
+        val totalTokens = agents.values.sumOf { it.tokensUsed }
+        // AgentResult.tokensUsed bundles in+out; preserve that split cheaply by halving — callers
+        // use the totals for session accounting, the exact in/out split isn't load-bearing here.
+        return OrchestrationResult(
+            response = response,
+            agents = agents,
+            totalTokensIn = (totalTokens / 2).toInt(),
+            totalTokensOut = (totalTokens - totalTokens / 2).toInt(),
+            totalCost = agents.values.sumOf { it.costUsd },
+        )
+    }
+
+    private fun joinModel(model: String?, provider: String?): String? {
+        return when {
+            model != null && provider != null -> "$provider/$model"
+            model != null -> model
+            else -> null
+        }
+    }
+
+    companion object {
+        private const val COORDINATOR_NAME = "multi-agent-coordinator"
+    }
+}
+
+class DecompositionFailedException(message: String) : Exception(message)
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/agents/orchestration/ResultMerger.kt b/core/src/main/kotlin/pl/jclab/refio/core/agents/orchestration/ResultMerger.kt
new file mode 100644
index 00000000..3584b1f4
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/agents/orchestration/ResultMerger.kt
@@ -0,0 +1,78 @@
+package pl.jclab.refio.core.agents.orchestration
+
+import pl.jclab.refio.core.agents.AgentResult
+import pl.jclab.refio.core.api.ModelOperation
+import pl.jclab.refio.core.api.StreamCallback
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.llm.LLMMessage
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.ConfigService
+
+private val logger = dualLogger("ResultMerger")
+
+/**
+ * Collapses N subagent outputs back into a single coherent answer for the user.
+ *
+ * Uses the configured default model (user expects high-quality synthesis, not a cheap model).
+ * Streams the merged response so the main chat bubble fills in real time, after the per-agent
+ * bubbles have already been written.
+ */
+class ResultMerger(
+    private val llmClient: LLMClient,
+    private val configService: ConfigService,
+) {
+    suspend fun merge(
+        originalPrompt: String,
+        results: Map<String, AgentResult>,
+        stream: Boolean = false,
+        onChunk: StreamCallback? = null,
+    ): String {
+        if (results.isEmpty()) {
+            logger.warn { "No results to merge — returning empty response" }
+            return ""
+        }
+
+        val successful = results.values.filter { it.success && it.response.isNotBlank() }
+        if (successful.isEmpty()) {
+            logger.warn { "All agent runs failed — returning concatenated errors" }
+            return results.values.joinToString("\n\n") { "[${it.agentName}] FAILED: ${it.error ?: "unknown error"}" }
+        }
+
+        val (model, provider) = configService.getModel(ModelOperation.DEFAULT)
+        val systemPrompt = """
+            You are a synthesis assistant. Multiple specialist subagents have analyzed the user's
+            request independently. Merge their findings into a single coherent response.
+
+            Rules:
+            - Do not lose concrete findings; preserve file references, code snippets, and metrics.
+            - If agents disagree, surface the disagreement and note both positions.
+            - Remove redundancy when multiple agents report the same thing.
+            - Use the same tone a senior engineer would when handing a review back to a colleague.
+        """.trimIndent()
+
+        val bundle = buildString {
+            appendLine("Original user request:")
+            appendLine(originalPrompt)
+            appendLine()
+            appendLine("Agent outputs:")
+            successful.forEach { result ->
+                appendLine()
+                appendLine("--- ${result.agentName} ---")
+                appendLine(result.response)
+            }
+        }
+
+        val response = llmClient.complete(
+            provider = provider,
+            model = model,
+            systemPrompt = systemPrompt,
+            messages = listOf(LLMMessage(role = "user", content = bundle)),
+            temperature = 0.3,
+            stream = stream,
+            onChunk = onChunk,
+            source = "ResultMerger",
+        )
+
+        return response.content
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/agents/orchestration/TaskDecomposer.kt b/core/src/main/kotlin/pl/jclab/refio/core/agents/orchestration/TaskDecomposer.kt
new file mode 100644
index 00000000..64377b61
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/agents/orchestration/TaskDecomposer.kt
@@ -0,0 +1,181 @@
+package pl.jclab.refio.core.agents.orchestration
+
+import kotlinx.serialization.SerialName
+import kotlinx.serialization.Serializable
+import kotlinx.serialization.json.Json
+import pl.jclab.refio.core.agents.AgentSpec
+import pl.jclab.refio.core.api.ModelOperation
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.llm.LLMMessage
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.subagents.SubagentRouter
+import pl.jclab.refio.core.subagents.models.SubagentInfo
+
+private val logger = dualLogger("TaskDecomposer")
+
+/**
+ * Turns a user prompt into a list of subagent invocations for PARALLEL or PIPELINE strategies.
+ *
+ * Calls an LLM with the list of available subagents and asks it to produce a JSON plan.
+ * For PIPELINE, `dependsOn` chains are linearized; for PARALLEL they remain independent.
+ *
+ * Returns an empty list when the task clearly does not need multi-agent orchestration
+ * (e.g. trivial request) — the dispatcher then falls back to the single-agent path.
+ */
+class TaskDecomposer(
+    private val llmClient: LLMClient,
+    private val configService: ConfigService,
+    private val subagentRouter: SubagentRouter?,
+) {
+    private val json = Json { ignoreUnknownKeys = true; isLenient = true }
+
+    enum class Mode { PARALLEL, PIPELINE }
+
+    suspend fun decompose(userInput: String, mode: Mode): List<AgentSpec> {
+        val router = subagentRouter ?: run {
+            logger.warn { "SubagentRouter unavailable, skipping decomposition" }
+            return emptyList()
+        }
+
+        val available = router.listSubagents(includeDisabled = false)
+        if (available.isEmpty()) {
+            logger.warn { "No subagents available — decomposition impossible" }
+            return emptyList()
+        }
+
+        val (model, provider) = configService.getModel(ModelOperation.PLAN)
+        val systemPrompt = buildSystemPrompt(available, mode)
+
+        val response = llmClient.complete(
+            provider = provider,
+            model = model,
+            systemPrompt = systemPrompt,
+            messages = listOf(LLMMessage(role = "user", content = userInput)),
+            temperature = 0.2,
+            kwargs = mapOf("response_format" to mapOf("type" to "json_object")),
+            source = "TaskDecomposer",
+        )
+
+        return parseResponse(response.content, available, mode)
+    }
+
+    private fun buildSystemPrompt(available: List<SubagentInfo>, mode: Mode): String {
+        val catalogue = available.joinToString("\n") { "- ${it.name}: ${it.description}" }
+        val modeRules = when (mode) {
+            Mode.PARALLEL -> """
+                Mode: PARALLEL
+                - Agents run concurrently with no inter-dependencies.
+                - Leave `depends_on` empty for every agent.
+                - Pick 2-4 agents that cover complementary angles of the user's request.
+            """.trimIndent()
+            Mode.PIPELINE -> """
+                Mode: PIPELINE
+                - Agents run sequentially; each agent consumes the previous agent's output.
+                - `depends_on` must be a list with the single previous agent name, or empty for the first.
+                - Use the placeholder {{prev.output}} in the `task` field to reference the previous agent's response.
+                - Usually 2-3 stages (e.g. analysis → implementation → review).
+            """.trimIndent()
+        }
+
+        return """
+            You are a task decomposition planner. Given a user request and a catalogue of available
+            subagents, produce a JSON plan that delegates the work across agents.
+
+            Available subagents:
+            $catalogue
+
+            $modeRules
+
+            Respond with **valid JSON only** matching this schema:
+            {
+              "agents": [
+                {
+                  "name": "<unique-spec-name>",
+                  "subagent": "<subagent name from the catalogue>",
+                  "task": "<concrete instruction for that agent>",
+                  "depends_on": ["<other spec name>"]
+                }
+              ]
+            }
+
+            If the request is trivial and a single agent would suffice, respond with {"agents": []}.
+            Do NOT invent subagents that are not in the catalogue.
+        """.trimIndent()
+    }
+
+    private fun parseResponse(
+        content: String,
+        available: List<SubagentInfo>,
+        mode: Mode,
+    ): List<AgentSpec> {
+        val cleaned = extractJson(content) ?: run {
+            logger.warn { "Decomposer returned non-JSON content: ${content.take(200)}" }
+            return emptyList()
+        }
+
+        val plan = try {
+            json.decodeFromString(DecompositionPlan.serializer(), cleaned)
+        } catch (e: Exception) {
+            logger.error(e) { "Failed to parse decomposition JSON: ${cleaned.take(300)}" }
+            return emptyList()
+        }
+
+        val availableNames = available.map { it.name.lowercase() }.toSet()
+        val valid = plan.agents.filter { it.subagent.lowercase() in availableNames }
+        if (valid.size < plan.agents.size) {
+            val dropped = plan.agents.filter { it.subagent.lowercase() !in availableNames }.map { it.subagent }
+            logger.warn { "Dropped unknown subagents from plan: $dropped" }
+        }
+
+        // Guard against degenerate plans: 0 or 1 agent → no orchestration needed.
+        if (valid.size < 2) {
+            logger.info { "Decomposition produced ${valid.size} agents — falling back to single-agent path" }
+            return emptyList()
+        }
+
+        val specs = valid.map { entry ->
+            AgentSpec(
+                name = entry.name.ifBlank { entry.subagent },
+                profile = entry.subagent,
+                task = entry.task,
+                dependsOn = entry.dependsOn,
+            )
+        }
+
+        return if (mode == Mode.PIPELINE) linearizePipeline(specs) else specs.map { it.copy(dependsOn = emptyList()) }
+    }
+
+    /**
+     * For PIPELINE mode, force a linear chain so each agent depends only on its predecessor —
+     * even if the LLM returned a fan-out graph.
+     */
+    private fun linearizePipeline(specs: List<AgentSpec>): List<AgentSpec> {
+        return specs.mapIndexed { index, spec ->
+            val deps = if (index == 0) emptyList() else listOf(specs[index - 1].name)
+            spec.copy(dependsOn = deps)
+        }
+    }
+
+    private fun extractJson(raw: String): String? {
+        val trimmed = raw.trim()
+        if (trimmed.startsWith("{") && trimmed.endsWith("}")) return trimmed
+        val start = trimmed.indexOf('{')
+        val end = trimmed.lastIndexOf('}')
+        return if (start in 0 until end) trimmed.substring(start, end + 1) else null
+    }
+
+    @Serializable
+    private data class DecompositionPlan(
+        val agents: List<DecomposedAgent> = emptyList(),
+    )
+
+    @Serializable
+    private data class DecomposedAgent(
+        val name: String,
+        val subagent: String,
+        val task: String,
+        @SerialName("depends_on")
+        val dependsOn: List<String> = emptyList(),
+    )
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/ApiModels.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/ApiModels.kt
index 57a4f33b..17941782 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/api/ApiModels.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/ApiModels.kt
@@ -8,7 +8,6 @@ import pl.jclab.refio.core.llm.ModelConfig
 import pl.jclab.refio.core.models.context.CurrentTaskDTO
 import pl.jclab.refio.core.models.context.SubtaskDTO
 import pl.jclab.refio.core.models.context.ExecutedStepDTO
-import pl.jclab.refio.core.models.context.CodeFragmentDTO
 import pl.jclab.refio.core.models.context.ConversationMessageDTO
 import pl.jclab.refio.api.models.ContextReference
 
@@ -416,15 +415,11 @@ data class ProjectContextResponse(
     val contextBuiltAt: Long,
     // User requirements extracted from task description
     val userRequirements: Map<String, Any> = emptyMap(),
-    // RAG (Retrieval-Augmented Generation) fragments
-    val ragFragments: List<CodeFragmentDTO> = emptyList(),
     val mcpResources: List<MCPResourceResponse> = emptyList(),
     // User context references from @mentions (files, selections, providers)
     val userContextRefs: List<UserContextRefDTO> = emptyList(),
     // Conversation history
     val conversationHistory: List<ConversationMessageDTO> = emptyList(),
-    // Previous subtask summaries
-    val previousSubtasks: List<String> = emptyList(),
     // Domain analysis scores
     val domainAnalysis: Map<String, Any> = emptyMap(),
     // Project structure details
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/CoreApiRouter.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/CoreApiRouter.kt
index 6d6b4722..d375fd3f 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/api/CoreApiRouter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/CoreApiRouter.kt
@@ -4,35 +4,12 @@ import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.Dispatchers
 import kotlinx.coroutines.SupervisorJob
 import kotlinx.coroutines.cancel
-import kotlinx.coroutines.flow.Flow
-import kotlinx.coroutines.sync.Mutex
 import pl.jclab.refio.core.db.*
-import pl.jclab.refio.core.db.repositories.*
 import pl.jclab.refio.core.llm.LLMClient
 import pl.jclab.refio.core.services.*
-import pl.jclab.refio.core.config.ConfigKeys
-import pl.jclab.refio.core.services.analysis.EmbeddingsService
 import pl.jclab.refio.core.services.turn.*
-import pl.jclab.refio.core.services.analysis.FileAnalyzerService
-import pl.jclab.refio.core.services.analysis.CppLanguageAnalyzer
-import pl.jclab.refio.core.services.analysis.CssLanguageAnalyzer
-import pl.jclab.refio.core.services.analysis.GoLanguageAnalyzer
-import pl.jclab.refio.core.services.analysis.HtmlLanguageAnalyzer
-import pl.jclab.refio.core.services.analysis.JavaLanguageAnalyzer
-import pl.jclab.refio.core.services.analysis.KotlinLanguageAnalyzer
-import pl.jclab.refio.core.services.analysis.PythonLanguageAnalyzer
-import pl.jclab.refio.core.services.analysis.RustLanguageAnalyzer
-import pl.jclab.refio.core.services.analysis.TypeScriptLanguageAnalyzer
-import pl.jclab.refio.core.services.analysis.project.RichProjectAnalysisEngine
-import pl.jclab.refio.core.services.context.WorkingMemoryService
 import pl.jclab.refio.core.tools.base.ToolRegistry
 import pl.jclab.refio.core.utils.ProjectIdGenerator
-import pl.jclab.refio.core.workflow.IntentRouter
-import pl.jclab.refio.core.workflow.WorkflowOrchestrator
-import pl.jclab.refio.core.workflow.executors.ChatExecutor
-import pl.jclab.refio.core.workflow.executors.PlanExecutor
-import pl.jclab.refio.core.workflow.executors.StepExecutor
-import pl.jclab.refio.core.workflow.executors.SubagentExecutor
 import pl.jclab.refio.core.logging.dualLogger
 
 private val logger = dualLogger("CoreApiRouter")
@@ -49,9 +26,9 @@ private val logger = dualLogger("CoreApiRouter")
 class CoreApiRouter(
     private val toolRegistry: ToolRegistry? = null,
     private val projectRoot: java.nio.file.Path? = null,
-    private val ideProject: Any? = null,
+    private val platformProjectOverride: Any? = null,
     private val llmClientOverride: LLMClient? = null,
-    /** Platform-agnostic project handle. When provided, ideProject is derived from platformProject. */
+    /** Platform-agnostic project handle. When provided, platformProject is derived from projectHandle.platformProject. */
     val projectHandle: pl.jclab.refio.core.project.ProjectHandle? = null,
     /** Callback to invalidate codebase context cache after RAG operations. Set by plugin layer. */
     private val codebaseCacheInvalidator: (projectRoot: String) -> Unit = {}
@@ -60,79 +37,58 @@ class CoreApiRouter(
     private val routerProjectPath: String? = projectHandle?.rootPath?.toAbsolutePath()?.normalize()?.toString()
         ?: projectRoot?.toAbsolutePath()?.normalize()?.toString()
 
-    /** Resolved IDE project — from projectHandle.platformProject or direct ideProject param */
-    private val resolvedIdeProject: Any?
-        get() = ideProject ?: projectHandle?.platformProject
+    /** Resolved platform-specific project — from projectHandle.platformProject or direct override */
+    private val resolvedPlatformProject: Any?
+        get() = platformProjectOverride ?: projectHandle?.platformProject
 
-    // Repositories
-    private val chatMessageRepository = ChatMessageRepository()
-    private val subtaskRepository = SubtaskRepository()
-    private val configRepository = ConfigRepository()
-    private val apiLogRepository = ApiLogRepository()
-    private val promptsRepository = PromptsRepository()
-    private val ragRepository = RagRepository()
-    private val documentationRepository = DocumentationRepository()
-    private val snapshotRepository = SnapshotRepository()
-    private val projectAnalysisReportRepository = ProjectAnalysisReportRepository()
-    private val agentSessionRepository = pl.jclab.refio.core.db.repositories.AgentSessionRepository()
-    private val agentInstanceRepository = pl.jclab.refio.core.db.repositories.AgentInstanceRepository()
+    // Persistence layer — all repositories centralized in PersistenceModule.
+    private val persistence = pl.jclab.refio.core.api.modules.PersistenceModule()
 
-    // Multi-agent infrastructure
+    // Multi-agent event bus — persists events so Session Trace / Timeline / Graph can replay history.
     val agentEventBus = pl.jclab.refio.core.agents.events.AgentEventBus().apply {
-        // Persist all events so Session Trace / Timeline / Graph can be replayed
-        // when the user reloads a session from history.
-        setRepository(pl.jclab.refio.core.db.repositories.AgentEventSqlRepository())
+        setRepository(persistence.agentEventSqlRepository)
     }
 
-    // Hook system
-    private val hookExecutor = pl.jclab.refio.core.services.hooks.HookExecutor()
-    private val hookService = pl.jclab.refio.core.services.hooks.HookService(
-        configProvider = { pl.jclab.refio.core.config.HierarchicalConfigLoader.getInstance(projectRoot).getHooks() },
-        hookExecutor = hookExecutor
-    )
-
-    // Single source of truth for prompt section providers.
-    // IMPORTANT: this same list must be used by both TurnPromptBuilder (runtime)
-    // and ProjectContextRouter (preview) so the Context panel shows the actual
-    // prompt the model receives. Previously preview used a stripped-down path
-    // and e.g. <system_environment> was invisible in the Context panel even
-    // though the real agent call included it.
-    val promptSectionProviders: List<pl.jclab.refio.core.services.turn.PromptSectionProvider> by lazy {
-        listOf(
-            AgentPlansSectionProvider(agentPlanService),
-            pl.jclab.refio.core.services.turn.providers.SystemEnvironmentPromptProvider(projectRoot)
-        )
-    }
-
-    // Services (public for cross-module access by plugin services)
-    val taskRepository = TaskRepository()
+    // Core services (public for cross-module access by plugin services)
+    val taskRepository get() = persistence.taskRepository
     val configService = ConfigService(
-        configRepository = configRepository,
+        configRepository = persistence.configRepository,
         defaultProjectId = routerProjectId
     )
     private val promptRegistry = pl.jclab.refio.core.prompts.PromptRegistry(projectRoot)
-    val promptsService = PromptsService(promptsRepository, promptRegistry)
+    val promptsService = PromptsService(persistence.promptsRepository, promptRegistry)
     val toolPermissionsService = ToolPermissionsService(
-        configRepository = configRepository,
+        configRepository = persistence.configRepository,
         toolRegistry = toolRegistry
     )
     val toolApprovalService = ToolApprovalService()
-    val pendingUserMessageQueue = PendingUserMessageQueue(chatMessageRepository)
     val llmClient = llmClientOverride ?: LLMClient(configService)
-    private val workingMemoryService = WorkingMemoryService()
-    private val workingMemoryIntegration = WorkingMemoryIntegration(workingMemoryService)
-    val agentPlanService = AgentPlanService()
-    private val conversationSummaryService = ConversationSummaryService(
+
+    // Support services (hooks, working memory, agent plans, conversation summary, user interaction, queue).
+    private val supportServices = pl.jclab.refio.core.api.modules.SupportServicesModule(
+        projectRoot = projectRoot,
+        chatMessageRepository = persistence.chatMessageRepository,
         llmClient = llmClient,
         promptsService = promptsService,
         configService = configService,
-        chatMessageRepository = chatMessageRepository
     )
+    private val workingMemoryService get() = supportServices.workingMemoryService
+    val agentPlanService get() = supportServices.agentPlanService
+    private val conversationSummaryService get() = supportServices.conversationSummaryService
+    val userInteraction get() = supportServices.userInteraction
+    val pendingUserMessageQueue get() = supportServices.pendingUserMessageQueue
 
-    // User interaction service (public for UI to detect waiting state)
-    val userInteraction = pl.jclab.refio.core.services.orchestration.UserInteraction(
-        chatMessageRepository = chatMessageRepository
-    )
+    /**
+     * Shared between [pl.jclab.refio.core.services.turn.TurnPromptBuilder] (runtime)
+     * and [pl.jclab.refio.core.api.routers.ProjectContextRouter] (preview) so the
+     * Context panel mirrors the exact system prompt the model receives.
+     */
+    val promptSectionProviders: List<pl.jclab.refio.core.services.turn.PromptSectionProvider> by lazy {
+        listOf(
+            AgentPlansSectionProvider(agentPlanService),
+            pl.jclab.refio.core.services.turn.providers.SystemEnvironmentPromptProvider(projectRoot)
+        )
+    }
 
     /**
      * Get the ToolRegistry for this router.
@@ -143,78 +99,34 @@ class CoreApiRouter(
     }
 
     fun hasIdeProject(): Boolean {
-        return resolvedIdeProject != null
+        return resolvedPlatformProject != null
     }
 
     private val routerScope = CoroutineScope(SupervisorJob() + Dispatchers.IO)
-    private val embeddingsMutex = Mutex()
-    private val languageAnalyzers = listOf(
-        KotlinLanguageAnalyzer(),
-        JavaLanguageAnalyzer(),
-        PythonLanguageAnalyzer(),
-        TypeScriptLanguageAnalyzer(),
-        GoLanguageAnalyzer(),
-        RustLanguageAnalyzer(),
-        HtmlLanguageAnalyzer(),
-        CppLanguageAnalyzer(),
-        CssLanguageAnalyzer()
-    )
-
-    private val embeddingsService: EmbeddingsService? = if (projectRoot != null) {
-        EmbeddingsService(
-            configService = configService,
-            providerFactory = ::embeddingProviderFor
-        )
-    } else null
-
-    private val ragChunkingStrategy: ChunkingStrategy = when (ChunkingMode.fromConfig(configService.getTyped(ConfigKeys.RAG_CHUNKING_MODE))) {
-        ChunkingMode.LINE_BASED -> DefaultChunkingStrategy()
-        ChunkingMode.SEMANTIC -> SemanticChunkingStrategy()
-    }
 
-    private val fileAnalyzerService: FileAnalyzerService? = if (projectRoot != null && embeddingsService != null) {
-        FileAnalyzerService(
-            configService = configService,
-            ragRepository = ragRepository,
-            chunkingStrategy = ragChunkingStrategy,
-            embeddingsService = embeddingsService,
-            analyzers = languageAnalyzers,
-            scope = routerScope
-        )
-    } else null
-
-    private val richProjectAnalysisEngine: RichProjectAnalysisEngine? =
-        if (projectRoot != null && fileAnalyzerService != null) {
-            RichProjectAnalysisEngine(
-                fileAnalyzerService = fileAnalyzerService,
-                configService = configService,
-                repository = projectAnalysisReportRepository,
-                languageAnalyzers = languageAnalyzers
-            )
-        } else null
-
-    // ProjectAnalyzerService and ContextService (optional, requires projectRoot)
-    private val projectAnalyzer: ProjectAnalyzerService? = if (projectRoot != null) {
-        ProjectAnalyzerService(configService, richProjectAnalysisEngine)
-    } else null
-
-    private val contextService: ContextService? = if (projectRoot != null && projectAnalyzer != null) {
-        ContextService(
-            projectAnalyzer = projectAnalyzer,
-            taskRepository = taskRepository,
-            chatMessageRepository = chatMessageRepository,
-            subtaskRepository = subtaskRepository,
-            fileAnalyzerService = fileAnalyzerService,
-            configService = configService,
-            workingMemoryService = workingMemoryService,
-            conversationSummaryService = conversationSummaryService
-        )
-    } else null
+    private val embeddingProviderFactory = pl.jclab.refio.core.api.modules.EmbeddingProviderFactory(configService)
 
-    // SnapshotService (optional, requires projectRoot)
-    private val snapshotService: SnapshotService? = if (projectRoot != null) {
-        SnapshotService(snapshotRepository, projectRoot)
-    } else null
+    // Analysis stack — embeddings / RAG / analyzer / context / snapshot.
+    // Null-valued fields when projectRoot is absent (app-level router).
+    private val analysisStack = pl.jclab.refio.core.api.modules.AnalysisStack(
+        projectRoot = projectRoot,
+        configService = configService,
+        ragRepository = persistence.ragRepository,
+        snapshotRepository = persistence.snapshotRepository,
+        analysisReportRepository = persistence.projectAnalysisReportRepository,
+        taskRepository = taskRepository,
+        chatMessageRepository = persistence.chatMessageRepository,
+        subtaskRepository = persistence.subtaskRepository,
+        workingMemoryService = workingMemoryService,
+        conversationSummaryService = conversationSummaryService,
+        scope = routerScope,
+        embeddingProviderFactory = embeddingProviderFactory,
+        platformProject = resolvedPlatformProject,
+    )
+    val projectAnalyzer get() = analysisStack.projectAnalyzer
+    private val contextService get() = analysisStack.contextService
+    private val snapshotService get() = analysisStack.snapshotService
+    private val ragSearchService get() = analysisStack.ragSearchService
 
     // Tool description builder (needs ToolRegistry and ToolPermissionsService)
     private val toolDescriptionBuilder = pl.jclab.refio.core.prompts.ToolDescriptionBuilder(
@@ -222,182 +134,55 @@ class CoreApiRouter(
         toolPermissionsService = toolPermissionsService
     )
 
-    private val chatService = ChatService(
-        taskRepository = taskRepository,
-        chatMessageRepository = chatMessageRepository,
+    private val chatPlanning = pl.jclab.refio.core.api.modules.ChatPlanningModule(
+        persistence = persistence,
         configService = configService,
         llmClient = llmClient,
         promptsService = promptsService,
         toolDescriptionBuilder = toolDescriptionBuilder,
+        toolRegistry = toolRegistry,
+        toolPermissionsService = toolPermissionsService,
         contextService = contextService,
         projectRoot = projectRoot,
-        ideProject = resolvedIdeProject
     )
-    private val planningService = PlanningService(
-        taskRepository = taskRepository,
-        chatMessageRepository = chatMessageRepository,
-        subtaskRepository = subtaskRepository,
-        configService = configService,
+
+    // Agent execution stack (StepPlanner, ToolExecutor, AgentExecutor) — null
+    // when toolRegistry is absent (app-level router without a project selected).
+    private val agentExecutionModule = pl.jclab.refio.core.api.modules.AgentExecutionModule(
+        persistence = persistence,
         llmClient = llmClient,
+        configService = configService,
         promptsService = promptsService,
         toolDescriptionBuilder = toolDescriptionBuilder,
-        toolRegistry = toolRegistry,
         toolPermissionsService = toolPermissionsService,
+        toolApprovalService = toolApprovalService,
         contextService = contextService,
+        snapshotService = snapshotService,
         projectRoot = projectRoot,
-        ideProject = resolvedIdeProject
-    )
-
-    // Agent execution services (optional if toolRegistry not provided)
-    private val stepPlanner: StepPlanner? = if (toolRegistry != null) {
-        StepPlanner(
-            taskRepository,
-            subtaskRepository,
-            toolRegistry,
-            llmClient,
-            promptsService,
-            toolDescriptionBuilder,
-            configService,
-            toolPermissionsService,
-            contextService,
-            projectRoot
-        )
-    } else null
-
-    private val stepSummarizer = StepSummarizer(
-        llmClient = llmClient,
-        promptsService = promptsService,
-        configService = configService,
-        taskRepository = taskRepository
+        toolRegistry = toolRegistry
     )
-
-    private val toolExecutor: ToolExecutor? = if (toolRegistry != null) {
-        ToolExecutor(
-            toolRegistry = toolRegistry,
-            taskRepository = taskRepository,
-            subtaskRepository = subtaskRepository,
-            snapshotService = snapshotService,
-            toolPermissionsService = toolPermissionsService,
-            mode = TaskMode.AGENT,
-            executionMode = pl.jclab.refio.api.models.ExecutionMode.AUTO
-        )
-    } else null
-
-    private val agentExecutor: AgentExecutor? = if (toolExecutor != null && stepPlanner != null) {
-        AgentExecutor(
-            taskRepository = taskRepository,
-            subtaskRepository = subtaskRepository,
-            toolExecutor = toolExecutor,
-            llmClient = llmClient,
-            promptsService = promptsService,
-            configService = configService,
-            stepPlanner = stepPlanner
-        )
-    } else null
+    private val toolExecutor get() = agentExecutionModule.toolExecutor
+    private val agentExecutor get() = agentExecutionModule.agentExecutor
 
     /**
-     * AgentTurnLoop - Turn-based execution loop implementing Codex CLI-style pattern.
-     * Optional service (requires toolRegistry).
-     * Uses turn/ package components for prompt building, tool parsing, and tool execution.
+     * AgentTurnLoop — Turn-based execution loop implementing the Codex CLI-style pattern.
+     * Null when [toolRegistry] or [toolExecutor] are unavailable (app-level router).
      */
-    private val agentTurnLoop: AgentTurnLoop? = if (toolRegistry != null && toolExecutor != null) {
-        val toolResultSummarizer = ToolResultSummarizer(
-            llmClient = llmClient,
-            configService = configService,
-            taskRepository = taskRepository
-        )
-        val taskVerifier = LlmTaskVerifier(
-            llmClient = llmClient,
-            configService = configService,
-            chatMessageRepository = chatMessageRepository
-        )
-        val tokenEstimator = PromptTokenEstimator()
-
-        // Create turn/ package components
-        val turnPromptBuilder = TurnPromptBuilder(
-            promptsService = promptsService,
-            chatMessageRepository = chatMessageRepository,
-            toolDescriptionBuilder = toolDescriptionBuilder,
-            contextService = contextService,
-            workingMemoryService = workingMemoryService,
-            projectRoot = projectRoot,
-            tokenEstimator = tokenEstimator,
-            promptCache = null,  // Could be added later if needed
-            sectionProviders = promptSectionProviders,
-            configService = configService
-        )
-
-        val toolCallParser = ToolCallParser(
-            toolRegistry = toolRegistry,
-            toolPermissionsService = toolPermissionsService,
-            getJsonThinkingXmlTags = { taskId -> configService.getTyped(ConfigKeys.JSON_THINKING_XML_TAGS, taskId) }
-        )
-
-        val turnToolExecutor = TurnToolExecutor(
-            toolExecutor = toolExecutor,
-            toolRegistry = toolRegistry,
-            subtaskRepository = subtaskRepository,
-            toolResultSummarizer = toolResultSummarizer,
-            snapshotService = snapshotService,
-            workingMemoryIntegration = workingMemoryIntegration,
-            taskRepository = taskRepository,
-            chatMessageRepository = chatMessageRepository,
-            approvalService = toolApprovalService,
-            permissionsService = toolPermissionsService,
-            hookService = hookService
-        )
-
-        val turnLLMCaller = TurnLLMCaller(
-            llmClient = llmClient,
-            configService = configService
-        )
-
-        val turnResponseProcessor = TurnResponseProcessor(
-            subtaskRepository = subtaskRepository,
-            toolRegistry = toolRegistry,
-            toolDescriptionBuilder = toolDescriptionBuilder
-        )
-
-        val turnFinalizer = TurnFinalizer(
-            chatMessageRepository = chatMessageRepository
-        )
-
-        // beforeFinish guardian registry — empty by default, no behavior change.
-        // Plug in domain-specific guardians here (e.g. write-without-verification, missing-deliverable).
-        val completionGuardians = pl.jclab.refio.core.services.turn.GuardianRegistry()
-
-        val turnSubagentValidator = TurnSubagentValidator(
-            maxSubagentDepth = 3
-        )
-
-        AgentTurnLoop(
-            // Core dependencies
-            llmClient = llmClient,
-            chatMessageRepository = chatMessageRepository,
-            taskRepository = taskRepository,
-            subtaskRepository = subtaskRepository,
-            configService = configService,
-            toolRegistry = toolRegistry,
-            toolDescriptionBuilder = toolDescriptionBuilder,
-            taskVerifier = taskVerifier,
-            // turn/ package components
-            turnPromptBuilder = turnPromptBuilder,
-            toolCallParser = toolCallParser,
-            turnToolExecutor = turnToolExecutor,
-            turnLLMCaller = turnLLMCaller,
-            turnResponseProcessor = turnResponseProcessor,
-            turnFinalizer = turnFinalizer,
-            turnSubagentValidator = turnSubagentValidator,
-            completionGuardians = completionGuardians,
-            // ADR-0028: Optional dependencies
-            tokenEstimator = tokenEstimator,
-            conversationCompactor = null,
-            llmRetryHandler = null,
-            workingMemoryIntegration = workingMemoryIntegration,
-            agentEventBus = agentEventBus,
-            hookService = hookService
-        )
-    } else null
+    private val agentTurnLoop: AgentTurnLoop? = pl.jclab.refio.core.api.modules.AgentTurnLoopFactory(
+        persistence = persistence,
+        support = supportServices,
+        llmClient = llmClient,
+        configService = configService,
+        promptsService = promptsService,
+        toolDescriptionBuilder = toolDescriptionBuilder,
+        contextService = contextService,
+        snapshotService = snapshotService,
+        toolApprovalService = toolApprovalService,
+        toolPermissionsService = toolPermissionsService,
+        agentEventBus = agentEventBus,
+        promptSectionProviders = promptSectionProviders,
+        projectRoot = projectRoot,
+    ).build(toolRegistry, toolExecutor)
 
     /**
      * Observable state of the current turn execution (phase, iteration, tokens, active tool).
@@ -414,676 +199,95 @@ class CoreApiRouter(
     val lastPromptSnapshot: kotlinx.coroutines.flow.StateFlow<pl.jclab.refio.core.services.turn.PromptSnapshot?>?
         get() = agentTurnLoop?.lastPromptSnapshot
 
-    // ========== RAG Services ==========
-
-    private val ragSearchService: RagSearchService? by lazy {
-        try {
-            val embeddingModelSetting = configService.getEmbeddingModel()
-            val (providerId, _) = resolveEmbeddingProvider(embeddingModelSetting)
-            val embeddingProvider = embeddingProviderFor(providerId)
-            RagSearchService(ragRepository, embeddingProvider)
-        } catch (e: Exception) {
-            logger.warn(e) { "Failed to initialize RagSearchService: ${e.message}" }
-            null
-        }
-    }
-
     // ========== Domain Routers (RFC 0005) - Public API ==========
+    // All 12 router lazy vals + workflow plumbing live in [DomainRouters] so
+    // composition-root concerns stay separated from public API wiring.
 
-    /**
-     * Chat operations router (messages, summarization).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val chatRouter by lazy {
-        pl.jclab.refio.core.api.routers.ChatRouter(
-            chatService = chatService,
-            chatMessageRepository = chatMessageRepository,
-            taskRepository = taskRepository
-        )
-    }
-
-    /**
-     * Configuration router (models, settings).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val configRouter by lazy {
-        pl.jclab.refio.core.api.routers.ConfigRouter(
-            configService = configService,
-            llmClient = llmClient,
-            configRepository = configRepository
-        )
-    }
-
-    /**
-     * Tool management router (permissions, registry).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val toolRouter by lazy {
-        pl.jclab.refio.core.api.routers.ToolRouter(
-            toolRegistry = toolRegistry,
-            toolPermissionsService = toolPermissionsService
-        )
-    }
-
-    /**
-     * Agent execution router (step planning, execution).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val agentRouter by lazy {
-        pl.jclab.refio.core.api.routers.AgentRouter(
-            agentExecutor = agentExecutor,
-            taskRepository = taskRepository,
-            subtaskRepository = subtaskRepository,
-            chatMessageRepository = chatMessageRepository,
-            configService = configService,
-            llmClient = llmClient,
-            promptsService = promptsService,
-            contextService = contextService,
-            projectRoot = projectRoot,
-            ideProject = resolvedIdeProject,
-            toolDescriptionBuilder = toolDescriptionBuilder,
-            agentTurnLoop = agentTurnLoop
-        )
-    }
-
-    /**
-     * RAG operations router (indexing, search).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val ragRouter by lazy {
-        pl.jclab.refio.core.api.routers.RagRouter(
-            ragRepository = ragRepository,
-            documentationRepository = documentationRepository,
-            ragSearchService = ragSearchService,
-            embeddingsService = embeddingsService,
-            fileAnalyzerService = fileAnalyzerService,
-            projectRoot = projectRoot,
-            configService = configService,
-            embeddingProviderFactory = { model -> createEmbeddingProvider(model) },
-            codebaseCacheInvalidator = codebaseCacheInvalidator
-        )
-    }
-
-    /**
-     * Task management router (CRUD, queries).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val taskRouter by lazy {
-        pl.jclab.refio.core.api.routers.TaskRouter(
-            taskRepository = taskRepository,
-            configService = configService,
-            defaultProjectId = routerProjectId,
-            defaultProjectPath = routerProjectPath
-        )
-    }
-
-    /**
-     * Subtask management router (CRUD, approval, ordering).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val subtaskRouter by lazy {
-        pl.jclab.refio.core.api.routers.SubtaskRouter(
-            subtaskRepository = subtaskRepository
-        )
-    }
-
-    /**
-     * Prompts management router (system prompts, rules, slash commands).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val promptsRouter by lazy {
-        pl.jclab.refio.core.api.routers.PromptsRouter(
-            promptsService = promptsService
-        )
-    }
-
-    /**
-     * API logs management router (logging, statistics, export).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val apiLogsRouter by lazy {
-        pl.jclab.refio.core.api.routers.ApiLogsRouter(
-            apiLogRepository = apiLogRepository
-        )
-    }
-
-    /**
-     * Subagent management router (subagent definitions, execution).
-     * Direct access for clients that want to bypass facade methods.
-     */
-    val subagentRouter: pl.jclab.refio.core.subagents.SubagentRouter? by lazy {
-        if (projectRoot != null && toolRegistry != null) {
-            pl.jclab.refio.core.subagents.SubagentRouter(
-                projectRoot = projectRoot,
-                toolRegistry = toolRegistry,
-                configService = configService,
-                llmClient = llmClient,
-                toolPermissionsService = toolPermissionsService,
-                chatMessageRepository = chatMessageRepository,
-                contextService = contextService,
-                ideProject = resolvedIdeProject,
-                runTurnCallback = { request, callback ->
-                    agentRouter.runTurn(
-                        request = request,
-                        streamCallback = callback,
-                        listener = null
-                    )
-                }
-            )
-        } else {
-            null
-        }
-    }
-
-    /**
-     * Project context and analysis router (context panel, prompt preview).
-     */
-    val projectContextRouter by lazy {
-        pl.jclab.refio.core.api.routers.ProjectContextRouter(
-            contextService = contextService,
-            projectRoot = projectRoot,
-            ideProject = resolvedIdeProject,
-            taskRepository = taskRepository,
-            chatMessageRepository = chatMessageRepository,
-            promptsService = promptsService,
-            toolDescriptionBuilder = toolDescriptionBuilder,
-            projectAnalyzer = projectAnalyzer,
-            richProjectAnalysisEngine = richProjectAnalysisEngine,
-            promptSectionProviders = promptSectionProviders
-        )
-    }
-
-    /**
-     * Workflow orchestrator (unified intent routing + execution adapters).
-     */
-    val workflowOrchestrator by lazy {
-        val intentRouter = IntentRouter(
-            subtaskRepository = subtaskRepository,
-            subagentRouter = subagentRouter
-        )
-        val chatExecutor = ChatExecutor(chatService)
-        val planExecutor = PlanExecutor(planningService)
-        val stepExecutor = StepExecutor(agentRouter)
-        val subagentExecutor = subagentRouter?.let { SubagentExecutor(it) }
-
-        WorkflowOrchestrator(
-            intentRouter = intentRouter,
-            chatExecutor = chatExecutor,
-            planExecutor = planExecutor,
-            stepExecutor = stepExecutor,
-            subagentExecutor = subagentExecutor,
-            userInteraction = userInteraction
-        )
-    }
-
-    /**
-     * Multi-agent runner for parallel agent orchestration.
-     */
     val multiAgentRunner by lazy {
         pl.jclab.refio.core.agents.MultiAgentRunner(agentEventBus)
     }
 
-    /**
-     * Multi-agent session management router.
-     */
-    val multiAgentRouter by lazy {
-        pl.jclab.refio.core.api.routers.MultiAgentRouter(
-            defaultProjectId = routerProjectId,
-            defaultProjectPath = routerProjectPath,
-            agentSessionRepository = agentSessionRepository,
-            agentInstanceRepository = agentInstanceRepository,
-            multiAgentRunner = multiAgentRunner,
-            createTaskFn = { request -> taskRouter.createTask(request) },
-            runTurnFn = { request, callback -> agentRouter.runTurn(request, callback) }
-        )
-    }
-
-    init {
-        if (toolRegistry != null && projectRoot != null) {
-            try {
-                if (!toolRegistry.hasTool("invoke_subagent")) {
-                    val invokeSubagentTool = pl.jclab.refio.core.tools.implementations.InvokeSubagentTool(
-                        subagentRouterProvider = { subagentRouter },
-                        runTurnCallback = { request, turnEventListener, streamCallback ->
-                            agentRouter.runTurn(
-                                request = request,
-                                streamCallback = streamCallback,
-                                listener = turnEventListener?.let {
-                                    pl.jclab.refio.core.services.AgentTurnLoop.TurnEventListener.fromTurnEventListener(it)
-                                }
-                            )
-                        },
-                        configServiceProvider = { configService }
-                    )
-                    toolRegistry.register(invokeSubagentTool)
-                    logger.info { "CoreApiRouter: invoke_subagent tool registered" }
-                }
-
-                // Register delegate_to_strong_model only if a strong model is configured
-                val strongModel = configService.getStrongModel()
-                if (strongModel != null && !toolRegistry.hasTool("delegate_to_strong_model")) {
-                    val delegateToStrongModelTool = pl.jclab.refio.core.tools.implementations.DelegateToStrongModelTool(
-                        llmClient = llmClient,
-                        configServiceProvider = { configService },
-                        runTurnCallback = { request, turnEventListener, streamCallback ->
-                            agentRouter.runTurn(
-                                request = request,
-                                streamCallback = streamCallback,
-                                listener = turnEventListener?.let {
-                                    pl.jclab.refio.core.services.AgentTurnLoop.TurnEventListener.fromTurnEventListener(it)
-                                }
-                            )
-                        }
-                    )
-                    toolRegistry.register(delegateToStrongModelTool)
-                    logger.info { "CoreApiRouter: delegate_to_strong_model tool registered (strong model: ${strongModel.second}/${strongModel.first})" }
-                }
-
-                // Register SYSTEM tools for multi-agent orchestration
-                val tasksTool = pl.jclab.refio.core.tools.implementations.TasksTool(agentPlanService)
-                val memoryTool = pl.jclab.refio.core.tools.implementations.MemoryTool(
-                    workingMemoryService = workingMemoryService,
-                    subtaskRepository = subtaskRepository
-                )
-                val manageSubagentTool = pl.jclab.refio.core.tools.implementations.ManageSubagentTool { subagentRouter }
-                val sendMessageTool = pl.jclab.refio.core.tools.implementations.SendMessageTool(agentEventBus)
-
-                listOf(tasksTool, memoryTool, manageSubagentTool, sendMessageTool).forEach { tool ->
-                    if (!toolRegistry.hasTool(tool.name)) {
-                        toolRegistry.register(tool)
-                    }
-                }
-                logger.info { "CoreApiRouter: SYSTEM tools registered (tasks, memory, manage_subagent, send_message)" }
-            } catch (e: Exception) {
-                logger.warn(e) { "CoreApiRouter: failed to register invoke_subagent tool" }
-            }
-        }
+    private val domainRouters = pl.jclab.refio.core.api.modules.DomainRouters(
+        persistence = persistence,
+        analysisStack = analysisStack,
+        chatPlanning = chatPlanning,
+        configService = configService,
+        promptsService = promptsService,
+        llmClient = llmClient,
+        toolRegistry = toolRegistry,
+        toolPermissionsService = toolPermissionsService,
+        toolDescriptionBuilder = toolDescriptionBuilder,
+        agentExecutor = agentExecutor,
+        agentTurnLoop = agentTurnLoop,
+        userInteraction = userInteraction,
+        multiAgentRunner = multiAgentRunner,
+        projectRoot = projectRoot,
+        promptSectionProviders = promptSectionProviders,
+        routerProjectId = routerProjectId,
+        routerProjectPath = routerProjectPath,
+        embeddingProviderFactory = embeddingProviderFactory::create,
+        codebaseCacheInvalidator = codebaseCacheInvalidator,
+    )
 
-        // Apply Ollama concurrency from config
-        val ollamaMaxConcurrent = configService.get(ConfigService.KEY_OLLAMA_MAX_CONCURRENT)?.toIntOrNull()
-        if (ollamaMaxConcurrent != null && ollamaMaxConcurrent > 0) {
-            OllamaRequestGate.maxConcurrentPerEndpoint = ollamaMaxConcurrent
-            logger.info { "CoreApiRouter: Ollama maxConcurrent set to $ollamaMaxConcurrent" }
-        }
+    val chatRouter get() = domainRouters.chatRouter
+    val configRouter get() = domainRouters.configRouter
+    val toolRouter get() = domainRouters.toolRouter
+    val agentRouter get() = domainRouters.agentRouter
+    val ragRouter get() = domainRouters.ragRouter
+    val taskRouter get() = domainRouters.taskRouter
+    val subtaskRouter get() = domainRouters.subtaskRouter
+    val promptsRouter get() = domainRouters.promptsRouter
+    val apiLogsRouter get() = domainRouters.apiLogsRouter
+    val subagentRouter get() = domainRouters.subagentRouter
+    val snapshotRouter get() = domainRouters.snapshotRouter
+    val projectContextRouter get() = domainRouters.projectContextRouter
+    val workflowOrchestrator get() = domainRouters.workflowOrchestrator
+    val multiAgentRouter get() = domainRouters.multiAgentRouter
+    val orchestrationDispatcher get() = domainRouters.orchestrationDispatcher
+
+    // Internal accessors for modules in `api.modules` package
+    internal val toolRegistryOrNull: ToolRegistry? get() = toolRegistry
+    internal val projectRootOrNull: java.nio.file.Path? get() = projectRoot
+    internal val persistenceInternal get() = persistence
+    internal val workingMemoryServiceInternal get() = workingMemoryService
+    internal val ragSearchServiceInternal get() = ragSearchService
+    internal val embeddingProviderFactoryInternal get() = embeddingProviderFactory
 
-        logger.info { "CoreApiRouter initialized with services" }
-        if (contextService != null) {
-            logger.info { "CoreApiRouter: ContextService initialized with projectRoot=$projectRoot" }
-        } else {
-            logger.warn { "CoreApiRouter: ContextService NOT available (projectRoot not provided)" }
-        }
-        logger.info { "CoreApiRouter: ideProject available=${resolvedIdeProject != null}, projectHandle=${projectHandle != null}" }
-        if (toolRegistry != null) {
-            logger.info { "CoreApiRouter: Agent execution services initialized" }
-        } else {
-            logger.warn { "CoreApiRouter: Agent execution services NOT available (toolRegistry not provided)" }
+    init {
+        pl.jclab.refio.core.api.modules.CoreApiRouterBootstrap.registerSystemTools(this)
+        pl.jclab.refio.core.api.modules.CoreApiRouterBootstrap.applyOllamaConcurrency(configService)
+        logger.info {
+            "CoreApiRouter init: projectRoot=$projectRoot, contextService=${contextService != null}, " +
+                "tools=${toolRegistry != null}, platformProject=${resolvedPlatformProject != null}, " +
+                "projectHandle=${projectHandle != null}"
         }
     }
 
     // configService is accessible directly as a public property
 
-    /**
-     * Get ProjectAnalyzerService (for startup analysis and caching)
-     * Returns null if projectRoot was not provided during router creation
-     */
-    fun getProjectAnalyzerService(): ProjectAnalyzerService? {
-        return projectAnalyzer
-    }
-
     /**
      * Create a project-level router from this app-level router.
      *
      * Shares the same database but creates project-specific tools and services.
      * Used by StandaloneCoreBootstrap and CoreConnectionManager.
-     *
-     * @param projectRoot Project root directory
-     * @param projectHandle Platform-agnostic project handle (optional)
-     * @param ideProject Platform-specific project instance (optional)
-     * @return Configured project-level CoreApiRouter
      */
     fun createProjectRouter(
         projectRoot: java.nio.file.Path,
         projectHandle: pl.jclab.refio.core.project.ProjectHandle? = null,
-        ideProject: Any? = null
-    ): CoreApiRouter {
-        val toolRegistry = ToolRegistry()
-
-        val maxFileSizeBytes = configService.getTyped(ConfigKeys.MAX_FILE_SIZE).toLong() * 1024 * 1024
-        val fileLimits = pl.jclab.refio.core.tools.security.FileLimits(maxFileSize = maxFileSizeBytes)
-
-        val toolFactory = pl.jclab.refio.core.tools.base.ToolFactory(
-            projectRoot = projectRoot,
-            toolRegistry = toolRegistry,
-            llmClient = llmClient,
-            configService = configService,
-            promptsService = promptsService,
-            taskRepository = taskRepository,
-            fileLimits = fileLimits
-        )
-        val tools = toolFactory.createAllTools()
-        tools.forEach { tool -> toolRegistry.register(tool) }
-
-        return CoreApiRouter(
-            toolRegistry = toolRegistry,
-            projectRoot = projectRoot,
-            ideProject = ideProject,
-            projectHandle = projectHandle
-        )
-    }
+        platformProject: Any? = null
+    ): CoreApiRouter = pl.jclab.refio.core.api.modules.ProjectRouterFactory.create(
+        projectRoot = projectRoot,
+        projectHandle = projectHandle,
+        platformProject = platformProject,
+        llmClient = llmClient,
+        configService = configService,
+        promptsService = promptsService,
+        taskRepository = taskRepository,
+    )
 
-    /**
-     * Initialize core components (database, etc.)
-     */
+    /** Initialize core components (database, prompt defaults, RAG tool). */
     fun initialize(dbPath: String = "database.sqlite") {
-        logger.info { "Initializing core with dbPath=$dbPath" }
-        DatabaseFactory.init(dbPath)
-        promptsService.initializeDefaults()
-        if (projectRoot != null && contextService != null) {
-            val ragComponents = initializeRagSearchService()
-            ragComponents?.let { (service, modelId, providerId) ->
-                contextService.updateRagSearchConfig(service, modelId, providerId)
-                // Register on-demand rag_search tool now that the embedding stack is wired.
-                // Done here (not in ToolFactory) because RagSearchService is project-scoped and
-                // only available after embedding model resolution succeeds.
-                if (toolRegistry != null) {
-                    try {
-                        val ragTool = pl.jclab.refio.core.tools.implementations.RagSearchTool(
-                            ragSearchService = service,
-                            embeddingModel = modelId,
-                            projectRoot = projectRoot
-                        )
-                        toolRegistry.register(ragTool)
-                        logger.info { "Registered rag_search tool (model=$modelId, provider=$providerId)" }
-                    } catch (e: IllegalArgumentException) {
-                        logger.debug { "rag_search tool already registered" }
-                    } catch (e: Exception) {
-                        logger.warn(e) { "Failed to register rag_search tool: ${e.message}" }
-                    }
-                }
-            }
-        }
-    }
-
-    // ========== Snapshot API (kept — no domain router) ==========
-
-    // ========== Snapshot API ==========
-
-    /**
-     * Get snapshot content for given snapshot ID.
-     *
-     * @param snapshotId Snapshot ID (subtask_id)
-     * @return Map of file path to content
-     */
-    suspend fun getSnapshot(snapshotId: String): SnapshotResponse {
-        if (snapshotService == null) {
-            throw IllegalStateException("Snapshot operations require project context")
-        }
-
-        try {
-            val files = snapshotService.getSnapshot(snapshotId)
-            return SnapshotResponse(
-                snapshotId = snapshotId,
-                files = files
-            )
-        } catch (e: Exception) {
-            logger.error(e) { "Failed to get snapshot: $snapshotId" }
-            throw e
-        }
-    }
-
-    /**
-     * Get content of a specific file from snapshot.
-     *
-     * @param snapshotId Snapshot ID
-     * @param filePath Relative file path
-     * @return File content or null if not found
-     */
-    suspend fun getSnapshotFileContent(snapshotId: String, filePath: String): String? {
-        if (snapshotService == null) {
-            throw IllegalStateException("Snapshot operations require project context")
-        }
-
-        try {
-            val snapshot = snapshotService.getSnapshot(snapshotId)
-            return snapshot[filePath]
-        } catch (e: Exception) {
-            logger.error(e) { "Failed to get snapshot file content: $snapshotId/$filePath" }
-            return null
-        }
-    }
-
-    /**
-     * Delete all snapshots for a task.
-     * Useful for "rewind conversation" to avoid leaving orphaned snapshots after deleting subtasks.
-     */
-    fun deleteSnapshotsByTaskId(taskId: String): Int {
-        return snapshotRepository.deleteByTaskId(taskId)
-    }
-
-    private fun initializeRagSearchService(): Triple<RagSearchService, String, String>? {
-        return try {
-            val embeddingModelSetting = configService.getEmbeddingModel()
-            val (providerId, modelId) = resolveEmbeddingProvider(embeddingModelSetting)
-            val embeddingProvider = embeddingProviderFor(providerId)
-            Triple(RagSearchService(ragRepository, embeddingProvider), modelId, providerId)
-        } catch (e: Exception) {
-            logger.warn(e) { "Failed to initialize RagSearchService: ${e.message}" }
-            null
-        }
-    }
-
-    /**
-     * Create embedding provider based on model name
-     * Supports formats: "provider/modelId" (e.g., "ollama/nomic-embed-text") or just "modelId"
-     */
-    private fun createEmbeddingProvider(model: String): EmbeddingProvider {
-        val (provider, modelId) = if (model.contains("/")) {
-            val parts = model.split("/", limit = 2)
-            parts[0].lowercase() to parts[1]
-        } else {
-            null to model
-        }
-
-        return provider?.let { embeddingProviderFor(it) } ?: embeddingProviderFromModel(modelId)
-    }
-
-    private fun embeddingProviderFromModel(modelId: String?): EmbeddingProvider {
-        if (modelId == null) {
-            logger.warn { "Embedding model not specified, defaulting to OpenAI provider" }
-            return embeddingProviderFor("openai")
-        }
-
-        return when {
-            modelId.startsWith("text-embedding") -> embeddingProviderFor("openai")
-            modelId in setOf(
-                "nomic-embed-text",
-                "mxbai-embed-large",
-                "all-minilm",
-                "all-MiniLM-L6-v2"
-            ) -> embeddingProviderFor("ollama")
-
-            else -> {
-                logger.warn { "Unknown embedding model: $modelId, defaulting to OpenAI provider" }
-                embeddingProviderFor("openai")
-            }
-        }
-    }
-
-    private fun resolveEmbeddingProvider(model: String): Pair<String, String> {
-        return if (model.contains("/")) {
-            val parts = model.split("/", limit = 2)
-            parts[0].lowercase() to parts[1]
-        } else {
-            val provider = when {
-                model.startsWith("text-embedding") -> "openai"
-                model in setOf("nomic-embed-text", "mxbai-embed-large", "all-minilm", "all-MiniLM-L6-v2") -> "ollama"
-                else -> "openai"
-            }
-            provider to model
-        }
-    }
-
-    private fun embeddingProviderFor(providerId: String): EmbeddingProvider {
-        return when (providerId.lowercase()) {
-            "ollama" -> {
-                val ollamaEndpoint = configService.getTyped(ConfigKeys.PROVIDER_OLLAMA_ENDPOINT)
-                OllamaEmbeddingProvider(ollamaEndpoint)
-            }
-
-            "openai" -> OpenAIEmbeddingProvider()
-
-            else -> {
-                logger.warn { "Unknown embedding provider: $providerId, defaulting to OpenAI" }
-                OpenAIEmbeddingProvider()
-            }
-        }
-    }
-
-    private fun extractModelId(model: String): String {
-        return if (model.contains("/")) {
-            model.split("/", limit = 2)[1]
-        } else {
-            model
-        }
-    }
-
-    // ========== Documentation API ==========
-
-    /**
-     * Get all documentation sources for current project
-     *
-     * @return List of documentation sources with indexing status
-     */
-    fun getDocumentationSources(): List<DocumentationSource> {
-        return ragRouter.getDocumentationSources()
-    }
-
-    /**
-     * Add documentation source for current project (create but don't index yet)
-     *
-     * @param url Documentation base URL
-     * @param crawlDepth Maximum crawl depth (default: 2)
-     * @return Created documentation source
-     */
-    fun addDocumentationSource(
-        url: String,
-        crawlDepth: Int = 2
-    ): DocumentationSource {
-        return ragRouter.addDocumentationSource(url, crawlDepth)
-    }
-
-    fun addDocumentationFile(
-        filePath: String
-    ): DocumentationSource {
-        return ragRouter.addDocumentationFile(filePath)
-    }
-
-    /**
-     * Index documentation from source
-     *
-     * @param docId Documentation source ID
-     * @return Flow of indexing progress
-     */
-    fun indexDocumentation(docId: Int): Flow<DocIndexingProgress> {
-        return ragRouter.indexDocumentation(docId)
-    }
-
-    /**
-     * Delete documentation source and all indexed pages
-     *
-     * @param docId Documentation source ID
-     */
-    fun deleteDocumentationSource(docId: Int) {
-        ragRouter.deleteDocumentationSource(docId)
-    }
-
-    /**
-     * Delete documentation index (indexed pages) but keep the source
-     *
-     * @param docId Documentation source ID
-     */
-    fun deleteDocumentationIndex(docId: Int) {
-        ragRouter.deleteDocumentationIndex(docId)
-    }
-
-    /**
-     * Get documentation statistics for task
-     *
-     * @param taskId Task ID
-     * @return Documentation statistics
-     */
-    fun getDocumentationStatistics(taskId: String): DocStatistics {
-        return ragRouter.getDocumentationStatistics(taskId)
-    }
-
-    // ========== API Logs Management ==========
-
-    /**
-     * Get recent API logs.
-     */
-    fun getRecentApiLogs(limit: Int = 50): List<ApiLog> {
-        return apiLogsRouter.getRecentApiLogs(limit)
-    }
-
-    /**
-     * Get filtered API logs.
-     */
-    fun getFilteredApiLogs(
-        provider: String? = null,
-        model: String? = null,
-        source: String? = null,
-        limit: Int = 50
-    ): List<ApiLog> {
-        return apiLogsRouter.getFilteredApiLogs(provider, model, source, limit)
-    }
-
-    /**
-     * Get global API log statistics.
-     */
-    fun getApiLogStatistics(): ApiLogStatistics {
-        return apiLogsRouter.getApiLogStatistics()
-    }
-
-    /**
-     * Get distinct providers from API logs.
-     */
-    fun getDistinctProviders(): List<String> {
-        return apiLogsRouter.getDistinctProviders()
-    }
-
-    /**
-     * Get distinct models from API logs.
-     */
-    fun getDistinctModels(): List<String> {
-        return apiLogsRouter.getDistinctModels()
-    }
-
-    /**
-     * Get distinct sources from API logs.
-     */
-    fun getDistinctSources(): List<String> {
-        return apiLogsRouter.getDistinctSources()
-    }
-
-    /**
-     * Delete all API logs.
-     */
-    fun deleteAllApiLogs(): Int {
-        return apiLogsRouter.deleteAllApiLogs()
-    }
-
-    /**
-     * Export all API logs to JSON.
-     */
-    fun exportAllApiLogsToJson(): String {
-        return apiLogsRouter.exportAllApiLogsToJson()
-    }
-
-    /**
-     * Export all API logs to CSV.
-     */
-    fun exportAllApiLogsToCsv(): String {
-        return apiLogsRouter.exportAllApiLogsToCsv()
+        pl.jclab.refio.core.api.modules.CoreApiRouterBootstrap.initializeCore(this, dbPath)
     }
 
     fun close() {
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/PromptsModels.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/PromptsModels.kt
index ff6ca87b..105be829 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/api/PromptsModels.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/PromptsModels.kt
@@ -34,9 +34,9 @@ data class SaveRuleRequest(
 )
 
 /**
- * Request to create or update a slash command
+ * Request to create or update a slash prompt
  */
-data class SaveCommandRequest(
+data class SaveSlashPromptRequest(
     val id: String? = null,
     val name: String,  // e.g., "/refactor" or "refactor" (will be normalized)
     val content: String,
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/AgentExecutionModule.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/AgentExecutionModule.kt
new file mode 100644
index 00000000..73adc502
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/AgentExecutionModule.kt
@@ -0,0 +1,88 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.api.models.ExecutionMode
+import pl.jclab.refio.core.db.TaskMode
+import pl.jclab.refio.core.db.repositories.ChatMessageRepository
+import pl.jclab.refio.core.db.repositories.SubtaskRepository
+import pl.jclab.refio.core.db.repositories.TaskRepository
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.prompts.ToolDescriptionBuilder
+import pl.jclab.refio.core.services.AgentExecutor
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.ContextService
+import pl.jclab.refio.core.services.PromptsService
+import pl.jclab.refio.core.services.SnapshotService
+import pl.jclab.refio.core.services.StepPlanner
+import pl.jclab.refio.core.services.StepSummarizer
+import pl.jclab.refio.core.services.ToolExecutor
+import pl.jclab.refio.core.services.turn.ToolApprovalService
+import pl.jclab.refio.core.services.ToolPermissionsService
+import pl.jclab.refio.core.tools.base.ToolRegistry
+import java.nio.file.Path
+
+/**
+ * Construction of the step execution stack — [StepPlanner], [ToolExecutor],
+ * [AgentExecutor], [StepSummarizer] — used by AgentRouter for PLAN/AGENT mode.
+ *
+ * All three downstream components are optional: they are null when the router
+ * has no `toolRegistry` (app-level router, no project selected).
+ */
+class AgentExecutionModule(
+    private val persistence: PersistenceModule,
+    private val llmClient: LLMClient,
+    private val configService: ConfigService,
+    private val promptsService: PromptsService,
+    private val toolDescriptionBuilder: ToolDescriptionBuilder,
+    private val toolPermissionsService: ToolPermissionsService,
+    private val toolApprovalService: ToolApprovalService,
+    private val contextService: ContextService?,
+    private val snapshotService: SnapshotService?,
+    private val projectRoot: Path?,
+    private val toolRegistry: ToolRegistry?
+) {
+    val stepPlanner: StepPlanner? = if (toolRegistry != null) {
+        StepPlanner(
+            persistence.taskRepository,
+            persistence.subtaskRepository,
+            toolRegistry,
+            llmClient,
+            promptsService,
+            toolDescriptionBuilder,
+            configService,
+            toolPermissionsService,
+            contextService,
+            projectRoot
+        )
+    } else null
+
+    val stepSummarizer: StepSummarizer = StepSummarizer(
+        llmClient = llmClient,
+        promptsService = promptsService,
+        configService = configService,
+        taskRepository = persistence.taskRepository
+    )
+
+    val toolExecutor: ToolExecutor? = if (toolRegistry != null) {
+        ToolExecutor(
+            toolRegistry = toolRegistry,
+            taskRepository = persistence.taskRepository,
+            subtaskRepository = persistence.subtaskRepository,
+            snapshotService = snapshotService,
+            toolPermissionsService = toolPermissionsService,
+            mode = TaskMode.AGENT,
+            executionMode = ExecutionMode.AUTO
+        )
+    } else null
+
+    val agentExecutor: AgentExecutor? = if (toolExecutor != null && stepPlanner != null) {
+        AgentExecutor(
+            taskRepository = persistence.taskRepository,
+            subtaskRepository = persistence.subtaskRepository,
+            toolExecutor = toolExecutor,
+            llmClient = llmClient,
+            promptsService = promptsService,
+            configService = configService,
+            stepPlanner = stepPlanner
+        )
+    } else null
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/AgentTurnLoopFactory.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/AgentTurnLoopFactory.kt
new file mode 100644
index 00000000..6dc27023
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/AgentTurnLoopFactory.kt
@@ -0,0 +1,131 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.services.*
+import pl.jclab.refio.core.services.turn.*
+import pl.jclab.refio.core.tools.base.ToolRegistry
+
+/**
+ * Wires AgentTurnLoop with all its turn/ package sub-components.
+ *
+ * Extracted from CoreApiRouter to keep composition root readable.
+ * Returns null when pre-requisites (toolRegistry, toolExecutor) are not available.
+ */
+internal class AgentTurnLoopFactory(
+    private val persistence: PersistenceModule,
+    private val support: SupportServicesModule,
+    private val llmClient: LLMClient,
+    private val configService: ConfigService,
+    private val promptsService: PromptsService,
+    private val toolDescriptionBuilder: pl.jclab.refio.core.prompts.ToolDescriptionBuilder,
+    private val contextService: ContextService?,
+    private val snapshotService: SnapshotService?,
+    private val toolApprovalService: ToolApprovalService,
+    private val toolPermissionsService: ToolPermissionsService,
+    private val agentEventBus: pl.jclab.refio.core.agents.events.AgentEventBus,
+    private val promptSectionProviders: List<PromptSectionProvider>,
+    private val projectRoot: java.nio.file.Path?,
+) {
+    private val chatMessageRepository get() = persistence.chatMessageRepository
+    private val taskRepository get() = persistence.taskRepository
+    private val subtaskRepository get() = persistence.subtaskRepository
+    private val workingMemoryService get() = support.workingMemoryService
+    private val workingMemoryIntegration get() = support.workingMemoryIntegration
+    private val hookService get() = support.hookService
+
+    fun build(toolRegistry: ToolRegistry?, toolExecutor: ToolExecutor?): AgentTurnLoop? {
+        if (toolRegistry == null || toolExecutor == null) return null
+
+        val toolResultSummarizer = ToolResultSummarizer(
+            llmClient = llmClient,
+            configService = configService,
+            taskRepository = taskRepository
+        )
+        val taskVerifier = LlmTaskVerifier(
+            llmClient = llmClient,
+            configService = configService,
+            chatMessageRepository = chatMessageRepository
+        )
+        val tokenEstimator = PromptTokenEstimator()
+
+        val turnPromptBuilder = TurnPromptBuilder(
+            promptsService = promptsService,
+            chatMessageRepository = chatMessageRepository,
+            toolDescriptionBuilder = toolDescriptionBuilder,
+            contextService = contextService,
+            workingMemoryService = workingMemoryService,
+            projectRoot = projectRoot,
+            tokenEstimator = tokenEstimator,
+            promptCache = null,
+            sectionProviders = promptSectionProviders,
+            configService = configService
+        )
+
+        val toolCallParser = ToolCallParser(
+            toolRegistry = toolRegistry,
+            toolPermissionsService = toolPermissionsService,
+            getJsonThinkingXmlTags = { taskId -> configService.getTyped(ConfigKeys.JSON_THINKING_XML_TAGS, taskId) }
+        )
+
+        val turnToolExecutor = TurnToolExecutor(
+            toolExecutor = toolExecutor,
+            toolRegistry = toolRegistry,
+            subtaskRepository = subtaskRepository,
+            toolResultSummarizer = toolResultSummarizer,
+            snapshotService = snapshotService,
+            workingMemoryIntegration = workingMemoryIntegration,
+            taskRepository = taskRepository,
+            chatMessageRepository = chatMessageRepository,
+            approvalService = toolApprovalService,
+            permissionsService = toolPermissionsService,
+            hookService = hookService
+        )
+
+        val turnLLMCaller = TurnLLMCaller(
+            llmClient = llmClient,
+            configService = configService
+        )
+
+        val turnResponseProcessor = TurnResponseProcessor(
+            subtaskRepository = subtaskRepository,
+            toolRegistry = toolRegistry,
+            toolDescriptionBuilder = toolDescriptionBuilder
+        )
+
+        val turnFinalizer = TurnFinalizer(
+            chatMessageRepository = chatMessageRepository
+        )
+
+        val completionGuardians = GuardianRegistry()
+
+        val turnSubagentValidator = TurnSubagentValidator(
+            maxSubagentDepth = 3
+        )
+
+        return AgentTurnLoop(
+            llmClient = llmClient,
+            chatMessageRepository = chatMessageRepository,
+            taskRepository = taskRepository,
+            subtaskRepository = subtaskRepository,
+            configService = configService,
+            toolRegistry = toolRegistry,
+            toolDescriptionBuilder = toolDescriptionBuilder,
+            taskVerifier = taskVerifier,
+            turnPromptBuilder = turnPromptBuilder,
+            toolCallParser = toolCallParser,
+            turnToolExecutor = turnToolExecutor,
+            turnLLMCaller = turnLLMCaller,
+            turnResponseProcessor = turnResponseProcessor,
+            turnFinalizer = turnFinalizer,
+            turnSubagentValidator = turnSubagentValidator,
+            completionGuardians = completionGuardians,
+            tokenEstimator = tokenEstimator,
+            conversationCompactor = null,
+            llmRetryHandler = null,
+            workingMemoryIntegration = workingMemoryIntegration,
+            agentEventBus = agentEventBus,
+            hookService = hookService
+        )
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/AnalysisStack.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/AnalysisStack.kt
new file mode 100644
index 00000000..b97ec57c
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/AnalysisStack.kt
@@ -0,0 +1,123 @@
+package pl.jclab.refio.core.api.modules
+
+import kotlinx.coroutines.CoroutineScope
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.db.repositories.ChatMessageRepository
+import pl.jclab.refio.core.db.repositories.ProjectAnalysisReportRepository
+import pl.jclab.refio.core.db.repositories.RagRepository
+import pl.jclab.refio.core.db.repositories.SnapshotRepository
+import pl.jclab.refio.core.db.repositories.SubtaskRepository
+import pl.jclab.refio.core.db.repositories.TaskRepository
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.*
+import pl.jclab.refio.core.services.analysis.*
+import pl.jclab.refio.core.services.analysis.project.RichProjectAnalysisEngine
+import pl.jclab.refio.core.services.context.WorkingMemoryService
+
+/**
+ * Bundles the "analysis stack" — embeddings, RAG chunking, file analyzer,
+ * rich project analysis, project analyzer, context service and snapshot service.
+ *
+ * All fields are null when projectRoot is not provided (app-level router).
+ * Extracted from CoreApiRouter.
+ */
+internal class AnalysisStack(
+    projectRoot: java.nio.file.Path?,
+    private val configService: ConfigService,
+    private val ragRepository: RagRepository,
+    snapshotRepository: SnapshotRepository,
+    analysisReportRepository: ProjectAnalysisReportRepository,
+    taskRepository: TaskRepository,
+    chatMessageRepository: ChatMessageRepository,
+    subtaskRepository: SubtaskRepository,
+    workingMemoryService: WorkingMemoryService,
+    conversationSummaryService: ConversationSummaryService,
+    scope: CoroutineScope,
+    private val embeddingProviderFactory: EmbeddingProviderFactory,
+    platformProject: Any? = null,
+) {
+    private val logger = dualLogger("AnalysisStack")
+
+    private val embeddingProviderById: (providerId: String) -> EmbeddingProvider =
+        { providerId -> embeddingProviderFactory.forProvider(providerId) }
+    val languageAnalyzers: List<LanguageAnalyzer> = listOf(
+        KotlinLanguageAnalyzer(),
+        JavaLanguageAnalyzer(),
+        PythonLanguageAnalyzer(),
+        TypeScriptLanguageAnalyzer(),
+        GoLanguageAnalyzer(),
+        RustLanguageAnalyzer(),
+        HtmlLanguageAnalyzer(),
+        CppLanguageAnalyzer(),
+        CssLanguageAnalyzer()
+    )
+
+    val embeddingsService: EmbeddingsService? = if (projectRoot != null) {
+        EmbeddingsService(
+            configService = configService,
+            providerFactory = embeddingProviderById
+        )
+    } else null
+
+    val ragChunkingStrategy: ChunkingStrategy = when (ChunkingMode.fromConfig(configService.getTyped(ConfigKeys.RAG_CHUNKING_MODE))) {
+        ChunkingMode.LINE_BASED -> DefaultChunkingStrategy()
+        ChunkingMode.SEMANTIC -> SemanticChunkingStrategy()
+    }
+
+    val fileAnalyzerService: FileAnalyzerService? = if (projectRoot != null && embeddingsService != null) {
+        FileAnalyzerService(
+            configService = configService,
+            ragRepository = ragRepository,
+            chunkingStrategy = ragChunkingStrategy,
+            embeddingsService = embeddingsService,
+            analyzers = languageAnalyzers,
+            scope = scope
+        )
+    } else null
+
+    val richProjectAnalysisEngine: RichProjectAnalysisEngine? =
+        if (projectRoot != null && fileAnalyzerService != null) {
+            RichProjectAnalysisEngine(
+                fileAnalyzerService = fileAnalyzerService,
+                configService = configService,
+                repository = analysisReportRepository,
+                languageAnalyzers = languageAnalyzers
+            )
+        } else null
+
+    val projectAnalyzer: ProjectAnalyzerService? = if (projectRoot != null) {
+        ProjectAnalyzerService(configService, richProjectAnalysisEngine)
+    } else null
+
+    val contextService: ContextService? = if (projectRoot != null && projectAnalyzer != null) {
+        ContextService(
+            projectAnalyzer = projectAnalyzer,
+            taskRepository = taskRepository,
+            chatMessageRepository = chatMessageRepository,
+            subtaskRepository = subtaskRepository,
+            fileAnalyzerService = fileAnalyzerService,
+            configService = configService,
+            workingMemoryService = workingMemoryService,
+            conversationSummaryService = conversationSummaryService,
+            platformProject = platformProject,
+        )
+    } else null
+
+    val snapshotService: SnapshotService? = if (projectRoot != null) {
+        SnapshotService(snapshotRepository, projectRoot)
+    } else null
+
+    /**
+     * Lazy `rag_search` backend — resolves the configured embedding provider on first
+     * access and returns null (with a WARN) if that fails.
+     */
+    val ragSearchService: RagSearchService? by lazy {
+        try {
+            val (providerId, _) = embeddingProviderFactory.resolve(configService.getEmbeddingModel())
+            RagSearchService(ragRepository, embeddingProviderFactory.forProvider(providerId))
+        } catch (e: Exception) {
+            logger.warn(e) { "Failed to initialize RagSearchService: ${e.message}" }
+            null
+        }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/ChatPlanningModule.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/ChatPlanningModule.kt
new file mode 100644
index 00000000..b5df35ee
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/ChatPlanningModule.kt
@@ -0,0 +1,57 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.prompts.ToolDescriptionBuilder
+import pl.jclab.refio.core.services.ChatService
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.ContextService
+import pl.jclab.refio.core.services.PlanningService
+import pl.jclab.refio.core.services.PromptsService
+import pl.jclab.refio.core.services.ToolPermissionsService
+import pl.jclab.refio.core.tools.base.ToolRegistry
+
+/**
+ * Bundles the two "conversational-only" services: [ChatService] (CHAT mode) and
+ * [PlanningService] (PLAN mode preview + step generation).
+ *
+ * Extracted from [pl.jclab.refio.core.api.CoreApiRouter] so the composition root
+ * doesn't have to hold every service field. Both services share the same
+ * repositories, config, and tool metadata, so bundling them avoids repeating the
+ * constructor arguments in the composition root.
+ */
+internal class ChatPlanningModule(
+    persistence: PersistenceModule,
+    configService: ConfigService,
+    llmClient: LLMClient,
+    promptsService: PromptsService,
+    toolDescriptionBuilder: ToolDescriptionBuilder,
+    toolRegistry: ToolRegistry?,
+    toolPermissionsService: ToolPermissionsService,
+    contextService: ContextService?,
+    projectRoot: java.nio.file.Path?,
+) {
+    val chatService = ChatService(
+        taskRepository = persistence.taskRepository,
+        chatMessageRepository = persistence.chatMessageRepository,
+        configService = configService,
+        llmClient = llmClient,
+        promptsService = promptsService,
+        toolDescriptionBuilder = toolDescriptionBuilder,
+        contextService = contextService,
+        projectRoot = projectRoot,
+    )
+
+    val planningService = PlanningService(
+        taskRepository = persistence.taskRepository,
+        chatMessageRepository = persistence.chatMessageRepository,
+        subtaskRepository = persistence.subtaskRepository,
+        configService = configService,
+        llmClient = llmClient,
+        promptsService = promptsService,
+        toolDescriptionBuilder = toolDescriptionBuilder,
+        toolRegistry = toolRegistry,
+        toolPermissionsService = toolPermissionsService,
+        contextService = contextService,
+        projectRoot = projectRoot,
+    )
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/CoreApiRouterBootstrap.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/CoreApiRouterBootstrap.kt
new file mode 100644
index 00000000..e1031707
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/CoreApiRouterBootstrap.kt
@@ -0,0 +1,86 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.core.api.CoreApiRouter
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.db.DatabaseFactory
+import pl.jclab.refio.core.services.OllamaRequestGate
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.AgentTurnLoop
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.tools.implementations.RagSearchTool
+
+/**
+ * One-shot bootstrap side-effects for [CoreApiRouter] that don't belong in
+ * the composition root itself:
+ *
+ *  - [registerSystemTools] — installs `invoke_subagent`, `delegate_to_strong_model`,
+ *    `tasks`, `memory`, `manage_subagent`, `send_message` into a project-scoped
+ *    router's [pl.jclab.refio.core.tools.base.ToolRegistry]. No-op when
+ *    `toolRegistry` or `projectRoot` is absent (app-level router).
+ *  - [applyOllamaConcurrency] — sets the `OllamaRequestGate` global tuned from
+ *    `providers.ollama_max_concurrent`.
+ *  - [initializeCore] — database init, prompt defaults, and RAG tool wiring.
+ *
+ * Kept as free functions so `CoreApiRouter.init {}` and `initialize()` stay
+ * short enough to read at a glance.
+ */
+internal object CoreApiRouterBootstrap {
+    private val logger = dualLogger("CoreApiRouterBootstrap")
+
+    fun registerSystemTools(router: CoreApiRouter) {
+        val toolRegistry = router.toolRegistryOrNull ?: return
+        val projectRoot = router.projectRootOrNull ?: return
+
+        SystemToolsRegistrar(
+            configService = router.configService,
+            llmClient = router.llmClient,
+            agentPlanService = router.agentPlanService,
+            workingMemoryService = router.workingMemoryServiceInternal,
+            subtaskRepository = router.persistenceInternal.subtaskRepository,
+            agentEventBus = router.agentEventBus,
+            subagentRouterProvider = { router.subagentRouter },
+            runTurnCallback = { request, listener, stream ->
+                router.agentRouter.runTurn(
+                    request = request,
+                    streamCallback = stream,
+                    listener = listener,
+                )
+            },
+        ).register(toolRegistry)
+    }
+
+    fun applyOllamaConcurrency(configService: ConfigService) {
+        val value = configService.get(ConfigKeys.OLLAMA_MAX_CONCURRENT.key)?.toIntOrNull()
+            ?.takeIf { it > 0 } ?: return
+        OllamaRequestGate.maxConcurrentPerEndpoint = value
+        logger.info { "CoreApiRouter: Ollama maxConcurrent=$value" }
+    }
+
+    fun initializeCore(router: CoreApiRouter, dbPath: String) {
+        logger.info { "Initializing core with dbPath=$dbPath" }
+        DatabaseFactory.init(dbPath)
+        router.promptsService.initializeDefaults()
+
+        val toolRegistry = router.toolRegistryOrNull ?: return
+        val projectRoot = router.projectRootOrNull ?: return
+        val ragSearchService = router.ragSearchServiceInternal ?: return
+
+        try {
+            val (providerId, modelId) = router.embeddingProviderFactoryInternal.resolve(
+                router.configService.getEmbeddingModel()
+            )
+            toolRegistry.register(
+                RagSearchTool(
+                    ragSearchService = ragSearchService,
+                    embeddingModel = modelId,
+                    projectRoot = projectRoot,
+                )
+            )
+            logger.info { "Registered rag_search tool (model=$modelId, provider=$providerId)" }
+        } catch (e: IllegalArgumentException) {
+            logger.debug { "rag_search tool already registered" }
+        } catch (e: Exception) {
+            logger.warn(e) { "Failed to register rag_search tool: ${e.message}" }
+        }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/DomainRouters.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/DomainRouters.kt
new file mode 100644
index 00000000..370e0e4d
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/DomainRouters.kt
@@ -0,0 +1,244 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.core.agents.MultiAgentRunner
+import pl.jclab.refio.core.agents.events.AgentEventBus
+import pl.jclab.refio.core.agents.orchestration.OrchestrationDispatcher
+import pl.jclab.refio.core.agents.orchestration.ResultMerger
+import pl.jclab.refio.core.agents.orchestration.TaskDecomposer
+import pl.jclab.refio.core.api.CoreApiRouter
+import pl.jclab.refio.core.api.StreamCallback
+import pl.jclab.refio.core.api.TurnRequest
+import pl.jclab.refio.core.api.routers.AgentRouter
+import pl.jclab.refio.core.api.routers.ApiLogsRouter
+import pl.jclab.refio.core.api.routers.ChatRouter
+import pl.jclab.refio.core.api.routers.ConfigRouter
+import pl.jclab.refio.core.api.routers.MultiAgentRouter
+import pl.jclab.refio.core.api.routers.ProjectContextRouter
+import pl.jclab.refio.core.api.routers.PromptsRouter
+import pl.jclab.refio.core.api.routers.RagRouter
+import pl.jclab.refio.core.api.routers.SnapshotRouter
+import pl.jclab.refio.core.api.routers.SubtaskRouter
+import pl.jclab.refio.core.api.routers.TaskRouter
+import pl.jclab.refio.core.api.routers.ToolRouter
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.prompts.ToolDescriptionBuilder
+import pl.jclab.refio.core.services.AgentExecutor
+import pl.jclab.refio.core.services.AgentTurnLoop
+import pl.jclab.refio.core.services.ChatService
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.ContextService
+import pl.jclab.refio.core.services.EmbeddingProvider
+import pl.jclab.refio.core.services.PlanningService
+import pl.jclab.refio.core.services.PromptsService
+import pl.jclab.refio.core.services.SnapshotService
+import pl.jclab.refio.core.services.ToolPermissionsService
+import pl.jclab.refio.core.services.analysis.EmbeddingsService
+import pl.jclab.refio.core.services.analysis.FileAnalyzerService
+import pl.jclab.refio.core.services.analysis.project.RichProjectAnalysisEngine
+import pl.jclab.refio.core.services.orchestration.UserInteraction
+import pl.jclab.refio.core.services.turn.PromptSectionProvider
+import pl.jclab.refio.core.subagents.SubagentRouter
+import pl.jclab.refio.core.tools.base.ToolRegistry
+import pl.jclab.refio.core.workflow.IntentRouter
+import pl.jclab.refio.core.workflow.WorkflowOrchestrator
+import java.nio.file.Path
+
+/**
+ * Lazy holder for the 12 domain routers + workflow plumbing exposed by [CoreApiRouter].
+ *
+ * Pulling the router wiring out of [CoreApiRouter] keeps the composition root focused
+ * on *creating* dependencies; this module *assembles* them into the public surface.
+ * Each router is lazy so callers that never touch (e.g.) the RAG surface don't pay
+ * its construction cost.
+ */
+internal class DomainRouters(
+    private val persistence: PersistenceModule,
+    private val analysisStack: AnalysisStack,
+    private val chatPlanning: ChatPlanningModule,
+    private val configService: ConfigService,
+    private val promptsService: PromptsService,
+    private val llmClient: LLMClient,
+    private val toolRegistry: ToolRegistry?,
+    private val toolPermissionsService: ToolPermissionsService,
+    private val toolDescriptionBuilder: ToolDescriptionBuilder,
+    private val agentExecutor: AgentExecutor?,
+    private val agentTurnLoop: AgentTurnLoop?,
+    private val userInteraction: UserInteraction,
+    private val multiAgentRunner: MultiAgentRunner,
+    private val projectRoot: Path?,
+    private val promptSectionProviders: List<PromptSectionProvider>,
+    private val routerProjectId: String?,
+    private val routerProjectPath: String?,
+    private val embeddingProviderFactory: (String) -> EmbeddingProvider,
+    private val codebaseCacheInvalidator: (projectRoot: String) -> Unit,
+) {
+    private val chatService get() = chatPlanning.chatService
+    private val planningService get() = chatPlanning.planningService
+    private val ragSearchService get() = analysisStack.ragSearchService
+    private val embeddingsService get() = analysisStack.embeddingsService
+    private val fileAnalyzerService get() = analysisStack.fileAnalyzerService
+    private val snapshotService get() = analysisStack.snapshotService
+    private val contextService get() = analysisStack.contextService
+    private val projectAnalyzer get() = analysisStack.projectAnalyzer
+    private val richProjectAnalysisEngine get() = analysisStack.richProjectAnalysisEngine
+    val chatRouter: ChatRouter by lazy {
+        ChatRouter(
+            chatService = chatService,
+            chatMessageRepository = persistence.chatMessageRepository,
+            taskRepository = persistence.taskRepository,
+        )
+    }
+
+    val configRouter: ConfigRouter by lazy {
+        ConfigRouter(
+            configService = configService,
+            llmClient = llmClient,
+            configRepository = persistence.configRepository,
+        )
+    }
+
+    val toolRouter: ToolRouter by lazy {
+        ToolRouter(
+            toolRegistry = toolRegistry,
+            toolPermissionsService = toolPermissionsService,
+        )
+    }
+
+    val agentRouter: AgentRouter by lazy {
+        AgentRouter(
+            agentExecutor = agentExecutor,
+            taskRepository = persistence.taskRepository,
+            subtaskRepository = persistence.subtaskRepository,
+            chatMessageRepository = persistence.chatMessageRepository,
+            configService = configService,
+            llmClient = llmClient,
+            promptsService = promptsService,
+            contextService = contextService,
+            projectRoot = projectRoot,
+            toolDescriptionBuilder = toolDescriptionBuilder,
+            agentTurnLoop = agentTurnLoop,
+        )
+    }
+
+    val ragRouter: RagRouter by lazy {
+        RagRouter(
+            ragRepository = persistence.ragRepository,
+            documentationRepository = persistence.documentationRepository,
+            ragSearchService = ragSearchService,
+            embeddingsService = embeddingsService,
+            fileAnalyzerService = fileAnalyzerService,
+            projectRoot = projectRoot,
+            configService = configService,
+            embeddingProviderFactory = embeddingProviderFactory,
+            codebaseCacheInvalidator = codebaseCacheInvalidator,
+        )
+    }
+
+    val taskRouter: TaskRouter by lazy {
+        TaskRouter(
+            taskRepository = persistence.taskRepository,
+            configService = configService,
+            defaultProjectId = routerProjectId,
+            defaultProjectPath = routerProjectPath,
+        )
+    }
+
+    val subtaskRouter: SubtaskRouter by lazy {
+        SubtaskRouter(
+            subtaskRepository = persistence.subtaskRepository,
+        )
+    }
+
+    val promptsRouter: PromptsRouter by lazy {
+        PromptsRouter(promptsService = promptsService)
+    }
+
+    val apiLogsRouter: ApiLogsRouter by lazy {
+        ApiLogsRouter(apiLogRepository = persistence.apiLogRepository)
+    }
+
+    val subagentRouter: SubagentRouter? by lazy {
+        if (projectRoot != null && toolRegistry != null) {
+            SubagentRouter(
+                projectRoot = projectRoot,
+                toolRegistry = toolRegistry,
+                configService = configService,
+                llmClient = llmClient,
+                toolPermissionsService = toolPermissionsService,
+                chatMessageRepository = persistence.chatMessageRepository,
+                contextService = contextService,
+                runTurnCallback = { request, callback ->
+                    agentRouter.runTurn(
+                        request = request,
+                        streamCallback = callback,
+                        listener = null,
+                    )
+                },
+            )
+        } else null
+    }
+
+    val snapshotRouter: SnapshotRouter by lazy {
+        SnapshotRouter(
+            snapshotService = snapshotService,
+            snapshotRepository = persistence.snapshotRepository,
+        )
+    }
+
+    val projectContextRouter: ProjectContextRouter by lazy {
+        ProjectContextRouter(
+            contextService = contextService,
+            projectRoot = projectRoot,
+            taskRepository = persistence.taskRepository,
+            chatMessageRepository = persistence.chatMessageRepository,
+            promptsService = promptsService,
+            toolDescriptionBuilder = toolDescriptionBuilder,
+            projectAnalyzer = projectAnalyzer,
+            richProjectAnalysisEngine = richProjectAnalysisEngine,
+            promptSectionProviders = promptSectionProviders,
+        )
+    }
+
+    val workflowOrchestrator: WorkflowOrchestrator by lazy {
+        val intentRouter = IntentRouter(
+            subtaskRepository = persistence.subtaskRepository,
+            subagentRouter = subagentRouter,
+        )
+        WorkflowOrchestrator(
+            intentRouter = intentRouter,
+            chatService = chatService,
+            planningService = planningService,
+            agentRouter = agentRouter,
+            subagentRouter = subagentRouter,
+            userInteraction = userInteraction,
+        )
+    }
+
+    val multiAgentRouter: MultiAgentRouter by lazy {
+        MultiAgentRouter(
+            defaultProjectId = routerProjectId,
+            defaultProjectPath = routerProjectPath,
+            agentSessionRepository = persistence.agentSessionRepository,
+            agentInstanceRepository = persistence.agentInstanceRepository,
+            multiAgentRunner = multiAgentRunner,
+            createTaskFn = { request -> taskRouter.createTask(request) },
+            runTurnFn = { request: TurnRequest, callback: StreamCallback? -> agentRouter.runTurn(request, callback) },
+        )
+    }
+
+    val orchestrationDispatcher: OrchestrationDispatcher? by lazy {
+        val router = subagentRouter ?: return@lazy null
+        OrchestrationDispatcher(
+            configService = configService,
+            subagentRouter = router,
+            multiAgentRunner = multiAgentRunner,
+            chatMessageRepository = persistence.chatMessageRepository,
+            taskDecomposer = TaskDecomposer(llmClient, configService, router),
+            resultMerger = ResultMerger(llmClient, configService),
+            createTaskFn = { request -> taskRouter.createTask(request) },
+            runTurnFn = { request, callback -> agentRouter.runTurn(request, callback) },
+            projectId = routerProjectId,
+            projectPath = routerProjectPath,
+        )
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/EmbeddingProviderFactory.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/EmbeddingProviderFactory.kt
new file mode 100644
index 00000000..ff23d8ca
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/EmbeddingProviderFactory.kt
@@ -0,0 +1,67 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.EmbeddingProvider
+import pl.jclab.refio.core.services.OllamaEmbeddingProvider
+import pl.jclab.refio.core.services.OpenAIEmbeddingProvider
+import pl.jclab.refio.core.services.ConfigService
+
+private val logger = dualLogger("EmbeddingProviderFactory")
+
+/**
+ * Resolves `"provider/model"` strings and produces the concrete [EmbeddingProvider].
+ *
+ * Kept as a small helper rather than inlined in CoreApiRouter so both RagModule
+ * and direct callers share the same resolution rules.
+ */
+class EmbeddingProviderFactory(private val configService: ConfigService) {
+
+    /**
+     * Create a provider by the configured model string (supports "provider/model" or bare model).
+     */
+    fun create(model: String): EmbeddingProvider {
+        val (providerId, _) = resolve(model)
+        return forProvider(providerId)
+    }
+
+    /**
+     * Parse "provider/model" or a bare model name; default to a reasonable provider.
+     */
+    fun resolve(model: String): Pair<String, String> {
+        return if (model.contains("/")) {
+            val parts = model.split("/", limit = 2)
+            parts[0].lowercase() to parts[1]
+        } else {
+            val provider = when {
+                model.startsWith("text-embedding") -> "openai"
+                model in OLLAMA_DEFAULT_MODELS -> "ollama"
+                else -> "openai"
+            }
+            provider to model
+        }
+    }
+
+    fun forProvider(providerId: String): EmbeddingProvider {
+        return when (providerId.lowercase()) {
+            "ollama" -> {
+                val ollamaEndpoint = configService.getTyped(ConfigKeys.PROVIDER_OLLAMA_ENDPOINT)
+                OllamaEmbeddingProvider(ollamaEndpoint)
+            }
+            "openai" -> OpenAIEmbeddingProvider()
+            else -> {
+                logger.warn { "Unknown embedding provider: $providerId, defaulting to OpenAI" }
+                OpenAIEmbeddingProvider()
+            }
+        }
+    }
+
+    companion object {
+        private val OLLAMA_DEFAULT_MODELS = setOf(
+            "nomic-embed-text",
+            "mxbai-embed-large",
+            "all-minilm",
+            "all-MiniLM-L6-v2"
+        )
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/PersistenceModule.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/PersistenceModule.kt
new file mode 100644
index 00000000..1897d078
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/PersistenceModule.kt
@@ -0,0 +1,37 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.core.db.repositories.AgentEventSqlRepository
+import pl.jclab.refio.core.db.repositories.AgentInstanceRepository
+import pl.jclab.refio.core.db.repositories.AgentSessionRepository
+import pl.jclab.refio.core.db.repositories.ApiLogRepository
+import pl.jclab.refio.core.db.repositories.ChatMessageRepository
+import pl.jclab.refio.core.db.repositories.ConfigRepository
+import pl.jclab.refio.core.db.repositories.DocumentationRepository
+import pl.jclab.refio.core.db.repositories.ProjectAnalysisReportRepository
+import pl.jclab.refio.core.db.repositories.PromptsRepository
+import pl.jclab.refio.core.db.repositories.RagRepository
+import pl.jclab.refio.core.db.repositories.SnapshotRepository
+import pl.jclab.refio.core.db.repositories.SubtaskRepository
+import pl.jclab.refio.core.db.repositories.TaskRepository
+
+/**
+ * Persistence layer — all repositories in one place.
+ *
+ * Repositories are stateless SQL wrappers; creating them is cheap,
+ * but centralizing their construction here keeps CoreApiRouter free of boilerplate.
+ */
+class PersistenceModule {
+    val chatMessageRepository = ChatMessageRepository()
+    val subtaskRepository = SubtaskRepository()
+    val configRepository = ConfigRepository()
+    val apiLogRepository = ApiLogRepository()
+    val promptsRepository = PromptsRepository()
+    val ragRepository = RagRepository()
+    val documentationRepository = DocumentationRepository()
+    val snapshotRepository = SnapshotRepository()
+    val projectAnalysisReportRepository = ProjectAnalysisReportRepository()
+    val agentSessionRepository = AgentSessionRepository()
+    val agentInstanceRepository = AgentInstanceRepository()
+    val taskRepository = TaskRepository()
+    val agentEventSqlRepository = AgentEventSqlRepository()
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/ProjectRouterFactory.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/ProjectRouterFactory.kt
new file mode 100644
index 00000000..8641a204
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/ProjectRouterFactory.kt
@@ -0,0 +1,59 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.core.api.CoreApiRouter
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.project.ProjectHandle
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.PromptsService
+import pl.jclab.refio.core.tools.base.ToolFactory
+import pl.jclab.refio.core.tools.base.ToolRegistry
+import pl.jclab.refio.core.tools.security.FileLimits
+import pl.jclab.refio.core.db.repositories.TaskRepository
+
+/**
+ * Builds a **project-scoped** [CoreApiRouter] from an **app-scoped** parent.
+ *
+ * The parent router is created with no [projectRoot] (app/global router); when the
+ * plugin opens a project, it calls this factory with the project path and receives
+ * a project-scoped router that shares the same `LLMClient`, `ConfigService`,
+ * `PromptsService`, and `TaskRepository`, but registers its own `ToolRegistry`
+ * with the full set of tools loaded.
+ *
+ * Kept out of `CoreApiRouter` so the composition root stays focused on wiring the
+ * current router's own services rather than cross-scope bootstrap.
+ */
+internal object ProjectRouterFactory {
+    fun create(
+        projectRoot: java.nio.file.Path,
+        projectHandle: ProjectHandle?,
+        platformProject: Any?,
+        llmClient: LLMClient,
+        configService: ConfigService,
+        promptsService: PromptsService,
+        taskRepository: TaskRepository,
+    ): CoreApiRouter {
+        val toolRegistry = ToolRegistry()
+
+        val maxFileSizeBytes = configService.getTyped(ConfigKeys.MAX_FILE_SIZE).toLong() * 1024 * 1024
+        val fileLimits = FileLimits(maxFileSize = maxFileSizeBytes)
+
+        val toolFactory = ToolFactory(
+            projectRoot = projectRoot,
+            toolRegistry = toolRegistry,
+            llmClient = llmClient,
+            configService = configService,
+            promptsService = promptsService,
+            taskRepository = taskRepository,
+            fileLimits = fileLimits,
+        )
+        toolFactory.createAllTools().forEach { toolRegistry.register(it) }
+
+        return CoreApiRouter(
+            toolRegistry = toolRegistry,
+            projectRoot = projectRoot,
+            platformProjectOverride = platformProject,
+            projectHandle = projectHandle,
+        )
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/SupportServicesModule.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/SupportServicesModule.kt
new file mode 100644
index 00000000..5b96db67
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/SupportServicesModule.kt
@@ -0,0 +1,51 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.core.db.repositories.ChatMessageRepository
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.services.AgentPlanService
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.ConversationSummaryService
+import pl.jclab.refio.core.services.PendingUserMessageQueue
+import pl.jclab.refio.core.services.PromptsService
+import pl.jclab.refio.core.services.WorkingMemoryIntegration
+import pl.jclab.refio.core.services.context.WorkingMemoryService
+import pl.jclab.refio.core.services.hooks.HookExecutor
+import pl.jclab.refio.core.services.hooks.HookService
+import pl.jclab.refio.core.services.orchestration.UserInteraction
+import java.nio.file.Path
+
+/**
+ * Supporting services that sit above [PersistenceModule] but below domain routers —
+ * hook runtime, working-memory caches, agent-plan registry, conversation summarizer,
+ * user-interaction signaller, and the pending-user-message queue.
+ *
+ * Kept together so [pl.jclab.refio.core.api.CoreApiRouter] can wire them in a
+ * single line instead of eight field declarations.
+ */
+class SupportServicesModule(
+    projectRoot: Path?,
+    chatMessageRepository: ChatMessageRepository,
+    llmClient: LLMClient,
+    promptsService: PromptsService,
+    configService: ConfigService,
+) {
+    private val hookExecutor = HookExecutor()
+    val hookService = HookService(
+        configProvider = { pl.jclab.refio.core.config.HierarchicalConfigLoader.getInstance(projectRoot).getHooks() },
+        hookExecutor = hookExecutor,
+    )
+
+    val workingMemoryService = WorkingMemoryService()
+    val workingMemoryIntegration = WorkingMemoryIntegration(workingMemoryService)
+    val agentPlanService = AgentPlanService()
+
+    val conversationSummaryService = ConversationSummaryService(
+        llmClient = llmClient,
+        promptsService = promptsService,
+        configService = configService,
+        chatMessageRepository = chatMessageRepository,
+    )
+
+    val userInteraction = UserInteraction(chatMessageRepository = chatMessageRepository)
+    val pendingUserMessageQueue = PendingUserMessageQueue(chatMessageRepository)
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/modules/SystemToolsRegistrar.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/SystemToolsRegistrar.kt
new file mode 100644
index 00000000..f3f89faa
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/modules/SystemToolsRegistrar.kt
@@ -0,0 +1,77 @@
+package pl.jclab.refio.core.api.modules
+
+import pl.jclab.refio.core.agents.events.AgentEventBus
+import pl.jclab.refio.core.db.repositories.SubtaskRepository
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.AgentPlanService
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.context.WorkingMemoryService
+import pl.jclab.refio.core.subagents.SubagentRouter
+import pl.jclab.refio.core.tools.base.ToolRegistry
+
+private val logger = dualLogger("SystemToolsRegistrar")
+
+/**
+ * Registers system-level tools (invoke_subagent, delegate_to_strong_model,
+ * tasks, memory, manage_subagent, send_message) into a ToolRegistry.
+ *
+ * Extracted from CoreApiRouter.init to keep the composition root readable.
+ */
+internal class SystemToolsRegistrar(
+    private val configService: ConfigService,
+    private val llmClient: LLMClient,
+    private val agentPlanService: AgentPlanService,
+    private val workingMemoryService: WorkingMemoryService,
+    private val subtaskRepository: SubtaskRepository,
+    private val agentEventBus: AgentEventBus,
+    private val subagentRouterProvider: () -> SubagentRouter?,
+    private val runTurnCallback: suspend (
+        pl.jclab.refio.core.api.TurnRequest,
+        pl.jclab.refio.core.services.turn.TurnEventListener?,
+        pl.jclab.refio.core.api.StreamCallback?
+    ) -> pl.jclab.refio.core.services.TurnResult
+) {
+
+    fun register(toolRegistry: ToolRegistry) {
+        try {
+            if (!toolRegistry.hasTool("invoke_subagent")) {
+                val invokeSubagentTool = pl.jclab.refio.core.tools.implementations.InvokeSubagentTool(
+                    subagentRouterProvider = subagentRouterProvider,
+                    runTurnCallback = runTurnCallback,
+                    configServiceProvider = { configService }
+                )
+                toolRegistry.register(invokeSubagentTool)
+                logger.info { "invoke_subagent tool registered" }
+            }
+
+            val strongModel = configService.getStrongModel()
+            if (strongModel != null && !toolRegistry.hasTool("delegate_to_strong_model")) {
+                val delegateToStrongModelTool = pl.jclab.refio.core.tools.implementations.DelegateToStrongModelTool(
+                    llmClient = llmClient,
+                    configServiceProvider = { configService },
+                    runTurnCallback = runTurnCallback
+                )
+                toolRegistry.register(delegateToStrongModelTool)
+                logger.info { "delegate_to_strong_model tool registered (strong model: ${strongModel.second}/${strongModel.first})" }
+            }
+
+            val tasksTool = pl.jclab.refio.core.tools.implementations.TasksTool(agentPlanService)
+            val memoryTool = pl.jclab.refio.core.tools.implementations.MemoryTool(
+                workingMemoryService = workingMemoryService,
+                subtaskRepository = subtaskRepository
+            )
+            val manageSubagentTool = pl.jclab.refio.core.tools.implementations.ManageSubagentTool(subagentRouterProvider)
+            val sendMessageTool = pl.jclab.refio.core.tools.implementations.SendMessageTool(agentEventBus)
+
+            listOf(tasksTool, memoryTool, manageSubagentTool, sendMessageTool).forEach { tool ->
+                if (!toolRegistry.hasTool(tool.name)) {
+                    toolRegistry.register(tool)
+                }
+            }
+            logger.info { "SYSTEM tools registered (tasks, memory, manage_subagent, send_message)" }
+        } catch (e: Exception) {
+            logger.warn(e) { "Failed to register system tools" }
+        }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/routers/AgentRouter.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/AgentRouter.kt
index 30aef4db..44077716 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/api/routers/AgentRouter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/AgentRouter.kt
@@ -18,6 +18,7 @@ import pl.jclab.refio.core.api.ModelOperation
 import pl.jclab.refio.core.services.AgentExecutor
 import pl.jclab.refio.core.services.AgentTurnLoop
 import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.turn.TurnEventListener
 import pl.jclab.refio.core.services.ContextService
 import pl.jclab.refio.core.services.TurnResult
 import pl.jclab.refio.core.db.PromptType
@@ -48,7 +49,6 @@ private val logger = dualLogger("AgentRouter")
  * @property promptsService Prompts service
  * @property contextService Context service (for execution summaries)
  * @property projectRoot Project root path (for execution summaries)
- * @property ideProject IntelliJ project instance (for context building)
  */
 class AgentRouter(
     private val agentExecutor: AgentExecutor?,
@@ -60,7 +60,6 @@ class AgentRouter(
     private val promptsService: pl.jclab.refio.core.services.PromptsService,
     private val contextService: ContextService?,
     private val projectRoot: Path?,
-    private val ideProject: Any?,
     private val toolDescriptionBuilder: pl.jclab.refio.core.prompts.ToolDescriptionBuilder,
     private val agentTurnLoop: pl.jclab.refio.core.services.AgentTurnLoop? = null
 ) : Router {
@@ -428,8 +427,7 @@ _Execution time: ${result.durationMs}ms_
         // Build full project context using ContextService
         val projectContext = contextService.buildProjectContext(
             projectRoot = projectRoot,
-            taskId = taskId,
-            project = ideProject
+            taskId = taskId
         )
 
         // Calculate statistics
@@ -730,7 +728,7 @@ _Execution time: ${result.durationMs}ms_
     suspend fun runTurn(
         request: TurnRequest,
         streamCallback: StreamCallback? = null,
-        listener: AgentTurnLoop.TurnEventListener? = null
+        listener: TurnEventListener? = null
     ): TurnResult {
         val turnLoop = agentTurnLoop
             ?: throw IllegalStateException("AgentTurnLoop not available - toolRegistry is not configured")
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/routers/ConfigRouter.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/ConfigRouter.kt
index 4fe83f9b..e236651c 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/api/routers/ConfigRouter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/ConfigRouter.kt
@@ -76,13 +76,16 @@ class ConfigRouter(
      * @param provider Optional provider filter (ollama, openai, anthropic)
      * @return List of ModelInfo with visibility settings
      */
-    suspend fun getModelsWithVisibility(provider: String? = null): List<ModelInfo> {
-        logger.info { "[ConfigRouter] Getting models with visibility: provider=${provider ?: "all"}" }
+    suspend fun getModelsWithVisibility(
+        provider: String? = null,
+        fetchIfMissing: Boolean = true
+    ): List<ModelInfo> {
+        logger.info { "[ConfigRouter] Getting models with visibility: provider=${provider ?: "all"}, fetchIfMissing=$fetchIfMissing" }
 
         val models = if (provider != null) {
             getModelsByProvider(provider, configService)
         } else {
-            getAllModels(configService)
+            getAllModels(configService, fetchIfMissing = fetchIfMissing)
         }
 
         // Get visibility settings
@@ -241,11 +244,11 @@ class ConfigRouter(
                 if (provider.equals("lmstudio", ignoreCase = true) && resolvedBaseUrl.isNullOrEmpty()) {
                     resolvedBaseUrl = "http://localhost:1234/v1"
                 }
-                if (provider.equals("custom_openai", ignoreCase = true) && resolvedBaseUrl.isNullOrEmpty()) {
+                if (provider.equals("generic_openai", ignoreCase = true) && resolvedBaseUrl.isNullOrEmpty()) {
                     resolvedBaseUrl = configService.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_BASE_URL)
                 }
                 if (provider.equals("zai", ignoreCase = true)) {
-                    resolvedBaseUrl = configService.normalizeZAIBaseUrl(resolvedBaseUrl)
+                    resolvedBaseUrl = pl.jclab.refio.core.llm.adapters.ZAIUrls.normalize(resolvedBaseUrl)
                 }
 
                 when (provider.lowercase()) {
@@ -272,7 +275,7 @@ class ConfigRouter(
                     "lmstudio" -> {
                         resolvedBaseUrl = resolvedBaseUrl ?: "http://localhost:1234/v1"
                     }
-                    "custom_openai" -> {
+                    "generic_openai" -> {
                         if (resolvedBaseUrl.isNullOrEmpty()) {
                             return@withTimeout TestConnectionResult(
                                 success = false,
@@ -285,26 +288,26 @@ class ConfigRouter(
                 }
 
                 val tempConfigKey = when (provider.lowercase()) {
-                    "ollama" -> ConfigService.KEY_PROVIDER_OLLAMA_ENDPOINT to (baseUrl ?: "")
-                    "anthropic" -> ConfigService.KEY_PROVIDER_ANTHROPIC_API_KEY to (apiKey ?: "")
-                    "openai" -> ConfigService.KEY_PROVIDER_OPENAI_API_KEY to (apiKey ?: "")
-                    "openrouter" -> ConfigService.KEY_PROVIDER_OPENROUTER_API_KEY to (apiKey ?: "")
-                    "gemini" -> ConfigService.KEY_PROVIDER_GEMINI_API_KEY to (apiKey ?: "")
-                    "lmstudio" -> ConfigService.KEY_PROVIDER_LM_STUDIO_API_KEY to (apiKey ?: "")
-                    "custom_openai" -> ConfigService.KEY_PROVIDER_CUSTOM_OPENAI_API_KEY to (apiKey ?: "")
-                    "zai" -> ConfigService.KEY_PROVIDER_ZAI_API_KEY to (apiKey ?: "")
+                    "ollama" -> ConfigKeys.PROVIDER_OLLAMA_ENDPOINT.key to (baseUrl ?: "")
+                    "anthropic" -> ConfigKeys.PROVIDER_ANTHROPIC_API_KEY.key to (apiKey ?: "")
+                    "openai" -> ConfigKeys.PROVIDER_OPENAI_API_KEY.key to (apiKey ?: "")
+                    "openrouter" -> ConfigKeys.PROVIDER_OPENROUTER_API_KEY.key to (apiKey ?: "")
+                    "gemini" -> ConfigKeys.PROVIDER_GEMINI_API_KEY.key to (apiKey ?: "")
+                    "lmstudio" -> ConfigKeys.PROVIDER_LM_STUDIO_API_KEY.key to (apiKey ?: "")
+                    "generic_openai" -> ConfigKeys.PROVIDER_CUSTOM_OPENAI_API_KEY.key to (apiKey ?: "")
+                    "zai" -> ConfigKeys.PROVIDER_ZAI_API_KEY.key to (apiKey ?: "")
                     else -> null
                 }
 
                 when (provider.lowercase()) {
-                    "custom_openai" -> {
-                        resolvedBaseUrl?.let { configService.set(ConfigService.KEY_PROVIDER_CUSTOM_OPENAI_BASE_URL, it, ConfigScope.APP) }
+                    "generic_openai" -> {
+                        resolvedBaseUrl?.let { configService.set(ConfigKeys.PROVIDER_CUSTOM_OPENAI_BASE_URL.key, it, ConfigScope.APP) }
                         config["model"]?.takeIf { it.isNotBlank() }?.let {
-                            configService.set(ConfigService.KEY_PROVIDER_CUSTOM_OPENAI_MODEL, it, ConfigScope.APP)
+                            configService.set(ConfigKeys.PROVIDER_CUSTOM_OPENAI_MODEL.key, it, ConfigScope.APP)
                         }
                     }
                     "zai" -> {
-                        resolvedBaseUrl?.let { configService.set(ConfigService.KEY_PROVIDER_ZAI_BASE_URL, it, ConfigScope.APP) }
+                        resolvedBaseUrl?.let { configService.set(ConfigKeys.PROVIDER_ZAI_BASE_URL.key, it, ConfigScope.APP) }
                     }
                 }
 
@@ -323,7 +326,7 @@ class ConfigRouter(
                         "openai" -> System.setProperty("OPENAI_API_KEY", apiKey)
                         "openrouter" -> System.setProperty("OPENROUTER_API_KEY", apiKey)
                         "lmstudio" -> System.setProperty("LM_STUDIO_API_KEY", apiKey)
-                        "custom_openai" -> System.setProperty("CUSTOM_OPENAI_API_KEY", apiKey)
+                        "generic_openai" -> System.setProperty("CUSTOM_OPENAI_API_KEY", apiKey)
                         "zai" -> System.setProperty("ZAI_API_KEY", apiKey)
                     }
                 }
@@ -331,8 +334,8 @@ class ConfigRouter(
                 when (provider.lowercase()) {
                     "ollama" -> resolvedBaseUrl?.let { System.setProperty("OLLAMA_BASE_URL", it) }
                     "lmstudio" -> resolvedBaseUrl?.let { System.setProperty("LM_STUDIO_BASE_URL", it) }
-                    "custom_openai" -> resolvedBaseUrl?.let { System.setProperty("CUSTOM_OPENAI_BASE_URL", it) }
-                    "zai" -> resolvedBaseUrl?.let { System.setProperty("ZAI_BASE_URL", configService.normalizeZAIBaseUrl(it)) }
+                    "generic_openai" -> resolvedBaseUrl?.let { System.setProperty("CUSTOM_OPENAI_BASE_URL", it) }
+                    "zai" -> resolvedBaseUrl?.let { System.setProperty("ZAI_BASE_URL", pl.jclab.refio.core.llm.adapters.ZAIUrls.normalize(it)) }
                 }
 
                 try {
@@ -464,6 +467,10 @@ class ConfigRouter(
         logger.info { "[ConfigRouter] Refreshing models for provider: $provider" }
 
         try {
+            // Force fresh fetch: model metadata (e.g. Ollama maxContext) depends on
+            // current provider config and would otherwise be served from stale cache.
+            clearModelsCache()
+
             logger.info { "Fetching models dynamically for $provider" }
             val models = getModelsByProvider(provider, configService)
 
@@ -503,7 +510,7 @@ class ConfigRouter(
     suspend fun refreshAllModels(): List<ModelInfo> {
         logger.info { "[ConfigRouter] Refreshing models for all providers" }
 
-        val allProviders = listOf("ollama", "anthropic", "openai", "openrouter", "gemini", "lmstudio", "custom_openai", "zai")
+        val allProviders = listOf("ollama", "anthropic", "openai", "openrouter", "gemini", "lmstudio", "generic_openai", "zai")
         val allModels = mutableListOf<ModelInfo>()
 
         for (provider in allProviders) {
@@ -564,6 +571,7 @@ class ConfigRouter(
     fun initializeProviderKeys() {
         logger.info { "[ConfigRouter] Initializing provider keys on startup" }
         syncProviderKeysToSystemProperties()
+        clearModelsCache()
     }
 
     /**
@@ -608,6 +616,11 @@ class ConfigRouter(
                 description = "Setting for $section"
             )
 
+            // Invalidate ConfigResolver cache — writing via configRepository bypasses it,
+            // which would otherwise keep serving the stale value (e.g. Ollama context size
+            // staying 32768 after the user bumped it to 65536).
+            configService.invalidateConfigCache(fullKey)
+
             logger.info { "Config updated: $fullKey = $value" }
         }
 
@@ -784,12 +797,12 @@ class ConfigRouter(
                         logger.debug { "Set LM_STUDIO_BASE_URL from database" }
                     }
 
-                    providerName == "custom_openai" && settingType == "custom_openai_api_key" -> {
+                    providerName == "generic_openai" && settingType == "generic_openai_api_key" -> {
                         System.setProperty("CUSTOM_OPENAI_API_KEY", config.value)
                         logger.debug { "Set CUSTOM_OPENAI_API_KEY from database" }
                     }
 
-                    providerName == "custom_openai" && settingType == "custom_openai_base_url" -> {
+                    providerName == "generic_openai" && settingType == "generic_openai_base_url" -> {
                         System.setProperty("CUSTOM_OPENAI_BASE_URL", config.value)
                         logger.debug { "Set CUSTOM_OPENAI_BASE_URL from database" }
                     }
@@ -800,7 +813,7 @@ class ConfigRouter(
                     }
 
                     providerName == "zai" && settingType == "zai_base_url" -> {
-                        System.setProperty("ZAI_BASE_URL", configService.normalizeZAIBaseUrl(config.value))
+                        System.setProperty("ZAI_BASE_URL", pl.jclab.refio.core.llm.adapters.ZAIUrls.normalize(config.value))
                         logger.debug { "Set ZAI_BASE_URL from database" }
                     }
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/routers/ProjectContextRouter.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/ProjectContextRouter.kt
index c351d528..49d1e8f6 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/api/routers/ProjectContextRouter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/ProjectContextRouter.kt
@@ -25,7 +25,6 @@ private val logger = dualLogger("ProjectContextRouter")
 class ProjectContextRouter(
     private val contextService: ContextService?,
     private val projectRoot: Path?,
-    private val ideProject: Any?,
     private val taskRepository: TaskRepository,
     private val chatMessageRepository: ChatMessageRepository,
     private val promptsService: PromptsService,
@@ -76,7 +75,6 @@ class ProjectContextRouter(
             val context = contextService.buildProjectContext(
                 projectRoot = projectRoot,
                 taskId = taskId,
-                project = ideProject,
                 query = effectiveQuery,
                 userContextRefs = userContextRefs
             )
@@ -305,9 +303,13 @@ class ProjectContextRouter(
         contextSectionTokens: Map<String, ContextSectionTokenInfo>
     ): RuntimePromptPreview {
         val toolDescriptions = toolDescriptionBuilder.getToolDescriptions(TaskMode.AGENT, task.id)
+        val toolSelectionMatrix = toolDescriptionBuilder.getToolSelectionMatrix(TaskMode.AGENT, task.id)
         val baseSystemPromptRaw = promptsService.getSystemPrompt(
             type = PromptType.SYSTEM_AGENT,
-            variables = mapOf("tool_descriptions" to toolDescriptions)
+            variables = mapOf(
+                "tool_descriptions" to toolDescriptions,
+                "tool_selection_matrix" to toolSelectionMatrix
+            )
         )
         // Apply the same section providers used by TurnPromptBuilder in runtime
         // so the preview reflects the real prompt (including <system_environment>).
@@ -358,7 +360,7 @@ class ProjectContextRouter(
         if (contextService != null && projectRoot != null) {
             return try {
                 val turnMessages = contextService.buildAgentTurnMessages(
-                    taskId = taskId, projectRoot = projectRoot, project = null,
+                    taskId = taskId, projectRoot = projectRoot,
                     userContextRefs = userContextRefs, query = query
                 ).messages.toMutableList()
                 appendPendingUserMessage(turnMessages, pendingUserInput)
@@ -613,13 +615,11 @@ class ProjectContextRouter(
             analyzedAt = context.contextGeneratedAt.toEpochMilli(),
             contextBuiltAt = context.contextGeneratedAt.toEpochMilli(),
             userRequirements = context.userRequirements,
-            ragFragments = context.ragFragments,
             mcpResources = context.mcpResources.map {
                 MCPResourceResponse(serverId = it.serverId, uri = it.uri, name = it.name, description = it.description, mimeType = it.mimeType)
             },
             userContextRefs = userContextRefDTOs,
             conversationHistory = conversationDTOs,
-            previousSubtasks = context.executedSteps.map { it.displayContent },
             domainAnalysis = context.domainAnalysis,
             directoryCount = context.structure.directoryCount,
             maxDepth = context.structure.maxDepth,
@@ -659,7 +659,7 @@ class ProjectContextRouter(
     private fun findNextKnownSectionStart(prompt: String, fromIndex: Int): Int? {
         val knownSectionTags = listOf(
             "PROJECT_CONTEXT", "CURRENT_TASK", "USER_REQUIREMENTS", "USER_PROVIDED_CONTEXT",
-            "WORKING_MEMORY", "MCP_RESOURCES", "RAG_FRAGMENTS", "CONVERSATION_HISTORY",
+            "WORKING_MEMORY", "MCP_RESOURCES", "CONVERSATION_HISTORY",
             "RECENT_WORK", "SUBTASKS_STATUS", "KEY_COMPONENTS", "PROJECT_DEPENDENCIES", "CODE_ANALYSIS"
         )
         var nextIndex: Int? = null
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/routers/PromptsRouter.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/PromptsRouter.kt
index d121b05a..c2cd10fd 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/api/routers/PromptsRouter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/PromptsRouter.kt
@@ -9,7 +9,7 @@ private val logger = dualLogger("PromptsRouter")
 
 /**
  * Router for prompts management operations.
- * Handles system prompts, rules, and slash commands.
+ * Handles system prompts, rules, and slash prompts.
  *
  * @property promptsService Prompts management service
  */
@@ -118,48 +118,48 @@ class PromptsRouter(
         return PromptResponse(prompt = rule.toDto())
     }
 
-    // ===== Commands =====
+    // ===== Slash Prompts =====
 
     /**
-     * Get all enabled slash commands.
+     * Get all enabled slash prompts.
      */
-    fun getEnabledCommands(): PromptsListResponse {
-        logger.info { "[PromptsRouter] Getting enabled commands" }
-        val commands = promptsService.getEnabledCommands()
+    fun getEnabledSlashPrompts(): PromptsListResponse {
+        logger.info { "[PromptsRouter] Getting enabled slash prompts" }
+        val slashPrompts = promptsService.getEnabledSlashPrompts()
         return PromptsListResponse(
-            prompts = commands.map { it.toDto() },
-            count = commands.size
+            prompts = slashPrompts.map { it.toDto() },
+            count = slashPrompts.size
         )
     }
 
     /**
-     * Find slash command by name.
+     * Find slash prompt by name.
      */
-    fun findCommand(commandName: String): PromptResponse? {
-        logger.info { "[PromptsRouter] Finding command: $commandName" }
-        val command = promptsService.findCommand(commandName)
-        return command?.let { PromptResponse(prompt = it.toDto()) }
+    fun findSlashPrompt(name: String): PromptResponse? {
+        logger.info { "[PromptsRouter] Finding slash prompt: $name" }
+        val slashPrompt = promptsService.findSlashPrompt(name)
+        return slashPrompt?.let { PromptResponse(prompt = it.toDto()) }
     }
 
     /**
-     * Save (create or update) a slash command.
+     * Save (create or update) a slash prompt.
      */
-    fun saveCommand(request: SaveCommandRequest): PromptResponse {
-        logger.info { "[PromptsRouter] Saving command: id=${request.id}, name=${request.name}" }
-        val command = promptsService.saveCommand(
+    fun saveSlashPrompt(request: SaveSlashPromptRequest): PromptResponse {
+        logger.info { "[PromptsRouter] Saving slash prompt: id=${request.id}, name=${request.name}" }
+        val slashPrompt = promptsService.saveSlashPrompt(
             id = request.id,
             name = request.name,
             content = request.content,
             description = request.description,
             isEnabled = request.isEnabled
         )
-        return PromptResponse(prompt = command.toDto())
+        return PromptResponse(prompt = slashPrompt.toDto())
     }
 
     // ===== General Operations =====
 
     /**
-     * Delete rule or command by ID.
+     * Delete rule or slash prompt by ID.
      */
     fun deletePrompt(id: String): DeletePromptResponse {
         logger.info { "[PromptsRouter] Deleting prompt: id=$id" }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/api/routers/SnapshotRouter.kt b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/SnapshotRouter.kt
new file mode 100644
index 00000000..534d537d
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/api/routers/SnapshotRouter.kt
@@ -0,0 +1,57 @@
+package pl.jclab.refio.core.api.routers
+
+import pl.jclab.refio.core.api.Router
+import pl.jclab.refio.core.api.SnapshotResponse
+import pl.jclab.refio.core.db.repositories.SnapshotRepository
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.SnapshotService
+
+private val logger = dualLogger("SnapshotRouter")
+
+/**
+ * Router for file snapshot operations.
+ * Snapshots are taken before destructive file edits in AGENT mode for rollback.
+ *
+ * @property snapshotService Present only when CoreApiRouter has a projectRoot.
+ *                           When null, snapshot operations throw IllegalStateException.
+ */
+class SnapshotRouter(
+    private val snapshotService: SnapshotService?,
+    private val snapshotRepository: SnapshotRepository
+) : Router {
+
+    override suspend fun initialize() {
+        logger.info { "[SnapshotRouter] Initialized (snapshotService=${snapshotService != null})" }
+    }
+
+    override suspend fun shutdown() {
+        logger.info { "[SnapshotRouter] Shutting down" }
+    }
+
+    suspend fun getSnapshot(snapshotId: String): SnapshotResponse {
+        val service = snapshotService
+            ?: throw IllegalStateException("Snapshot operations require project context")
+        try {
+            val files = service.getSnapshot(snapshotId)
+            return SnapshotResponse(snapshotId = snapshotId, files = files)
+        } catch (e: Exception) {
+            logger.error(e) { "Failed to get snapshot: $snapshotId" }
+            throw e
+        }
+    }
+
+    suspend fun getSnapshotFileContent(snapshotId: String, filePath: String): String? {
+        val service = snapshotService
+            ?: throw IllegalStateException("Snapshot operations require project context")
+        return try {
+            service.getSnapshot(snapshotId)[filePath]
+        } catch (e: Exception) {
+            logger.error(e) { "Failed to get snapshot file content: $snapshotId/$filePath" }
+            null
+        }
+    }
+
+    fun deleteSnapshotsByTaskId(taskId: String): Int {
+        return snapshotRepository.deleteByTaskId(taskId)
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigKeys.kt b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigKeys.kt
index f062a7df..5f84e85c 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigKeys.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigKeys.kt
@@ -1,6 +1,5 @@
 package pl.jclab.refio.core.config
 
-import pl.jclab.refio.core.utils.GsonInstance.gson
 
 /**
  * Typed configuration key descriptor.
@@ -20,7 +19,13 @@ data class ConfigKey<T>(
     val parser: (String) -> T?,
     val default: T,
     val serializer: (T) -> String = { it.toString() },
-    val yamlAccessor: ((HierarchicalConfigLoader) -> Any?)? = null
+    val yamlAccessor: ((HierarchicalConfigLoader) -> Any?)? = null,
+    /**
+     * Optional validator invoked by [pl.jclab.refio.core.services.ConfigValidator] at startup
+     * and after YAML reloads. Return `false` to reject a value — the startup then fails loud
+     * with the offending key/value, rather than silently using a broken config.
+     */
+    val validator: (T) -> Boolean = { true }
 )
 
 /**
@@ -69,7 +74,8 @@ object ConfigKeys {
         key = "limits.api_call_timeout",
         parser = String::toIntOrNull,
         default = 360,
-        yamlAccessor = { it.getApiCallTimeout() }
+        yamlAccessor = { it.getApiCallTimeout() },
+        validator = { it > 0 }
     )
 
     val STREAMING_READ_TIMEOUT = ConfigKey(
@@ -97,21 +103,24 @@ object ConfigKeys {
         key = "limits.max_context_size",
         parser = String::toIntOrNull,
         default = 128000,
-        yamlAccessor = { it.getMaxContextSize() }
+        yamlAccessor = { it.getMaxContextSize() },
+        validator = { it in 1024..4_194_304 }
     )
 
     val MAX_OUTPUT_SIZE = ConfigKey(
         key = "limits.max_output_size",
         parser = String::toIntOrNull,
         default = 16384,
-        yamlAccessor = { it.getMaxOutputSize() }
+        yamlAccessor = { it.getMaxOutputSize() },
+        validator = { it > 0 }
     )
 
     val MAX_FILE_SIZE = ConfigKey(
         key = "limits.max_file_size",
         parser = String::toIntOrNull,
         default = 10,
-        yamlAccessor = { it.getMaxFileSize() }
+        yamlAccessor = { it.getMaxFileSize() },
+        validator = { it > 0 }
     )
 
     val MAX_RETRIES = ConfigKey(
@@ -168,6 +177,12 @@ object ConfigKeys {
         default = false
     )
 
+    val UI_MULTI_AGENT_STRATEGY = ConfigKey(
+        key = "ui.multi_agent_strategy",
+        parser = { it.trim().uppercase().takeIf { s -> s.isNotBlank() } },
+        default = "SINGLE"
+    )
+
     val UI_EXECUTION_MODE = ConfigKey(
         key = "ui.execution_mode",
         parser = { it.trim().uppercase().takeIf { s -> s.isNotBlank() } },
@@ -231,6 +246,18 @@ object ConfigKeys {
         default = "qwen2.5:7b"
     )
 
+    val STRONG_MODEL = ConfigKey<String?>(
+        key = "default_model.strong",
+        parser = { it.takeIf { s -> s.isNotBlank() } },
+        default = null
+    )
+
+    val OLLAMA_MAX_CONCURRENT = ConfigKey(
+        key = "providers.ollama_max_concurrent",
+        parser = String::toIntOrNull,
+        default = 1
+    )
+
     // ==================== RAG ====================
 
     val RAG_ENABLED = ConfigKey(
@@ -360,7 +387,8 @@ object ConfigKeys {
         key = "rag.search_semantic_weight",
         parser = String::toFloatOrNull,
         default = 0.7f,
-        yamlAccessor = { it.getRagSearchSemanticWeight() }
+        yamlAccessor = { it.getRagSearchSemanticWeight() },
+        validator = { it in 0f..1f }
     )
 
     val RAG_SEARCH_INCLUDE_CONTEXT_CHUNKS = ConfigKey(
@@ -427,31 +455,6 @@ object ConfigKeys {
         default = true
     )
 
-    val TERMINAL_WHITELIST = ConfigKey(
-        key = "terminal.whitelist",
-        parser = { it.takeIf { s -> s.isNotBlank() } },
-        default = "",
-        yamlAccessor = { loader ->
-            loader.getTerminalWhitelist()?.let {
-                gson.toJson(it)
-            }
-        }
-    )
-
-    val TERMINAL_WHITELIST_ENABLED = ConfigKey(
-        key = "terminal.whitelist.enabled",
-        parser = String::toBooleanStrictOrNull,
-        default = true,
-        yamlAccessor = { it.getTerminalWhitelistEnabled() }
-    )
-
-    val TERMINAL_WHITELIST_MODE = ConfigKey(
-        key = "terminal.whitelist.mode",
-        parser = { it.trim().uppercase().takeIf { s -> s.isNotBlank() } },
-        default = "WHITELIST_ONLY",
-        yamlAccessor = { it.getTerminalWhitelistMode()?.trim()?.uppercase() }
-    )
-
     // ==================== TOOL SUMMARY ====================
 
     val TOOL_SUMMARY_ENABLED = ConfigKey(
@@ -497,7 +500,8 @@ object ConfigKeys {
     val CONTEXT_BUDGET_INPUT_RATIO = ConfigKey(
         key = "context.budget.input_ratio",
         parser = String::toDoubleOrNull,
-        default = 0.85
+        default = 0.85,
+        validator = { it > 0.0 && it <= 1.0 }
     )
 
     val WORKING_MEMORY_MAX_FACTS = ConfigKey(
@@ -593,27 +597,27 @@ object ConfigKeys {
     )
 
     val PROVIDER_CUSTOM_OPENAI_API_KEY = ConfigKey<String?>(
-        key = "providers.custom_openai.custom_openai_api_key",
+        key = "providers.generic_openai.generic_openai_api_key",
         parser = { it.takeIf { s -> s.isNotBlank() } },
         default = null,
         serializer = { it ?: "" },
-        yamlAccessor = { it.getCustomOpenAIApiKey() }
+        yamlAccessor = { it.getGenericOpenAIApiKey() }
     )
 
     val PROVIDER_CUSTOM_OPENAI_BASE_URL = ConfigKey<String?>(
-        key = "providers.custom_openai.custom_openai_base_url",
+        key = "providers.generic_openai.generic_openai_base_url",
         parser = { it.takeIf { s -> s.isNotBlank() } },
         default = null,
         serializer = { it ?: "" },
-        yamlAccessor = { it.getCustomOpenAIBaseUrl() }
+        yamlAccessor = { it.getGenericOpenAIBaseUrl() }
     )
 
     val PROVIDER_CUSTOM_OPENAI_MODEL = ConfigKey<String?>(
-        key = "providers.custom_openai.custom_openai_model",
+        key = "providers.generic_openai.generic_openai_model",
         parser = { it.takeIf { s -> s.isNotBlank() } },
         default = null,
         serializer = { it ?: "" },
-        yamlAccessor = { it.getCustomOpenAIModel() }
+        yamlAccessor = { it.getGenericOpenAIModel() }
     )
 
     val PROVIDER_ZAI_API_KEY = ConfigKey<String?>(
@@ -707,6 +711,7 @@ object ConfigKeys {
             UI_NO_EGRESS_ENABLED,
             UI_ORCHESTRATION_ENABLED,
             UI_INTENT_CLASSIFICATION_ENABLED,
+            UI_MULTI_AGENT_STRATEGY,
             UI_EXECUTION_MODE,
             UI_SELECTED_MODE,
             UI_SELECTED_MODEL,
@@ -717,6 +722,7 @@ object ConfigKeys {
             DEFAULT_MODEL_PLAN,
             DEFAULT_MODEL_AGENT,
             WEAK_MODEL,
+            STRONG_MODEL,
             // RAG
             RAG_ENABLED,
             RAG_AUTO_INDEX_ON_CONTEXT,
@@ -747,9 +753,6 @@ object ConfigKeys {
             // Tools
             TOOLS_PERMISSIONS,
             TOOL_PERMISSION_RUN_TERMINAL,
-            TERMINAL_WHITELIST,
-            TERMINAL_WHITELIST_ENABLED,
-            TERMINAL_WHITELIST_MODE,
             // Tool summary
             TOOL_SUMMARY_ENABLED,
             TOOL_SUMMARY_MIN_LENGTH,
@@ -767,6 +770,7 @@ object ConfigKeys {
             PROVIDER_OLLAMA_ENDPOINT,
             PROVIDER_OLLAMA_CONTEXT_SIZE,
             PROVIDER_OLLAMA_KEEP_ALIVE,
+            OLLAMA_MAX_CONCURRENT,
             PROVIDER_ANTHROPIC_API_KEY,
             PROVIDER_OPENAI_API_KEY,
             PROVIDER_OPENROUTER_API_KEY,
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYaml.kt b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYaml.kt
index c7363504..a14f39dc 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYaml.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYaml.kt
@@ -1,8 +1,5 @@
 package pl.jclab.refio.core.config
 
-import com.charleskorn.kaml.Yaml
-import com.charleskorn.kaml.YamlConfiguration
-import kotlinx.serialization.SerialName
 import kotlinx.serialization.Serializable
 import java.io.File
 import java.nio.file.Path
@@ -33,1464 +30,47 @@ import java.nio.file.Path
  */
 @Serializable
 data class ConfigYaml(
-    /**
-     * General UI and behavior settings
-     */
     val general: GeneralConfig? = null,
-
-    /**
-     * Provider configurations (API keys, endpoints)
-     */
     val providers: ProvidersConfig? = null,
-
-    /**
-     * Default model selection per mode and visibility settings
-     */
     val models: ModelsConfig? = null,
-
-    /**
-     * System limits (timeouts, context size, etc.)
-     */
     val limits: LimitsConfig? = null,
-
-    /**
-     * Advanced settings (security, optimization)
-     */
     val advanced: AdvancedConfig? = null,
-
-    /**
-     * Tool permissions per mode
-     */
     val tools: ToolsConfig? = null,
-
-    /**
-     * Terminal command whitelist configuration
-     */
-    val terminal: TerminalConfig? = null,
-
-    /**
-     * RAG indexing configuration
-     */
     val rag: RagConfig? = null,
-
-    /**
-     * UI state settings (persisted between sessions)
-     */
     val ui: UiConfig? = null,
-
-    /**
-     * Custom prompts configuration (project-specific)
-     */
     val prompts: PromptsConfig? = null,
-
-    /**
-     * MCP server configurations (project-specific)
-     */
     val mcp: McpConfig? = null,
-
-    /**
-     * Hooks configuration (user-defined lifecycle actions)
-     */
     val hooks: HooksConfig? = null
 ) {
     companion object {
-        /**
-         * Get the path to the user's config YAML file in home directory
-         */
         fun getUserConfigPath(): File {
             val userHome = System.getProperty("user.home")
             return File(userHome, ".refio${File.separator}config.yaml")
         }
 
-        /**
-         * Get the path to the project's config YAML file
-         */
-        fun getProjectConfigPath(projectRoot: Path): File {
-            return projectRoot.resolve(".refio").resolve("config.yaml").toFile()
-        }
+        fun getProjectConfigPath(projectRoot: Path): File =
+            projectRoot.resolve(".refio").resolve("config.yaml").toFile()
 
-        /**
-         * Legacy alias for getUserConfigPath()
-         */
+        /** Legacy alias for [getUserConfigPath]. */
         fun getConfigPath(): File = getUserConfigPath()
 
-        /**
-         * Load user-level configuration from YAML file.
-         * Returns null if file doesn't exist or can't be parsed.
-         */
-        fun load(): ConfigYaml? = loadFromPath(getUserConfigPath())
-
-        /**
-         * Load user-level configuration from YAML file.
-         * Returns null if file doesn't exist or can't be parsed.
-         */
-        fun loadUserConfig(): ConfigYaml? = loadFromPath(getUserConfigPath())
-
-        /**
-         * Load project-level configuration from YAML file.
-         * Returns null if file doesn't exist or can't be parsed.
-         */
-        fun loadProjectConfig(projectRoot: Path): ConfigYaml? = loadFromPath(getProjectConfigPath(projectRoot))
-
-        /**
-         * Load configuration from a specific path.
-         * Returns null if file doesn't exist or can't be parsed.
-         */
-        private fun loadFromPath(configFile: File): ConfigYaml? {
-            if (!configFile.exists()) {
-                return null
-            }
-
-            return try {
-                val yamlContent = configFile.readText()
-                decodeYamlContent(yamlContent)
-            } catch (e: Exception) {
-                // Log error but don't fail - return null to indicate failure
-                println("Error loading config YAML from ${configFile.absolutePath}: ${e.message}")
-                null
-            }
-        }
-
-        private fun decodeYamlContent(yamlContent: String): ConfigYaml {
-            val yaml = Yaml(
-                configuration = YamlConfiguration(
-                    strictMode = false  // Allow unknown fields for forward compatibility
-                )
-            )
-
-            val firstAttempt = runCatching {
-                yaml.decodeFromString(serializer(), yamlContent)
-            }
-            if (firstAttempt.isSuccess) {
-                return firstAttempt.getOrThrow()
-            }
-
-            val sanitizedEscapes = sanitizeInvalidDoubleQuotedEscapes(yamlContent)
-            val sanitizedBrokenLines = sanitizeBrokenStandaloneEmptyQuotedLines(sanitizedEscapes)
-
-            if (sanitizedBrokenLines == yamlContent) {
-                throw firstAttempt.exceptionOrNull() ?: IllegalStateException("Unknown YAML parsing error")
-            }
-
-            val secondAttempt = runCatching {
-                yaml.decodeFromString(serializer(), sanitizedBrokenLines)
-            }
-            if (secondAttempt.isSuccess) {
-                if (sanitizedEscapes != yamlContent) {
-                    println("Config YAML parser fallback: sanitized invalid double-quoted escape sequences")
-                }
-                if (sanitizedBrokenLines != sanitizedEscapes) {
-                    println("Config YAML parser fallback: repaired broken standalone empty-quoted lines")
-                }
-                return secondAttempt.getOrThrow()
-            }
-
-            throw secondAttempt.exceptionOrNull()
-                ?: firstAttempt.exceptionOrNull()
-                ?: IllegalStateException("Unknown YAML parsing error")
-        }
-
-        private fun sanitizeInvalidDoubleQuotedEscapes(input: String): String {
-            val out = StringBuilder(input.length + 32)
-            var inDoubleQuoted = false
-            var inSingleQuoted = false
-            var inComment = false
-            var i = 0
-
-            while (i < input.length) {
-                val ch = input[i]
-
-                if (inComment) {
-                    out.append(ch)
-                    if (ch == '\n') {
-                        inComment = false
-                    }
-                    i++
-                    continue
-                }
-
-                if (inSingleQuoted) {
-                    out.append(ch)
-                    if (ch == '\'') {
-                        if (i + 1 < input.length && input[i + 1] == '\'') {
-                            out.append('\'')
-                            i += 2
-                            continue
-                        }
-                        inSingleQuoted = false
-                    }
-                    i++
-                    continue
-                }
-
-                if (inDoubleQuoted) {
-                    if (ch == '"') {
-                        inDoubleQuoted = false
-                        out.append(ch)
-                        i++
-                        continue
-                    }
-
-                    if (ch == '\\') {
-                        val next = input.getOrNull(i + 1)
-                        if (next == null || !isValidYamlEscape(input, i + 1)) {
-                            out.append("\\\\")
-                            i++
-                            continue
-                        }
-                    }
-
-                    out.append(ch)
-                    i++
-                    continue
-                }
-
-                when (ch) {
-                    '#' -> inComment = true
-                    '"' -> inDoubleQuoted = true
-                    '\'' -> inSingleQuoted = true
-                }
-                out.append(ch)
-                i++
-            }
-
-            return out.toString()
-        }
-
-        private fun isValidYamlEscape(input: String, escapeCharIndex: Int): Boolean {
-            val escapeChar = input.getOrNull(escapeCharIndex) ?: return false
-            return when (escapeChar) {
-                '0', 'a', 'b', 't', 'n', 'v', 'f', 'r', 'e', ' ', '"', '/', '\\', 'N', '_', 'L', 'P' -> true
-                'x' -> hasHexDigits(input, escapeCharIndex + 1, 2)
-                'u' -> hasHexDigits(input, escapeCharIndex + 1, 4)
-                'U' -> hasHexDigits(input, escapeCharIndex + 1, 8)
-                else -> false
-            }
-        }
-
-        private fun hasHexDigits(input: String, start: Int, length: Int): Boolean {
-            if (start + length > input.length) {
-                return false
-            }
-            for (idx in start until start + length) {
-                if (!input[idx].isDigit() && input[idx].lowercaseChar() !in 'a'..'f') {
-                    return false
-                }
-            }
-            return true
-        }
-
-        private fun sanitizeBrokenStandaloneEmptyQuotedLines(input: String): String {
-            val lines = input.split('\n')
-            val out = ArrayList<String>(lines.size)
-
-            var previousSignificant: String? = null
-            var lastListIndent: Int? = null
-            var changed = false
-
-            for (line in lines) {
-                val trimmed = line.trim()
-
-                if (trimmed == "''") {
-                    val replacement = when {
-                        lastListIndent != null -> "${" ".repeat(lastListIndent)}- ''"
-                        previousSignificant?.trimEnd()?.endsWith(":") == true -> {
-                            val baseIndent = previousSignificant.takeWhile { it == ' ' }.length
-                            "${" ".repeat(baseIndent + 2)}- ''"
-                        }
-                        else -> "- ''"
-                    }
-                    out.add(replacement)
-                    previousSignificant = replacement
-                    lastListIndent = replacement.takeWhile { it == ' ' }.length
-                    changed = true
-                    continue
-                }
-
-                out.add(line)
-                if (trimmed.isNotEmpty() && !trimmed.startsWith("#")) {
-                    previousSignificant = line
-                    if (trimmed.startsWith("- ")) {
-                        lastListIndent = line.takeWhile { it == ' ' }.length
-                    } else if (trimmed.endsWith(":")) {
-                        lastListIndent = null
-                    }
-                }
-            }
-
-            if (!changed) {
-                return input
-            }
-            return out.joinToString("\n")
-        }
-
-        /**
-         * Merge two configs - values from 'override' take precedence over 'base'
-         */
-        fun merge(base: ConfigYaml?, override: ConfigYaml?): ConfigYaml {
-            if (base == null) return override ?: ConfigYaml()
-            if (override == null) return base
-
-            return ConfigYaml(
-                general = mergeGeneral(base.general, override.general),
-                providers = mergeProviders(base.providers, override.providers),
-                models = mergeModels(base.models, override.models),
-                limits = mergeLimits(base.limits, override.limits),
-                advanced = mergeAdvanced(base.advanced, override.advanced),
-                tools = mergeTools(base.tools, override.tools),
-                terminal = mergeTerminal(base.terminal, override.terminal),
-                rag = mergeRag(base.rag, override.rag),
-                ui = mergeUi(base.ui, override.ui),
-                prompts = mergePrompts(base.prompts, override.prompts),
-                mcp = mergeMcp(base.mcp, override.mcp),
-                hooks = mergeHooks(base.hooks, override.hooks)
-            )
-        }
-
-        private fun mergeGeneral(base: GeneralConfig?, override: GeneralConfig?): GeneralConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            return GeneralConfig(
-                formatMarkdown = override.formatMarkdown ?: base.formatMarkdown,
-                streamingEnabled = override.streamingEnabled ?: base.streamingEnabled,
-                advancedView = override.advancedView ?: base.advancedView
-            )
-        }
-
-        private fun mergeProviders(base: ProvidersConfig?, override: ProvidersConfig?): ProvidersConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            return ProvidersConfig(
-                ollama = override.ollama ?: base.ollama,
-                anthropic = override.anthropic ?: base.anthropic,
-                openai = override.openai ?: base.openai,
-                openrouter = override.openrouter ?: base.openrouter,
-                gemini = override.gemini ?: base.gemini,
-                lmstudio = override.lmstudio ?: base.lmstudio,
-                customOpenai = override.customOpenai ?: base.customOpenai,
-                zai = override.zai ?: base.zai
-            )
-        }
-
-        private fun mergeModels(base: ModelsConfig?, override: ModelsConfig?): ModelsConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            return ModelsConfig(
-                default = override.default ?: base.default,
-                defaults = override.defaults ?: base.defaults,
-                visibility = mergeVisibility(base.visibility, override.visibility),
-                presets = override.presets ?: base.presets
-            )
-        }
-
-        private fun mergeVisibility(base: Map<String, Boolean>?, override: Map<String, Boolean>?): Map<String, Boolean>? {
-            if (base == null) return override
-            if (override == null) return base
-            return base + override  // override wins on conflicts
-        }
-
-        private fun mergeLimits(base: LimitsConfig?, override: LimitsConfig?): LimitsConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            return LimitsConfig(
-                apiCallTimeout = override.apiCallTimeout ?: base.apiCallTimeout,
-                toolExecutionTimeout = override.toolExecutionTimeout ?: base.toolExecutionTimeout,
-                streamingReadTimeout = override.streamingReadTimeout ?: base.streamingReadTimeout,
-                streamingRequestTimeout = override.streamingRequestTimeout ?: base.streamingRequestTimeout,
-                maxContextSize = override.maxContextSize ?: base.maxContextSize,
-                maxOutputSize = override.maxOutputSize ?: base.maxOutputSize,
-                maxFileSize = override.maxFileSize ?: base.maxFileSize
-            )
-        }
-
-        private fun mergeAdvanced(base: AdvancedConfig?, override: AdvancedConfig?): AdvancedConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            return AdvancedConfig(
-                noEgressDefault = override.noEgressDefault ?: base.noEgressDefault,
-                readOnlyMode = override.readOnlyMode ?: base.readOnlyMode,
-                autoOptimizePercentage = override.autoOptimizePercentage ?: base.autoOptimizePercentage,
-                orchestrationEnabled = override.orchestrationEnabled ?: base.orchestrationEnabled
-            )
-        }
-
-        private fun mergeRag(base: RagConfig?, override: RagConfig?): RagConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            return RagConfig(
-                enabled = override.enabled ?: base.enabled,
-                indexOnStartup = override.indexOnStartup ?: base.indexOnStartup,
-                autoIndexOnContextBuild = override.autoIndexOnContextBuild ?: base.autoIndexOnContextBuild,
-                maxFileSizeMB = override.maxFileSizeMB ?: base.maxFileSizeMB,
-                maxChunksPerFile = override.maxChunksPerFile ?: base.maxChunksPerFile,
-                indexBatchSize = override.indexBatchSize ?: base.indexBatchSize,
-                embeddingsBatchSize = override.embeddingsBatchSize ?: base.embeddingsBatchSize,
-                cacheTtlMs = override.cacheTtlMs ?: base.cacheTtlMs,
-                maxConcurrentJobs = override.maxConcurrentJobs ?: base.maxConcurrentJobs,
-                ignoredDirectories = mergeList(base.ignoredDirectories, override.ignoredDirectories),
-                searchSimilarityThreshold = override.searchSimilarityThreshold ?: base.searchSimilarityThreshold,
-                searchTopK = override.searchTopK ?: base.searchTopK,
-                searchHybridEnabled = override.searchHybridEnabled ?: base.searchHybridEnabled,
-                searchSemanticWeight = override.searchSemanticWeight ?: base.searchSemanticWeight,
-                searchIncludeContextChunks = override.searchIncludeContextChunks ?: base.searchIncludeContextChunks
-            )
-        }
-
-        private fun mergeUi(base: UiConfig?, override: UiConfig?): UiConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            return UiConfig(
-                thinkingEnabled = override.thinkingEnabled ?: base.thinkingEnabled,
-                noEgressEnabled = override.noEgressEnabled ?: base.noEgressEnabled,
-                executionMode = override.executionMode ?: base.executionMode,
-                selectedMode = override.selectedMode ?: base.selectedMode,
-                selectedModel = override.selectedModel ?: base.selectedModel
-            )
-        }
-
-        private fun mergeTools(base: ToolsConfig?, override: ToolsConfig?): ToolsConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            val mergedPermissions = (base.permissions ?: emptyMap()) + (override.permissions ?: emptyMap())
-            return ToolsConfig(permissions = mergedPermissions)
-        }
-
-        private fun mergeTerminal(base: TerminalConfig?, override: TerminalConfig?): TerminalConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            val baseWhitelist = base.whitelist
-            val overrideWhitelist = override.whitelist
-            if (baseWhitelist == null && overrideWhitelist == null) {
-                return TerminalConfig()
-            }
-
-            val mergedCommands = mergeTerminalCommands(
-                baseWhitelist?.commands,
-                overrideWhitelist?.commands
-            )
-
-            val mergedWhitelist = TerminalWhitelistConfig(
-                enabled = overrideWhitelist?.enabled ?: baseWhitelist?.enabled,
-                mode = overrideWhitelist?.mode ?: baseWhitelist?.mode,
-                globalBlockedPatterns = ((baseWhitelist?.globalBlockedPatterns ?: emptyList()) +
-                    (overrideWhitelist?.globalBlockedPatterns ?: emptyList())).distinct(),
-                commands = mergedCommands
-            )
-
-            return TerminalConfig(whitelist = mergedWhitelist)
-        }
-
-        private fun mergeTerminalCommands(
-            base: List<TerminalCommandConfig>?,
-            override: List<TerminalCommandConfig>?
-        ): List<TerminalCommandConfig>? {
-            if (base == null) return override
-            if (override == null) return base
-
-            val byProgram = linkedMapOf<String, TerminalCommandConfig>()
-            base.forEach { byProgram[it.program.lowercase()] = it }
-            override.forEach { byProgram[it.program.lowercase()] = it }
-            return byProgram.values.toList()
-        }
-
-        private fun mergePrompts(base: PromptsConfig?, override: PromptsConfig?): PromptsConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            return PromptsConfig(
-                systemChat = override.systemChat ?: base.systemChat,
-                systemPlan = override.systemPlan ?: base.systemPlan,
-                systemAgent = override.systemAgent ?: base.systemAgent,
-                commands = mergeList(base.commands, override.commands),
-                rules = mergeList(base.rules, override.rules)
-            )
-        }
-
-        private fun <T> mergeList(base: List<T>?, override: List<T>?): List<T>? {
-            if (base == null) return override
-            if (override == null) return base
-            return base + override
-        }
-
-        private fun mergeMcp(base: McpConfig?, override: McpConfig?): McpConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            val mergedServers = (base.servers ?: emptyList()) + (override.servers ?: emptyList())
-            return McpConfig(servers = mergedServers.distinctBy { it.id })
-        }
-
-        private fun mergeHooks(base: HooksConfig?, override: HooksConfig?): HooksConfig? {
-            if (base == null) return override
-            if (override == null) return base
-
-            return HooksConfig(
-                beforeTurnLoop = override.beforeTurnLoop ?: base.beforeTurnLoop,
-                afterTurnLoop = override.afterTurnLoop ?: base.afterTurnLoop,
-                beforeTool = override.beforeTool ?: base.beforeTool,
-                afterTool = override.afterTool ?: base.afterTool,
-                onAgentComplete = override.onAgentComplete ?: base.onAgentComplete,
-                onAgentError = override.onAgentError ?: base.onAgentError
-            )
-        }
-
-        /**
-         * Serialize ConfigYaml to YAML string.
-         */
-        fun toYamlString(config: ConfigYaml): String {
-            return Yaml(
-                configuration = YamlConfiguration(
-                    strictMode = false
-                )
-            ).encodeToString(serializer(), config)
-        }
-
-        /**
-         * Save configuration to a file.
-         *
-         * @param config Configuration to save
-         * @param file Target file
-         * @param withComments If true, adds helpful comments to the output
-         */
-        fun saveToFile(config: ConfigYaml, file: File, withComments: Boolean = true) {
-            // Ensure parent directory exists
-            file.parentFile?.mkdirs()
-
-            val content = if (withComments) {
-                createCommentedYaml(config)
-            } else {
-                toYamlString(config)
-            }
-
-            file.writeText(content)
-        }
-
-        /**
-         * Create YAML string with helpful comments.
-         */
-        private fun createCommentedYaml(config: ConfigYaml): String {
-            val sb = StringBuilder()
-
-            sb.appendLine("# ═══════════════════════════════════════════════════════════════════════════════")
-            sb.appendLine("# Refio Configuration File")
-            sb.appendLine("# Generated: ${java.time.LocalDateTime.now()}")
-            sb.appendLine("# ═══════════════════════════════════════════════════════════════════════════════")
-            sb.appendLine()
-
-            // General
-            config.general?.let { general ->
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("# General Settings")
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("general:")
-                general.formatMarkdown?.let { sb.appendLine("  formatMarkdown: $it") }
-                general.streamingEnabled?.let { sb.appendLine("  streamingEnabled: $it") }
-                general.advancedView?.let { sb.appendLine("  advancedView: $it") }
-                sb.appendLine()
-            }
-
-            // Providers
-            config.providers?.let { providers ->
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("# LLM Provider Configuration")
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("providers:")
-
-                providers.ollama?.let { ollama ->
-                    sb.appendLine("  ollama:")
-                    ollama.endpoint?.let { sb.appendLine("    endpoint: \"$it\"") }
-                    ollama.contextSize?.let { sb.appendLine("    contextSize: $it") }
-                }
-
-                providers.anthropic?.let { anthropic ->
-                    sb.appendLine("  anthropic:")
-                    anthropic.apiKey?.let {
-                        sb.appendLine("    apiKey: \"$it\"")
-                    }
-                }
-
-                providers.openai?.let { openai ->
-                    sb.appendLine("  openai:")
-                    openai.apiKey?.let {
-                        sb.appendLine("    apiKey: \"$it\"")
-                    }
-                }
-
-                providers.openrouter?.let { openrouter ->
-                    sb.appendLine("  openrouter:")
-                    openrouter.apiKey?.let {
-                        sb.appendLine("    apiKey: \"$it\"")
-                    }
-                }
-
-                providers.gemini?.let { gemini ->
-                    sb.appendLine("  gemini:")
-                    gemini.apiKey?.let {
-                        sb.appendLine("    apiKey: \"$it\"")
-                    }
-                }
-
-                providers.lmstudio?.let { lmstudio ->
-                    sb.appendLine("  lmstudio:")
-                    lmstudio.apiKey?.let { sb.appendLine("    apiKey: \"$it\"") }
-                    lmstudio.baseUrl?.let { sb.appendLine("    baseUrl: \"$it\"") }
-                    lmstudio.contextSize?.let { sb.appendLine("    contextSize: $it") }
-                }
-
-                providers.customOpenai?.let { customOpenai ->
-                    sb.appendLine("  custom_openai:")
-                    customOpenai.apiKey?.let { sb.appendLine("    apiKey: \"$it\"") }
-                    customOpenai.baseUrl?.let { sb.appendLine("    baseUrl: \"$it\"") }
-                    customOpenai.model?.let { sb.appendLine("    model: \"$it\"") }
-                }
-
-                providers.zai?.let { zai ->
-                    sb.appendLine("  zai:")
-                    zai.apiKey?.let { sb.appendLine("    apiKey: \"$it\"") }
-                    zai.baseUrl?.let { sb.appendLine("    baseUrl: \"$it\"") }
-                }
-
-                sb.appendLine()
-            }
-
-            // Models
-            config.models?.let { models ->
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("# Model Configuration")
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("models:")
-
-                models.defaults?.let { defaults ->
-                    sb.appendLine("  defaults:")
-                    defaults.chat?.let { sb.appendLine("    chat: \"$it\"") }
-                    defaults.plan?.let { sb.appendLine("    plan: \"$it\"") }
-                    defaults.coding?.let { sb.appendLine("    coding: \"$it\"") }
-                    defaults.weak?.let { sb.appendLine("    weak: \"$it\"") }
-                    defaults.embedding?.let { sb.appendLine("    embedding: \"$it\"") }
-                }
-
-                models.visibility?.let { visibility ->
-                    if (visibility.isNotEmpty()) {
-                        sb.appendLine("  visibility:")
-                        visibility.forEach { (model, visible) ->
-                            sb.appendLine("    \"$model\": $visible")
-                        }
-                    }
-                }
-
-                sb.appendLine()
-            }
-
-            // Limits
-            config.limits?.let { limits ->
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("# System Limits")
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("limits:")
-                limits.apiCallTimeout?.let { sb.appendLine("  apiCallTimeout: $it") }
-                limits.toolExecutionTimeout?.let { sb.appendLine("  toolExecutionTimeout: $it") }
-                limits.streamingReadTimeout?.let { sb.appendLine("  streamingReadTimeout: $it") }
-                limits.streamingRequestTimeout?.let { sb.appendLine("  streamingRequestTimeout: $it") }
-                limits.maxContextSize?.let { sb.appendLine("  maxContextSize: $it") }
-                limits.maxOutputSize?.let { sb.appendLine("  maxOutputSize: $it") }
-                limits.maxFileSize?.let { sb.appendLine("  maxFileSize: $it") }
-                sb.appendLine()
-            }
+        fun load(): ConfigYaml? = ConfigYamlIO.loadFromPath(getUserConfigPath())
 
-            // Advanced
-            config.advanced?.let { advanced ->
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("# Advanced Settings")
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("advanced:")
-                advanced.noEgressDefault?.let { sb.appendLine("  noEgressDefault: $it") }
-                advanced.readOnlyMode?.let { sb.appendLine("  readOnlyMode: $it") }
-                advanced.autoOptimizePercentage?.let { sb.appendLine("  autoOptimizePercentage: $it") }
-                sb.appendLine()
-            }
+        fun loadUserConfig(): ConfigYaml? = ConfigYamlIO.loadFromPath(getUserConfigPath())
 
-            // Tools
-            config.tools?.let { tools ->
-                tools.permissions?.let { permissions ->
-                    if (permissions.isNotEmpty()) {
-                        sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                        sb.appendLine("# Tool Permissions")
-                        sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                        sb.appendLine("tools:")
-                        sb.appendLine("  permissions:")
-                        permissions.forEach { (tool, perm) ->
-                            sb.appendLine("    $tool:")
-                            perm.planMode?.let { sb.appendLine("      planMode: \"$it\"") }
-                            perm.agentMode?.let { sb.appendLine("      agentMode: \"$it\"") }
-                        }
-                        sb.appendLine()
-                    }
-                }
-            }
+        fun loadProjectConfig(projectRoot: Path): ConfigYaml? =
+            ConfigYamlIO.loadFromPath(getProjectConfigPath(projectRoot))
 
-            // Terminal whitelist
-            config.terminal?.whitelist?.let { whitelist ->
-                sb.appendLine("# Terminal Command Whitelist")
-                sb.appendLine("terminal:")
-                sb.appendLine("  whitelist:")
-                whitelist.enabled?.let { sb.appendLine("    enabled: $it") }
-                whitelist.mode?.let { sb.appendLine("    mode: ${yamlDoubleQuoted(it)}") }
-                whitelist.globalBlockedPatterns?.let { patterns ->
-                    if (patterns.isNotEmpty()) {
-                        sb.appendLine("    globalBlockedPatterns:")
-                        patterns.forEach { pattern ->
-                            sb.appendLine("      - ${yamlDoubleQuoted(pattern)}")
-                        }
-                    }
-                }
-                whitelist.commands?.let { commands ->
-                    if (commands.isNotEmpty()) {
-                        sb.appendLine("    commands:")
-                        commands.forEach { command ->
-                            sb.appendLine("      - program: ${yamlDoubleQuoted(command.program)}")
-                            command.description?.let { sb.appendLine("        description: ${yamlDoubleQuoted(it)}") }
-                            command.aliases?.let { aliases ->
-                                if (aliases.isNotEmpty()) {
-                                    sb.appendLine("        aliases:")
-                                    aliases.forEach { alias -> sb.appendLine("          - ${yamlDoubleQuoted(alias)}") }
-                                }
-                            }
-                            command.allowedSubcommands?.let { subs ->
-                                if (subs.isNotEmpty()) {
-                                    sb.appendLine("        allowedSubcommands:")
-                                    subs.forEach { sub -> sb.appendLine("          - ${yamlDoubleQuoted(sub)}") }
-                                }
-                            }
-                            command.blockedSubcommands?.let { subs ->
-                                if (subs.isNotEmpty()) {
-                                    sb.appendLine("        blockedSubcommands:")
-                                    subs.forEach { sub -> sb.appendLine("          - ${yamlDoubleQuoted(sub)}") }
-                                }
-                            }
-                            command.blockedFlags?.let { flags ->
-                                if (flags.isNotEmpty()) {
-                                    sb.appendLine("        blockedFlags:")
-                                    flags.forEach { flag -> sb.appendLine("          - ${yamlDoubleQuoted(flag)}") }
-                                }
-                            }
-                            command.blockedArgPatterns?.let { argPatterns ->
-                                if (argPatterns.isNotEmpty()) {
-                                    sb.appendLine("        blockedArgPatterns:")
-                                    argPatterns.forEach { argPattern -> sb.appendLine("          - ${yamlDoubleQuoted(argPattern)}") }
-                                }
-                            }
-                            command.maxArgs?.let { sb.appendLine("        maxArgs: $it") }
-                            command.requireConfirmation?.let { sb.appendLine("        requireConfirmation: $it") }
-                        }
-                    }
-                }
-                sb.appendLine()
-            }
+        /** Merge two configs — values from `override` take precedence over `base`. */
+        fun merge(base: ConfigYaml?, override: ConfigYaml?): ConfigYaml =
+            ConfigYamlMerger.merge(base, override)
 
-            // RAG
-            config.rag?.let { rag ->
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("# RAG Configuration")
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("rag:")
-                rag.enabled?.let { sb.appendLine("  enabled: $it") }
-                rag.indexOnStartup?.let { sb.appendLine("  indexOnStartup: $it") }
-                rag.autoIndexOnContextBuild?.let { sb.appendLine("  autoIndexOnContextBuild: $it") }
-                rag.maxFileSizeMB?.let { sb.appendLine("  maxFileSizeMB: $it") }
-                rag.maxChunksPerFile?.let { sb.appendLine("  maxChunksPerFile: $it") }
-                rag.indexBatchSize?.let { sb.appendLine("  indexBatchSize: $it") }
-                rag.embeddingsBatchSize?.let { sb.appendLine("  embeddingsBatchSize: $it") }
-                rag.cacheTtlMs?.let { sb.appendLine("  cacheTtlMs: $it") }
-                rag.maxConcurrentJobs?.let { sb.appendLine("  maxConcurrentJobs: $it") }
-                rag.ignoredDirectories?.let { dirs ->
-                    if (dirs.isNotEmpty()) {
-                        sb.appendLine("  ignoredDirectories:")
-                        dirs.forEach { sb.appendLine("    - \"$it\"") }
-                    }
-                }
-                sb.appendLine()
-            }
+        fun toYamlString(config: ConfigYaml): String = ConfigYamlIO.toYamlString(config)
 
-            // UI
-            config.ui?.let { ui ->
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("# UI State")
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("ui:")
-                ui.thinkingEnabled?.let { sb.appendLine("  thinkingEnabled: $it") }
-                ui.noEgressEnabled?.let { sb.appendLine("  noEgressEnabled: $it") }
-                ui.executionMode?.let { sb.appendLine("  executionMode: \"$it\"") }
-                ui.selectedMode?.let { sb.appendLine("  selectedMode: \"$it\"") }
-                ui.selectedModel?.let { sb.appendLine("  selectedModel: \"$it\"") }
-                sb.appendLine()
-            }
+        fun saveToFile(config: ConfigYaml, file: File, withComments: Boolean = true) =
+            ConfigYamlIO.saveToFile(config, file, withComments)
 
-            // Prompts (project-specific)
-            config.prompts?.let { prompts ->
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("# Custom Prompts (project-specific)")
-                sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                sb.appendLine("prompts:")
-
-                prompts.systemChat?.let {
-                    sb.appendLine("  systemChat: |")
-                    it.lines().forEach { line -> sb.appendLine("    $line") }
-                }
-
-                prompts.systemPlan?.let {
-                    sb.appendLine("  systemPlan: |")
-                    it.lines().forEach { line -> sb.appendLine("    $line") }
-                }
-
-                prompts.systemAgent?.let {
-                    sb.appendLine("  systemAgent: |")
-                    it.lines().forEach { line -> sb.appendLine("    $line") }
-                }
-
-                prompts.commands?.let { commands ->
-                    if (commands.isNotEmpty()) {
-                        sb.appendLine("  commands:")
-                        commands.forEach { cmd ->
-                            sb.appendLine("    - name: \"${cmd.name}\"")
-                            cmd.description?.let { sb.appendLine("      description: \"$it\"") }
-                            sb.appendLine("      content: \"${cmd.content.replace("\"", "\\\"")}\"")
-                            sb.appendLine("      enabled: ${cmd.enabled}")
-                        }
-                    }
-                }
-
-                prompts.rules?.let { rules ->
-                    if (rules.isNotEmpty()) {
-                        sb.appendLine("  rules:")
-                        rules.forEach { rule ->
-                            sb.appendLine("    - name: \"${rule.name}\"")
-                            sb.appendLine("      content: \"${rule.content.replace("\"", "\\\"")}\"")
-                            sb.appendLine("      enabled: ${rule.enabled}")
-                        }
-                    }
-                }
-
-                sb.appendLine()
-            }
-
-            // MCP (project-specific)
-            config.mcp?.let { mcp ->
-                mcp.servers?.let { servers ->
-                    if (servers.isNotEmpty()) {
-                        sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                        sb.appendLine("# MCP Server Configuration (project-specific)")
-                        sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
-                        sb.appendLine("mcp:")
-                        sb.appendLine("  servers:")
-                        servers.forEach { server ->
-                            sb.appendLine("    - id: \"${server.id}\"")
-                            server.displayName?.let { sb.appendLine("      displayName: \"$it\"") }
-                            sb.appendLine("      type: \"${server.type}\"")
-                            server.command?.let { sb.appendLine("      command: \"$it\"") }
-                            server.args?.let { args ->
-                                if (args.isNotEmpty()) {
-                                    sb.appendLine("      args: [${args.joinToString(", ") { "\"$it\"" }}]")
-                                }
-                            }
-                            server.url?.let { sb.appendLine("      url: \"$it\"") }
-                            sb.appendLine("      accessMode: \"${server.accessMode}\"")
-                            sb.appendLine("      enabled: ${server.enabled}")
-                            server.env?.let { envs ->
-                                if (envs.isNotEmpty()) {
-                                    sb.appendLine("      env:")
-                                    envs.forEach { env ->
-                                        sb.appendLine("        - name: \"${env.name}\"")
-                                        val value = if (env.isSecret) "***" else env.value
-                                        sb.appendLine("          value: \"$value\"")
-                                        if (env.isSecret) sb.appendLine("          isSecret: true")
-                                    }
-                                }
-                            }
-                        }
-                        sb.appendLine()
-                    }
-                }
-            }
-
-            return sb.toString()
-        }
-
-        private fun yamlDoubleQuoted(value: String): String {
-            val escaped = value
-                .replace("\\", "\\\\")
-                .replace("\"", "\\\"")
-                .replace("\r", "\\r")
-                .replace("\n", "\\n")
-                .replace("\t", "\\t")
-            return "\"$escaped\""
-        }
-
-        /**
-         * Create example config file with all available options documented
-         */
-        fun createExampleConfig(): String {
-            return """
-# ═══════════════════════════════════════════════════════════════════════════════
-# Refio Configuration File
-# ═══════════════════════════════════════════════════════════════════════════════
-#
-# Location:
-#   User config:    ~/.refio/config.yaml (Linux/macOS) or %USERPROFILE%\.refio\config.yaml (Windows)
-#   Project config: <project_root>/.refio/config.yaml
-#
-# Configuration Hierarchy (lowest to highest priority):
-#   1. Built-in defaults (hardcoded)
-#   2. User config file (~/.refio/config.yaml)
-#   3. Project config file (<project>/.refio/config.yaml)
-#   4. Database overrides (changes made in Settings UI)
-#
-# All fields are optional. Missing fields use built-in defaults.
-# ═══════════════════════════════════════════════════════════════════════════════
-
-# ─────────────────────────────────────────────────────────────────────────────
-# General Settings
-# ─────────────────────────────────────────────────────────────────────────────
-general:
-  formatMarkdown: true          # Format responses as markdown
-  streamingEnabled: true        # Stream LLM responses in real-time
-  advancedView: false           # Show advanced UI tabs (Steps, Context, RAG, Debug)
-
-# ─────────────────────────────────────────────────────────────────────────────
-# LLM Provider Configuration
-# ─────────────────────────────────────────────────────────────────────────────
-providers:
-  ollama:
-    endpoint: "http://localhost:11434"
-    contextSize: 32768          # Context window size in tokens
-
-  anthropic:
-    apiKey: ""                  # sk-ant-...
-
-  openai:
-    apiKey: ""                  # sk-...
-
-  openrouter:
-    apiKey: ""                  # sk-or-...
-
-  gemini:
-    apiKey: ""                  # AIza...
-
-  lmstudio:
-    baseUrl: "http://localhost:1234/v1"
-    contextSize: 32768
-
-# ─────────────────────────────────────────────────────────────────────────────
-# Model Configuration
-# ─────────────────────────────────────────────────────────────────────────────
-models:
-  # Default models per operation mode (format: "provider/model-id")
-  defaults:
-    chat: "ollama/qwen2.5:7b"       # Default chat/conversation model
-    plan: "ollama/qwen2.5:7b"       # Model for planning operations
-    coding: "ollama/qwen2.5-coder:7b"  # Model for coding/agent tasks
-    weak: "ollama/qwen2.5:3b"       # Cheap model for auxiliary operations
-    embedding: "ollama/nomic-embed-text"  # Model for embeddings (RAG)
-
-  # Model visibility in dropdown (format: "provider/model-id": true/false)
-  visibility:
-    "ollama/qwen2.5:7b": true
-    "ollama/qwen2.5:14b": true
-    "ollama/qwen2.5-coder:7b": true
-    "openai/gpt-4o-mini": true
-    "openai/gpt-4o": false          # Hidden by default (expensive)
-    "anthropic/claude-3-5-sonnet-20241022": true
-    "anthropic/claude-3-opus-20240229": false  # Hidden by default (expensive)
-
-# ─────────────────────────────────────────────────────────────────────────────
-# System Limits
-# ─────────────────────────────────────────────────────────────────────────────
-limits:
-  apiCallTimeout: 240           # API call timeout in seconds
-  toolExecutionTimeout: 240     # Tool execution timeout in seconds
-  streamingReadTimeout: 240     # Streaming read timeout in seconds
-  streamingRequestTimeout: 1800 # Total streaming request timeout in seconds
-  maxContextSize: 128000        # Maximum context size in tokens
-  maxOutputSize: 16384          # Maximum output size in tokens
-  maxFileSize: 10               # Maximum file size in MB
-
-# ─────────────────────────────────────────────────────────────────────────────
-# Advanced Settings
-# ─────────────────────────────────────────────────────────────────────────────
-advanced:
-  noEgressDefault: false        # Block external network calls by default
-  readOnlyMode: false           # Prevent all file write operations
-  autoOptimizePercentage: 85    # Auto-optimize context at this % of limit
-
-# ─────────────────────────────────────────────────────────────────────────────
-# Tool Permissions
-# ─────────────────────────────────────────────────────────────────────────────
-tools:
-  # Permission format: { planMode: "ON"|"OFF", agentMode: "ON"|"OFF" }
-  permissions:
-    read_file:
-      planMode: "ON"
-      agentMode: "ON"
-    read_directory:
-      planMode: "ON"
-      agentMode: "ON"
-    file_search:
-      planMode: "ON"
-      agentMode: "ON"
-    grep_search:
-      planMode: "ON"
-      agentMode: "ON"
-    view_diff:
-      planMode: "ON"
-      agentMode: "ON"
-    create_new_file:
-      planMode: "OFF"
-      agentMode: "ON"
-    code_editing:
-      planMode: "OFF"
-      agentMode: "ON"
-    advance_code_editing:
-      planMode: "OFF"
-      agentMode: "ON"
-    multi_edit:
-      planMode: "OFF"
-      agentMode: "ON"
-    run_terminal_command:
-      planMode: "OFF"
-      agentMode: "ON"          # Enabled by default in AGENT mode
-
-# Terminal command whitelist configuration
-terminal:
-  whitelist:
-    enabled: true
-    mode: "WHITELIST_ONLY"     # WHITELIST_ONLY | WHITELIST_PLUS_DENY
-    globalBlockedPatterns:
-      - "\\|\\s*(sh|bash|zsh|powershell|cmd)\\b"
-      - "\\$\\("
-      - "`[^`]+`"
-    commands:
-      - program: "git"
-        allowedSubcommands: ["status", "log", "diff", "add", "commit"]
-        blockedSubcommands: ["push", "remote"]
-        blockedFlags: ["--force", "--no-verify"]
-        blockedArgPatterns: [".*\\.env$"]
-      - program: "gradle"
-        aliases: ["gradlew", "gradlew.bat"]
-        blockedFlags: ["--init-script"]
-
-# ─────────────────────────────────────────────────────────────────────────────
-# RAG (Retrieval-Augmented Generation) Configuration
-# ─────────────────────────────────────────────────────────────────────────────
-rag:
-  enabled: true                 # Enable RAG features
-  indexOnStartup: true          # Index project at IDE startup
-  autoIndexOnContextBuild: true # Auto-index when building context
-  maxFileSizeMB: 2              # Max file size for indexing
-  maxChunksPerFile: 100         # Max chunks per file
-  indexBatchSize: 10            # Files per indexing batch
-  embeddingsBatchSize: 50       # Embeddings per batch
-  cacheTtlMs: 300000            # RAG cache TTL (5 minutes)
-  maxConcurrentJobs: 4          # Max concurrent indexing jobs
-
-  # Directories to ignore during indexing
-  ignoredDirectories:
-    - ".git"
-    - ".idea"
-    - ".vscode"
-    - "node_modules"
-    - "build"
-    - "dist"
-    - "__pycache__"
-    - ".venv"
-    - "target"
-    - "out"
-
-# ─────────────────────────────────────────────────────────────────────────────
-# UI State (persisted between sessions)
-# ─────────────────────────────────────────────────────────────────────────────
-ui:
-  thinkingEnabled: false        # Show LLM thinking process
-  noEgressEnabled: false        # Block external network calls
-  orchestrationEnabled: true    # Enable orchestration toggle
-  intentClassificationEnabled: false # Enable LLM intent classification
-  executionMode: "AUTO"         # AUTO or INTERACTIVE
-  selectedMode: "CHAT"          # CHAT, PLAN, or AGENT
-  selectedModel: ""             # Currently selected model (empty = auto)
-
-# ═══════════════════════════════════════════════════════════════════════════════
-# PROJECT-SPECIFIC SETTINGS (only in <project>/.refio/config.yaml)
-# ═══════════════════════════════════════════════════════════════════════════════
-
-# ─────────────────────────────────────────────────────────────────────────────
-# Custom Prompts (project-specific)
-# ─────────────────────────────────────────────────────────────────────────────
-# prompts:
-#   systemChat: |
-#     You are a helpful coding assistant for this specific project.
-#     Always follow the project's coding conventions.
-#
-#   systemPlan: |
-#     You are a planning assistant. Create detailed plans for tasks.
-#
-#   systemAgent: |
-#     You are an autonomous coding agent.
-#
-#   commands:
-#     - name: "fix"
-#       description: "Fix code issues"
-#       content: "Analyze and fix any issues in the selected code."
-#       enabled: true
-#
-#     - name: "refactor"
-#       description: "Refactor code"
-#       content: "Refactor the selected code for better readability."
-#       enabled: true
-#
-#   rules:
-#     - name: "coding-style"
-#       content: "Always use 4-space indentation."
-#       enabled: true
-
-# ─────────────────────────────────────────────────────────────────────────────
-# MCP Server Configuration (project-specific)
-# ─────────────────────────────────────────────────────────────────────────────
-# mcp:
-#   servers:
-#     - id: "github"
-#       displayName: "GitHub"
-#       type: "STDIO"
-#       command: "npx"
-#       args: ["-y", "@modelcontextprotocol/server-github"]
-#       accessMode: "READ"
-#       enabled: true
-#       env:
-#         - name: "GITHUB_TOKEN"
-#           value: ""
-#           isSecret: true
-#
-#     - id: "filesystem"
-#       displayName: "Filesystem"
-#       type: "STDIO"
-#       command: "npx"
-#       args: ["-y", "@modelcontextprotocol/server-filesystem", "/path/to/allowed"]
-#       accessMode: "READ_WRITE"
-#       enabled: true
-""".trimIndent()
-        }
+        /** Static, fully-documented template for the "Example config" UI panel. */
+        fun createExampleConfig(): String = ConfigYamlEmitter.createExampleConfig()
     }
 }
-
-// ═══════════════════════════════════════════════════════════════════════════════
-// Configuration Data Classes
-// ═══════════════════════════════════════════════════════════════════════════════
-
-/**
- * General UI and behavior settings
- */
-@Serializable
-data class GeneralConfig(
-    val formatMarkdown: Boolean? = null,
-    val streamingEnabled: Boolean? = null,
-    val advancedView: Boolean? = null
-)
-
-/**
- * Provider configurations
- */
-@Serializable
-data class ProvidersConfig(
-    val ollama: OllamaConfig? = null,
-    val anthropic: AnthropicConfig? = null,
-    val openai: OpenAIConfig? = null,
-    val openrouter: OpenRouterConfig? = null,
-    val gemini: GeminiConfig? = null,
-    val lmstudio: LMStudioConfig? = null,
-    @SerialName("custom_openai")
-    val customOpenai: CustomOpenAIConfig? = null,
-    val zai: ZAIConfig? = null
-)
-
-@Serializable
-data class OllamaConfig(
-    val endpoint: String? = null,
-    val contextSize: Int? = null,
-    val keepAlive: Int? = null
-)
-
-@Serializable
-data class AnthropicConfig(
-    val apiKey: String? = null
-)
-
-@Serializable
-data class OpenAIConfig(
-    val apiKey: String? = null
-)
-
-@Serializable
-data class OpenRouterConfig(
-    val apiKey: String? = null
-)
-
-@Serializable
-data class GeminiConfig(
-    val apiKey: String? = null
-)
-
-@Serializable
-data class LMStudioConfig(
-    val apiKey: String? = null,
-    val baseUrl: String? = null,
-    val contextSize: Int? = null
-)
-
-@Serializable
-data class CustomOpenAIConfig(
-    val apiKey: String? = null,
-    val baseUrl: String? = null,
-    val model: String? = null
-)
-
-@Serializable
-data class ZAIConfig(
-    val apiKey: String? = null,
-    val baseUrl: String? = null
-)
-
-/**
- * Model configuration
- */
-@Serializable
-data class ModelsConfig(
-    val default: String? = null,  // Legacy single default model
-    val defaults: ModelDefaultsConfig? = null,
-    val visibility: Map<String, Boolean>? = null,
-    val presets: List<ModelPresetConfig>? = null
-)
-
-@Serializable
-data class ModelDefaultsConfig(
-    val chat: String? = null,
-    val plan: String? = null,
-    val coding: String? = null,
-    val weak: String? = null,
-    val embedding: String? = null,
-    val strong: String? = null
-)
-
-@Serializable
-data class ModelPresetConfig(
-    val name: String,
-    val description: String? = null,
-    val defaultModel: String,
-    val planModel: String? = null,
-    val codingModel: String? = null,
-    val weakModel: String? = null,
-    val strongModel: String? = null,
-    val visibleModels: List<String>? = null
-)
-
-/**
- * System limits configuration
- */
-@Serializable
-data class LimitsConfig(
-    val apiCallTimeout: Int? = null,
-    val toolExecutionTimeout: Int? = null,
-    val streamingReadTimeout: Int? = null,
-    val streamingRequestTimeout: Int? = null,
-    val maxContextSize: Int? = null,
-    val maxOutputSize: Int? = null,
-    val maxFileSize: Int? = null
-)
-
-/**
- * Advanced settings
- */
-@Serializable
-data class AdvancedConfig(
-    val noEgressDefault: Boolean? = null,
-    val readOnlyMode: Boolean? = null,
-    val autoOptimizePercentage: Int? = null,
-    val orchestrationEnabled: Boolean? = null
-)
-
-/**
- * Tool permissions configuration
- */
-@Serializable
-data class ToolsConfig(
-    val permissions: Map<String, ToolPermissionConfig>? = null
-)
-
-@Serializable
-data class TerminalConfig(
-    val whitelist: TerminalWhitelistConfig? = null
-)
-
-@Serializable
-data class TerminalWhitelistConfig(
-    val enabled: Boolean? = null,
-    val mode: String? = null,
-    val globalBlockedPatterns: List<String>? = null,
-    @SerialName("commands")
-    val commands: List<TerminalCommandConfig>? = null
-)
-
-@Serializable
-data class TerminalCommandConfig(
-    val program: String,
-    val description: String? = null,
-    val aliases: List<String>? = null,
-    val blockedFlags: List<String>? = null,
-    val blockedSubcommands: List<String>? = null,
-    val blockedArgPatterns: List<String>? = null,
-    val allowedSubcommands: List<String>? = null,
-    val maxArgs: Int? = null,
-    val requireConfirmation: Boolean? = null
-)
-
-@Serializable
-data class ToolPermissionConfig(
-    val planMode: String? = null,  // "ON" or "OFF"
-    val agentMode: String? = null  // "ON" or "OFF"
-)
-
-/**
- * RAG configuration
- */
-@Serializable
-data class RagConfig(
-    val enabled: Boolean? = null,
-    val indexOnStartup: Boolean? = null,
-    val autoIndexOnContextBuild: Boolean? = null,
-    val maxFileSizeMB: Long? = null,
-    val maxChunksPerFile: Int? = null,
-    val indexBatchSize: Int? = null,
-    val embeddingsBatchSize: Int? = null,
-    val cacheTtlMs: Long? = null,
-    val maxConcurrentJobs: Int? = null,
-    val ignoredDirectories: List<String>? = null,
-    val searchSimilarityThreshold: Float? = null,
-    val searchTopK: Int? = null,
-    val searchHybridEnabled: Boolean? = null,
-    val searchSemanticWeight: Float? = null,
-    val searchIncludeContextChunks: Boolean? = null
-)
-
-/**
- * UI state configuration
- */
-@Serializable
-data class UiConfig(
-    val thinkingEnabled: Boolean? = null,
-    val noEgressEnabled: Boolean? = null,
-    val executionMode: String? = null,
-    val selectedMode: String? = null,
-    val selectedModel: String? = null
-)
-
-/**
- * Prompts configuration (project-specific)
- */
-@Serializable
-data class PromptsConfig(
-    val systemChat: String? = null,
-    val systemPlan: String? = null,
-    val systemAgent: String? = null,
-    val commands: List<CommandConfig>? = null,
-    val rules: List<RuleConfig>? = null
-)
-
-@Serializable
-data class CommandConfig(
-    val name: String,
-    val description: String? = null,
-    val content: String,
-    val enabled: Boolean = true
-)
-
-@Serializable
-data class RuleConfig(
-    val name: String,
-    val content: String,
-    val enabled: Boolean = true
-)
-
-/**
- * MCP server configuration (project-specific)
- */
-@Serializable
-data class McpConfig(
-    val servers: List<McpServerConfig>? = null
-)
-
-@Serializable
-data class McpServerConfig(
-    val id: String,
-    val displayName: String? = null,
-    val description: String? = null,
-    val type: String = "STDIO",  // STDIO or HTTP
-    val command: String? = null,
-    val args: List<String>? = null,
-    val workingDirectory: String? = null,
-    val url: String? = null,
-    val accessMode: String = "READ",  // READ or READ_WRITE
-    val enabled: Boolean = true,
-    val env: List<McpEnvConfig>? = null,
-    val httpHeaders: List<McpHeaderConfig>? = null,
-    val timeout: Int? = null,
-    val retryAttempts: Int? = null
-)
-
-@Serializable
-data class McpEnvConfig(
-    val name: String,
-    val value: String,
-    val isSecret: Boolean = false
-)
-
-@Serializable
-data class McpHeaderConfig(
-    val name: String,
-    val value: String,
-    val isSecret: Boolean = false
-)
-
-/**
- * Hooks configuration — user-defined actions triggered on agent lifecycle events.
- * Configured in .refio/config.yaml under the `hooks` key.
- */
-@Serializable
-data class HooksConfig(
-    @SerialName("before_turn_loop")
-    val beforeTurnLoop: List<HookDefinition>? = null,
-    @SerialName("after_turn_loop")
-    val afterTurnLoop: List<HookDefinition>? = null,
-    @SerialName("before_tool")
-    val beforeTool: List<HookDefinition>? = null,
-    @SerialName("after_tool")
-    val afterTool: List<HookDefinition>? = null,
-    @SerialName("on_agent_complete")
-    val onAgentComplete: List<HookDefinition>? = null,
-    @SerialName("on_agent_error")
-    val onAgentError: List<HookDefinition>? = null
-)
-
-@Serializable
-data class HookDefinition(
-    val action: String,
-    val command: String? = null,
-    val message: String? = null,
-    val match: String? = null,
-    val modes: List<String>? = null,
-    val timeout: Long? = null
-)
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlEmitter.kt b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlEmitter.kt
new file mode 100644
index 00000000..33e112d5
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlEmitter.kt
@@ -0,0 +1,251 @@
+package pl.jclab.refio.core.config
+
+/**
+ * Pretty-printers for [ConfigYaml]:
+ *  - [createCommentedYaml] serialises an in-memory config with section-header
+ *    comments for the Settings UI "Export" flow.
+ *  - [createExampleConfig] returns a static, fully-documented template shown
+ *    to users in the "Example config" panel.
+ *
+ * Pulled out of [ConfigYaml] so the data class stays focused on (de)serialisation.
+ * Both functions are pure (no I/O).
+ */
+internal object ConfigYamlEmitter {
+
+    fun createCommentedYaml(config: ConfigYaml): String {
+        val sb = StringBuilder()
+
+        sb.appendLine("# ═══════════════════════════════════════════════════════════════════════════════")
+        sb.appendLine("# Refio Configuration File")
+        sb.appendLine("# Generated: ${java.time.LocalDateTime.now()}")
+        sb.appendLine("# ═══════════════════════════════════════════════════════════════════════════════")
+        sb.appendLine()
+
+        config.general?.let { general ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# General Settings")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("general:")
+            general.formatMarkdown?.let { sb.appendLine("  formatMarkdown: $it") }
+            general.streamingEnabled?.let { sb.appendLine("  streamingEnabled: $it") }
+            general.advancedView?.let { sb.appendLine("  advancedView: $it") }
+            sb.appendLine()
+        }
+
+        config.providers?.let { providers ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# LLM Provider Configuration")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("providers:")
+
+            providers.ollama?.let { ollama ->
+                sb.appendLine("  ollama:")
+                ollama.endpoint?.let { sb.appendLine("    endpoint: \"$it\"") }
+                ollama.contextSize?.let { sb.appendLine("    contextSize: $it") }
+            }
+            providers.anthropic?.let { it.apiKey?.let { k -> sb.appendLine("  anthropic:"); sb.appendLine("    apiKey: \"$k\"") } }
+            providers.openai?.let { it.apiKey?.let { k -> sb.appendLine("  openai:"); sb.appendLine("    apiKey: \"$k\"") } }
+            providers.openrouter?.let { it.apiKey?.let { k -> sb.appendLine("  openrouter:"); sb.appendLine("    apiKey: \"$k\"") } }
+            providers.gemini?.let { it.apiKey?.let { k -> sb.appendLine("  gemini:"); sb.appendLine("    apiKey: \"$k\"") } }
+            providers.lmstudio?.let { lmstudio ->
+                sb.appendLine("  lmstudio:")
+                lmstudio.apiKey?.let { sb.appendLine("    apiKey: \"$it\"") }
+                lmstudio.baseUrl?.let { sb.appendLine("    baseUrl: \"$it\"") }
+                lmstudio.contextSize?.let { sb.appendLine("    contextSize: $it") }
+            }
+            providers.genericOpenai?.let { gen ->
+                sb.appendLine("  generic_openai:")
+                gen.apiKey?.let { sb.appendLine("    apiKey: \"$it\"") }
+                gen.baseUrl?.let { sb.appendLine("    baseUrl: \"$it\"") }
+                gen.model?.let { sb.appendLine("    model: \"$it\"") }
+            }
+            providers.zai?.let { zai ->
+                sb.appendLine("  zai:")
+                zai.apiKey?.let { sb.appendLine("    apiKey: \"$it\"") }
+                zai.baseUrl?.let { sb.appendLine("    baseUrl: \"$it\"") }
+            }
+            sb.appendLine()
+        }
+
+        config.models?.let { models ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# Model Configuration")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("models:")
+            models.defaults?.let { defaults ->
+                sb.appendLine("  defaults:")
+                defaults.chat?.let { sb.appendLine("    chat: \"$it\"") }
+                defaults.plan?.let { sb.appendLine("    plan: \"$it\"") }
+                defaults.coding?.let { sb.appendLine("    coding: \"$it\"") }
+                defaults.weak?.let { sb.appendLine("    weak: \"$it\"") }
+                defaults.embedding?.let { sb.appendLine("    embedding: \"$it\"") }
+            }
+            models.visibility?.takeIf { it.isNotEmpty() }?.let { visibility ->
+                sb.appendLine("  visibility:")
+                visibility.forEach { (model, visible) ->
+                    sb.appendLine("    \"$model\": $visible")
+                }
+            }
+            sb.appendLine()
+        }
+
+        config.limits?.let { limits ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# System Limits")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("limits:")
+            limits.apiCallTimeout?.let { sb.appendLine("  apiCallTimeout: $it") }
+            limits.toolExecutionTimeout?.let { sb.appendLine("  toolExecutionTimeout: $it") }
+            limits.streamingReadTimeout?.let { sb.appendLine("  streamingReadTimeout: $it") }
+            limits.streamingRequestTimeout?.let { sb.appendLine("  streamingRequestTimeout: $it") }
+            limits.maxContextSize?.let { sb.appendLine("  maxContextSize: $it") }
+            limits.maxOutputSize?.let { sb.appendLine("  maxOutputSize: $it") }
+            limits.maxFileSize?.let { sb.appendLine("  maxFileSize: $it") }
+            sb.appendLine()
+        }
+
+        config.advanced?.let { advanced ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# Advanced Settings")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("advanced:")
+            advanced.noEgressDefault?.let { sb.appendLine("  noEgressDefault: $it") }
+            advanced.readOnlyMode?.let { sb.appendLine("  readOnlyMode: $it") }
+            advanced.autoOptimizePercentage?.let { sb.appendLine("  autoOptimizePercentage: $it") }
+            sb.appendLine()
+        }
+
+        config.tools?.permissions?.takeIf { it.isNotEmpty() }?.let { permissions ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# Tool Permissions")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("tools:")
+            sb.appendLine("  permissions:")
+            permissions.forEach { (tool, perm) ->
+                sb.appendLine("    $tool:")
+                perm.planMode?.let { sb.appendLine("      planMode: \"$it\"") }
+                perm.agentMode?.let { sb.appendLine("      agentMode: \"$it\"") }
+            }
+            sb.appendLine()
+        }
+
+        config.rag?.let { rag ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# RAG Configuration")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("rag:")
+            rag.enabled?.let { sb.appendLine("  enabled: $it") }
+            rag.indexOnStartup?.let { sb.appendLine("  indexOnStartup: $it") }
+            rag.autoIndexOnContextBuild?.let { sb.appendLine("  autoIndexOnContextBuild: $it") }
+            rag.maxFileSizeMB?.let { sb.appendLine("  maxFileSizeMB: $it") }
+            rag.maxChunksPerFile?.let { sb.appendLine("  maxChunksPerFile: $it") }
+            rag.indexBatchSize?.let { sb.appendLine("  indexBatchSize: $it") }
+            rag.embeddingsBatchSize?.let { sb.appendLine("  embeddingsBatchSize: $it") }
+            rag.cacheTtlMs?.let { sb.appendLine("  cacheTtlMs: $it") }
+            rag.maxConcurrentJobs?.let { sb.appendLine("  maxConcurrentJobs: $it") }
+            rag.ignoredDirectories?.takeIf { it.isNotEmpty() }?.let { dirs ->
+                sb.appendLine("  ignoredDirectories:")
+                dirs.forEach { sb.appendLine("    - \"$it\"") }
+            }
+            sb.appendLine()
+        }
+
+        config.ui?.let { ui ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# UI State")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("ui:")
+            ui.thinkingEnabled?.let { sb.appendLine("  thinkingEnabled: $it") }
+            ui.noEgressEnabled?.let { sb.appendLine("  noEgressEnabled: $it") }
+            ui.executionMode?.let { sb.appendLine("  executionMode: \"$it\"") }
+            ui.selectedMode?.let { sb.appendLine("  selectedMode: \"$it\"") }
+            ui.selectedModel?.let { sb.appendLine("  selectedModel: \"$it\"") }
+            sb.appendLine()
+        }
+
+        config.prompts?.let { prompts ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# Custom Prompts (project-specific)")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("prompts:")
+
+            prompts.systemChat?.let {
+                sb.appendLine("  systemChat: |")
+                it.lines().forEach { line -> sb.appendLine("    $line") }
+            }
+            prompts.systemPlan?.let {
+                sb.appendLine("  systemPlan: |")
+                it.lines().forEach { line -> sb.appendLine("    $line") }
+            }
+            prompts.systemAgent?.let {
+                sb.appendLine("  systemAgent: |")
+                it.lines().forEach { line -> sb.appendLine("    $line") }
+            }
+
+            prompts.commands?.takeIf { it.isNotEmpty() }?.let { commands ->
+                sb.appendLine("  commands:")
+                commands.forEach { cmd ->
+                    sb.appendLine("    - name: \"${cmd.name}\"")
+                    cmd.description?.let { sb.appendLine("      description: \"$it\"") }
+                    sb.appendLine("      content: \"${cmd.content.replace("\"", "\\\"")}\"")
+                    sb.appendLine("      enabled: ${cmd.enabled}")
+                }
+            }
+            prompts.rules?.takeIf { it.isNotEmpty() }?.let { rules ->
+                sb.appendLine("  rules:")
+                rules.forEach { rule ->
+                    sb.appendLine("    - name: \"${rule.name}\"")
+                    sb.appendLine("      content: \"${rule.content.replace("\"", "\\\"")}\"")
+                    sb.appendLine("      enabled: ${rule.enabled}")
+                }
+            }
+            sb.appendLine()
+        }
+
+        config.mcp?.servers?.takeIf { it.isNotEmpty() }?.let { servers ->
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("# MCP Server Configuration (project-specific)")
+            sb.appendLine("# ─────────────────────────────────────────────────────────────────────────────")
+            sb.appendLine("mcp:")
+            sb.appendLine("  servers:")
+            servers.forEach { server ->
+                sb.appendLine("    - id: \"${server.id}\"")
+                server.displayName?.let { sb.appendLine("      displayName: \"$it\"") }
+                sb.appendLine("      type: \"${server.type}\"")
+                server.command?.let { sb.appendLine("      command: \"$it\"") }
+                server.args?.takeIf { it.isNotEmpty() }?.let { args ->
+                    sb.appendLine("      args: [${args.joinToString(", ") { "\"$it\"" }}]")
+                }
+                server.url?.let { sb.appendLine("      url: \"$it\"") }
+                sb.appendLine("      accessMode: \"${server.accessMode}\"")
+                sb.appendLine("      enabled: ${server.enabled}")
+                server.env?.takeIf { it.isNotEmpty() }?.let { envs ->
+                    sb.appendLine("      env:")
+                    envs.forEach { env ->
+                        sb.appendLine("        - name: \"${env.name}\"")
+                        val value = if (env.isSecret) "***" else env.value
+                        sb.appendLine("          value: \"$value\"")
+                        if (env.isSecret) sb.appendLine("          isSecret: true")
+                    }
+                }
+            }
+            sb.appendLine()
+        }
+
+        return sb.toString()
+    }
+
+    private const val EXAMPLE_CONFIG_RESOURCE = "/config/example-config.yaml"
+
+    /**
+     * Loads the static example config YAML from [EXAMPLE_CONFIG_RESOURCE].
+     * Centralizing the ~230 LOC template as a resource removes boilerplate from
+     * this file and lets editors treat it as actual YAML.
+     */
+    fun createExampleConfig(): String {
+        val resource = ConfigYamlEmitter::class.java.getResourceAsStream(EXAMPLE_CONFIG_RESOURCE)
+            ?: error("Missing classpath resource: $EXAMPLE_CONFIG_RESOURCE")
+        return resource.bufferedReader(Charsets.UTF_8).use { it.readText() }
+    }
+
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlIO.kt b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlIO.kt
new file mode 100644
index 00000000..75544c9f
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlIO.kt
@@ -0,0 +1,69 @@
+package pl.jclab.refio.core.config
+
+import com.charleskorn.kaml.Yaml
+import com.charleskorn.kaml.YamlConfiguration
+import java.io.File
+
+/**
+ * Load / save / serialize operations for [ConfigYaml].
+ *
+ * Kept separate from the data-class declaration so that `ConfigYaml.kt` stays a thin record
+ * of the schema, while parsing/encoding details live here.
+ */
+internal object ConfigYamlIO {
+
+    private val yaml = Yaml(
+        configuration = YamlConfiguration(strictMode = false)
+    )
+
+    fun loadFromPath(configFile: File): ConfigYaml? {
+        if (!configFile.exists()) return null
+
+        return try {
+            decode(configFile.readText())
+        } catch (e: Exception) {
+            println("Error loading config YAML from ${configFile.absolutePath}: ${e.message}")
+            null
+        }
+    }
+
+    fun toYamlString(config: ConfigYaml): String =
+        yaml.encodeToString(ConfigYaml.serializer(), config)
+
+    fun saveToFile(config: ConfigYaml, file: File, withComments: Boolean) {
+        file.parentFile?.mkdirs()
+        val content = if (withComments) ConfigYamlEmitter.createCommentedYaml(config) else toYamlString(config)
+        file.writeText(content)
+    }
+
+    private fun decode(yamlContent: String): ConfigYaml {
+        val firstAttempt = runCatching {
+            yaml.decodeFromString(ConfigYaml.serializer(), yamlContent)
+        }
+        if (firstAttempt.isSuccess) return firstAttempt.getOrThrow()
+
+        val sanitizedEscapes = YamlSanitizer.sanitizeInvalidDoubleQuotedEscapes(yamlContent)
+        val sanitizedBrokenLines = YamlSanitizer.sanitizeBrokenStandaloneEmptyQuotedLines(sanitizedEscapes)
+
+        if (sanitizedBrokenLines == yamlContent) {
+            throw firstAttempt.exceptionOrNull() ?: IllegalStateException("Unknown YAML parsing error")
+        }
+
+        val secondAttempt = runCatching {
+            yaml.decodeFromString(ConfigYaml.serializer(), sanitizedBrokenLines)
+        }
+        if (secondAttempt.isSuccess) {
+            if (sanitizedEscapes != yamlContent) {
+                println("Config YAML parser fallback: sanitized invalid double-quoted escape sequences")
+            }
+            if (sanitizedBrokenLines != sanitizedEscapes) {
+                println("Config YAML parser fallback: repaired broken standalone empty-quoted lines")
+            }
+            return secondAttempt.getOrThrow()
+        }
+
+        throw secondAttempt.exceptionOrNull()
+            ?: firstAttempt.exceptionOrNull()
+            ?: IllegalStateException("Unknown YAML parsing error")
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlMerger.kt b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlMerger.kt
new file mode 100644
index 00000000..14f5e5db
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlMerger.kt
@@ -0,0 +1,178 @@
+package pl.jclab.refio.core.config
+
+/**
+ * Merges two [ConfigYaml] snapshots (user + project) using section-aware rules:
+ * scalars use override-wins; visibility maps and hook command lists concatenate;
+ * MCP servers dedupe by `id`.
+ *
+ * Lives outside [ConfigYaml] so the data class isn't burdened by 150 LOC of
+ * per-section null-coalescing code. Pure function; no hidden state.
+ */
+internal object ConfigYamlMerger {
+
+    /** Merge two configs — values from [override] take precedence over [base]. */
+    fun merge(base: ConfigYaml?, override: ConfigYaml?): ConfigYaml {
+        if (base == null) return override ?: ConfigYaml()
+        if (override == null) return base
+
+        return ConfigYaml(
+            general = mergeGeneral(base.general, override.general),
+            providers = mergeProviders(base.providers, override.providers),
+            models = mergeModels(base.models, override.models),
+            limits = mergeLimits(base.limits, override.limits),
+            advanced = mergeAdvanced(base.advanced, override.advanced),
+            tools = mergeTools(base.tools, override.tools),
+            rag = mergeRag(base.rag, override.rag),
+            ui = mergeUi(base.ui, override.ui),
+            prompts = mergePrompts(base.prompts, override.prompts),
+            mcp = mergeMcp(base.mcp, override.mcp),
+            hooks = mergeHooks(base.hooks, override.hooks),
+        )
+    }
+
+    private fun mergeGeneral(base: GeneralConfig?, override: GeneralConfig?): GeneralConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        return GeneralConfig(
+            formatMarkdown = override.formatMarkdown ?: base.formatMarkdown,
+            streamingEnabled = override.streamingEnabled ?: base.streamingEnabled,
+            advancedView = override.advancedView ?: base.advancedView,
+        )
+    }
+
+    private fun mergeProviders(base: ProvidersConfig?, override: ProvidersConfig?): ProvidersConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        return ProvidersConfig(
+            ollama = override.ollama ?: base.ollama,
+            anthropic = override.anthropic ?: base.anthropic,
+            openai = override.openai ?: base.openai,
+            openrouter = override.openrouter ?: base.openrouter,
+            gemini = override.gemini ?: base.gemini,
+            lmstudio = override.lmstudio ?: base.lmstudio,
+            genericOpenai = override.genericOpenai ?: base.genericOpenai,
+            zai = override.zai ?: base.zai,
+        )
+    }
+
+    private fun mergeModels(base: ModelsConfig?, override: ModelsConfig?): ModelsConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        return ModelsConfig(
+            default = override.default ?: base.default,
+            defaults = override.defaults ?: base.defaults,
+            visibility = mergeVisibility(base.visibility, override.visibility),
+            presets = override.presets ?: base.presets,
+        )
+    }
+
+    private fun mergeVisibility(base: Map<String, Boolean>?, override: Map<String, Boolean>?): Map<String, Boolean>? {
+        if (base == null) return override
+        if (override == null) return base
+        return base + override
+    }
+
+    private fun mergeLimits(base: LimitsConfig?, override: LimitsConfig?): LimitsConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        return LimitsConfig(
+            apiCallTimeout = override.apiCallTimeout ?: base.apiCallTimeout,
+            toolExecutionTimeout = override.toolExecutionTimeout ?: base.toolExecutionTimeout,
+            streamingReadTimeout = override.streamingReadTimeout ?: base.streamingReadTimeout,
+            streamingRequestTimeout = override.streamingRequestTimeout ?: base.streamingRequestTimeout,
+            maxContextSize = override.maxContextSize ?: base.maxContextSize,
+            maxOutputSize = override.maxOutputSize ?: base.maxOutputSize,
+            maxFileSize = override.maxFileSize ?: base.maxFileSize,
+        )
+    }
+
+    private fun mergeAdvanced(base: AdvancedConfig?, override: AdvancedConfig?): AdvancedConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        return AdvancedConfig(
+            noEgressDefault = override.noEgressDefault ?: base.noEgressDefault,
+            readOnlyMode = override.readOnlyMode ?: base.readOnlyMode,
+            autoOptimizePercentage = override.autoOptimizePercentage ?: base.autoOptimizePercentage,
+            orchestrationEnabled = override.orchestrationEnabled ?: base.orchestrationEnabled,
+        )
+    }
+
+    private fun mergeRag(base: RagConfig?, override: RagConfig?): RagConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        return RagConfig(
+            enabled = override.enabled ?: base.enabled,
+            indexOnStartup = override.indexOnStartup ?: base.indexOnStartup,
+            autoIndexOnContextBuild = override.autoIndexOnContextBuild ?: base.autoIndexOnContextBuild,
+            maxFileSizeMB = override.maxFileSizeMB ?: base.maxFileSizeMB,
+            maxChunksPerFile = override.maxChunksPerFile ?: base.maxChunksPerFile,
+            indexBatchSize = override.indexBatchSize ?: base.indexBatchSize,
+            embeddingsBatchSize = override.embeddingsBatchSize ?: base.embeddingsBatchSize,
+            cacheTtlMs = override.cacheTtlMs ?: base.cacheTtlMs,
+            maxConcurrentJobs = override.maxConcurrentJobs ?: base.maxConcurrentJobs,
+            ignoredDirectories = mergeList(base.ignoredDirectories, override.ignoredDirectories),
+            searchSimilarityThreshold = override.searchSimilarityThreshold ?: base.searchSimilarityThreshold,
+            searchTopK = override.searchTopK ?: base.searchTopK,
+            searchHybridEnabled = override.searchHybridEnabled ?: base.searchHybridEnabled,
+            searchSemanticWeight = override.searchSemanticWeight ?: base.searchSemanticWeight,
+            searchIncludeContextChunks = override.searchIncludeContextChunks ?: base.searchIncludeContextChunks,
+        )
+    }
+
+    private fun mergeUi(base: UiConfig?, override: UiConfig?): UiConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        return UiConfig(
+            thinkingEnabled = override.thinkingEnabled ?: base.thinkingEnabled,
+            noEgressEnabled = override.noEgressEnabled ?: base.noEgressEnabled,
+            executionMode = override.executionMode ?: base.executionMode,
+            selectedMode = override.selectedMode ?: base.selectedMode,
+            selectedModel = override.selectedModel ?: base.selectedModel,
+        )
+    }
+
+    private fun mergeTools(base: ToolsConfig?, override: ToolsConfig?): ToolsConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        val mergedPermissions = (base.permissions ?: emptyMap()) + (override.permissions ?: emptyMap())
+        return ToolsConfig(permissions = mergedPermissions)
+    }
+
+    private fun mergePrompts(base: PromptsConfig?, override: PromptsConfig?): PromptsConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        return PromptsConfig(
+            systemChat = override.systemChat ?: base.systemChat,
+            systemPlan = override.systemPlan ?: base.systemPlan,
+            systemAgent = override.systemAgent ?: base.systemAgent,
+            commands = mergeList(base.commands, override.commands),
+            rules = mergeList(base.rules, override.rules),
+        )
+    }
+
+    private fun <T> mergeList(base: List<T>?, override: List<T>?): List<T>? {
+        if (base == null) return override
+        if (override == null) return base
+        return base + override
+    }
+
+    private fun mergeMcp(base: McpConfig?, override: McpConfig?): McpConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        val mergedServers = (base.servers ?: emptyList()) + (override.servers ?: emptyList())
+        return McpConfig(servers = mergedServers.distinctBy { it.id })
+    }
+
+    private fun mergeHooks(base: HooksConfig?, override: HooksConfig?): HooksConfig? {
+        if (base == null) return override
+        if (override == null) return base
+        return HooksConfig(
+            beforeTurnLoop = override.beforeTurnLoop ?: base.beforeTurnLoop,
+            afterTurnLoop = override.afterTurnLoop ?: base.afterTurnLoop,
+            beforeTool = override.beforeTool ?: base.beforeTool,
+            afterTool = override.afterTool ?: base.afterTool,
+            onAgentComplete = override.onAgentComplete ?: base.onAgentComplete,
+            onAgentError = override.onAgentError ?: base.onAgentError,
+        )
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlModel.kt b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlModel.kt
new file mode 100644
index 00000000..7b909835
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/config/ConfigYamlModel.kt
@@ -0,0 +1,234 @@
+package pl.jclab.refio.core.config
+
+import kotlinx.serialization.SerialName
+import kotlinx.serialization.Serializable
+
+@Serializable
+data class GeneralConfig(
+    val formatMarkdown: Boolean? = null,
+    val streamingEnabled: Boolean? = null,
+    val advancedView: Boolean? = null
+)
+
+@Serializable
+data class ProvidersConfig(
+    val ollama: OllamaConfig? = null,
+    val anthropic: AnthropicConfig? = null,
+    val openai: OpenAIConfig? = null,
+    val openrouter: OpenRouterConfig? = null,
+    val gemini: GeminiConfig? = null,
+    val lmstudio: LMStudioConfig? = null,
+    @SerialName("generic_openai")
+    val genericOpenai: GenericOpenAIConfig? = null,
+    val zai: ZAIConfig? = null
+)
+
+@Serializable
+data class OllamaConfig(
+    val endpoint: String? = null,
+    val contextSize: Int? = null,
+    val keepAlive: Int? = null
+)
+
+@Serializable
+data class AnthropicConfig(val apiKey: String? = null)
+
+@Serializable
+data class OpenAIConfig(val apiKey: String? = null)
+
+@Serializable
+data class OpenRouterConfig(val apiKey: String? = null)
+
+@Serializable
+data class GeminiConfig(val apiKey: String? = null)
+
+@Serializable
+data class LMStudioConfig(
+    val apiKey: String? = null,
+    val baseUrl: String? = null,
+    val contextSize: Int? = null
+)
+
+@Serializable
+data class GenericOpenAIConfig(
+    val apiKey: String? = null,
+    val baseUrl: String? = null,
+    val model: String? = null
+)
+
+@Serializable
+data class ZAIConfig(
+    val apiKey: String? = null,
+    val baseUrl: String? = null
+)
+
+@Serializable
+data class ModelsConfig(
+    val default: String? = null,
+    val defaults: ModelDefaultsConfig? = null,
+    val visibility: Map<String, Boolean>? = null,
+    val presets: List<ModelPresetConfig>? = null
+)
+
+@Serializable
+data class ModelDefaultsConfig(
+    val chat: String? = null,
+    val plan: String? = null,
+    val coding: String? = null,
+    val weak: String? = null,
+    val embedding: String? = null,
+    val strong: String? = null
+)
+
+@Serializable
+data class ModelPresetConfig(
+    val name: String,
+    val description: String? = null,
+    val defaultModel: String,
+    val planModel: String? = null,
+    val codingModel: String? = null,
+    val weakModel: String? = null,
+    val strongModel: String? = null,
+    val visibleModels: List<String>? = null
+)
+
+@Serializable
+data class LimitsConfig(
+    val apiCallTimeout: Int? = null,
+    val toolExecutionTimeout: Int? = null,
+    val streamingReadTimeout: Int? = null,
+    val streamingRequestTimeout: Int? = null,
+    val maxContextSize: Int? = null,
+    val maxOutputSize: Int? = null,
+    val maxFileSize: Int? = null
+)
+
+@Serializable
+data class AdvancedConfig(
+    val noEgressDefault: Boolean? = null,
+    val readOnlyMode: Boolean? = null,
+    val autoOptimizePercentage: Int? = null,
+    val orchestrationEnabled: Boolean? = null
+)
+
+@Serializable
+data class ToolsConfig(val permissions: Map<String, ToolPermissionConfig>? = null)
+
+@Serializable
+data class ToolPermissionConfig(
+    val planMode: String? = null,
+    val agentMode: String? = null
+)
+
+@Serializable
+data class RagConfig(
+    val enabled: Boolean? = null,
+    val indexOnStartup: Boolean? = null,
+    val autoIndexOnContextBuild: Boolean? = null,
+    val maxFileSizeMB: Long? = null,
+    val maxChunksPerFile: Int? = null,
+    val indexBatchSize: Int? = null,
+    val embeddingsBatchSize: Int? = null,
+    val cacheTtlMs: Long? = null,
+    val maxConcurrentJobs: Int? = null,
+    val ignoredDirectories: List<String>? = null,
+    val searchSimilarityThreshold: Float? = null,
+    val searchTopK: Int? = null,
+    val searchHybridEnabled: Boolean? = null,
+    val searchSemanticWeight: Float? = null,
+    val searchIncludeContextChunks: Boolean? = null
+)
+
+@Serializable
+data class UiConfig(
+    val thinkingEnabled: Boolean? = null,
+    val noEgressEnabled: Boolean? = null,
+    val executionMode: String? = null,
+    val selectedMode: String? = null,
+    val selectedModel: String? = null
+)
+
+@Serializable
+data class PromptsConfig(
+    val systemChat: String? = null,
+    val systemPlan: String? = null,
+    val systemAgent: String? = null,
+    val commands: List<CommandConfig>? = null,
+    val rules: List<RuleConfig>? = null
+)
+
+@Serializable
+data class CommandConfig(
+    val name: String,
+    val description: String? = null,
+    val content: String,
+    val enabled: Boolean = true
+)
+
+@Serializable
+data class RuleConfig(
+    val name: String,
+    val content: String,
+    val enabled: Boolean = true
+)
+
+@Serializable
+data class McpConfig(val servers: List<McpServerConfig>? = null)
+
+@Serializable
+data class McpServerConfig(
+    val id: String,
+    val displayName: String? = null,
+    val description: String? = null,
+    val type: String = "STDIO",
+    val command: String? = null,
+    val args: List<String>? = null,
+    val workingDirectory: String? = null,
+    val url: String? = null,
+    val accessMode: String = "READ",
+    val enabled: Boolean = true,
+    val env: List<McpEnvConfig>? = null,
+    val httpHeaders: List<McpHeaderConfig>? = null,
+    val timeout: Int? = null,
+    val retryAttempts: Int? = null
+)
+
+@Serializable
+data class McpEnvConfig(
+    val name: String,
+    val value: String,
+    val isSecret: Boolean = false
+)
+
+@Serializable
+data class McpHeaderConfig(
+    val name: String,
+    val value: String,
+    val isSecret: Boolean = false
+)
+
+@Serializable
+data class HooksConfig(
+    @SerialName("before_turn_loop")
+    val beforeTurnLoop: List<HookDefinition>? = null,
+    @SerialName("after_turn_loop")
+    val afterTurnLoop: List<HookDefinition>? = null,
+    @SerialName("before_tool")
+    val beforeTool: List<HookDefinition>? = null,
+    @SerialName("after_tool")
+    val afterTool: List<HookDefinition>? = null,
+    @SerialName("on_agent_complete")
+    val onAgentComplete: List<HookDefinition>? = null,
+    @SerialName("on_agent_error")
+    val onAgentError: List<HookDefinition>? = null
+)
+
+@Serializable
+data class HookDefinition(
+    val action: String,
+    val command: String? = null,
+    val message: String? = null,
+    val match: String? = null,
+    val modes: List<String>? = null,
+    val timeout: Long? = null
+)
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/config/HierarchicalConfigLoader.kt b/core/src/main/kotlin/pl/jclab/refio/core/config/HierarchicalConfigLoader.kt
index 76bcf09f..deb91d21 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/config/HierarchicalConfigLoader.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/config/HierarchicalConfigLoader.kt
@@ -110,9 +110,9 @@ class HierarchicalConfigLoader private constructor(
     fun getLMStudioApiKey(): String? = getConfig().providers?.lmstudio?.apiKey
     fun getLMStudioBaseUrl(): String? = getConfig().providers?.lmstudio?.baseUrl
     fun getLMStudioContextSize(): Int? = getConfig().providers?.lmstudio?.contextSize
-    fun getCustomOpenAIApiKey(): String? = getConfig().providers?.customOpenai?.apiKey
-    fun getCustomOpenAIBaseUrl(): String? = getConfig().providers?.customOpenai?.baseUrl
-    fun getCustomOpenAIModel(): String? = getConfig().providers?.customOpenai?.model
+    fun getGenericOpenAIApiKey(): String? = getConfig().providers?.genericOpenai?.apiKey
+    fun getGenericOpenAIBaseUrl(): String? = getConfig().providers?.genericOpenai?.baseUrl
+    fun getGenericOpenAIModel(): String? = getConfig().providers?.genericOpenai?.model
     fun getZAIApiKey(): String? = getConfig().providers?.zai?.apiKey
     fun getZAIBaseUrl(): String? = getConfig().providers?.zai?.baseUrl
 
@@ -167,14 +167,6 @@ class HierarchicalConfigLoader private constructor(
         return getConfig().tools?.permissions
     }
 
-    fun getTerminalWhitelist(): TerminalWhitelistConfig? {
-        return getConfig().terminal?.whitelist
-    }
-
-    fun getTerminalWhitelistEnabled(): Boolean? = getConfig().terminal?.whitelist?.enabled
-
-    fun getTerminalWhitelistMode(): String? = getConfig().terminal?.whitelist?.mode
-
     // ═══════════════════════════════════════════════════════════════════════════════
     // RAG Settings
     // ═══════════════════════════════════════════════════════════════════════════════
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/config/YamlSanitizer.kt b/core/src/main/kotlin/pl/jclab/refio/core/config/YamlSanitizer.kt
new file mode 100644
index 00000000..fc689568
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/config/YamlSanitizer.kt
@@ -0,0 +1,152 @@
+package pl.jclab.refio.core.config
+
+/**
+ * Fallback YAML sanitization used when Kaml's strict parse fails.
+ *
+ * Handles two historical problems in user-authored `~/.refio/config.yaml` files:
+ *  - invalid double-quoted escape sequences (e.g. `"C:\Users"` with an
+ *    unescaped backslash) — we double the backslash so Kaml accepts it.
+ *  - broken `''` list items that some legacy emitters produced without a
+ *    preceding `-` marker — we re-indent them as proper list entries.
+ *
+ * Kept out of [ConfigYaml] so the data class doesn't drown in string-bashing.
+ */
+internal object YamlSanitizer {
+
+    /**
+     * Escape any raw `\` inside a double-quoted scalar whose following char
+     * is not a valid YAML escape indicator. Leaves everything else untouched.
+     */
+    fun sanitizeInvalidDoubleQuotedEscapes(input: String): String {
+        val out = StringBuilder(input.length + 32)
+        var inDoubleQuoted = false
+        var inSingleQuoted = false
+        var inComment = false
+        var i = 0
+
+        while (i < input.length) {
+            val ch = input[i]
+
+            if (inComment) {
+                out.append(ch)
+                if (ch == '\n') {
+                    inComment = false
+                }
+                i++
+                continue
+            }
+
+            if (inSingleQuoted) {
+                out.append(ch)
+                if (ch == '\'') {
+                    if (i + 1 < input.length && input[i + 1] == '\'') {
+                        out.append('\'')
+                        i += 2
+                        continue
+                    }
+                    inSingleQuoted = false
+                }
+                i++
+                continue
+            }
+
+            if (inDoubleQuoted) {
+                if (ch == '"') {
+                    inDoubleQuoted = false
+                    out.append(ch)
+                    i++
+                    continue
+                }
+
+                if (ch == '\\') {
+                    val next = input.getOrNull(i + 1)
+                    if (next == null || !isValidYamlEscape(input, i + 1)) {
+                        out.append("\\\\")
+                        i++
+                        continue
+                    }
+                }
+
+                out.append(ch)
+                i++
+                continue
+            }
+
+            when (ch) {
+                '#' -> inComment = true
+                '"' -> inDoubleQuoted = true
+                '\'' -> inSingleQuoted = true
+            }
+            out.append(ch)
+            i++
+        }
+
+        return out.toString()
+    }
+
+    /**
+     * Repair lonely `''` lines (broken "empty scalar in a list" output) by
+     * making them explicit `- ''` entries at the correct indent. When the
+     * input has no such lines, returns [input] unchanged to preserve identity.
+     */
+    fun sanitizeBrokenStandaloneEmptyQuotedLines(input: String): String {
+        val lines = input.split('\n')
+        val out = ArrayList<String>(lines.size)
+
+        var previousSignificant: String? = null
+        var lastListIndent: Int? = null
+        var changed = false
+
+        for (line in lines) {
+            val trimmed = line.trim()
+
+            if (trimmed == "''") {
+                val replacement = when {
+                    lastListIndent != null -> "${" ".repeat(lastListIndent)}- ''"
+                    previousSignificant?.trimEnd()?.endsWith(":") == true -> {
+                        val baseIndent = previousSignificant.takeWhile { it == ' ' }.length
+                        "${" ".repeat(baseIndent + 2)}- ''"
+                    }
+                    else -> "- ''"
+                }
+                out.add(replacement)
+                previousSignificant = replacement
+                lastListIndent = replacement.takeWhile { it == ' ' }.length
+                changed = true
+                continue
+            }
+
+            out.add(line)
+            if (trimmed.isNotEmpty() && !trimmed.startsWith("#")) {
+                previousSignificant = line
+                if (trimmed.startsWith("- ")) {
+                    lastListIndent = line.takeWhile { it == ' ' }.length
+                } else if (trimmed.endsWith(":")) {
+                    lastListIndent = null
+                }
+            }
+        }
+
+        if (!changed) return input
+        return out.joinToString("\n")
+    }
+
+    private fun isValidYamlEscape(input: String, escapeCharIndex: Int): Boolean {
+        val escapeChar = input.getOrNull(escapeCharIndex) ?: return false
+        return when (escapeChar) {
+            '0', 'a', 'b', 't', 'n', 'v', 'f', 'r', 'e', ' ', '"', '/', '\\', 'N', '_', 'L', 'P' -> true
+            'x' -> hasHexDigits(input, escapeCharIndex + 1, 2)
+            'u' -> hasHexDigits(input, escapeCharIndex + 1, 4)
+            'U' -> hasHexDigits(input, escapeCharIndex + 1, 8)
+            else -> false
+        }
+    }
+
+    private fun hasHexDigits(input: String, start: Int, length: Int): Boolean {
+        if (start + length > input.length) return false
+        for (idx in start until start + length) {
+            if (!input[idx].isDigit() && input[idx].lowercaseChar() !in 'a'..'f') return false
+        }
+        return true
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/db/PromptsTable.kt b/core/src/main/kotlin/pl/jclab/refio/core/db/PromptsTable.kt
index 21ed05e8..aed244b5 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/db/PromptsTable.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/db/PromptsTable.kt
@@ -34,7 +34,7 @@ enum class PromptType {
 
     // ========== User-Defined Content ==========
     RULE,                   // User-defined rule (appended to system prompts)
-    SLASH_COMMAND           // User-defined slash command (triggered by /command)
+    SLASH_PROMPT            // User-defined slash prompt template (triggered by /name)
     ;
 
     companion object {
@@ -58,12 +58,12 @@ enum class PromptType {
 
     fun isSystemPrompt(): Boolean = SYSTEM_PROMPT_TYPES.contains(this)
 
-    fun isSlashCommand(): Boolean = this == SLASH_COMMAND
+    fun isSlashPrompt(): Boolean = this == SLASH_PROMPT
 }
 
 /**
  * Prompts table definition using Exposed ORM DSL
- * Stores system prompts, rules, and slash commands
+ * Stores system prompts, rules, and slash prompts
  */
 object PromptsTable : Table("prompts") {
     val id = varchar("id", 36).clientDefault { UUID.randomUUID().toString() }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/db/migrations/MigrationRunner.kt b/core/src/main/kotlin/pl/jclab/refio/core/db/migrations/MigrationRunner.kt
index 1577c648..c8e5a1a6 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/db/migrations/MigrationRunner.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/db/migrations/MigrationRunner.kt
@@ -17,6 +17,7 @@ object MigrationRunner {
         // SeedTestDataMigration removed from production runtime (Phase 7 refactor).
         // Seed data was for UI development only. Existing databases retain v1 data.
         V2DropAgentEventsSessionFk(),
+        V3RenameSlashCommandToSlashPrompt(),
     )
 
     fun run(database: Database) {
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/db/migrations/V3RenameSlashCommandToSlashPrompt.kt b/core/src/main/kotlin/pl/jclab/refio/core/db/migrations/V3RenameSlashCommandToSlashPrompt.kt
new file mode 100644
index 00000000..82900747
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/db/migrations/V3RenameSlashCommandToSlashPrompt.kt
@@ -0,0 +1,31 @@
+package pl.jclab.refio.core.db.migrations
+
+import org.jetbrains.exposed.sql.Database
+import org.jetbrains.exposed.sql.transactions.transaction
+import pl.jclab.refio.core.logging.dualLogger
+
+private val logger = dualLogger("V3RenameSlashCommandToSlashPrompt")
+
+/**
+ * Renames PromptType value `SLASH_COMMAND` to `SLASH_PROMPT` for existing rows.
+ *
+ * The feature was renamed from "slash commands" to "slash prompts" because these
+ * are reusable prompt templates invoked via `/name`, not shell/CLI commands.
+ * Existing databases stored the enum as its string name in `prompts.type`, so
+ * we remap them in-place to keep user data (both built-in and custom entries).
+ */
+class V3RenameSlashCommandToSlashPrompt : Migration {
+    override val version: Int = 3
+
+    override fun migrate(database: Database) {
+        transaction(database) {
+            val jdbc = (connection.connection as java.sql.Connection)
+            jdbc.createStatement().use { st ->
+                val updated = st.executeUpdate(
+                    "UPDATE prompts SET type = 'SLASH_PROMPT' WHERE type = 'SLASH_COMMAND'"
+                )
+                logger.info { "Renamed SLASH_COMMAND → SLASH_PROMPT for $updated row(s)" }
+            }
+        }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/ChatMessageRepository.kt b/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/ChatMessageRepository.kt
index caff8cd0..ad5470f9 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/ChatMessageRepository.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/ChatMessageRepository.kt
@@ -76,7 +76,9 @@ class ChatMessageRepository {
         result: String,
         isSummarized: Boolean = false,
         rawOutput: String? = null,
-        metadata: String? = null
+        metadata: String? = null,
+        agentName: String? = null,
+        agentDepth: Int? = null,
     ): ChatMessage {
         return create(
             taskId = taskId,
@@ -86,7 +88,9 @@ class ChatMessageRepository {
             toolCallId = toolCallId,
             subtaskId = subtaskId,
             isSummarized = isSummarized,
-            rawOutput = rawOutput
+            rawOutput = rawOutput,
+            agentName = agentName,
+            agentDepth = agentDepth,
         )
     }
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/ConfigRepository.kt b/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/ConfigRepository.kt
index 3b95b335..cb1006d8 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/ConfigRepository.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/ConfigRepository.kt
@@ -9,10 +9,12 @@ import org.jetbrains.exposed.exceptions.ExposedSQLException
 import org.jetbrains.exposed.sql.deleteWhere
 import org.jetbrains.exposed.sql.insert
 import org.jetbrains.exposed.sql.selectAll
+import org.jetbrains.exposed.sql.transactions.TransactionManager
 import org.jetbrains.exposed.sql.transactions.transaction
 import org.jetbrains.exposed.sql.update
 import org.jetbrains.exposed.sql.and
 import pl.jclab.refio.core.db.Config
+import pl.jclab.refio.core.db.DatabaseFactory
 import pl.jclab.refio.core.db.ConfigScope
 import pl.jclab.refio.core.db.ConfigTable
 import pl.jclab.refio.core.logging.dualLogger
@@ -60,6 +62,10 @@ class ConfigRepository {
     }
 
     fun get(key: String, scope: ConfigScope, projectId: String? = null, taskId: String? = null): Config? {
+        if (!canAccessConfigTable()) {
+            logger.debug { "Skipping config lookup before database init: key=$key, scope=$scope" }
+            return null
+        }
         return try {
             transaction {
                 validateScope(scope, projectId, taskId)
@@ -80,6 +86,10 @@ class ConfigRepository {
     }
 
     fun getWithPrecedence(key: String, taskId: String? = null, projectId: String? = null): Config? {
+        if (!canAccessConfigTable()) {
+            logger.debug { "Skipping precedence config lookup before database init: key=$key" }
+            return null
+        }
         return try {
             transaction {
                 logger.debug { "[ORCHESTRATION-DEBUG] getWithPrecedence: key=$key, taskId=$taskId, projectId=$projectId" }
@@ -115,6 +125,10 @@ class ConfigRepository {
     }
 
     fun findByScope(scope: ConfigScope, projectId: String? = null, taskId: String? = null): List<Config> {
+        if (!canAccessConfigTable()) {
+            logger.debug { "Skipping config scope lookup before database init: scope=$scope" }
+            return emptyList()
+        }
         return try {
             transaction {
                 validateScope(scope, projectId, taskId)
@@ -143,6 +157,10 @@ class ConfigRepository {
         projectId: String? = null,
         taskId: String? = null
     ): List<Config> {
+        if (!canAccessConfigTable()) {
+            logger.debug { "Skipping config search before database init: pattern=$keyPattern" }
+            return emptyList()
+        }
         return try {
             transaction {
                 val conditions = mutableListOf<Op<Boolean>>(ConfigTable.key like keyPattern)
@@ -197,6 +215,10 @@ class ConfigRepository {
     }
 
     fun count(scope: ConfigScope? = null, projectId: String? = null, taskId: String? = null): Long {
+        if (!canAccessConfigTable()) {
+            logger.debug { "Skipping config count before database init" }
+            return 0L
+        }
         return try {
             transaction {
                 val conditions = mutableListOf<Op<Boolean>>()
@@ -286,6 +308,21 @@ class ConfigRepository {
         }
     }
 
+    private fun canAccessConfigTable(): Boolean {
+        if (DatabaseFactory.isInitialized()) return true
+        if (TransactionManager.defaultDatabase == null) return false
+
+        return try {
+            transaction {
+                exec("SELECT 1 FROM sqlite_master WHERE type = 'table' AND name = 'config' LIMIT 1") { rs ->
+                    rs.next()
+                } ?: false
+            }
+        } catch (_: Exception) {
+            false
+        }
+    }
+
     private fun isDatabaseNotReady(error: Throwable): Boolean {
         return generateSequence(error) { it.cause }.any { cause ->
             when (cause) {
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/PromptsRepository.kt b/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/PromptsRepository.kt
index 9b37efc4..9d3cfe1e 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/PromptsRepository.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/db/repositories/PromptsRepository.kt
@@ -11,7 +11,7 @@ private val logger = dualLogger("PromptsRepository")
 
 /**
  * Repository for Prompts database operations
- * Manages system prompts, rules, and slash commands
+ * Manages system prompts, rules, and slash prompts
  */
 class PromptsRepository {
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/errors/RefioError.kt b/core/src/main/kotlin/pl/jclab/refio/core/errors/RefioError.kt
index 6fb68fa1..aab86ba6 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/errors/RefioError.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/errors/RefioError.kt
@@ -37,4 +37,20 @@ sealed class RefioError(
         val provider: String,
         val key: String
     ) : RefioError("Provider '$provider' is not configured. Missing: $key")
+
+    /**
+     * Thrown when provider returns a response that doesn't match expected structure.
+     *
+     * `bodyPreview` is the first ~500 chars of raw JSON body (for debugging).
+     */
+    class MalformedResponse(
+        val provider: String,
+        val model: String,
+        val reason: String,
+        val bodyPreview: String,
+        cause: Throwable? = null,
+    ) : RefioError(
+        "Malformed response from $provider/$model: $reason. Preview: ${bodyPreview.take(500)}",
+        cause,
+    )
 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/Base.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/Base.kt
index c714eebc..01a6ac01 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/Base.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/Base.kt
@@ -2,7 +2,7 @@ package pl.jclab.refio.core.llm
 
 import kotlinx.coroutines.flow.Flow
 import kotlinx.coroutines.flow.flow
-import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.config.ConfigKeys
 
 /**
  * Base classes for LLM adapters in Refio.
@@ -91,7 +91,7 @@ abstract class BaseLLMAdapter(
     val model: String,
     val provider: String
 ) {
-    protected fun toOpenAiMessageContent(message: LLMMessage): Any {
+    internal fun toOpenAiMessageContent(message: LLMMessage): Any {
         val normalizedParts = normalizeMessageParts(message)
         if (normalizedParts.size == 1 && normalizedParts.first() is LLMContentPart.Text) {
             return (normalizedParts.first() as LLMContentPart.Text).text
@@ -267,7 +267,7 @@ abstract class BaseLLMAdapter(
             if (maxTokensKey != null) {
                 val baseMaxTokens = (mutableParams[maxTokensKey] as? Number)?.toInt()
                     ?: definition.maxOutputTokens
-                    ?: ConfigService.DEFAULT_MAX_OUTPUT_SIZE
+                    ?: ConfigKeys.MAX_OUTPUT_SIZE.default
                 val adjustedMaxTokens = (baseMaxTokens * definition.reasoningTokensMultiplier).toInt()
                 mutableParams[maxTokensKey] = adjustedMaxTokens
             }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/HttpClientConfig.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/HttpClientConfig.kt
index bd415ed6..ebd398d9 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/HttpClientConfig.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/HttpClientConfig.kt
@@ -1,5 +1,7 @@
 package pl.jclab.refio.core.llm
 
+import pl.jclab.refio.core.config.ConfigKeys
+
 /**
  * HTTP client configuration for LLM adapters.
  * Loaded from database (ConfigService) and passed to adapters.
@@ -57,13 +59,13 @@ data class HttpClientConfig(
             }
 
             // Load timeout values (stored in seconds, convert to ms)
-            val apiCallTimeoutSec = configService.get(pl.jclab.refio.core.services.ConfigService.KEY_API_CALL_TIMEOUT)?.toIntOrNull() ?: 60
-            val toolTimeoutSec = configService.get(pl.jclab.refio.core.services.ConfigService.KEY_TOOL_EXECUTION_TIMEOUT)?.toIntOrNull() ?: 240
+            val apiCallTimeoutSec = configService.get(ConfigKeys.API_CALL_TIMEOUT.key)?.toIntOrNull() ?: 60
+            val toolTimeoutSec = configService.get(ConfigKeys.TOOL_EXECUTION_TIMEOUT.key)?.toIntOrNull() ?: 240
 
             // Load other limits
-            val maxRetries = configService.get(pl.jclab.refio.core.services.ConfigService.KEY_MAX_RETRIES)?.toIntOrNull() ?: 3
-            val retryDelay = configService.get(pl.jclab.refio.core.services.ConfigService.KEY_RETRY_DELAY_MS)?.toLongOrNull() ?: 1000
-            val rateLimit = configService.get(pl.jclab.refio.core.services.ConfigService.KEY_RATE_LIMIT_RPM)?.toIntOrNull() ?: 60
+            val maxRetries = configService.get(ConfigKeys.MAX_RETRIES.key)?.toIntOrNull() ?: 3
+            val retryDelay = configService.get(ConfigKeys.RETRY_DELAY_MS.key)?.toLongOrNull() ?: 1000
+            val rateLimit = configService.get(ConfigKeys.RATE_LIMIT_RPM.key)?.toIntOrNull() ?: 60
 
             return HttpClientConfig(
                 requestTimeoutMs = apiCallTimeoutSec.toLong() * 1000,  // Convert sec to ms
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/JsonExtractor.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/JsonExtractor.kt
index 1eb47786..b7d54ec3 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/JsonExtractor.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/JsonExtractor.kt
@@ -192,7 +192,7 @@ object JsonExtractor {
                 ?: "Execution plan"
         }
 
-        logger.info { "[JSON] Normalized to ${(subtasks as? List<*>)?.size ?: 0} subtasks" }
+        logger.info { "[JSON] Normalized to ${subtasks.size} subtasks" }
 
         return mapOf(
             "plan" to planDescription,
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/LLMClient.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/LLMClient.kt
index 1c1bc811..a25843fb 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/LLMClient.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/LLMClient.kt
@@ -4,7 +4,7 @@ import pl.jclab.refio.core.api.StreamCallback
 import pl.jclab.refio.core.api.StreamChunk
 import pl.jclab.refio.core.errors.RefioError
 import pl.jclab.refio.core.llm.adapters.AnthropicAdapter
-import pl.jclab.refio.core.llm.adapters.CustomOpenAIAdapter
+import pl.jclab.refio.core.llm.adapters.GenericOpenAIAdapter
 import pl.jclab.refio.core.llm.adapters.GeminiAdapter
 import pl.jclab.refio.core.llm.adapters.LMStudioAdapter
 import pl.jclab.refio.core.llm.adapters.OllamaAdapter
@@ -194,7 +194,7 @@ class LLMClient(
             // Even for local providers, verify the endpoint is actually localhost
             val endpoint = when (providerLower) {
                 "ollama" -> configService?.getTyped(pl.jclab.refio.core.config.ConfigKeys.PROVIDER_OLLAMA_ENDPOINT) ?: "http://localhost:11434"
-                "lmstudio" -> configService?.get(pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_LM_STUDIO_BASE_URL) ?: "http://localhost:1234"
+                "lmstudio" -> configService?.get(pl.jclab.refio.core.config.ConfigKeys.PROVIDER_LM_STUDIO_BASE_URL.key) ?: "http://localhost:1234"
                 else -> ""
             }
             if (endpoint.isNotBlank() && !isLocalEndpoint(endpoint)) {
@@ -293,7 +293,14 @@ class LLMClient(
             // Provider-agnostic guardrails — detect repetition loops, runaway output
             // size, and wall-clock deadlines. Instantiated per-request (stateful).
             // See core/llm/streaming/StreamGuardrails.kt for details.
-            val guardrails = if (stream) StreamGuardrails.defaults() else null
+            val guardrails = if (stream) {
+                val streamingTimeoutSec = configService?.getTyped(
+                    pl.jclab.refio.core.config.ConfigKeys.STREAMING_REQUEST_TIMEOUT
+                ) ?: pl.jclab.refio.core.config.ConfigKeys.STREAMING_REQUEST_TIMEOUT.default
+                // Wall clock = 90% of streaming timeout (10% buffer for cleanup/logging)
+                val wallClockMs = (streamingTimeoutSec * 900L).coerceIn(60_000, 1_800_000)
+                StreamGuardrails.defaults(wallClockMs)
+            } else null
 
             val streamCallback: ((pl.jclab.refio.core.llm.StreamChunk) -> Unit)? = if (stream) { llmChunk ->
                 contentBuilder.append(llmChunk.delta)
@@ -444,7 +451,7 @@ class LLMClient(
                 "anthropic" -> AnthropicAdapter(model = model, configService = configService, taskId = taskId, subtaskId = subtaskId, source = source)
                 "gemini" -> GeminiAdapter(model = model, configService = configService, taskId = taskId, subtaskId = subtaskId, source = source)
                 "lmstudio" -> LMStudioAdapter(model = model, configService = configService, taskId = taskId, subtaskId = subtaskId, source = source)
-                "custom_openai" -> CustomOpenAIAdapter(model = model, providerName = "custom_openai", configService = configService, taskId = taskId, subtaskId = subtaskId, source = source)
+                "generic_openai" -> GenericOpenAIAdapter(model = model, providerName = "generic_openai", configService = configService, taskId = taskId, subtaskId = subtaskId, source = source)
                 "zai" -> ZAIAdapter(model = model, configService = configService, taskId = taskId, subtaskId = subtaskId, source = source)
                 "openrouter" -> OpenRouterAdapter(model = model, configService = configService, taskId = taskId, subtaskId = subtaskId, source = source)
                 else -> {
@@ -483,7 +490,7 @@ class LLMClient(
      * Get list of supported providers.
      */
     fun getSupportedProviders(): List<String> {
-        return listOf("ollama", "openai", "anthropic", "gemini", "openrouter", "lmstudio", "custom_openai", "zai")
+        return listOf("ollama", "openai", "anthropic", "gemini", "openrouter", "lmstudio", "generic_openai", "zai")
     }
 
     /**
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/LLMMessageMapper.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/LLMMessageMapper.kt
index 3f3be8d8..4fc1b3cc 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/LLMMessageMapper.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/LLMMessageMapper.kt
@@ -74,8 +74,8 @@ object LLMMessageMapper {
             parts = listOf(
                 LLMContentPart.Text(textContent),
                 LLMContentPart.Image(
-                    mediaType = mediaType!!,
-                    base64Data = base64Data!!,
+                    mediaType = mediaType,
+                    base64Data = base64Data,
                     detail = "auto"
                 )
             )
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/ModelDefinitions.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/ModelDefinitions.kt
index c9f424ad..ca6e686d 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/ModelDefinitions.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/ModelDefinitions.kt
@@ -1020,6 +1020,52 @@ object ModelDefinitions {
      */
     val ANTHROPIC_MODELS = mapOf(
         // Sonnet models (latest, most capable)
+        "claude-sonnet-4-6" to ModelDefinition(
+            id = "claude-sonnet-4-6",
+            name = "Claude Sonnet 4.6",
+            provider = "anthropic",
+            description = "Best combination of speed and intelligence with extended and adaptive thinking, 1M context",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.TEXT_COMPLETION,
+                ModelCapability.VISION
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 1_000_000,
+            maxOutputTokens = 64_000,
+            costPer1MInput = 3.00,
+            costPer1MOutput = 15.00,
+            supportsVision = true,
+            supportsReasoning = false,
+            supportsStreaming = true,
+            supportsFunctionCalling = true,
+            supportsThinking = true,
+            active = true
+        ),
+
+        "anthropic.claude-sonnet-4-6" to ModelDefinition(
+            id = "anthropic.claude-sonnet-4-6",
+            name = "Claude Sonnet 4.6 (Bedrock)",
+            provider = "anthropic",
+            description = "AWS Bedrock alias for Claude Sonnet 4.6",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.TEXT_COMPLETION,
+                ModelCapability.VISION
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 1_000_000,
+            maxOutputTokens = 64_000,
+            costPer1MInput = 3.00,
+            costPer1MOutput = 15.00,
+            supportsVision = true,
+            supportsReasoning = false,
+            supportsStreaming = true,
+            supportsFunctionCalling = true,
+            supportsThinking = true,
+            active = true
+        ),
+
         "claude-sonnet-4-5-20250929" to ModelDefinition(
             id = "claude-sonnet-4-5-20250929",
             name = "Claude Sonnet 4.5",
@@ -1159,6 +1205,52 @@ object ModelDefinitions {
         ),
 
         // Opus models (premium intelligence)
+        "claude-opus-4-7" to ModelDefinition(
+            id = "claude-opus-4-7",
+            name = "Claude Opus 4.7",
+            provider = "anthropic",
+            description = "Most capable Claude model for complex reasoning and agentic coding, 1M context with adaptive thinking",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.TEXT_COMPLETION,
+                ModelCapability.VISION
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 1_000_000,
+            maxOutputTokens = 128_000,
+            costPer1MInput = 5.00,
+            costPer1MOutput = 25.00,
+            supportsVision = true,
+            supportsReasoning = false,
+            supportsStreaming = true,
+            supportsFunctionCalling = true,
+            supportsThinking = true,
+            active = true
+        ),
+
+        "anthropic.claude-opus-4-7" to ModelDefinition(
+            id = "anthropic.claude-opus-4-7",
+            name = "Claude Opus 4.7 (Bedrock)",
+            provider = "anthropic",
+            description = "AWS Bedrock alias for Claude Opus 4.7 (research preview on Bedrock)",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.TEXT_COMPLETION,
+                ModelCapability.VISION
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 1_000_000,
+            maxOutputTokens = 128_000,
+            costPer1MInput = 5.00,
+            costPer1MOutput = 25.00,
+            supportsVision = true,
+            supportsReasoning = false,
+            supportsStreaming = true,
+            supportsFunctionCalling = true,
+            supportsThinking = true,
+            active = true
+        ),
+
         "anthropic.claude-opus-4-6-v1" to ModelDefinition(
             id = "anthropic.claude-opus-4-6-v1",
             name = "Claude Opus 4.6",
@@ -2740,6 +2832,53 @@ object ModelDefinitions {
             active = true
         ),
 
+        // Qwen 3.6 - 35B MoE (3B active), 256K context, multimodal with tool use
+        "qwen3.6:latest" to ModelDefinition(
+            id = "qwen3.6:latest",
+            name = "Qwen 3.6 35B MoE",
+            provider = "ollama",
+            description = "Latest Qwen 3.6 MoE (35B total, 3B active) multimodal model with 256K context",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.VISION,
+                ModelCapability.TOOL_USE
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 256_000,
+            maxOutputTokens = null,
+            costPer1MInput = 0.0,
+            costPer1MOutput = 0.0,
+            supportsVision = true,
+            supportsReasoning = true,
+            supportsStreaming = true,
+            supportsFunctionCalling = true,
+            defaultParams = mapOf("temperature" to 0.7),
+            active = true
+        ),
+
+        "qwen3.6:35b-a3b" to ModelDefinition(
+            id = "qwen3.6:35b-a3b",
+            name = "Qwen 3.6 35B MoE (A3B)",
+            provider = "ollama",
+            description = "Qwen 3.6 MoE (35B total, 3B active) multimodal model with 256K context",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.VISION,
+                ModelCapability.TOOL_USE
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 256_000,
+            maxOutputTokens = null,
+            costPer1MInput = 0.0,
+            costPer1MOutput = 0.0,
+            supportsVision = true,
+            supportsReasoning = true,
+            supportsStreaming = true,
+            supportsFunctionCalling = true,
+            defaultParams = mapOf("temperature" to 0.7),
+            active = true
+        ),
+
         "qwen3-coder:30b" to ModelDefinition(
             id = "qwen3-coder:30b",
             name = "Qwen 3 Coder 30B",
@@ -3736,6 +3875,118 @@ object ModelDefinitions {
             active = true
         ),
 
+        // MedGemma - Gemma 3 variants trained for medical text and image comprehension, 128K context
+        "medgemma:latest" to ModelDefinition(
+            id = "medgemma:latest",
+            name = "MedGemma 4B",
+            provider = "ollama",
+            description = "Gemma 3 variant fine-tuned for medical text and image comprehension",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.VISION
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 128_000,
+            maxOutputTokens = null,
+            costPer1MInput = 0.0,
+            costPer1MOutput = 0.0,
+            supportsVision = true,
+            supportsReasoning = false,
+            supportsStreaming = true,
+            supportsFunctionCalling = false,
+            defaultParams = mapOf("temperature" to 0.7),
+            active = true
+        ),
+
+        "medgemma:4b" to ModelDefinition(
+            id = "medgemma:4b",
+            name = "MedGemma 4B",
+            provider = "ollama",
+            description = "4B Gemma 3 variant for medical text and image comprehension",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.VISION
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 128_000,
+            maxOutputTokens = null,
+            costPer1MInput = 0.0,
+            costPer1MOutput = 0.0,
+            supportsVision = true,
+            supportsReasoning = false,
+            supportsStreaming = true,
+            supportsFunctionCalling = false,
+            defaultParams = mapOf("temperature" to 0.7),
+            active = true
+        ),
+
+        "medgemma:27b" to ModelDefinition(
+            id = "medgemma:27b",
+            name = "MedGemma 27B",
+            provider = "ollama",
+            description = "27B Gemma 3 variant for medical text and image comprehension",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.VISION
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 128_000,
+            maxOutputTokens = null,
+            costPer1MInput = 0.0,
+            costPer1MOutput = 0.0,
+            supportsVision = true,
+            supportsReasoning = false,
+            supportsStreaming = true,
+            supportsFunctionCalling = false,
+            defaultParams = mapOf("temperature" to 0.7),
+            active = true
+        ),
+
+        // MedGemma 1.5 - updated MedGemma 4B with improved medical performance, 128K context
+        "medgemma1.5:latest" to ModelDefinition(
+            id = "medgemma1.5:latest",
+            name = "MedGemma 1.5 4B",
+            provider = "ollama",
+            description = "Updated MedGemma 4B with improved medical text and image comprehension",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.VISION
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 128_000,
+            maxOutputTokens = null,
+            costPer1MInput = 0.0,
+            costPer1MOutput = 0.0,
+            supportsVision = true,
+            supportsReasoning = false,
+            supportsStreaming = true,
+            supportsFunctionCalling = false,
+            defaultParams = mapOf("temperature" to 0.7),
+            active = true
+        ),
+
+        "medgemma1.5:4b" to ModelDefinition(
+            id = "medgemma1.5:4b",
+            name = "MedGemma 1.5 4B",
+            provider = "ollama",
+            description = "4B updated MedGemma with improved medical text and image comprehension",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.VISION
+            ),
+            modelType = ModelType.MULTIMODAL,
+            maxContext = 128_000,
+            maxOutputTokens = null,
+            costPer1MInput = 0.0,
+            costPer1MOutput = 0.0,
+            supportsVision = true,
+            supportsReasoning = false,
+            supportsStreaming = true,
+            supportsFunctionCalling = false,
+            defaultParams = mapOf("temperature" to 0.7),
+            active = true
+        ),
+
         "gemma2:2b" to ModelDefinition(
             id = "gemma2:2b",
             name = "Gemma 2 2B",
@@ -3802,6 +4053,62 @@ object ModelDefinitions {
             active = true
         ),
 
+        // ═══════════════════════════════════════════════════════════════════
+        // ZHIPU GLM FAMILY (Ollama cloud variants)
+        // ═══════════════════════════════════════════════════════════════════
+
+        // GLM-5.1 - next-generation flagship for agentic engineering with strong coding
+        "glm-5.1:cloud" to ModelDefinition(
+            id = "glm-5.1:cloud",
+            name = "GLM-5.1 (Cloud)",
+            provider = "ollama",
+            description = "Flagship agentic engineering model with strong coding capabilities, 198K context",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.CODE_COMPLETION,
+                ModelCapability.TOOL_USE
+            ),
+            modelType = ModelType.TEXT,
+            maxContext = 198_000,
+            maxOutputTokens = null,
+            costPer1MInput = 0.0,
+            costPer1MOutput = 0.0,
+            supportsVision = false,
+            supportsReasoning = true,
+            supportsStreaming = true,
+            supportsFunctionCalling = true,
+            defaultParams = mapOf("temperature" to 0.7),
+            active = true
+        ),
+
+        // ═══════════════════════════════════════════════════════════════════
+        // MINIMAX FAMILY (Ollama cloud variants)
+        // ═══════════════════════════════════════════════════════════════════
+
+        // MiniMax M2.7 - coding, agentic workflows, and professional productivity
+        "minimax-m2.7:cloud" to ModelDefinition(
+            id = "minimax-m2.7:cloud",
+            name = "MiniMax M2.7 (Cloud)",
+            provider = "ollama",
+            description = "MiniMax M2-series model for coding, agentic workflows, and professional productivity, 200K context",
+            capabilities = listOf(
+                ModelCapability.CHAT_COMPLETION,
+                ModelCapability.CODE_COMPLETION,
+                ModelCapability.TOOL_USE
+            ),
+            modelType = ModelType.TEXT,
+            maxContext = 200_000,
+            maxOutputTokens = null,
+            costPer1MInput = 0.0,
+            costPer1MOutput = 0.0,
+            supportsVision = false,
+            supportsReasoning = true,
+            supportsStreaming = true,
+            supportsFunctionCalling = true,
+            defaultParams = mapOf("temperature" to 0.7),
+            active = true
+        ),
+
         // ═══════════════════════════════════════════════════════════════════
         // LIQUID AI FAMILY
         // ═══════════════════════════════════════════════════════════════════
@@ -4180,15 +4487,19 @@ object ModelDefinitions {
     }
 
     /**
-     * Create fallback ModelDefinition for unknown models.
-     * Used when model is available from API but not in our registry.
+     * Create synthetic ModelDefinition for models not yet in registry.
+     *
+     * Semantyka: provider API zwraca model którego nie ma w naszym statycznym rejestrze
+     * (nowy release). Tworzymy best-effort definicję z defaultami — **nie jest to
+     * silent default**, callery muszą zalogować WARN. Zgodne z regułą "no fallbacks":
+     * to nie fallback w execution path, tylko enumeracja zewnętrznych zasobów.
      *
      * @param provider Provider name
      * @param modelId Model identifier
      * @param maxContext Context window size (default: 32768)
-     * @return ModelDefinition with basic configuration
+     * @return ModelDefinition with conservative defaults
      */
-    fun createFallback(
+    fun syntheticDefinitionFor(
         provider: String,
         modelId: String,
         maxContext: Int = DEFAULT_CONTEXT_SIZE
@@ -4197,7 +4508,7 @@ object ModelDefinitions {
             id = modelId,
             name = modelId,
             provider = provider,
-            description = "Unknown model (fallback configuration)",
+            description = "Unknown model (synthetic definition)",
             capabilities = listOf(ModelCapability.CHAT_COMPLETION),
             modelType = ModelType.TEXT,
             maxContext = maxContext,
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/ModelRegistry.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/ModelRegistry.kt
index 2eae5870..4c426add 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/ModelRegistry.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/ModelRegistry.kt
@@ -7,7 +7,7 @@ import kotlinx.coroutines.sync.Mutex
 import kotlinx.coroutines.sync.withLock
 import kotlinx.coroutines.withTimeoutOrNull
 import pl.jclab.refio.core.llm.adapters.AnthropicAdapter
-import pl.jclab.refio.core.llm.adapters.CustomOpenAIAdapter
+import pl.jclab.refio.core.llm.adapters.GenericOpenAIAdapter
 import pl.jclab.refio.core.llm.adapters.GeminiAdapter
 import pl.jclab.refio.core.llm.adapters.LMStudioAdapter
 import pl.jclab.refio.core.llm.adapters.OllamaAdapter
@@ -129,9 +129,15 @@ fun ModelDefinition.toModelConfig(): ModelConfig {
 private var modelsCache: Map<String, List<ModelConfig>>? = null
 private var cacheTimestamp: Long = 0L
 private const val CACHE_TTL_MS = 300_000L // 5 minutes
-private const val LIST_MODELS_TIMEOUT_MS = 15_000L // 15s per-provider timeout for listing models
+private const val LIST_MODELS_TIMEOUT_MS = 15_000L // 15s per-provider timeout for cloud providers
+private const val LIST_MODELS_TIMEOUT_LOCAL_MS = 3_000L // 3s for local providers (ollama, lmstudio)
 private val modelsCacheMutex = Mutex()
 
+private fun listModelsTimeoutFor(provider: String): Long = when (provider) {
+    "ollama", "lmstudio" -> LIST_MODELS_TIMEOUT_LOCAL_MS
+    else -> LIST_MODELS_TIMEOUT_MS
+}
+
 private fun getCachedModelsIfFresh(now: Long = System.currentTimeMillis()): List<ModelConfig>? {
     val cached = modelsCache ?: return null
     if ((now - cacheTimestamp) >= CACHE_TTL_MS) return null
@@ -179,21 +185,39 @@ fun inferProvider(model: String, default: String = "ollama"): String {
     }
 }
 
+/**
+ * Returns the last cached model snapshot regardless of TTL freshness, or empty if
+ * nothing has ever been fetched. Never triggers remote calls. Use this for read-only
+ * UI listings that should not block on slow providers.
+ */
+fun getCachedModelsSnapshot(): List<ModelConfig> {
+    return modelsCache?.values?.flatten() ?: emptyList()
+}
+
 /**
  * Gets all models from all providers dynamically.
  * Results are cached for 5 minutes to avoid excessive API calls.
  *
  * @param configService Optional ConfigService for API keys (uses env vars as fallback)
+ * @param fetchIfMissing When false, returns the current cache (even if stale or empty)
+ *                      without performing any remote calls. UI screens that just want
+ *                      to display the last known state should pass false.
  * @return List of all available models from all providers
  */
 suspend fun getAllModels(
-    configService: pl.jclab.refio.core.services.ConfigService? = null
+    configService: pl.jclab.refio.core.services.ConfigService? = null,
+    fetchIfMissing: Boolean = true
 ): List<ModelConfig> {
     getCachedModelsIfFresh()?.let {
         GlobalMetrics.recordCacheAccess("model_registry", hit = true)
         return it
     }
 
+    if (!fetchIfMissing) {
+        GlobalMetrics.recordCacheAccess("model_registry", hit = false)
+        return getCachedModelsSnapshot()
+    }
+
     return modelsCacheMutex.withLock {
         getCachedModelsIfFresh()?.let {
             GlobalMetrics.recordCacheAccess("model_registry", hit = true)
@@ -206,13 +230,13 @@ suspend fun getAllModels(
 
         data class ProviderFetch(val name: String, val models: List<ModelConfig>)
 
-        val providerNames = listOf("ollama", "openai", "anthropic", "openrouter", "gemini", "lmstudio", "custom_openai", "zai")
+        val providerNames = listOf("ollama", "openai", "anthropic", "openrouter", "gemini", "lmstudio", "generic_openai", "zai")
 
         val results = coroutineScope {
             providerNames.map { name ->
                 async {
                     try {
-                        val models = withTimeoutOrNull(LIST_MODELS_TIMEOUT_MS) {
+                        val models = withTimeoutOrNull(listModelsTimeoutFor(name)) {
                             when (name) {
                                 "ollama" -> OllamaAdapter(configService = configService).listModels()
                                 "openai" -> OpenAIAdapter(configService = configService).listModels()
@@ -220,9 +244,9 @@ suspend fun getAllModels(
                                 "openrouter" -> OpenRouterAdapter(configService = configService).listModels()
                                 "gemini" -> GeminiAdapter(configService = configService).listModels()
                                 "lmstudio" -> LMStudioAdapter(configService = configService).listModels()
-                                "custom_openai" -> CustomOpenAIAdapter(
+                                "generic_openai" -> GenericOpenAIAdapter(
                                     model = configService?.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_MODEL) ?: "custom-openai",
-                                    providerName = "custom_openai",
+                                    providerName = "generic_openai",
                                     configService = configService
                                 ).listModels()
                                 "zai" -> ZAIAdapter(
@@ -233,7 +257,7 @@ suspend fun getAllModels(
                             }
                         }
                         if (models == null) {
-                            logger.warn { "[ModelRegistry] Timeout fetching $name models (${LIST_MODELS_TIMEOUT_MS}ms)" }
+                            logger.warn { "[ModelRegistry] Timeout fetching $name models (${listModelsTimeoutFor(name)}ms)" }
                             ProviderFetch(name, emptyList())
                         } else {
                             logger.info { "[ModelRegistry] Fetched ${models.size} models from $name" }
@@ -271,13 +295,14 @@ suspend fun getModelsByProvider(
     // Check cache first
     val now = System.currentTimeMillis()
     if (modelsCache != null && (now - cacheTimestamp) < CACHE_TTL_MS) {
-        return modelsCache!![provider] ?: emptyList()
+        val cached = modelsCache!![provider]
+        if (cached != null) return cached
     }
 
     logger.info { "[ModelRegistry] Fetching models from provider: $provider" }
 
     return try {
-        val models = withTimeoutOrNull(LIST_MODELS_TIMEOUT_MS) {
+        val models = withTimeoutOrNull(listModelsTimeoutFor(provider.lowercase())) {
             when (provider.lowercase()) {
                 "ollama" -> {
                     val adapter = OllamaAdapter(
@@ -305,10 +330,10 @@ suspend fun getModelsByProvider(
                     val adapter = LMStudioAdapter(configService = configService)
                     adapter.listModels()
                 }
-                "custom_openai" -> {
-                    val adapter = CustomOpenAIAdapter(
+                "generic_openai" -> {
+                    val adapter = GenericOpenAIAdapter(
                         model = configService?.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_MODEL) ?: "custom-openai",
-                        providerName = "custom_openai",
+                        providerName = "generic_openai",
                         configService = configService
                     )
                     adapter.listModels()
@@ -326,12 +351,22 @@ suspend fun getModelsByProvider(
                 }
             }
         }
-        if (models == null) {
-            logger.warn { "[ModelRegistry] Timeout fetching $provider models (${LIST_MODELS_TIMEOUT_MS}ms)" }
+        val fetched = if (models == null) {
+            logger.warn { "[ModelRegistry] Timeout fetching $provider models (${listModelsTimeoutFor(provider.lowercase())}ms)" }
             emptyList()
         } else {
             models
         }
+
+        // Merge result into shared cache so subsequent cache-only reads
+        // (e.g. StatusBar via getModelConfigFromCache) see fresh data.
+        val existing = modelsCache ?: emptyMap()
+        modelsCache = existing + (provider to fetched)
+        if (cacheTimestamp == 0L) {
+            cacheTimestamp = System.currentTimeMillis()
+        }
+
+        fetched
     } catch (e: Exception) {
         logger.error(e) { "[ModelRegistry] Failed to fetch models from $provider: ${e.message}" }
         emptyList()
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/SupportedModels.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/SupportedModels.kt
index 0ad2322f..a3c36c71 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/SupportedModels.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/SupportedModels.kt
@@ -82,6 +82,8 @@ object SupportedModels {
      */
     private val ANTHROPIC_SUPPORTED = setOf<String>(
         // Opus models
+        "claude-opus-4-7",
+        "anthropic.claude-opus-4-7",
         "claude-opus-4-6",
         "anthropic.claude-opus-4-6-v1",
         "claude-opus-4-5",
@@ -100,6 +102,8 @@ object SupportedModels {
         // Haiku models
         "claude-haiku-4-5-20251001",
         "claude-haiku-4-5",
+        "claude-haiku-4-5@20251001",
+        "anthropic.claude-haiku-4-5-20251001-v1:0",
         "claude-3-5-haiku-20241022",
         "claude-3-5-haiku-latest",
         "claude-3-haiku-20240307",
@@ -141,6 +145,8 @@ object SupportedModels {
      */
     private val GEMINI_SUPPORTED = setOf(
         "gemini-3-pro-preview",
+        "gemini-3-flash",
+        "gemini-3-flash-lite",
         "gemini-2.5-pro",
         "gemini-2.5-flash",
         "gemini-2.5-flash-lite",
@@ -232,7 +238,7 @@ object SupportedModels {
                 supported
             }
 
-            "custom_openai" -> {
+            "generic_openai" -> {
                 val supported = CUSTOM_OPENAI_SUPPORTED.any { it.matches(modelId) }
                 if (!supported) {
                     logger.debug { "[WHITELIST] Custom OpenAI model not supported: $modelId" }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/AnthropicAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/AnthropicAdapter.kt
index 61e9584e..3ee21884 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/AnthropicAdapter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/AnthropicAdapter.kt
@@ -23,6 +23,7 @@ import io.ktor.http.*
 import io.ktor.serialization.gson.*
 import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.core.errors.LLMErrorMapper
+import pl.jclab.refio.core.errors.RefioError
 import pl.jclab.refio.core.security.SecureLogger
 import pl.jclab.refio.core.logging.dualLogger
 import java.util.UUID
@@ -62,33 +63,10 @@ class AnthropicAdapter(
     private val baseUrl: String
         get() = baseUrlOverride ?: DEFAULT_BASE_URL
 
-    private val client = httpClientOverride ?: HttpClient(CIO) {
-        install(ContentNegotiation) {
-            gson {
-                setPrettyPrinting()
-                serializeNulls()
-            }
-        }
-        install(Logging) {
-            level = LogLevel.INFO
-            logger = object : KtorLogger {
-                override fun log(message: String) {
-                    this@AnthropicAdapter.logger.debug { message }
-                }
-            }
-            sanitizeHeader { header ->
-                header.equals(HttpHeaders.Authorization, ignoreCase = true) ||
-                    header.equals("x-api-key", ignoreCase = true) ||
-                    header.equals("x-goog-api-key", ignoreCase = true)
-            }
-        }
-        install(HttpTimeout) {
-            val timeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
-                ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
-            requestTimeoutMillis = timeoutMs
-            connectTimeoutMillis = 30000
-            socketTimeoutMillis = timeoutMs
-        }
+    private val client = httpClientOverride ?: run {
+        val socketTimeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
+            ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
+        LLMKtorClientFactory.create(socketTimeoutMs, logger)
     }
 
     override suspend fun chat(
@@ -124,7 +102,7 @@ class AnthropicAdapter(
     ): LLMResponse {
         // Get API key from ConfigService (single source of truth)
         val apiKeyToUse = configService?.get(
-            key = pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_ANTHROPIC_API_KEY,
+            key = ConfigKeys.PROVIDER_ANTHROPIC_API_KEY.key,
             scope = pl.jclab.refio.core.db.ConfigScope.APP
         )
             ?: System.getProperty("ANTHROPIC_API_KEY")
@@ -343,6 +321,14 @@ class AnthropicAdapter(
             )
 
             // Parse response (handle content blocks including thinking)
+            if (response["content"] !is List<*>) {
+                throw RefioError.MalformedResponse(
+                    provider = provider,
+                    model = model,
+                    reason = "Missing or non-list 'content' in Anthropic response",
+                    bodyPreview = gson.toJson(response)
+                )
+            }
             @Suppress("UNCHECKED_CAST")
             val contentBlocks = response["content"] as? List<Map<String, Any?>> ?: emptyList()
 
@@ -743,7 +729,7 @@ class AnthropicAdapter(
         try {
             // Get API key from ConfigService (single source of truth)
             val apiKeyToUse = configService?.get(
-                key = pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_ANTHROPIC_API_KEY,
+                key = ConfigKeys.PROVIDER_ANTHROPIC_API_KEY.key,
                 scope = pl.jclab.refio.core.db.ConfigScope.APP
             )
                 ?: System.getProperty("ANTHROPIC_API_KEY")
@@ -779,12 +765,14 @@ class AnthropicAdapter(
                     return@mapNotNull null
                 }
 
-                // Get static definition from ModelDefinitions or create fallback
+                // Get static definition from ModelDefinitions or synthesize for unknown models.
                 val definition = pl.jclab.refio.core.llm.ModelDefinitions.getDefinition("anthropic", modelId)
                     ?: run {
-                        logger.debug { "[ANTHROPIC] Model $modelId not in registry, using fallback" }
+                        logger.warn {
+                            "[ANTHROPIC] Model $modelId not in registry — using synthetic definition (context=200000)"
+                        }
 
-                        pl.jclab.refio.core.llm.ModelDefinitions.createFallback(
+                        pl.jclab.refio.core.llm.ModelDefinitions.syntheticDefinitionFor(
                             provider = "anthropic",
                             modelId = modelId,
                             maxContext = 200_000  // Claude models typically have 200K context
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/CustomOpenAIAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/CustomOpenAIAdapter.kt
deleted file mode 100644
index 71d0f5da..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/CustomOpenAIAdapter.kt
+++ /dev/null
@@ -1,664 +0,0 @@
-package pl.jclab.refio.core.llm.adapters
-
-import io.ktor.client.HttpClient
-import io.ktor.client.call.body
-import io.ktor.client.engine.cio.CIO
-import io.ktor.client.plugins.HttpTimeout
-import io.ktor.client.plugins.HttpRequestTimeoutException
-import io.ktor.client.plugins.contentnegotiation.ContentNegotiation
-import io.ktor.client.plugins.logging.LogLevel
-import io.ktor.client.plugins.logging.Logger as KtorLogger
-import io.ktor.client.plugins.logging.Logging
-import io.ktor.client.request.get
-import io.ktor.client.request.header
-import io.ktor.client.request.post
-import io.ktor.client.request.preparePost
-import io.ktor.client.request.setBody
-import io.ktor.http.ContentType
-import io.ktor.http.HttpHeaders
-import io.ktor.http.contentType
-import io.ktor.serialization.gson.gson
-import kotlinx.coroutines.CancellationException
-import kotlinx.coroutines.Dispatchers
-import kotlinx.coroutines.delay
-import kotlinx.coroutines.withContext
-import kotlinx.coroutines.sync.Mutex
-import kotlinx.coroutines.sync.withLock
-import pl.jclab.refio.core.config.ConfigKeys
-import pl.jclab.refio.core.errors.RefioError
-import pl.jclab.refio.core.llm.BaseLLMAdapter
-import pl.jclab.refio.core.llm.LLMMessage
-import pl.jclab.refio.core.llm.LLMResponse
-import pl.jclab.refio.core.llm.LLMUsage
-import pl.jclab.refio.core.llm.ModelConfig
-import pl.jclab.refio.core.llm.ModelDefinitions
-import pl.jclab.refio.core.llm.StreamChunk
-import pl.jclab.refio.core.llm.toModelConfig
-import pl.jclab.refio.core.security.SecureLogger
-import pl.jclab.refio.core.services.ConfigService
-import pl.jclab.refio.core.services.ConfigService.Companion.DEFAULT_CONTEXT_SIZE
-import pl.jclab.refio.core.utils.GsonInstance.gson
-import pl.jclab.refio.core.logging.dualLogger
-import java.util.UUID
-
-open class CustomOpenAIAdapter(
-    model: String,
-    private val providerName: String = "custom_openai",
-    private val configService: ConfigService? = null,
-    private val taskId: String? = null,
-    private val subtaskId: String? = null,
-    private val source: String? = null,
-    private val baseUrlOverride: String? = null,
-    private val apiKeyOverride: String? = null,
-    private val requireApiKey: Boolean = false,
-    private val defaultBaseUrl: String? = null
-) : BaseLLMAdapter(model, providerName) {
-
-    companion object {
-        private const val CHAT_ENDPOINT = "/chat/completions"
-        private const val MODELS_ENDPOINT = "/models"
-        private const val ZAI_COOLDOWN_MS = 5_000L
-        private const val ZAI_RATE_LIMIT_RETRY_DELAY_MS = 15_000L
-        private val zaiRequestMutex = Mutex()
-        private var zaiNextAllowedAtMs: Long = 0L
-    }
-
-    private val logger = dualLogger("CustomOpenAIAdapter")
-
-    private val client = HttpClient(CIO) {
-        install(ContentNegotiation) {
-            gson {
-                setPrettyPrinting()
-                serializeNulls()
-            }
-        }
-        install(Logging) {
-            level = LogLevel.INFO
-            logger = object : KtorLogger {
-                override fun log(message: String) {
-                    this@CustomOpenAIAdapter.logger.debug { message }
-                }
-            }
-            sanitizeHeader { header ->
-                header.equals(HttpHeaders.Authorization, ignoreCase = true)
-            }
-        }
-        install(HttpTimeout) {
-            val timeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
-                ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
-            requestTimeoutMillis = timeoutMs
-            connectTimeoutMillis = 30_000L
-            socketTimeoutMillis = timeoutMs
-        }
-    }
-
-    private fun resolveBaseUrl(): String {
-        val configured = baseUrlOverride?.takeIf { it.isNotBlank() }
-            ?: when (providerName) {
-                "zai" -> configService?.getTyped(ConfigKeys.PROVIDER_ZAI_BASE_URL)
-                else -> configService?.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_BASE_URL)
-            }
-            ?: when (providerName) {
-                "zai" -> System.getProperty("ZAI_BASE_URL") ?: System.getenv("ZAI_BASE_URL")
-                else -> System.getProperty("CUSTOM_OPENAI_BASE_URL") ?: System.getenv("CUSTOM_OPENAI_BASE_URL")
-            }
-            ?: defaultBaseUrl
-
-        return configured?.trimEnd('/')
-            ?: throw RefioError.ProviderNotConfigured(providerName, "base_url")
-    }
-
-    private fun resolveApiKey(): String? {
-        val key = apiKeyOverride?.takeIf { it.isNotBlank() }
-            ?: when (providerName) {
-                "zai" -> configService?.getTyped(ConfigKeys.PROVIDER_ZAI_API_KEY)
-                else -> configService?.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_API_KEY)
-            }
-            ?: when (providerName) {
-                "zai" -> System.getProperty("ZAI_API_KEY") ?: System.getenv("ZAI_API_KEY")
-                else -> System.getProperty("CUSTOM_OPENAI_API_KEY") ?: System.getenv("CUSTOM_OPENAI_API_KEY")
-            }
-
-        if (requireApiKey && key.isNullOrBlank()) {
-            throw RefioError.ProviderNotConfigured(providerName, "api_key")
-        }
-        return key
-    }
-
-    override suspend fun chat(
-        messages: List<LLMMessage>,
-        systemMessages: List<String>,
-        maxTokens: Int?,
-        temperature: Double,
-        streaming: Boolean,
-        onStreamChunk: ((StreamChunk) -> Unit)?,
-        kwargs: Map<String, Any>
-    ): LLMResponse {
-        val baseUrl = resolveBaseUrl()
-        val apiKey = resolveApiKey()
-        val requestMessages = buildList<Map<String, Any>> {
-            systemMessages.filter { it.isNotBlank() }.forEach { add(mapOf("role" to "system", "content" to it)) }
-            // Remap "tool" (used by LLMMessageMapper for tool results) to "assistant" — OpenAI-compatible
-            // APIs require tool_call_id alongside role="tool", which this adapter does not currently emit.
-            messages.filter { it.role != "system" }.forEach {
-                val mappedRole = if (it.role == "tool") "assistant" else it.role
-                add(mapOf("role" to mappedRole, "content" to toOpenAiMessageContent(it)))
-            }
-        }
-        val maxOutputLimit = configService?.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, taskId) ?: ConfigKeys.MAX_OUTPUT_SIZE.default
-        val effectiveMaxTokens = when {
-            maxTokens != null && maxTokens > 0 -> minOf(maxTokens, maxOutputLimit)
-            else -> maxOutputLimit
-        }
-        val requestId = UUID.randomUUID().toString()
-        val requestBody = buildMap<String, Any> {
-            put("request_id", requestId)
-            put("model", model)
-            put("messages", requestMessages)
-            put("temperature", temperature)
-            put("max_tokens", effectiveMaxTokens)
-            if (streaming) put("stream", true)
-            (kwargs["top_p"] as? Number)?.let { put("top_p", it) }
-            (kwargs["frequency_penalty"] as? Number)?.let { put("frequency_penalty", it) }
-            (kwargs["presence_penalty"] as? Number)?.let { put("presence_penalty", it) }
-            kwargs["stop"]?.let { put("stop", it) }
-            kwargs["response_format"]?.let { put("response_format", it) }
-        }
-        val requestJson = gson.toJson(requestBody)
-        val startTime = System.currentTimeMillis()
-        val logPrefix = "[${providerName.uppercase()}][$requestId]"
-
-        return try {
-            if (streaming && onStreamChunk != null) {
-                executeStreaming(baseUrl, apiKey, requestBody, requestJson, startTime, onStreamChunk, logPrefix)
-            } else {
-                executeStandard(baseUrl, apiKey, requestBody, requestJson, startTime, logPrefix)
-            }
-        } catch (e: HttpRequestTimeoutException) {
-            throw RefioError.LLMTimeout(providerName, model, configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L) ?: 0L, e)
-        }
-    }
-
-    private suspend fun executeStandard(
-        baseUrl: String,
-        apiKey: String?,
-        requestBody: Map<String, Any>,
-        requestJson: String,
-        startTime: Long,
-        logPrefix: String
-    ): LLMResponse {
-        var httpStatus: Int? = null
-
-        try {
-            logger.info { "$logPrefix Request start: endpoint=$baseUrl$CHAT_ENDPOINT, body=${SecureLogger.redactAndTruncate(requestJson)}" }
-            val response = executeWithZaiRateLimitRetry("$baseUrl$CHAT_ENDPOINT") {
-                withProviderRateLimit("$baseUrl$CHAT_ENDPOINT") {
-                    client.post("$baseUrl$CHAT_ENDPOINT") {
-                        contentType(ContentType.Application.Json)
-                        apiKey?.let { header(HttpHeaders.Authorization, "Bearer $it") }
-                        setBody(requestBody)
-                    }
-                }
-            }
-
-            httpStatus = response.status.value
-            val rawResponse: Map<String, Any?> = response.body()
-            ensureSuccess(httpStatus, rawResponse, baseUrl)
-
-            val usage = extractUsage(rawResponse)
-            @Suppress("UNCHECKED_CAST")
-            val choices = rawResponse["choices"] as? List<Map<String, Any?>> ?: emptyList()
-            val firstChoice = choices.firstOrNull() ?: emptyMap()
-            @Suppress("UNCHECKED_CAST")
-            val message = firstChoice["message"] as? Map<String, Any?> ?: emptyMap()
-            val content = message["content"] as? String ?: ""
-            val normalizedToolCallsJson = if (content.isBlank()) {
-                ToolCallContentNormalizer.fromOpenAiToolCalls(message["tool_calls"])
-            } else {
-                null
-            }
-
-            logger.apiResponse(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                responseJson = gson.toJson(rawResponse),
-                httpStatus = httpStatus,
-                inputTokens = usage.inputTokens,
-                outputTokens = usage.outputTokens,
-                costUsd = 0.0,
-                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-
-            return LLMResponse(
-                content = normalizedToolCallsJson ?: content,
-                usage = usage,
-                model = model,
-                provider = provider,
-                cost = 0.0,
-                finishReason = firstChoice["finish_reason"] as? String,
-                rawResponse = rawResponse
-            )
-        } catch (e: RefioError) {
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = httpStatus,
-                error = e,
-                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw e
-        } catch (e: Exception) {
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = httpStatus,
-                error = e,
-                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw RefioError.LLMError(providerName, model, e)
-        }
-    }
-
-    private suspend fun executeStreaming(
-        baseUrl: String,
-        apiKey: String?,
-        requestBody: Map<String, Any>,
-        requestJson: String,
-        startTime: Long,
-        onStreamChunk: (StreamChunk) -> Unit,
-        logPrefix: String
-    ): LLMResponse {
-        val contentBuilder = StringBuilder()
-        val toolCallAccumulator = ToolCallContentNormalizer.OpenAiStreamingToolCallAccumulator()
-        var httpStatus: Int? = null
-        var finalFinishReason: String? = null
-
-        try {
-            logger.info { "$logPrefix Request start: endpoint=$baseUrl$CHAT_ENDPOINT, body=${SecureLogger.redactAndTruncate(requestJson)}" }
-            executeWithZaiRateLimitRetry("$baseUrl$CHAT_ENDPOINT") {
-                withProviderRateLimit("$baseUrl$CHAT_ENDPOINT") {
-                    client.preparePost("$baseUrl$CHAT_ENDPOINT") {
-                        contentType(ContentType.Application.Json)
-                        apiKey?.let { header(HttpHeaders.Authorization, "Bearer $it") }
-                        setBody(requestBody)
-                    }.execute { httpResponse ->
-                        httpStatus = httpResponse.status.value
-                        if (httpStatus !in 200..299) {
-                            val errorBody = httpResponse.body<String>()
-                            throw mapHttpError(httpStatus ?: 500, errorBody)
-                        }
-
-                        val channel: io.ktor.utils.io.ByteReadChannel = httpResponse.body()
-                        while (!channel.isClosedForRead) {
-                            val line = channel.readUTF8Line(limit = Int.MAX_VALUE) ?: continue
-                            if (line.isBlank() || !line.startsWith("data: ")) continue
-
-                            val data = line.removePrefix("data: ").trim()
-                            if (data == "[DONE]") break
-
-                            try {
-                                @Suppress("UNCHECKED_CAST")
-                                val chunk = gson.fromJson(data, Map::class.java) as Map<String, Any?>
-                                @Suppress("UNCHECKED_CAST")
-                                val choices = chunk["choices"] as? List<Map<String, Any?>> ?: emptyList()
-                                val first = choices.firstOrNull() ?: emptyMap()
-                                @Suppress("UNCHECKED_CAST")
-                                val delta = first["delta"] as? Map<String, Any?>
-                                toolCallAccumulator.consumeDelta(delta)
-                                val content = delta?.get("content") as? String
-                                if (!content.isNullOrEmpty()) {
-                                    contentBuilder.append(content)
-                                    onStreamChunk(StreamChunk(delta = content))
-                                }
-                                finalFinishReason = first["finish_reason"] as? String ?: finalFinishReason
-                            } catch (e: CancellationException) {
-                                // Let stream abort (guardrail trip) propagate out of the loop.
-                                // NOTE: replaced an earlier `runCatching { }` here — runCatching
-                                // swallowed CancellationException into a Result and the abort was lost.
-                                throw e
-                            } catch (_: Exception) {
-                                // Match previous behavior of silently skipping malformed chunks.
-                            }
-                        }
-                    }
-                }
-            }
-
-            if (contentBuilder.isEmpty()) {
-                toolCallAccumulator.toCanonicalJson()?.let { contentBuilder.append(it) }
-            }
-
-            @Suppress("UNCHECKED_CAST")
-            val inputTokensEstimate = (requestBody["messages"] as? List<Map<String, Any?>>)?.sumOf {
-                when (val content = it["content"]) {
-                    is String -> content.length
-                    is List<*> -> content.sumOf { part ->
-                        @Suppress("UNCHECKED_CAST")
-                        val partMap = part as? Map<String, Any?>
-                        (partMap?.get("text") as? String)?.length ?: 0
-                    }
-                    else -> 0
-                }
-            } ?: 0
-            val usage = LLMUsage(
-                inputTokens = inputTokensEstimate,
-                outputTokens = contentBuilder.length / 4,
-                totalTokens = inputTokensEstimate + contentBuilder.length / 4
-            )
-            onStreamChunk(StreamChunk(delta = "", finishReason = finalFinishReason, usage = usage))
-
-            logger.apiResponse(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                responseJson = gson.toJson(mapOf("content" to contentBuilder.toString())),
-                httpStatus = httpStatus ?: 200,
-                inputTokens = usage.inputTokens,
-                outputTokens = usage.outputTokens,
-                costUsd = 0.0,
-                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-
-            return LLMResponse(
-                content = contentBuilder.toString(),
-                usage = usage,
-                model = model,
-                provider = provider,
-                cost = 0.0,
-                finishReason = finalFinishReason,
-                rawResponse = mapOf("content" to contentBuilder.toString())
-            )
-        } catch (e: RefioError) {
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = httpStatus,
-                error = e,
-                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw e
-        } catch (e: CancellationException) {
-            // Guardrail-triggered abort — log and rethrow as-is, do NOT wrap.
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = httpStatus,
-                error = e,
-                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw e
-        } catch (e: Exception) {
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = httpStatus,
-                error = e,
-                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw RefioError.LLMError(providerName, model, e)
-        }
-    }
-
-    open suspend fun listModels(): List<ModelConfig> = withContext(Dispatchers.IO) {
-        val baseUrl = resolveBaseUrl()
-        val apiKey = resolveApiKey()
-        val startTime = System.currentTimeMillis()
-
-        try {
-            logger.info { "[${providerName.uppercase()}] Request start: endpoint=$baseUrl$MODELS_ENDPOINT" }
-            val response = withProviderRateLimit("$baseUrl$MODELS_ENDPOINT") {
-                client.get("$baseUrl$MODELS_ENDPOINT") {
-                    apiKey?.let { header(HttpHeaders.Authorization, "Bearer $it") }
-                }
-            }
-            val rawBody = response.body<String>()
-
-            if (response.status.value !in 200..299) {
-                throw mapHttpError(response.status.value, rawBody)
-            }
-
-            parseModelsPayload(rawBody)
-        } catch (e: RefioError) {
-            logger.apiError(
-                provider = provider,
-                model = "models",
-                endpoint = "$baseUrl$MODELS_ENDPOINT",
-                requestJson = "",
-                httpStatus = null,
-                error = e,
-                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw e
-        } catch (e: Exception) {
-            logger.apiError(
-                provider = provider,
-                model = "models",
-                endpoint = "$baseUrl$MODELS_ENDPOINT",
-                requestJson = "",
-                httpStatus = null,
-                error = e,
-                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw RefioError.LLMError(providerName, model, e)
-        }
-    }
-
-    internal fun parseModelsPayload(rawBody: String): List<ModelConfig> {
-        val parsed = gson.fromJson(rawBody, Any::class.java)
-        val modelsData: List<*> = when (parsed) {
-            is Map<*, *> -> parsed["data"] as? List<*> ?: emptyList<Any?>()
-            is List<*> -> parsed
-            else -> emptyList<Any?>()
-        }
-
-        return modelsData.mapNotNull { item ->
-            val modelData = item as? Map<*, *> ?: return@mapNotNull null
-            val modelId = modelData["id"] as? String ?: return@mapNotNull null
-            val contextLength = (modelData["context_length"] as? Number)?.toInt() ?: DEFAULT_CONTEXT_SIZE
-            val definition = ModelDefinitions.getDefinition(providerName, modelId)
-                ?: ModelDefinitions.createFallback(providerName, modelId, contextLength)
-            definition.toModelConfig()
-        }
-    }
-
-    override fun estimateCost(usage: LLMUsage): Double = 0.0
-
-    override suspend fun close() {
-        client.close()
-    }
-
-    private fun extractUsage(rawResponse: Map<String, Any?>): LLMUsage {
-        @Suppress("UNCHECKED_CAST")
-        val usageMap = rawResponse["usage"] as? Map<String, Any?> ?: emptyMap()
-        val promptTokens = (usageMap["prompt_tokens"] as? Number)?.toInt() ?: 0
-        val completionTokens = (usageMap["completion_tokens"] as? Number)?.toInt() ?: 0
-        val totalTokens = (usageMap["total_tokens"] as? Number)?.toInt() ?: promptTokens + completionTokens
-        return LLMUsage(promptTokens, completionTokens, totalTokens)
-    }
-
-    @Suppress("UNUSED_PARAMETER")
-    private fun ensureSuccess(httpStatus: Int, rawResponse: Map<String, Any?>, _baseUrl: String) {
-        if (httpStatus in 200..299) return
-
-        val message = (rawResponse["error"] as? Map<*, *>)?.get("message") as? String
-            ?: "OpenAI-compatible API error (HTTP $httpStatus)"
-        val code = (rawResponse["error"] as? Map<*, *>)?.get("code")?.toString()
-        throw mapHttpError(httpStatus, message, code)
-    }
-
-    private fun mapHttpError(httpStatus: Int, message: String): RefioError {
-        val parsed = parseProviderError(message)
-        return mapHttpError(
-            httpStatus = httpStatus,
-            message = parsed.message ?: message,
-            businessCode = parsed.code
-        )
-    }
-
-    private fun mapHttpError(httpStatus: Int, message: String, businessCode: String?): RefioError {
-        val zaiMessage = if (providerName == "zai") {
-            buildZAIErrorMessage(httpStatus, businessCode, message)
-        } else {
-            message
-        }
-
-        return when (httpStatus) {
-            401, 403 -> RefioError.LLMAuthentication(providerName, model, IllegalStateException(zaiMessage))
-            429 -> RefioError.LLMRateLimit(providerName, null, IllegalStateException(zaiMessage))
-            434 -> RefioError.LLMAuthentication(providerName, model, IllegalStateException(zaiMessage))
-            else -> RefioError.LLMError(providerName, model, IllegalStateException(zaiMessage))
-        }
-    }
-
-    internal data class ProviderErrorPayload(
-        val code: String? = null,
-        val message: String? = null
-    )
-
-    internal fun parseProviderError(rawBody: String): ProviderErrorPayload {
-        return runCatching {
-            val parsed = gson.fromJson(rawBody, Map::class.java)
-            val error = parsed?.get("error") as? Map<*, *>
-            ProviderErrorPayload(
-                code = error?.get("code")?.toString(),
-                message = error?.get("message")?.toString()
-            )
-        }.getOrDefault(ProviderErrorPayload(message = rawBody))
-    }
-
-    internal fun buildZAIErrorMessage(httpStatus: Int, businessCode: String?, message: String): String {
-        val normalized = businessCode?.trim()
-        val detail = when (normalized) {
-            "1000", "1001", "1002", "1003", "1004" -> "Authentication failed or token expired"
-            "1110" -> "Account is inactive"
-            "1111" -> "Account does not exist"
-            "1112", "1121" -> "Account has been locked"
-            "1113" -> "Account balance exhausted"
-            "1120" -> "Unable to access account temporarily"
-            "1210", "1213", "1214", "1215" -> "Invalid request parameters"
-            "1211" -> "Model does not exist"
-            "1212" -> "Model does not support this API method"
-            "1220" -> "No permission to access this API"
-            "1221" -> "API has been taken offline"
-            "1222" -> "API does not exist"
-            "1230" -> "API call process error"
-            "1231" -> "An identical request is already in progress"
-            "1234" -> "Network error on provider side"
-            "1301" -> "Request blocked by safety policy"
-            "1302" -> "API concurrency limit exceeded"
-            "1303" -> "API frequency limit exceeded"
-            "1304" -> "Daily API call limit reached"
-            "1305" -> "API rate limit triggered"
-            "1308" -> "Usage limit reached"
-            "1309" -> "GLM Coding Plan expired"
-            "1310" -> "Weekly or monthly limit exhausted"
-            else -> when (httpStatus) {
-                401, 403 -> "Authentication failure or token timeout"
-                429 -> "Rate limit or account quota restriction"
-                434 -> "No API permission"
-                else -> null
-            }
-        }
-
-        return buildString {
-            if (!normalized.isNullOrBlank()) {
-                append("Z.AI error ")
-                append(normalized)
-                append(": ")
-            }
-            if (!detail.isNullOrBlank()) {
-                append(detail)
-                if (message.isNotBlank() && !message.contains(detail, ignoreCase = true)) {
-                    append(". ")
-                }
-            }
-            if (message.isNotBlank()) {
-                append(message)
-            } else if (detail.isNullOrBlank()) {
-                append("HTTP ")
-                append(httpStatus)
-            }
-        }
-    }
-
-    private suspend fun <T> withProviderRateLimit(endpoint: String, block: suspend () -> T): T {
-        if (providerName != "zai") {
-            return block()
-        }
-
-        return zaiRequestMutex.withLock {
-            val now = System.currentTimeMillis()
-            val waitMs = (zaiNextAllowedAtMs - now).coerceAtLeast(0L)
-            if (waitMs > 0) {
-                logger.info { "[ZAI] Waiting ${waitMs}ms before next request: $endpoint" }
-                delay(waitMs)
-            }
-
-            try {
-                block()
-            } finally {
-                zaiNextAllowedAtMs = System.currentTimeMillis() + ZAI_COOLDOWN_MS
-            }
-        }
-    }
-
-    private suspend fun <T> executeWithZaiRateLimitRetry(endpoint: String, block: suspend () -> T): T {
-        if (providerName != "zai") {
-            return block()
-        }
-
-        return try {
-            block()
-        } catch (e: RefioError.LLMRateLimit) {
-            logger.warn {
-                "[ZAI] Rate limit hit for $endpoint. Waiting ${ZAI_RATE_LIMIT_RETRY_DELAY_MS}ms before retry"
-            }
-            zaiNextAllowedAtMs = maxOf(
-                zaiNextAllowedAtMs,
-                System.currentTimeMillis() + ZAI_RATE_LIMIT_RETRY_DELAY_MS
-            )
-            delay(ZAI_RATE_LIMIT_RETRY_DELAY_MS)
-            block()
-        }
-    }
-}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/GeminiAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/GeminiAdapter.kt
index 3096af22..84575323 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/GeminiAdapter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/GeminiAdapter.kt
@@ -27,6 +27,7 @@ import kotlinx.coroutines.withContext
 import kotlinx.coroutines.Dispatchers
 import io.ktor.utils.io.ByteReadChannel
 import pl.jclab.refio.core.errors.LLMErrorMapper
+import pl.jclab.refio.core.errors.RefioError
 import java.util.UUID
 
 /**
@@ -60,33 +61,10 @@ class GeminiAdapter(
 
     private val ownsHttpClient = httpClientOverride == null
 
-    private val client = httpClientOverride ?: HttpClient(CIO) {
-        install(ContentNegotiation) {
-            gson {
-                setPrettyPrinting()
-                serializeNulls()
-            }
-        }
-        install(Logging) {
-            level = LogLevel.INFO
-            logger = object : KtorLogger {
-                override fun log(message: String) {
-                    this@GeminiAdapter.logger.debug { message }
-                }
-            }
-            sanitizeHeader { header ->
-                header.equals(HttpHeaders.Authorization, ignoreCase = true) ||
-                    header.equals("x-api-key", ignoreCase = true) ||
-                    header.equals("x-goog-api-key", ignoreCase = true)
-            }
-        }
-        install(HttpTimeout) {
-            val timeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
-                ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
-            requestTimeoutMillis = timeoutMs
-            connectTimeoutMillis = 30_000
-            socketTimeoutMillis = timeoutMs
-        }
+    private val client = httpClientOverride ?: run {
+        val socketTimeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
+            ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
+        LLMKtorClientFactory.create(socketTimeoutMs, logger)
     }
 
     override suspend fun chat(
@@ -99,7 +77,7 @@ class GeminiAdapter(
         kwargs: Map<String, Any>
     ): LLMResponse {
         val apiKey = configService?.get(
-            key = pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_GEMINI_API_KEY,
+            key = ConfigKeys.PROVIDER_GEMINI_API_KEY.key,
             scope = pl.jclab.refio.core.db.ConfigScope.APP
         )
             ?: System.getProperty("GEMINI_API_KEY")
@@ -260,6 +238,14 @@ class GeminiAdapter(
                 throw LLMErrorMapper.fromHttpStatus(provider, model, httpStatus, "Gemini API error (HTTP $httpStatus)")
             }
 
+            if (rawResponse["candidates"] !is List<*>) {
+                throw RefioError.MalformedResponse(
+                    provider = provider,
+                    model = model,
+                    reason = "Missing or non-list 'candidates' in Gemini response",
+                    bodyPreview = gson.toJson(rawResponse)
+                )
+            }
             val usage = extractUsage(rawResponse)
             val cost = estimateCost(usage)
             val content = extractContent(rawResponse)
@@ -543,7 +529,7 @@ class GeminiAdapter(
      */
     suspend fun listModels(): List<ModelConfig> = withContext(Dispatchers.IO) {
         val apiKey = configService?.get(
-            key = pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_GEMINI_API_KEY,
+            key = ConfigKeys.PROVIDER_GEMINI_API_KEY.key,
             scope = pl.jclab.refio.core.db.ConfigScope.APP
         )
             ?: System.getProperty("GEMINI_API_KEY")
@@ -566,7 +552,12 @@ class GeminiAdapter(
                 }
 
                 val definition = pl.jclab.refio.core.llm.ModelDefinitions.getDefinition("gemini", shortId)
-                    ?: pl.jclab.refio.core.llm.ModelDefinitions.createFallback("gemini", shortId)
+                    ?: run {
+                        logger.warn {
+                            "[GEMINI] Model $shortId not in registry — using synthetic definition with defaults"
+                        }
+                        pl.jclab.refio.core.llm.ModelDefinitions.syntheticDefinitionFor("gemini", shortId)
+                    }
                 definition.toModelConfig()
             }
         } catch (e: Exception) {
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/GenericOpenAIAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/GenericOpenAIAdapter.kt
new file mode 100644
index 00000000..9a894047
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/GenericOpenAIAdapter.kt
@@ -0,0 +1,189 @@
+package pl.jclab.refio.core.llm.adapters
+
+import io.ktor.client.HttpClient
+import kotlinx.coroutines.delay
+import kotlinx.coroutines.sync.Mutex
+import kotlinx.coroutines.sync.withLock
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.errors.RefioError
+import pl.jclab.refio.core.services.ConfigService
+
+/**
+ * Open, generic OpenAI-compatible chat adapter for:
+ * - `provider: custom_openai` (any OpenAI-compatible endpoint),
+ * - `provider: zai` via the [ZAIAdapter] subclass (adds rate-limit serialization).
+ *
+ * Most logic lives in [OpenAICompatibleAdapter]; this class only supplies the
+ * per-provider config keys, enforces `api_key` when required, and (for `zai`)
+ * serializes requests through a mutex with retry after HTTP 429.
+ */
+open class GenericOpenAIAdapter(
+    model: String,
+    providerName: String = "generic_openai",
+    configService: ConfigService? = null,
+    taskId: String? = null,
+    subtaskId: String? = null,
+    source: String? = null,
+    private val baseUrlOverride: String? = null,
+    private val apiKeyOverride: String? = null,
+    requireApiKey: Boolean = false,
+    private val defaultBaseUrl: String? = null,
+    httpClientOverride: HttpClient? = null,
+) : OpenAICompatibleAdapter(
+    model = model,
+    providerName = providerName,
+    configService = configService,
+    taskId = taskId,
+    subtaskId = subtaskId,
+    source = source,
+    requireApiKey = requireApiKey,
+    httpClientOverride = httpClientOverride,
+) {
+
+    companion object {
+        private const val ZAI_COOLDOWN_MS = 5_000L
+        private const val ZAI_RATE_LIMIT_RETRY_DELAY_MS = 15_000L
+        private val zaiRequestMutex = Mutex()
+        private var zaiNextAllowedAtMs: Long = 0L
+    }
+
+    override fun resolveBaseUrl(): String {
+        val configured = baseUrlOverride?.takeIf { it.isNotBlank() }
+            ?: when (providerName) {
+                "zai" -> configService?.getTyped(ConfigKeys.PROVIDER_ZAI_BASE_URL)
+                else -> configService?.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_BASE_URL)
+            }
+            ?: when (providerName) {
+                "zai" -> System.getProperty("ZAI_BASE_URL") ?: System.getenv("ZAI_BASE_URL")
+                else -> System.getProperty("CUSTOM_OPENAI_BASE_URL") ?: System.getenv("CUSTOM_OPENAI_BASE_URL")
+            }
+            ?: defaultBaseUrl
+
+        return configured?.trimEnd('/')
+            ?: throw RefioError.ProviderNotConfigured(providerName, "base_url")
+    }
+
+    override fun resolveApiKey(): String? {
+        val key = apiKeyOverride?.takeIf { it.isNotBlank() }
+            ?: when (providerName) {
+                "zai" -> configService?.getTyped(ConfigKeys.PROVIDER_ZAI_API_KEY)
+                else -> configService?.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_API_KEY)
+            }
+            ?: when (providerName) {
+                "zai" -> System.getProperty("ZAI_API_KEY") ?: System.getenv("ZAI_API_KEY")
+                else -> System.getProperty("CUSTOM_OPENAI_API_KEY") ?: System.getenv("CUSTOM_OPENAI_API_KEY")
+            }
+
+        if (requireApiKey && key.isNullOrBlank()) {
+            throw RefioError.ProviderNotConfigured(providerName, "api_key")
+        }
+        return key
+    }
+
+    override suspend fun <T> withProviderRateLimit(endpoint: String, block: suspend () -> T): T {
+        if (providerName != "zai") return block()
+        return zaiRequestMutex.withLock {
+            val now = System.currentTimeMillis()
+            val waitMs = (zaiNextAllowedAtMs - now).coerceAtLeast(0L)
+            if (waitMs > 0) {
+                logger.info { "[ZAI] Waiting ${waitMs}ms before next request: $endpoint" }
+                delay(waitMs)
+            }
+            try {
+                block()
+            } finally {
+                zaiNextAllowedAtMs = System.currentTimeMillis() + ZAI_COOLDOWN_MS
+            }
+        }
+    }
+
+    override suspend fun <T> executeWithRateLimitRetry(endpoint: String, block: suspend () -> T): T {
+        if (providerName != "zai") return block()
+        return try {
+            block()
+        } catch (e: RefioError.LLMRateLimit) {
+            logger.warn {
+                "[ZAI] Rate limit hit for $endpoint. Waiting ${ZAI_RATE_LIMIT_RETRY_DELAY_MS}ms before retry"
+            }
+            zaiNextAllowedAtMs = maxOf(
+                zaiNextAllowedAtMs,
+                System.currentTimeMillis() + ZAI_RATE_LIMIT_RETRY_DELAY_MS
+            )
+            delay(ZAI_RATE_LIMIT_RETRY_DELAY_MS)
+            block()
+        }
+    }
+
+    override fun mapHttpError(httpStatus: Int, rawBody: String): RefioError {
+        val parsed = parseProviderError(rawBody)
+        val message = parsed.message ?: rawBody
+        val businessCode = parsed.code
+        val finalMessage = if (providerName == "zai") {
+            buildZAIErrorMessage(httpStatus, businessCode, message)
+        } else {
+            message
+        }
+        return when (httpStatus) {
+            401, 403 -> RefioError.LLMAuthentication(providerName, model, IllegalStateException(finalMessage))
+            429 -> RefioError.LLMRateLimit(providerName, null, IllegalStateException(finalMessage))
+            434 -> RefioError.LLMAuthentication(providerName, model, IllegalStateException(finalMessage))
+            else -> RefioError.LLMError(providerName, model, IllegalStateException(finalMessage))
+        }
+    }
+
+    internal fun buildZAIErrorMessage(httpStatus: Int, businessCode: String?, message: String): String {
+        val normalized = businessCode?.trim()
+        val detail = when (normalized) {
+            "1000", "1001", "1002", "1003", "1004" -> "Authentication failed or token expired"
+            "1110" -> "Account is inactive"
+            "1111" -> "Account does not exist"
+            "1112", "1121" -> "Account has been locked"
+            "1113" -> "Account balance exhausted"
+            "1120" -> "Unable to access account temporarily"
+            "1210", "1213", "1214", "1215" -> "Invalid request parameters"
+            "1211" -> "Model does not exist"
+            "1212" -> "Model does not support this API method"
+            "1220" -> "No permission to access this API"
+            "1221" -> "API has been taken offline"
+            "1222" -> "API does not exist"
+            "1230" -> "API call process error"
+            "1231" -> "An identical request is already in progress"
+            "1234" -> "Network error on provider side"
+            "1301" -> "Request blocked by safety policy"
+            "1302" -> "API concurrency limit exceeded"
+            "1303" -> "API frequency limit exceeded"
+            "1304" -> "Daily API call limit reached"
+            "1305" -> "API rate limit triggered"
+            "1308" -> "Usage limit reached"
+            "1309" -> "GLM Coding Plan expired"
+            "1310" -> "Weekly or monthly limit exhausted"
+            else -> when (httpStatus) {
+                401, 403 -> "Authentication failure or token timeout"
+                429 -> "Rate limit or account quota restriction"
+                434 -> "No API permission"
+                else -> null
+            }
+        }
+
+        return buildString {
+            if (!normalized.isNullOrBlank()) {
+                append("Z.AI error ")
+                append(normalized)
+                append(": ")
+            }
+            if (!detail.isNullOrBlank()) {
+                append(detail)
+                if (message.isNotBlank() && !message.contains(detail, ignoreCase = true)) {
+                    append(". ")
+                }
+            }
+            if (message.isNotBlank()) {
+                append(message)
+            } else if (detail.isNullOrBlank()) {
+                append("HTTP ")
+                append(httpStatus)
+            }
+        }
+    }
+
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/LLMKtorClientFactory.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/LLMKtorClientFactory.kt
new file mode 100644
index 00000000..240f538a
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/LLMKtorClientFactory.kt
@@ -0,0 +1,55 @@
+package pl.jclab.refio.core.llm.adapters
+
+import io.ktor.client.HttpClient
+import io.ktor.client.engine.cio.CIO
+import io.ktor.client.plugins.HttpTimeout
+import io.ktor.client.plugins.contentnegotiation.ContentNegotiation
+import io.ktor.client.plugins.logging.LogLevel
+import io.ktor.client.plugins.logging.Logger as KtorLogger
+import io.ktor.client.plugins.logging.Logging
+import io.ktor.http.HttpHeaders
+import io.ktor.serialization.gson.gson
+import pl.jclab.refio.core.logging.DualLogger
+
+/**
+ * Fabryka [HttpClient] dla adapterów OpenAI-compatible (OpenAI, OpenRouter, LMStudio, GenericOpenAI).
+ *
+ * Eliminuje duplikację ~30 LOC per adapter:
+ * - ContentNegotiation z Gson (pretty print, serialize nulls),
+ * - Logging z auth headers sanitizacją i delegacją do per-adapter [DualLogger],
+ * - HttpTimeout skonfigurowany pod streaming (`requestTimeoutMillis = INFINITE`,
+ *   `socketTimeoutMillis` resetuje się per chunk — wykrywa tylko martwe połączenia).
+ *
+ * @param socketTimeoutMs socket timeout (reset per chunk). Typowo `configService.get(API_CALL_TIMEOUT) * 1000`.
+ * @param adapterLogger logger per-adapter do którego Ktor logging deleguje.
+ */
+internal object LLMKtorClientFactory {
+
+    fun create(socketTimeoutMs: Long, adapterLogger: DualLogger): HttpClient =
+        HttpClient(CIO) {
+            install(ContentNegotiation) {
+                gson {
+                    setPrettyPrinting()
+                    serializeNulls()
+                }
+            }
+            install(Logging) {
+                level = LogLevel.INFO
+                logger = object : KtorLogger {
+                    override fun log(message: String) {
+                        adapterLogger.debug { message }
+                    }
+                }
+                sanitizeHeader { header ->
+                    header.equals(HttpHeaders.Authorization, ignoreCase = true) ||
+                        header.equals("x-api-key", ignoreCase = true) ||
+                        header.equals("x-goog-api-key", ignoreCase = true)
+                }
+            }
+            install(HttpTimeout) {
+                requestTimeoutMillis = HttpTimeout.INFINITE_TIMEOUT_MS
+                connectTimeoutMillis = 30_000L
+                socketTimeoutMillis = socketTimeoutMs
+            }
+        }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/LMStudioAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/LMStudioAdapter.kt
index a6671d4e..8a0f5829 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/LMStudioAdapter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/LMStudioAdapter.kt
@@ -1,517 +1,67 @@
 package pl.jclab.refio.core.llm.adapters
 
-import kotlinx.coroutines.CancellationException
+import io.ktor.client.HttpClient
+import io.ktor.client.call.body
+import io.ktor.client.request.get
+import io.ktor.client.request.header
 import kotlinx.coroutines.Dispatchers
 import kotlinx.coroutines.withContext
+import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.core.db.ConfigScope
-import pl.jclab.refio.core.llm.BaseLLMAdapter
-import pl.jclab.refio.core.llm.LLMMessage
-import pl.jclab.refio.core.llm.LLMResponse
-import pl.jclab.refio.core.llm.LLMUsage
+import pl.jclab.refio.core.errors.LLMErrorMapper
 import pl.jclab.refio.core.llm.ModelConfig
-import pl.jclab.refio.core.llm.StreamChunk
+import pl.jclab.refio.core.llm.ModelDefinitions
+import pl.jclab.refio.core.llm.SupportedModels
 import pl.jclab.refio.core.llm.toModelConfig
-import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.core.services.ConfigService
-import pl.jclab.refio.core.services.ConfigService.Companion.DEFAULT_CONTEXT_SIZE
-import pl.jclab.refio.core.utils.GsonInstance.gson
-import pl.jclab.refio.core.logging.dualLogger
-import pl.jclab.refio.core.errors.LLMErrorMapper
-import pl.jclab.refio.core.security.SecureLogger
-import io.ktor.client.HttpClient
-import io.ktor.client.call.body
-import io.ktor.client.engine.cio.CIO
-import io.ktor.client.plugins.HttpTimeout
-import io.ktor.client.plugins.contentnegotiation.ContentNegotiation
-import io.ktor.client.plugins.logging.*
-import io.ktor.client.plugins.logging.Logger as KtorLogger
-import io.ktor.client.request.get
-import io.ktor.client.request.header
-import io.ktor.client.request.preparePost
-import io.ktor.client.request.post
-import io.ktor.client.request.setBody
-import io.ktor.http.ContentType
-import io.ktor.http.HttpHeaders
-import io.ktor.http.contentType
-import io.ktor.serialization.gson.gson
-import java.util.UUID
 
 /**
- * Adapter for LM Studio (local) using OpenAI-compatible API.
+ * LM Studio adapter for OpenAI-compatible local endpoints (defaults to
+ * `http://localhost:1234/v1`). Filters `listModels()` through [SupportedModels]
+ * and applies the user-configured context size when the model's own metadata
+ * does not include one.
  */
 class LMStudioAdapter(
     model: String = "local",
-    private val baseUrl: String? = null,
-    private val configService: ConfigService? = null,
-    private val taskId: String? = null,
-    private val subtaskId: String? = null,
-    private val source: String? = null
-) : BaseLLMAdapter(model, "lmstudio") {
+    private val baseUrlOverride: String? = null,
+    configService: ConfigService? = null,
+    taskId: String? = null,
+    subtaskId: String? = null,
+    source: String? = null,
+    httpClientOverride: HttpClient? = null,
+) : OpenAICompatibleAdapter(
+    model = model,
+    providerName = "lmstudio",
+    configService = configService,
+    taskId = taskId,
+    subtaskId = subtaskId,
+    source = source,
+    httpClientOverride = httpClientOverride,
+) {
 
     companion object {
         const val DEFAULT_BASE_URL = "http://localhost:1234/v1"
-        private const val CHAT_ENDPOINT = "/chat/completions"
-        private const val MODELS_ENDPOINT = "/models"
-    }
-
-    private val logger = dualLogger("LMStudioAdapter")
-
-    private val timeout: Long
-        get() = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
-            ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
-
-    private val client = HttpClient(CIO) {
-        install(ContentNegotiation) {
-            gson {
-                setPrettyPrinting()
-                serializeNulls()
-            }
-        }
-        install(Logging) {
-            level = LogLevel.INFO
-            logger = object : KtorLogger {
-                override fun log(message: String) {
-                    this@LMStudioAdapter.logger.debug { message }
-                }
-            }
-            sanitizeHeader { header ->
-                header.equals(HttpHeaders.Authorization, ignoreCase = true) ||
-                    header.equals("x-api-key", ignoreCase = true) ||
-                    header.equals("x-goog-api-key", ignoreCase = true)
-            }
-        }
-        install(HttpTimeout) {
-            val timeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
-                ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
-            requestTimeoutMillis = timeoutMs
-            connectTimeoutMillis = 30000
-            socketTimeoutMillis = timeoutMs
-        }
     }
 
-    private fun resolveBaseUrl(): String {
-        return baseUrl
-            ?: configService?.get(ConfigService.KEY_PROVIDER_LM_STUDIO_BASE_URL, ConfigScope.APP)
+    override fun resolveBaseUrl(): String {
+        return baseUrlOverride
+            ?: configService?.get(ConfigKeys.PROVIDER_LM_STUDIO_BASE_URL.key, ConfigScope.APP)
             ?: System.getProperty("LM_STUDIO_BASE_URL")
             ?: System.getenv("LM_STUDIO_BASE_URL")
             ?: DEFAULT_BASE_URL
     }
 
-    private fun resolveApiKey(): String? {
-        return configService?.get(ConfigService.KEY_PROVIDER_LM_STUDIO_API_KEY, ConfigScope.APP)
+    override fun resolveApiKey(): String? {
+        return configService?.get(ConfigKeys.PROVIDER_LM_STUDIO_API_KEY.key, ConfigScope.APP)
             ?: System.getProperty("LM_STUDIO_API_KEY")
             ?: System.getenv("LM_STUDIO_API_KEY")
     }
 
-    override suspend fun chat(
-        messages: List<LLMMessage>,
-        systemMessages: List<String>,
-        maxTokens: Int?,
-        temperature: Double,
-        streaming: Boolean,
-        onStreamChunk: ((StreamChunk) -> Unit)?,
-        kwargs: Map<String, Any>
-    ): LLMResponse {
-        logger.info { "[LMStudio] Sending ${if (streaming) "streaming" else "standard"} chat request: model=$model, messages=${messages.size}, systemMessages=${systemMessages.size}" }
-
-        val resolvedBaseUrl = resolveBaseUrl()
-        val apiKey = resolveApiKey()
-
-        val lmMessages = mutableListOf<Map<String, Any>>()
-
-        // Add system messages from systemMessages parameter
-        systemMessages.filter { it.isNotBlank() }.forEach { sysMsg ->
-            lmMessages.add(mapOf("role" to "system", "content" to sysMsg))
-        }
-
-        // Add conversation messages (filter out any system messages as they should be in systemMessages parameter).
-        // Remap "tool" (used by LLMMessageMapper for tool results) to "assistant" — LM Studio uses the
-        // OpenAI-compatible chat schema where role="tool" requires tool_call_id, which this adapter does not emit.
-        messages.filter { it.role != "system" }.forEach { msg ->
-            val mappedRole = if (msg.role == "tool") "assistant" else msg.role
-            lmMessages.add(mapOf("role" to mappedRole, "content" to toOpenAiMessageContent(msg)))
-        }
-
-        val maxOutputLimit = configService?.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, taskId)
-            ?: ConfigKeys.MAX_OUTPUT_SIZE.default
-        val requestedMaxTokens = when {
-            maxTokens != null && maxTokens > 0 -> minOf(maxTokens, maxOutputLimit)
-            else -> maxOutputLimit
-        }
-        val modelLimit = pl.jclab.refio.core.llm.ModelDefinitions
-            .getDefinition("lmstudio", model)
-            ?.maxOutputTokens
-        val effectiveMaxTokens = if (modelLimit != null && modelLimit > 0 && requestedMaxTokens > modelLimit) {
-            logger.warn {
-                "[LMStudio] Requested max_tokens=$requestedMaxTokens exceeds model limit ($modelLimit) for $model - clamping to safe value"
-            }
-            modelLimit
-        } else {
-            requestedMaxTokens
-        }
-
-        val requestBody = buildMap<String, Any> {
-            put("model", model)
-            put("messages", lmMessages)
-            put("temperature", temperature)
-            put("max_tokens", effectiveMaxTokens)
-            if (streaming) put("stream", true)
-
-            // LM Studio uses OpenAI-compatible API which doesn't support thinking parameter
-            // Log if thinking was requested for user awareness
-            val thinking = kwargs["thinking"] as? Boolean ?: false
-            if (thinking) {
-                logger.info { "[LMStudio] Thinking mode requested but not supported by OpenAI-compatible API - parameter ignored" }
-            }
-
-            (kwargs["top_p"] as? Number)?.let { put("top_p", it) }
-            (kwargs["frequency_penalty"] as? Number)?.let { put("frequency_penalty", it) }
-            (kwargs["presence_penalty"] as? Number)?.let { put("presence_penalty", it) }
-            kwargs["stop"]?.let { put("stop", it) }
-        }
-
-        val requestJson = gson.toJson(requestBody)
-        val requestId = UUID.randomUUID().toString()
-        val logPrefix = "[LMStudio][$requestId]"
-        logger.debug { "$logPrefix Request: ${SecureLogger.redactAndTruncate(requestJson)}" }
-
-        val startTime = System.currentTimeMillis()
-
-        return try {
-            if (streaming && onStreamChunk != null) {
-                executeStreaming(resolvedBaseUrl, apiKey, requestBody, requestJson, startTime, onStreamChunk, logPrefix)
-            } else {
-                executeStandard(resolvedBaseUrl, apiKey, requestBody, requestJson, startTime, logPrefix)
-            }
-        } catch (e: CancellationException) {
-            // Stream aborted by a guardrail (see core/llm/streaming/) — must propagate
-            // so the caller can see StreamAbortedException instead of RefioError.LLMError.
-            throw e
-        } catch (e: Exception) {
-            throw LLMErrorMapper.fromThrowable(provider, model, timeout, e)
-        }
-    }
-
-    private suspend fun executeStandard(
-        baseUrl: String,
-        apiKey: String?,
-        requestBody: Map<String, Any>,
-        requestJson: String,
-        startTime: Long,
-        logPrefix: String
-    ): LLMResponse {
-        var httpStatus: Int? = null
-
-        try {
-            logger.info { "$logPrefix Request start: endpoint=$baseUrl$CHAT_ENDPOINT, body=${SecureLogger.redactAndTruncate(requestJson)}" }
-            val response = client.post("$baseUrl$CHAT_ENDPOINT") {
-                contentType(ContentType.Application.Json)
-                apiKey?.let { header("Authorization", "Bearer $it") }
-                setBody(requestBody)
-            }
-
-            httpStatus = response.status.value
-            val rawResponse: Map<String, Any?> = response.body()
-
-            if (httpStatus !in 200..299) {
-                val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-                val errorMessage = (rawResponse["error"] as? Map<*, *>)?.get("message") as? String
-                    ?: "LM Studio API error (HTTP $httpStatus)"
-
-                logger.apiError(
-                    provider = provider,
-                    model = model,
-                    endpoint = "$baseUrl$CHAT_ENDPOINT",
-                    requestJson = requestJson,
-                    httpStatus = httpStatus,
-                    error = Exception(errorMessage),
-                    latencyMs = latencyMs,
-                    taskId = taskId,
-                    subtaskId = subtaskId,
-                    source = source
-                )
-                throw LLMErrorMapper.fromHttpStatus(provider, model, httpStatus, errorMessage)
-            }
-
-            @Suppress("UNCHECKED_CAST")
-            val usageMap = rawResponse["usage"] as? Map<String, Any?> ?: emptyMap()
-            val promptTokens = (usageMap["prompt_tokens"] as? Number)?.toInt() ?: 0
-            val completionTokens = (usageMap["completion_tokens"] as? Number)?.toInt() ?: 0
-            val totalTokens = (usageMap["total_tokens"] as? Number)?.toInt() ?: promptTokens + completionTokens
-
-            val usage = LLMUsage(
-                inputTokens = promptTokens,
-                outputTokens = completionTokens,
-                totalTokens = totalTokens
-            )
-
-            @Suppress("UNCHECKED_CAST")
-            val choices = rawResponse["choices"] as? List<Map<String, Any?>> ?: emptyList()
-            val firstChoice = choices.firstOrNull() ?: emptyMap()
-            @Suppress("UNCHECKED_CAST")
-            val message = firstChoice["message"] as? Map<String, Any?> ?: emptyMap()
-            val content = message["content"] as? String ?: ""
-            val normalizedToolCallsJson = if (content.isBlank()) {
-                ToolCallContentNormalizer.fromOpenAiToolCalls(message["tool_calls"])
-            } else {
-                null
-            }
-            val finalContent = normalizedToolCallsJson ?: content
-            if (normalizedToolCallsJson != null) {
-                logger.info { "$logPrefix [TOOL_CALLS_NORMALIZED] Converted LM Studio tool_calls to canonical JSON content" }
-            }
-            val finishReason = firstChoice["finish_reason"] as? String
-
-            val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-            val responseJson = gson.toJson(rawResponse)
-            logger.info {
-                "$logPrefix Response received: status=$httpStatus, durationMs=${System.currentTimeMillis() - startTime}, " +
-                    "bodySize=${responseJson.length}"
-            }
-
-            logger.apiResponse(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                responseJson = responseJson,
-                httpStatus = httpStatus,
-                inputTokens = usage.inputTokens,
-                outputTokens = usage.outputTokens,
-                costUsd = 0.0,
-                latencyMs = latencyMs,
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-
-            return LLMResponse(
-                content = finalContent,
-                usage = usage,
-                model = model,
-                provider = provider,
-                cost = 0.0,
-                finishReason = finishReason,
-                rawResponse = rawResponse
-            )
-        } catch (e: Exception) {
-            val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = httpStatus,
-                error = e,
-                latencyMs = latencyMs,
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw LLMErrorMapper.fromThrowable(provider, model, timeout, e)
-        }
-    }
-
-    private suspend fun executeStreaming(
-        baseUrl: String,
-        apiKey: String?,
-        requestBody: Map<String, Any>,
-        requestJson: String,
-        startTime: Long,
-        onStreamChunk: (StreamChunk) -> Unit,
-        logPrefix: String
-    ): LLMResponse {
-        val contentBuilder = StringBuilder()
-        val toolCallAccumulator = ToolCallContentNormalizer.OpenAiStreamingToolCallAccumulator()
-        var httpStatus: Int? = null
-        var finalFinishReason: String? = null
-
-        try {
-            logger.info { "$logPrefix Request start: endpoint=$baseUrl$CHAT_ENDPOINT, body=${SecureLogger.redactAndTruncate(requestJson)}" }
-            client.preparePost("$baseUrl$CHAT_ENDPOINT") {
-                contentType(ContentType.Application.Json)
-                apiKey?.let { header("Authorization", "Bearer $it") }
-                setBody(requestBody)
-            }.execute { httpResponse ->
-                httpStatus = httpResponse.status.value
-
-                if (httpStatus !in 200..299) {
-                    val errorBody = httpResponse.body<String>()
-                    val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-                    val errorMessage = "LM Studio API error (HTTP $httpStatus): $errorBody"
-
-                    logger.apiError(
-                        provider = provider,
-                        model = model,
-                        endpoint = "$baseUrl$CHAT_ENDPOINT",
-                        requestJson = requestJson,
-                        httpStatus = httpStatus,
-                        error = Exception(errorMessage),
-                        latencyMs = latencyMs,
-                        taskId = taskId,
-                        subtaskId = subtaskId,
-                        source = source
-                    )
-                    throw LLMErrorMapper.fromHttpStatus(provider, model, httpStatus ?: 500, errorMessage)
-                }
-
-                val channel: io.ktor.utils.io.ByteReadChannel = httpResponse.body()
-                while (!channel.isClosedForRead) {
-                    if (pl.jclab.refio.core.services.monitoring.GlobalMetrics.isCancelled()) {
-                        finalFinishReason = "cancelled"
-                        break
-                    }
-
-                    val line = channel.readUTF8Line(limit = Int.MAX_VALUE) ?: continue
-                    if (line.isBlank() || !line.startsWith("data: ")) continue
-
-                    val data = line.removePrefix("data: ").trim()
-                    if (data == "[DONE]") break
-
-                    try {
-                        @Suppress("UNCHECKED_CAST")
-                        val chunk = gson.fromJson(data, Map::class.java) as Map<String, Any?>
-                        @Suppress("UNCHECKED_CAST")
-                        val choices = chunk["choices"] as? List<Map<String, Any?>> ?: emptyList()
-                        val first = choices.firstOrNull() ?: emptyMap()
-                        @Suppress("UNCHECKED_CAST")
-                        val delta = first["delta"] as? Map<String, Any?>
-                        toolCallAccumulator.consumeDelta(delta)
-                        val content = delta?.get("content") as? String
-                        val finishReason = first["finish_reason"] as? String
-
-                        if (!content.isNullOrEmpty()) {
-                            contentBuilder.append(content)
-                            onStreamChunk(StreamChunk(delta = content, finishReason = null))
-                        }
-
-                        if (finishReason != null) {
-                            finalFinishReason = finishReason
-                        }
-                    } catch (e: CancellationException) {
-                        // Let stream abort (guardrail trip) propagate out of the loop.
-                        throw e
-                    } catch (_: Exception) {
-                        continue
-                    }
-                }
-            }
-
-            if (contentBuilder.isEmpty()) {
-                val normalizedToolCallsJson = toolCallAccumulator.toCanonicalJson()
-                if (normalizedToolCallsJson != null) {
-                    contentBuilder.append(normalizedToolCallsJson)
-                    logger.info { "$logPrefix [TOOL_CALLS_NORMALIZED] Converted streamed LM Studio tool_calls to canonical JSON content" }
-                }
-            }
-
-            @Suppress("UNCHECKED_CAST")
-            val inputTokensEstimate = (requestBody["messages"] as? List<Map<String, Any?>>)
-                ?.sumOf {
-                    when (val content = it["content"]) {
-                        is String -> content.length
-                        is List<*> -> content.sumOf { part ->
-                            @Suppress("UNCHECKED_CAST")
-                            val partMap = part as? Map<String, Any?>
-                            (partMap?.get("text") as? String)?.length ?: 0
-                        }
-                        else -> 0
-                    }
-                } ?: 0
-            val outputTokensEstimate = contentBuilder.length / 4
-
-            val usage = LLMUsage(
-                inputTokens = inputTokensEstimate,
-                outputTokens = outputTokensEstimate,
-                totalTokens = inputTokensEstimate + outputTokensEstimate
-            )
-
-            onStreamChunk(StreamChunk(delta = "", finishReason = finalFinishReason, usage = usage))
-
-            val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-            val syntheticResponse = mapOf(
-                "choices" to listOf(
-                    mapOf(
-                        "message" to mapOf("role" to "assistant", "content" to contentBuilder.toString()),
-                        "finish_reason" to finalFinishReason
-                    )
-                ),
-                "usage" to mapOf(
-                    "prompt_tokens" to usage.inputTokens,
-                    "completion_tokens" to usage.outputTokens,
-                    "total_tokens" to usage.totalTokens
-                ),
-                "model" to model
-            )
-            val responseJson = gson.toJson(syntheticResponse)
-            logger.info {
-                "$logPrefix Response received: status=${httpStatus ?: 200}, durationMs=${System.currentTimeMillis() - startTime}, " +
-                    "bodySize=${responseJson.length}"
-            }
-
-            logger.apiResponse(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                responseJson = responseJson,
-                httpStatus = httpStatus ?: 200,
-                inputTokens = usage.inputTokens,
-                outputTokens = usage.outputTokens,
-                costUsd = 0.0,
-                latencyMs = latencyMs,
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-
-            return LLMResponse(
-                content = contentBuilder.toString(),
-                usage = usage,
-                model = model,
-                provider = provider,
-                cost = 0.0,
-                finishReason = finalFinishReason,
-                rawResponse = syntheticResponse
-            )
-        } catch (e: CancellationException) {
-            // Guardrail-triggered abort — log and rethrow as-is, do NOT wrap.
-            val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = httpStatus,
-                error = e,
-                latencyMs = latencyMs,
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw e
-        } catch (e: Exception) {
-            val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$baseUrl$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = httpStatus,
-                error = e,
-                latencyMs = latencyMs,
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            throw LLMErrorMapper.fromThrowable(provider, model, timeout, e)
-        }
+    override fun logUnsupportedThinking(logPrefix: String) {
+        logger.info { "$logPrefix Thinking mode requested but not supported by OpenAI-compatible API - parameter ignored" }
     }
 
-    /**
-     * Lists models from LM Studio /v1/models endpoint.
-     */
-    suspend fun listModels(): List<ModelConfig> = withContext(Dispatchers.IO) {
+    override suspend fun listModels(): List<ModelConfig> = withContext(Dispatchers.IO) {
         val resolvedBaseUrl = resolveBaseUrl()
         val apiKey = resolveApiKey()
 
@@ -524,29 +74,30 @@ class LMStudioAdapter(
             @Suppress("UNCHECKED_CAST")
             val modelsData = body["data"] as? List<Map<String, Any?>> ?: emptyList()
 
-            val contextSize = configService?.getTyped(ConfigKeys.PROVIDER_LM_STUDIO_CONTEXT_SIZE) ?: ConfigKeys.PROVIDER_LM_STUDIO_CONTEXT_SIZE.default
+            val contextSize = configService?.getTyped(ConfigKeys.PROVIDER_LM_STUDIO_CONTEXT_SIZE)
+                ?: ConfigKeys.PROVIDER_LM_STUDIO_CONTEXT_SIZE.default
 
-            return@withContext modelsData.mapNotNull { modelData ->
+            modelsData.mapNotNull { modelData ->
                 val modelId = modelData["id"] as? String ?: return@mapNotNull null
 
-                if (!pl.jclab.refio.core.llm.SupportedModels.isSupported("lmstudio", modelId)) {
+                if (!SupportedModels.isSupported("lmstudio", modelId)) {
                     return@mapNotNull null
                 }
 
-                // Get context length from model data or use configured context size
                 val modelContextLength = (modelData["context_length"] as? Number)?.toInt() ?: contextSize
+                val baseDefinition = ModelDefinitions.getDefinition("lmstudio", modelId)
+                    ?: run {
+                        logger.warn {
+                            "[LMSTUDIO] Model $modelId not in registry — using synthetic definition (context=$modelContextLength)"
+                        }
+                        ModelDefinitions.syntheticDefinitionFor(
+                            provider = "lmstudio",
+                            modelId = modelId,
+                            maxContext = modelContextLength,
+                        )
+                    }
 
-                // Get definition from registry or create fallback
-                val baseDefinition = pl.jclab.refio.core.llm.ModelDefinitions.getDefinition("lmstudio", modelId)
-                    ?: pl.jclab.refio.core.llm.ModelDefinitions.createFallback(
-                        provider = "lmstudio",
-                        modelId = modelId,
-                        maxContext = modelContextLength
-                    )
-
-                // Always override maxContext with configured/model-reported value for LM Studio models
                 val definition = baseDefinition.copy(maxContext = modelContextLength)
-
                 definition.toModelConfig()
             }
         } catch (e: Exception) {
@@ -554,10 +105,4 @@ class LMStudioAdapter(
             throw LLMErrorMapper.listModelsFailure(provider, e)
         }
     }
-
-    override fun estimateCost(usage: LLMUsage): Double = 0.0
-
-    override suspend fun close() {
-        client.close()
-    }
 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OllamaAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OllamaAdapter.kt
index 341c53f6..de217495 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OllamaAdapter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OllamaAdapter.kt
@@ -21,6 +21,7 @@ import io.ktor.http.*
 import io.ktor.serialization.gson.*
 import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.core.errors.LLMErrorMapper
+import pl.jclab.refio.core.errors.RefioError
 import pl.jclab.refio.core.security.SecureLogger
 import pl.jclab.refio.core.services.OllamaRequestGate
 import pl.jclab.refio.core.logging.dualLogger
@@ -43,7 +44,8 @@ class OllamaAdapter(
     private val configService: pl.jclab.refio.core.services.ConfigService? = null,
     private val taskId: String? = null,
     private val subtaskId: String? = null,
-    private val source: String? = null
+    private val source: String? = null,
+    httpClientOverride: HttpClient? = null
 ) : BaseLLMAdapter(model, "ollama") {
 
     private val logger = dualLogger("OllamaAdapter")
@@ -65,33 +67,10 @@ class OllamaAdapter(
         get() = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
             ?: ConfigKeys.TOOL_EXECUTION_TIMEOUT.default.toLong() * 1000L
 
-    private val client = HttpClient(CIO) {
-        install(ContentNegotiation) {
-            gson {
-                setPrettyPrinting()
-                serializeNulls()
-            }
-        }
-        install(Logging) {
-            level = LogLevel.INFO
-            logger = object : KtorLogger {
-                override fun log(message: String) {
-                    this@OllamaAdapter.logger.debug { message }
-                }
-            }
-            sanitizeHeader { header ->
-                header.equals(HttpHeaders.Authorization, ignoreCase = true) ||
-                        header.equals("x-api-key", ignoreCase = true) ||
-                        header.equals("x-goog-api-key", ignoreCase = true)
-            }
-        }
-        install(HttpTimeout) {
-            val timeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
-                ?: ConfigKeys.TOOL_EXECUTION_TIMEOUT.default.toLong() * 1000L
-            requestTimeoutMillis = timeoutMs
-            connectTimeoutMillis = 30000
-            socketTimeoutMillis = timeoutMs
-        }
+    private val client = httpClientOverride ?: run {
+        val socketTimeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
+            ?: ConfigKeys.TOOL_EXECUTION_TIMEOUT.default.toLong() * 1000L
+        LLMKtorClientFactory.create(socketTimeoutMs, logger)
     }
 
     override suspend fun chat(
@@ -241,7 +220,7 @@ class OllamaAdapter(
             put("options", buildMap {
                 put("temperature", temperature)
 
-                val contextSize = configService?.get(ConfigService.KEY_PROVIDER_OLLAMA_CONTEXT_SIZE)?.toIntOrNull()
+                val contextSize = configService?.get(ConfigKeys.PROVIDER_OLLAMA_CONTEXT_SIZE.key)?.toIntOrNull()
                     ?: DEFAULT_CONTEXT_SIZE
                 put("num_ctx", contextSize)
 
@@ -360,6 +339,14 @@ class OllamaAdapter(
             )
 
             // Parse response
+            if (response["message"] !is Map<*, *>) {
+                throw RefioError.MalformedResponse(
+                    provider = provider,
+                    model = model,
+                    reason = "Missing or non-object 'message' in Ollama response",
+                    bodyPreview = gson.toJson(response)
+                )
+            }
             @Suppress("UNCHECKED_CAST")
             val messageMap = response["message"] as? Map<String, Any?> ?: emptyMap()
             var rawContent = messageMap["content"] as? String ?: ""
@@ -756,7 +743,7 @@ class OllamaAdapter(
 
             // Get context size from ConfigService (global setting for all Ollama models)
             val contextSize =
-                configService?.get(pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_OLLAMA_CONTEXT_SIZE)
+                configService?.get(ConfigKeys.PROVIDER_OLLAMA_CONTEXT_SIZE.key)
                     ?.toIntOrNull()
                     ?: DEFAULT_CONTEXT_SIZE
 
@@ -765,13 +752,18 @@ class OllamaAdapter(
             val modelConfigs = modelsData.mapNotNull { modelData ->
                 val modelName = modelData["name"] as? String ?: return@mapNotNull null
 
-                // Get definition from registry or create fallback with configured context size
+                // Get definition from registry or synthesize for unknown models (new releases).
                 val baseDefinition = pl.jclab.refio.core.llm.ModelDefinitions.getDefinition("ollama", modelName)
-                    ?: pl.jclab.refio.core.llm.ModelDefinitions.createFallback(
-                        provider = "ollama",
-                        modelId = modelName,
-                        maxContext = contextSize  // Use configured context size
-                    )
+                    ?: run {
+                        logger.warn {
+                            "[OLLAMA] Model $modelName not in registry — using synthetic definition with defaults (context=$contextSize)"
+                        }
+                        pl.jclab.refio.core.llm.ModelDefinitions.syntheticDefinitionFor(
+                            provider = "ollama",
+                            modelId = modelName,
+                            maxContext = contextSize
+                        )
+                    }
 
                 // Always override maxContext with configured value for Ollama models
                 val definition = baseDefinition.copy(maxContext = contextSize)
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAIAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAIAdapter.kt
index 7956d617..ed240d78 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAIAdapter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAIAdapter.kt
@@ -173,13 +173,16 @@ class OpenAIAdapter(
                     true -> "medium"
                 }
                 is String -> {
-                    // Validate string value
                     when (thinking.lowercase()) {
                         "low", "medium", "high" -> thinking.lowercase()
-                        else -> "medium" // default to medium for invalid values
+                        else -> throw IllegalArgumentException(
+                            "OpenAI reasoning effort must be one of [low, medium, high], got: '$thinking'"
+                        )
                     }
                 }
-                else -> "medium" // default fallback
+                else -> throw IllegalArgumentException(
+                    "OpenAI reasoning 'thinking' must be Boolean or String, got: ${thinking.javaClass.simpleName}"
+                )
             }
 
             transformed["reasoning"] = mapOf("effort" to effort)
@@ -264,30 +267,20 @@ class OpenAIAdapter(
         }
 
         if (content.isBlank()) {
-            // FALLBACK: If no output text found, check top-level "text" field
-            logger.warn { "[OPENAI] No content in output, checking top-level 'text' field" }
-
-            val topLevelText = response["text"]
-            logger.info { "[OPENAI] Top-level text field type: ${topLevelText?.javaClass?.simpleName}" }
-
-            content = when (topLevelText) {
-                is String -> {
-                    logger.info { "[OPENAI] Using top-level text string, length: ${topLevelText.length}" }
-                    topLevelText
-                }
-
-                is Map<*, *> -> {
-                    val textMap = topLevelText
-                    val textContent = textMap["content"]?.toString() ?: textMap["text"]?.toString() ?: ""
-                    logger.info { "[OPENAI] Using text from map, length: ${textContent.length}" }
-                    textContent
-                }
-
-                else -> {
-                    logger.error { "[OPENAI] Cannot extract content - no output text and no text field" }
-                    ""
-                }
+            // No silent fallback — let caller see the malformed structure. (Previously
+            // probed top-level "text" field; that path hid bugs in Responses API
+            // integration. See REFACTOR.md §1.)
+            val preview = try {
+                gson.toJson(response).take(500)
+            } catch (_: Exception) {
+                response.keys.joinToString(prefix = "[keys: ", postfix = "]")
             }
+            throw RefioError.MalformedResponse(
+                provider = provider,
+                model = model,
+                reason = "Responses API returned no content in 'output[*]' and no readable message item",
+                bodyPreview = preview
+            )
         }
 
         val role = messageItem?.get("role") as? String ?: "assistant"
@@ -345,33 +338,10 @@ class OpenAIAdapter(
 
     private val ownsHttpClient = httpClientOverride == null
 
-    private val client = httpClientOverride ?: HttpClient(CIO) {
-        install(ContentNegotiation) {
-            gson {
-                setPrettyPrinting()
-                serializeNulls()
-            }
-        }
-        install(Logging) {
-            level = LogLevel.INFO
-            logger = object : KtorLogger {
-                override fun log(message: String) {
-                    this@OpenAIAdapter.logger.debug { message }
-                }
-            }
-            sanitizeHeader { header ->
-                header.equals(HttpHeaders.Authorization, ignoreCase = true) ||
-                    header.equals("x-api-key", ignoreCase = true) ||
-                    header.equals("x-goog-api-key", ignoreCase = true)
-            }
-        }
-        install(HttpTimeout) {
-            val timeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
-                ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
-            requestTimeoutMillis = timeoutMs
-            connectTimeoutMillis = 30000
-            socketTimeoutMillis = timeoutMs
-        }
+    private val client = httpClientOverride ?: run {
+        val socketTimeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
+            ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
+        LLMKtorClientFactory.create(socketTimeoutMs, logger)
     }
 
     override suspend fun chat(
@@ -437,7 +407,7 @@ class OpenAIAdapter(
         try {
             // Get API key from ConfigService (single source of truth)
             val apiKeyToUse = configService?.get(
-                key = pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_OPENAI_API_KEY,
+                key = ConfigKeys.PROVIDER_OPENAI_API_KEY.key,
                 scope = pl.jclab.refio.core.db.ConfigScope.APP
             )
                 ?: System.getProperty("OPENAI_API_KEY")
@@ -480,25 +450,19 @@ class OpenAIAdapter(
             baseParams["stream"] = true
         }
 
-        // Use min of provided maxTokens and configured limit
         val maxOutputLimit = configService?.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, taskId)
             ?: ConfigKeys.MAX_OUTPUT_SIZE.default
-        val requestedMax = when {
-            maxTokens != null && maxTokens > 0 -> minOf(maxTokens, maxOutputLimit)
-            else -> maxOutputLimit
-        }
-        val modelLimit = definition?.maxOutputTokens
-        val effectiveMaxTokens = if (modelLimit != null && modelLimit > 0 && requestedMax > modelLimit) {
-            logger.warn {
-                "[OPENAI] Requested max_tokens=$requestedMax exceeds model limit ($modelLimit) for $model - clamping to safe value"
-            }
-            modelLimit
-        } else {
-            requestedMax
-        }
+        val effectiveMaxTokens = OpenAICompatibleHelpers.resolveEffectiveMaxTokens(
+            requested = maxTokens,
+            configLimit = maxOutputLimit,
+            modelLimit = definition?.maxOutputTokens,
+            providerTag = "OPENAI",
+            model = model,
+            log = { logger.warn(it) }
+        )
         baseParams["max_tokens"] = effectiveMaxTokens
         logger.debug {
-            "[OPENAI] Using maxTokens=$effectiveMaxTokens (requested=$maxTokens, configLimit=$maxOutputLimit, modelLimit=${modelLimit ?: "n/a"})"
+            "[OPENAI] Using maxTokens=$effectiveMaxTokens (requested=$maxTokens, configLimit=$maxOutputLimit, modelLimit=${definition?.maxOutputTokens ?: "n/a"})"
         }
 
         // Handle response_format for JSON mode
@@ -531,11 +495,7 @@ class OpenAIAdapter(
             }
         }
 
-        // Additional parameters from kwargs
-        (kwargs["top_p"] as? Number)?.let { baseParams["top_p"] = it }
-        (kwargs["frequency_penalty"] as? Number)?.let { baseParams["frequency_penalty"] = it }
-        (kwargs["presence_penalty"] as? Number)?.let { baseParams["presence_penalty"] = it }
-        kwargs["stop"]?.let { baseParams["stop"] = it }
+        with(OpenAICompatibleHelpers) { baseParams.addCommonKwargs(kwargs) }
 
         // Apply format transformation if needed
         val transformedParams = if (definition?.apiFormat == pl.jclab.refio.core.llm.ApiFormat.RESPONSES) {
@@ -1362,7 +1322,7 @@ class OpenAIAdapter(
         try {
             // Get API key from ConfigService (single source of truth)
             val apiKeyToUse = configService?.get(
-                key = pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_OPENAI_API_KEY,
+                key = ConfigKeys.PROVIDER_OPENAI_API_KEY.key,
                 scope = pl.jclab.refio.core.db.ConfigScope.APP
             )
                 ?: System.getProperty("OPENAI_API_KEY")
@@ -1397,16 +1357,17 @@ class OpenAIAdapter(
                     return@mapNotNull null
                 }
 
-                // Get static definition from ModelDefinitions or create fallback
+                // Get static definition from ModelDefinitions or synthesize for unknown models.
                 val definition = pl.jclab.refio.core.llm.ModelDefinitions.getDefinition("openai", modelId)
                     ?: run {
-                        // Extract context length from API response if available
                         @Suppress("UNCHECKED_CAST")
                         val contextLength = (modelData["context_length"] as? Number)?.toInt() ?: DEFAULT_CONTEXT_SIZE
 
-                        logger.debug { "[OPENAI] Model $modelId not in registry, using fallback (context=$contextLength)" }
+                        logger.warn {
+                            "[OPENAI] Model $modelId not in registry — using synthetic definition (context=$contextLength)"
+                        }
 
-                        pl.jclab.refio.core.llm.ModelDefinitions.createFallback(
+                        pl.jclab.refio.core.llm.ModelDefinitions.syntheticDefinitionFor(
                             provider = "openai",
                             modelId = modelId,
                             maxContext = contextLength
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAICompatibleAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAICompatibleAdapter.kt
new file mode 100644
index 00000000..76d34f80
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAICompatibleAdapter.kt
@@ -0,0 +1,559 @@
+package pl.jclab.refio.core.llm.adapters
+
+import io.ktor.client.HttpClient
+import io.ktor.client.call.body
+import io.ktor.client.plugins.HttpRequestTimeoutException
+import io.ktor.client.request.get
+import io.ktor.client.request.header
+import io.ktor.client.request.post
+import io.ktor.client.request.preparePost
+import io.ktor.client.request.setBody
+import io.ktor.http.ContentType
+import io.ktor.http.HttpHeaders
+import io.ktor.http.contentType
+import kotlinx.coroutines.CancellationException
+import kotlinx.coroutines.Dispatchers
+import kotlinx.coroutines.withContext
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.errors.RefioError
+import pl.jclab.refio.core.llm.BaseLLMAdapter
+import pl.jclab.refio.core.llm.LLMMessage
+import pl.jclab.refio.core.llm.LLMResponse
+import pl.jclab.refio.core.llm.LLMUsage
+import pl.jclab.refio.core.llm.ModelConfig
+import pl.jclab.refio.core.llm.ModelDefinitions
+import pl.jclab.refio.core.llm.StreamChunk
+import pl.jclab.refio.core.llm.toModelConfig
+import pl.jclab.refio.core.logging.DualLogger
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.security.SecureLogger
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.ConfigService.Companion.DEFAULT_CONTEXT_SIZE
+import pl.jclab.refio.core.utils.GsonInstance.gson
+import java.util.UUID
+
+/**
+ * Base adapter for OpenAI-compatible providers (Generic OpenAI, Z.AI, LM Studio).
+ *
+ * Encapsulates the shared protocol: `/v1/chat/completions` with `choices[].message.content`
+ * responses and standard SSE streaming. Subclasses override only what truly differs
+ * (endpoint URLs, API key resolution, error mapping, per-provider quirks).
+ *
+ * Not suitable for the OpenAI Responses API (use [OpenAIAdapter]) or providers with
+ * structurally different streams (OpenRouter's mid-stream error envelopes).
+ */
+abstract class OpenAICompatibleAdapter(
+    model: String,
+    protected val providerName: String,
+    protected val configService: ConfigService? = null,
+    protected val taskId: String? = null,
+    protected val subtaskId: String? = null,
+    protected val source: String? = null,
+    protected val requireApiKey: Boolean = false,
+    httpClientOverride: HttpClient? = null,
+    protected val logger: DualLogger = dualLogger("${providerName}Adapter"),
+) : BaseLLMAdapter(model, providerName) {
+
+    companion object {
+        const val CHAT_ENDPOINT = "/chat/completions"
+        const val MODELS_ENDPOINT = "/models"
+    }
+
+    protected val providerTag: String get() = providerName.uppercase()
+
+    protected val client: HttpClient = httpClientOverride ?: run {
+        val timeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
+            ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
+        LLMKtorClientFactory.create(timeoutMs, logger)
+    }
+
+    /** Endpoint suffix appended to [resolveBaseUrl]. Defaults to `/chat/completions`. */
+    protected open val chatEndpointPath: String = CHAT_ENDPOINT
+
+    /** Endpoint suffix for listing models. Defaults to `/models`. */
+    protected open val modelsEndpointPath: String = MODELS_ENDPOINT
+
+    /** Provider-specific base URL. Must be implemented by subclass. */
+    protected abstract fun resolveBaseUrl(): String
+
+    /** Provider-specific API key. Returns null for providers that don't require auth. */
+    protected abstract fun resolveApiKey(): String?
+
+    /** Provider-specific log message when `thinking` is requested but unsupported. */
+    protected open fun logUnsupportedThinking(logPrefix: String) {
+        // Default: silent. Subclasses may override to emit a hint.
+    }
+
+    /** Pre-request hook (rate limit mutex, etc.). Default: run block as-is. */
+    protected open suspend fun <T> withProviderRateLimit(endpoint: String, block: suspend () -> T): T = block()
+
+    /** Extra headers appended to every chat and models request. Default: none. */
+    protected open fun extraRequestHeaders(): Map<String, String> = emptyMap()
+
+    /**
+     * Inspect each raw SSE chunk before normal delta processing. Throwing
+     * [IllegalStateException] from here propagates up as a stream error
+     * (used by OpenRouter to surface mid-stream provider errors).
+     */
+    protected open fun onStreamRawChunk(chunk: Map<String, Any?>) = Unit
+
+    /** Rate-limit retry hook (Z.AI). Default: no retry. */
+    protected open suspend fun <T> executeWithRateLimitRetry(endpoint: String, block: suspend () -> T): T = block()
+
+    /**
+     * Build the JSON request body. Subclasses can override to add custom keys or
+     * strip parameters that their provider rejects (e.g. `response_format`).
+     */
+    protected open fun buildRequestBody(
+        requestMessages: List<Map<String, Any>>,
+        effectiveMaxTokens: Int,
+        temperature: Double,
+        streaming: Boolean,
+        kwargs: Map<String, Any>,
+        requestId: String,
+    ): Map<String, Any> = buildMap {
+        put("request_id", requestId)
+        put("model", model)
+        put("messages", requestMessages)
+        put("temperature", temperature)
+        put("max_tokens", effectiveMaxTokens)
+        if (streaming) put("stream", true)
+        with(OpenAICompatibleHelpers) { addCommonKwargs(kwargs) }
+        kwargs["response_format"]?.let { put("response_format", it) }
+    }
+
+    /**
+     * Map a non-2xx HTTP response to a [RefioError]. Default mapping:
+     * 401/403 → Authentication, 429 → RateLimit, else → LLMError.
+     * Subclasses can override to parse business codes and produce richer messages.
+     */
+    protected open fun mapHttpError(httpStatus: Int, rawBody: String): RefioError {
+        val parsed = parseProviderError(rawBody)
+        val message = parsed.message ?: rawBody
+        return when (httpStatus) {
+            401, 403 -> RefioError.LLMAuthentication(providerName, model, IllegalStateException(message))
+            429 -> RefioError.LLMRateLimit(providerName, null, IllegalStateException(message))
+            else -> RefioError.LLMError(providerName, model, IllegalStateException(message))
+        }
+    }
+
+    /**
+     * Ensure a successful response. Default: delegate to [mapHttpError] on non-2xx.
+     */
+    protected open fun ensureSuccess(httpStatus: Int, rawResponse: Map<String, Any?>, endpoint: String) {
+        if (httpStatus in 200..299) return
+
+        val message = (rawResponse["error"] as? Map<*, *>)?.get("message") as? String
+            ?: "OpenAI-compatible API error (HTTP $httpStatus) at $endpoint"
+        throw mapHttpError(httpStatus, gson.toJson(rawResponse).ifBlank { message })
+    }
+
+    /**
+     * Extract usage from the standard OpenAI-compatible `usage` map.
+     */
+    protected fun extractUsage(rawResponse: Map<String, Any?>): LLMUsage {
+        @Suppress("UNCHECKED_CAST")
+        val usageMap = rawResponse["usage"] as? Map<String, Any?> ?: emptyMap()
+        val promptTokens = (usageMap["prompt_tokens"] as? Number)?.toInt() ?: 0
+        val completionTokens = (usageMap["completion_tokens"] as? Number)?.toInt() ?: 0
+        val totalTokens = (usageMap["total_tokens"] as? Number)?.toInt() ?: (promptTokens + completionTokens)
+        return LLMUsage(promptTokens, completionTokens, totalTokens)
+    }
+
+    /**
+     * Parse a chat-completions provider error body into a structured payload.
+     */
+    data class ProviderErrorPayload(
+        val code: String? = null,
+        val message: String? = null,
+    )
+
+    fun parseProviderError(rawBody: String): ProviderErrorPayload {
+        return runCatching {
+            val parsed = gson.fromJson(rawBody, Map::class.java)
+            val error = parsed?.get("error") as? Map<*, *>
+            ProviderErrorPayload(
+                code = error?.get("code")?.toString(),
+                message = error?.get("message")?.toString(),
+            )
+        }.getOrDefault(ProviderErrorPayload(message = rawBody))
+    }
+
+    final override suspend fun chat(
+        messages: List<LLMMessage>,
+        systemMessages: List<String>,
+        maxTokens: Int?,
+        temperature: Double,
+        streaming: Boolean,
+        onStreamChunk: ((StreamChunk) -> Unit)?,
+        kwargs: Map<String, Any>
+    ): LLMResponse {
+        val baseUrl = resolveBaseUrl()
+        val apiKey = resolveApiKey()
+        if (requireApiKey && apiKey.isNullOrBlank()) {
+            throw RefioError.ProviderNotConfigured(providerName, "api_key")
+        }
+        val requestMessages = OpenAICompatibleHelpers.buildMessages(this, systemMessages, messages)
+        val configLimit = configService?.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, taskId)
+            ?: ConfigKeys.MAX_OUTPUT_SIZE.default
+        val effectiveMaxTokens = OpenAICompatibleHelpers.resolveEffectiveMaxTokens(
+            requested = maxTokens,
+            configLimit = configLimit,
+            modelLimit = ModelDefinitions.getDefinition(providerName, model)?.maxOutputTokens,
+            providerTag = providerTag,
+            model = model,
+            log = { logger.warn(it) }
+        )
+        val requestId = UUID.randomUUID().toString()
+        val logPrefix = "[$providerTag][$requestId]"
+
+        if (kwargs["thinking"] as? Boolean == true) {
+            logUnsupportedThinking(logPrefix)
+        }
+
+        val requestBody = buildRequestBody(
+            requestMessages = requestMessages,
+            effectiveMaxTokens = effectiveMaxTokens,
+            temperature = temperature,
+            streaming = streaming,
+            kwargs = kwargs,
+            requestId = requestId,
+        )
+        val requestJson = gson.toJson(requestBody)
+        val startTime = System.currentTimeMillis()
+
+        return try {
+            if (streaming && onStreamChunk != null) {
+                executeStreaming(baseUrl, apiKey, requestBody, requestJson, startTime, onStreamChunk, logPrefix)
+            } else {
+                executeStandard(baseUrl, apiKey, requestBody, requestJson, startTime, logPrefix)
+            }
+        } catch (e: HttpRequestTimeoutException) {
+            val timeoutMs = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L) ?: 0L
+            throw RefioError.LLMTimeout(providerName, model, timeoutMs, e)
+        }
+    }
+
+    protected open suspend fun executeStandard(
+        baseUrl: String,
+        apiKey: String?,
+        requestBody: Map<String, Any>,
+        requestJson: String,
+        startTime: Long,
+        logPrefix: String,
+    ): LLMResponse {
+        var httpStatus: Int? = null
+        val endpoint = "$baseUrl$chatEndpointPath"
+
+        try {
+            logger.info { "$logPrefix Request start: endpoint=$endpoint, body=${SecureLogger.redactAndTruncate(requestJson)}" }
+            val response = executeWithRateLimitRetry(endpoint) {
+                withProviderRateLimit(endpoint) {
+                    client.post(endpoint) {
+                        contentType(ContentType.Application.Json)
+                        apiKey?.let { header(HttpHeaders.Authorization, "Bearer $it") }
+                        extraRequestHeaders().forEach { (k, v) -> header(k, v) }
+                        setBody(requestBody)
+                    }
+                }
+            }
+
+            httpStatus = response.status.value
+            val rawResponse: Map<String, Any?> = response.body()
+            ensureSuccess(httpStatus, rawResponse, endpoint)
+
+            val usage = extractUsage(rawResponse)
+            if (rawResponse["choices"] !is List<*>) {
+                throw RefioError.MalformedResponse(
+                    provider = provider,
+                    model = model,
+                    reason = "Missing or non-list 'choices' in chat completions response",
+                    bodyPreview = gson.toJson(rawResponse)
+                )
+            }
+            @Suppress("UNCHECKED_CAST")
+            val choices = rawResponse["choices"] as? List<Map<String, Any?>> ?: emptyList()
+            val firstChoice = choices.firstOrNull() ?: emptyMap()
+            @Suppress("UNCHECKED_CAST")
+            val message = firstChoice["message"] as? Map<String, Any?> ?: emptyMap()
+            val content = message["content"] as? String ?: ""
+            val normalizedToolCallsJson = if (content.isBlank()) {
+                ToolCallContentNormalizer.fromOpenAiToolCalls(message["tool_calls"])
+            } else {
+                null
+            }
+
+            logger.apiResponse(
+                provider = provider,
+                model = model,
+                endpoint = endpoint,
+                requestJson = requestJson,
+                responseJson = gson.toJson(rawResponse),
+                httpStatus = httpStatus,
+                inputTokens = usage.inputTokens,
+                outputTokens = usage.outputTokens,
+                costUsd = estimateCost(usage),
+                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
+                taskId = taskId,
+                subtaskId = subtaskId,
+                source = source,
+            )
+
+            return LLMResponse(
+                content = normalizedToolCallsJson ?: content,
+                usage = usage,
+                model = model,
+                provider = provider,
+                cost = estimateCost(usage),
+                finishReason = firstChoice["finish_reason"] as? String,
+                rawResponse = rawResponse,
+            )
+        } catch (e: RefioError) {
+            logger.apiError(
+                provider = provider,
+                model = model,
+                endpoint = endpoint,
+                requestJson = requestJson,
+                httpStatus = httpStatus,
+                error = e,
+                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
+                taskId = taskId,
+                subtaskId = subtaskId,
+                source = source,
+            )
+            throw e
+        } catch (e: CancellationException) {
+            throw e
+        } catch (e: Exception) {
+            logger.apiError(
+                provider = provider,
+                model = model,
+                endpoint = endpoint,
+                requestJson = requestJson,
+                httpStatus = httpStatus,
+                error = e,
+                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
+                taskId = taskId,
+                subtaskId = subtaskId,
+                source = source,
+            )
+            throw RefioError.LLMError(providerName, model, e)
+        }
+    }
+
+    protected open suspend fun executeStreaming(
+        baseUrl: String,
+        apiKey: String?,
+        requestBody: Map<String, Any>,
+        requestJson: String,
+        startTime: Long,
+        onStreamChunk: (StreamChunk) -> Unit,
+        logPrefix: String,
+    ): LLMResponse {
+        val contentBuilder = StringBuilder()
+        val toolCallAccumulator = ToolCallContentNormalizer.OpenAiStreamingToolCallAccumulator()
+        var httpStatus: Int? = null
+        var finalFinishReason: String? = null
+        val endpoint = "$baseUrl$chatEndpointPath"
+
+        try {
+            logger.info { "$logPrefix Request start: endpoint=$endpoint, body=${SecureLogger.redactAndTruncate(requestJson)}" }
+            executeWithRateLimitRetry(endpoint) {
+                withProviderRateLimit(endpoint) {
+                    client.preparePost(endpoint) {
+                        contentType(ContentType.Application.Json)
+                        apiKey?.let { header(HttpHeaders.Authorization, "Bearer $it") }
+                        extraRequestHeaders().forEach { (k, v) -> header(k, v) }
+                        setBody(requestBody)
+                    }.execute { httpResponse ->
+                        httpStatus = httpResponse.status.value
+                        if (httpStatus !in 200..299) {
+                            val errorBody = httpResponse.body<String>()
+                            throw mapHttpError(httpStatus!!, errorBody)
+                        }
+
+                        finalFinishReason = OpenAICompatibleHelpers.consumeChatCompletionsSSE(
+                            channel = httpResponse.body(),
+                            toolCallAccumulator = toolCallAccumulator,
+                            onContent = { delta ->
+                                contentBuilder.append(delta)
+                                onStreamChunk(StreamChunk(delta = delta))
+                            },
+                            onRawChunk = ::onStreamRawChunk,
+                        )
+                    }
+                }
+            }
+
+            if (contentBuilder.isEmpty()) {
+                toolCallAccumulator.toCanonicalJson()?.let { contentBuilder.append(it) }
+            }
+
+            @Suppress("UNCHECKED_CAST")
+            val inputTokensEstimate = (requestBody["messages"] as? List<Map<String, Any?>>)?.sumOf {
+                when (val content = it["content"]) {
+                    is String -> content.length
+                    is List<*> -> content.sumOf { part ->
+                        @Suppress("UNCHECKED_CAST")
+                        val partMap = part as? Map<String, Any?>
+                        (partMap?.get("text") as? String)?.length ?: 0
+                    }
+                    else -> 0
+                }
+            } ?: 0
+            val usage = LLMUsage(
+                inputTokens = inputTokensEstimate,
+                outputTokens = contentBuilder.length / 4,
+                totalTokens = inputTokensEstimate + contentBuilder.length / 4,
+            )
+            onStreamChunk(StreamChunk(delta = "", finishReason = finalFinishReason, usage = usage))
+
+            logger.apiResponse(
+                provider = provider,
+                model = model,
+                endpoint = endpoint,
+                requestJson = requestJson,
+                responseJson = gson.toJson(mapOf("content" to contentBuilder.toString())),
+                httpStatus = httpStatus ?: 200,
+                inputTokens = usage.inputTokens,
+                outputTokens = usage.outputTokens,
+                costUsd = estimateCost(usage),
+                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
+                taskId = taskId,
+                subtaskId = subtaskId,
+                source = source,
+            )
+
+            return LLMResponse(
+                content = contentBuilder.toString(),
+                usage = usage,
+                model = model,
+                provider = provider,
+                cost = estimateCost(usage),
+                finishReason = finalFinishReason,
+                rawResponse = mapOf("content" to contentBuilder.toString()),
+            )
+        } catch (e: RefioError) {
+            logger.apiError(
+                provider = provider,
+                model = model,
+                endpoint = endpoint,
+                requestJson = requestJson,
+                httpStatus = httpStatus,
+                error = e,
+                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
+                taskId = taskId,
+                subtaskId = subtaskId,
+                source = source,
+            )
+            throw e
+        } catch (e: CancellationException) {
+            logger.apiError(
+                provider = provider,
+                model = model,
+                endpoint = endpoint,
+                requestJson = requestJson,
+                httpStatus = httpStatus,
+                error = e,
+                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
+                taskId = taskId,
+                subtaskId = subtaskId,
+                source = source,
+            )
+            throw e
+        } catch (e: Exception) {
+            logger.apiError(
+                provider = provider,
+                model = model,
+                endpoint = endpoint,
+                requestJson = requestJson,
+                httpStatus = httpStatus,
+                error = e,
+                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
+                taskId = taskId,
+                subtaskId = subtaskId,
+                source = source,
+            )
+            throw RefioError.LLMError(providerName, model, e)
+        }
+    }
+
+    open suspend fun listModels(): List<ModelConfig> = withContext(Dispatchers.IO) {
+        val baseUrl = resolveBaseUrl()
+        val apiKey = resolveApiKey()
+        val endpoint = "$baseUrl$modelsEndpointPath"
+        val startTime = System.currentTimeMillis()
+
+        try {
+            logger.info { "[$providerTag] Request start: endpoint=$endpoint" }
+            val response = withProviderRateLimit(endpoint) {
+                client.get(endpoint) {
+                    apiKey?.let { header(HttpHeaders.Authorization, "Bearer $it") }
+                    extraRequestHeaders().forEach { (k, v) -> header(k, v) }
+                }
+            }
+            val rawBody = response.body<String>()
+            if (response.status.value !in 200..299) {
+                throw mapHttpError(response.status.value, rawBody)
+            }
+            parseModelsPayload(rawBody)
+        } catch (e: RefioError) {
+            logger.apiError(
+                provider = provider,
+                model = "models",
+                endpoint = endpoint,
+                requestJson = "",
+                httpStatus = null,
+                error = e,
+                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
+                taskId = taskId,
+                subtaskId = subtaskId,
+                source = source,
+            )
+            throw e
+        } catch (e: Exception) {
+            logger.apiError(
+                provider = provider,
+                model = "models",
+                endpoint = endpoint,
+                requestJson = "",
+                httpStatus = null,
+                error = e,
+                latencyMs = (System.currentTimeMillis() - startTime).toInt(),
+                taskId = taskId,
+                subtaskId = subtaskId,
+                source = source,
+            )
+            throw RefioError.LLMError(providerName, model, e)
+        }
+    }
+
+    open fun parseModelsPayload(rawBody: String): List<ModelConfig> {
+        val parsed = gson.fromJson(rawBody, Any::class.java)
+        val modelsData: List<*> = when (parsed) {
+            is Map<*, *> -> parsed["data"] as? List<*> ?: emptyList<Any?>()
+            is List<*> -> parsed
+            else -> emptyList<Any?>()
+        }
+
+        return modelsData.mapNotNull { item ->
+            val modelData = item as? Map<*, *> ?: return@mapNotNull null
+            val modelId = modelData["id"] as? String ?: return@mapNotNull null
+            val contextLength = (modelData["context_length"] as? Number)?.toInt() ?: DEFAULT_CONTEXT_SIZE
+            val definition = ModelDefinitions.getDefinition(providerName, modelId)
+                ?: run {
+                    logger.warn {
+                        "[$providerName] Model $modelId not in registry — using synthetic definition (context=$contextLength)"
+                    }
+                    ModelDefinitions.syntheticDefinitionFor(providerName, modelId, contextLength)
+                }
+            definition.toModelConfig()
+        }
+    }
+
+    override fun estimateCost(usage: LLMUsage): Double = 0.0
+
+    override suspend fun close() {
+        client.close()
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAICompatibleHelpers.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAICompatibleHelpers.kt
new file mode 100644
index 00000000..a43a8695
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenAICompatibleHelpers.kt
@@ -0,0 +1,126 @@
+package pl.jclab.refio.core.llm.adapters
+
+import io.ktor.utils.io.ByteReadChannel
+import kotlinx.coroutines.CancellationException
+import pl.jclab.refio.core.llm.BaseLLMAdapter
+import pl.jclab.refio.core.llm.LLMMessage
+import pl.jclab.refio.core.utils.GsonInstance.gson
+
+/**
+ * Shared helpers for OpenAI-compatible chat adapters
+ * (OpenAI, OpenRouter, LMStudio, GenericOpenAI/Z.AI).
+ *
+ * NOT used by Anthropic, Gemini, Ollama (each has provider-specific protocol).
+ */
+internal object OpenAICompatibleHelpers {
+
+    /**
+     * Map [systemMessages] + [messages] to OpenAI chat-completions message array.
+     *
+     * Behavior:
+     * - non-blank entries from [systemMessages] become `role=system`.
+     * - any inline `system` messages from [messages] are dropped (must be passed via [systemMessages]).
+     * - `tool` role is remapped to `assistant` because OpenAI-compatible APIs require
+     *   `tool_call_id` alongside `role=tool`, which our adapters don't currently emit.
+     */
+    fun buildMessages(
+        adapter: BaseLLMAdapter,
+        systemMessages: List<String>,
+        messages: List<LLMMessage>
+    ): List<Map<String, Any>> = buildList {
+        systemMessages.filter { it.isNotBlank() }.forEach { add(mapOf("role" to "system", "content" to it)) }
+        messages.filter { it.role != "system" }.forEach { msg ->
+            val mappedRole = if (msg.role == "tool") "assistant" else msg.role
+            add(mapOf("role" to mappedRole, "content" to adapter.toOpenAiMessageContent(msg)))
+        }
+    }
+
+    /**
+     * Add common OpenAI-compatible knobs from the caller's [kwargs] map to a request body.
+     * Only writes keys that are actually present, so callers that pass through fewer
+     * kwargs don't end up with unwanted defaults.
+     */
+    fun MutableMap<String, Any>.addCommonKwargs(kwargs: Map<String, Any>) {
+        (kwargs["top_p"] as? Number)?.let { put("top_p", it) }
+        (kwargs["frequency_penalty"] as? Number)?.let { put("frequency_penalty", it) }
+        (kwargs["presence_penalty"] as? Number)?.let { put("presence_penalty", it) }
+        kwargs["stop"]?.let { put("stop", it) }
+    }
+
+    /**
+     * Resolve the effective `max_tokens` value: caller-provided cap, configured limit,
+     * and per-model definition cap, all combined. Logs a warning when the requested
+     * value exceeds the model's hard limit and is being clamped down.
+     *
+     * @param requested caller's [maxTokens] (null/0 means "use config limit").
+     * @param configLimit configured maximum from `ConfigKeys.MAX_OUTPUT_SIZE`.
+     * @param modelLimit per-model cap from [pl.jclab.refio.core.llm.ModelDefinitions], or null/0 if unknown.
+     */
+    /**
+     * Consume an OpenAI chat.completions SSE stream from [channel]:
+     * - skips blank lines, lines not starting with `data: `, and the `[DONE]` terminator.
+     * - accumulates `tool_calls` deltas via [toolCallAccumulator].
+     * - invokes [onContent] with each non-empty `content` delta.
+     * - returns the last seen `finish_reason` (or `"cancelled"` if [checkCancelled] returns true mid-stream).
+     *
+     * Malformed chunks are silently skipped to match historical adapter behavior.
+     * CancellationException (guardrail trip) propagates out.
+     */
+    suspend fun consumeChatCompletionsSSE(
+        channel: ByteReadChannel,
+        toolCallAccumulator: ToolCallContentNormalizer.OpenAiStreamingToolCallAccumulator,
+        onContent: (String) -> Unit,
+        checkCancelled: () -> Boolean = { false },
+        onRawChunk: ((Map<String, Any?>) -> Unit)? = null,
+    ): String? {
+        var finishReason: String? = null
+        while (!channel.isClosedForRead) {
+            if (checkCancelled()) {
+                finishReason = "cancelled"
+                break
+            }
+            val line = channel.readUTF8Line(limit = Int.MAX_VALUE) ?: continue
+            if (line.isBlank() || !line.startsWith("data: ")) continue
+            val data = line.removePrefix("data: ").trim()
+            if (data == "[DONE]") break
+            try {
+                @Suppress("UNCHECKED_CAST")
+                val chunk = gson.fromJson(data, Map::class.java) as Map<String, Any?>
+                onRawChunk?.invoke(chunk)
+                @Suppress("UNCHECKED_CAST")
+                val choices = chunk["choices"] as? List<Map<String, Any?>> ?: emptyList()
+                val first = choices.firstOrNull() ?: emptyMap()
+                @Suppress("UNCHECKED_CAST")
+                val delta = first["delta"] as? Map<String, Any?>
+                toolCallAccumulator.consumeDelta(delta)
+                (delta?.get("content") as? String)?.takeIf { it.isNotEmpty() }?.let(onContent)
+                (first["finish_reason"] as? String)?.let { finishReason = it }
+            } catch (e: CancellationException) {
+                throw e
+            } catch (e: Exception) {
+                // Rethrow errors thrown by onRawChunk (e.g. mid-stream provider error
+                // envelopes from OpenRouter). Malformed chunks raise generic exceptions
+                // that match historical behavior — those we silently skip.
+                if (e is IllegalStateException) throw e
+            }
+        }
+        return finishReason
+    }
+
+    fun resolveEffectiveMaxTokens(
+        requested: Int?,
+        configLimit: Int,
+        modelLimit: Int?,
+        providerTag: String,
+        model: String,
+        log: (() -> String) -> Unit
+    ): Int {
+        val capped = if (requested != null && requested > 0) minOf(requested, configLimit) else configLimit
+        return if (modelLimit != null && modelLimit > 0 && capped > modelLimit) {
+            log { "[$providerTag] Requested max_tokens=$capped exceeds model limit ($modelLimit) for $model - clamping to safe value" }
+            modelLimit
+        } else {
+            capped
+        }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenRouterAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenRouterAdapter.kt
index 6481b1f2..e7a2b365 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenRouterAdapter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/OpenRouterAdapter.kt
@@ -1,814 +1,180 @@
 package pl.jclab.refio.core.llm.adapters
 
-import kotlinx.coroutines.CancellationException
-import kotlinx.coroutines.Dispatchers
-import kotlinx.coroutines.withContext
-import pl.jclab.refio.core.llm.BaseLLMAdapter
-import pl.jclab.refio.core.llm.LLMMessage
-import pl.jclab.refio.core.llm.LLMResponse
+import io.ktor.client.HttpClient
+import pl.jclab.refio.core.db.ConfigScope
+import pl.jclab.refio.core.errors.RefioError
 import pl.jclab.refio.core.llm.LLMUsage
 import pl.jclab.refio.core.llm.ModelConfig
+import pl.jclab.refio.core.llm.SupportedModels
+import pl.jclab.refio.core.llm.calculateCost
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.services.ConfigService
 import pl.jclab.refio.core.services.ConfigService.Companion.DEFAULT_CONTEXT_SIZE
 import pl.jclab.refio.core.utils.GsonInstance.gson
-import io.ktor.client.*
-import io.ktor.client.call.*
-import io.ktor.client.engine.cio.*
-import io.ktor.client.plugins.*
-import io.ktor.client.plugins.contentnegotiation.*
-import io.ktor.client.plugins.logging.*
-import io.ktor.client.plugins.logging.Logger as KtorLogger
-import io.ktor.client.request.*
-import io.ktor.http.*
-import io.ktor.serialization.gson.*
-import pl.jclab.refio.core.config.ConfigKeys
-import pl.jclab.refio.core.errors.LLMErrorMapper
-import pl.jclab.refio.core.errors.RefioError
-import pl.jclab.refio.core.security.SecureLogger
-import pl.jclab.refio.core.logging.dualLogger
-import java.util.UUID
 
 /**
- * Adapter for OpenRouter - unified API for multiple LLM providers.
+ * OpenRouter — unified OpenAI-compatible gateway to many providers.
  *
- * OpenRouter provides access to models from OpenAI, Anthropic, Google, Meta, and others
- * through a single OpenAI-compatible API.
+ * Differences from a stock OpenAI-compatible adapter:
+ * - Adds `HTTP-Referer` and `X-Title` headers to every request.
+ * - Supports `provider`, `route`, and `thinking` (Claude-only) kwargs.
+ * - Detects mid-stream `{"error":{...}}` envelopes and status-200 errors.
+ * - Rich `/models` payload with per-model pricing + architecture (vision detection).
  *
- * API Documentation: https://openrouter.ai/docs
+ * API docs: https://openrouter.ai/docs
  */
 class OpenRouterAdapter(
     model: String = "anthropic/claude-3.5-sonnet",
-    private val configService: pl.jclab.refio.core.services.ConfigService? = null,
-    private val taskId: String? = null,
-    private val subtaskId: String? = null,
-    private val source: String? = null,
+    configService: ConfigService? = null,
+    taskId: String? = null,
+    subtaskId: String? = null,
+    source: String? = null,
     private val appName: String = "Refio",
-    private val siteUrl: String = "https://github.com/shadoq/refio"
-) : BaseLLMAdapter(model, "openrouter") {
-
-    private val logger = dualLogger("OpenRouterAdapter")
+    private val siteUrl: String = "https://github.com/shadoq/refio",
+    httpClientOverride: HttpClient? = null,
+) : OpenAICompatibleAdapter(
+    model = model,
+    providerName = "openrouter",
+    configService = configService,
+    taskId = taskId,
+    subtaskId = subtaskId,
+    source = source,
+    requireApiKey = true,
+    httpClientOverride = httpClientOverride,
+) {
 
     companion object {
         const val DEFAULT_BASE_URL = "https://openrouter.ai/api/v1"
-        const val CHAT_ENDPOINT = "/chat/completions"
-        const val MODELS_ENDPOINT = "/models"
     }
 
-    private val timeoutMs: Long
-        get() = configService?.getTyped(ConfigKeys.API_CALL_TIMEOUT, taskId)?.toLong()?.times(1000L)
-            ?: ConfigKeys.API_CALL_TIMEOUT.default.toLong() * 1000L
+    override fun resolveBaseUrl(): String = DEFAULT_BASE_URL
 
-    private val client = HttpClient(CIO) {
-        install(ContentNegotiation) {
-            gson {
-                setPrettyPrinting()
-                serializeNulls()
-            }
-        }
-        install(Logging) {
-            level = LogLevel.INFO
-            logger = object : KtorLogger {
-                override fun log(message: String) {
-                    this@OpenRouterAdapter.logger.debug { message }
-                }
-            }
-            sanitizeHeader { header ->
-                header.equals(HttpHeaders.Authorization, ignoreCase = true) ||
-                    header.equals("x-api-key", ignoreCase = true) ||
-                    header.equals("x-goog-api-key", ignoreCase = true)
-            }
-        }
-        install(HttpTimeout) {
-            requestTimeoutMillis = timeoutMs
-            connectTimeoutMillis = 30000
-            socketTimeoutMillis = timeoutMs
-        }
-    }
+    override fun resolveApiKey(): String? = configService?.get(
+        key = ConfigKeys.PROVIDER_OPENROUTER_API_KEY.key,
+        scope = ConfigScope.APP,
+    )
+        ?: System.getProperty("OPENROUTER_API_KEY")
+        ?: System.getenv("OPENROUTER_API_KEY")
+
+    override fun extraRequestHeaders(): Map<String, String> = mapOf(
+        "HTTP-Referer" to siteUrl,
+        "X-Title" to appName,
+    )
 
-    override suspend fun chat(
-        messages: List<LLMMessage>,
-        systemMessages: List<String>,
-        maxTokens: Int?,
+    override fun buildRequestBody(
+        requestMessages: List<Map<String, Any>>,
+        effectiveMaxTokens: Int,
         temperature: Double,
         streaming: Boolean,
-        onStreamChunk: ((pl.jclab.refio.core.llm.StreamChunk) -> Unit)?,
-        kwargs: Map<String, Any>
-    ): LLMResponse {
-        logger.info { "[OPENROUTER] Sending ${if (streaming) "streaming" else "standard"} chat request: model=$model, messages=${messages.size}, systemMessages=${systemMessages.size}" }
-
-        try {
-            if (streaming && onStreamChunk != null) {
-                return chatStreamingInternal(messages, systemMessages, maxTokens, temperature, kwargs, onStreamChunk)
-            }
-
-            return chatStandard(messages, systemMessages, maxTokens, temperature, kwargs)
-        } catch (e: CancellationException) {
-            // Stream aborted by a guardrail (see core/llm/streaming/) — must propagate
-            // so the caller can see StreamAbortedException instead of RefioError.LLMError.
-            throw e
-        } catch (e: Exception) {
-            throw LLMErrorMapper.fromThrowable(provider, model, timeoutMs, e)
+        kwargs: Map<String, Any>,
+        requestId: String,
+    ): Map<String, Any> = super.buildRequestBody(
+        requestMessages, effectiveMaxTokens, temperature, streaming, kwargs, requestId,
+    ).toMutableMap().apply {
+        val thinking = kwargs["thinking"] as? Boolean ?: false
+        if (thinking && model.contains("claude", ignoreCase = true)) {
+            put("thinking", mapOf("type" to "enabled", "budget_tokens" to 10000))
+            logger.info { "[${providerTag}] Enabled thinking mode for $model" }
         }
+        (kwargs["provider"] as? Map<*, *>)?.let { put("provider", it) }
+        (kwargs["route"] as? String)?.let { put("route", it) }
     }
 
-    private suspend fun chatStandard(
-        messages: List<LLMMessage>,
-        systemMessages: List<String>,
-        maxTokens: Int?,
-        temperature: Double,
-        kwargs: Map<String, Any>
-    ): LLMResponse {
-        logger.info { "[OPENROUTER] Executing standard chat" }
-
-        // Get API key from ConfigService (single source of truth)
-        val apiKeyToUse = configService?.get(
-            key = pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_OPENROUTER_API_KEY,
-            scope = pl.jclab.refio.core.db.ConfigScope.APP
-        )
-            ?: System.getProperty("OPENROUTER_API_KEY")
-            ?: System.getenv("OPENROUTER_API_KEY")
-            ?: throw LLMErrorMapper.missingConfig(provider, "api_key")
-
-        // Prepare messages
-        val openrouterMessages = mutableListOf<Map<String, Any>>()
-
-        // Add system messages from systemMessages parameter
-        systemMessages.filter { it.isNotBlank() }.forEach { sysMsg ->
-            openrouterMessages.add(mapOf("role" to "system", "content" to sysMsg))
-        }
-
-        // Add conversation messages (filter out any system messages as they should be in systemMessages parameter).
-        // Remap "tool" (used by LLMMessageMapper for tool results) to "assistant" — OpenAI-compatible APIs
-        // expect tool_call_id alongside role="tool", which this adapter does not currently emit.
-        for (msg in messages.filter { it.role != "system" }) {
-            val mappedRole = if (msg.role == "tool") "assistant" else msg.role
-            openrouterMessages.add(mapOf("role" to mappedRole, "content" to toOpenAiMessageContent(msg)))
-        }
-
-        // Build request body
-        val requestBody = buildMap {
-            put("model", model)
-            put("messages", openrouterMessages)
-            put("temperature", temperature)
-
-            if (maxTokens != null && maxTokens > 0) {
-                put("max_tokens", maxTokens)
-            }
-
-            // Handle response_format for JSON mode
-            val responseFormat = kwargs["response_format"] as? Map<*, *>
-            if (responseFormat != null && responseFormat["type"] == "json_object") {
-                put("response_format", mapOf("type" to "json_object"))
-            }
-
-            // Enable thinking mode for Anthropic models via OpenRouter
-            val thinking = kwargs["thinking"] as? Boolean ?: false
-            if (thinking && model.contains("claude", ignoreCase = true)) {
-                put("thinking", mapOf(
-                    "type" to "enabled",
-                    "budget_tokens" to 10000
-                ))
-                logger.info { "[OPENROUTER] Enabled thinking mode for $model" }
-            }
-
-            // Additional parameters
-            (kwargs["top_p"] as? Number)?.let { put("top_p", it) }
-            (kwargs["frequency_penalty"] as? Number)?.let { put("frequency_penalty", it) }
-            (kwargs["presence_penalty"] as? Number)?.let { put("presence_penalty", it) }
-            kwargs["stop"]?.let { put("stop", it) }
-
-            // OpenRouter-specific parameters
-            (kwargs["provider"] as? Map<*, *>)?.let { put("provider", it) }
-            (kwargs["route"] as? String)?.let { put("route", it) }
-        }
-
-        val requestJson = gson.toJson(requestBody)
-        val requestId = UUID.randomUUID().toString()
-        val logPrefix = "[OPENROUTER][$requestId]"
-        logger.debug { "$logPrefix Request: ${SecureLogger.redactAndTruncate(requestJson)}" }
-
-        val startTime = System.currentTimeMillis()
-        var httpStatus: Int? = null
-
-        try {
-            // Make HTTP request
-            logger.info { "$logPrefix Request start: endpoint=$DEFAULT_BASE_URL$CHAT_ENDPOINT, body=${SecureLogger.redactAndTruncate(requestJson)}" }
-            val httpResponse = client.post("$DEFAULT_BASE_URL$CHAT_ENDPOINT") {
-                contentType(ContentType.Application.Json)
-                header("Authorization", "Bearer $apiKeyToUse")
-                // OpenRouter-specific headers (optional but recommended)
-                header("HTTP-Referer", siteUrl)
-                header("X-Title", appName)
-                setBody(requestBody)
-            }
-
-            httpStatus = httpResponse.status.value
-            val response: Map<String, Any?> = httpResponse.body()
-
-            val responseJson = gson.toJson(response)
-            logger.debug { "$logPrefix Response: ${SecureLogger.redactAndTruncate(responseJson)}" }
-            logger.info {
-                "$logPrefix Response received: status=$httpStatus, durationMs=${System.currentTimeMillis() - startTime}, " +
-                    "bodySize=${responseJson.length}"
-            }
-
-
-            val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-
-            @Suppress("UNCHECKED_CAST")
-            val usageMap = response["usage"] as? Map<String, Any?> ?: emptyMap()
-            val promptTokens = (usageMap["prompt_tokens"] as? Number)?.toInt() ?: 0
-            val completionTokens = (usageMap["completion_tokens"] as? Number)?.toInt() ?: 0
-            val totalTokens = (usageMap["total_tokens"] as? Number)?.toInt() ?: 0
-
-            val usage = LLMUsage(
-                inputTokens = promptTokens,
-                outputTokens = completionTokens,
-                totalTokens = totalTokens
-            )
-
-            val cost = estimateCost(usage)
-            val responseModel = response["model"] as? String ?: model
-
-            logger.apiResponse(
-                provider = provider,
-                model = model,
-                endpoint = "$DEFAULT_BASE_URL$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                responseJson = responseJson,
-                httpStatus = httpStatus,
-                inputTokens = usage.inputTokens,
-                outputTokens = usage.outputTokens,
-                costUsd = cost,
-                latencyMs = latencyMs,
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-            // Check for error in response
-            @Suppress("UNCHECKED_CAST")
-            val errorObj = response["error"] as? Map<String, Any?>
-            if (errorObj != null) {
-                val errorMessage = errorObj["message"] as? String ?: "Unknown error"
-                val errorCode = (errorObj["code"] as? Number)?.toInt() ?: 500
-                @Suppress("UNCHECKED_CAST")
-                val metadata = errorObj["metadata"] as? Map<String, Any?>
-                val providerName = metadata?.get("provider_name") as? String ?: "OpenRouter"
-                logger.error { "$logPrefix API error from $providerName (code $errorCode): $errorMessage" }
-                throw LLMErrorMapper.fromHttpStatus(provider, model, errorCode, "OpenRouter error from $providerName: $errorMessage")
-            }
-
-            // Parse response
-            @Suppress("UNCHECKED_CAST")
-            val choices = response["choices"] as? List<Map<String, Any?>> ?: emptyList()
-            if (choices.isEmpty()) {
-                throw RefioError.LLMError(provider, model, IllegalStateException("OpenRouter returned empty choices"))
-            }
-
-            val choice = choices[0]
-            @Suppress("UNCHECKED_CAST")
-            val message = choice["message"] as? Map<String, Any?> ?: emptyMap()
-            val content = message["content"] as? String ?: ""
-            val normalizedToolCallsJson = if (content.isBlank()) {
-                ToolCallContentNormalizer.fromOpenAiToolCalls(message["tool_calls"])
-            } else {
-                null
-            }
-            val finalContent = normalizedToolCallsJson ?: content
-            if (normalizedToolCallsJson != null) {
-                logger.info { "$logPrefix [TOOL_CALLS_NORMALIZED] Converted OpenRouter tool_calls to canonical JSON content" }
-            }
-            val finishReason = choice["finish_reason"] as? String
-
-            logger.info { "$logPrefix Response processed: model=$responseModel, " +
-                    "tokens_in=${usage.inputTokens}, tokens_out=${usage.outputTokens}, " +
-                    "cost=$${"%.4f".format(cost)}, finish_reason=$finishReason" }
-
-
-            return LLMResponse(
-                content = finalContent,
-                usage = usage,
-                model = responseModel,
-                provider = provider,
-                cost = cost,
-                finishReason = finishReason,
-                rawResponse = response
-            )
-
-        } catch (e: Exception) {
-            // Error #15: Log error (console + database)
-            val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$DEFAULT_BASE_URL$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = httpStatus,
-                error = e,
-                latencyMs = latencyMs,
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-
-            throw LLMErrorMapper.fromThrowable(provider, model, timeoutMs, e)
-        }
+    /**
+     * OpenRouter returns HTTP 200 with `{"error": {...}}` for upstream provider errors.
+     * We detect and rethrow so the envelope doesn't leak through to `choices` parsing.
+     */
+    override fun ensureSuccess(httpStatus: Int, rawResponse: Map<String, Any?>, endpoint: String) {
+        super.ensureSuccess(httpStatus, rawResponse, endpoint)
+        @Suppress("UNCHECKED_CAST")
+        val error = rawResponse["error"] as? Map<String, Any?> ?: return
+        val message = error["message"] as? String ?: "Unknown error"
+        val code = (error["code"] as? Number)?.toInt() ?: 500
+        @Suppress("UNCHECKED_CAST")
+        val metadata = error["metadata"] as? Map<String, Any?>
+        val providerFromMeta = metadata?.get("provider_name") as? String ?: "OpenRouter"
+        throw mapHttpError(code, "$providerFromMeta: $message")
     }
 
     /**
-     * Stream chat completion from OpenRouter API (US-027)
-     *
-     * OpenRouter uses OpenAI-compatible SSE format:
-     * - Lines start with "data: "
-     * - JSON contains: {"choices":[{"delta":{"content":"..."}}]}
-     * - Last message is "data: [DONE]"
-     * - Usage info NOT included in stream (estimated at end)
+     * Detect OpenRouter's mid-stream `{"error":{...}}` envelope — throwing from here
+     * aborts the SSE loop (see `consumeChatCompletionsSSE`).
      */
-    private suspend fun chatStreamingInternal(
-        messages: List<LLMMessage>,
-        systemMessages: List<String>,
-        maxTokens: Int?,
-        temperature: Double,
-        kwargs: Map<String, Any>,
-        onStreamChunk: (pl.jclab.refio.core.llm.StreamChunk) -> Unit
-    ): LLMResponse {
-        logger.info { "[OPENROUTER] Executing streaming chat" }
-
-        val contentBuilder = StringBuilder()
-        val toolCallAccumulator = ToolCallContentNormalizer.OpenAiStreamingToolCallAccumulator()
-        var finalUsage: LLMUsage? = null
-        var finalFinishReason: String? = null
-
-        // Get API key from ConfigService
-        val apiKeyToUse = configService?.get(
-            key = pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_OPENROUTER_API_KEY,
-            scope = pl.jclab.refio.core.db.ConfigScope.APP
-        )
-            ?: System.getProperty("OPENROUTER_API_KEY")
-            ?: System.getenv("OPENROUTER_API_KEY")
-            ?: throw LLMErrorMapper.missingConfig(provider, "api_key")
-
-        // Prepare messages
-        val openrouterMessages = mutableListOf<Map<String, Any>>()
-
-        // Add system messages from systemMessages parameter
-        systemMessages.filter { it.isNotBlank() }.forEach { sysMsg ->
-            openrouterMessages.add(mapOf("role" to "system", "content" to sysMsg))
-        }
-
-        // Add conversation messages (filter out any system messages as they should be in systemMessages parameter).
-        // Remap "tool" (used by LLMMessageMapper for tool results) to "assistant" — OpenAI-compatible APIs
-        // expect tool_call_id alongside role="tool", which this adapter does not currently emit.
-        for (msg in messages.filter { it.role != "system" }) {
-            val mappedRole = if (msg.role == "tool") "assistant" else msg.role
-            openrouterMessages.add(mapOf("role" to mappedRole, "content" to toOpenAiMessageContent(msg)))
-        }
-
-        // Build request body with stream: true
-        val requestBody = buildMap {
-            put("model", model)
-            put("messages", openrouterMessages)
-            put("stream", true)  // Enable streaming
-            put("temperature", temperature)
-
-            if (maxTokens != null && maxTokens > 0) {
-                put("max_tokens", maxTokens)
-            }
-
-            // Handle response_format for JSON mode
-            val responseFormat = kwargs["response_format"] as? Map<*, *>
-            if (responseFormat != null && responseFormat["type"] == "json_object") {
-                put("response_format", mapOf("type" to "json_object"))
-            }
-
-            // Enable thinking mode for Anthropic models via OpenRouter
-            val thinking = kwargs["thinking"] as? Boolean ?: false
-            if (thinking && model.contains("claude", ignoreCase = true)) {
-                put("thinking", mapOf(
-                    "type" to "enabled",
-                    "budget_tokens" to 10000
-                ))
-                logger.info { "[OPENROUTER] Enabled thinking mode for $model" }
-            }
-
-            // Additional parameters
-            (kwargs["top_p"] as? Number)?.let { put("top_p", it) }
-            (kwargs["frequency_penalty"] as? Number)?.let { put("frequency_penalty", it) }
-            (kwargs["presence_penalty"] as? Number)?.let { put("presence_penalty", it) }
-            kwargs["stop"]?.let { put("stop", it) }
+    override fun onStreamRawChunk(chunk: Map<String, Any?>) {
+        @Suppress("UNCHECKED_CAST")
+        val error = chunk["error"] as? Map<String, Any?> ?: return
+        val message = error["message"] as? String ?: "Unknown error"
+        val code = (error["code"] as? Number)?.toInt() ?: 500
+        @Suppress("UNCHECKED_CAST")
+        val metadata = error["metadata"] as? Map<String, Any?>
+        val providerFromMeta = metadata?.get("provider_name") as? String ?: "OpenRouter"
+        throw IllegalStateException("$providerFromMeta error (HTTP $code): $message")
+    }
 
-            // OpenRouter-specific parameters
-            (kwargs["provider"] as? Map<*, *>)?.let { put("provider", it) }
-            (kwargs["route"] as? String)?.let { put("route", it) }
+    /**
+     * Per-model pricing and vision capability parsing specific to OpenRouter's
+     * `/models` payload structure.
+     */
+    override fun parseModelsPayload(rawBody: String): List<ModelConfig> {
+        val parsed = gson.fromJson(rawBody, Map::class.java)
+        @Suppress("UNCHECKED_CAST")
+        val modelsData = parsed?.get("data") as? List<Map<String, Any?>> ?: emptyList()
+
+        if (modelsData.isEmpty()) {
+            logger.warn { "[${providerTag}] API returned empty model list" }
+            return emptyList()
         }
 
-        val requestJson = gson.toJson(requestBody)
-        val requestId = UUID.randomUUID().toString()
-        val logPrefix = "[OPENROUTER][$requestId]"
-        logger.debug { "$logPrefix Streaming request: ${SecureLogger.redactAndTruncate(requestJson)}" }
-
-        val startTime = System.currentTimeMillis()
-        var totalTokensEstimate = 0
-        var httpStatus: Int? = null
-        var lineCount = 0
-        var dataLineCount = 0
-
-        try {
-            // Make streaming HTTP request
-            logger.info { "$logPrefix Request start: endpoint=$DEFAULT_BASE_URL$CHAT_ENDPOINT, body=${SecureLogger.redactAndTruncate(requestJson)}" }
-            client.preparePost("$DEFAULT_BASE_URL$CHAT_ENDPOINT") {
-                contentType(ContentType.Application.Json)
-                header("Authorization", "Bearer $apiKeyToUse")
-                header("HTTP-Referer", siteUrl)
-                header("X-Title", appName)
-                setBody(requestBody)
-            }.execute { httpResponse ->
-                httpStatus = httpResponse.status.value
-                val channel: io.ktor.utils.io.ByteReadChannel = httpResponse.body()
-
-                // Read SSE stream line by line
-                while (!channel.isClosedForRead) {
-                    val line = channel.readUTF8Line(limit = Int.MAX_VALUE) ?: continue
-                    lineCount++
-                    if (line.isBlank()) continue
-
-                    // Debug: log first few lines to see what format we're getting
-                    if (lineCount <= 5) {
-                        logger.info { "$logPrefix SSE line $lineCount: ${line.take(200)}" }
-                    }
-
-                    // SSE format: "data: {...}" or "data: [DONE]"
-                    if (!line.startsWith("data: ")) {
-                        // Some models might send content without "data: " prefix
-                        if (line.startsWith("{") && (line.contains("choices") || line.contains("error"))) {
-                            logger.debug { "[OPENROUTER] Found JSON without data: prefix, processing..." }
-                            // Process as if it was a data line
-                            val data = line.trim()
-                            dataLineCount++
-                            try {
-                                @Suppress("UNCHECKED_CAST")
-                                val chunk = gson.fromJson(data, Map::class.java) as Map<String, Any?>
-
-                                // Check for error first
-                                @Suppress("UNCHECKED_CAST")
-                                val errorObj = chunk["error"] as? Map<String, Any?>
-                                if (errorObj != null) {
-                                    val errorMessage = errorObj["message"] as? String ?: "Unknown error"
-                                    @Suppress("UNCHECKED_CAST")
-                                    val metadata = errorObj["metadata"] as? Map<String, Any?>
-                                    val providerName = metadata?.get("provider_name") as? String ?: "provider"
-                                    logger.error { "$logPrefix Error from $providerName: $errorMessage" }
-                                    contentBuilder.append("\n\n**Error from $providerName:** $errorMessage")
-                                    onStreamChunk(pl.jclab.refio.core.llm.StreamChunk(
-                                        delta = "\n\n**Error from $providerName:** $errorMessage",
-                                        finishReason = "error"
-                                    ))
-                                    finalFinishReason = "error"
-                                    break
-                                }
-
-                                @Suppress("UNCHECKED_CAST")
-                                val choices = chunk["choices"] as? List<Map<String, Any?>>
-                                if (choices != null && choices.isNotEmpty()) {
-                                    val choice = choices[0]
-                                    @Suppress("UNCHECKED_CAST")
-                                    val message = choice["message"] as? Map<String, Any?>
-                                    toolCallAccumulator.consumeToolCalls(message?.get("tool_calls"))
-                                    val content = message?.get("content") as? String
-                                    if (content != null && content.isNotEmpty()) {
-                                        logger.info { "$logPrefix Found non-streaming response with content length: ${content.length}" }
-                                        contentBuilder.append(content)
-                                        totalTokensEstimate = content.split(" ").size
-                                        onStreamChunk(pl.jclab.refio.core.llm.StreamChunk(
-                                            delta = content,
-                                            finishReason = choice["finish_reason"] as? String
-                                        ))
-                                    }
-                                    finalFinishReason = choice["finish_reason"] as? String
-                                }
-                            } catch (e: CancellationException) {
-                                // Let stream abort (guardrail trip) propagate out of the loop.
-                                throw e
-                            } catch (e: Exception) {
-                                logger.warn { "[OPENROUTER] Failed to parse non-prefixed JSON: ${e.message}" }
-                            }
-                        }
-                        continue
-                    }
-                    dataLineCount++
-
-                    val data = line.removePrefix("data: ").trim()
-
-                    // Check for stream end
-                    if (data == "[DONE]") {
-                        logger.info { "$logPrefix Stream complete" }
-                        break
-                    }
-
-                    // Parse JSON chunk
-                    try {
-                        @Suppress("UNCHECKED_CAST")
-                        val chunk = gson.fromJson(data, Map::class.java) as Map<String, Any?>
-
-                        // Debug: log raw chunk structure for troubleshooting
-                        logger.debug { "[OPENROUTER] Raw chunk: $data" }
-
-                        // Check for error in stream response (OpenRouter sends errors as JSON in SSE)
-                        @Suppress("UNCHECKED_CAST")
-                        val errorObj = chunk["error"] as? Map<String, Any?>
-                        if (errorObj != null) {
-                            val errorMessage = errorObj["message"] as? String ?: "Unknown error"
-                            val errorCode = (errorObj["code"] as? Number)?.toInt() ?: 500
-                            @Suppress("UNCHECKED_CAST")
-                            val metadata = errorObj["metadata"] as? Map<String, Any?>
-                            val providerName = metadata?.get("provider_name") as? String ?: "OpenRouter"
-
-                            logger.error { "$logPrefix Error from $providerName (code $errorCode): $errorMessage" }
-
-                            // Throw exception instead of returning error in content
-                            // This prevents PlanningService from trying to parse error message as JSON
-                            throw IllegalStateException("$providerName error (HTTP $errorCode): $errorMessage")
-                        }
-
-                        @Suppress("UNCHECKED_CAST")
-                        val choices = chunk["choices"] as? List<Map<String, Any?>> ?: continue
-                        if (choices.isEmpty()) continue
-
-                        val choice = choices[0]
-                        @Suppress("UNCHECKED_CAST")
-                        val delta = choice["delta"] as? Map<String, Any?>
-                        toolCallAccumulator.consumeDelta(delta)
-
-                        // Try multiple content locations (different models use different formats)
-                        val content = delta?.get("content") as? String
-                            ?: (choice["text"] as? String)  // Some models use "text" directly
-                            ?: (delta?.get("text") as? String)  // Or delta.text
-
-                        val finishReason = choice["finish_reason"] as? String
-
-                        // Debug: log parsed values
-                        if (!content.isNullOrEmpty() || finishReason != null) {
-                            logger.debug { "[OPENROUTER] Parsed: content=${content?.take(50)}, finishReason=$finishReason" }
-                        }
-
-                        // Emit content chunk
-                        if (content != null && content.isNotEmpty()) {
-                            contentBuilder.append(content)
-                            totalTokensEstimate += content.split(" ").size // Rough estimate
-
-                            onStreamChunk(pl.jclab.refio.core.llm.StreamChunk(
-                                delta = content,
-                                finishReason = null
-                            ))
-                        }
-
-                        // Emit final chunk with finish_reason
-                        if (finishReason != null) {
-                            finalFinishReason = finishReason
-                            // Estimate token usage (OpenRouter doesn't send usage in stream)
-                            val systemTokens = systemMessages.sumOf { it.split(" ").size }
-                            val inputTokensEstimate = messages.sumOf { it.content.split(" ").size } + systemTokens
-                            val usage = LLMUsage(
-                                inputTokens = inputTokensEstimate,
-                                outputTokens = totalTokensEstimate,
-                                totalTokens = inputTokensEstimate + totalTokensEstimate
-                            )
-
-                            finalUsage = usage
-                            finalFinishReason = finishReason
-
-                            onStreamChunk(pl.jclab.refio.core.llm.StreamChunk(
-                                delta = "",
-                                finishReason = finishReason,
-                                usage = usage
-                            ))
-                        }
-                    } catch (e: CancellationException) {
-                        // Let stream abort (guardrail trip) propagate out of the loop.
-                        throw e
-                    } catch (e: Exception) {
-                        logger.warn { "[OPENROUTER] Failed to parse chunk: $data - ${e.message}" }
-                        continue
-                    }
-                }
-            }
-
-            if (contentBuilder.isEmpty()) {
-                val normalizedToolCallsJson = toolCallAccumulator.toCanonicalJson()
-                if (normalizedToolCallsJson != null) {
-                    contentBuilder.append(normalizedToolCallsJson)
-                    logger.info { "$logPrefix [TOOL_CALLS_NORMALIZED] Converted streamed OpenRouter tool_calls to canonical JSON content" }
-                }
-            }
-
-            val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-
-            // Debug: log final content length
-            logger.info { "$logPrefix Stream finished: totalLines=$lineCount, dataLines=$dataLineCount, " +
-                    "contentLength=${contentBuilder.length}, estimatedOutputTokens=$totalTokensEstimate, " +
-                    "finishReason=$finalFinishReason" }
+        return modelsData.mapNotNull { modelData ->
+            val modelId = modelData["id"] as? String ?: return@mapNotNull null
+            if (!SupportedModels.isSupported("openrouter", modelId)) return@mapNotNull null
 
-            // If no content was received but stream completed, log warning
-            if (contentBuilder.isEmpty()) {
-                logger.warn { "$logPrefix Stream completed but no content received! Model: $model" }
-            }
-
-            // Estimate final usage for logging
-            val systemTokens = systemMessages.sumOf { it.split(" ").size }
-            val inputTokensEstimate = messages.sumOf { it.content.split(" ").size } + systemTokens
-            val cost = estimateCost(LLMUsage(
-                inputTokens = inputTokensEstimate,
-                outputTokens = totalTokensEstimate,
-                totalTokens = inputTokensEstimate + totalTokensEstimate
-            ))
-
-            // Create synthetic response JSON for logging
-            val syntheticResponse = mapOf(
-                "choices" to listOf(
-                    mapOf(
-                        "message" to mapOf("role" to "assistant", "content" to contentBuilder.toString()),
-                        "finish_reason" to finalFinishReason
-                    )
-                ),
-                "usage" to mapOf(
-                    "prompt_tokens" to inputTokensEstimate,
-                    "completion_tokens" to totalTokensEstimate,
-                    "total_tokens" to (inputTokensEstimate + totalTokensEstimate)
-                ),
-                "model" to model
-            )
-            val responseJson = gson.toJson(syntheticResponse)
-            logger.info {
-                "$logPrefix Response received: status=${httpStatus ?: 200}, durationMs=${System.currentTimeMillis() - startTime}, " +
-                    "bodySize=${responseJson.length}"
-            }
+            val modelName = modelData["name"] as? String ?: modelId
+            val contextLength = (modelData["context_length"] as? Number)?.toInt() ?: DEFAULT_CONTEXT_SIZE
 
-            // Log successful stream completion to API logs
-            logger.apiResponse(
-                provider = provider,
-                model = model,
-                endpoint = "$DEFAULT_BASE_URL$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                responseJson = responseJson,
-                httpStatus = httpStatus ?: 200,
-                inputTokens = inputTokensEstimate,
-                outputTokens = totalTokensEstimate,
-                costUsd = cost,
-                latencyMs = latencyMs,
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
-            )
-
-            logger.info { "$logPrefix Streaming completed in ${latencyMs}ms, logged to API logs" }
-
-            // Return complete LLMResponse
-            return LLMResponse(
-                content = contentBuilder.toString(),
-                usage = finalUsage ?: LLMUsage(0, 0, 0),
-                model = model,
-                provider = provider,
-                cost = finalUsage?.let { estimateCost(it) } ?: 0.0,
-                finishReason = finalFinishReason,
-                rawResponse = null
-            )
+            @Suppress("UNCHECKED_CAST")
+            val pricingData = modelData["pricing"] as? Map<String, Any?>
+            val promptPricePerToken = (pricingData?.get("prompt") as? String)?.toDoubleOrNull() ?: 0.0
+            val completionPricePerToken = (pricingData?.get("completion") as? String)?.toDoubleOrNull() ?: 0.0
+            val costPer1mInput = promptPricePerToken * 1_000_000
+            val costPer1mOutput = completionPricePerToken * 1_000_000
 
-        } catch (e: Exception) {
-            val latencyMs = (System.currentTimeMillis() - startTime).toInt()
-            logger.apiError(
-                provider = provider,
-                model = model,
-                endpoint = "$DEFAULT_BASE_URL$CHAT_ENDPOINT",
-                requestJson = requestJson,
-                httpStatus = null,
-                error = e,
-                latencyMs = latencyMs,
-                taskId = taskId,
-                subtaskId = subtaskId,
-                source = source
+            @Suppress("UNCHECKED_CAST")
+            val architecture = modelData["architecture"] as? Map<String, Any?>
+            val modality = architecture?.get("modality") as? String
+            val capabilities = mutableListOf("chat", "streaming").apply {
+                if (modality == "text+image" || modality?.contains("image") == true) add("vision")
+            }
+
+            ModelConfig(
+                id = modelId,
+                name = "$modelName (via OpenRouter)",
+                provider = "openrouter",
+                capabilities = capabilities,
+                maxContext = contextLength,
+                costPer1mInput = costPer1mInput,
+                costPer1mOutput = costPer1mOutput,
             )
-            throw e
         }
     }
 
     /**
-     * Lists all available models from OpenRouter API.
-     *
-     * @return List of ModelConfig objects with model metadata and pricing
-     * @throws IllegalStateException if API key is not provided or API returns empty response
+     * Missing API key yields an empty list (caller-facing behavior preserved from the
+     * pre-migration adapter so the settings UI can still list supported models lazily).
      */
-    suspend fun listModels(): List<ModelConfig> = kotlinx.coroutines.withContext(kotlinx.coroutines.Dispatchers.IO) {
-        logger.info { "[OPENROUTER] Fetching available models from $DEFAULT_BASE_URL$MODELS_ENDPOINT" }
-
-        try {
-            // Get API key from ConfigService (single source of truth)
-            val apiKeyToUse = configService?.get(
-                key = pl.jclab.refio.core.services.ConfigService.KEY_PROVIDER_OPENROUTER_API_KEY,
-                scope = pl.jclab.refio.core.db.ConfigScope.APP
-            )
-                ?: System.getProperty("OPENROUTER_API_KEY")
-                ?: System.getenv("OPENROUTER_API_KEY")
-                ?: null
-
-            if (apiKeyToUse==null){
-                return@withContext emptyList()
-            }
-
-            // Make HTTP request
-            val httpResponse = client.get("$DEFAULT_BASE_URL$MODELS_ENDPOINT") {
-                header("Authorization", "Bearer $apiKeyToUse")
-                header("HTTP-Referer", siteUrl)
-                header("X-Title", appName)
-            }
-
-            val response: Map<String, Any?> = httpResponse.body()
-
-            // Parse response
-            @Suppress("UNCHECKED_CAST")
-            val modelsData = response["data"] as? List<Map<String, Any?>> ?: emptyList()
-
-            if (modelsData.isEmpty()) {
-                logger.warn { "[OPENROUTER] API returned empty model list" }
-                return@withContext emptyList()
-            }
-
-            val modelConfigs = modelsData.mapNotNull { modelData ->
-                try {
-                    val modelId = modelData["id"] as? String
-                    if (modelId == null) {
-                        logger.debug { "[OPENROUTER] Skipping model without ID: $modelData" }
-                        return@mapNotNull null
-                    }
-
-                    // Filter using whitelist - only supported models
-                    if (!pl.jclab.refio.core.llm.SupportedModels.isSupported("openrouter", modelId)) {
-                        return@mapNotNull null
-                    }
-
-                    val modelName = modelData["name"] as? String ?: modelId
-                    val contextLength = (modelData["context_length"] as? Number)?.toInt() ?: DEFAULT_CONTEXT_SIZE
-
-                    // Parse pricing (OpenRouter returns price per token, we need per 1M tokens)
-                    @Suppress("UNCHECKED_CAST")
-                    val pricingData = modelData["pricing"] as? Map<String, Any?>
-                    val promptPricePerToken = (pricingData?.get("prompt") as? String)?.toDoubleOrNull() ?: 0.0
-                    val completionPricePerToken = (pricingData?.get("completion") as? String)?.toDoubleOrNull() ?: 0.0
-
-                    // Convert from per-token to per-1M-tokens
-                    val costPer1mInput = promptPricePerToken * 1_000_000
-                    val costPer1mOutput = completionPricePerToken * 1_000_000
-
-                    // Parse architecture to determine capabilities
-                    @Suppress("UNCHECKED_CAST")
-                    val architecture = modelData["architecture"] as? Map<String, Any?>
-                    val modality = architecture?.get("modality") as? String
-
-                    logger.debug { "[OPENROUTER] Model $modelId has modality: $modality" }
-
-                    // All OpenRouter models support chat
-                    val capabilities = mutableListOf("chat", "streaming")
-
-                    // Add vision if it's a multimodal model
-                    if (modality == "text+image" || modality?.contains("image") == true) {
-                        capabilities.add("vision")
-                    }
-
-                    logger.debug { "[OPENROUTER] Found model: $modelId ($modelName) - context=$contextLength, " +
-                            "input=$${"%.4f".format(costPer1mInput)}/1M, output=$${"%.4f".format(costPer1mOutput)}/1M" }
-
-                    ModelConfig(
-                        id = modelId,
-                        name = "$modelName (via OpenRouter)",
-                        provider = "openrouter",
-                        capabilities = capabilities,
-                        maxContext = contextLength,
-                        costPer1mInput = costPer1mInput,
-                        costPer1mOutput = costPer1mOutput
-                    )
-                } catch (e: Exception) {
-                    logger.warn { "[OPENROUTER] Failed to parse model: ${e.message}" }
-                    null
-                }
-            }
-
-            logger.info { "[OPENROUTER] Found ${modelConfigs.size} models" }
-            return@withContext modelConfigs
-
-        } catch (e: Exception) {
-            logger.error(e) { "[OPENROUTER] Failed to fetch models: ${e.message}" }
-            throw LLMErrorMapper.listModelsFailure(provider, e)
+    override suspend fun listModels(): List<ModelConfig> {
+        if (resolveApiKey().isNullOrBlank()) return emptyList()
+        return try {
+            super.listModels()
+        } catch (e: RefioError.ProviderNotConfigured) {
+            emptyList()
         }
     }
 
-    override fun estimateCost(usage: LLMUsage): Double {
-        return pl.jclab.refio.core.llm.calculateCost(
-            provider = provider,
-            model = model,
-            inputTokens = usage.inputTokens,
-            outputTokens = usage.outputTokens
-        )
-    }
-
-    override suspend fun close() {
-        client.close()
-    }
+    override fun estimateCost(usage: LLMUsage): Double = calculateCost(
+        provider = provider,
+        model = model,
+        inputTokens = usage.inputTokens,
+        outputTokens = usage.outputTokens,
+    )
 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/ZAIAdapter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/ZAIAdapter.kt
index c90c3d40..3888f501 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/ZAIAdapter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/ZAIAdapter.kt
@@ -9,7 +9,7 @@ class ZAIAdapter(
     taskId: String? = null,
     subtaskId: String? = null,
     source: String? = null
-) : CustomOpenAIAdapter(
+) : GenericOpenAIAdapter(
     model = model,
     providerName = "zai",
     configService = configService,
@@ -17,7 +17,7 @@ class ZAIAdapter(
     subtaskId = subtaskId,
     source = source,
     requireApiKey = true,
-    defaultBaseUrl = configService?.getTyped(ConfigKeys.PROVIDER_ZAI_BASE_URL) ?: ConfigService.DEFAULT_ZAI_BASE_URL
+    defaultBaseUrl = configService?.getTyped(ConfigKeys.PROVIDER_ZAI_BASE_URL) ?: ZAIUrls.DEFAULT
 ) {
     override suspend fun listModels() = super.listModels()
 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/ZAIUrls.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/ZAIUrls.kt
new file mode 100644
index 00000000..8b5fb82d
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/adapters/ZAIUrls.kt
@@ -0,0 +1,23 @@
+package pl.jclab.refio.core.llm.adapters
+
+/**
+ * Known Z.AI base URLs and normalization.
+ *
+ * Z.AI has published several endpoints over time; we canonicalize to [DEFAULT] so that
+ * configurations pointing at older URLs keep working without per-caller branching.
+ */
+object ZAIUrls {
+    const val DEFAULT: String = "https://api.z.ai/api/coding/paas/v4"
+    const val LEGACY: String = "https://api.z.ai/v1"
+    const val GENERAL: String = "https://api.z.ai/api/paas/v4"
+
+    fun normalize(baseUrl: String?): String {
+        val trimmed = baseUrl?.trim()?.trimEnd('/')
+        return when {
+            trimmed.isNullOrEmpty() -> DEFAULT
+            trimmed.equals(LEGACY, ignoreCase = true) -> DEFAULT
+            trimmed.equals(GENERAL, ignoreCase = true) -> DEFAULT
+            else -> trimmed
+        }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/streaming/OutputSizeLimiter.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/streaming/OutputSizeLimiter.kt
index 38119e79..c8981706 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/streaming/OutputSizeLimiter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/streaming/OutputSizeLimiter.kt
@@ -10,13 +10,14 @@ package pl.jclab.refio.core.llm.streaming
  * while it chews through context window budget, and so that downstream code is
  * never handed a 2MB "message" that came from a single LLM turn gone wrong.
  *
- * Set conservatively: 32KB of text is ~8000 tokens, which is larger than
- * any legitimate single agent turn response should be.
+ * Set conservatively: 128KB of text is ~32K tokens, which covers
+ * legitimate large outputs such as single-file HTML apps, full file
+ * regenerations, and detailed analysis reports.
  *
- * @param maxChars Abort threshold in characters. Default 32768 (~8K tokens).
+ * @param maxChars Abort threshold in characters. Default 131072 (~32K tokens).
  */
 class OutputSizeLimiter(
-    private val maxChars: Int = 32_768
+    private val maxChars: Int = 131_072
 ) : StreamGuardrail {
 
     override val name: String = "size-limit"
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/llm/streaming/StreamGuardrails.kt b/core/src/main/kotlin/pl/jclab/refio/core/llm/streaming/StreamGuardrails.kt
index b6bace2b..f7ec156c 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/llm/streaming/StreamGuardrails.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/llm/streaming/StreamGuardrails.kt
@@ -82,7 +82,7 @@ class StreamGuardrails(
          * Current composition:
          * - [RepetitionDetector] with conservative defaults (fires on 4×
          *   consecutive block repetitions, any block size 50–800 chars).
-         * - [OutputSizeLimiter] at 32 KB — prevents runaway continuations.
+         * - [OutputSizeLimiter] at 128 KB — prevents runaway continuations.
          * - [WallClockDeadline] at 180 s — independent of Ktor's request
          *   timeout, so a stuck stream unwinds cleanly without waiting for
          *   the full provider timeout.
@@ -90,12 +90,12 @@ class StreamGuardrails(
          * All thresholds are deliberately loose — the goal is "catch obvious
          * pathology without false positives on healthy long-running streams".
          */
-        fun defaults(): StreamGuardrails {
+        fun defaults(wallClockDeadlineMs: Long = 180_000): StreamGuardrails {
             return StreamGuardrails(
                 guardrails = listOf(
                     RepetitionDetector(),
                     OutputSizeLimiter(),
-                    WallClockDeadline()
+                    WallClockDeadline(deadlineMs = wallClockDeadlineMs)
                 )
             )
         }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/models/api/PromptsModels.kt b/core/src/main/kotlin/pl/jclab/refio/core/models/api/PromptsModels.kt
index be389e91..3693c28e 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/models/api/PromptsModels.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/models/api/PromptsModels.kt
@@ -34,9 +34,9 @@ data class SaveRuleRequest(
 )
 
 /**
- * Request to create or update a slash command
+ * Request to create or update a slash prompt
  */
-data class SaveCommandRequest(
+data class SaveSlashPromptRequest(
     val id: String? = null,
     val name: String,  // e.g., "/refactor" or "refactor" (will be normalized)
     val content: String,
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/models/context/ExecutedStepDTO.kt b/core/src/main/kotlin/pl/jclab/refio/core/models/context/ExecutedStepDTO.kt
index 6bf2007b..d0a6087f 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/models/context/ExecutedStepDTO.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/models/context/ExecutedStepDTO.kt
@@ -13,6 +13,7 @@ import java.time.Instant
  * @property tool Tool name that was executed
  * @property parameters Tool parameters
  * @property result Full raw output from the tool
+ * @property rawResultSize Original size in chars of the raw output before any truncation
  * @property summary Summarized output (may be null if not summarized)
  * @property timestamp When the step was completed
  * @property success Whether the underlying subtask finished successfully.
@@ -25,6 +26,7 @@ data class ExecutedStepDTO(
     val tool: String,
     val parameters: Map<String, Any>,
     val result: String,
+    val rawResultSize: Int = 0,
     val summary: String?,
     val timestamp: Instant,
     val success: Boolean = true
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/models/context/HelperDTOs.kt b/core/src/main/kotlin/pl/jclab/refio/core/models/context/HelperDTOs.kt
index 38d243fe..43a040b1 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/models/context/HelperDTOs.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/models/context/HelperDTOs.kt
@@ -44,18 +44,6 @@ data class ConversationMessageDTO(
     val metadata: Map<String, Any?>? = null
 )
 
-/**
- * Code fragment DTO - relevant code/documentation fragment from RAG search
- */
-data class CodeFragmentDTO(
-    val filePath: String,
-    val content: String,
-    val startLine: Int?,
-    val endLine: Int?,
-    val similarity: Float,
-    val contentType: String  // PROJECT_CODE or DOCUMENTATION
-)
-
 /**
  * Resolved context reference DTO - user-provided context from @ mentions
  * Contains resolved content ready for LLM consumption
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/models/context/ProjectContextDTO.kt b/core/src/main/kotlin/pl/jclab/refio/core/models/context/ProjectContextDTO.kt
index 0b5d1d6d..df1ffb2a 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/models/context/ProjectContextDTO.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/models/context/ProjectContextDTO.kt
@@ -29,16 +29,11 @@ data class ProjectContextDTO(
 
     // Work history (from PHASE 3, refactored in ADR 0041)
     val completedFiles: List<String> = emptyList(),
-    @Deprecated("Use executedSteps instead for structured history")
-    val previousSubtasks: List<String> = emptyList(),
     val executedSteps: List<ExecutedStepDTO> = emptyList(),
 
     // User requirements (extracted from task description - PHASE 2)
     val userRequirements: Map<String, Any> = emptyMap(),
 
-    // RAG (Retrieval-Augmented Generation) context
-    val ragFragments: List<CodeFragmentDTO> = emptyList(),
-
     // User-provided context (from @ mentions + extracted from messages)
     val userContextRefs: List<ResolvedContextDTO> = emptyList(),
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/prompts/ToolDescriptionBuilder.kt b/core/src/main/kotlin/pl/jclab/refio/core/prompts/ToolDescriptionBuilder.kt
index 3760a042..08b429ee 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/prompts/ToolDescriptionBuilder.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/prompts/ToolDescriptionBuilder.kt
@@ -13,8 +13,8 @@ import pl.jclab.refio.core.utils.GsonInstance.prettyGson
  * - CHAT/PLAN: Read-only tools (read_file, grep_search, etc.) filtered by permissions
  * - AGENT: All tools including write operations filtered by permissions
  *
- * Uses ToolRegistry and ToolPermissionsService to dynamically generate descriptions
- * with parameter schemas.
+ * Groups tools by logical sections (Reading & Search, Web, Editing, etc.)
+ * with section headers for better LLM comprehension.
  */
 class ToolDescriptionBuilder(
     private val toolRegistry: ToolRegistry,
@@ -35,34 +35,6 @@ class ToolDescriptionBuilder(
         return getToolDescriptionsForTools(mode, getToolsForMode(mode, taskId))
     }
 
-    /**
-     * Get tool description for a specific tool only.
-     * Used when we want to constrain the LLM to use only the suggested tool.
-     *
-     * @param mode Task mode
-     * @param toolName Name of the tool to describe
-     * @param taskId Optional task ID for task-level permissions
-     * @return Tool description string with parameter schema, or error message if tool not found/available
-     */
-    fun getSingleToolDescription(mode: TaskMode, toolName: String, taskId: String? = null): String {
-        val allTools = getToolsForMode(mode, taskId)
-        val tool = allTools.find { it.name == toolName }
-
-        if (tool == null) {
-            return "ERROR: Tool '$toolName' not found or not available in ${mode.name} mode."
-        }
-
-        val schema = tool.getParameterSchema()
-        val description = buildToolDescription(1, tool.name, tool.description, schema)
-
-        val modeNote = when (mode) {
-            TaskMode.CHAT, TaskMode.PLAN -> "SUGGESTED TOOL (you MUST use this tool in ${mode.name} mode):"
-            TaskMode.AGENT -> "SUGGESTED TOOL (you should use this tool):"
-        }
-
-        return "$modeNote\n\n$description"
-    }
-
     /**
      * Get list of valid tool names for validation.
      *
@@ -76,20 +48,30 @@ class ToolDescriptionBuilder(
 
     /**
      * Get tool descriptions for a pre-filtered tool list.
-     * Keeps prompts consistent with task-specific constraints (e.g., read-only).
+     * Groups tools by logical sections (Reading, Web, Editing, etc.) with headers.
      */
     fun getToolDescriptionsForTools(mode: TaskMode, tools: List<Tool>): String {
-        val descriptions = tools.mapIndexed { index, tool ->
-            val schema = tool.getParameterSchema()
-            buildToolDescription(index + 1, tool.name, tool.description, schema)
-        }.joinToString("\n\n")
-
         val modeNote = when (mode) {
             TaskMode.CHAT, TaskMode.PLAN -> "READ-ONLY TOOLS (you can use these in ${mode.name} mode):"
             TaskMode.AGENT -> "AVAILABLE TOOLS (read-only and write operations):"
         }
 
-        return "$modeNote\n\n$descriptions"
+        val groups = toolRegistry.getToolsByGroups(tools)
+        var number = 1
+        val sb = StringBuilder()
+        sb.appendLine(modeNote)
+
+        for ((groupName, groupTools) in groups) {
+            sb.appendLine()
+            sb.appendLine("### $groupName\n")
+            for (tool in groupTools) {
+                val schema = tool.getParameterSchema()
+                sb.append(buildToolDescription(number, tool.name, tool.description, schema))
+                number++
+            }
+        }
+
+        return sb.toString().trimEnd()
     }
 
     /**
@@ -99,6 +81,40 @@ class ToolDescriptionBuilder(
         return tools.joinToString(", ") { it.name }
     }
 
+    /**
+     * Build the When-to-use-what selection matrix for the given mode.
+     * Only tools that provide a [Tool.selectionHint] appear. Grouped by the same
+     * logical sections used by tool descriptions — if a permission filter hides a
+     * tool, it disappears from the matrix too.
+     */
+    fun getToolSelectionMatrix(mode: TaskMode, taskId: String? = null): String {
+        return buildSelectionMatrix(getToolsForMode(mode, taskId))
+    }
+
+    /**
+     * Build the selection matrix from a pre-filtered tool list. Returns empty
+     * string if no tool in the list provides a selection hint.
+     */
+    fun buildSelectionMatrix(tools: List<Tool>): String {
+        val groups = toolRegistry.getToolsByGroups(tools)
+        val sb = StringBuilder()
+        var anyRow = false
+
+        sb.appendLine("| Tool | When to use |")
+        sb.appendLine("|---|---|")
+        for ((_, groupTools) in groups) {
+            for (tool in groupTools) {
+                val hint = tool.selectionHint?.trim().orEmpty()
+                if (hint.isBlank()) continue
+                val safeHint = hint.replace("|", "\\|").replace("\n", " ")
+                sb.append("| `").append(tool.name).append("` | ").append(safeHint).append(" |\n")
+                anyRow = true
+            }
+        }
+
+        return if (anyRow) sb.toString().trimEnd() else ""
+    }
+
     /**
      * Get tools filtered by task mode AND permissions.
      */
@@ -137,16 +153,22 @@ class ToolDescriptionBuilder(
         val required = (schema["required"] as? List<String>) ?: emptyList()
 
         if (properties.isNotEmpty()) {
-            properties.forEach { (paramName, paramSchema) ->
+            val requiredParams = properties.filter { it.key in required }
+            val optionalParams = properties.filter { it.key !in required }
+
+            requiredParams.forEach { (paramName, paramSchema) ->
                 val paramType = paramSchema["type"]?.toString() ?: "any"
                 val paramDesc = paramSchema["description"]?.toString() ?: ""
-                val isRequired = paramName in required
-                val requiredLabel = if (isRequired) "Required" else "Optional"
+                sb.append("   - **\"$paramName\"** ($paramType)")
+                if (paramDesc.isNotBlank()) sb.append(" — $paramDesc")
+                sb.append("\n")
+            }
 
-                sb.append("   - $requiredLabel: \"$paramName\" ($paramType)")
-                if (paramDesc.isNotBlank()) {
-                    sb.append(" - $paramDesc")
-                }
+            optionalParams.forEach { (paramName, paramSchema) ->
+                val paramType = paramSchema["type"]?.toString() ?: "any"
+                val paramDesc = paramSchema["description"]?.toString() ?: ""
+                sb.append("   - \"$paramName\" ($paramType, optional)")
+                if (paramDesc.isNotBlank()) sb.append(" — $paramDesc")
                 sb.append("\n")
             }
         }
@@ -165,7 +187,7 @@ class ToolDescriptionBuilder(
         }
 
         if (exampleParams.isNotEmpty()) {
-            sb.append("   - Example: {\"tool\": \"$name\", \"args\": ${gson.toJson(exampleParams)}}\n")
+            sb.append("\nExample: {\"tool\": \"$name\", \"args\": ${gson.toJson(exampleParams)}}\n\n")
         }
 
         return sb.toString()
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/AgentTurnLoop.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/AgentTurnLoop.kt
index ea88d26f..03920cc3 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/AgentTurnLoop.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/AgentTurnLoop.kt
@@ -25,6 +25,8 @@ import pl.jclab.refio.core.services.AgentTurnLoop.UserMessageStrategy
 import pl.jclab.refio.core.services.monitoring.GlobalMetrics
 import pl.jclab.refio.core.services.monitoring.OperationInfo
 import pl.jclab.refio.core.services.turn.ToolCallParser
+import pl.jclab.refio.core.services.turn.TurnEventListener
+import pl.jclab.refio.core.services.turn.TurnPrompt
 import pl.jclab.refio.core.services.turn.GuardianContext
 import pl.jclab.refio.core.services.turn.GuardianDecision
 import pl.jclab.refio.core.services.turn.GuardianRegistry
@@ -56,16 +58,6 @@ private typealias TurnRepetitionTracker = TurnGuardrails.TurnRepetitionTracker
 private val logger = dualLogger("AgentTurnLoop")
 
 
-/**
- * Adapter to convert between TurnPrompt and LLMCallPrompt.
- */
-private object TurnPromptAdapter {
-    fun toLLMCallPrompt(prompt: TurnPrompt) = pl.jclab.refio.core.services.turn.LLMCallPrompt(
-        systemPrompt = prompt.systemPrompt,
-        messages = prompt.messages
-    )
-}
-
 /**
  * AgentTurnLoop - Turn-based execution loop implementing Codex CLI-style pattern.
  *
@@ -135,77 +127,6 @@ class AgentTurnLoop(
         _turnState.value = _turnState.value.update()
     }
 
-    /**
-     * Listener for turn events (tool execution, streaming, etc.).
-     */
-    interface TurnEventListener : pl.jclab.refio.core.services.turn.TurnCompletionListener {
-        fun onTurnStarted(
-            taskId: String,
-            mode: TaskMode,
-            runId: String,
-            parentRunId: String?,
-            depth: Int
-        ) {}
-
-        fun onToolExecutionStarted(taskId: String, toolCall: pl.jclab.refio.core.db.ToolCallData) {}
-        fun onToolStreamChunk(taskId: String, toolCallId: String, delta: String, accumulated: String) {}
-        fun onToolExecutionCompleted(taskId: String, toolCall: pl.jclab.refio.core.db.ToolCallData, result: String, success: Boolean) {}
-        fun onStreamChunk(taskId: String, delta: String, accumulated: String) {}
-
-        companion object {
-            /**
-             * Create from turn/ package TurnEventListener (inverse of [toTurnEventListener]).
-             */
-            fun fromTurnEventListener(source: pl.jclab.refio.core.services.turn.TurnEventListener): TurnEventListener =
-                object : TurnEventListener {
-                    override fun onTurnStarted(taskId: String, mode: TaskMode, runId: String, parentRunId: String?, depth: Int) {
-                        source.onTurnStarted(taskId, mode, runId, parentRunId, depth)
-                    }
-                    override fun onToolExecutionStarted(taskId: String, toolCall: pl.jclab.refio.core.db.ToolCallData) {
-                        source.onToolExecutionStarted(taskId, toolCall)
-                    }
-                    override fun onToolStreamChunk(taskId: String, toolCallId: String, delta: String, accumulated: String) {
-                        source.onToolStreamChunk(taskId, toolCallId, delta, accumulated)
-                    }
-                    override fun onToolExecutionCompleted(taskId: String, toolCall: pl.jclab.refio.core.db.ToolCallData, result: String, success: Boolean) {
-                        source.onToolExecutionCompleted(taskId, toolCall, result, success)
-                    }
-                    override fun onStreamChunk(taskId: String, delta: String, accumulated: String) {
-                        source.onStreamChunk(taskId, delta, accumulated)
-                    }
-                    override fun onTurnCompleted(taskId: String, result: pl.jclab.refio.core.services.TurnResult, runId: String, parentRunId: String?, depth: Int) {
-                        source.onTurnCompleted(taskId, result, runId, parentRunId, depth)
-                    }
-                }
-        }
-
-        /**
-         * Convert to turn/ package TurnEventListener for compatibility.
-         */
-        fun toTurnEventListener(): pl.jclab.refio.core.services.turn.TurnEventListener =
-            object : pl.jclab.refio.core.services.turn.TurnEventListener {
-                override fun onTurnStarted(taskId: String, mode: pl.jclab.refio.core.db.TaskMode, runId: String, parentRunId: String?, depth: Int) {
-                    this@TurnEventListener.onTurnStarted(taskId, mode, runId, parentRunId, depth)
-                }
-
-                override fun onToolExecutionStarted(taskId: String, toolCall: pl.jclab.refio.core.db.ToolCallData) {
-                    this@TurnEventListener.onToolExecutionStarted(taskId, toolCall)
-                }
-
-                override fun onToolStreamChunk(taskId: String, toolCallId: String, delta: String, accumulated: String) {
-                    this@TurnEventListener.onToolStreamChunk(taskId, toolCallId, delta, accumulated)
-                }
-
-                override fun onToolExecutionCompleted(taskId: String, toolCall: pl.jclab.refio.core.db.ToolCallData, result: String, success: Boolean) {
-                    this@TurnEventListener.onToolExecutionCompleted(taskId, toolCall, result, success)
-                }
-
-                override fun onStreamChunk(taskId: String, delta: String, accumulated: String) {
-                    this@TurnEventListener.onStreamChunk(taskId, delta, accumulated)
-                }
-            }
-    }
-
     // Type aliases for guardrails classes - using turn/ package implementations
     // Note: Using full qualified names instead of nested typealiases (not supported in Kotlin classes)
 
@@ -301,7 +222,9 @@ class AgentTurnLoop(
         chatMessageRepository.create(
             taskId = taskId,
             role = MessageRole.USER,
-            content = userInput
+            content = userInput,
+            agentName = profileOverrides?.subagentName,
+            agentDepth = profileOverrides?.subagentName?.let { (profileOverrides.depth) + 1 },
         )
 
         // Step 2: Execute turn loop
@@ -474,6 +397,8 @@ class AgentTurnLoop(
         // under context pressure routinely drop format for one iteration; a single short
         // nudge usually brings them back. Without this guard the loop exits as `success=true`
         // on the first plain-text response, silently abandoning mid-task work.
+        // NOTE: Nudges are skipped when the model previously executed tool calls — plain text
+        // after successful tool usage is treated as intentional completion, not format loss.
         var plainTextNudgeCount = 0
         var totalTokensIn = 0
         var totalTokensOut = 0
@@ -484,6 +409,46 @@ class AgentTurnLoop(
             val name = profileOverrides?.subagentName ?: "subagent"
             """{"subagent_name":"$name"}"""
         } else null
+
+        // When running as a subagent, persist messages with agentName / agentDepth so the
+        // IntelliJ chat bubble renderer groups them under a per-agent header.
+        val persistAgentName: String? = profileOverrides?.subagentName
+        val persistAgentDepth: Int? = if (persistAgentName != null) (profileOverrides?.depth ?: 0) + 1 else null
+
+        // For subagent turns, wrap the caller's streamCallback so each token delta is ALSO
+        // published as AgentEvent.StreamChunk with runId/depth/agentName. CoreSessionService
+        // subscribes to these events to render a per-agent streaming bubble that updates live
+        // while the subagent's LLM is still generating. Top-level turns skip the wrapper — their
+        // deltas already feed the main streaming message directly via streamCallback.
+        val effectiveStreamCallback: StreamCallback? = if (persistAgentName != null && agentEventBus != null) {
+            val bus = agentEventBus
+            val wrappedName = persistAgentName
+            val wrappedDepth = persistAgentDepth ?: 1
+            val wrappedRunId = runId
+            val wrappedSessionId = evSessionId
+            val wrappedSourceAgentId = evSourceAgentId
+            val delegate = streamCallback
+            { chunk ->
+                delegate?.invoke(chunk)
+                bus.tryEmit(
+                    pl.jclab.refio.core.agents.events.AgentEvent.StreamChunk(
+                        id = UUID.randomUUID().toString(),
+                        sessionId = wrappedSessionId,
+                        sourceAgentId = wrappedSourceAgentId,
+                        timestamp = System.currentTimeMillis(),
+                        correlationId = wrappedRunId,
+                        delta = chunk.delta,
+                        accumulated = chunk.accumulated,
+                        isComplete = chunk.isComplete,
+                        runId = wrappedRunId,
+                        depth = wrappedDepth,
+                        agentName = wrappedName,
+                    )
+                )
+            }
+        } else {
+            streamCallback
+        }
         val (effectiveModel, effectiveProvider) = turnLLMCaller.resolveModelSelection(
             mode = mode,
             taskId = taskId,
@@ -572,7 +537,6 @@ class AgentTurnLoop(
                     updateTurnState { copy(phase = TurnPhase.CALLING_MODEL) }
                     GlobalMetrics.setCurrentOperation(OperationInfo.TurnLLMCall(iteration, mode.name))
 
-                    val llmPrompt = TurnPromptAdapter.toLLMCallPrompt(prompt)
                     val llmCallStartNanos = System.nanoTime()
                     // Mutable so the empty-content recovery path below can re-bind it after pulling
                     // a JSON envelope out of the `thinking` field (qwen3 / Ollama edge case).
@@ -587,15 +551,15 @@ class AgentTurnLoop(
                             maxRetries = config.maxRetries,
                             baseDelayMs = config.retryBackoffMs,
                             responseFormat = responseFormat,
-                            stream = streamCallback != null,
-                            onChunk = streamCallback
+                            stream = effectiveStreamCallback != null,
+                            onChunk = effectiveStreamCallback
                         )
                     } else {
                         turnLLMCaller.callLLM(
                             taskId = taskId,
                             mode = mode,
-                            prompt = llmPrompt,
-                            streamCallback = streamCallback,
+                            prompt = prompt,
+                            streamCallback = effectiveStreamCallback,
                             model = effectiveModel,
                             provider = effectiveProvider,
                             profileOverrides = profileOverrides
@@ -677,33 +641,86 @@ class AgentTurnLoop(
                             llmResponse = llmResponse.copy(content = thinking ?: "", thinking = null)
                             // Fall through to the regular tool-call extraction path.
                         } else {
+                            val canRetryEmptyContent =
+                                mode == TaskMode.AGENT &&
+                                    plainTextNudgeCount < 2 &&
+                                    iteration < maxIterations
+
+                            if (canRetryEmptyContent) {
+                                plainTextNudgeCount++
+                                logger.warn {
+                                    "[FORMAT_RETRY_NUDGE] taskId=$taskId, iteration=$iteration: " +
+                                        "LLM returned empty content in JSON mode. " +
+                                        "Nudge=$plainTextNudgeCount/2, finishReason=${llmResponse.finishReason}"
+                                }
+                                val resolvedThinking = turnResponseProcessor.resolveAssistantThinking(llmResponse)
+                                if (!resolvedThinking.isNullOrBlank()) {
+                                    chatMessageRepository.create(
+                                        taskId = taskId,
+                                        role = MessageRole.ASSISTANT,
+                                        content = "",
+                                        thinking = resolvedThinking,
+                                        toolCalls = null,
+                                        tokensIn = llmResponse.usage.inputTokens,
+                                        tokensOut = llmResponse.usage.outputTokens,
+                                        cost = llmResponse.cost,
+                                        agentName = persistAgentName,
+                                        agentDepth = persistAgentDepth,
+                                    )
+                                }
+                                chatMessageRepository.create(
+                                    taskId = taskId,
+                                    role = MessageRole.SYSTEM,
+                                    content = "Your previous reply contained empty content in structured JSON mode. " +
+                                        "Generate the full JSON envelope again from scratch. " +
+                                        "Do not continue or patch the previous output. Reply with JSON only: " +
+                                        "{\"actions\":[{\"tool\":\"NAME\",\"arguments\":{...}}]," +
+                                        "\"response\":\"...\",\"intent\":\"implementation\"}. " +
+                                        "No prose, no markdown fences.",
+                                    toolCalls = null,
+                                    agentName = persistAgentName,
+                                    agentDepth = persistAgentDepth,
+                                )
+                                continue
+                            }
+
                             logger.error {
                                 "[TURN_FAILED] Empty content from model in JSON mode " +
                                     "(mode=$mode, finishReason=${llmResponse.finishReason}, thinkingLength=${llmResponse.thinking?.length ?: 0})"
                             }
+                            val response = if (mode == TaskMode.AGENT) {
+                                "The agent returned empty content in structured mode and could not recover after retrying. " +
+                                    "Please rerun with the same task or switch to a more reliable model."
+                            } else {
+                                "Model returned empty content in structured mode. " +
+                                    "The selected model likely does not produce the required JSON envelope — " +
+                                    "try a different model (e.g. one tuned for tool use) or simplify the request."
+                            }
                             val result = TurnResult(
                                 success = false,
-                                response = "Model returned empty content in structured mode. " +
-                                    "The selected model likely does not produce the required JSON envelope — " +
-                                    "try a different model (e.g. one tuned for tool use) or simplify the request.",
+                                response = response,
                                 iterations = iteration,
                                 tokensIn = totalTokensIn,
                                 tokensOut = totalTokensOut,
                                 cost = totalCost,
                                 toolsUsed = usedTools.distinct()
                             )
-                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata)
+                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
                         }
                     }
 
                     // Check if model invoked tools
                     val contentForExtraction = toolCallParser.preprocessContent(llmResponse.content, taskId)
+                    val jsonEnvelopeInspection = toolCallParser.inspectJsonEnvelope(contentForExtraction)
                     val toolCalls = toolCallParser.extractToolCalls(contentForExtraction, mode, profileOverrides)
+                    val looksLikeJsonResponse =
+                        jsonEnvelopeInspection.hasJsonEnvelope || contentForExtraction.trim().startsWith("[")
 
                     // Check for truncated response with incomplete JSON
                     val isTruncatedWithIncompleteJson =
                         llmResponse.finishReason == "length" &&
-                        contentForExtraction.trim().startsWith("{") &&
+                        jsonEnvelopeInspection.hasJsonEnvelope &&
+                        !jsonEnvelopeInspection.isComplete &&
                         toolCalls.isEmpty()
 
                     if (isTruncatedWithIncompleteJson) {
@@ -724,7 +741,7 @@ class AgentTurnLoop(
                             cost = totalCost,
                             toolsUsed = usedTools.distinct()
                         )
-                        return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata)
+                        return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
                     }
 
                     if (toolCalls.isNotEmpty()) {
@@ -744,7 +761,9 @@ class AgentTurnLoop(
                             toolCalls = toolCalls,
                             tokensIn = llmResponse.usage.inputTokens,
                             tokensOut = llmResponse.usage.outputTokens,
-                            cost = llmResponse.cost
+                            cost = llmResponse.cost,
+                            agentName = persistAgentName,
+                            agentDepth = persistAgentDepth,
                         )
 
                         // Track used tool names
@@ -768,7 +787,7 @@ class AgentTurnLoop(
                         // When no caller listener is present we fall back to batch-level timing.
                         toolStartNanos.clear()
                         toolDurationsMs.clear()
-                        val innerListener = listener?.toTurnEventListener()
+                        val innerListener = listener
                         val effectiveListener: pl.jclab.refio.core.services.turn.TurnEventListener? =
                             if (innerListener != null) {
                                 object : pl.jclab.refio.core.services.turn.TurnEventListener {
@@ -821,7 +840,9 @@ class AgentTurnLoop(
                             chatMessageRepository.create(
                                 taskId = taskId,
                                 role = MessageRole.SYSTEM,
-                                content = "User rejected tool '${e.toolName}'. Reason: ${e.reason ?: "not specified"}"
+                                content = "User rejected tool '${e.toolName}'. Reason: ${e.reason ?: "not specified"}",
+                                agentName = persistAgentName,
+                                agentDepth = persistAgentDepth,
                             )
                             updateTurnState { copy(phase = TurnPhase.IDLE) }
                             val result = TurnResult(
@@ -838,7 +859,8 @@ class AgentTurnLoop(
                             )
                             return turnFinalizer.completeTurn(
                                 taskId, result, listener, runId, parentRunId, depth,
-                                persistAssistantMessage = false, metadata = subagentMetadata
+                                persistAssistantMessage = false, metadata = subagentMetadata,
+                                agentName = persistAgentName, agentDepth = persistAgentDepth,
                             )
                         }
 
@@ -853,7 +875,9 @@ class AgentTurnLoop(
                                 result = resultData.content,
                                 isSummarized = resultData.isSummarized,
                                 rawOutput = resultData.rawOutput,
-                                metadata = resultData.metadata
+                                metadata = resultData.metadata,
+                                agentName = persistAgentName,
+                                agentDepth = persistAgentDepth,
                             )
                         }
 
@@ -928,7 +952,9 @@ class AgentTurnLoop(
                                 chatMessageRepository.create(
                                     taskId = taskId,
                                     role = MessageRole.SYSTEM,
-                                    content = responseContent
+                                    content = responseContent,
+                                    agentName = persistAgentName,
+                                    agentDepth = persistAgentDepth,
                                 )
                                 logger.info { "[AWAITING_RESPONSE] Got response for $requestId: ${responseContent.take(100)}" }
                             }
@@ -944,7 +970,7 @@ class AgentTurnLoop(
                             )
                         }
                         val batchSummary = ToolBatchSummary.summarize(batchInput)
-                        listener?.toTurnEventListener()?.onToolBatchCompleted(taskId, batchSummary)
+                        listener?.onToolBatchCompleted(taskId, batchSummary)
 
                         // Track error rate + definitive-loop detection + unified repetition tracker.
                         // Definitive loop = the SAME tool with the SAME arguments failing repeatedly.
@@ -994,7 +1020,7 @@ class AgentTurnLoop(
                                 cost = totalCost,
                                 toolsUsed = usedTools.distinct()
                             )
-                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata)
+                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
                         }
 
                         val writeToolCalls = turnToolExecutor.countWriteToolCalls(toolCalls)
@@ -1021,7 +1047,7 @@ class AgentTurnLoop(
                                 cost = totalCost,
                                 toolsUsed = usedTools.distinct()
                             )
-                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata)
+                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
                         }
 
                         if (errorTracker.shouldAbort(config.errorRateThreshold)) {
@@ -1034,7 +1060,7 @@ class AgentTurnLoop(
                                 cost = totalCost,
                                 toolsUsed = usedTools.distinct()
                             )
-                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata)
+                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
                         }
 
                         // Check for mid-execution user messages after tool execution
@@ -1044,7 +1070,9 @@ class AgentTurnLoop(
                                 taskId = taskId,
                                 role = MessageRole.SYSTEM,
                                 content = "[New user message above — address it next]",
-                                toolCalls = null
+                                toolCalls = null,
+                                agentName = persistAgentName,
+                                agentDepth = persistAgentDepth,
                             )
                         }
 
@@ -1059,7 +1087,9 @@ class AgentTurnLoop(
                                 taskId = taskId,
                                 role = MessageRole.SYSTEM,
                                 content = "[New user message above — address it before finishing]",
-                                toolCalls = null
+                                toolCalls = null,
+                                agentName = persistAgentName,
+                                agentDepth = persistAgentDepth,
                             )
                             // Save the current assistant response before continuing
                             val textResponse = toolCallParser.extractTextResponse(llmResponse.content)
@@ -1071,7 +1101,9 @@ class AgentTurnLoop(
                                 toolCalls = null,
                                 tokensIn = llmResponse.usage.inputTokens,
                                 tokensOut = llmResponse.usage.outputTokens,
-                                cost = llmResponse.cost
+                                cost = llmResponse.cost,
+                                agentName = persistAgentName,
+                                agentDepth = persistAgentDepth,
                             )
                             continue
                         }
@@ -1082,6 +1114,12 @@ class AgentTurnLoop(
                         // unable to produce JSON and further retries are wasted turns. Guarded by
                         // `iteration < maxIterations` so we don't nudge on the last possible turn.
                         //
+                        // IMPORTANT: If the model previously executed tool calls (usedTools is
+                        // non-empty), plain text is treated as intentional task completion — not a
+                        // format lapse. Nudging in this case wastes 2-3 iterations regenerating
+                        // the same summary. Incomplete JSON envelopes are still retried regardless,
+                        // as they indicate a truncated response that needs regeneration.
+                        //
                         // We intentionally do NOT persist the plain-text body as an ASSISTANT
                         // message: doing so reinjects the bad output into the next prompt and
                         // trains the model to keep emitting plain text. The model's `thinking`
@@ -1091,16 +1129,31 @@ class AgentTurnLoop(
                         // Nudge message is inlined (short, verbatim) because TurnNudgeBuilder was
                         // removed — reanimating the whole nudge layer for this one case would
                         // contradict the intentional simplification in TurnGuardrails.
-                        if (mode == TaskMode.AGENT &&
-                            !contentForExtraction.trim().let { it.startsWith("{") || it.startsWith("[") } &&
-                            contentForExtraction.isNotBlank() &&
-                            plainTextNudgeCount < 2 &&
-                            iteration < maxIterations
-                        ) {
+                        val hasIncompleteJsonEnvelope =
+                            jsonEnvelopeInspection.hasJsonEnvelope && !jsonEnvelopeInspection.isComplete
+                        // Skip nudge when the model previously produced valid JSON with tool calls
+                        // and now returns plain text — this is an intentional completion, not a
+                        // format lapse. Only nudge when the model never succeeded with JSON format
+                        // (usedTools is empty) or when the envelope is structurally incomplete
+                        // (truncated response that needs regeneration).
+                        val modelPreviouslyUsedTools = usedTools.isNotEmpty()
+                        val requiresFormatRetry =
+                            mode == TaskMode.AGENT &&
+                                contentForExtraction.isNotBlank() &&
+                                plainTextNudgeCount < 2 &&
+                                iteration < maxIterations &&
+                                (hasIncompleteJsonEnvelope || (!looksLikeJsonResponse && !modelPreviouslyUsedTools))
+
+                        if (requiresFormatRetry) {
                             plainTextNudgeCount++
+                            val retryReason = if (hasIncompleteJsonEnvelope) {
+                                "LLM returned incomplete JSON envelope"
+                            } else {
+                                "LLM returned plain text without JSON structure"
+                            }
                             logger.warn {
-                                "[PLAIN_TEXT_NUDGE] taskId=$taskId, iteration=$iteration: " +
-                                    "LLM returned plain text without JSON structure. " +
+                                "[FORMAT_RETRY_NUDGE] taskId=$taskId, iteration=$iteration: " +
+                                    "$retryReason. " +
                                     "Nudge=$plainTextNudgeCount/2, content='${contentForExtraction.take(80)}'"
                             }
                             val resolvedThinking = turnResponseProcessor.resolveAssistantThinking(llmResponse)
@@ -1113,21 +1166,62 @@ class AgentTurnLoop(
                                     toolCalls = null,
                                     tokensIn = llmResponse.usage.inputTokens,
                                     tokensOut = llmResponse.usage.outputTokens,
-                                    cost = llmResponse.cost
+                                    cost = llmResponse.cost,
+                                    agentName = persistAgentName,
+                                    agentDepth = persistAgentDepth,
                                 )
                             }
                             chatMessageRepository.create(
                                 taskId = taskId,
                                 role = MessageRole.SYSTEM,
-                                content = "Reply with JSON only: " +
-                                    "{\"actions\":[{\"tool\":\"NAME\",\"arguments\":{...}}]," +
-                                    "\"response\":\"...\",\"intent\":\"implementation\"}. " +
-                                    "No prose, no markdown fences.",
-                                toolCalls = null
+                                content = if (hasIncompleteJsonEnvelope) {
+                                    "Your previous reply contained incomplete JSON. Generate the full JSON envelope again from scratch. " +
+                                        "Do not continue or patch the previous output. Reply with JSON only: " +
+                                        "{\"actions\":[{\"tool\":\"NAME\",\"arguments\":{...}}]," +
+                                        "\"response\":\"...\",\"intent\":\"implementation\"}. " +
+                                        "No prose, no markdown fences."
+                                } else {
+                                    "Reply with JSON only: " +
+                                        "{\"actions\":[{\"tool\":\"NAME\",\"arguments\":{...}}]," +
+                                        "\"response\":\"...\",\"intent\":\"implementation\"}. " +
+                                        "No prose, no markdown fences."
+                                },
+                                toolCalls = null,
+                                agentName = persistAgentName,
+                                agentDepth = persistAgentDepth,
                             )
                             continue
                         }
 
+                        if (mode == TaskMode.AGENT && hasIncompleteJsonEnvelope) {
+                            logger.error {
+                                "[MALFORMED_JSON_ENVELOPE] taskId=$taskId, iteration=$iteration: " +
+                                    "assistant returned incomplete JSON after retries exhausted"
+                            }
+                            val result = TurnResult(
+                                success = false,
+                                response = "The agent returned an incomplete JSON envelope and could not recover after retrying. " +
+                                    "Please rerun with the same task or switch to a more reliable model.",
+                                iterations = iteration,
+                                tokensIn = totalTokensIn,
+                                tokensOut = totalTokensOut,
+                                cost = totalCost,
+                                toolsUsed = usedTools.distinct()
+                            )
+                            return turnFinalizer.completeTurn(
+                                taskId,
+                                result,
+                                listener,
+                                runId,
+                                parentRunId,
+                                depth,
+                                persistAssistantMessage = true,
+                                metadata = subagentMetadata,
+                                agentName = persistAgentName,
+                                agentDepth = persistAgentDepth,
+                            )
+                        }
+
                         // Check error rate abort (hard abort — same threshold as tool-calls branch).
                         if (errorTracker.shouldAbort(config.errorRateThreshold)) {
                             val result = TurnResult(
@@ -1139,7 +1233,7 @@ class AgentTurnLoop(
                                 cost = totalCost,
                                 toolsUsed = usedTools.distinct()
                             )
-                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata)
+                            return turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
                         }
 
                         val shouldRunTaskVerification =
@@ -1178,7 +1272,9 @@ class AgentTurnLoop(
                                         taskId = taskId,
                                         role = MessageRole.SYSTEM,
                                         content = decision.nudge,
-                                        toolCalls = null
+                                        toolCalls = null,
+                                        agentName = persistAgentName,
+                                        agentDepth = persistAgentDepth,
                                     )
                                     continue
                                 }
@@ -1203,7 +1299,9 @@ class AgentTurnLoop(
                             toolCalls = null,
                             tokensIn = llmResponse.usage.inputTokens,
                             tokensOut = llmResponse.usage.outputTokens,
-                            cost = llmResponse.cost
+                            cost = llmResponse.cost,
+                            agentName = persistAgentName,
+                            agentDepth = persistAgentDepth,
                         )
 
                         val result = TurnResult(
@@ -1223,7 +1321,7 @@ class AgentTurnLoop(
                             "iterations" to iteration.toString(),
                             "agentName" to (profileOverrides?.subagentName ?: "default")
                         ))
-                        val finalResult = turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = false, metadata = subagentMetadata)
+                        val finalResult = turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = false, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
                         updateTurnState { TurnStateSnapshot() }
                         return finalResult
                     }
@@ -1297,7 +1395,7 @@ class AgentTurnLoop(
                 cost = totalCost,
                 toolsUsed = usedTools.distinct()
             )
-            val finalResult = turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata)
+            val finalResult = turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
             updateTurnState { TurnStateSnapshot() }
             return finalResult
         } catch (e: CancellationException) {
@@ -1317,7 +1415,7 @@ class AgentTurnLoop(
                 cost = totalCost,
                 toolsUsed = usedTools.distinct()
             )
-            val finalResult = turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata)
+            val finalResult = turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
             updateTurnState { TurnStateSnapshot() }
             return finalResult
         }
@@ -1340,7 +1438,7 @@ class AgentTurnLoop(
             cost = totalCost,
             toolsUsed = usedTools.distinct()
         )
-        val finalResult = turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata)
+        val finalResult = turnFinalizer.completeTurn(taskId, result, listener, runId, parentRunId, depth, persistAssistantMessage = true, metadata = subagentMetadata, agentName = persistAgentName, agentDepth = persistAgentDepth)
         updateTurnState { TurnStateSnapshot() }
         return finalResult
     }
@@ -1515,14 +1613,6 @@ class AgentTurnLoop(
     }
 }
 
-/**
- * Turn prompt data class.
- */
-data class TurnPrompt(
-    val systemPrompt: String,
-    val messages: List<LLMMessage>
-)
-
 /**
  * Turn result data class.
  */
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ChatService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ChatService.kt
index 0ee4c178..d86c6ae3 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/ChatService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ChatService.kt
@@ -51,7 +51,6 @@ class ChatService(
     private val toolDescriptionBuilder: ToolDescriptionBuilder,
     private val contextService: ContextService? = null,
     private val projectRoot: java.nio.file.Path? = null,
-    private val ideProject: Any? = null
 ) {
     private val fallbackProjectId: String =
         projectRoot?.let { ProjectIdGenerator.generate(it) } ?: LEGACY_PROJECT_ID
@@ -171,7 +170,6 @@ class ChatService(
                 val projectContext = contextService.buildProjectContext(
                     projectRoot = projectRoot,
                     taskId = task.id,
-                    project = ideProject,
                     query = request.input,
                     userContextRefs = allContextRefs  // Use combined refs (history + current request)
                 )
@@ -225,8 +223,8 @@ class ChatService(
 
         // Read UI state from config table (single source of truth)
         // UI state is global plugin state, not task-specific (saved by SessionManager)
-        val thinkingEnabled = configService.get(ConfigService.KEY_UI_THINKING_ENABLED)?.toBoolean() ?: false
-        val noEgressEnabled = configService.get(ConfigService.KEY_UI_NO_EGRESS_ENABLED)?.toBoolean() ?: false
+        val thinkingEnabled = configService.get(ConfigKeys.UI_THINKING_ENABLED.key)?.toBoolean() ?: false
+        val noEgressEnabled = configService.get(ConfigKeys.UI_NO_EGRESS_ENABLED.key)?.toBoolean() ?: false
 
         // Call LLM
         // Context is passed separately via contextContent parameter to ensure proper message order:
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigDefaultsInitializer.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigDefaultsInitializer.kt
new file mode 100644
index 00000000..2db93c4e
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigDefaultsInitializer.kt
@@ -0,0 +1,100 @@
+package pl.jclab.refio.core.services
+
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.config.ConfigYaml
+import pl.jclab.refio.core.db.ConfigScope
+import pl.jclab.refio.core.db.repositories.ConfigRepository
+import pl.jclab.refio.core.llm.adapters.ZAIUrls
+import pl.jclab.refio.core.logging.dualLogger
+/**
+ * Owns the one-shot startup contract:
+ *  1. [initializeDefaults] — seed DB with built-in defaults (only keys not already set).
+ *  2. [loadFromYamlIfMissing] — merge user/project YAML into DB for missing keys only.
+ *  3. [reloadFromYaml] — force reload from YAML, overwriting DB (invoked via Settings UI button).
+ *
+ * Extracted from [ConfigService] so the facade isn't 1000+ LOC.
+ * Writes go through [configService.set] so cache invalidation stays consistent;
+ * YAML merges are delegated to the existing [ConfigYamlApplier] wired inside [ConfigService].
+ */
+internal class ConfigDefaultsInitializer(
+    private val configRepository: ConfigRepository,
+    private val applyYaml: (ConfigYaml, Boolean) -> Int,
+    private val invalidateAllCaches: () -> Unit,
+) {
+    private val logger = dualLogger("ConfigDefaultsInitializer")
+
+    fun initializeDefaults() {
+        logger.info { "Initializing default configuration values (only missing keys)" }
+
+        var initializedCount = 0
+        for ((key, value, description) in BUILTIN_DEFAULTS) {
+            if (configRepository.get(key, ConfigScope.APP) == null) {
+                configRepository.set(
+                    key = key,
+                    value = value,
+                    scope = ConfigScope.APP,
+                    taskId = null,
+                    description = description,
+                )
+                logger.info { "Initialized default: $key = $value" }
+                initializedCount++
+            }
+        }
+
+        logger.info { "Finished initializing defaults: $initializedCount keys set" }
+        invalidateAllCaches()
+    }
+
+    fun loadFromYamlIfMissing() {
+        val yamlConfig = ConfigYaml.load()
+        if (yamlConfig == null) {
+            logger.info { "No YAML config file found or failed to parse, skipping" }
+            return
+        }
+        logger.info { "Loading configuration from YAML file (only missing keys)" }
+        applyYaml(yamlConfig, false)
+        logger.info { "Finished loading configuration from YAML" }
+    }
+
+    fun reloadFromYaml(): Int {
+        val yamlConfig = ConfigYaml.load()
+            ?: throw IllegalStateException("No YAML config file found at ${ConfigYaml.getConfigPath().absolutePath}")
+
+        logger.info { "Reloading all configuration from YAML file (overwriting DB)" }
+        val updatedCount = applyYaml(yamlConfig, true)
+        logger.info { "Finished reloading configuration from YAML: $updatedCount keys updated" }
+        invalidateAllCaches()
+        return updatedCount
+    }
+
+    companion object {
+        /**
+         * (key, value, description) triples seeded on first run. Order is intentional —
+         * UI toggles first, then models, then feature flags, then RAG/context/agent knobs.
+         */
+        private val BUILTIN_DEFAULTS: List<Triple<String, String, String>> = listOf(
+            Triple(ConfigKeys.UI_THINKING_ENABLED.key, "false", "Show LLM thinking process in UI"),
+            Triple(ConfigKeys.UI_NO_EGRESS_ENABLED.key, "false", "Block external network calls"),
+            Triple(ConfigKeys.UI_ORCHESTRATION_ENABLED.key, "true", "Enable orchestration UI toggle"),
+            Triple(ConfigKeys.UI_MULTI_AGENT_STRATEGY.key, "SINGLE", "Multi-agent orchestration strategy: SINGLE, PARALLEL, PIPELINE, ORCHESTRATOR"),
+            Triple(ConfigKeys.UI_INTENT_CLASSIFICATION_ENABLED.key, "false", "Enable LLM intent classification"),
+            Triple(ConfigKeys.UI_EXECUTION_MODE.key, "AUTO", "Execution mode (AUTO/INTERACTIVE)"),
+            Triple(ConfigKeys.UI_SELECTED_MODE.key, "CHAT", "Selected task mode (CHAT/PLAN/AGENT)"),
+            Triple(ConfigKeys.EMBEDDING_MODEL.key, "ollama/nomic-embed-text", "Model for embeddings"),
+            Triple(ConfigKeys.FORMAT_MARKDOWN.key, "true", "Format responses as markdown"),
+            Triple(ConfigKeys.STREAMING_ENABLED.key, "true", "Enable streaming responses"),
+            Triple(ConfigKeys.ADVANCED_VIEW.key, "false", "Show advanced UI options"),
+            Triple(ConfigKeys.TOOL_SUMMARY_ENABLED.key, "true", "Enable tool result summarization"),
+            Triple(ConfigKeys.TOOL_SUMMARY_MIN_LENGTH.key, "500", "Minimum tool output length for summarization"),
+            Triple(ConfigKeys.SECURITY_ALLOW_SYMLINKS.key, "false", "Allow symbolic links in PathSandbox (unsafe, opt-in)"),
+            Triple(ConfigKeys.PROVIDER_ZAI_BASE_URL.key, ZAIUrls.DEFAULT, "Base URL for Z.AI provider"),
+            Triple(ConfigKeys.RAG_EMBEDDING_CACHE_SIZE.key, ConfigKeys.RAG_EMBEDDING_CACHE_SIZE.default.toString(), "Maximum embedding cache entries"),
+            Triple(ConfigKeys.RAG_CHUNKING_MODE.key, ConfigKeys.RAG_CHUNKING_MODE.default, "RAG chunking mode (semantic or line_based)"),
+            Triple(ConfigKeys.RAG_SEARCH_CACHE_TTL_SECONDS.key, ConfigKeys.RAG_SEARCH_CACHE_TTL_SECONDS.default.toString(), "TTL for cached @codebase search results in seconds"),
+            Triple(ConfigKeys.WORKING_MEMORY_MAX_FACTS.key, ConfigKeys.WORKING_MEMORY_MAX_FACTS.default.toString(), "Maximum working memory facts stored per task"),
+            Triple(ConfigKeys.TASK_VERIFICATION_ENABLED.key, "false", "Enable task completion verification for AGENT mode"),
+            Triple(ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.key, ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.default.toString(), "Max consecutive failures of the same tool+args before aborting (definitive loop). Varied args reset the counter."),
+            Triple(ConfigKeys.JSON_THINKING_XML_TAGS.key, ConfigKeys.JSON_THINKING_XML_TAGS.default.joinToString(","), "Comma-separated XML tags stripped before JSON extraction (e.g., thinking,think)"),
+        )
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigResolver.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigResolver.kt
new file mode 100644
index 00000000..ac79741d
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigResolver.kt
@@ -0,0 +1,131 @@
+package pl.jclab.refio.core.services
+
+import pl.jclab.refio.core.config.ConfigKey
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.config.HierarchicalConfigLoader
+import pl.jclab.refio.core.db.ConfigScope
+import pl.jclab.refio.core.db.repositories.ConfigRepository
+
+/**
+ * Pure lookup/write layer for configuration values.
+ *
+ * Owns the hierarchy resolution (TASK > PROJECT > APP > YAML > default), the
+ * cache, and cache invalidation — factored out of [ConfigService] so the
+ * facade keeps only the high-level helpers and delegates lookup concerns here.
+ */
+internal class ConfigResolver(
+    private val configRepository: ConfigRepository,
+    private val yamlLoader: HierarchicalConfigLoader,
+    private val cache: ConfigCache,
+    private val defaultProjectId: String?,
+) {
+
+    /**
+     * Get a typed configuration value using a [ConfigKey] descriptor.
+     *
+     * Lookup order (highest priority first):
+     * 1. Database value (task-scoped, then project-scoped, then app-scoped)
+     * 2. YAML config value via the key's yamlAccessor
+     * 3. The key's built-in default
+     */
+    fun <T> getTyped(configKey: ConfigKey<T>, taskId: String? = null): T {
+        val cacheKey = "typed:${configKey.key}:task=${taskId.orEmpty()}"
+        return cache.getOrCompute(cacheKey) {
+            val dbConfig = getConfigWithPrecedence(key = configKey.key, taskId = taskId)
+            if (dbConfig?.value != null) {
+                val parsed = configKey.parser(dbConfig.value)
+                if (parsed != null) return@getOrCompute parsed
+            }
+
+            val yamlValue = configKey.yamlAccessor?.invoke(yamlLoader)
+            if (yamlValue != null) {
+                val parsed = configKey.parser(yamlValue.toString())
+                if (parsed != null) return@getOrCompute parsed
+            }
+
+            configKey.default
+        }
+    }
+
+    fun <T> setTyped(configKey: ConfigKey<T>, value: T, scope: ConfigScope = ConfigScope.APP, taskId: String? = null) {
+        val serialized = configKey.serializer(value)
+        configRepository.set(
+            key = configKey.key,
+            value = serialized,
+            scope = scope,
+            taskId = taskId,
+            description = null,
+        )
+        invalidate(configKey.key)
+    }
+
+    /**
+     * Raw string lookup with hierarchy resolution.
+     *
+     * Useful for dynamic keys not backed by a [ConfigKey] descriptor (provider
+     * keys, etc.). Returns null when no DB or YAML value exists.
+     */
+    fun get(
+        key: String,
+        scope: ConfigScope = ConfigScope.APP,
+        taskId: String? = null,
+        projectId: String? = null,
+    ): String? {
+        val resolvedProject = resolveProjectId(projectId)
+        val cacheKey = "raw:$key:scope=${scope.name}:task=${taskId.orEmpty()}:project=${resolvedProject.orEmpty()}"
+        return cache.getOrCompute(cacheKey) {
+            val dbConfig = when {
+                taskId != null -> getConfigWithPrecedence(key = key, taskId = taskId, projectId = projectId)
+                scope == ConfigScope.PROJECT ->
+                    resolvedProject?.let { configRepository.get(key, ConfigScope.PROJECT, projectId = it) }
+                else -> configRepository.get(key, scope)
+            }
+            dbConfig?.value ?: getFromYaml(key)
+        }
+    }
+
+    fun getFromYaml(key: String): String? {
+        val cacheKey = "yaml:$key"
+        return cache.getOrCompute(cacheKey) {
+            val configKey = ConfigKeys.byKey(key) ?: return@getOrCompute null
+            configKey.yamlAccessor?.invoke(yamlLoader)?.toString()
+        }
+    }
+
+    fun set(
+        key: String,
+        value: String,
+        scope: ConfigScope = ConfigScope.APP,
+        taskId: String? = null,
+        projectId: String? = null,
+    ) {
+        val resolvedProjectId = resolveProjectId(projectId)
+        configRepository.set(
+            key = key,
+            value = value,
+            scope = scope,
+            projectId = if (scope == ConfigScope.PROJECT) resolvedProjectId else null,
+            taskId = taskId,
+            description = null,
+        )
+        invalidate(key)
+    }
+
+    fun getConfigWithPrecedence(
+        key: String,
+        taskId: String? = null,
+        projectId: String? = null,
+    ) = configRepository.getWithPrecedence(
+        key = key,
+        taskId = taskId,
+        projectId = resolveProjectId(projectId),
+    )
+
+    fun invalidate(key: String) {
+        cache.invalidateByPrefix("typed:$key:")
+        cache.invalidateByPrefix("raw:$key:")
+        cache.invalidate("yaml:$key")
+    }
+
+    private fun resolveProjectId(projectId: String?): String? = projectId ?: defaultProjectId
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigService.kt
index 5bd1e5ca..edb5503d 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigService.kt
@@ -5,19 +5,10 @@ import pl.jclab.refio.core.config.ConfigKey
 import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.core.config.ConfigYaml
 import pl.jclab.refio.core.config.HierarchicalConfigLoader
-import pl.jclab.refio.core.config.TerminalCommandConfig
-import pl.jclab.refio.core.config.TerminalConfig
-import pl.jclab.refio.core.config.TerminalWhitelistConfig
 import pl.jclab.refio.core.db.ConfigScope
 import pl.jclab.refio.core.db.repositories.ConfigRepository
-import pl.jclab.refio.core.llm.getModelConfigFromCache
 import pl.jclab.refio.core.services.context.ContextBudget
-import pl.jclab.refio.core.services.context.ContextSection
-import pl.jclab.refio.core.tools.security.AllowedCommand
-import pl.jclab.refio.core.tools.security.CommandWhitelistConfig
-import pl.jclab.refio.core.tools.security.CommandWhitelistDefaults
-import pl.jclab.refio.core.tools.security.WhitelistMode
-import pl.jclab.refio.core.utils.GsonInstance.gson
+import pl.jclab.refio.core.subagents.BuiltinSubagentOverrides
 import pl.jclab.refio.core.logging.dualLogger
 import java.nio.file.Path
 
@@ -33,7 +24,7 @@ import java.nio.file.Path
  * Handles default model selection per logical operation slot.
  */
 class ConfigService(
-    private val configRepository: ConfigRepository,
+    internal val configRepository: ConfigRepository,
     private val defaultProjectId: String? = null,
     private val projectRoot: Path? = null
 ) {
@@ -44,60 +35,24 @@ class ConfigService(
      * Hierarchical config loader for YAML files.
      * Handles user and project config files.
      */
-    private val yamlLoader: HierarchicalConfigLoader by lazy {
+    internal val yamlLoader: HierarchicalConfigLoader by lazy {
         HierarchicalConfigLoader.getInstance(projectRoot)
     }
 
-    /**
-     * Get a typed configuration value using a [ConfigKey] descriptor.
-     *
-     * Lookup order (highest priority first):
-     * 1. Database value (task-scoped, then project-scoped, then app-scoped)
-     * 2. YAML config value via the key's yamlAccessor
-     * 3. The key's built-in default
-     *
-     * @param configKey Typed key descriptor containing key, parser, default, and yaml accessor
-     * @param taskId Optional task ID for task-level override
-     * @return Parsed value of type T, or the key's default if not found / unparseable
-     */
-    fun <T> getTyped(configKey: ConfigKey<T>, taskId: String? = null): T {
-        val cacheKey = "typed:${configKey.key}:task=${taskId.orEmpty()}"
-        return configCache.getOrCompute(cacheKey) {
-            val dbConfig = getConfigWithPrecedence(key = configKey.key, taskId = taskId)
-            if (dbConfig?.value != null) {
-                val parsed = configKey.parser(dbConfig.value)
-                if (parsed != null) return@getOrCompute parsed
-            }
+    private val resolver = ConfigResolver(
+        configRepository = configRepository,
+        yamlLoader = yamlLoader,
+        cache = configCache,
+        defaultProjectId = defaultProjectId,
+    )
 
-            val yamlValue = configKey.yamlAccessor?.invoke(yamlLoader)
-            if (yamlValue != null) {
-                val parsed = configKey.parser(yamlValue.toString())
-                if (parsed != null) return@getOrCompute parsed
-            }
+    private val modelSelectionService = ModelSelectionService(this)
 
-            configKey.default
-        }
-    }
+    fun <T> getTyped(configKey: ConfigKey<T>, taskId: String? = null): T =
+        resolver.getTyped(configKey, taskId)
 
-    /**
-     * Set a typed configuration value using a [ConfigKey] descriptor.
-     *
-     * @param configKey Typed key descriptor containing key and serializer
-     * @param value The value to store
-     * @param scope Configuration scope (default: APP)
-     * @param taskId Optional task ID for TASK scope
-     */
-    fun <T> setTyped(configKey: ConfigKey<T>, value: T, scope: ConfigScope = ConfigScope.APP, taskId: String? = null) {
-        val serialized = configKey.serializer(value)
-        configRepository.set(
-            key = configKey.key,
-            value = serialized,
-            scope = scope,
-            taskId = taskId,
-            description = null
-        )
-        invalidateConfigCache(configKey.key)
-    }
+    fun <T> setTyped(configKey: ConfigKey<T>, value: T, scope: ConfigScope = ConfigScope.APP, taskId: String? = null) =
+        resolver.setTyped(configKey, value, scope, taskId)
 
     companion object {
         const val INHERIT_MODEL_VALUE = "inherit"
@@ -105,494 +60,45 @@ class ConfigService(
         /** Context sizes at or below this threshold trigger compact (shorter) prompts. */
         const val COMPACT_PROMPT_THRESHOLD = 48_000
 
-        // Configuration keys
-        const val KEY_DEFAULT_MODEL_CHAT = "default_model.chat"
-        const val KEY_DEFAULT_MODEL_PLAN = "default_model.plan"
-        const val KEY_DEFAULT_MODEL_AGENT = "default_model.agent"
-        const val KEY_WEAK_MODEL = "default_model.weak"  // Cheap model for auxiliary operations (summaries, etc.)
-        const val KEY_STRONG_MODEL = "default_model.strong"  // Powerful model for complex delegation
-        const val KEY_MODELS_VISIBILITY = "models.visibility"
-
-        // Limits configuration keys
-        const val KEY_API_CALL_TIMEOUT = "limits.api_call_timeout"
-        const val KEY_STREAMING_READ_TIMEOUT = "limits.streaming_read_timeout_sec"
-        const val KEY_STREAMING_REQUEST_TIMEOUT = "limits.streaming_request_timeout_sec"
-        const val KEY_TOOL_EXECUTION_TIMEOUT = "limits.tool_execution_timeout"
-        const val KEY_MAX_CONTEXT_SIZE = "limits.max_context_size"
-        const val KEY_MAX_OUTPUT_SIZE = "limits.max_output_size"
-        const val KEY_MAX_FILE_SIZE = "limits.max_file_size"
-        const val KEY_MAX_RETRIES = "limits.max_retries"
-        const val KEY_RATE_LIMIT_RPM = "limits.rate_limit_rpm"
-        const val KEY_RETRY_DELAY_MS = "limits.retry_delay_ms"
-
-        // Orchestration configuration keys (US-028)
-        const val KEY_ORCHESTRATION_ENABLED = "orchestration.enabled"
-
-        // UI configuration keys
-        const val KEY_UI_THINKING_ENABLED = "ui.thinking_enabled"
-        const val KEY_UI_NO_EGRESS_ENABLED = "ui.no_egress_enabled"
-        const val KEY_UI_ORCHESTRATION_ENABLED = "ui.orchestration_enabled"
-        const val KEY_UI_MULTI_AGENT_STRATEGY = "ui.multi_agent_strategy"
-        const val KEY_UI_INTENT_CLASSIFICATION_ENABLED = "ui.intent_classification_enabled"
-        const val KEY_UI_EXECUTION_MODE = "ui.execution_mode"
-        const val KEY_UI_SELECTED_MODE = "ui.selected_mode"
-        const val KEY_UI_SELECTED_MODEL = "ui.selected_model"
-
-        // Models configuration keys
-        const val KEY_EMBEDDING_MODEL = "models.embedding_model"
+        /** Default assumed context size when model metadata is unavailable. */
+        const val DEFAULT_CONTEXT_SIZE = 32768
 
-        // RAG configuration keys
-        const val KEY_RAG_ENABLED = "rag.enabled"
-        const val KEY_RAG_AUTO_INDEX_ON_CONTEXT = "rag.auto_index_on_context_build"
-        const val KEY_RAG_INDEX_ON_STARTUP = "rag.index_on_startup"
-        const val KEY_RAG_MAX_FILE_SIZE_MB = "rag.max_file_size_mb"
-        const val KEY_RAG_CACHE_TTL_MS = "rag.cache_ttl_ms"
-        const val KEY_RAG_MAX_CONCURRENT_JOBS = "rag.max_concurrent_jobs"
-        const val KEY_RAG_MAX_CHUNKS_PER_FILE = "rag.max_chunks_per_file"
-        const val KEY_OLLAMA_MAX_CONCURRENT = "providers.ollama_max_concurrent"
-        const val KEY_RAG_INDEX_BATCH_SIZE = "rag.index_batch_size"
-        const val KEY_RAG_EMBEDDINGS_BATCH_SIZE = "rag.embeddings_batch_size"
-        const val KEY_RAG_EMBEDDING_CACHE_SIZE = "rag.embedding_cache_size"
-        const val KEY_RAG_IGNORED_DIRECTORIES = "rag.ignored_directories"
-        const val KEY_RAG_CHUNKING_MODE = "rag.chunking_mode"
-        const val KEY_RAG_SEARCH_SIMILARITY_THRESHOLD = "rag.search_similarity_threshold"
-        const val KEY_RAG_SEARCH_TOP_K = "rag.search_top_k"
-        const val KEY_RAG_SEARCH_CACHE_TTL_SECONDS = "rag.search.cache_ttl_seconds"
-        const val KEY_RAG_SEARCH_HYBRID_ENABLED = "rag.search_hybrid_enabled"
-        const val KEY_RAG_SEARCH_SEMANTIC_WEIGHT = "rag.search_semantic_weight"
-        const val KEY_RAG_SEARCH_INCLUDE_CONTEXT_CHUNKS = "rag.search_include_context_chunks"
-        const val KEY_PROJECT_ANALYSIS_MAX_FILES = "project_analysis.max_files"
-        const val KEY_PROJECT_ANALYSIS_FINGERPRINT_LIMIT = "project_analysis.fingerprint_limit"
-        const val KEY_PROJECT_ANALYSIS_CACHE_TTL_MS = "project_analysis.cache_ttl_ms"
-
-        // General configuration keys
-        const val KEY_FORMAT_MARKDOWN = "general.format_markdown"
-        const val KEY_STREAMING_ENABLED = "general.streaming_enabled"
-        const val KEY_ADVANCED_VIEW = "general.advanced_view"
-
-        // Advanced configuration keys
-        const val KEY_AUTO_OPTIMIZE_PERCENTAGE = "advanced.auto_optimize_percentage"
-        const val KEY_NO_EGRESS_DEFAULT = "advanced.no_egress_default"
-        const val KEY_READ_ONLY_MODE = "advanced.read_only_mode"
-
-        // Tools configuration keys
-        const val KEY_TOOLS_PERMISSIONS = "tools.permissions"
-        const val KEY_TOOL_PERMISSION_RUN_TERMINAL = "tools.permission_run_terminal_command"
-        const val KEY_TERMINAL_WHITELIST = "terminal.whitelist"
-        const val KEY_TERMINAL_WHITELIST_ENABLED = "terminal.whitelist.enabled"
-        const val KEY_TERMINAL_WHITELIST_MODE = "terminal.whitelist.mode"
-
-        // Tool result summarization configuration keys
-        const val KEY_TOOL_SUMMARY_ENABLED = "tool_summary.enabled"
-        const val KEY_TOOL_SUMMARY_MIN_LENGTH = "tool_summary.min_length"
-        const val KEY_SECURITY_ALLOW_SYMLINKS = "security.allow_symlinks"
-
-        // Context configuration keys (ADR 0017)
-        const val KEY_RECENT_WORK_FULL_DATA_LIMIT = "context.recent_work.full_data_limit"
-        const val KEY_RECENT_WORK_SUMMARY_MAX_LENGTH = "context.recent_work.summary_max_length"
-        const val KEY_CONTEXT_BUDGET_TOTAL_TOKENS = "context.budget.total_tokens"
-        const val KEY_CONTEXT_BUDGET_INPUT_RATIO = "context.budget.input_ratio"
-        const val KEY_CONTEXT_BUDGET_SECTION_PREFIX = "context.budget.section."
-        const val KEY_WORKING_MEMORY_MAX_FACTS = "working_memory.max_facts"
-
-        // Subagents configuration keys
-        const val KEY_SUBAGENTS_BUILTIN_ENABLED = "subagents.builtin_enabled"
-
-        // Provider configuration key prefixes (dynamic keys with provider name)
+        // Prefixes for dynamic key searches (not single keys, can't live in ConfigKeys registry).
         const val KEY_PREFIX_PROVIDERS = "providers."
-        const val KEY_PROVIDER_OLLAMA_ENDPOINT = "providers.ollama.ollama_endpoint"
-        const val KEY_PROVIDER_OLLAMA_CONTEXT_SIZE = "providers.ollama.ollama_context_size"
-        const val KEY_PROVIDER_OLLAMA_KEEP_ALIVE = "providers.ollama.ollama_keep_alive"
-        const val KEY_PROVIDER_ANTHROPIC_API_KEY = "providers.anthropic.anthropic_api_key"
-        const val KEY_PROVIDER_OPENAI_API_KEY = "providers.openai.openai_api_key"
-        const val KEY_PROVIDER_OPENROUTER_API_KEY = "providers.openrouter.openrouter_api_key"
-        const val KEY_PROVIDER_GEMINI_API_KEY = "providers.gemini.gemini_api_key"
-        const val KEY_PROVIDER_LM_STUDIO_API_KEY = "providers.lmstudio.lmstudio_api_key"
-        const val KEY_PROVIDER_LM_STUDIO_BASE_URL = "providers.lmstudio.lmstudio_base_url"
-        const val KEY_PROVIDER_LM_STUDIO_CONTEXT_SIZE = "providers.lmstudio.lmstudio_context_size"
-        const val KEY_PROVIDER_CUSTOM_OPENAI_API_KEY = "providers.custom_openai.custom_openai_api_key"
-        const val KEY_PROVIDER_CUSTOM_OPENAI_BASE_URL = "providers.custom_openai.custom_openai_base_url"
-        const val KEY_PROVIDER_CUSTOM_OPENAI_MODEL = "providers.custom_openai.custom_openai_model"
-        const val KEY_PROVIDER_ZAI_API_KEY = "providers.zai.zai_api_key"
-        const val KEY_PROVIDER_ZAI_BASE_URL = "providers.zai.zai_base_url"
+        const val KEY_CONTEXT_BUDGET_SECTION_PREFIX = "context.budget.section."
 
-        // Fallback defaults (used when no config exists)
+        // Hard fallbacks used when neither DB nor YAML provide a value (last resort).
         const val FALLBACK_MODEL = "qwen2.5:7b"
         const val FALLBACK_PROVIDER = "ollama"
         const val FALLBACK_WEAK_MODEL = "qwen2.5:7b"
         const val FALLBACK_WEAK_PROVIDER = "ollama"
         const val FALLBACK_EMBEDDING_MODEL = "nomic-embed-text"
         const val FALLBACK_EMBEDDING_PROVIDER = "ollama"
-        const val DEFAULT_ZAI_BASE_URL = "https://api.z.ai/api/coding/paas/v4"
-        const val LEGACY_ZAI_BASE_URL = "https://api.z.ai/v1"
-        const val GENERAL_ZAI_BASE_URL = "https://api.z.ai/api/paas/v4"
-
-        // Limit defaults
-        const val DEFAULT_API_CALL_TIMEOUT = 360 // seconds
-        const val DEFAULT_STREAMING_READ_TIMEOUT = 360
-        const val DEFAULT_STREAMING_REQUEST_TIMEOUT = 1800
-        const val DEFAULT_TOOL_EXECUTION_TIMEOUT = 360 // seconds
-        const val DEFAULT_CONTEXT_SIZE = 32768 // tokens
-        const val DEFAULT_MAX_CONTEXT_SIZE = 128000 // tokens
-        const val DEFAULT_MAX_OUTPUT_SIZE = 16384 // tokens
-        const val DEFAULT_MAX_FILE_SIZE = 10 // MB
-        const val DEFAULT_ORCHESTRATION_ENABLED = true
-        const val DEFAULT_RAG_MAX_FILE_SIZE_MB = 2L
-        const val DEFAULT_RAG_INDEX_ON_STARTUP = true
-        const val DEFAULT_RAG_INDEX_BATCH_SIZE = 10
-        const val DEFAULT_RAG_EMBEDDING_BATCH_SIZE = 50
-        const val DEFAULT_RAG_EMBEDDING_CACHE_SIZE = 2_000
-        const val DEFAULT_RAG_CACHE_TTL_MS = 300_000L
-        const val DEFAULT_RAG_MAX_CONCURRENT_JOBS = 4
-        const val DEFAULT_RAG_MAX_CHUNKS_PER_FILE = 100
-        const val DEFAULT_RAG_CHUNKING_MODE = "semantic"
-        const val DEFAULT_RAG_SEARCH_SIMILARITY_THRESHOLD = 0.65f
-        const val DEFAULT_RAG_SEARCH_TOP_K = 5
-        const val DEFAULT_RAG_SEARCH_CACHE_TTL_SECONDS = 60L
-        const val DEFAULT_RAG_SEARCH_HYBRID_ENABLED = false
-        const val DEFAULT_RAG_SEARCH_SEMANTIC_WEIGHT = 0.7f
-        const val DEFAULT_RAG_SEARCH_INCLUDE_CONTEXT_CHUNKS = false
-        val DEFAULT_RAG_IGNORED_DIRECTORIES = listOf(
-            ".git",
-            ".idea",
-            ".vscode",
-            ".gradle",
-            ".claude",
-            ".continue",
-            ".github",
-            ".refio",
-            ".codex",
-            ".junie",
-            ".husky",
-            ".vscode",
-            "node_modules",
-            "build",
-            "dist",
-            "out",
-            "target",
-            "__pycache__",
-            ".venv",
-            "*.log",
-            "*.tmp",
-            "Agents.md",
-            "CLAUDE.md",
-            "GEMINI.md",
-            ".gitignore",
-            ".aiignore",
-        )
-        const val DEFAULT_PROJECT_ANALYSIS_MAX_FILES = 400
-        const val DEFAULT_PROJECT_ANALYSIS_FINGERPRINT_LIMIT = 2000
-        const val DEFAULT_PROJECT_ANALYSIS_CACHE_TTL_MS = 600_000L
-
-        // Tool result summarization defaults
-        const val DEFAULT_TOOL_SUMMARY_ENABLED = true
-        const val DEFAULT_TOOL_SUMMARY_MIN_LENGTH = 500
-
-        // Context defaults (ADR 0017)
-        const val DEFAULT_RECENT_WORK_FULL_DATA_LIMIT = 5
-        const val DEFAULT_RECENT_WORK_SUMMARY_MAX_LENGTH = 1000
-        const val DEFAULT_CONTEXT_BUDGET_INPUT_RATIO = 0.85
-        const val DEFAULT_WORKING_MEMORY_MAX_FACTS = 20
-
-        // Agent flow defaults (ADR 0019) — single source of truth is ConfigKeys
-        val DEFAULT_TASK_VERIFICATION_ENABLED get() = ConfigKeys.TASK_VERIFICATION_ENABLED.default
-        val DEFAULT_MAX_CONSECUTIVE_TOOL_ERRORS get() = ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.default
-        val DEFAULT_MAX_ITERATIONS get() = ConfigKeys.MAX_ITERATIONS.default
-        val DEFAULT_JSON_THINKING_XML_TAGS get() = ConfigKeys.JSON_THINKING_XML_TAGS.default.joinToString(",")
-
-        // Agent flow configuration keys (ADR 0019) — single source of truth is ConfigKeys
-        val KEY_TASK_VERIFICATION_ENABLED get() = ConfigKeys.TASK_VERIFICATION_ENABLED.key
-        val KEY_MAX_CONSECUTIVE_TOOL_ERRORS get() = ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.key
-        val KEY_MAX_ITERATIONS get() = ConfigKeys.MAX_ITERATIONS.key
         val KEY_JSON_THINKING_XML_TAGS get() = ConfigKeys.JSON_THINKING_XML_TAGS.key
     }
 
     /**
      * Get the logical model to use for a request.
      *
-     * This method centralizes model selection logic:
-     * 1. Check if user selected a specific model in UI (ui.selected_model)
-     * 2. If yes and not "Auto" -> return that model for ALL operations
-     * 3. If "Auto" or not set -> return operation-specific default model
-     *
-     * @param operation Model operation type (DEFAULT, PLAN, CODING, WEAK, EMBEDDING)
-     * @param taskId Optional task ID for task-level override
-     * @return Pair of (model_id, provider) to use for the request
+     * Delegates to [ModelSelectionService].
      */
     fun getModel(
         operation: ModelOperation,
         taskId: String? = null,
         projectId: String? = null
-    ): Pair<String, String> {
-        // Check if user selected a specific model in UI
-        val selectedModel = get(
-            key = KEY_UI_SELECTED_MODEL,
-            taskId = taskId,
-            projectId = projectId
-        )
+    ): Pair<String, String> = modelSelectionService.getModel(operation, taskId, projectId)
 
-        if (selectedModel != null && selectedModel.isNotBlank() && !selectedModel.equals("auto", ignoreCase = true)) {
-            // User selected a specific model -> use it for ALL operations
-            val (providerFromString, modelIdFromString) = parseModelString(selectedModel)
-
-            logger.info { "Using user-selected model for $operation: $modelIdFromString (provider=$providerFromString)" }
-            return Pair(modelIdFromString, providerFromString)
-        }
-
-        // User selected "Auto" or no selection -> use operation-specific default
-        return getDefaultModel(operation, taskId, projectId)
-    }
-
-    /**
-     * Parse model string that might be in format "provider/model" or just "model".
-     *
-     * Examples:
-     * - "ollama/qwen2.5:7b" -> ("ollama", "qwen2.5:7b")
-     * - "qwen2.5:7b" -> ("ollama", "qwen2.5:7b") // fallback to ollama
-     * - "gpt-4.1-mini" -> ("openai", "gpt-4.1-mini") // fallback to openai
-     *
-     * @param modelString Model identifier, optionally prefixed with provider
-     * @return Pair of (provider, model_id)
-     */
-    private fun parseModelString(modelString: String): Pair<String, String> {
-        if (modelString.contains("/")) {
-            val parts = modelString.split("/", limit = 2)
-            return Pair(parts[0], parts[1])
-        }
-
-        // Infer provider from model name patterns
-        val provider = when {
-            modelString.startsWith("gpt-") -> "openai"
-            modelString.startsWith("glm-") -> "zai"
-            modelString.startsWith("claude-") -> "anthropic"
-            else -> FALLBACK_PROVIDER // Default to ollama for unknown models
-        }
-
-        return Pair(provider, modelString)
-    }
-
-    private fun configKeyForOperation(operation: ModelOperation): String = when (operation) {
-        ModelOperation.DEFAULT -> KEY_DEFAULT_MODEL_CHAT
-        ModelOperation.PLAN -> KEY_DEFAULT_MODEL_PLAN
-        ModelOperation.CODING -> KEY_DEFAULT_MODEL_AGENT
-        ModelOperation.WEAK -> KEY_WEAK_MODEL
-        ModelOperation.EMBEDDING -> KEY_EMBEDDING_MODEL
-        ModelOperation.STRONG -> KEY_STRONG_MODEL
-    }
-
-    private fun isInheritedModelConfig(data: ModelConfigData): Boolean {
-        return data.modelId.equals(INHERIT_MODEL_VALUE, ignoreCase = true) &&
-                data.provider.equals(INHERIT_MODEL_VALUE, ignoreCase = true)
-    }
-
-    private fun fallbackModelForOperation(operation: ModelOperation): Pair<String, String> = when (operation) {
-        ModelOperation.DEFAULT,
-        ModelOperation.PLAN,
-        ModelOperation.CODING -> Pair(FALLBACK_MODEL, FALLBACK_PROVIDER)
-        ModelOperation.WEAK -> Pair(FALLBACK_WEAK_MODEL, FALLBACK_WEAK_PROVIDER)
-        ModelOperation.EMBEDDING -> Pair(FALLBACK_EMBEDDING_MODEL, FALLBACK_EMBEDDING_PROVIDER)
-        ModelOperation.STRONG -> throw IllegalStateException("STRONG model has no fallback — must be explicitly configured")
-    }
-
-    /**
-     * Get default model_id and provider for given mode.
-     *
-     * Configuration hierarchy:
-     * 1. Database value (highest priority)
-     * 2. YAML config (user + project)
-     * 3. Built-in fallback
-     *
-     * @param mode Task mode (chat, plan, agent)
-     * @param taskId Optional task ID for task-level override
-     * @return Pair of (model_id, provider)
-     */
     fun getDefaultModel(
         operation: ModelOperation,
         taskId: String? = null,
         projectId: String? = null
-    ): Pair<String, String> {
-        val key = configKeyForOperation(operation)
-
-        // 1. Check database first
-        val config = getConfigWithPrecedence(
-            key = key,
-            taskId = taskId,
-            projectId = projectId
-        )
-
-        when (operation) {
-            ModelOperation.EMBEDDING -> {
-                // Check DB first
-                if (config?.value != null) {
-                    val (provider, model) = parseModelString(config.value)
-                    logger.info { "Using embedding model from DB: $model (provider=$provider)" }
-                    return Pair(model, provider)
-                }
-                // Check YAML
-                val yamlModel = yamlLoader.getDefaultEmbeddingModel()
-                if (yamlModel != null) {
-                    val (provider, model) = parseModelString(yamlModel)
-                    logger.info { "Using embedding model from YAML: $model (provider=$provider)" }
-                    return Pair(model, provider)
-                }
-                return fallbackModelForOperation(operation)
-            }
-
-            ModelOperation.WEAK -> {
-                // Check DB first
-                if (config != null) {
-                    val data = gson.fromJson(config.value, ModelConfigData::class.java)
-                    if (isInheritedModelConfig(data)) {
-                        logger.info { "Using inherited weak model -> default model" }
-                        return getDefaultModel(ModelOperation.DEFAULT, taskId, projectId)
-                    }
-                    if (data.modelId != null && data.provider != null) {
-                        logger.info { "Using weak model from DB: ${data.modelId}" }
-                        return Pair(data.modelId, data.provider)
-                    }
-                }
-                // Check YAML
-                val yamlModel = yamlLoader.getDefaultWeakModel()
-                if (yamlModel != null) {
-                    val (provider, model) = parseModelString(yamlModel)
-                    logger.info { "Using weak model from YAML: $model (provider=$provider)" }
-                    return Pair(model, provider)
-                }
-            }
-
-            ModelOperation.DEFAULT -> {
-                if (config != null) {
-                    val data = gson.fromJson(config.value, ModelConfigData::class.java)
-                    if (data.modelId != null && data.provider != null) {
-                        logger.info { "Using chat model from DB: ${data.modelId}" }
-                        return Pair(data.modelId, data.provider)
-                    }
-                }
-                // Check YAML
-                val yamlModel = yamlLoader.getDefaultChatModel()
-                if (yamlModel != null) {
-                    val (provider, model) = parseModelString(yamlModel)
-                    logger.info { "Using chat model from YAML: $model (provider=$provider)" }
-                    return Pair(model, provider)
-                }
-            }
-
-            ModelOperation.PLAN -> {
-                if (config != null) {
-                    val data = gson.fromJson(config.value, ModelConfigData::class.java)
-                    if (isInheritedModelConfig(data)) {
-                        logger.info { "Using inherited plan model -> default model" }
-                        return getDefaultModel(ModelOperation.DEFAULT, taskId, projectId)
-                    }
-                    if (data.modelId != null && data.provider != null) {
-                        logger.info { "Using plan model from DB: ${data.modelId}" }
-                        return Pair(data.modelId, data.provider)
-                    }
-                }
-                // Check YAML
-                val yamlModel = yamlLoader.getDefaultPlanModel()
-                if (yamlModel != null) {
-                    val (provider, model) = parseModelString(yamlModel)
-                    logger.info { "Using plan model from YAML: $model (provider=$provider)" }
-                    return Pair(model, provider)
-                }
-            }
+    ): Pair<String, String> = modelSelectionService.getDefaultModel(operation, taskId, projectId)
 
-            ModelOperation.CODING -> {
-                if (config != null) {
-                    val data = gson.fromJson(config.value, ModelConfigData::class.java)
-                    if (isInheritedModelConfig(data)) {
-                        logger.info { "Using inherited coding model -> default model" }
-                        return getDefaultModel(ModelOperation.DEFAULT, taskId, projectId)
-                    }
-                    if (data.modelId != null && data.provider != null) {
-                        logger.info { "Using coding model from DB: ${data.modelId}" }
-                        return Pair(data.modelId, data.provider)
-                    }
-                }
-                // Check YAML
-                val yamlModel = yamlLoader.getDefaultCodingModel()
-                if (yamlModel != null) {
-                    val (provider, model) = parseModelString(yamlModel)
-                    logger.info { "Using coding model from YAML: $model (provider=$provider)" }
-                    return Pair(model, provider)
-                }
-            }
-
-            ModelOperation.STRONG -> {
-                if (config != null) {
-                    val data = gson.fromJson(config.value, ModelConfigData::class.java)
-                    if (isInheritedModelConfig(data)) {
-                        logger.info { "Using inherited strong model -> default model" }
-                        return getDefaultModel(ModelOperation.DEFAULT, taskId, projectId)
-                    }
-                    if (data.modelId != null && data.provider != null) {
-                        logger.info { "Using strong model from DB: ${data.modelId}" }
-                        return Pair(data.modelId, data.provider)
-                    }
-                }
-                val yamlModel = yamlLoader.getDefaultStrongModel()
-                if (yamlModel != null) {
-                    val (provider, model) = parseModelString(yamlModel)
-                    logger.info { "Using strong model from YAML: $model (provider=$provider)" }
-                    return Pair(model, provider)
-                }
-                // No fallback for STRONG — callers should use getStrongModel() which returns null
-                throw IllegalStateException("STRONG model not configured and has no fallback")
-            }
-        }
-
-        val fallback = fallbackModelForOperation(operation)
-        logger.info { "No config found for $operation, using fallback: ${fallback.first}" }
-        return fallback
-    }
-
-    /**
-     * Get the configured strong model, or null if not configured.
-     * Unlike other operations, STRONG has no fallback.
-     */
     fun getStrongModel(
         taskId: String? = null,
         projectId: String? = null
-    ): Pair<String, String>? {
-        val key = KEY_STRONG_MODEL
-
-        // 1. Check database
-        val config = getConfigWithPrecedence(key = key, taskId = taskId, projectId = projectId)
-        if (config != null) {
-            val data = gson.fromJson(config.value, ModelConfigData::class.java)
-            if (isInheritedModelConfig(data)) {
-                logger.info { "Using inherited strong model -> default model" }
-                return getDefaultModel(ModelOperation.DEFAULT, taskId, projectId)
-            }
-            if (data.modelId != null && data.provider != null) {
-                logger.info { "Using strong model from DB: ${data.modelId}" }
-                return Pair(data.modelId, data.provider)
-            }
-        }
-
-        // 2. Check YAML
-        val yamlModel = yamlLoader.getDefaultStrongModel()
-        if (yamlModel != null) {
-            val (provider, model) = parseModelString(yamlModel)
-            logger.info { "Using strong model from YAML: $model (provider=$provider)" }
-            return Pair(model, provider)
-        }
-
-        // 3. No fallback — return null
-        return null
-    }
+    ): Pair<String, String>? = modelSelectionService.getStrongModel(taskId, projectId)
 
-    /**
-     * Set default model for given mode.
-     *
-     * @param mode Task mode (chat, plan, agent)
-     * @param modelId Model identifier (e.g., "qwen2.5:7b")
-     * @param provider Provider name (e.g., "ollama")
-     * @param taskId Optional task ID for task-level config
-     * @param userId Optional user ID for audit
-     * @throws IllegalArgumentException If model_id doesn't exist in model_registry or provider mismatch
-     */
     @Suppress("UNUSED_PARAMETER")
     fun setDefaultModel(
         operation: ModelOperation,
@@ -600,691 +106,74 @@ class ConfigService(
         provider: String,
         taskId: String? = null,
         _userId: String? = null
-    ) {
-        // Validate model exists using cached model registry (no suspend/runBlocking needed)
-        if (operation != ModelOperation.EMBEDDING) {
-            val modelConfig = getModelConfigFromCache(modelId)
-            if (modelConfig != null && modelConfig.provider != provider) {
-                return
-            }
-        }
-
-        when (operation) {
-            ModelOperation.WEAK -> {
-                setWeakModel(modelId, provider)
-                logger.info { "Set weak model to $provider/$modelId" }
-                return
-            }
-            ModelOperation.EMBEDDING -> {
-                setEmbeddingModel("$provider/$modelId")
-                logger.info { "Set embedding model to $provider/$modelId" }
-                return
-            }
-            else -> {
-                val key = configKeyForOperation(operation)
-                val scope = if (taskId != null) ConfigScope.TASK else ConfigScope.APP
-                val valueJson = gson.toJson(ModelConfigData(modelId, provider))
-
-                configRepository.set(
-                    key = key,
-                    value = valueJson,
-                    scope = scope,
-                    taskId = taskId,
-                    description = "Default model for $operation operation"
-                )
-                invalidateConfigCache(key)
-
-                logger.info { "Set ${scope.name} config $key = $modelId" }
-            }
-        }
-    }
+    ) = modelSelectionService.setDefaultModel(operation, modelId, provider, taskId, _userId)
 
-    /**
-     * Set default model for ALL modes (chat, plan, agent) in one operation.
-     *
-     * @param modelId Model identifier (e.g., "qwen2.5:7b")
-     * @param provider Provider name (e.g., "ollama")
-     * @param taskId Optional task ID for task-level config
-     * @param userId Optional user ID for audit
-     * @throws IllegalArgumentException If model_id doesn't exist in model_registry or provider mismatch
-     */
     fun setDefaultModelAllModes(
         modelId: String,
         provider: String,
         taskId: String? = null,
         userId: String? = null
-    ) {
-        // Validate once using cached model registry (no suspend/runBlocking needed)
-        val modelConfig = getModelConfigFromCache(modelId)
-        if (modelConfig != null && modelConfig.provider != provider) {
-            return
-        }
-
-        // Set for all modes
-        for (operation in listOf(ModelOperation.DEFAULT, ModelOperation.PLAN, ModelOperation.CODING)) {
-            setDefaultModel(
-                operation = operation,
-                modelId = modelId,
-                provider = provider,
-                taskId = taskId,
-                _userId = userId
-            )
-        }
-
-        logger.info { "Set default model for ALL modes: $modelId (provider=$provider)" }
-    }
-
-    /**
-     * Get visibility setting for a specific model.
-     *
-     * @param modelId Model identifier (e.g., "ollama/qwen2.5:14b")
-     * @return true if model should be shown in dropdown (default), false otherwise
-     */
-    fun getModelVisibility(modelId: String): Boolean {
-        val config = configRepository.get(KEY_MODELS_VISIBILITY, ConfigScope.APP)
-        if (config != null) {
-            @Suppress("UNCHECKED_CAST")
-            val visibilityMap = gson.fromJson(config.value, Map::class.java) as? Map<String, Boolean>
-            return visibilityMap?.get(modelId) ?: true  // Default to visible
-        }
-        return true  // Default to visible if no config
-    }
-
-    /**
-     * Set visibility setting for a specific model.
-     *
-     * @param modelId Model identifier (e.g., "ollama/qwen2.5:14b")
-     * @param showInDropdown true to show model in dropdown, false to hide
-     */
-    fun setModelVisibility(modelId: String, showInDropdown: Boolean) {
-        // Get current visibility map
-        val config = configRepository.get(KEY_MODELS_VISIBILITY, ConfigScope.APP)
-        @Suppress("UNCHECKED_CAST")
-        val visibilityMap = if (config != null) {
-            gson.fromJson(config.value, Map::class.java) as? Map<String, Boolean> ?: emptyMap()
-        } else {
-            emptyMap()
-        }
+    ) = modelSelectionService.setDefaultModelAllModes(modelId, provider, taskId, userId)
 
-        // Update map with new value
-        val updatedMap = visibilityMap.toMutableMap()
-        updatedMap[modelId] = showInDropdown
+    fun getModelVisibility(modelId: String): Boolean =
+        modelSelectionService.getModelVisibility(modelId)
 
-        // Save back to config
-        val valueJson = gson.toJson(updatedMap)
-        configRepository.set(
-            key = KEY_MODELS_VISIBILITY,
-            value = valueJson,
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Model visibility settings"
-        )
-
-        logger.info { "Updated model visibility: $modelId -> $showInDropdown" }
-        invalidateConfigCache(KEY_MODELS_VISIBILITY)
-    }
+    fun setModelVisibility(modelId: String, showInDropdown: Boolean) =
+        modelSelectionService.setModelVisibility(modelId, showInDropdown)
 
-    /**
-     * Replace all model visibility settings in one write.
-     *
-     * @param visibilityMap Map of modelId to showInDropdown setting
-     */
-    fun setModelsVisibility(visibilityMap: Map<String, Boolean>) {
-        val valueJson = gson.toJson(visibilityMap)
-        configRepository.set(
-            key = KEY_MODELS_VISIBILITY,
-            value = valueJson,
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Model visibility settings"
-        )
+    fun setModelsVisibility(visibilityMap: Map<String, Boolean>) =
+        modelSelectionService.setModelsVisibility(visibilityMap)
 
-        logger.info { "Updated model visibility for ${visibilityMap.size} models" }
-        invalidateConfigCache(KEY_MODELS_VISIBILITY)
-    }
+    fun getModelsVisibility(): Map<String, Boolean> =
+        modelSelectionService.getModelsVisibility()
 
-    /**
-     * Get all model visibility settings.
-     *
-     * Configuration hierarchy:
-     * 1. Database value (highest priority)
-     * 2. YAML config (user + project merged)
-     *
-     * @return Map of modelId to showInDropdown setting
-     */
-    fun getModelsVisibility(): Map<String, Boolean> {
-        // 1. Check database first
-        val config = configRepository.get(KEY_MODELS_VISIBILITY, ConfigScope.APP)
-        if (config != null) {
-            @Suppress("UNCHECKED_CAST")
-            val visibilityMap = gson.fromJson(config.value, Map::class.java) as? Map<String, Boolean>
-            if (visibilityMap != null && visibilityMap.isNotEmpty()) {
-                return visibilityMap
-            }
-        }
-
-        // 2. Fall back to YAML config
-        val yamlVisibility = yamlLoader.getModelsVisibility()
-        if (yamlVisibility != null && yamlVisibility.isNotEmpty()) {
-            return yamlVisibility
-        }
-
-        return emptyMap()
-    }
-
-    /**
-     * Get config value by key with hierarchical lookup.
-     *
-     * Lookup order (highest priority first):
-     * 1. Database value (if exists)
-     * 2. Project YAML config (<project>/.refio/config.yaml)
-     * 3. User YAML config (~/.refio/config.yaml)
-     * 4. Built-in default (returned as null here, caller uses default)
-     *
-     * @param key Configuration key
-     * @param scope Configuration scope (APP or TASK)
-     * @param taskId Optional task ID for TASK scope
-     * @return Config value or null if not found
-     */
     fun get(
         key: String,
         scope: ConfigScope = ConfigScope.APP,
         taskId: String? = null,
         projectId: String? = null
-    ): String? {
-        val cacheKey = "raw:$key:scope=${scope.name}:task=${taskId.orEmpty()}:project=${resolveProjectId(projectId).orEmpty()}"
-        return configCache.getOrCompute(cacheKey) {
-            val dbConfig = when {
-                taskId != null -> getConfigWithPrecedence(key = key, taskId = taskId, projectId = projectId)
-                scope == ConfigScope.PROJECT -> {
-                    val resolvedProjectId = resolveProjectId(projectId)
-                    resolvedProjectId?.let { configRepository.get(key, ConfigScope.PROJECT, projectId = it) }
-                }
-                else -> configRepository.get(key, scope)
-            }
-            dbConfig?.value ?: getFromYaml(key)
-        }
-    }
+    ): String? = resolver.get(key, scope, taskId, projectId)
 
-    /**
-     * Get configuration value from YAML files only.
-     * Useful when you want to explicitly read from file-based config.
-     *
-     * Uses [ConfigKeys.byKey] to find the registered [ConfigKey] and its yamlAccessor.
-     *
-     * @param key Configuration key in dot notation (e.g., "general.format_markdown")
-     * @return Value from YAML config or null if not found
-     */
-    fun getFromYaml(key: String): String? {
-        val cacheKey = "yaml:$key"
-        return configCache.getOrCompute(cacheKey) {
-            val configKey = ConfigKeys.byKey(key)
-                ?: return@getOrCompute null
-            configKey.yamlAccessor?.invoke(yamlLoader)?.toString()
-        }
-    }
+    fun getFromYaml(key: String): String? = resolver.getFromYaml(key)
 
-    /**
-     * Set config value by key (simple helper for APP scope).
-     *
-     * @param key Configuration key
-     * @param value Configuration value
-     * @param scope Configuration scope (default: APP)
-     * @param taskId Optional task ID for TASK scope
-     */
     fun set(
         key: String,
         value: String,
         scope: ConfigScope = ConfigScope.APP,
         taskId: String? = null,
         projectId: String? = null
-    ) {
-        val resolvedProjectId = resolveProjectId(projectId)
-        configRepository.set(
-            key = key,
-            value = value,
-            scope = scope,
-            projectId = if (scope == ConfigScope.PROJECT) resolvedProjectId else null,
-            taskId = taskId,
-            description = null
-        )
-        invalidateConfigCache(key)
-    }
+    ) = resolver.set(key, value, scope, taskId, projectId)
 
-    /**
-     * Get API call timeout in milliseconds.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return Timeout in milliseconds (default: 120000ms = 120s)
-     */
-    /**
-     * Get tool execution timeout in milliseconds.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return Timeout in milliseconds (default: 120000ms = 120s)
-     */
-    /**
-     * Get maximum context size in tokens.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return Max context size in tokens (default: 128000)
-     */
-    /**
-     * Get maximum output size in tokens.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return Max output size in tokens (default: 8192)
-     */
-    /**
-     * Get maximum file size in MB.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return Max file size in MB (default: 10)
-     */
     // ==================== UI CONFIGURATION ====================
 
-    /**
-     * Get thinking enabled setting.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return true if thinking is enabled (default: false)
-     */
-    /**
-     * Set thinking enabled setting.
-     *
-     * @param enabled true to enable thinking display
-     * @param taskId Optional task ID for task-level config
-     */
     fun setThinkingEnabled(enabled: Boolean, taskId: String? = null) {
-        val scope = if (taskId != null) ConfigScope.TASK else ConfigScope.APP
-        configRepository.set(
-            key = KEY_UI_THINKING_ENABLED,
-            value = enabled.toString(),
-            scope = scope,
-            taskId = taskId,
-            description = "Show LLM thinking process in UI"
-        )
-        invalidateConfigCache(KEY_UI_THINKING_ENABLED)
-        invalidateConfigCache(KEY_UI_THINKING_ENABLED)
+        setTyped(ConfigKeys.UI_THINKING_ENABLED, enabled, taskScope(taskId), taskId)
     }
 
-    /**
-     * Get no-egress enabled setting.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return true if no-egress is enabled (default: false)
-     */
-    /**
-     * Set no-egress enabled setting.
-     *
-     * @param enabled true to block external network calls
-     * @param taskId Optional task ID for task-level config
-     */
     fun setNoEgressEnabled(enabled: Boolean, taskId: String? = null) {
-        val scope = if (taskId != null) ConfigScope.TASK else ConfigScope.APP
-        configRepository.set(
-            key = KEY_UI_NO_EGRESS_ENABLED,
-            value = enabled.toString(),
-            scope = scope,
-            taskId = taskId,
-            description = "Block external network calls"
-        )
-        invalidateConfigCache(KEY_UI_NO_EGRESS_ENABLED)
-        invalidateConfigCache(KEY_UI_NO_EGRESS_ENABLED)
+        setTyped(ConfigKeys.UI_NO_EGRESS_ENABLED, enabled, taskScope(taskId), taskId)
     }
 
-    /**
-     * Get execution mode setting.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return Execution mode (AUTO/INTERACTIVE, default: AUTO)
-     */
-    /**
-     * Set execution mode setting.
-     *
-     * @param mode Execution mode (AUTO/INTERACTIVE)
-     * @param taskId Optional task ID for task-level config
-     */
     fun setExecutionMode(mode: String, taskId: String? = null) {
-        val scope = if (taskId != null) ConfigScope.TASK else ConfigScope.APP
-        configRepository.set(
-            key = KEY_UI_EXECUTION_MODE,
-            value = mode,
-            scope = scope,
-            taskId = taskId,
-            description = "Execution mode (AUTO/INTERACTIVE)"
-        )
-        invalidateConfigCache(KEY_UI_EXECUTION_MODE)
-        invalidateConfigCache(KEY_UI_EXECUTION_MODE)
+        setTyped(ConfigKeys.UI_EXECUTION_MODE, mode, taskScope(taskId), taskId)
     }
 
-    /**
-     * Get selected mode setting.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return Selected mode (CHAT/PLAN/AGENT, default: CHAT)
-     */
-    /**
-     * Set selected mode setting.
-     *
-     * @param mode Selected mode (CHAT/PLAN/AGENT)
-     * @param taskId Optional task ID for task-level config
-     */
-    fun setSelectedMode(mode: String, taskId: String? = null) {
-        val scope = if (taskId != null) ConfigScope.TASK else ConfigScope.APP
-        configRepository.set(
-            key = KEY_UI_SELECTED_MODE,
-            value = mode,
-            scope = scope,
-            taskId = taskId,
-            description = "Selected task mode"
-        )
-        invalidateConfigCache(KEY_UI_SELECTED_MODE)
-        invalidateConfigCache(KEY_UI_SELECTED_MODE)
-    }
-
-    /**
-     * Get selected model setting.
-     *
-     * @param taskId Optional task ID for task-level override
-     * @return Selected model (default: empty string)
-     */
-    /**
-     * Set selected model setting.
-     *
-     * @param model Selected model (provider/model format)
-     * @param taskId Optional task ID for task-level config
-     */
     fun setSelectedModel(model: String, taskId: String? = null) {
-        val scope = if (taskId != null) ConfigScope.TASK else ConfigScope.APP
-        configRepository.set(
-            key = KEY_UI_SELECTED_MODEL,
-            value = model,
-            scope = scope,
-            taskId = taskId,
-            description = "Selected model in UI"
-        )
-        invalidateConfigCache(KEY_UI_SELECTED_MODEL)
-        invalidateConfigCache(KEY_UI_SELECTED_MODEL)
+        setTyped(ConfigKeys.UI_SELECTED_MODEL, model, taskScope(taskId), taskId)
     }
 
     // ==================== MODELS CONFIGURATION ====================
 
-    /**
-     * Get weak model for auxiliary operations.
-     *
-     * @return Pair of (model_id, provider) for weak model
-     */
-    fun getWeakModel(): Pair<String, String> = getDefaultModel(ModelOperation.WEAK)
+    fun getWeakModel(): Pair<String, String> = modelSelectionService.getWeakModel()
 
-    /**
-     * Set weak model for auxiliary operations.
-     *
-     * @param modelId Model identifier
-     * @param provider Provider name
-     */
-    fun setWeakModel(modelId: String, provider: String) {
-        val valueJson = gson.toJson(ModelConfigData(modelId, provider))
-        configRepository.set(
-            key = KEY_WEAK_MODEL,
-            value = valueJson,
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Cheap model for auxiliary operations"
-        )
-        invalidateConfigCache(KEY_WEAK_MODEL)
-        invalidateConfigCache(KEY_WEAK_MODEL)
-    }
+    fun getEmbeddingModel(): String = modelSelectionService.getEmbeddingModel()
 
-    /**
-     * Get embedding model.
-     *
-     * @return Embedding model (default: "ollama/nomic-embed-text")
-     */
-    fun getEmbeddingModel(): String {
-        val (modelId, provider) = getDefaultModel(ModelOperation.EMBEDDING)
-        return "$provider/$modelId"
-    }
-
-    /**
-     * Set embedding model.
-     *
-     * @param model Embedding model (provider/model format)
-     */
-    fun setEmbeddingModel(model: String) {
-        configRepository.set(
-            key = KEY_EMBEDDING_MODEL,
-            value = model,
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Model for embeddings"
-        )
-        invalidateConfigCache(KEY_EMBEDDING_MODEL)
-        invalidateConfigCache(KEY_EMBEDDING_MODEL)
-    }
+    fun setEmbeddingModel(model: String) = modelSelectionService.setEmbeddingModel(model)
 
-    /**
-     * Get configured Ollama endpoint with hierarchical lookup.
-     *
-     * Priority:
-     * 1. Database value
-     * 2. YAML config
-     * 3. System property OLLAMA_BASE_URL
-     * 4. System property OLLAMA_ENDPOINT (legacy)
-     * 5. Default: http://localhost:11434
-     */
-    /**
-     * Reload configuration from YAML files.
-     * Call this when you need to refresh cached config.
-     */
-    fun reloadYamlConfig(): ConfigYaml {
-        return yamlLoader.reloadConfig()
-    }
-
-    /**
-     * Get the merged YAML configuration.
-     * Useful for debugging or exporting current config state.
-     */
     fun getYamlConfig(): ConfigYaml {
         return yamlLoader.getConfig()
     }
 
-    /**
-     * Build a ConfigYaml object from current database values.
-     * Used for exporting current configuration to YAML file.
-     *
-     * @param includeApiKeys If true, includes API keys (masked). If false, omits them.
-     * @return ConfigYaml object with current settings
-     */
-    fun buildConfigYamlFromCurrentSettings(includeApiKeys: Boolean = false): ConfigYaml {
-        return ConfigYaml(
-            general = buildGeneralConfig(),
-            providers = buildProvidersConfig(includeApiKeys),
-            models = buildModelsConfig(),
-            limits = buildLimitsConfig(),
-            advanced = buildAdvancedConfig(),
-            tools = buildToolsConfig(),
-            terminal = buildTerminalConfig(),
-            rag = buildRagConfig(),
-            ui = buildUiConfig()
-        )
-    }
-
-    fun normalizeZAIBaseUrl(baseUrl: String?): String {
-        val normalized = baseUrl?.trim()?.trimEnd('/')
-        return when {
-            normalized.isNullOrEmpty() -> DEFAULT_ZAI_BASE_URL
-            normalized.equals(LEGACY_ZAI_BASE_URL, ignoreCase = true) -> DEFAULT_ZAI_BASE_URL
-            normalized.equals(GENERAL_ZAI_BASE_URL, ignoreCase = true) -> DEFAULT_ZAI_BASE_URL
-            else -> normalized
-        }
-    }
-
-    private fun buildGeneralConfig(): pl.jclab.refio.core.config.GeneralConfig {
-        return pl.jclab.refio.core.config.GeneralConfig(
-            formatMarkdown = getTyped(ConfigKeys.FORMAT_MARKDOWN),
-            streamingEnabled = getTyped(ConfigKeys.STREAMING_ENABLED),
-            advancedView = getTyped(ConfigKeys.ADVANCED_VIEW)
-        )
-    }
-
-    private fun buildProvidersConfig(includeApiKeys: Boolean): pl.jclab.refio.core.config.ProvidersConfig {
-        val ollamaEndpoint = get(KEY_PROVIDER_OLLAMA_ENDPOINT)
-        val lmstudioBaseUrl = get(KEY_PROVIDER_LM_STUDIO_BASE_URL)
-        val customOpenAIBaseUrl = get(KEY_PROVIDER_CUSTOM_OPENAI_BASE_URL)
-
-        return pl.jclab.refio.core.config.ProvidersConfig(
-            ollama = pl.jclab.refio.core.config.OllamaConfig(
-                endpoint = ollamaEndpoint ?: getTyped(ConfigKeys.PROVIDER_OLLAMA_ENDPOINT),
-                contextSize = getTyped(ConfigKeys.PROVIDER_OLLAMA_CONTEXT_SIZE),
-                keepAlive = getTyped(ConfigKeys.PROVIDER_OLLAMA_KEEP_ALIVE)
-            ),
-            anthropic = if (includeApiKeys) {
-                pl.jclab.refio.core.config.AnthropicConfig(
-                    apiKey = get(KEY_PROVIDER_ANTHROPIC_API_KEY)
-                )
-            } else null,
-            openai = if (includeApiKeys) {
-                pl.jclab.refio.core.config.OpenAIConfig(
-                    apiKey = get(KEY_PROVIDER_OPENAI_API_KEY)
-                )
-            } else null,
-            openrouter = if (includeApiKeys) {
-                pl.jclab.refio.core.config.OpenRouterConfig(
-                    apiKey = get(KEY_PROVIDER_OPENROUTER_API_KEY)
-                )
-            } else null,
-            gemini = if (includeApiKeys) {
-                pl.jclab.refio.core.config.GeminiConfig(
-                    apiKey = get(KEY_PROVIDER_GEMINI_API_KEY)
-                )
-            } else null,
-            lmstudio = pl.jclab.refio.core.config.LMStudioConfig(
-                apiKey = if (includeApiKeys) get(KEY_PROVIDER_LM_STUDIO_API_KEY) else null,
-                baseUrl = lmstudioBaseUrl,
-                contextSize = getTyped(ConfigKeys.PROVIDER_LM_STUDIO_CONTEXT_SIZE)
-            ),
-            customOpenai = pl.jclab.refio.core.config.CustomOpenAIConfig(
-                apiKey = if (includeApiKeys) get(KEY_PROVIDER_CUSTOM_OPENAI_API_KEY) else null,
-                baseUrl = customOpenAIBaseUrl,
-                model = get(KEY_PROVIDER_CUSTOM_OPENAI_MODEL)
-            ),
-            zai = pl.jclab.refio.core.config.ZAIConfig(
-                apiKey = if (includeApiKeys) get(KEY_PROVIDER_ZAI_API_KEY) else null,
-                baseUrl = get(KEY_PROVIDER_ZAI_BASE_URL) ?: DEFAULT_ZAI_BASE_URL
-            )
-        )
-    }
-
-    private fun buildModelsConfig(): pl.jclab.refio.core.config.ModelsConfig {
-        val (chatModel, chatProvider) = getDefaultModel(ModelOperation.DEFAULT)
-        val (planModel, planProvider) = getDefaultModel(ModelOperation.PLAN)
-        val (codingModel, codingProvider) = getDefaultModel(ModelOperation.CODING)
-        val (weakModel, weakProvider) = getDefaultModel(ModelOperation.WEAK)
-        val (embeddingModel, embeddingProvider) = getDefaultModel(ModelOperation.EMBEDDING)
-
-        return pl.jclab.refio.core.config.ModelsConfig(
-            defaults = pl.jclab.refio.core.config.ModelDefaultsConfig(
-                chat = "$chatProvider/$chatModel",
-                plan = "$planProvider/$planModel",
-                coding = "$codingProvider/$codingModel",
-                weak = "$weakProvider/$weakModel",
-                embedding = "$embeddingProvider/$embeddingModel"
-            ),
-            visibility = getModelsVisibility()
-        )
-    }
-
-    private fun buildLimitsConfig(): pl.jclab.refio.core.config.LimitsConfig {
-        return pl.jclab.refio.core.config.LimitsConfig(
-            apiCallTimeout = getTyped(ConfigKeys.API_CALL_TIMEOUT),
-            toolExecutionTimeout = getTyped(ConfigKeys.TOOL_EXECUTION_TIMEOUT),
-            streamingReadTimeout = getTyped(ConfigKeys.STREAMING_READ_TIMEOUT),
-            streamingRequestTimeout = getTyped(ConfigKeys.STREAMING_REQUEST_TIMEOUT),
-            maxContextSize = getTyped(ConfigKeys.MAX_CONTEXT_SIZE),
-            maxOutputSize = getTyped(ConfigKeys.MAX_OUTPUT_SIZE),
-            maxFileSize = getTyped(ConfigKeys.MAX_FILE_SIZE)
-        )
-    }
-
-    private fun buildAdvancedConfig(): pl.jclab.refio.core.config.AdvancedConfig {
-        return pl.jclab.refio.core.config.AdvancedConfig(
-            noEgressDefault = getTyped(ConfigKeys.NO_EGRESS_DEFAULT),
-            readOnlyMode = getTyped(ConfigKeys.READ_ONLY_MODE),
-            autoOptimizePercentage = getTyped(ConfigKeys.AUTO_OPTIMIZE_PERCENTAGE)
-        )
-    }
-
-    private fun buildToolsConfig(): pl.jclab.refio.core.config.ToolsConfig {
-        val permissions = getToolsPermissions()
-        if (permissions.isEmpty()) return pl.jclab.refio.core.config.ToolsConfig()
-
-        val yamlPermissions = permissions.mapValues { (_, enabled) ->
-            pl.jclab.refio.core.config.ToolPermissionConfig(
-                planMode = if (enabled) "ON" else "OFF",
-                agentMode = if (enabled) "ON" else "OFF"
-            )
-        }
-
-        return pl.jclab.refio.core.config.ToolsConfig(permissions = yamlPermissions)
-    }
-
-    private fun buildTerminalConfig(): TerminalConfig {
-        val whitelist = getTerminalWhitelistConfig()
-        val yamlCommands = whitelist.allowedCommands.map { command ->
-            TerminalCommandConfig(
-                program = command.program,
-                description = command.description.ifBlank { null },
-                aliases = command.aliases.ifEmpty { null },
-                blockedFlags = command.blockedFlags.ifEmpty { null },
-                blockedSubcommands = command.blockedSubcommands.ifEmpty { null },
-                blockedArgPatterns = command.blockedArgPatterns.ifEmpty { null },
-                allowedSubcommands = command.allowedSubcommands.ifEmpty { null },
-                maxArgs = command.maxArgs,
-                requireConfirmation = command.requireConfirmation
-            )
-        }
-
-        return TerminalConfig(
-            whitelist = TerminalWhitelistConfig(
-                enabled = whitelist.enabled,
-                mode = whitelist.mode.name,
-                globalBlockedPatterns = whitelist.globalBlockedPatterns,
-                commands = yamlCommands
-            )
-        )
-    }
-
-    private fun buildRagConfig(): pl.jclab.refio.core.config.RagConfig {
-        return pl.jclab.refio.core.config.RagConfig(
-            enabled = getTyped(ConfigKeys.RAG_ENABLED),
-            indexOnStartup = getTyped(ConfigKeys.RAG_INDEX_ON_STARTUP),
-            autoIndexOnContextBuild = getTyped(ConfigKeys.RAG_AUTO_INDEX_ON_CONTEXT),
-            maxFileSizeMB = getTyped(ConfigKeys.RAG_MAX_FILE_SIZE_MB),
-            maxChunksPerFile = getTyped(ConfigKeys.RAG_MAX_CHUNKS_PER_FILE),
-            indexBatchSize = getTyped(ConfigKeys.RAG_INDEX_BATCH_SIZE),
-            embeddingsBatchSize = getTyped(ConfigKeys.RAG_EMBEDDINGS_BATCH_SIZE),
-            cacheTtlMs = getTyped(ConfigKeys.RAG_CACHE_TTL_MS),
-            maxConcurrentJobs = getTyped(ConfigKeys.RAG_MAX_CONCURRENT_JOBS),
-            ignoredDirectories = getTyped(ConfigKeys.RAG_IGNORED_DIRECTORIES),
-            searchSimilarityThreshold = getTyped(ConfigKeys.RAG_SEARCH_SIMILARITY_THRESHOLD),
-            searchTopK = getTyped(ConfigKeys.RAG_SEARCH_TOP_K),
-            searchHybridEnabled = getTyped(ConfigKeys.RAG_SEARCH_HYBRID_ENABLED),
-            searchSemanticWeight = getTyped(ConfigKeys.RAG_SEARCH_SEMANTIC_WEIGHT),
-            searchIncludeContextChunks = getTyped(ConfigKeys.RAG_SEARCH_INCLUDE_CONTEXT_CHUNKS)
-        )
-    }
-
-    private fun buildUiConfig(): pl.jclab.refio.core.config.UiConfig {
-        return pl.jclab.refio.core.config.UiConfig(
-            thinkingEnabled = getTyped(ConfigKeys.UI_THINKING_ENABLED),
-            noEgressEnabled = getTyped(ConfigKeys.UI_NO_EGRESS_ENABLED),
-            executionMode = getTyped(ConfigKeys.UI_EXECUTION_MODE),
-            selectedMode = getTyped(ConfigKeys.UI_SELECTED_MODE),
-            selectedModel = getTyped(ConfigKeys.UI_SELECTED_MODEL)
-        )
-    }
-
     /**
      * Export current configuration to a YAML file.
      *
@@ -1292,1071 +181,85 @@ class ConfigService(
      * @param includeApiKeys If true, includes API keys (masked for security)
      */
     fun exportToYaml(file: java.io.File, includeApiKeys: Boolean = false) {
-        val config = buildConfigYamlFromCurrentSettings(includeApiKeys)
+        val config = ConfigYamlBuilder(this, configRepository).build(includeApiKeys)
         ConfigYaml.saveToFile(config, file, withComments = true)
         logger.info { "Exported configuration to: ${file.absolutePath}" }
     }
 
-    // ==================== GENERAL CONFIGURATION ====================
-
-    /**
-     * Get format markdown setting.
-     *
-     * @return true if markdown formatting is enabled (default: true)
-     */
-    /**
-     * Get streaming enabled setting.
-     *
-     * @return true if streaming is enabled (default: true)
-     */
-    // ==================== RAG CONFIGURATION ====================
-
-    /**
-     * Get advanced view setting.
-     *
-     * @return true if advanced view is enabled (default: false)
-     */
-    // ==================== ADVANCED CONFIGURATION ====================
-
-    /**
-     * Get auto-optimize percentage.
-     *
-     * @return Auto-optimize percentage (default: 85)
-     */
-    /**
-     * Get no-egress default setting.
-     *
-     * @return true if no-egress is default (default: false)
-     */
-    /**
-     * Get read-only mode setting.
-     *
-     * @return true if read-only mode is enabled (default: false)
-     */
-    // ==================== PROVIDER CONFIGURATION ====================
-
-    /**
-     * Get Ollama context size.
-     *
-     * @return Ollama context size in tokens (default: 32768)
-     */
-    /**
-     * Get Ollama keep_alive duration.
-     *
-     * @return Ollama keep_alive in seconds (default: 1800 = 30 minutes)
-     */
-    /**
-     * Set Ollama context size.
-     *
-     * @param contextSize Context size in tokens
-     */
-    fun setOllamaContextSize(contextSize: Int) {
-        configRepository.set(
-            key = KEY_PROVIDER_OLLAMA_CONTEXT_SIZE,
-            value = contextSize.toString(),
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Ollama context size in tokens"
-        )
-        invalidateConfigCache(KEY_PROVIDER_OLLAMA_CONTEXT_SIZE)
-        invalidateConfigCache(KEY_PROVIDER_OLLAMA_CONTEXT_SIZE)
-    }
-
-    /**
-     * Get LM Studio context size.
-     *
-     * @return LM Studio context size in tokens (default: 32768)
-     */
-    /**
-     * Set LM Studio context size.
-     *
-     * @param contextSize Context size in tokens
-     */
-    fun setLMStudioContextSize(contextSize: Int) {
-        configRepository.set(
-            key = KEY_PROVIDER_LM_STUDIO_CONTEXT_SIZE,
-            value = contextSize.toString(),
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "LM Studio context size in tokens"
-        )
-        invalidateConfigCache(KEY_PROVIDER_LM_STUDIO_CONTEXT_SIZE)
-        invalidateConfigCache(KEY_PROVIDER_LM_STUDIO_CONTEXT_SIZE)
-    }
-
-    // ==================== TOOLS CONFIGURATION ====================
-
-    /**
-     * Get tool permissions map.
-     *
-     * @return Map of tool_name -> enabled
-     */
-    fun getToolsPermissions(): Map<String, Boolean> {
-        val config = configRepository.get(KEY_TOOLS_PERMISSIONS, ConfigScope.APP)
-        if (config != null) {
-            @Suppress("UNCHECKED_CAST")
-            val permissionsMap = gson.fromJson(config.value, Map::class.java) as? Map<String, Boolean>
-            return permissionsMap ?: emptyMap()
-        }
-        return emptyMap()
-    }
-
-    /**
-     * Set tool permissions map.
-     *
-     * @param permissions Map of tool_name -> enabled
-     */
-    fun setToolsPermissions(permissions: Map<String, Boolean>) {
-        val valueJson = gson.toJson(permissions)
-        configRepository.set(
-            key = KEY_TOOLS_PERMISSIONS,
-            value = valueJson,
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Tool permissions"
-        )
-        invalidateConfigCache(KEY_TOOLS_PERMISSIONS)
-        invalidateConfigCache(KEY_TOOLS_PERMISSIONS)
-    }
-
-    /**
-     * Get run terminal command permission.
-     *
-     * @return true if run_terminal_command is allowed (default: true)
-     */
-    /**
-     * Set run terminal command permission.
-     *
-     * @param allowed true to allow run_terminal_command tool
-     */
-    fun setRunTerminalCommandAllowed(allowed: Boolean) {
-        configRepository.set(
-            key = KEY_TOOL_PERMISSION_RUN_TERMINAL,
-            value = allowed.toString(),
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Allow run_terminal_command tool"
-        )
-        invalidateConfigCache(KEY_TOOL_PERMISSION_RUN_TERMINAL)
-        invalidateConfigCache(KEY_TOOL_PERMISSION_RUN_TERMINAL)
-    }
-
-    fun getTerminalWhitelistConfig(): CommandWhitelistConfig {
-        val defaults = CommandWhitelistConfig(
-            enabled = true,
-            mode = WhitelistMode.WHITELIST_ONLY,
-            allowedCommands = CommandWhitelistDefaults.DEFAULT_COMMANDS,
-            globalBlockedPatterns = CommandWhitelistDefaults.DEFAULT_BLOCKED_PATTERNS
-        )
-
-        val fromYaml = yamlLoader.getTerminalWhitelist()?.let { yaml ->
-            CommandWhitelistConfig(
-                enabled = yaml.enabled ?: defaults.enabled,
-                mode = parseWhitelistMode(yaml.mode) ?: defaults.mode,
-                allowedCommands = yaml.commands?.map { toDomainAllowedCommand(it) } ?: emptyList(),
-                globalBlockedPatterns = yaml.globalBlockedPatterns ?: emptyList()
-            )
-        }
-
-        var merged = mergeTerminalWhitelistConfigs(defaults, fromYaml)
-
-        val dbConfig = getConfigWithPrecedence(KEY_TERMINAL_WHITELIST)
-        if (dbConfig != null) {
-            val parsed = runCatching { gson.fromJson(dbConfig.value, CommandWhitelistConfig::class.java) }.getOrNull()
-            if (parsed != null) {
-                merged = mergeTerminalWhitelistConfigs(merged, parsed)
-            } else {
-                logger.warn { "Failed to parse DB terminal whitelist config JSON" }
-            }
-        }
-
-        getConfigWithPrecedence(KEY_TERMINAL_WHITELIST_ENABLED)?.value?.toBooleanStrictOrNull()?.let { enabled ->
-            merged = merged.copy(enabled = enabled)
-        }
-        parseWhitelistMode(getConfigWithPrecedence(KEY_TERMINAL_WHITELIST_MODE)?.value)?.let { mode ->
-            merged = merged.copy(mode = mode)
-        }
-
-        return merged
-    }
-
-    fun setTerminalWhitelistConfig(config: CommandWhitelistConfig, scope: ConfigScope) {
-        require(scope != ConfigScope.TASK) { "TASK scope is not supported for terminal whitelist config" }
-
-        val projectId = if (scope == ConfigScope.PROJECT) {
-            resolveProjectId(null)
-                ?: throw IllegalArgumentException("PROJECT scope requires default projectId")
-        } else {
-            null
-        }
-
-        configRepository.set(
-            key = KEY_TERMINAL_WHITELIST,
-            value = gson.toJson(config),
-            scope = scope,
-            projectId = projectId,
-            taskId = null,
-            description = "Terminal whitelist configuration"
-        )
-        configRepository.set(
-            key = KEY_TERMINAL_WHITELIST_ENABLED,
-            value = config.enabled.toString(),
-            scope = scope,
-            projectId = projectId,
-            taskId = null,
-            description = "Terminal whitelist enabled"
-        )
-        configRepository.set(
-            key = KEY_TERMINAL_WHITELIST_MODE,
-            value = config.mode.name,
-            scope = scope,
-            projectId = projectId,
-            taskId = null,
-            description = "Terminal whitelist mode"
-        )
-        invalidateConfigCache(KEY_TERMINAL_WHITELIST)
-        invalidateConfigCache(KEY_TERMINAL_WHITELIST_ENABLED)
-        invalidateConfigCache(KEY_TERMINAL_WHITELIST_MODE)
-        invalidateConfigCache(KEY_TERMINAL_WHITELIST)
-        invalidateConfigCache(KEY_TERMINAL_WHITELIST_ENABLED)
-        invalidateConfigCache(KEY_TERMINAL_WHITELIST_MODE)
-    }
-
-    fun addAllowedCommand(command: AllowedCommand, scope: ConfigScope) {
-        val current = getTerminalWhitelistConfig()
-        val programKey = command.program.lowercase()
-        val updatedCommands = current.allowedCommands
-            .filterNot { it.program.lowercase() == programKey } + command
-        setTerminalWhitelistConfig(current.copy(allowedCommands = updatedCommands), scope)
-    }
-
-    fun removeAllowedCommand(program: String, scope: ConfigScope) {
-        val current = getTerminalWhitelistConfig()
-        val targetKey = program.lowercase()
-        val updatedCommands = current.allowedCommands.filterNot { it.program.lowercase() == targetKey }
-        setTerminalWhitelistConfig(current.copy(allowedCommands = updatedCommands), scope)
-    }
-
-    /**
-     * Get enabled overrides for builtin subagents.
-     *
-     * @return Map of subagent name -> enabled flag
-     */
-    fun getBuiltinSubagentEnabledOverrides(): Map<String, Boolean> {
-        val config = configRepository.get(KEY_SUBAGENTS_BUILTIN_ENABLED, ConfigScope.APP)
-        if (config != null) {
-            val raw = gson.fromJson(config.value, Map::class.java)
-            if (raw != null) {
-                return raw.mapNotNull { (key, value) ->
-                    val name = key as? String ?: return@mapNotNull null
-                    val enabled = when (value) {
-                        is Boolean -> value
-                        is String -> value.toBoolean()
-                        else -> null
-                    } ?: return@mapNotNull null
-                    name to enabled
-                }.toMap()
-            }
-        }
-        return emptyMap()
-    }
-
-    /**
-     * Set enabled override for a builtin subagent.
-     *
-     * @param name Subagent name
-     * @param enabled Enabled flag
-     */
-    fun setBuiltinSubagentEnabledOverride(name: String, enabled: Boolean) {
-        val current = getBuiltinSubagentEnabledOverrides().toMutableMap()
-        current[name.lowercase()] = enabled
-        configRepository.set(
-            key = KEY_SUBAGENTS_BUILTIN_ENABLED,
-            value = gson.toJson(current),
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Builtin subagent enabled overrides"
-        )
-        invalidateConfigCache(KEY_SUBAGENTS_BUILTIN_ENABLED)
-        invalidateConfigCache(KEY_SUBAGENTS_BUILTIN_ENABLED)
-    }
-
-    // ==================== TOOL RESULT SUMMARIZATION ====================
+    private val builtinSubagentOverrides = BuiltinSubagentOverrides(
+        configRepository = configRepository,
+        invalidate = ::invalidateConfigCache,
+    )
 
-    /**
-     * Check if tool result summarization is enabled.
-     *
-     * @return true if tool summarization is enabled (default: true)
-     */
-    /**
-     * Set tool result summarization enabled setting.
-     *
-     * @param enabled true to enable tool summarization
-     */
-    fun setToolSummaryEnabled(enabled: Boolean) {
-        configRepository.set(
-            key = KEY_TOOL_SUMMARY_ENABLED,
-            value = enabled.toString(),
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Enable tool result summarization"
-        )
-        invalidateConfigCache(KEY_TOOL_SUMMARY_ENABLED)
-        invalidateConfigCache(KEY_TOOL_SUMMARY_ENABLED)
-    }
+    fun getBuiltinSubagentEnabledOverrides(): Map<String, Boolean> =
+        builtinSubagentOverrides.getAll()
 
-    /**
-     * Get minimum output length for summarization.
-     * Tool outputs shorter than this will not be summarized.
-     *
-     * @return Minimum length in characters (default: 500)
-     */
-    /**
-     * Set minimum output length for summarization.
-     *
-     * @param length Minimum length in characters
-     */
-    fun setToolSummaryMinLength(length: Int) {
-        configRepository.set(
-            key = KEY_TOOL_SUMMARY_MIN_LENGTH,
-            value = length.toString(),
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Minimum tool output length for summarization"
-        )
-        invalidateConfigCache(KEY_TOOL_SUMMARY_MIN_LENGTH)
-        invalidateConfigCache(KEY_TOOL_SUMMARY_MIN_LENGTH)
-    }
+    fun setBuiltinSubagentEnabledOverride(name: String, enabled: Boolean) =
+        builtinSubagentOverrides.setOverride(name, enabled)
 
     // ==================== CONTEXT CONFIGURATION (ADR 0017) ====================
+    // Delegated to ContextBudgetResolver to keep budget math out of this class.
 
-    /**
-     * Get recent work full data limit.
-     * Number of recent tool executions to show with full data.
-     *
-     * @return Full data limit (default: 2)
-     */
-    /**
-     * Set recent work full data limit.
-     *
-     * @param limit Number of recent tool executions with full data
-     */
-    fun setRecentWorkFullDataLimit(limit: Int) {
-        configRepository.set(
-            key = KEY_RECENT_WORK_FULL_DATA_LIMIT,
-            value = limit.toString(),
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Number of recent tool executions with full data"
-        )
-        invalidateConfigCache(KEY_RECENT_WORK_FULL_DATA_LIMIT)
-        invalidateConfigCache(KEY_RECENT_WORK_FULL_DATA_LIMIT)
-    }
+    private val contextBudgetResolver = ContextBudgetResolver(this)
 
-    /**
-     * Get recent work summary max length.
-     * Maximum length for truncated tool outputs.
-     *
-     * @return Summary max length in characters (default: 150)
-     */
-    /**
-     * Set recent work summary max length.
-     *
-     * @param length Maximum length in characters
-     */
-    fun setRecentWorkSummaryMaxLength(length: Int) {
-        configRepository.set(
-            key = KEY_RECENT_WORK_SUMMARY_MAX_LENGTH,
-            value = length.toString(),
-            scope = ConfigScope.APP,
-            taskId = null,
-            description = "Maximum length for truncated tool outputs"
-        )
-        invalidateConfigCache(KEY_RECENT_WORK_SUMMARY_MAX_LENGTH)
-        invalidateConfigCache(KEY_RECENT_WORK_SUMMARY_MAX_LENGTH)
-    }
+    fun getContextBudget(taskId: String? = null, operation: ModelOperation? = null): ContextBudget =
+        contextBudgetResolver.getContextBudget(taskId, operation)
 
-    /**
-     * Resolve context budget for prompt building.
-     */
-    fun getContextBudget(taskId: String? = null, operation: ModelOperation? = null): ContextBudget {
-        val inputRatio = getContextBudgetInputRatio(taskId)
-        val contextSize = resolveContextSizeForBudget(operation, taskId)
-        val totalOverride = getContextBudgetTotalTokens(taskId)
-        val overrides = getContextBudgetSectionOverrides(taskId)
+    fun isCompactPrompts(operation: ModelOperation? = null, taskId: String? = null): Boolean =
+        contextBudgetResolver.isCompactPrompts(operation, taskId)
 
-        return ContextBudget.forContextSize(
-            contextSize = contextSize,
-            inputRatio = inputRatio,
-            overrides = overrides,
-            totalTokensOverride = totalOverride
-        )
-    }
-
-    private fun getContextBudgetInputRatio(taskId: String? = null): Double {
-        val config = getConfigWithPrecedence(KEY_CONTEXT_BUDGET_INPUT_RATIO, taskId)
-        return config?.value?.toDoubleOrNull() ?: DEFAULT_CONTEXT_BUDGET_INPUT_RATIO
-    }
-
-    private fun getContextBudgetTotalTokens(taskId: String? = null): Int? {
-        val config = getConfigWithPrecedence(KEY_CONTEXT_BUDGET_TOTAL_TOKENS, taskId)
-        return config?.value?.toIntOrNull()
-    }
-
-    private fun getContextBudgetSectionOverrides(taskId: String? = null): Map<ContextSection, Int> {
-        val overrides = mutableMapOf<ContextSection, Int>()
-        ContextSection.values().forEach { section ->
-            val key = "$KEY_CONTEXT_BUDGET_SECTION_PREFIX${section.name.lowercase()}"
-            val config = getConfigWithPrecedence(key, taskId)
-            val value = config?.value?.toIntOrNull()
-            if (value != null && value > 0) {
-                overrides[section] = value
-            }
-        }
-        return overrides
-    }
-
-    private fun resolveContextSizeForBudget(operation: ModelOperation?, taskId: String?): Int {
-        val fallback = getTyped(ConfigKeys.MAX_CONTEXT_SIZE, taskId)
-        if (operation == null) return fallback
-
-        val (_, provider) = getModel(operation, taskId)
-        return when (provider.lowercase()) {
-            "ollama" -> getTyped(ConfigKeys.PROVIDER_OLLAMA_CONTEXT_SIZE)
-            "lmstudio" -> getTyped(ConfigKeys.PROVIDER_LM_STUDIO_CONTEXT_SIZE)
-            else -> fallback
-        }
-    }
+    private val defaultsInitializer = ConfigDefaultsInitializer(
+        configRepository = configRepository,
+        applyYaml = ::applyYaml,
+        invalidateAllCaches = configCache::invalidateAll,
+    )
 
-    /**
-     * Whether compact (shorter) system prompts should be used.
-     * Auto-detects based on resolved context size for the operation:
-     * context <= 48000 tokens → compact mode (saves ~40% prompt tokens).
-     */
-    fun isCompactPrompts(operation: ModelOperation? = null, taskId: String? = null): Boolean {
-        val contextSize = resolveContextSizeForBudget(operation, taskId)
-        return contextSize <= COMPACT_PROMPT_THRESHOLD
-    }
+    private val validator = ConfigValidator(this)
 
-    /**
-     * Get streaming read timeout (time between chunks) in milliseconds.
-     * Used to detect stalled streaming connections.
-     */
-    /**
-     * Get streaming request timeout (total time) in milliseconds.
-     * Used as maximum total duration for streaming requests.
-     */
-    /**
-     * Load configuration from YAML file, but only for keys that don't exist in DB.
-     * Called on plugin startup to initialize config from user's config file.
-     *
-     * Location: ~/.refio/config.yaml (Linux/macOS) or %USERPROFILE%\.refio\config.yaml (Windows)
-     *
-     * This method is idempotent - it won't overwrite existing config values.
-     */
     fun loadFromYamlIfMissing() {
-        val yamlConfig = pl.jclab.refio.core.config.ConfigYaml.load()
-        if (yamlConfig == null) {
-            logger.info { "No YAML config file found or failed to parse, skipping" }
-            return
-        }
-
-        logger.info { "Loading configuration from YAML file (only missing keys)" }
-
-        // Load default models (only if not set)
-        yamlConfig.models?.defaults?.let { defaults ->
-            defaults.chat?.let { model ->
-                if (configRepository.get(KEY_DEFAULT_MODEL_CHAT, ConfigScope.APP) == null) {
-                    val (provider, modelId) = parseModelString(model)
-                    try {
-                        setDefaultModel(ModelOperation.DEFAULT, modelId, provider)
-                        logger.info { "Loaded default chat model from YAML: $modelId" }
-                    } catch (e: Exception) {
-                        logger.warn { "Failed to set chat model from YAML: ${e.message}" }
-                    }
-                }
-            }
-
-            defaults.plan?.let { model ->
-                if (configRepository.get(KEY_DEFAULT_MODEL_PLAN, ConfigScope.APP) == null) {
-                    val (provider, modelId) = parseModelString(model)
-                    try {
-                        setDefaultModel(ModelOperation.PLAN, modelId, provider)
-                        logger.info { "Loaded default plan model from YAML: $modelId" }
-                    } catch (e: Exception) {
-                        logger.warn { "Failed to set plan model from YAML: ${e.message}" }
-                    }
-                }
-            }
-
-            defaults.coding?.let { model ->
-                if (configRepository.get(KEY_DEFAULT_MODEL_AGENT, ConfigScope.APP) == null) {
-                    val (provider, modelId) = parseModelString(model)
-                    try {
-                        setDefaultModel(ModelOperation.CODING, modelId, provider)
-                        logger.info { "Loaded default coding model from YAML: $modelId" }
-                    } catch (e: Exception) {
-                        logger.warn { "Failed to set coding model from YAML: ${e.message}" }
-                    }
-                }
-            }
-        }
-
-        // Load model visibility (merge with existing)
-        yamlConfig.models?.visibility?.let { visibility ->
-            val existingVisibility = getModelsVisibility().toMutableMap()
-            var hasChanges = false
-
-            visibility.forEach { (modelId, show) ->
-                if (!existingVisibility.containsKey(modelId)) {
-                    existingVisibility[modelId] = show
-                    hasChanges = true
-                    logger.info { "Loaded model visibility from YAML: $modelId -> $show" }
-                }
-            }
-
-            if (hasChanges) {
-                val valueJson = gson.toJson(existingVisibility)
-                configRepository.set(
-                    key = KEY_MODELS_VISIBILITY,
-                    value = valueJson,
-                    scope = ConfigScope.APP,
-                    taskId = null,
-                    description = "Model visibility settings"
-                )
-            }
-        }
-
-        // Load provider configs (only if not set)
-        yamlConfig.providers?.let { providers ->
-            providers.ollama?.endpoint?.let { endpoint ->
-                if (configRepository.get(KEY_PROVIDER_OLLAMA_ENDPOINT, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_OLLAMA_ENDPOINT, endpoint)
-                    logger.info { "Loaded Ollama endpoint from YAML: $endpoint" }
-                }
-            }
-
-            providers.anthropic?.apiKey?.let { apiKey ->
-                if (configRepository.get(KEY_PROVIDER_ANTHROPIC_API_KEY, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_ANTHROPIC_API_KEY, apiKey)
-                    logger.info { "Loaded Anthropic API key from YAML" }
-                }
-            }
-
-            providers.openai?.apiKey?.let { apiKey ->
-                if (configRepository.get(KEY_PROVIDER_OPENAI_API_KEY, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_OPENAI_API_KEY, apiKey)
-                    logger.info { "Loaded OpenAI API key from YAML" }
-                }
-            }
-
-            providers.openrouter?.apiKey?.let { apiKey ->
-                if (configRepository.get(KEY_PROVIDER_OPENROUTER_API_KEY, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_OPENROUTER_API_KEY, apiKey)
-                    logger.info { "Loaded OpenRouter API key from YAML" }
-                }
-            }
-
-            providers.gemini?.apiKey?.let { apiKey ->
-                if (configRepository.get(KEY_PROVIDER_GEMINI_API_KEY, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_GEMINI_API_KEY, apiKey)
-                    logger.info { "Loaded Gemini API key from YAML" }
-                }
-            }
-
-            providers.lmstudio?.apiKey?.let { apiKey ->
-                if (configRepository.get(KEY_PROVIDER_LM_STUDIO_API_KEY, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_LM_STUDIO_API_KEY, apiKey)
-                    logger.info { "Loaded LM Studio API key from YAML" }
-                }
-            }
-
-            providers.lmstudio?.baseUrl?.let { baseUrl ->
-                if (configRepository.get(KEY_PROVIDER_LM_STUDIO_BASE_URL, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_LM_STUDIO_BASE_URL, baseUrl)
-                    logger.info { "Loaded LM Studio base URL from YAML: $baseUrl" }
-                }
-            }
-            providers.customOpenai?.apiKey?.let { apiKey ->
-                if (configRepository.get(KEY_PROVIDER_CUSTOM_OPENAI_API_KEY, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_CUSTOM_OPENAI_API_KEY, apiKey)
-                    logger.info { "Loaded Custom OpenAI API key from YAML" }
-                }
-            }
-            providers.customOpenai?.baseUrl?.let { baseUrl ->
-                if (configRepository.get(KEY_PROVIDER_CUSTOM_OPENAI_BASE_URL, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_CUSTOM_OPENAI_BASE_URL, baseUrl)
-                    logger.info { "Loaded Custom OpenAI base URL from YAML: $baseUrl" }
-                }
-            }
-            providers.customOpenai?.model?.let { model ->
-                if (configRepository.get(KEY_PROVIDER_CUSTOM_OPENAI_MODEL, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_CUSTOM_OPENAI_MODEL, model)
-                    logger.info { "Loaded Custom OpenAI model from YAML: $model" }
-                }
-            }
-            providers.zai?.apiKey?.let { apiKey ->
-                if (configRepository.get(KEY_PROVIDER_ZAI_API_KEY, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_ZAI_API_KEY, apiKey)
-                    logger.info { "Loaded Z.AI API key from YAML" }
-                }
-            }
-            providers.zai?.baseUrl?.let { baseUrl ->
-                if (configRepository.get(KEY_PROVIDER_ZAI_BASE_URL, ConfigScope.APP) == null) {
-                    set(KEY_PROVIDER_ZAI_BASE_URL, baseUrl)
-                    logger.info { "Loaded Z.AI base URL from YAML: $baseUrl" }
-                }
-            }
-        }
-
-        // Load limits (only if not set)
-        yamlConfig.limits?.let { limits ->
-            limits.apiCallTimeout?.let { timeout ->
-                if (configRepository.get(KEY_API_CALL_TIMEOUT, ConfigScope.APP) == null) {
-                    set(KEY_API_CALL_TIMEOUT, timeout.toString())
-                    logger.info { "Loaded API call timeout from YAML: $timeout" }
-                }
-            }
-
-            limits.toolExecutionTimeout?.let { timeout ->
-                if (configRepository.get(KEY_TOOL_EXECUTION_TIMEOUT, ConfigScope.APP) == null) {
-                    set(KEY_TOOL_EXECUTION_TIMEOUT, timeout.toString())
-                    logger.info { "Loaded tool execution timeout from YAML: $timeout" }
-                }
-            }
-
-            limits.maxContextSize?.let { size ->
-                if (configRepository.get(KEY_MAX_CONTEXT_SIZE, ConfigScope.APP) == null) {
-                    set(KEY_MAX_CONTEXT_SIZE, size.toString())
-                    logger.info { "Loaded max context size from YAML: $size" }
-                }
-            }
-
-            limits.maxOutputSize?.let { size ->
-                if (configRepository.get(KEY_MAX_OUTPUT_SIZE, ConfigScope.APP) == null) {
-                    set(KEY_MAX_OUTPUT_SIZE, size.toString())
-                    logger.info { "Loaded max output size from YAML: $size" }
-                }
-            }
-
-            limits.maxFileSize?.let { size ->
-                if (configRepository.get(KEY_MAX_FILE_SIZE, ConfigScope.APP) == null) {
-                    set(KEY_MAX_FILE_SIZE, size.toString())
-                    logger.info { "Loaded max file size from YAML: $size" }
-                }
-            }
-        }
-
-        yamlConfig.terminal?.whitelist?.let { whitelist ->
-            if (configRepository.get(KEY_TERMINAL_WHITELIST, ConfigScope.APP) == null) {
-                val domainConfig = CommandWhitelistConfig(
-                    enabled = whitelist.enabled ?: true,
-                    mode = parseWhitelistMode(whitelist.mode) ?: WhitelistMode.WHITELIST_ONLY,
-                    allowedCommands = whitelist.commands?.map { toDomainAllowedCommand(it) } ?: emptyList(),
-                    globalBlockedPatterns = whitelist.globalBlockedPatterns ?: emptyList()
-                )
-                configRepository.set(
-                    key = KEY_TERMINAL_WHITELIST,
-                    value = gson.toJson(domainConfig),
-                    scope = ConfigScope.APP,
-                    taskId = null,
-                    description = "Terminal whitelist configuration"
-                )
-                logger.info { "Loaded terminal whitelist config from YAML" }
-            }
-            if (configRepository.get(KEY_TERMINAL_WHITELIST_ENABLED, ConfigScope.APP) == null && whitelist.enabled != null) {
-                set(KEY_TERMINAL_WHITELIST_ENABLED, whitelist.enabled.toString())
-                logger.info { "Loaded terminal whitelist enabled from YAML: ${whitelist.enabled}" }
-            }
-            if (configRepository.get(KEY_TERMINAL_WHITELIST_MODE, ConfigScope.APP) == null && whitelist.mode != null) {
-                val mode = parseWhitelistMode(whitelist.mode)
-                if (mode != null) {
-                    set(KEY_TERMINAL_WHITELIST_MODE, mode.name)
-                    logger.info { "Loaded terminal whitelist mode from YAML: ${mode.name}" }
-                }
-            }
-        }
-
-        logger.info { "Finished loading configuration from YAML" }
+        defaultsInitializer.loadFromYamlIfMissing()
+        validator.validateAll()
     }
 
-    /**
-     * Initialize default configuration values if they don't exist in the database.
-     * Called on plugin startup to ensure all required config keys have values.
-     *
-     * This method is idempotent - it won't overwrite existing config values.
-     */
     fun initializeDefaults() {
-        logger.info { "Initializing default configuration values (only missing keys)" }
-
-        migrateProviderKeysToLowercase()
-
-        val defaults = listOf(
-            Triple(KEY_UI_THINKING_ENABLED, "false", "Show LLM thinking process in UI"),
-            Triple(KEY_UI_NO_EGRESS_ENABLED, "false", "Block external network calls"),
-            Triple(KEY_UI_ORCHESTRATION_ENABLED, "true", "Enable orchestration UI toggle"),
-            Triple(KEY_UI_MULTI_AGENT_STRATEGY, "SINGLE", "Multi-agent orchestration strategy: SINGLE, PARALLEL, PIPELINE, ORCHESTRATOR"),
-            Triple(KEY_UI_INTENT_CLASSIFICATION_ENABLED, "false", "Enable LLM intent classification"),
-            Triple(KEY_UI_EXECUTION_MODE, "AUTO", "Execution mode (AUTO/INTERACTIVE)"),
-            Triple(KEY_UI_SELECTED_MODE, "CHAT", "Selected task mode (CHAT/PLAN/AGENT)"),
-            Triple(KEY_EMBEDDING_MODEL, "ollama/nomic-embed-text", "Model for embeddings"),
-            Triple(KEY_FORMAT_MARKDOWN, "true", "Format responses as markdown"),
-            Triple(KEY_STREAMING_ENABLED, "true", "Enable streaming responses"),
-            Triple(KEY_ADVANCED_VIEW, "false", "Show advanced UI options"),
-            Triple(KEY_TOOL_SUMMARY_ENABLED, "true", "Enable tool result summarization"),
-            Triple(KEY_TOOL_SUMMARY_MIN_LENGTH, "500", "Minimum tool output length for summarization"),
-            Triple(KEY_SECURITY_ALLOW_SYMLINKS, "false", "Allow symbolic links in PathSandbox (unsafe, opt-in)"),
-            Triple(KEY_PROVIDER_ZAI_BASE_URL, DEFAULT_ZAI_BASE_URL, "Base URL for Z.AI provider"),
-            Triple(KEY_RAG_EMBEDDING_CACHE_SIZE, DEFAULT_RAG_EMBEDDING_CACHE_SIZE.toString(), "Maximum embedding cache entries"),
-            Triple(KEY_RAG_CHUNKING_MODE, DEFAULT_RAG_CHUNKING_MODE, "RAG chunking mode (semantic or line_based)"),
-            Triple(KEY_RAG_SEARCH_CACHE_TTL_SECONDS, DEFAULT_RAG_SEARCH_CACHE_TTL_SECONDS.toString(), "TTL for cached @codebase search results in seconds"),
-            Triple(KEY_WORKING_MEMORY_MAX_FACTS, DEFAULT_WORKING_MEMORY_MAX_FACTS.toString(), "Maximum working memory facts stored per task"),
-            Triple(KEY_TASK_VERIFICATION_ENABLED, "false", "Enable task completion verification for AGENT mode"),
-            Triple(KEY_MAX_CONSECUTIVE_TOOL_ERRORS, DEFAULT_MAX_CONSECUTIVE_TOOL_ERRORS.toString(), "Max consecutive failures of the same tool+args before aborting (definitive loop). Varied args reset the counter."),
-            Triple(KEY_JSON_THINKING_XML_TAGS, DEFAULT_JSON_THINKING_XML_TAGS, "Comma-separated XML tags stripped before JSON extraction (e.g., thinking,think)")
-        )
-
-        var initializedCount = 0
-        for ((key, value, description) in defaults) {
-            if (configRepository.get(key, ConfigScope.APP) == null) {
-                configRepository.set(
-                    key = key,
-                    value = value,
-                    scope = ConfigScope.APP,
-                    taskId = null,
-                    description = description
-                )
-                logger.info { "Initialized default: $key = $value" }
-                initializedCount++
-            }
-        }
-
-        if (configRepository.get(KEY_TERMINAL_WHITELIST_ENABLED, ConfigScope.APP) == null) {
-            configRepository.set(
-                key = KEY_TERMINAL_WHITELIST_ENABLED,
-                value = "true",
-                scope = ConfigScope.APP,
-                taskId = null,
-                description = "Terminal whitelist enabled"
-            )
-            initializedCount++
-        }
-        if (configRepository.get(KEY_TERMINAL_WHITELIST_MODE, ConfigScope.APP) == null) {
-            configRepository.set(
-                key = KEY_TERMINAL_WHITELIST_MODE,
-                value = WhitelistMode.WHITELIST_ONLY.name,
-                scope = ConfigScope.APP,
-                taskId = null,
-                description = "Terminal whitelist mode"
-            )
-            initializedCount++
-        }
-
-        logger.info { "Finished initializing defaults: $initializedCount keys set" }
-        configCache.invalidateAll()
+        defaultsInitializer.initializeDefaults()
+        validator.validateAll()
     }
 
-    private fun migrateProviderKeysToLowercase() {
-        val legacyToCanonical = listOf(
-            "providers.Ollama.ollama_endpoint" to KEY_PROVIDER_OLLAMA_ENDPOINT,
-            "providers.Anthropic.anthropic_api_key" to KEY_PROVIDER_ANTHROPIC_API_KEY,
-            "providers.OpenAI.openai_api_key" to KEY_PROVIDER_OPENAI_API_KEY,
-            "providers.OpenRouter.openrouter_api_key" to KEY_PROVIDER_OPENROUTER_API_KEY,
-            "providers.Gemini.gemini_api_key" to KEY_PROVIDER_GEMINI_API_KEY,
-            "providers.LMStudio.lmstudio_api_key" to KEY_PROVIDER_LM_STUDIO_API_KEY,
-            "providers.LMStudio.lmstudio_base_url" to KEY_PROVIDER_LM_STUDIO_BASE_URL,
-            "providers.CustomOpenAI.custom_openai_api_key" to KEY_PROVIDER_CUSTOM_OPENAI_API_KEY,
-            "providers.CustomOpenAI.custom_openai_base_url" to KEY_PROVIDER_CUSTOM_OPENAI_BASE_URL,
-            "providers.CustomOpenAI.custom_openai_model" to KEY_PROVIDER_CUSTOM_OPENAI_MODEL,
-            "providers.ZAI.zai_api_key" to KEY_PROVIDER_ZAI_API_KEY,
-            "providers.ZAI.zai_base_url" to KEY_PROVIDER_ZAI_BASE_URL,
-            "ollama_endpoint" to KEY_PROVIDER_OLLAMA_ENDPOINT,
-            "anthropic_api_key" to KEY_PROVIDER_ANTHROPIC_API_KEY,
-            "openai_api_key" to KEY_PROVIDER_OPENAI_API_KEY,
-            "openrouter_api_key" to KEY_PROVIDER_OPENROUTER_API_KEY,
-            "gemini_api_key" to KEY_PROVIDER_GEMINI_API_KEY,
-            "lmstudio_api_key" to KEY_PROVIDER_LM_STUDIO_API_KEY,
-            "lmstudio_base_url" to KEY_PROVIDER_LM_STUDIO_BASE_URL,
-            "custom_openai_api_key" to KEY_PROVIDER_CUSTOM_OPENAI_API_KEY,
-            "custom_openai_base_url" to KEY_PROVIDER_CUSTOM_OPENAI_BASE_URL,
-            "custom_openai_model" to KEY_PROVIDER_CUSTOM_OPENAI_MODEL,
-            "zai_api_key" to KEY_PROVIDER_ZAI_API_KEY,
-            "zai_base_url" to KEY_PROVIDER_ZAI_BASE_URL
-        )
-
-        legacyToCanonical.forEach { (legacyKey, canonicalKey) ->
-            val legacyConfig = configRepository.get(legacyKey, ConfigScope.APP) ?: return@forEach
-            val canonicalConfig = configRepository.get(canonicalKey, ConfigScope.APP)
-
-            if (canonicalConfig == null) {
-                configRepository.set(
-                    key = canonicalKey,
-                    value = legacyConfig.value,
-                    scope = ConfigScope.APP,
-                    taskId = null,
-                    description = legacyConfig.description
-                )
-                logger.info { "Migrated provider key: $legacyKey -> $canonicalKey" }
-            } else {
-                logger.info { "Skipping provider key migration for $legacyKey (canonical already set)" }
-            }
-
-            configRepository.delete(legacyKey, ConfigScope.APP)
-        }
+    fun reloadFromYaml(): Int {
+        val updated = defaultsInitializer.reloadFromYaml()
+        validator.validateAll()
+        return updated
     }
 
     /**
-     * Reload all configuration from YAML file, overwriting existing DB values.
-     * Called manually via Settings UI "Reload from Local Config" button.
-     *
-     * This method is NOT idempotent - it will overwrite all config values with YAML values.
-     *
-     * @return Number of config keys updated
+     * Walk a [ConfigYaml] snapshot and apply it to DB.
+     * @param overwrite true = `reload` semantics (overwrite all, count updates);
+     *                  false = `load-if-missing` semantics (skip keys already present).
+     * @return Number of keys written.
      */
-    fun reloadFromYaml(): Int {
-        val yamlConfig = pl.jclab.refio.core.config.ConfigYaml.load()
-            ?: throw IllegalStateException("No YAML config file found at ${pl.jclab.refio.core.config.ConfigYaml.getConfigPath().absolutePath}")
-
-        logger.info { "Reloading all configuration from YAML file (overwriting DB)" }
-        var updatedCount = 0
-
-        // Reload default models
-        yamlConfig.models?.defaults?.let { defaults ->
-            defaults.chat?.let { model ->
-                val (provider, modelId) = parseModelString(model)
-                try {
-                    setDefaultModel(ModelOperation.DEFAULT, modelId, provider)
-                    updatedCount++
-                    logger.info { "Reloaded default chat model from YAML: $modelId" }
-                } catch (e: Exception) {
-                    logger.warn { "Failed to set chat model from YAML: ${e.message}" }
-                }
-            }
-
-            defaults.plan?.let { model ->
-                val (provider, modelId) = parseModelString(model)
-                try {
-                    setDefaultModel(ModelOperation.PLAN, modelId, provider)
-                    updatedCount++
-                    logger.info { "Reloaded default plan model from YAML: $modelId" }
-                } catch (e: Exception) {
-                    logger.warn { "Failed to set plan model from YAML: ${e.message}" }
-                }
-            }
-
-            defaults.coding?.let { model ->
-                val (provider, modelId) = parseModelString(model)
-                try {
-                    setDefaultModel(ModelOperation.CODING, modelId, provider)
-                    updatedCount++
-                    logger.info { "Reloaded default coding model from YAML: $modelId" }
-                } catch (e: Exception) {
-                    logger.warn { "Failed to set coding model from YAML: ${e.message}" }
-                }
-            }
-        }
-
-        // Reload model visibility (replace existing)
-        yamlConfig.models?.visibility?.let { visibility ->
-            val valueJson = gson.toJson(visibility)
-            configRepository.set(
-                key = KEY_MODELS_VISIBILITY,
-                value = valueJson,
-                scope = ConfigScope.APP,
-                taskId = null,
-                description = "Model visibility settings"
-            )
-            updatedCount++
-            logger.info { "Reloaded model visibility from YAML: ${visibility.size} entries" }
-        }
-
-        // Reload provider configs
-        yamlConfig.providers?.let { providers ->
-            providers.ollama?.endpoint?.let { endpoint ->
-                set(KEY_PROVIDER_OLLAMA_ENDPOINT, endpoint)
-                updatedCount++
-                logger.info { "Reloaded Ollama endpoint from YAML: $endpoint" }
-            }
-
-            providers.anthropic?.apiKey?.let { apiKey ->
-                set(KEY_PROVIDER_ANTHROPIC_API_KEY, apiKey)
-                updatedCount++
-                logger.info { "Reloaded Anthropic API key from YAML" }
-            }
-
-            providers.openai?.apiKey?.let { apiKey ->
-                set(KEY_PROVIDER_OPENAI_API_KEY, apiKey)
-                updatedCount++
-                logger.info { "Reloaded OpenAI API key from YAML" }
-            }
-
-            providers.openrouter?.apiKey?.let { apiKey ->
-                set(KEY_PROVIDER_OPENROUTER_API_KEY, apiKey)
-                updatedCount++
-                logger.info { "Reloaded OpenRouter API key from YAML" }
-            }
-
-            providers.gemini?.apiKey?.let { apiKey ->
-                set(KEY_PROVIDER_GEMINI_API_KEY, apiKey)
-                updatedCount++
-                logger.info { "Reloaded Gemini API key from YAML" }
-            }
-
-            providers.lmstudio?.apiKey?.let { apiKey ->
-                set(KEY_PROVIDER_LM_STUDIO_API_KEY, apiKey)
-                updatedCount++
-                logger.info { "Reloaded LM Studio API key from YAML" }
-            }
-
-            providers.lmstudio?.baseUrl?.let { baseUrl ->
-                set(KEY_PROVIDER_LM_STUDIO_BASE_URL, baseUrl)
-                updatedCount++
-                logger.info { "Reloaded LM Studio base URL from YAML: $baseUrl" }
-            }
-
-            providers.customOpenai?.apiKey?.let { apiKey ->
-                set(KEY_PROVIDER_CUSTOM_OPENAI_API_KEY, apiKey)
-                updatedCount++
-                logger.info { "Reloaded Custom OpenAI API key from YAML" }
-            }
-
-            providers.customOpenai?.baseUrl?.let { baseUrl ->
-                set(KEY_PROVIDER_CUSTOM_OPENAI_BASE_URL, baseUrl)
-                updatedCount++
-                logger.info { "Reloaded Custom OpenAI base URL from YAML: $baseUrl" }
-            }
-
-            providers.customOpenai?.model?.let { model ->
-                set(KEY_PROVIDER_CUSTOM_OPENAI_MODEL, model)
-                updatedCount++
-                logger.info { "Reloaded Custom OpenAI model from YAML: $model" }
-            }
-
-            providers.zai?.apiKey?.let { apiKey ->
-                set(KEY_PROVIDER_ZAI_API_KEY, apiKey)
-                updatedCount++
-                logger.info { "Reloaded Z.AI API key from YAML" }
-            }
-
-            providers.zai?.baseUrl?.let { baseUrl ->
-                set(KEY_PROVIDER_ZAI_BASE_URL, baseUrl)
-                updatedCount++
-                logger.info { "Reloaded Z.AI base URL from YAML: $baseUrl" }
-            }
-        }
-
-        // Reload limits
-        yamlConfig.limits?.let { limits ->
-            limits.apiCallTimeout?.let { timeout ->
-                set(KEY_API_CALL_TIMEOUT, timeout.toString())
-                updatedCount++
-                logger.info { "Reloaded API call timeout from YAML: $timeout" }
-            }
-
-            limits.toolExecutionTimeout?.let { timeout ->
-                set(KEY_TOOL_EXECUTION_TIMEOUT, timeout.toString())
-                updatedCount++
-                logger.info { "Reloaded tool execution timeout from YAML: $timeout" }
-            }
-
-            limits.maxContextSize?.let { size ->
-                set(KEY_MAX_CONTEXT_SIZE, size.toString())
-                updatedCount++
-                logger.info { "Reloaded max context size from YAML: $size" }
-            }
-
-            limits.maxOutputSize?.let { size ->
-                set(KEY_MAX_OUTPUT_SIZE, size.toString())
-                updatedCount++
-                logger.info { "Reloaded max output size from YAML: $size" }
-            }
-
-            limits.maxFileSize?.let { size ->
-                set(KEY_MAX_FILE_SIZE, size.toString())
-                updatedCount++
-                logger.info { "Reloaded max file size from YAML: $size" }
-            }
-        }
-
-        yamlConfig.terminal?.whitelist?.let { whitelist ->
-            val domainConfig = CommandWhitelistConfig(
-                enabled = whitelist.enabled ?: true,
-                mode = parseWhitelistMode(whitelist.mode) ?: WhitelistMode.WHITELIST_ONLY,
-                allowedCommands = whitelist.commands?.map { toDomainAllowedCommand(it) } ?: emptyList(),
-                globalBlockedPatterns = whitelist.globalBlockedPatterns ?: emptyList()
-            )
-            setTerminalWhitelistConfig(domainConfig, ConfigScope.APP)
-            updatedCount++
-            logger.info { "Reloaded terminal whitelist from YAML" }
-        }
-
-        logger.info { "Finished reloading configuration from YAML: $updatedCount keys updated" }
-        configCache.invalidateAll()
-        return updatedCount
-    }
-
-    private fun mergeTerminalWhitelistConfigs(
-        base: CommandWhitelistConfig,
-        override: CommandWhitelistConfig?
-    ): CommandWhitelistConfig {
-        if (override == null) {
-            return base
-        }
-
-        val commandsByProgram = linkedMapOf<String, AllowedCommand>()
-        base.allowedCommands.forEach { command ->
-            commandsByProgram[command.program.lowercase()] = command
-        }
-        override.allowedCommands.forEach { command ->
-            commandsByProgram[command.program.lowercase()] = command
-        }
-
-        return base.copy(
-            enabled = override.enabled,
-            mode = override.mode,
-            allowedCommands = commandsByProgram.values.toList(),
-            globalBlockedPatterns = (base.globalBlockedPatterns + override.globalBlockedPatterns).distinct()
-        )
-    }
-
-    private fun toDomainAllowedCommand(command: TerminalCommandConfig): AllowedCommand {
-        return AllowedCommand(
-            program = command.program,
-            description = command.description.orEmpty(),
-            aliases = command.aliases ?: emptyList(),
-            blockedFlags = command.blockedFlags ?: emptyList(),
-            blockedSubcommands = command.blockedSubcommands ?: emptyList(),
-            blockedArgPatterns = command.blockedArgPatterns ?: emptyList(),
-            allowedSubcommands = command.allowedSubcommands ?: emptyList(),
-            maxArgs = command.maxArgs ?: 50,
-            requireConfirmation = command.requireConfirmation ?: false
-        )
+    private fun applyYaml(yamlConfig: pl.jclab.refio.core.config.ConfigYaml, overwrite: Boolean): Int {
+        return yamlApplier.apply(yamlConfig, overwrite)
     }
 
-    private fun parseWhitelistMode(value: String?): WhitelistMode? {
-        if (value.isNullOrBlank()) {
-            return null
-        }
-        return runCatching { WhitelistMode.valueOf(value.trim().uppercase()) }.getOrNull()
-    }
+    private val yamlApplier = ConfigYamlApplier(
+        configRepository = configRepository,
+        setter = ::set,
+        modelsVisibilityGetter = ::getModelsVisibility,
+        defaultModelSetter = ::setDefaultModel,
+        modelStringParser = modelSelectionService::parseModelString,
+    )
 
-    private fun resolveProjectId(projectId: String?): String? = projectId ?: defaultProjectId
+    private fun taskScope(taskId: String?): ConfigScope =
+        if (taskId != null) ConfigScope.TASK else ConfigScope.APP
 
-    private fun invalidateConfigCache(key: String) {
-        configCache.invalidateByPrefix("typed:$key:")
-        configCache.invalidateByPrefix("raw:$key:")
-        configCache.invalidate("yaml:$key")
-    }
+    internal fun invalidateConfigCache(key: String) = resolver.invalidate(key)
 
-    private fun getConfigWithPrecedence(
+    internal fun getConfigWithPrecedence(
         key: String,
         taskId: String? = null,
-        projectId: String? = null
-    ) = configRepository.getWithPrecedence(
-        key = key,
-        taskId = taskId,
-        projectId = resolveProjectId(projectId)
-    )
+        projectId: String? = null,
+    ) = resolver.getConfigWithPrecedence(key, taskId, projectId)
 
     /**
      * Check if intelligent orchestration is enabled (US-028).
@@ -2373,7 +276,7 @@ class ConfigService(
      * @return true if verification should run
      */
     fun shouldVerifyTask(taskId: String? = null, iterationCount: Int = 0, writeToolsExecutedInTurn: Int = 0): Boolean {
-        val explicitSetting = getConfigWithPrecedence(KEY_TASK_VERIFICATION_ENABLED, taskId)
+        val explicitSetting = getConfigWithPrecedence(ConfigKeys.TASK_VERIFICATION_ENABLED.key, taskId)
         if (explicitSetting != null) {
             // User explicitly configured it - respect that
             return explicitSetting.value.toBoolean()
@@ -2384,33 +287,4 @@ class ConfigService(
         // Auto-enable for longer turns (>5 iterations) where hallucinations are more likely
         return iterationCount >= 5
     }
-
-    /**
-     * Set orchestration enabled setting (US-028).
-     *
-     * @param enabled true to enable orchestration, false to disable
-     * @param taskId Optional task ID for task-level config
-     */
-    fun setOrchestrationEnabled(enabled: Boolean, taskId: String? = null) {
-        val scope = if (taskId != null) ConfigScope.TASK
-                    else ConfigScope.APP
-        configRepository.set(
-            key = KEY_ORCHESTRATION_ENABLED,
-            value = enabled.toString(),
-            scope = scope,
-            taskId = taskId,
-            description = "Enable intelligent orchestration with reflection and plan adaptation"
-        )
-
-        logger.info { "Set orchestration_enabled = $enabled (scope=${scope.name})" }
-        invalidateConfigCache(KEY_ORCHESTRATION_ENABLED)
-    }
-
-    /**
-     * Data class for model configuration JSON storage.
-     */
-    private data class ModelConfigData(
-        val modelId: String? = null,
-        val provider: String? = null
-    )
 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigValidator.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigValidator.kt
new file mode 100644
index 00000000..baddc982
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigValidator.kt
@@ -0,0 +1,43 @@
+package pl.jclab.refio.core.services
+
+import pl.jclab.refio.core.config.ConfigKey
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.logging.dualLogger
+
+/**
+ * Runs [ConfigKey.validator] for every key in the registry against the currently-resolved value.
+ *
+ * Invoked at startup (after defaults/YAML bootstrap) and after every YAML reload. A single
+ * failure aborts the run with [InvalidConfigException] — broken configuration should fail loud,
+ * not silently fall back to defaults.
+ */
+class ConfigValidator(private val configService: ConfigService) {
+
+    private val logger = dualLogger("ConfigValidator")
+
+    fun validateAll() {
+        val failures = mutableListOf<ValidationFailure>()
+        for (key in ConfigKeys.allKeys()) {
+            val failure = validate(key)
+            if (failure != null) failures += failure
+        }
+        if (failures.isNotEmpty()) {
+            logger.error { "Config validation failed for ${failures.size} key(s):" }
+            for (f in failures) logger.error { "  ${f.key}: rejected value '${f.rawValue}'" }
+            throw InvalidConfigException(failures)
+        }
+        logger.info { "Config validation passed (${ConfigKeys.allKeys().size} keys)" }
+    }
+
+    private fun <T> validate(key: ConfigKey<T>): ValidationFailure? {
+        val resolved = configService.getTyped(key)
+        return if (!key.validator(resolved)) {
+            ValidationFailure(key = key.key, rawValue = resolved?.toString() ?: "null")
+        } else null
+    }
+
+    data class ValidationFailure(val key: String, val rawValue: String)
+
+    class InvalidConfigException(val failures: List<ValidationFailure>) :
+        RuntimeException("Invalid config: " + failures.joinToString { "${it.key}='${it.rawValue}'" })
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigYamlApplier.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigYamlApplier.kt
new file mode 100644
index 00000000..7eb41307
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigYamlApplier.kt
@@ -0,0 +1,114 @@
+package pl.jclab.refio.core.services
+
+import com.google.gson.Gson
+import pl.jclab.refio.core.api.ModelOperation
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.config.ConfigYaml
+import pl.jclab.refio.core.db.ConfigScope
+import pl.jclab.refio.core.db.repositories.ConfigRepository
+import pl.jclab.refio.core.logging.dualLogger
+private val logger = dualLogger("ConfigYamlApplier")
+private val gson = Gson()
+
+/**
+ * Materializes a parsed [ConfigYaml] into the APP-scoped config store.
+ *
+ * Extracted from [ConfigService] so the two-mode apply logic (overwrite on
+ * explicit reload vs. load-if-missing on startup) lives in one focused place
+ * rather than inside the larger service class.
+ */
+class ConfigYamlApplier(
+    private val configRepository: ConfigRepository,
+    private val setter: (key: String, value: String) -> Unit,
+    private val modelsVisibilityGetter: () -> Map<String, Boolean>,
+    private val defaultModelSetter: (op: ModelOperation, modelId: String, provider: String) -> Unit,
+    private val modelStringParser: (String) -> Pair<String, String>,
+) {
+    /**
+     * Apply YAML configuration to APP scope.
+     *
+     * @param yamlConfig Parsed YAML structure.
+     * @param overwrite When true, every present key is written (counting updates).
+     *                  When false, keys already in the DB are skipped ("load-if-missing").
+     * @return Number of keys written.
+     */
+    fun apply(yamlConfig: ConfigYaml, overwrite: Boolean): Int {
+        val verb = if (overwrite) "Reloaded" else "Loaded"
+        var count = 0
+
+        fun applyKey(key: String, value: String?, label: String) {
+            if (value == null) return
+            if (!overwrite && configRepository.get(key, ConfigScope.APP) != null) return
+            setter(key, value)
+            count++
+            logger.info { "$verb $label from YAML: $value" }
+        }
+
+        fun applyDefaultModel(key: String, op: ModelOperation, model: String?, label: String) {
+            if (model == null) return
+            if (!overwrite && configRepository.get(key, ConfigScope.APP) != null) return
+            val (provider, modelId) = modelStringParser(model)
+            try {
+                defaultModelSetter(op, modelId, provider)
+                count++
+                logger.info { "$verb default $label model from YAML: $modelId" }
+            } catch (e: Exception) {
+                logger.warn { "Failed to set $label model from YAML: ${e.message}" }
+            }
+        }
+
+        yamlConfig.models?.defaults?.let { d ->
+            applyDefaultModel(ConfigKeys.DEFAULT_MODEL_CHAT.key, ModelOperation.DEFAULT, d.chat, "chat")
+            applyDefaultModel(ConfigKeys.DEFAULT_MODEL_PLAN.key, ModelOperation.PLAN, d.plan, "plan")
+            applyDefaultModel(ConfigKeys.DEFAULT_MODEL_AGENT.key, ModelOperation.CODING, d.coding, "coding")
+        }
+
+        yamlConfig.models?.visibility?.let { visibility ->
+            val finalMap = if (overwrite) {
+                visibility
+            } else {
+                val existing = modelsVisibilityGetter().toMutableMap()
+                val additions = visibility.filterKeys { !existing.containsKey(it) }
+                if (additions.isEmpty()) return@let
+                additions.forEach { (id, show) -> logger.info { "Loaded model visibility from YAML: $id -> $show" } }
+                existing.apply { putAll(additions) }
+            }
+            configRepository.set(
+                key = ConfigKeys.MODELS_VISIBILITY.key,
+                value = gson.toJson(finalMap),
+                scope = ConfigScope.APP,
+                taskId = null,
+                description = "Model visibility settings",
+            )
+            if (overwrite) {
+                count++
+                logger.info { "Reloaded model visibility from YAML: ${visibility.size} entries" }
+            }
+        }
+
+        yamlConfig.providers?.let { p ->
+            applyKey(ConfigKeys.PROVIDER_OLLAMA_ENDPOINT.key, p.ollama?.endpoint, "Ollama endpoint")
+            applyKey(ConfigKeys.PROVIDER_ANTHROPIC_API_KEY.key, p.anthropic?.apiKey, "Anthropic API key")
+            applyKey(ConfigKeys.PROVIDER_OPENAI_API_KEY.key, p.openai?.apiKey, "OpenAI API key")
+            applyKey(ConfigKeys.PROVIDER_OPENROUTER_API_KEY.key, p.openrouter?.apiKey, "OpenRouter API key")
+            applyKey(ConfigKeys.PROVIDER_GEMINI_API_KEY.key, p.gemini?.apiKey, "Gemini API key")
+            applyKey(ConfigKeys.PROVIDER_LM_STUDIO_API_KEY.key, p.lmstudio?.apiKey, "LM Studio API key")
+            applyKey(ConfigKeys.PROVIDER_LM_STUDIO_BASE_URL.key, p.lmstudio?.baseUrl, "LM Studio base URL")
+            applyKey(ConfigKeys.PROVIDER_CUSTOM_OPENAI_API_KEY.key, p.genericOpenai?.apiKey, "Custom OpenAI API key")
+            applyKey(ConfigKeys.PROVIDER_CUSTOM_OPENAI_BASE_URL.key, p.genericOpenai?.baseUrl, "Custom OpenAI base URL")
+            applyKey(ConfigKeys.PROVIDER_CUSTOM_OPENAI_MODEL.key, p.genericOpenai?.model, "Custom OpenAI model")
+            applyKey(ConfigKeys.PROVIDER_ZAI_API_KEY.key, p.zai?.apiKey, "Z.AI API key")
+            applyKey(ConfigKeys.PROVIDER_ZAI_BASE_URL.key, p.zai?.baseUrl, "Z.AI base URL")
+        }
+
+        yamlConfig.limits?.let { l ->
+            applyKey(ConfigKeys.API_CALL_TIMEOUT.key, l.apiCallTimeout?.toString(), "API call timeout")
+            applyKey(ConfigKeys.TOOL_EXECUTION_TIMEOUT.key, l.toolExecutionTimeout?.toString(), "tool execution timeout")
+            applyKey(ConfigKeys.MAX_CONTEXT_SIZE.key, l.maxContextSize?.toString(), "max context size")
+            applyKey(ConfigKeys.MAX_OUTPUT_SIZE.key, l.maxOutputSize?.toString(), "max output size")
+            applyKey(ConfigKeys.MAX_FILE_SIZE.key, l.maxFileSize?.toString(), "max file size")
+        }
+
+        return count
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigYamlBuilder.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigYamlBuilder.kt
new file mode 100644
index 00000000..765bc3e3
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ConfigYamlBuilder.kt
@@ -0,0 +1,148 @@
+package pl.jclab.refio.core.services
+
+import pl.jclab.refio.core.api.ModelOperation
+import pl.jclab.refio.core.config.*
+import pl.jclab.refio.core.db.ConfigScope
+import pl.jclab.refio.core.db.repositories.ConfigRepository
+import pl.jclab.refio.core.llm.adapters.ZAIUrls
+import pl.jclab.refio.core.utils.GsonInstance.gson
+import pl.jclab.refio.core.config.ConfigKeys
+
+/**
+ * Builds a [ConfigYaml] snapshot from the current database values.
+ *
+ * Extracted from [ConfigService] to keep responsibilities narrow:
+ * ConfigService handles reads/writes + caching; ConfigYamlBuilder composes
+ * the cross-section YAML representation used by `exportToYaml`.
+ */
+internal class ConfigYamlBuilder(
+    private val configService: ConfigService,
+    private val configRepository: ConfigRepository
+) {
+    fun build(includeApiKeys: Boolean): ConfigYaml = ConfigYaml(
+        general = buildGeneral(),
+        providers = buildProviders(includeApiKeys),
+        models = buildModels(),
+        limits = buildLimits(),
+        advanced = buildAdvanced(),
+        tools = buildTools(),
+        rag = buildRag(),
+        ui = buildUi()
+    )
+
+    private fun buildGeneral() = GeneralConfig(
+        formatMarkdown = configService.getTyped(ConfigKeys.FORMAT_MARKDOWN),
+        streamingEnabled = configService.getTyped(ConfigKeys.STREAMING_ENABLED),
+        advancedView = configService.getTyped(ConfigKeys.ADVANCED_VIEW)
+    )
+
+    private fun buildProviders(includeApiKeys: Boolean): ProvidersConfig {
+        val ollamaEndpoint = configService.get(ConfigKeys.PROVIDER_OLLAMA_ENDPOINT.key)
+        val lmstudioBaseUrl = configService.get(ConfigKeys.PROVIDER_LM_STUDIO_BASE_URL.key)
+        val customOpenAIBaseUrl = configService.get(ConfigKeys.PROVIDER_CUSTOM_OPENAI_BASE_URL.key)
+
+        return ProvidersConfig(
+            ollama = OllamaConfig(
+                endpoint = ollamaEndpoint ?: configService.getTyped(ConfigKeys.PROVIDER_OLLAMA_ENDPOINT),
+                contextSize = configService.getTyped(ConfigKeys.PROVIDER_OLLAMA_CONTEXT_SIZE),
+                keepAlive = configService.getTyped(ConfigKeys.PROVIDER_OLLAMA_KEEP_ALIVE)
+            ),
+            anthropic = if (includeApiKeys) AnthropicConfig(apiKey = configService.get(ConfigKeys.PROVIDER_ANTHROPIC_API_KEY.key)) else null,
+            openai = if (includeApiKeys) OpenAIConfig(apiKey = configService.get(ConfigKeys.PROVIDER_OPENAI_API_KEY.key)) else null,
+            openrouter = if (includeApiKeys) OpenRouterConfig(apiKey = configService.get(ConfigKeys.PROVIDER_OPENROUTER_API_KEY.key)) else null,
+            gemini = if (includeApiKeys) GeminiConfig(apiKey = configService.get(ConfigKeys.PROVIDER_GEMINI_API_KEY.key)) else null,
+            lmstudio = LMStudioConfig(
+                apiKey = if (includeApiKeys) configService.get(ConfigKeys.PROVIDER_LM_STUDIO_API_KEY.key) else null,
+                baseUrl = lmstudioBaseUrl,
+                contextSize = configService.getTyped(ConfigKeys.PROVIDER_LM_STUDIO_CONTEXT_SIZE)
+            ),
+            genericOpenai = GenericOpenAIConfig(
+                apiKey = if (includeApiKeys) configService.get(ConfigKeys.PROVIDER_CUSTOM_OPENAI_API_KEY.key) else null,
+                baseUrl = customOpenAIBaseUrl,
+                model = configService.get(ConfigKeys.PROVIDER_CUSTOM_OPENAI_MODEL.key)
+            ),
+            zai = ZAIConfig(
+                apiKey = if (includeApiKeys) configService.get(ConfigKeys.PROVIDER_ZAI_API_KEY.key) else null,
+                baseUrl = configService.get(ConfigKeys.PROVIDER_ZAI_BASE_URL.key) ?: ZAIUrls.DEFAULT
+            )
+        )
+    }
+
+    private fun buildModels(): ModelsConfig {
+        val (chatModel, chatProvider) = configService.getDefaultModel(ModelOperation.DEFAULT)
+        val (planModel, planProvider) = configService.getDefaultModel(ModelOperation.PLAN)
+        val (codingModel, codingProvider) = configService.getDefaultModel(ModelOperation.CODING)
+        val (weakModel, weakProvider) = configService.getDefaultModel(ModelOperation.WEAK)
+        val (embeddingModel, embeddingProvider) = configService.getDefaultModel(ModelOperation.EMBEDDING)
+
+        return ModelsConfig(
+            defaults = ModelDefaultsConfig(
+                chat = "$chatProvider/$chatModel",
+                plan = "$planProvider/$planModel",
+                coding = "$codingProvider/$codingModel",
+                weak = "$weakProvider/$weakModel",
+                embedding = "$embeddingProvider/$embeddingModel"
+            ),
+            visibility = configService.getModelsVisibility()
+        )
+    }
+
+    private fun buildLimits() = LimitsConfig(
+        apiCallTimeout = configService.getTyped(ConfigKeys.API_CALL_TIMEOUT),
+        toolExecutionTimeout = configService.getTyped(ConfigKeys.TOOL_EXECUTION_TIMEOUT),
+        streamingReadTimeout = configService.getTyped(ConfigKeys.STREAMING_READ_TIMEOUT),
+        streamingRequestTimeout = configService.getTyped(ConfigKeys.STREAMING_REQUEST_TIMEOUT),
+        maxContextSize = configService.getTyped(ConfigKeys.MAX_CONTEXT_SIZE),
+        maxOutputSize = configService.getTyped(ConfigKeys.MAX_OUTPUT_SIZE),
+        maxFileSize = configService.getTyped(ConfigKeys.MAX_FILE_SIZE)
+    )
+
+    private fun buildAdvanced() = AdvancedConfig(
+        noEgressDefault = configService.getTyped(ConfigKeys.NO_EGRESS_DEFAULT),
+        readOnlyMode = configService.getTyped(ConfigKeys.READ_ONLY_MODE),
+        autoOptimizePercentage = configService.getTyped(ConfigKeys.AUTO_OPTIMIZE_PERCENTAGE)
+    )
+
+    private fun buildTools(): ToolsConfig {
+        val config = configRepository.get(ConfigKeys.TOOLS_PERMISSIONS.key, ConfigScope.APP)
+        @Suppress("UNCHECKED_CAST")
+        val permissions: Map<String, Boolean> = if (config != null) {
+            (gson.fromJson(config.value, Map::class.java) as? Map<String, Boolean>) ?: emptyMap()
+        } else emptyMap()
+        if (permissions.isEmpty()) return ToolsConfig()
+
+        val yamlPermissions = permissions.mapValues { (_, enabled) ->
+            ToolPermissionConfig(
+                planMode = if (enabled) "ON" else "OFF",
+                agentMode = if (enabled) "ON" else "OFF"
+            )
+        }
+        return ToolsConfig(permissions = yamlPermissions)
+    }
+
+    private fun buildRag() = RagConfig(
+        enabled = configService.getTyped(ConfigKeys.RAG_ENABLED),
+        indexOnStartup = configService.getTyped(ConfigKeys.RAG_INDEX_ON_STARTUP),
+        autoIndexOnContextBuild = configService.getTyped(ConfigKeys.RAG_AUTO_INDEX_ON_CONTEXT),
+        maxFileSizeMB = configService.getTyped(ConfigKeys.RAG_MAX_FILE_SIZE_MB),
+        maxChunksPerFile = configService.getTyped(ConfigKeys.RAG_MAX_CHUNKS_PER_FILE),
+        indexBatchSize = configService.getTyped(ConfigKeys.RAG_INDEX_BATCH_SIZE),
+        embeddingsBatchSize = configService.getTyped(ConfigKeys.RAG_EMBEDDINGS_BATCH_SIZE),
+        cacheTtlMs = configService.getTyped(ConfigKeys.RAG_CACHE_TTL_MS),
+        maxConcurrentJobs = configService.getTyped(ConfigKeys.RAG_MAX_CONCURRENT_JOBS),
+        ignoredDirectories = configService.getTyped(ConfigKeys.RAG_IGNORED_DIRECTORIES),
+        searchSimilarityThreshold = configService.getTyped(ConfigKeys.RAG_SEARCH_SIMILARITY_THRESHOLD),
+        searchTopK = configService.getTyped(ConfigKeys.RAG_SEARCH_TOP_K),
+        searchHybridEnabled = configService.getTyped(ConfigKeys.RAG_SEARCH_HYBRID_ENABLED),
+        searchSemanticWeight = configService.getTyped(ConfigKeys.RAG_SEARCH_SEMANTIC_WEIGHT),
+        searchIncludeContextChunks = configService.getTyped(ConfigKeys.RAG_SEARCH_INCLUDE_CONTEXT_CHUNKS)
+    )
+
+    private fun buildUi() = UiConfig(
+        thinkingEnabled = configService.getTyped(ConfigKeys.UI_THINKING_ENABLED),
+        noEgressEnabled = configService.getTyped(ConfigKeys.UI_NO_EGRESS_ENABLED),
+        executionMode = configService.getTyped(ConfigKeys.UI_EXECUTION_MODE),
+        selectedMode = configService.getTyped(ConfigKeys.UI_SELECTED_MODE),
+        selectedModel = configService.getTyped(ConfigKeys.UI_SELECTED_MODEL)
+    )
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ContextBudgetResolver.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ContextBudgetResolver.kt
new file mode 100644
index 00000000..5cf87def
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ContextBudgetResolver.kt
@@ -0,0 +1,79 @@
+package pl.jclab.refio.core.services
+
+import pl.jclab.refio.core.api.ModelOperation
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.services.context.ContextBudget
+import pl.jclab.refio.core.services.context.ContextSection
+
+/**
+ * Owns the math behind context-budget resolution — the slice of [ConfigService]
+ * responsible for translating raw config values into a [ContextBudget] snapshot
+ * that the prompt builder can apply.
+ *
+ * Kept as a small helper so the logic around per-section overrides,
+ * compact-prompt auto-detection, and provider-specific context sizes lives in
+ * one place instead of being scattered across ConfigService.
+ */
+class ContextBudgetResolver(private val configService: ConfigService) {
+
+    /**
+     * Resolve context budget for prompt building.
+     */
+    fun getContextBudget(taskId: String? = null, operation: ModelOperation? = null): ContextBudget {
+        val inputRatio = getInputRatio(taskId)
+        val contextSize = resolveContextSize(operation, taskId)
+        val totalOverride = getTotalTokensOverride(taskId)
+        val overrides = getSectionOverrides(taskId)
+
+        return ContextBudget.forContextSize(
+            contextSize = contextSize,
+            inputRatio = inputRatio,
+            overrides = overrides,
+            totalTokensOverride = totalOverride,
+        )
+    }
+
+    /**
+     * Whether compact (shorter) system prompts should be used.
+     * Auto-detects based on resolved context size for the operation:
+     * context <= 48000 tokens → compact mode (saves ~40% prompt tokens).
+     */
+    fun isCompactPrompts(operation: ModelOperation? = null, taskId: String? = null): Boolean {
+        val contextSize = resolveContextSize(operation, taskId)
+        return contextSize <= ConfigService.COMPACT_PROMPT_THRESHOLD
+    }
+
+    private fun getInputRatio(taskId: String?): Double {
+        val raw = configService.get(ConfigKeys.CONTEXT_BUDGET_INPUT_RATIO.key, taskId = taskId)
+        return raw?.toDoubleOrNull() ?: ConfigKeys.CONTEXT_BUDGET_INPUT_RATIO.default
+    }
+
+    private fun getTotalTokensOverride(taskId: String?): Int? {
+        val raw = configService.get(ConfigKeys.CONTEXT_BUDGET_TOTAL_TOKENS.key, taskId = taskId)
+        return raw?.toIntOrNull()
+    }
+
+    private fun getSectionOverrides(taskId: String?): Map<ContextSection, Int> {
+        val overrides = mutableMapOf<ContextSection, Int>()
+        ContextSection.values().forEach { section ->
+            val key = "${ConfigService.KEY_CONTEXT_BUDGET_SECTION_PREFIX}${section.name.lowercase()}"
+            val value = configService.get(key, taskId = taskId)?.toIntOrNull()
+            if (value != null && value > 0) {
+                overrides[section] = value
+            }
+        }
+        return overrides
+    }
+
+    private fun resolveContextSize(operation: ModelOperation?, taskId: String?): Int {
+        val fallback = configService.getTyped(ConfigKeys.MAX_CONTEXT_SIZE, taskId)
+        if (operation == null) return fallback
+
+        val (_, provider) = configService.getModel(operation, taskId)
+        return when (provider.lowercase()) {
+            "ollama" -> configService.getTyped(ConfigKeys.PROVIDER_OLLAMA_CONTEXT_SIZE)
+            "lmstudio" -> configService.getTyped(ConfigKeys.PROVIDER_LM_STUDIO_CONTEXT_SIZE)
+            else -> fallback
+        }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ContextService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ContextService.kt
index b4d921bb..bf9bfa25 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/ContextService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ContextService.kt
@@ -20,7 +20,6 @@ import java.nio.file.Path
 import java.time.Instant
 
 private val logger = dualLogger("ContextService")
-private const val MAX_RAG_FRAGMENTS = 15
 private const val CONVERSATION_SUMMARY_METADATA_TYPE = ConversationContextBuilder.CONVERSATION_SUMMARY_METADATA_TYPE
 
 // Context budget limits
@@ -76,17 +75,20 @@ class ContextService(
     private val configService: ConfigService,
     private val workingMemoryService: WorkingMemoryService? = null,
     private val conversationSummaryService: ConversationSummaryService? = null,
-    ragSearchService: RagSearchService? = null,
-    ragSearchModel: String? = null,
-    ragSearchProvider: String? = null
+    /**
+     * Opaque platform project handle (IntelliJ Project or null for CLI). Propagated
+     * to IDE-specific context providers via [ContextProviderExtras]. Injected once
+     * at construction because it is stable for the lifetime of a [ContextService]
+     * (CoreApiRouter is per-project).
+     */
+    private val platformProject: Any? = null,
 ) {
     private val projectInstructionsLoader = ProjectInstructionsLoader()
-    private val ragContextLoader = RagContextLoader(configService, ragSearchService, ragSearchModel, ragSearchProvider)
     private val mcpContextLoader = McpContextLoader()
     private val projectContextSummarizer = ProjectContextSummarizer()
     private val conversationContextBuilder = ConversationContextBuilder()
     private val taskContextExtractor = TaskContextExtractor()
-    private val contextReferenceResolver = ContextReferenceResolver(fileAnalyzerService, configService, chatMessageRepository)
+    private val contextReferenceResolver = ContextReferenceResolver(fileAnalyzerService, configService, chatMessageRepository, platformProject)
     private val pruner: ContextPruner = ContextPruner(configService)
     private val formatter: ContextFormatter = ContextFormatter(configService)
 
@@ -129,9 +131,6 @@ class ContextService(
         subtaskRepository: SubtaskRepository,
         fileAnalyzerService: FileAnalyzerService? = null,
         configService: ConfigService,
-        ragSearchService: RagSearchService? = null,
-        ragSearchModel: String? = null,
-        ragSearchProvider: String? = null
     ) : this(
         projectAnalyzer = projectAnalyzer,
         taskRepository = taskRepository,
@@ -141,15 +140,8 @@ class ContextService(
         configService = configService,
         workingMemoryService = null,
         conversationSummaryService = null,
-        ragSearchService = ragSearchService,
-        ragSearchModel = ragSearchModel,
-        ragSearchProvider = ragSearchProvider
     )
 
-    fun updateRagSearchConfig(service: RagSearchService?, model: String?, provider: String?) {
-        ragContextLoader.updateRagSearchConfig(service, model, provider)
-    }
-
     /**
      * Build comprehensive project context for LLM using rich DTOs.
      * REFACTORED to use new ProjectContextDTO from models/context package.
@@ -162,7 +154,6 @@ class ContextService(
     suspend fun buildProjectContext(
         projectRoot: Path,
         taskId: String,
-        project: Any? = null,
         query: String? = null,
         userContextRefs: List<ContextReference> = emptyList()
     ): ProjectContextDTO {
@@ -210,8 +201,8 @@ class ContextService(
             }
         }.takeLast(CONTEXT_CONVERSATION_HISTORY_LIMIT)
 
-        // 6. Build previous subtasks data (PHASE 3)
-        val (previousSubtaskSummaries, completedFiles) = taskContextExtractor.buildPreviousSubtasksData(subtasks)
+        // 6. Build completed files data
+        val completedFiles = taskContextExtractor.buildCompletedFiles(subtasks)
 
         // 6a. Build structured executed steps for RECENT_WORK (ADR 0041)
         val executedSteps = taskContextExtractor.buildExecutedSteps(subtasks)
@@ -349,17 +340,12 @@ class ContextService(
             )
         }
 
-        // 8. Load hybrid RAG fragments (code + documentation)
-        val ragFragments = ragContextLoader.loadRagFragments(
-            projectRoot = projectRoot,
-            query = query
-        )
         val mcpResources = mcpContextLoader.loadMcpResources(projectRoot, query)
 
         // 9. Resolve user context references (from @ mentions)
         val resolvedUserContext = if (dedupedUserContextRefs.isNotEmpty()) {
             logger.info { "[CONTEXT] Resolving ${dedupedUserContextRefs.size} user context reference(s)" }
-            contextReferenceResolver.resolveAndConvertUserContextRefs(dedupedUserContextRefs, projectRoot, project, query)
+            contextReferenceResolver.resolveAndConvertUserContextRefs(dedupedUserContextRefs, projectRoot, query)
         } else {
             emptyList()
         }
@@ -390,15 +376,11 @@ class ContextService(
 
             // Work history (from PHASE 3)
             completedFiles = completedFiles,
-            previousSubtasks = previousSubtaskSummaries,
             executedSteps = executedSteps,
 
             // User requirements (extracted from task description - PHASE 2)
             userRequirements = userRequirements,
 
-            // RAG (Retrieval-Augmented Generation) context - unified fragments
-            ragFragments = ragFragments,
-
             // User-provided context (from @ mentions)
             userContextRefs = resolvedUserContext,
 
@@ -433,7 +415,7 @@ class ContextService(
      * ADR 0040 ORDER (2025-12-03):
      * 1. PROJECT CONTEXT FIRST - Agent must know the project before getting the task
      * 2. TASK & REQUIREMENTS - What needs to be done
-     * 3. USER CONTEXT & RAG - Supporting information
+     * 3. USER CONTEXT - Supporting information
      * 4. HISTORY - Previous work and conversation
      */
     /**
@@ -593,7 +575,7 @@ class ContextService(
         //   - RECENT_WORK never benefited from unused budget from STABLE_CONTEXT /
         //     PROJECT_CONTEXT / REFERENCE sections (they often leave 10–30k tokens on
         //     the table because project context is small relative to total budget)
-        //   - The redistribution flowed only into CONVERSATION / RAG / USER_CONTEXT,
+        //   - The redistribution flowed only into CONVERSATION / USER_CONTEXT,
         //     which in turn couldn't use it because of other caps (Bug 2B).
         //
         // New flow: redistribute the unused stable-layer budget BEFORE adding the
@@ -655,15 +637,9 @@ class ContextService(
         addSection(ContextSection.USER_CONTEXT, userContextParts.joinToString("\n\n"))
         // Second redistribution pass: now that accumulated + ephemeral layers have
         // reported their actual usage, any budget still unused is pushed into
-        // CONVERSATION / RAG so they can benefit from slack.
+        // CONVERSATION so they can benefit from slack.
         val redistributedBudget = budgetAfterStable.redistributeUnused(actualUsage)
 
-        // TIER 3: SUPPLEMENTARY CONTEXT
-        if (context.ragFragments.isNotEmpty()) {
-            val ragBudget = minOf(redistributedBudget.budgetFor(ContextSection.RAG_FRAGMENTS), remainingTokens)
-            addSection(ContextSection.RAG_FRAGMENTS, formatter.buildRagFragmentsSection(context), ragBudget)
-        }
-
         if (context.conversationHistory.isNotEmpty()) {
             val conversationBudget = minOf(redistributedBudget.budgetFor(ContextSection.CONVERSATION), remainingTokens)
             addSection(
@@ -745,15 +721,13 @@ class ContextService(
      *
      * @param taskId Task ID
      * @param projectRoot Project root path
-     * @param project IntelliJ Project instance (optional)
      * @param userContextRefs User-provided @ mentions
-     * @param query Current user query for RAG
+     * @param query Current user query
      * @return Pair of (projectContextPrompt, messages list)
      */
     suspend fun buildAgentTurnMessages(
         taskId: String,
         projectRoot: Path,
-        project: Any? = null,
         userContextRefs: List<ContextReference> = emptyList(),
         query: String? = null
     ): AgentTurnMessagesResult {
@@ -792,7 +766,6 @@ class ContextService(
             val projectContext = buildProjectContext(
                 projectRoot = projectRoot,
                 taskId = taskId,
-                project = project,
                 query = query,
                 userContextRefs = userContextRefs
             )
@@ -1370,7 +1343,6 @@ class ContextService(
             "USER_PROVIDED_CONTEXT" to "user_context",
             "WORKING_MEMORY" to "working_memory",
             "MCP_RESOURCES" to "mcp_resources",
-            "RAG_FRAGMENTS" to "rag_fragments",
             "CONVERSATION_HISTORY" to "conversation",
             "RECENT_WORK" to "recent_work",
             "SUBTASKS_STATUS" to "subtasks",
@@ -1394,7 +1366,6 @@ class ContextService(
             "user_context" to "User Context",
             "working_memory" to "Working Memory",
             "mcp_resources" to "MCP Resources",
-            "rag_fragments" to "RAG Fragments",
             "conversation" to "Conversation History",
             "recent_work" to "Recent Work",
             "subtasks" to "Subtasks",
@@ -1450,7 +1421,7 @@ class ContextService(
             val openTag = "<$tagName>"
             val closeTag = "</$tagName>"
             val sectionChars = content.length + openTag.length + if (hasClosingTag) closeTag.length else 0
-            val tokens = (sectionChars / 4).toInt().coerceAtLeast(1)
+            val tokens = (sectionChars / 4).coerceAtLeast(1)
 
             result[key] = ContextSectionTokenInfo(
                 name = sectionNames[key] ?: key,
@@ -1601,7 +1572,7 @@ class ContextService(
 data class AgentTurnMessagesResult(
     /** Conversation messages ready for LLM (filtered and formatted) */
     val messages: List<LLMMessage>,
-    /** Project context prompt (project analysis, RAG, user @ mentions) */
+    /** Project context prompt (project analysis, user @ mentions) */
     val projectContextPrompt: String,
     /** Size of conversation history before filtering */
     val historySize: Int
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ModelSelectionService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ModelSelectionService.kt
new file mode 100644
index 00000000..1d344d35
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ModelSelectionService.kt
@@ -0,0 +1,313 @@
+package pl.jclab.refio.core.services
+
+import pl.jclab.refio.core.api.ModelOperation
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.db.ConfigScope
+import pl.jclab.refio.core.llm.getModelConfigFromCache
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.ConfigService.Companion.FALLBACK_EMBEDDING_MODEL
+import pl.jclab.refio.core.services.ConfigService.Companion.FALLBACK_EMBEDDING_PROVIDER
+import pl.jclab.refio.core.services.ConfigService.Companion.FALLBACK_MODEL
+import pl.jclab.refio.core.services.ConfigService.Companion.FALLBACK_PROVIDER
+import pl.jclab.refio.core.services.ConfigService.Companion.FALLBACK_WEAK_MODEL
+import pl.jclab.refio.core.services.ConfigService.Companion.FALLBACK_WEAK_PROVIDER
+import pl.jclab.refio.core.services.ConfigService.Companion.INHERIT_MODEL_VALUE
+import pl.jclab.refio.core.utils.GsonInstance.gson
+
+/**
+ * Model-selection logic extracted from [ConfigService].
+ *
+ * Owns:
+ * - Logical-slot resolution (`getModel`, `getDefaultModel`, `getStrongModel`, `getWeakModel`, `getEmbeddingModel`).
+ * - Slot writers (`setDefaultModel`, `setDefaultModelAllModes`, `setEmbeddingModel`).
+ * - Model dropdown visibility (`getModel*Visibility`, `setModel*Visibility`).
+ * - The small parsing/inheritance helpers for `provider/model` strings and `inherit` sentinels.
+ *
+ * Kept in the same package as [ConfigService] so it can use the `internal` access points
+ * (`configRepository`, `yamlLoader`, `getConfigWithPrecedence`, `invalidateConfigCache`,
+ * `setTyped`) without widening them to the whole module.
+ */
+internal class ModelSelectionService(private val configService: ConfigService) {
+    private val logger = dualLogger("ModelSelectionService")
+
+    fun getModel(
+        operation: ModelOperation,
+        taskId: String? = null,
+        projectId: String? = null
+    ): Pair<String, String> {
+        val selectedModel = configService.get(
+            key = ConfigKeys.UI_SELECTED_MODEL.key,
+            taskId = taskId,
+            projectId = projectId
+        )
+
+        if (selectedModel != null && selectedModel.isNotBlank() && !selectedModel.equals("auto", ignoreCase = true)) {
+            val (providerFromString, modelIdFromString) = parseModelString(selectedModel)
+            logger.info { "Using user-selected model for $operation: $modelIdFromString (provider=$providerFromString)" }
+            return Pair(modelIdFromString, providerFromString)
+        }
+
+        return getDefaultModel(operation, taskId, projectId)
+    }
+
+    fun getDefaultModel(
+        operation: ModelOperation,
+        taskId: String? = null,
+        projectId: String? = null
+    ): Pair<String, String> {
+        val key = configKeyForOperation(operation)
+        val config = configService.getConfigWithPrecedence(key = key, taskId = taskId, projectId = projectId)
+        val label = operation.name.lowercase()
+
+        if (operation == ModelOperation.EMBEDDING) {
+            if (config?.value != null) {
+                val (provider, model) = parseModelString(config.value)
+                logger.info { "Using embedding model from DB: $model (provider=$provider)" }
+                return Pair(model, provider)
+            }
+            val yamlModel = configService.yamlLoader.getDefaultEmbeddingModel()
+            if (yamlModel != null) {
+                val (provider, model) = parseModelString(yamlModel)
+                logger.info { "Using embedding model from YAML: $model (provider=$provider)" }
+                return Pair(model, provider)
+            }
+            return fallbackModelForOperation(operation)
+        }
+
+        if (config != null) {
+            val data = gson.fromJson(config.value, ModelConfigData::class.java)
+            if (operation != ModelOperation.DEFAULT && isInheritedModelConfig(data)) {
+                logger.info { "Using inherited $label model -> default model" }
+                return getDefaultModel(ModelOperation.DEFAULT, taskId, projectId)
+            }
+            if (data.modelId != null && data.provider != null) {
+                logger.info { "Using $label model from DB: ${data.modelId}" }
+                return Pair(data.modelId, data.provider)
+            }
+        }
+
+        val yamlModel = when (operation) {
+            ModelOperation.DEFAULT -> configService.yamlLoader.getDefaultChatModel()
+            ModelOperation.PLAN -> configService.yamlLoader.getDefaultPlanModel()
+            ModelOperation.CODING -> configService.yamlLoader.getDefaultCodingModel()
+            ModelOperation.WEAK -> configService.yamlLoader.getDefaultWeakModel()
+            ModelOperation.STRONG -> configService.yamlLoader.getDefaultStrongModel()
+            ModelOperation.EMBEDDING -> error("unreachable")
+        }
+        if (yamlModel != null) {
+            val (provider, model) = parseModelString(yamlModel)
+            logger.info { "Using $label model from YAML: $model (provider=$provider)" }
+            return Pair(model, provider)
+        }
+
+        if (operation == ModelOperation.STRONG) {
+            throw IllegalStateException("STRONG model not configured and has no fallback")
+        }
+
+        val fallback = fallbackModelForOperation(operation)
+        logger.info { "No config found for $operation, using fallback: ${fallback.first}" }
+        return fallback
+    }
+
+    fun getStrongModel(
+        taskId: String? = null,
+        projectId: String? = null
+    ): Pair<String, String>? {
+        val config = configService.getConfigWithPrecedence(key = ConfigKeys.STRONG_MODEL.key, taskId = taskId, projectId = projectId)
+        if (config != null) {
+            val data = gson.fromJson(config.value, ModelConfigData::class.java)
+            if (isInheritedModelConfig(data)) {
+                logger.info { "Using inherited strong model -> default model" }
+                return getDefaultModel(ModelOperation.DEFAULT, taskId, projectId)
+            }
+            if (data.modelId != null && data.provider != null) {
+                logger.info { "Using strong model from DB: ${data.modelId}" }
+                return Pair(data.modelId, data.provider)
+            }
+        }
+
+        val yamlModel = configService.yamlLoader.getDefaultStrongModel()
+        if (yamlModel != null) {
+            val (provider, model) = parseModelString(yamlModel)
+            logger.info { "Using strong model from YAML: $model (provider=$provider)" }
+            return Pair(model, provider)
+        }
+
+        return null
+    }
+
+    @Suppress("UNUSED_PARAMETER")
+    fun setDefaultModel(
+        operation: ModelOperation,
+        modelId: String,
+        provider: String,
+        taskId: String? = null,
+        _userId: String? = null
+    ) {
+        if (operation != ModelOperation.EMBEDDING) {
+            val modelConfig = getModelConfigFromCache(modelId)
+            if (modelConfig != null && modelConfig.provider != provider) {
+                return
+            }
+        }
+
+        if (operation == ModelOperation.EMBEDDING) {
+            setEmbeddingModel("$provider/$modelId")
+            logger.info { "Set embedding model to $provider/$modelId" }
+            return
+        }
+
+        val key = configKeyForOperation(operation)
+        val scope = if (operation != ModelOperation.WEAK && taskId != null) ConfigScope.TASK else ConfigScope.APP
+        val effectiveTaskId = if (scope == ConfigScope.TASK) taskId else null
+        val description = if (operation == ModelOperation.WEAK) {
+            "Cheap model for auxiliary operations"
+        } else {
+            "Default model for $operation operation"
+        }
+        configService.configRepository.set(
+            key = key,
+            value = gson.toJson(ModelConfigData(modelId, provider)),
+            scope = scope,
+            taskId = effectiveTaskId,
+            description = description
+        )
+        configService.invalidateConfigCache(key)
+        logger.info { "Set ${scope.name} config $key = $modelId" }
+    }
+
+    fun setDefaultModelAllModes(
+        modelId: String,
+        provider: String,
+        taskId: String? = null,
+        userId: String? = null
+    ) {
+        val modelConfig = getModelConfigFromCache(modelId)
+        if (modelConfig != null && modelConfig.provider != provider) {
+            return
+        }
+
+        for (operation in listOf(ModelOperation.DEFAULT, ModelOperation.PLAN, ModelOperation.CODING)) {
+            setDefaultModel(
+                operation = operation,
+                modelId = modelId,
+                provider = provider,
+                taskId = taskId,
+                _userId = userId
+            )
+        }
+
+        logger.info { "Set default model for ALL modes: $modelId (provider=$provider)" }
+    }
+
+    fun getModelVisibility(modelId: String): Boolean {
+        val config = configService.configRepository.get(ConfigKeys.MODELS_VISIBILITY.key, ConfigScope.APP)
+        if (config != null) {
+            @Suppress("UNCHECKED_CAST")
+            val visibilityMap = gson.fromJson(config.value, Map::class.java) as? Map<String, Boolean>
+            return visibilityMap?.get(modelId) ?: true
+        }
+        return true
+    }
+
+    fun setModelVisibility(modelId: String, showInDropdown: Boolean) {
+        setModelsVisibility(getModelsVisibility().toMutableMap().apply { put(modelId, showInDropdown) })
+        logger.info { "Updated model visibility: $modelId -> $showInDropdown" }
+    }
+
+    fun setModelsVisibility(visibilityMap: Map<String, Boolean>) {
+        val valueJson = gson.toJson(visibilityMap)
+        configService.configRepository.set(
+            key = ConfigKeys.MODELS_VISIBILITY.key,
+            value = valueJson,
+            scope = ConfigScope.APP,
+            taskId = null,
+            description = "Model visibility settings"
+        )
+
+        logger.info { "Updated model visibility for ${visibilityMap.size} models" }
+        configService.invalidateConfigCache(ConfigKeys.MODELS_VISIBILITY.key)
+    }
+
+    fun getModelsVisibility(): Map<String, Boolean> {
+        val config = configService.configRepository.get(ConfigKeys.MODELS_VISIBILITY.key, ConfigScope.APP)
+        if (config != null) {
+            @Suppress("UNCHECKED_CAST")
+            val visibilityMap = gson.fromJson(config.value, Map::class.java) as? Map<String, Boolean>
+            if (visibilityMap != null && visibilityMap.isNotEmpty()) {
+                return visibilityMap
+            }
+        }
+
+        val yamlVisibility = configService.yamlLoader.getModelsVisibility()
+        if (yamlVisibility != null && yamlVisibility.isNotEmpty()) {
+            return yamlVisibility
+        }
+
+        return emptyMap()
+    }
+
+    fun getWeakModel(): Pair<String, String> = getDefaultModel(ModelOperation.WEAK)
+
+    fun getEmbeddingModel(): String {
+        val (modelId, provider) = getDefaultModel(ModelOperation.EMBEDDING)
+        return "$provider/$modelId"
+    }
+
+    fun setEmbeddingModel(model: String) {
+        configService.setTyped(ConfigKeys.EMBEDDING_MODEL, model)
+    }
+
+    /**
+     * Parse model string that might be in format "provider/model" or just "model".
+     *
+     * Examples:
+     * - "ollama/qwen2.5:7b" -> ("ollama", "qwen2.5:7b")
+     * - "qwen2.5:7b" -> ("ollama", "qwen2.5:7b") // fallback to ollama
+     * - "gpt-4.1-mini" -> ("openai", "gpt-4.1-mini") // fallback to openai
+     */
+    fun parseModelString(modelString: String): Pair<String, String> {
+        if (modelString.contains("/")) {
+            val parts = modelString.split("/", limit = 2)
+            return Pair(parts[0], parts[1])
+        }
+
+        val provider = when {
+            modelString.startsWith("gpt-") -> "openai"
+            modelString.startsWith("glm-") -> "zai"
+            modelString.startsWith("claude-") -> "anthropic"
+            else -> FALLBACK_PROVIDER
+        }
+
+        return Pair(provider, modelString)
+    }
+
+    private fun configKeyForOperation(operation: ModelOperation): String = when (operation) {
+        ModelOperation.DEFAULT -> ConfigKeys.DEFAULT_MODEL_CHAT.key
+        ModelOperation.PLAN -> ConfigKeys.DEFAULT_MODEL_PLAN.key
+        ModelOperation.CODING -> ConfigKeys.DEFAULT_MODEL_AGENT.key
+        ModelOperation.WEAK -> ConfigKeys.WEAK_MODEL.key
+        ModelOperation.EMBEDDING -> ConfigKeys.EMBEDDING_MODEL.key
+        ModelOperation.STRONG -> ConfigKeys.STRONG_MODEL.key
+    }
+
+    private fun isInheritedModelConfig(data: ModelConfigData): Boolean {
+        return data.modelId.equals(INHERIT_MODEL_VALUE, ignoreCase = true) &&
+                data.provider.equals(INHERIT_MODEL_VALUE, ignoreCase = true)
+    }
+
+    private fun fallbackModelForOperation(operation: ModelOperation): Pair<String, String> = when (operation) {
+        ModelOperation.DEFAULT,
+        ModelOperation.PLAN,
+        ModelOperation.CODING -> Pair(FALLBACK_MODEL, FALLBACK_PROVIDER)
+        ModelOperation.WEAK -> Pair(FALLBACK_WEAK_MODEL, FALLBACK_WEAK_PROVIDER)
+        ModelOperation.EMBEDDING -> Pair(FALLBACK_EMBEDDING_MODEL, FALLBACK_EMBEDDING_PROVIDER)
+        ModelOperation.STRONG -> throw IllegalStateException("STRONG model has no fallback — must be explicitly configured")
+    }
+
+    /** JSON storage format for model selection entries in DB. */
+    internal data class ModelConfigData(
+        val modelId: String? = null,
+        val provider: String? = null
+    )
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/PlanningService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/PlanningService.kt
index 4cd490fd..3b996e75 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/PlanningService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/PlanningService.kt
@@ -52,7 +52,6 @@ class PlanningService(
     private val toolPermissionsService: ToolPermissionsService? = null,
     private val contextService: ContextService? = null,
     private val projectRoot: java.nio.file.Path? = null,
-    private val ideProject: Any? = null
 ) {
     private val fallbackProjectId: String =
         projectRoot?.let { ProjectIdGenerator.generate(it) } ?: LEGACY_PROJECT_ID
@@ -152,8 +151,8 @@ class PlanningService(
         val userPrompt = buildUserPrompt(sanitizedInput)
 
         // Read UI state from config table (single source of truth)
-        val thinkingEnabled = configService.get(ConfigService.KEY_UI_THINKING_ENABLED)?.toBoolean() ?: false
-        val noEgressEnabled = configService.get(ConfigService.KEY_UI_NO_EGRESS_ENABLED)?.toBoolean() ?: false
+        val thinkingEnabled = configService.get(ConfigKeys.UI_THINKING_ENABLED.key)?.toBoolean() ?: false
+        val noEgressEnabled = configService.get(ConfigKeys.UI_NO_EGRESS_ENABLED.key)?.toBoolean() ?: false
 
         // Build messages - context is passed separately via contextContent parameter
         // This ensures proper order: [system] System, [user] Context, [user] User prompt
@@ -365,7 +364,6 @@ class PlanningService(
                 val projectContext = contextService.buildProjectContext(
                     projectRoot = projectRoot,
                     taskId = task.id,
-                    project = ideProject,
                     query = ragUserPrompt,
                     userContextRefs = contextRefs
                 )
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ProcessManager.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ProcessManager.kt
new file mode 100644
index 00000000..b9423a23
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ProcessManager.kt
@@ -0,0 +1,73 @@
+package pl.jclab.refio.core.services
+
+import kotlinx.coroutines.Dispatchers
+import kotlinx.coroutines.withContext
+import pl.jclab.refio.core.logging.dualLogger
+import java.util.UUID
+import java.util.concurrent.ConcurrentHashMap
+
+private val logger = dualLogger("ProcessManager")
+
+/**
+ * Manages long-running background processes.
+ * Processes are stored by process_id and can be monitored or stopped.
+ */
+class ProcessManager {
+    data class ManagedProcess(
+        val processId: String,
+        val command: String,
+        val process: Process,
+        val startedAt: Long = System.currentTimeMillis()
+    )
+
+    private val processes = ConcurrentHashMap<String, ManagedProcess>()
+
+    fun start(command: String, workingDir: java.io.File): ManagedProcess {
+        val processId = UUID.randomUUID().toString().take(8)
+        val pb = ProcessBuilder()
+            .command(shellWrap(command))
+            .directory(workingDir)
+            .redirectErrorStream(true)
+        val process = pb.start()
+        val managed = ManagedProcess(processId, command, process)
+        processes[processId] = managed
+        logger.info { "Started background process $processId: $command" }
+        return managed
+    }
+
+    fun get(processId: String): ManagedProcess? = processes[processId]
+
+    fun stop(processId: String) {
+        val managed = processes.remove(processId) ?: return
+        managed.process.destroyForcibly()
+        logger.info { "Stopped process $processId" }
+    }
+
+    fun listRunning(): List<ManagedProcess> =
+        processes.values.filter { it.process.isAlive }.toList()
+
+    suspend fun readOutput(processId: String, maxLines: Int = 500): Pair<List<String>, Boolean> =
+        withContext(Dispatchers.IO) {
+            val managed = processes[processId]
+                ?: return@withContext Pair(emptyList(), false)
+
+            val lines = mutableListOf<String>()
+            val reader = managed.process.inputStream.bufferedReader()
+
+            while (lines.size < maxLines && reader.ready()) {
+                val line = reader.readLine() ?: break
+                lines.add(line)
+            }
+
+            val isRunning = managed.process.isAlive
+            if (!isRunning) processes.remove(processId)
+
+            Pair(lines, isRunning)
+        }
+
+    private fun shellWrap(cmd: String): List<String> =
+        if (System.getProperty("os.name").lowercase().contains("windows"))
+            listOf("cmd.exe", "/c", cmd)
+        else
+            listOf("/bin/sh", "-c", cmd)
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/PromptsService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/PromptsService.kt
index 69d1537b..38632543 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/PromptsService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/PromptsService.kt
@@ -1,6 +1,6 @@
 package pl.jclab.refio.core.services
 
-import pl.jclab.refio.api.models.SlashCommand
+import pl.jclab.refio.api.models.SlashPrompt
 import pl.jclab.refio.core.db.Prompt
 import pl.jclab.refio.core.db.PromptType
 import pl.jclab.refio.core.db.repositories.PromptsRepository
@@ -18,7 +18,7 @@ private val logger = dualLogger("PromptsService")
  * 2. USER - DB isCustom=true (UI edits) > ~/.refio/prompts/ *.md
  * 3. PROJECT - .refio/prompts/ *.md (highest priority)
  *
- * Slash commands and rules remain in the database.
+ * Slash prompts and rules remain in the database.
  */
 class PromptsService(
     private val promptsRepository: PromptsRepository,
@@ -61,13 +61,13 @@ class PromptsService(
     )
 
     /**
-     * Initialize defaults: seed slash commands to DB and clean up stale system prompt records.
+     * Initialize defaults: seed slash prompts to DB and clean up stale system prompt records.
      * System prompts are now loaded from MD files - no DB seeding needed.
      */
     fun initializeDefaults() {
         logger.info { "Initializing default prompts" }
 
-        initializeBuiltinCommands()
+        initializeBuiltinSlashPrompts()
         cleanupNonCustomSystemPrompts()
 
         logger.info { "Default prompts initialized" }
@@ -90,8 +90,48 @@ class PromptsService(
         }
     }
 
+    /**
+     * Get all system prompts.
+     *
+     * Merges DB custom overrides (isCustom=true) with file-based defaults from PromptRegistry
+     * so the UI always sees the full set of system prompt types, with isCustom flagging overrides.
+     */
     fun getSystemPrompts(): List<Prompt> {
-        return promptsRepository.findSystemPrompts()
+        val dbByType = promptsRepository.findSystemPrompts(enabledOnly = false)
+            .associateBy { it.type }
+
+        return PromptType.SYSTEM_PROMPT_TYPES.mapNotNull { type ->
+            val dbPrompt = dbByType[type]
+            if (dbPrompt != null && dbPrompt.isCustom) {
+                dbPrompt
+            } else {
+                buildDefaultSystemPrompt(type, dbPrompt)
+            }
+        }
+    }
+
+    private fun buildDefaultSystemPrompt(type: PromptType, existing: Prompt?): Prompt? {
+        val displayName = systemPromptNames[type] ?: return null
+        val fileName = promptTypeToName(type) ?: return null
+
+        val projectDef = promptRegistry.getProjectFile(fileName)
+        val userDef = promptRegistry.getUserFile(fileName)
+        val builtin = promptRegistry.getBuiltin(fileName)
+        val def = projectDef ?: userDef ?: builtin ?: return null
+
+        val now = System.currentTimeMillis()
+        return Prompt(
+            id = existing?.id ?: "default:${type.name}",
+            name = displayName,
+            type = type,
+            content = def.content,
+            description = def.description.ifBlank { existing?.description ?: "Default system prompt" },
+            isCustom = false,
+            isEnabled = true,
+            orderIndex = existing?.orderIndex ?: 0,
+            createdAt = existing?.createdAt ?: now,
+            updatedAt = existing?.updatedAt ?: now
+        )
     }
 
     /**
@@ -102,18 +142,18 @@ class PromptsService(
     }
 
     /**
-     * Get all enabled slash commands
+     * Get all enabled slash prompts
      */
-    fun getEnabledCommands(): List<Prompt> {
-        return promptsRepository.findByType(PromptType.SLASH_COMMAND, enabledOnly = true)
+    fun getEnabledSlashPrompts(): List<Prompt> {
+        return promptsRepository.findByType(PromptType.SLASH_PROMPT, enabledOnly = true)
     }
 
     /**
-     * Find slash command by name (e.g., "/refactor")
+     * Find slash prompt by name (e.g., "/refactor")
      */
-    fun findCommand(commandName: String): Prompt? {
-        val normalizedName = if (commandName.startsWith("/")) commandName else "/$commandName"
-        return promptsRepository.findByNameAndType(normalizedName, PromptType.SLASH_COMMAND)
+    fun findSlashPrompt(name: String): Prompt? {
+        val normalizedName = if (name.startsWith("/")) name else "/$name"
+        return promptsRepository.findByNameAndType(normalizedName, PromptType.SLASH_PROMPT)
     }
 
     /**
@@ -147,16 +187,16 @@ class PromptsService(
     }
 
     /**
-     * Create or update a slash command
+     * Create or update a slash prompt
      */
-    fun saveCommand(
+    fun saveSlashPrompt(
         id: String? = null,
         name: String,
         content: String,
         description: String? = null,
         isEnabled: Boolean = true
     ): Prompt {
-        val normalizedName = normalizeCommandName(name)
+        val normalizedName = normalizeSlashPromptName(name)
         return if (id != null && promptsRepository.exists(id)) {
             promptsRepository.update(
                 id = id,
@@ -169,7 +209,7 @@ class PromptsService(
         } else {
             promptsRepository.create(
                 name = normalizedName,
-                type = PromptType.SLASH_COMMAND,
+                type = PromptType.SLASH_PROMPT,
                 content = content,
                 description = description,
                 isCustom = true,
@@ -249,7 +289,7 @@ class PromptsService(
     }
 
     /**
-     * Delete rule or command by ID
+     * Delete rule or slash prompt by ID
      */
     fun delete(id: String): Boolean {
         return promptsRepository.delete(id)
@@ -349,7 +389,7 @@ class PromptsService(
         return systemPromptNames[type]
     }
 
-    private fun normalizeCommandName(name: String): String {
+    private fun normalizeSlashPromptName(name: String): String {
         return if (name.startsWith("/")) name else "/$name"
     }
 
@@ -367,33 +407,33 @@ class PromptsService(
         }
     }
 
-    private fun initializeBuiltinCommands() {
-        SlashCommand.BUILTINS.forEachIndexed { index, command ->
-            val normalizedName = normalizeCommandName(command.name)
-            val existing = promptsRepository.findByNameAndType(normalizedName, PromptType.SLASH_COMMAND)
+    private fun initializeBuiltinSlashPrompts() {
+        SlashPrompt.BUILTINS.forEachIndexed { index, slashPrompt ->
+            val normalizedName = normalizeSlashPromptName(slashPrompt.name)
+            val existing = promptsRepository.findByNameAndType(normalizedName, PromptType.SLASH_PROMPT)
 
             if (existing == null) {
                 promptsRepository.create(
                     name = normalizedName,
-                    type = PromptType.SLASH_COMMAND,
-                    content = command.template,
-                    description = command.description,
+                    type = PromptType.SLASH_PROMPT,
+                    content = slashPrompt.template,
+                    description = slashPrompt.description,
                     isCustom = false,
                     isEnabled = true,
                     orderIndex = index
                 )
-                logger.info { "Created built-in slash command: ${command.name}" }
+                logger.info { "Created built-in slash prompt: ${slashPrompt.name}" }
             } else if (!existing.isCustom) {
                 promptsRepository.update(
                     id = existing.id,
-                    content = command.template,
-                    description = command.description,
+                    content = slashPrompt.template,
+                    description = slashPrompt.description,
                     isEnabled = true,
                     orderIndex = index
                 )
-                logger.debug { "Updated built-in slash command: ${command.name}" }
+                logger.debug { "Updated built-in slash prompt: ${slashPrompt.name}" }
             } else {
-                logger.debug { "Skipping built-in command update because user customized it: ${command.name}" }
+                logger.debug { "Skipping built-in slash prompt update because user customized it: ${slashPrompt.name}" }
             }
         }
     }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/RagEmbeddingService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/RagEmbeddingService.kt
index f9b81b5a..24d18bf7 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/RagEmbeddingService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/RagEmbeddingService.kt
@@ -41,7 +41,7 @@ class RagEmbeddingService(
     private val configService: ConfigService? = null
 ) {
     companion object {
-        private const val BATCH_SIZE = ConfigService.DEFAULT_RAG_EMBEDDING_BATCH_SIZE
+        private val BATCH_SIZE = ConfigKeys.RAG_EMBEDDINGS_BATCH_SIZE.default
         private const val PER_CHUNK_THROTTLE_MS = 200L
         private const val MIN_COOLDOWN_DELAY_MS = 1_000L
     }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/StepPlanner.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/StepPlanner.kt
index d515d544..4a32385e 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/StepPlanner.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/StepPlanner.kt
@@ -182,8 +182,8 @@ class StepPlanner(
             logger.info { "[PLANNER] Using model=$model, provider=$provider" }
 
             // 6. Read UI state
-            val thinkingEnabled = configService.get(ConfigService.KEY_UI_THINKING_ENABLED)?.toBoolean() ?: false
-            val noEgressEnabled = configService.get(ConfigService.KEY_UI_NO_EGRESS_ENABLED)?.toBoolean() ?: false
+            val thinkingEnabled = configService.get(ConfigKeys.UI_THINKING_ENABLED.key)?.toBoolean() ?: false
+            val noEgressEnabled = configService.get(ConfigKeys.UI_NO_EGRESS_ENABLED.key)?.toBoolean() ?: false
 
             // 7. RFC 0032: Use unified complete() with stream flag
             val startTime = System.currentTimeMillis()
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/TokenEstimator.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/TokenEstimator.kt
index 2fe042e2..645b424a 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/TokenEstimator.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/TokenEstimator.kt
@@ -2,6 +2,7 @@ package pl.jclab.refio.core.services
 
 import pl.jclab.refio.core.llm.LLMMessage
 import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.turn.TurnPrompt
 
 private val logger = dualLogger("TokenEstimator")
 
@@ -32,7 +33,7 @@ class PromptTokenEstimator {
             "gemini" to 1.05,
             "openrouter" to 1.0,
             "lmstudio" to 1.0,
-            "custom_openai" to 1.0,
+            "generic_openai" to 1.0,
             "zai" to 1.0
         )
     }
@@ -133,7 +134,7 @@ class PromptTokenEstimator {
             "openai" -> 128000
             "gemini" -> 1000000
             "zai" -> 128000
-            "ollama", "lmstudio", "openrouter", "custom_openai" -> 32768
+            "ollama", "lmstudio", "openrouter", "generic_openai" -> 32768
             else -> 128000
         }
     }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ToolPermissionsService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ToolPermissionsService.kt
index 73f5c7f0..ab429d6d 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/ToolPermissionsService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ToolPermissionsService.kt
@@ -1,5 +1,6 @@
 package pl.jclab.refio.core.services
 
+import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.core.db.ConfigScope
 import pl.jclab.refio.core.db.TaskMode
 import pl.jclab.refio.core.db.repositories.ConfigRepository
@@ -59,7 +60,7 @@ class ToolPermissionsService(
 ) {
 
     companion object {
-        const val CONFIG_KEY = ConfigService.KEY_TOOLS_PERMISSIONS
+        val CONFIG_KEY = ConfigKeys.TOOLS_PERMISSIONS.key
 
         /**
          * Override'y dla narzędzi, których defaulty odbiegają od reguły
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/ToolResultSummarizer.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/ToolResultSummarizer.kt
index 5b3d98e9..1cdab496 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/ToolResultSummarizer.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/ToolResultSummarizer.kt
@@ -127,6 +127,24 @@ class ToolResultSummarizer(
             )
         }
 
+        // Skip eager summarization for read_file outputs under 512KB.
+        // These are deferred to RECENT_WORK budget-driven compression which
+        // can make better decisions about how much to keep based on available
+        // context window. See design spec: 2026-04-12-agent-execution-reliability.
+        if (toolName == "read_file" && rawOutput.length < 524_288) {
+            logger.info {
+                "[SUMMARIZER_SKIP] read_file output (${rawOutput.length} chars) below " +
+                    "lazy-compression threshold (512KB), deferring to RECENT_WORK budget."
+            }
+            return ToolResultSummary(
+                summary = rawOutput,
+                wasSummarized = false,
+                tokensIn = 0,
+                tokensOut = 0,
+                cost = 0.0
+            )
+        }
+
         // Higher skip threshold for RAW_OUTPUT (run_code, run_terminal_command, http_request).
         // These tool outputs typically contain literal data the model needs to read
         // verbatim (IDs, counts, error bodies, HTTP status codes, response JSON). Below
@@ -205,11 +223,11 @@ class ToolResultSummarizer(
         // Higher limits reduce finishReason=length failures with weaker/larger models
         // that tend to be verbose (e.g. qwen3.5, glm-5).
         val maxTokens = when (contextType) {
-            SummaryContextType.CODE_ANALYSIS -> 4096   // Keep more details for code
-            SummaryContextType.DATA_FILE -> 3072       // Preserve structure + samples
-            SummaryContextType.RAW_OUTPUT -> 4096      // Preserve numbers/IDs/errors verbatim
-            SummaryContextType.SEARCH_RESULT -> 2048   // Medium for search
-            SummaryContextType.GENERAL -> 1536         // Standard for others
+            SummaryContextType.CODE_ANALYSIS -> 8192   // Keep more details for code
+            SummaryContextType.DATA_FILE -> 4096       // Preserve structure + samples
+            SummaryContextType.RAW_OUTPUT -> 8192      // Preserve numbers/IDs/errors verbatim
+            SummaryContextType.SEARCH_RESULT -> 4096   // Medium for search
+            SummaryContextType.GENERAL -> 4096         // Standard for others
         }
 
         // Explicitly pass thinking=false to ensure all output goes to content.
@@ -554,7 +572,7 @@ Guidelines:
          * it could possibly save. Acts as a lower bound on TOOL_SUMMARY_MIN_LENGTH —
          * raising the config above this is fine, lowering it below has no effect.
          */
-        const val GLOBAL_MIN_SKIP_THRESHOLD = 1_024
+        const val GLOBAL_MIN_SKIP_THRESHOLD = 2048
 
         /**
          * Below this size, RAW_OUTPUT (run_code / run_terminal_command / http_request)
@@ -564,7 +582,7 @@ Guidelines:
          * HTTP status codes, error payloads) have the same characteristics and are
          * routed through the same path. Not configurable on purpose.
          */
-        const val RAW_OUTPUT_SKIP_THRESHOLD = 4_096
+        const val RAW_OUTPUT_SKIP_THRESHOLD = 8192
 
         /**
          * Below this size, DATA_FILE (read_file on .md/.json/.csv etc.) is NEVER
@@ -572,13 +590,13 @@ Guidelines:
          * data files — it paraphrases numbers, drops samples, and wastes 20-35s per
          * call on local models. Matches RAW_OUTPUT_SKIP_THRESHOLD.
          */
-        const val DATA_FILE_SKIP_THRESHOLD = 4_096
+        const val DATA_FILE_SKIP_THRESHOLD = 8192
 
         /** Trailing bytes of stdout copied verbatim into the head of a deterministic RAW_OUTPUT summary. */
-        const val RAW_OUTPUT_TAIL_BYTES = 1_500
+        const val RAW_OUTPUT_TAIL_BYTES = 4096
 
         /** Leading bytes of stdout copied verbatim after the tail block in a deterministic RAW_OUTPUT summary. */
-        const val RAW_OUTPUT_HEAD_BYTES = 600
+        const val RAW_OUTPUT_HEAD_BYTES = 4096
 
         /**
          * Total characters of raw output sent to the summarizer LLM. When the output
@@ -587,7 +605,7 @@ Guidelines:
          * — exit codes, API response bodies, stack traces — always survives the
          * compression step regardless of what the WEAK summarizer model decides.
          */
-        const val SUMMARIZER_INPUT_BUDGET = 16_394
+        const val SUMMARIZER_INPUT_BUDGET = 16394
 
         /**
          * File extensions treated as DATA_FILE rather than CODE_ANALYSIS.
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/analysis/EmbeddingsService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/analysis/EmbeddingsService.kt
index 006c3239..67e829f7 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/analysis/EmbeddingsService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/analysis/EmbeddingsService.kt
@@ -1,5 +1,7 @@
 package pl.jclab.refio.core.services.analysis
 
+import pl.jclab.refio.core.config.ConfigKeys
+
 import com.github.benmanes.caffeine.cache.Cache
 import com.github.benmanes.caffeine.cache.Caffeine
 import pl.jclab.refio.core.services.ConfigService
@@ -18,7 +20,7 @@ class EmbeddingsService(
 ) {
 
     companion object {
-        private const val DEFAULT_CACHE_SIZE = ConfigService.DEFAULT_RAG_EMBEDDING_CACHE_SIZE
+        private val DEFAULT_CACHE_SIZE = ConfigKeys.RAG_EMBEDDING_CACHE_SIZE.default
     }
 
     private val logger = dualLogger("EmbeddingsService")
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextBudget.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextBudget.kt
index f976c57e..c336b6b3 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextBudget.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextBudget.kt
@@ -95,7 +95,7 @@ data class ContextBudget(
         // truncate when space is tight. Giving them slack from under-used stable
         // sections (PROJECT_CONTEXT, REFERENCE) means a multi-turn agent session
         // gets to carry more of its own history forward. CONVERSATION comes next
-        // (user-facing chat), then USER_CONTEXT and RAG_FRAGMENTS as supplementary.
+        // (user-facing chat), then USER_CONTEXT as supplementary.
         //
         // Bug 2C fix: previously CONVERSATION was first and RECENT_WORK wasn't on
         // the list at all, so agent work history never benefited from budget slack.
@@ -104,7 +104,6 @@ data class ContextBudget(
             ContextSection.WORKING_MEMORY,
             ContextSection.CONVERSATION,
             ContextSection.USER_CONTEXT,
-            ContextSection.RAG_FRAGMENTS
         )
 
         val redistributed = sectionBudgets.toMutableMap()
@@ -129,7 +128,6 @@ data class ContextBudget(
             ContextSection.PROJECT_CONTEXT to 1500,
             ContextSection.RECENT_WORK to 8000,
             ContextSection.USER_CONTEXT to 5000,
-            ContextSection.RAG_FRAGMENTS to 1000,
             ContextSection.CONVERSATION to 14000,
             ContextSection.REFERENCE to 2500
         )
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextFormatter.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextFormatter.kt
index 87528be4..4190c532 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextFormatter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextFormatter.kt
@@ -1,7 +1,6 @@
 package pl.jclab.refio.core.services.context
 
 import pl.jclab.refio.core.config.ConfigKeys
-import pl.jclab.refio.core.models.context.CodeFragmentDTO
 import pl.jclab.refio.core.models.context.ExecutedStepDTO
 import pl.jclab.refio.core.models.context.ProjectContextDTO
 import pl.jclab.refio.core.services.ConfigService
@@ -10,7 +9,6 @@ import java.nio.file.Path
 
 private val logger = dualLogger("ContextFormatter")
 
-private const val MAX_RAG_FRAGMENTS = 15
 private const val CONVERSATION_SUMMARY_METADATA_TYPE = ConversationContextBuilder.CONVERSATION_SUMMARY_METADATA_TYPE
 
 // RECENT_WORK limits
@@ -206,64 +204,6 @@ class ContextFormatter(
         return parts.joinToString("\n")
     }
 
-    /**
-     * Build RAG fragments section with metadata.
-     */
-    fun buildRagFragmentsSection(context: ProjectContextDTO): String {
-        val parts = mutableListOf<String>()
-        parts.add("<RAG_FRAGMENTS>")
-
-        context.ragFragments.take(MAX_RAG_FRAGMENTS).forEach { fragment ->
-            // Enrich fragment with metadata
-            val metadata = enrichFragmentWithMetadata(fragment)
-
-            parts.add("")
-
-            // Build fragment header with metadata
-            val attrs = buildList {
-                add("file=\"${fragment.filePath}\"")
-
-                if (fragment.startLine != null && fragment.endLine != null) {
-                    add("lines=\"${fragment.startLine}-${fragment.endLine}\"")
-                }
-
-                metadata["language"]?.let { add("lang=\"$it\"") }
-                metadata["fileSize"]?.let { add("size=\"$it\"") }
-                add("similarity=\"${String.format("%.2f", fragment.similarity)}\"")
-                metadata["complexity"]?.let { add("complexity=\"$it\"") }
-            }.joinToString(" ")
-
-            parts.add("<fragment $attrs>")
-
-            // Content with language hint
-            val lang = metadata["language"] as? String ?: ""
-            val langHint = when (lang.lowercase()) {
-                "kotlin" -> "kotlin"
-                "java" -> "java"
-                "python" -> "python"
-                "javascript", "typescript" -> "javascript"
-                "html" -> "html"
-                "css" -> "css"
-                "json" -> "json"
-                "yaml" -> "yaml"
-                else -> ""
-            }
-
-            if (langHint.isNotEmpty()) {
-                parts.add("```$langHint")
-            } else {
-                parts.add("```")
-            }
-
-            parts.add(fragment.content.trim())
-            parts.add("```")
-            parts.add("</fragment>")
-        }
-
-        parts.add("</RAG_FRAGMENTS>")
-        return parts.joinToString("\n")
-    }
-
     fun buildCurrentTaskSection(context: ProjectContextDTO): String {
         val task = context.currentTask ?: return "<CURRENT_TASK>\nNo task information available\n</CURRENT_TASK>"
 
@@ -874,42 +814,6 @@ class ContextFormatter(
         return parts.joinToString("\n")
     }
 
-    /**
-     * Enrich code fragment with metadata (language, file size, complexity).
-     */
-    fun enrichFragmentWithMetadata(fragment: CodeFragmentDTO): Map<String, Any> {
-        val metadata = mutableMapOf<String, Any>()
-
-        // File size and line count
-        try {
-            val path = Path.of(fragment.filePath)
-            if (java.nio.file.Files.exists(path)) {
-                val size = java.nio.file.Files.size(path)
-                metadata["fileSize"] = when {
-                    size < 1024 -> "${size}B"
-                    size < 1024 * 1024 -> "${size / 1024}KB"
-                    else -> "${size / (1024 * 1024)}MB"
-                }
-
-                val lines = java.nio.file.Files.readAllLines(path).size
-                metadata["lineCount"] = lines
-
-                val modified = java.nio.file.Files.getLastModifiedTime(path).toInstant()
-                metadata["lastModified"] = modified.toString().take(10)
-            }
-        } catch (e: Exception) {
-            // Ignore file metadata errors
-        }
-
-        // Language detection
-        metadata["language"] = detectLanguage(fragment.filePath)
-
-        // Complexity estimation
-        metadata["complexity"] = estimateComplexity(fragment.content)
-
-        return metadata
-    }
-
     // ===========================
     // Private helper methods
     // ===========================
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextReferenceResolver.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextReferenceResolver.kt
index b4e8677c..723e5bfc 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextReferenceResolver.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextReferenceResolver.kt
@@ -27,7 +27,12 @@ private val logger = dualLogger("ContextReferenceResolver")
 class ContextReferenceResolver(
     private val fileAnalyzerService: FileAnalyzerService? = null,
     private val configService: ConfigService,
-    private val chatMessageRepository: ChatMessageRepository
+    private val chatMessageRepository: ChatMessageRepository,
+    /**
+     * Opaque platform project handle (IntelliJ Project or null for CLI). Passed
+     * down to IDE-specific context providers via [ContextProviderExtras].
+     */
+    private val platformProject: Any? = null,
 ) {
 
     /**
@@ -36,16 +41,14 @@ class ContextReferenceResolver(
      *
      * @param refs Raw user context references from PromptInputPanel
      * @param projectRoot Project root path
-     * @param project IntelliJ Project instance (optional)
      * @return List of ResolvedContextDTO ready for LLM
      */
     internal suspend fun resolveAndConvertUserContextRefs(
         refs: List<ContextReference>,
         projectRoot: Path,
-        project: Any?,
         currentQuery: String?
     ): List<ResolvedContextDTO> {
-        val resolved = resolveUserContextReferences(refs, projectRoot, project, currentQuery)
+        val resolved = resolveUserContextReferences(refs, projectRoot, currentQuery)
 
         return resolved.mapNotNull { ref ->
             val content = ref.content
@@ -86,8 +89,6 @@ class ContextReferenceResolver(
      * Resolve user-provided context references (@file, @folder, @selection, etc.)
      * using ContextProviderRegistry as SINGLE SOURCE OF TRUTH.
      *
-     * INTERNAL: This is now private. Use buildProjectContext() with userContextRefs parameter.
-     *
      * Flow:
      * 1. For each ContextReference, determine provider ID
      * 2. Get provider from ContextProviderRegistry
@@ -96,13 +97,11 @@ class ContextReferenceResolver(
      *
      * @param refs User context references from PromptInputPanel
      * @param projectRoot Project root path for PathSandbox validation
-     * @param project IntelliJ Project instance (nullable for core-only usage)
      * @return List of resolved references with loaded content
      */
     private suspend fun resolveUserContextReferences(
         refs: List<ContextReference>,
         projectRoot: Path,
-        project: Any? = null,
         currentQuery: String? = null
     ): List<ContextReference> = withContext(Dispatchers.IO) {
         logger.info { "[CONTEXT] Resolving ${refs.size} user context reference(s)" }
@@ -113,35 +112,32 @@ class ContextReferenceResolver(
             try {
                 when (ref.type) {
                     ContextType.PROVIDER -> {
-                        // Modern flow: direct provider reference
-                        resolveProviderReference(ref, projectRoot, project, pathSandbox, currentQuery)
+                        resolveProviderReference(ref, projectRoot, pathSandbox, currentQuery)
                     }
-                    // Legacy types: map to providers for backwards compatibility
                     ContextType.FILE -> {
-                        resolveLegacyFileReference(ref, projectRoot, project, pathSandbox)
+                        resolveLegacyFileReference(ref, projectRoot, pathSandbox)
                     }
 
                     ContextType.FOLDER -> {
-                        resolveLegacyFolderReference(ref, projectRoot, project, pathSandbox)
+                        resolveLegacyFolderReference(ref, projectRoot, pathSandbox)
                     }
 
                     ContextType.SELECTION -> {
-                        // Selection already has content, just validate size
                         ref.copy(
                             estimatedTokens = (ref.content?.length ?: 0) / 4
                         )
                     }
 
                     ContextType.OPEN -> {
-                        resolveLegacyOpenReference(ref, projectRoot, project, pathSandbox)
+                        resolveLegacyOpenReference(ref, projectRoot, pathSandbox)
                     }
 
                     ContextType.RULES -> {
-                        resolveLegacyRulesReference(ref, projectRoot, project, pathSandbox)
+                        resolveLegacyRulesReference(ref, projectRoot, pathSandbox)
                     }
 
                     ContextType.DOCS -> {
-                        resolveDocsReference(ref, projectRoot, project, currentQuery)
+                        resolveDocsReference(ref, projectRoot, currentQuery)
                     }
                 }
             } catch (e: Exception) {
@@ -160,7 +156,6 @@ class ContextReferenceResolver(
     private suspend fun resolveProviderReference(
         ref: ContextReference,
         projectRoot: Path,
-        project: Any?,
         pathSandbox: PathSandbox,
         currentQuery: String?
     ): ContextReference {
@@ -191,7 +186,7 @@ class ContextReferenceResolver(
         }
 
         val extras = ContextProviderExtras(
-            project = project,
+            project = platformProject,
             fullInput = fullInput,
             workspacePath = projectRoot.toString()
         )
@@ -250,7 +245,6 @@ class ContextReferenceResolver(
     private suspend fun resolveDocsReference(
         ref: ContextReference,
         projectRoot: Path,
-        project: Any?,
         currentQuery: String?
     ): ContextReference {
         val provider = ContextProviderRegistry.getProvider("docs")
@@ -263,7 +257,7 @@ class ContextReferenceResolver(
         }
 
         val extras = ContextProviderExtras(
-            project = project,
+            project = platformProject,
             fullInput = currentQuery ?: ref.path,
             workspacePath = projectRoot.toString()
         )
@@ -300,7 +294,6 @@ class ContextReferenceResolver(
     private suspend fun resolveLegacyFileReference(
         ref: ContextReference,
         projectRoot: Path,
-        project: Any?,
         pathSandbox: PathSandbox
     ): ContextReference {
         logger.debug { "[CONTEXT] Resolving legacy FILE reference: ${ref.path}" }
@@ -315,7 +308,7 @@ class ContextReferenceResolver(
         }
 
         val extras = ContextProviderExtras(
-            project = project,
+            project = platformProject,
             fullInput = ref.path,
             workspacePath = projectRoot.toString()
         )
@@ -362,7 +355,6 @@ class ContextReferenceResolver(
     private suspend fun resolveLegacyFolderReference(
         ref: ContextReference,
         projectRoot: Path,
-        project: Any?,
         pathSandbox: PathSandbox
     ): ContextReference {
         logger.debug { "[CONTEXT] Resolving legacy FOLDER reference: ${ref.path}" }
@@ -377,7 +369,7 @@ class ContextReferenceResolver(
         }
 
         val extras = ContextProviderExtras(
-            project = project,
+            project = platformProject,
             fullInput = ref.path,
             workspacePath = projectRoot.toString()
         )
@@ -409,7 +401,6 @@ class ContextReferenceResolver(
     private suspend fun resolveLegacyOpenReference(
         ref: ContextReference,
         projectRoot: Path,
-        project: Any?,
         pathSandbox: PathSandbox
     ): ContextReference {
         logger.debug { "[CONTEXT] Resolving legacy OPEN reference" }
@@ -424,7 +415,7 @@ class ContextReferenceResolver(
         }
 
         val extras = ContextProviderExtras(
-            project = project,
+            project = platformProject,
             fullInput = "",
             workspacePath = projectRoot.toString()
         )
@@ -465,7 +456,6 @@ class ContextReferenceResolver(
     private suspend fun resolveLegacyRulesReference(
         ref: ContextReference,
         projectRoot: Path,
-        project: Any?,
         pathSandbox: PathSandbox
     ): ContextReference {
         logger.debug { "[CONTEXT] Resolving legacy RULES reference" }
@@ -487,7 +477,7 @@ class ContextReferenceResolver(
         }
 
         val extras = ContextProviderExtras(
-            project = project,
+            project = platformProject,
             fullInput = rulesPath,
             workspacePath = projectRoot.toString()
         )
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextSection.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextSection.kt
index c1d221dd..42226a2e 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextSection.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/context/ContextSection.kt
@@ -25,7 +25,6 @@ enum class ContextSection(val defaultPriority: ContextPriority) {
     PROJECT_INSTRUCTIONS(ContextPriority.HIGH),
     RECENT_WORK(ContextPriority.NORMAL),
     USER_CONTEXT(ContextPriority.HIGH),
-    RAG_FRAGMENTS(ContextPriority.NORMAL),
     CONVERSATION(ContextPriority.NORMAL),
     REFERENCE(ContextPriority.LOW);
 
@@ -33,13 +32,13 @@ enum class ContextSection(val defaultPriority: ContextPriority) {
      * Context layer classification for caching and incremental building.
      * - STABLE: project info, conventions, key files — cached, invalidated on project file change
      * - ACCUMULATED: working memory, modified files — grows across turns
-     * - EPHEMERAL: current query, RAG, user refs — rebuilt every turn
+     * - EPHEMERAL: current query, user refs — rebuilt every turn
      */
     val contextLayer: ContextLayer
         get() = when (this) {
             SYSTEM_PROMPT, TOOL_DESCRIPTIONS, PROJECT_CONTEXT, PROJECT_INSTRUCTIONS, REFERENCE -> ContextLayer.STABLE
             WORKING_MEMORY, RECENT_WORK, AGENT_PLANS -> ContextLayer.ACCUMULATED
-            USER_CONTEXT, RAG_FRAGMENTS, CONVERSATION -> ContextLayer.EPHEMERAL
+            USER_CONTEXT, CONVERSATION -> ContextLayer.EPHEMERAL
         }
 }
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/context/RagContextLoader.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/context/RagContextLoader.kt
deleted file mode 100644
index 2cd137b4..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/context/RagContextLoader.kt
+++ /dev/null
@@ -1,207 +0,0 @@
-package pl.jclab.refio.core.services.context
-
-import pl.jclab.refio.core.config.ConfigKeys
-import pl.jclab.refio.core.models.context.CodeFragmentDTO
-import pl.jclab.refio.core.services.ConfigService
-import pl.jclab.refio.core.services.EmbeddingCircuitBreaker
-import pl.jclab.refio.core.services.RagSearchService
-import pl.jclab.refio.core.services.rag.RagSearchConfig
-import pl.jclab.refio.core.logging.dualLogger
-import java.nio.file.Path
-
-private val logger = dualLogger("RagContextLoader")
-// Hard cap on auto-injected RAG fragments. The actual count is taken from
-// ConfigKeys.RAG_SEARCH_TOP_K (default 5) but we never exceed this even if a user sets a
-// higher value, because auto-RAG bloats the prompt at turn start before the agent has
-// any signal about which fragments are actually relevant.
-private const val MAX_AUTO_RAG_FRAGMENTS = 8
-
-class RagContextLoader(
-    private val configService: ConfigService,
-    ragSearchService: RagSearchService? = null,
-    ragSearchModel: String? = null,
-    ragSearchProvider: String? = null
-) {
-    @Volatile
-    private var ragSearchServiceRef: RagSearchService? = ragSearchService
-
-    @Volatile
-    private var ragSearchModelRef: String? = ragSearchModel
-
-    @Volatile
-    private var ragSearchProviderRef: String? = ragSearchProvider
-
-    fun updateRagSearchConfig(service: RagSearchService?, model: String?, provider: String?) {
-        ragSearchServiceRef = service
-        ragSearchModelRef = model
-        ragSearchProviderRef = provider
-    }
-
-    /**
-     * Determines if RAG should be skipped for simple/meta questions.
-     * Based on ADR 0017: Refaktoryzacja Context Service.
-     *
-     * @param query User query
-     * @return true if RAG should be skipped, false otherwise
-     */
-    internal fun shouldSkipRag(query: String?): Boolean {
-        if (query.isNullOrBlank()) return true
-
-        val queryLower = query.lowercase().trim()
-
-        // System-injected harness phrases (retry / continue / nudge follow-ups). Embedding these
-        // wastes 8-20s per iteration and never returns useful fragments because they carry no
-        // task-specific signal.
-        val systemPhrases = listOf(
-            "continue from where you left off",
-            "continue where you left off",
-            "kontynuuj od miejsca",
-            "kontynuuj zadanie"
-        )
-        if (systemPhrases.any { queryLower.contains(it) }) {
-            logger.info { "[CONTEXT] Skipping RAG - system harness phrase: ${query.take(80)}" }
-            return true
-        }
-
-        // Meta questions that don't need code context
-        val metaPatterns = listOf(
-            "co wiesz", "what do you know",
-            "opisz projekt", "describe project", "describe the project",
-            "jaki to projekt", "what project", "what is this project",
-            "podsumuj", "summarize",
-            "struktura", "structure", "project structure",
-            "technologie", "technologies", "tech stack",
-            "architektura", "architecture",
-            "co to za projekt", "what kind of project"
-        )
-
-        if (metaPatterns.any { queryLower.contains(it) }) {
-            logger.info { "[CONTEXT] Skipping RAG - meta question detected: ${query.take(100)}" }
-            return true
-        }
-
-        // Short questions are usually meta (unless they mention code/file)
-        if (query.length < 30 &&
-            !queryLower.contains("kod") &&
-            !queryLower.contains("code") &&
-            !queryLower.contains("file") &&
-            !queryLower.contains("plik") &&
-            !queryLower.contains("function") &&
-            !queryLower.contains("funkcja") &&
-            !queryLower.contains("class") &&
-            !queryLower.contains("klasa")
-        ) {
-            logger.info { "[CONTEXT] Skipping RAG - short meta question: ${query}" }
-            return true
-        }
-
-        return false
-    }
-
-    suspend fun loadRagFragments(
-        projectRoot: Path,
-        query: String?,
-    ): List<CodeFragmentDTO> {
-        val searchService = ragSearchServiceRef ?: run {
-            logger.info { "[CONTEXT] RAG search service not configured - skipping fragments" }
-            return emptyList()
-        }
-        val model = ragSearchModelRef ?: run {
-            logger.warn { "[CONTEXT] RAG search model is not configured - skipping fragments" }
-            return emptyList()
-        }
-        if (ragSearchProviderRef.equals("ollama", ignoreCase = true)) {
-            val endpoint = configService.getTyped(ConfigKeys.PROVIDER_OLLAMA_ENDPOINT)
-            val providerKey = "ollama:$endpoint"
-            if (EmbeddingCircuitBreaker.getState(providerKey) == "OPEN") {
-                val retryMs = EmbeddingCircuitBreaker.getCooldownRemaining(providerKey)
-                logger.warn { "[CONTEXT] Skipping RAG fragments - Ollama unavailable (circuit OPEN, retry in ${retryMs}ms, endpoint=$endpoint)" }
-                return emptyList()
-            }
-        }
-
-        // Skip RAG for simple/meta questions (ADR 0017)
-        if (shouldSkipRag(query)) {
-            return emptyList()
-        }
-
-        val queryParts = listOfNotNull(query?.trim())
-            .filter { it.isNotBlank() }
-        if (queryParts.isEmpty()) {
-            logger.info { "[CONTEXT] No RAG query data provided - skipping fragments" }
-            return emptyList()
-        }
-
-        val combinedQuery = queryParts.joinToString("\n\n")
-        val keywords = extractRagKeywords(queryParts)
-        val hybridEnabled = configService.getTyped(ConfigKeys.RAG_SEARCH_HYBRID_ENABLED)
-
-        return try {
-            logger.info {
-                "[CONTEXT] Running hybrid RAG search: query='${combinedQuery.take(120)}...', keywords=$keywords"
-            }
-            // topK comes from config (default 5), capped at MAX_AUTO_RAG_FRAGMENTS so an
-            // unintended high config value cannot pollute the startup prompt with noise.
-            val configuredTopK = configService.getTyped(ConfigKeys.RAG_SEARCH_TOP_K)
-            val effectiveTopK = configuredTopK.coerceIn(1, MAX_AUTO_RAG_FRAGMENTS)
-            val config = RagSearchConfig(
-                similarityThreshold = configService.getTyped(ConfigKeys.RAG_SEARCH_SIMILARITY_THRESHOLD),
-                topK = effectiveTopK,
-                hybridSearch = hybridEnabled,
-                keywords = keywords,
-                semanticWeight = configService.getTyped(ConfigKeys.RAG_SEARCH_SEMANTIC_WEIGHT),
-                includeContextChunks = configService.getTyped(ConfigKeys.RAG_SEARCH_INCLUDE_CONTEXT_CHUNKS)
-            )
-            val results = searchService.search(
-                projectRoot = projectRoot.toString(),
-                query = combinedQuery,
-                model = model,
-                config = config
-            )
-
-            val out = results.map { result ->
-                CodeFragmentDTO(
-                    filePath = result.filePath,
-                    content = result.content,
-                    startLine = result.startLine,
-                    endLine = result.endLine,
-                    similarity = result.similarity,
-                    contentType = result.contentType.name
-                )
-            }
-
-            logger.info { "Found the: ${out.size} fragments" }
-
-            out
-        } catch (e: Exception) {
-            logger.warn(e) { "[CONTEXT] Hybrid RAG search failed (${e.message})" }
-            emptyList()
-        }
-    }
-
-    internal fun extractRagKeywords(parts: List<String>): List<String> {
-        if (parts.isEmpty()) {
-            return emptyList()
-        }
-
-        val tokens = parts
-            .flatMap { it.split(Regex("[^A-Za-z0-9_/\\\\-]+")) }
-            .map { it.trim().lowercase() }
-            .filter { it.length >= 4 }
-            .map { it.take(64) }
-
-        // Prioritize unique keywords while preserving order
-        val seen = mutableSetOf<String>()
-        val keywords = mutableListOf<String>()
-        for (token in tokens) {
-            if (seen.add(token)) {
-                keywords.add(token)
-            }
-            if (keywords.size >= 12) {
-                break
-            }
-        }
-
-        return keywords
-    }
-}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/context/TaskContextExtractor.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/context/TaskContextExtractor.kt
index 4183b3d8..c2363d83 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/context/TaskContextExtractor.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/context/TaskContextExtractor.kt
@@ -57,71 +57,47 @@ internal class TaskContextExtractor {
     }
 
     /**
-     * Build previous subtasks data for context.
-     * Returns pair of (subtask summaries, completed file paths).
-     * Based on Python context_service.py lines 1087-1105
+     * Extract distinct file paths referenced by completed subtasks
+     * (looks at tool args in `paramsJson` and `stepPlanJson`).
      */
-    fun buildPreviousSubtasksData(
+    fun buildCompletedFiles(
         subtasks: List<Subtask>,
         limit: Int = 10
-    ): Pair<List<String>, List<String>> {
+    ): List<String> {
         val completedFiles = mutableSetOf<String>()
-        val previousSubtasks = mutableSetOf<String>()
-
         val completed = subtasks.filter { it.status == TaskStatus.SUCCESS }
         val gson = GsonInstance.gson
+        val pathKeys = listOf("path", "file_path", "file", "target", "source", "files")
+
+        fun extractFromMap(map: Map<*, *>?) {
+            if (map == null) return
+            for (key in pathKeys) {
+                when (val value = map[key]) {
+                    is String -> completedFiles.add(value)
+                    is List<*> -> value.filterIsInstance<String>().forEach { completedFiles.add(it) }
+                }
+            }
+        }
 
         for (prevSubtask in completed.takeLast(limit)) {
-            val summary = prevSubtask.result ?: "No summary available."
-            previousSubtasks.add("- ${prevSubtask.description}: $summary")
-
-            // Extract file paths from tool arguments (try multiple field names)
-            val filePaths = mutableSetOf<String>()
-
-            // Try paramsJson first
             prevSubtask.paramsJson?.let { json ->
                 try {
-                    val params = gson.fromJson(json, Map::class.java)
-                    // Try various common field names for file paths
-                    val possibleKeys = listOf("path", "file_path", "file", "target", "source", "files")
-                    for (key in possibleKeys) {
-                        when (val value = params?.get(key)) {
-                            is String -> filePaths.add(value)
-                            is List<*> -> value.filterIsInstance<String>().forEach { filePaths.add(it) }
-                        }
-                    }
-                } catch (e: Exception) {
-                    // Ignore parse errors
+                    extractFromMap(gson.fromJson(json, Map::class.java))
+                } catch (_: Exception) {
                 }
             }
-
-            // Also try stepPlanJson (might contain file references)
             prevSubtask.stepPlanJson?.let { json ->
                 try {
                     val plan = gson.fromJson(json, Map::class.java)
-                    // Look for files in tool_calls
                     @Suppress("UNCHECKED_CAST")
-                    val toolCalls = plan?.get("tool_calls") as? List<Map<*, *>>
-                    toolCalls?.forEach { call ->
-                        @Suppress("UNCHECKED_CAST")
-                        val args = call["args"] as? Map<*, *>
-                        val possibleKeys = listOf("path", "file_path", "file", "target", "source")
-                        for (key in possibleKeys) {
-                            when (val value = args?.get(key)) {
-                                is String -> filePaths.add(value)
-                                is List<*> -> value.filterIsInstance<String>().forEach { filePaths.add(it) }
-                            }
-                        }
+                    (plan?.get("tool_calls") as? List<Map<*, *>>)?.forEach { call ->
+                        extractFromMap(call["args"] as? Map<*, *>)
                     }
-                } catch (e: Exception) {
-                    // Ignore parse errors
+                } catch (_: Exception) {
                 }
             }
-
-            completedFiles.addAll(filePaths)
         }
-
-        return Pair(previousSubtasks.toList(), completedFiles.toList())
+        return completedFiles.toList()
     }
 
     /**
@@ -182,6 +158,7 @@ internal class TaskContextExtractor {
                                     tool = tool,
                                     parameters = paramsMap,
                                     result = resultText,
+                                    rawResultSize = resultText.length,
                                     summary = summary,
                                     timestamp = Instant.ofEpochMilli(timestamp),
                                     success = prevSubtask.status == TaskStatus.SUCCESS
@@ -213,9 +190,11 @@ internal class TaskContextExtractor {
                 val timestamp = prevSubtask.completedAt ?: prevSubtask.updatedAt
                 val summary = prevSubtask.summary ?: "Completed: ${prevSubtask.kind.name}"
 
-                // Don't truncate - use full rawResult, but limit to reasonable size for display
-                val resultText = if (rawResult.length > 16384) {
-                    rawResult.take(16384) + "\n\n... [${rawResult.length - 16384} chars omitted] ..."
+                // Keep raw data up to 512KB — let RECENT_WORK budget-driven compression decide.
+                // Only truncate truly huge outputs to prevent memory pressure.
+                val maxRawResultChars = 524_288 // 512KB
+                val resultText = if (rawResult.length > maxRawResultChars) {
+                    rawResult.take(maxRawResultChars) + "\n... [truncated from ${rawResult.length} chars]"
                 } else {
                     rawResult
                 }
@@ -227,6 +206,7 @@ internal class TaskContextExtractor {
                         tool = prevSubtask.kind.name,
                         parameters = fallbackParams,
                         result = resultText,
+                        rawResultSize = rawResult.length,
                         summary = summary,
                         timestamp = Instant.ofEpochMilli(timestamp),
                         success = prevSubtask.status == TaskStatus.SUCCESS
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/context/WorkingMemoryService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/context/WorkingMemoryService.kt
index 7ada812c..51975466 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/context/WorkingMemoryService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/context/WorkingMemoryService.kt
@@ -1,6 +1,6 @@
 package pl.jclab.refio.core.services.context
 
-import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.core.services.analysis.CodeElements
 import java.time.Instant
 import java.util.concurrent.ConcurrentHashMap
@@ -24,7 +24,7 @@ data class WorkingMemoryEntry(
 )
 
 class WorkingMemoryService(
-    private val maxEntriesPerTask: Int = ConfigService.DEFAULT_WORKING_MEMORY_MAX_FACTS
+    private val maxEntriesPerTask: Int = ConfigKeys.WORKING_MEMORY_MAX_FACTS.default
 ) {
     private val entriesByTask = ConcurrentHashMap<String, ConcurrentHashMap<String, WorkingMemoryEntry>>()
     private val entriesBySession = ConcurrentHashMap<String, ConcurrentHashMap<String, WorkingMemoryEntry>>()
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/ToolCallParser.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/ToolCallParser.kt
index 232af107..810202c2 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/ToolCallParser.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/ToolCallParser.kt
@@ -36,6 +36,14 @@ class ToolCallParser(
     private val json = Json { ignoreUnknownKeys = true; prettyPrint = false }
     private val toolArgumentKeys = listOf("arguments", "args", "tool_args", "toolArgs", "parameters", "params")
 
+    data class JsonEnvelopeInspection(
+        val hasJsonEnvelope: Boolean,
+        val isComplete: Boolean,
+        val isFenced: Boolean,
+        val firstBraceIndex: Int,
+        val closingBraceIndex: Int
+    )
+
     /**
      * Main entry point - parses LLM content into ToolCallData list.
      */
@@ -187,6 +195,51 @@ class ToolCallParser(
         }
     }
 
+    fun inspectJsonEnvelope(content: String): JsonEnvelopeInspection {
+        val trimmed = content.trim()
+        if (trimmed.isBlank()) {
+            return JsonEnvelopeInspection(
+                hasJsonEnvelope = false,
+                isComplete = false,
+                isFenced = false,
+                firstBraceIndex = -1,
+                closingBraceIndex = -1
+            )
+        }
+
+        val isFenced = Regex("""^```(?:json)?\s*""", RegexOption.IGNORE_CASE).containsMatchIn(trimmed)
+        val firstBraceIndex = trimmed.indexOf('{')
+        if (firstBraceIndex == -1) {
+            return JsonEnvelopeInspection(
+                hasJsonEnvelope = false,
+                isComplete = false,
+                isFenced = isFenced,
+                firstBraceIndex = -1,
+                closingBraceIndex = -1
+            )
+        }
+
+        var closingBraceIndex = findMatchingBrace(trimmed, firstBraceIndex)
+        val recoveredFromClosedFence = if (closingBraceIndex == -1 && isFenced) {
+            val fencedBody = extractClosedFencedJsonBody(trimmed)
+            if (fencedBody != null && canParseJsonCandidate(fencedBody)) {
+                closingBraceIndex = trimmed.lastIndexOf('}')
+                true
+            } else {
+                false
+            }
+        } else {
+            false
+        }
+        return JsonEnvelopeInspection(
+            hasJsonEnvelope = true,
+            isComplete = closingBraceIndex != -1 || recoveredFromClosedFence,
+            isFenced = isFenced,
+            firstBraceIndex = firstBraceIndex,
+            closingBraceIndex = closingBraceIndex
+        )
+    }
+
     // ===== Private methods =====
 
     private fun extractToolCallsFromJson(content: String, mode: TaskMode): List<ToolCallData> {
@@ -382,6 +435,26 @@ class ToolCallParser(
 
     private fun extractJsonWithStrategies(content: String): String? {
         val trimmed = content.trim()
+        val closedFencedBody = extractClosedFencedJsonBody(trimmed)
+
+        if (closedFencedBody != null) {
+            try {
+                json.parseToJsonElement(closedFencedBody)
+                logger.info { "[EXTRACT_JSON] Strategy 0: Extracted valid JSON from fenced block body" }
+                return closedFencedBody
+            } catch (e: Exception) {
+                logger.debug { "[EXTRACT_JSON] Strategy 0 failed: ${e.message}" }
+            }
+
+            val repairedFencedBody = TurnJsonUtils.repairMalformedJson(closedFencedBody)
+            try {
+                json.parseToJsonElement(repairedFencedBody)
+                logger.info { "[EXTRACT_JSON] Strategy 0b: Repaired malformed fenced JSON body" }
+                return repairedFencedBody
+            } catch (e: Exception) {
+                logger.debug { "[EXTRACT_JSON] Strategy 0b failed: ${e.message}" }
+            }
+        }
 
         // Strategy 1: Try parsing entire content as JSON directly
         if (trimmed.startsWith("{")) {
@@ -407,11 +480,20 @@ class ToolCallParser(
                 } catch (e: Exception) {
                     logger.debug { "[EXTRACT_JSON] Strategy 2 extraction invalid: ${e.message}" }
                 }
+            } else {
+                val repairedCandidate = TurnJsonUtils.repairMalformedJson(trimmed.substring(firstBraceIndex))
+                try {
+                    json.parseToJsonElement(repairedCandidate)
+                    logger.info { "[EXTRACT_JSON] Strategy 2b: Repaired unmatched-brace JSON fragment" }
+                    return repairedCandidate
+                } catch (e: Exception) {
+                    logger.debug { "[EXTRACT_JSON] Strategy 2b repair failed: ${e.message}" }
+                }
             }
         }
 
         // Strategy 3: JSON in code block ```json ... ```
-        val codeBlockStartPattern = Regex("""```(?:json)?\s*\n""")
+        val codeBlockStartPattern = Regex("""```(?:json)?\s*\n""", RegexOption.IGNORE_CASE)
         val codeBlockMatch = codeBlockStartPattern.find(trimmed)
         if (codeBlockMatch != null) {
             val afterFence = codeBlockMatch.range.last + 1
@@ -430,6 +512,15 @@ class ToolCallParser(
                             logger.debug { "[EXTRACT_JSON] Strategy 3 extraction invalid: ${e.message}" }
                         }
                     }
+                } else {
+                    val repairedCandidate = TurnJsonUtils.repairMalformedJson(trimmed.substring(jsonStartInBlock))
+                    try {
+                        json.parseToJsonElement(repairedCandidate)
+                        logger.info { "[EXTRACT_JSON] Strategy 3b: Repaired incomplete fenced JSON block" }
+                        return repairedCandidate
+                    } catch (e: Exception) {
+                        logger.debug { "[EXTRACT_JSON] Strategy 3b repair failed: ${e.message}" }
+                    }
                 }
             }
         }
@@ -472,6 +563,30 @@ class ToolCallParser(
         return null
     }
 
+    private fun extractClosedFencedJsonBody(content: String): String? {
+        val startMatch = Regex("""^```(?:json)?\s*""", RegexOption.IGNORE_CASE).find(content) ?: return null
+        val bodyStart = startMatch.range.last + 1
+        val lastFenceIndex = content.lastIndexOf("```")
+        if (lastFenceIndex <= bodyStart) {
+            return null
+        }
+        return content.substring(bodyStart, lastFenceIndex).trim()
+    }
+
+    private fun canParseJsonCandidate(candidate: String): Boolean {
+        return try {
+            json.parseToJsonElement(candidate)
+            true
+        } catch (_: Exception) {
+            try {
+                json.parseToJsonElement(TurnJsonUtils.repairMalformedJson(candidate))
+                true
+            } catch (_: Exception) {
+                false
+            }
+        }
+    }
+
     private fun extractJsonFromContent(content: String): String? {
         return extractJsonWithStrategies(content)
     }
@@ -590,13 +705,28 @@ class ToolCallParser(
             if (isToolAllowedByProfile(toolCall.name, profileOverrides)) {
                 toolCall
             } else {
-                val message = toolCall.error
-                    ?: "Tool '${toolCall.name}' is not allowed for current run profile"
+                val message = toolCall.error ?: buildProfileBlockedError(toolCall.name, profileOverrides)
                 toolCall.copy(error = message)
             }
         }
     }
 
+    /** Mirror of [pl.jclab.refio.core.services.turn.TurnToolExecutor.buildProfileBlockedError]. */
+    private fun buildProfileBlockedError(
+        toolName: String,
+        profileOverrides: TurnProfileOverrides,
+    ): String {
+        val allowed = profileOverrides.allowedTools?.takeIf { it.isNotEmpty() }
+        val disallowed = profileOverrides.disallowedTools?.takeIf { it.isNotEmpty() }
+        val scope = profileOverrides.subagentName?.let { "subagent '$it'" } ?: "current run profile"
+        val details = when {
+            allowed != null -> "Your available tools are: ${allowed.joinToString(", ")}. Pick one of these or produce a final response."
+            disallowed != null -> "This tool is on the blocklist for this profile (${disallowed.joinToString(", ")}). Use a different approach."
+            else -> "Check the <available_tools> section and use only tools listed there."
+        }
+        return "Tool '$toolName' is not available to the $scope. $details"
+    }
+
     private fun isToolAllowedByProfile(toolName: String, profileOverrides: TurnProfileOverrides): Boolean {
         val normalizedName = toolName.lowercase()
         val allowed = profileOverrides.allowedTools?.map { it.lowercase() }?.toSet()
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnEventListener.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnEventListener.kt
index abcb02fd..429e91e5 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnEventListener.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnEventListener.kt
@@ -6,7 +6,7 @@ import pl.jclab.refio.core.db.ToolCallData
  * Listener for turn events (tool execution, streaming, etc.).
  * Extracted from AgentTurnLoop to avoid circular dependencies.
  */
-interface TurnEventListener {
+interface TurnEventListener : TurnCompletionListener {
     fun onTurnStarted(
         taskId: String,
         mode: pl.jclab.refio.core.db.TaskMode,
@@ -24,12 +24,4 @@ interface TurnEventListener {
     fun onStreamChunk(taskId: String, delta: String, accumulated: String) {}
 
     fun onToolBatchCompleted(taskId: String, summary: ToolBatchSummary.BatchSummary) {}
-
-    fun onTurnCompleted(
-        taskId: String,
-        result: pl.jclab.refio.core.services.TurnResult,
-        runId: String,
-        parentRunId: String?,
-        depth: Int
-    ) {}
 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnFinalizer.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnFinalizer.kt
index 58fb2a9a..11d3f08a 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnFinalizer.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnFinalizer.kt
@@ -30,7 +30,9 @@ class TurnFinalizer(
         parentRunId: String?,
         depth: Int,
         persistAssistantMessage: Boolean,
-        metadata: String? = null
+        metadata: String? = null,
+        agentName: String? = null,
+        agentDepth: Int? = null,
     ): TurnResult {
         if (persistAssistantMessage) {
             val content = result.response.ifBlank {
@@ -40,7 +42,9 @@ class TurnFinalizer(
                 taskId = taskId,
                 role = MessageRole.ASSISTANT,
                 content = content,
-                metadata = metadata
+                metadata = metadata,
+                agentName = agentName,
+                agentDepth = agentDepth,
             )
         }
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnJsonUtils.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnJsonUtils.kt
index 0b923bb6..03a42d77 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnJsonUtils.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnJsonUtils.kt
@@ -269,6 +269,18 @@ object TurnJsonUtils {
             i += 1
         }
 
+        if (inString) {
+            out.append('"')
+            inString = false
+        }
+
+        while (stack.isNotEmpty()) {
+            when (stack.removeLast().type) {
+                ContainerType.OBJECT -> out.append('}')
+                ContainerType.ARRAY -> out.append(']')
+            }
+        }
+
         return out.toString()
     }
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnLLMCaller.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnLLMCaller.kt
index 6b269260..dac1b433 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnLLMCaller.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnLLMCaller.kt
@@ -11,6 +11,7 @@ import pl.jclab.refio.core.llm.LLMMessage
 import pl.jclab.refio.core.llm.LLMResponse
 import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.TurnLoopConfig
 import pl.jclab.refio.core.logging.dualLogger
 
 private val logger = dualLogger("TurnLLMCaller")
@@ -42,7 +43,7 @@ class TurnLLMCaller(
     suspend fun callLLM(
         taskId: String,
         mode: TaskMode,
-        prompt: LLMCallPrompt,
+        prompt: TurnPrompt,
         streamCallback: StreamCallback? = null,
         model: String? = null,
         provider: String? = null,
@@ -154,14 +155,6 @@ class TurnLLMCaller(
     }
 }
 
-/**
- * Prompt for LLM call.
- */
-data class LLMCallPrompt(
-    val systemPrompt: String,
-    val messages: List<LLMMessage>
-)
-
 /**
  * Selected model and provider.
  */
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnLoopConfigAliases.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnLoopConfigAliases.kt
deleted file mode 100644
index e53e339b..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnLoopConfigAliases.kt
+++ /dev/null
@@ -1,9 +0,0 @@
-package pl.jclab.refio.core.services.turn
-
-import pl.jclab.refio.core.services.TurnLoopConfig as CoreTurnLoopConfig
-
-/**
- * Type alias for TurnLoopConfig from core services package.
- * This allows turn/ package classes to use TurnLoopConfig without circular dependencies.
- */
-typealias TurnLoopConfig = CoreTurnLoopConfig
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPrompt.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPrompt.kt
new file mode 100644
index 00000000..aac82fe4
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPrompt.kt
@@ -0,0 +1,15 @@
+package pl.jclab.refio.core.services.turn
+
+import pl.jclab.refio.core.llm.LLMMessage
+
+/**
+ * Prompt handed from [TurnPromptBuilder] to [TurnLLMCaller] and back to the turn loop.
+ *
+ * Single shape for the whole turn path — previous revisions split this into
+ * `TurnPrompt` / `LLMCallPrompt` / `CoreTurnPrompt` plus converters; all three carried the
+ * same pair of fields. One class, no adapters.
+ */
+data class TurnPrompt(
+    val systemPrompt: String,
+    val messages: List<LLMMessage>
+)
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPromptAliases.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPromptAliases.kt
deleted file mode 100644
index 309be466..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPromptAliases.kt
+++ /dev/null
@@ -1,28 +0,0 @@
-package pl.jclab.refio.core.services.turn
-
-import pl.jclab.refio.core.llm.LLMMessage
-import pl.jclab.refio.core.services.TurnPrompt as CoreTurnPrompt
-
-/**
- * Turn prompt data class - aligned with core services.
- */
-data class TurnPrompt(
-    val systemPrompt: String,
-    val messages: List<LLMMessage>
-)
-
-/**
- * Convert to core TurnPrompt for compatibility.
- */
-fun TurnPrompt.toCoreTurnPrompt(): CoreTurnPrompt = CoreTurnPrompt(
-    systemPrompt = systemPrompt,
-    messages = messages
-)
-
-/**
- * Convert from core TurnPrompt.
- */
-fun CoreTurnPrompt.toTurnPrompt(): TurnPrompt = TurnPrompt(
-    systemPrompt = systemPrompt,
-    messages = messages
-)
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPromptBuilder.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPromptBuilder.kt
index 7d6f3ebc..974a0fcd 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPromptBuilder.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnPromptBuilder.kt
@@ -157,7 +157,6 @@ $stickyRequirements
                 val turnResult = contextService.buildAgentTurnMessages(
                     taskId = taskId,
                     projectRoot = projectRoot,
-                    project = null,
                     userContextRefs = allContextRefs,
                     query = lastUserMessage
                 )
@@ -385,9 +384,14 @@ $filteredContextPrompt
             logger.error { "[PLAN_PROMPT] Tool descriptions are EMPTY! This will cause LLM to return error." }
         }
 
+        val toolSelectionMatrix = toolDescriptionBuilder.getToolSelectionMatrix(mode, taskId)
+
         return promptsService.getSystemPrompt(
             type = PromptType.SYSTEM_PLAN,
-            variables = mapOf("tool_descriptions" to toolDescriptions)
+            variables = mapOf(
+                "tool_descriptions" to toolDescriptions,
+                "tool_selection_matrix" to toolSelectionMatrix
+            )
         )
     }
 
@@ -421,9 +425,14 @@ $filteredContextPrompt
 
         val iterationInfo = buildIterationInfo(currentIteration, maxIterations, writeToolsExecutedInTurn)
 
+        val toolSelectionMatrix = toolDescriptionBuilder.getToolSelectionMatrix(mode, taskId)
+
         val basePrompt = promptsService.getSystemPrompt(
             type = PromptType.SYSTEM_AGENT,
-            variables = mapOf("tool_descriptions" to toolDescriptions)
+            variables = mapOf(
+                "tool_descriptions" to toolDescriptions,
+                "tool_selection_matrix" to toolSelectionMatrix
+            )
         )
 
         return if (iterationInfo.isNotEmpty()) {
@@ -617,12 +626,6 @@ ${warning}
             result = result.replace(Regex("<project_structure>.*?</project_structure>", RegexOption.DOT_MATCHES_ALL), "")
         }
 
-        if (!profile.includeRag) {
-            // Remove RAG fragment sections
-            result = result.replace(Regex("<rag_fragments>.*?</rag_fragments>", RegexOption.DOT_MATCHES_ALL), "")
-            result = result.replace(Regex("<rag_context>.*?</rag_context>", RegexOption.DOT_MATCHES_ALL), "")
-        }
-
         if (!profile.includeDependencies) {
             // Remove dependency sections
             result = result.replace(Regex("<dependencies>.*?</dependencies>", RegexOption.DOT_MATCHES_ALL), "")
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnToolExecutor.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnToolExecutor.kt
index 8f7cfa9d..6d6e695e 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnToolExecutor.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/TurnToolExecutor.kt
@@ -69,6 +69,9 @@ class TurnToolExecutor(
         /** Max raw output size (chars) to preserve in-context for DATA_PRODUCING tools */
         const val DATA_PRODUCING_RAW_OUTPUT_BUFFER = 16_000
 
+        /** Max raw output size (chars) for read_file — lazy compression deferred to RECENT_WORK */
+        const val READ_FILE_RAW_OUTPUT_BUFFER = 524_288
+
         /** Max tokens of enriched context to inject into coding tools */
         const val CODING_TOOL_CONTEXT_TOKENS = 8_000
 
@@ -80,11 +83,17 @@ class TurnToolExecutor(
             rawOutput: String,
             summaryText: String,
             wasSummarized: Boolean,
-            isDataProducing: Boolean
+            isDataProducing: Boolean,
+            toolName: String = ""
         ): Pair<String, Boolean> {
             val rawLen = rawOutput.length
             return when {
                 rawLen <= 500 -> rawOutput to false
+
+                // read_file: keep raw up to 512KB, let RECENT_WORK compress lazily
+                toolName == "read_file" && rawLen <= READ_FILE_RAW_OUTPUT_BUFFER ->
+                    rawOutput to false
+
                 isDataProducing && rawLen <= DATA_PRODUCING_RAW_OUTPUT_BUFFER && wasSummarized ->
                     rawOutput to true
                 isDataProducing && wasSummarized ->
@@ -148,7 +157,7 @@ class TurnToolExecutor(
 
         val blockedResults = blockedIndexed.map { (index, toolCall) ->
             val subtaskId = subtaskIds[toolCall.id]!!
-            val errorText = "Error: Tool '${toolCall.name}' is not allowed for current run profile"
+            val errorText = buildProfileBlockedError(toolCall.name, profileOverrides)
             subtaskRepository.updateStatus(subtaskId, TaskStatus.FAILED)
             subtaskRepository.updateResult(subtaskId, result = null, errorMessage = errorText)
             listener?.onToolExecutionCompleted(taskId, toolCall, errorText, false)
@@ -180,19 +189,15 @@ class TurnToolExecutor(
                 .map { it.second }
         }
 
-        val containsInvokeSubagent = allowedIndexed.any { (_, toolCall) ->
-            isDelegationTool(toolCall.name)
-        }
         // If any tool requires ASK approval, run sequentially to avoid multiple simultaneous dialogs
         val containsAskTool = permissionsService != null && approvalService != null &&
             allowedIndexed.any { (_, toolCall) ->
                 permissionsService.getPermission(toolCall.name, mode) == PermissionLevel.ASK
             }
-        // Allow READ_ONLY parallel execution during subagent runs — only block parallel
-        // for invoke_subagent (recursion risk) and ASK tools (concurrent approval dialogs).
-        // Previously isSubagentRun blanket-disabled all parallelism, causing 5 sequential
-        // read_file calls to take 270s instead of ~70s in a documentation-engineer session.
-        val shouldDisableParallel = containsInvokeSubagent || containsAskTool
+        // Recursion protection for delegation tools lives in InvokeSubagentTool.execute()
+        // (subagentChain check), so running multiple invoke_subagent in parallel is safe.
+        // LLM-level concurrency is bounded per-endpoint by OllamaRequestGate.
+        val shouldDisableParallel = containsAskTool
 
         // Parallel execution for READ_ONLY tools
         if (config.parallelReadTools && allowedIndexed.size > 1 && !shouldDisableParallel) {
@@ -270,10 +275,7 @@ class TurnToolExecutor(
         }
 
         if (config.parallelReadTools && shouldDisableParallel) {
-            logger.info {
-                "[PARALLEL] Disabled for this batch: invoke_subagent=$containsInvokeSubagent, " +
-                    "containsAskTool=$containsAskTool"
-            }
+            logger.info { "[PARALLEL] Disabled for this batch: containsAskTool=$containsAskTool" }
         }
 
         // Sequential execution
@@ -594,7 +596,8 @@ class TurnToolExecutor(
                     rawOutput = outputWithWarnings,
                     summaryText = summaryResult.summary,
                     wasSummarized = summaryResult.wasSummarized,
-                    isDataProducing = isDataProducing
+                    isDataProducing = isDataProducing,
+                    toolName = toolCall.name
                 )
                 if (isDataProducing && outputWithWarnings.length <= DATA_PRODUCING_RAW_OUTPUT_BUFFER && summaryResult.wasSummarized) {
                     logger.info {
@@ -743,6 +746,26 @@ class TurnToolExecutor(
         return true
     }
 
+    /**
+     * Build a self-correcting error that tells the LLM which tools it actually has. Weak models
+     * hallucinate tool names from conversation history — listing the real whitelist lets them
+     * recover on the next iteration instead of repeating the same blocked call.
+     */
+    private fun buildProfileBlockedError(
+        toolName: String,
+        profileOverrides: TurnProfileOverrides?,
+    ): String {
+        val allowed = profileOverrides?.allowedTools?.takeIf { it.isNotEmpty() }
+        val disallowed = profileOverrides?.disallowedTools?.takeIf { it.isNotEmpty() }
+        val scope = profileOverrides?.subagentName?.let { "subagent '$it'" } ?: "current run profile"
+        val details = when {
+            allowed != null -> "Your available tools are: ${allowed.joinToString(", ")}. Pick one of these or produce a final response."
+            disallowed != null -> "This tool is on the blocklist for this profile (${disallowed.joinToString(", ")}). Use a different approach."
+            else -> "Check the <available_tools> section and use only tools listed there."
+        }
+        return "Error: Tool '$toolName' is not available to the $scope. $details"
+    }
+
     /**
      * Count write tool calls in list.
      */
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/UserQuestionService.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/UserQuestionService.kt
new file mode 100644
index 00000000..069ccc95
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/UserQuestionService.kt
@@ -0,0 +1,83 @@
+package pl.jclab.refio.core.services.turn
+
+import kotlinx.coroutines.CompletableDeferred
+import kotlinx.coroutines.TimeoutCancellationException
+import kotlinx.coroutines.withTimeout
+import pl.jclab.refio.core.logging.dualLogger
+import java.util.UUID
+import java.util.concurrent.ConcurrentHashMap
+
+private val logger = dualLogger("UserQuestionService")
+
+/**
+ * Suspends the agent loop while waiting for a user answer to a question.
+ *
+ * Usage:
+ * 1. Tool calls ask(question) -> gets back a requestId
+ * 2. Service emits AskUserRequest to listener (UI shows the question)
+ * 3. UI calls resolve(requestId, answer) when user responds
+ * 4. ask() returns the answer
+ */
+class UserQuestionService(
+    private val timeoutMs: Long = DEFAULT_TIMEOUT_MS
+) {
+    companion object {
+        const val DEFAULT_TIMEOUT_MS = 10 * 60 * 1000L // 10 minutes
+    }
+
+    data class AskUserRequest(
+        val requestId: String,
+        val taskId: String,
+        val question: String,
+        val options: List<String>?
+    )
+
+    interface Listener {
+        fun onAskUserRequest(request: AskUserRequest)
+    }
+
+    private data class PendingEntry(
+        val request: AskUserRequest,
+        val deferred: CompletableDeferred<String>
+    )
+
+    private val pending = ConcurrentHashMap<String, PendingEntry>()
+    var listener: Listener? = null
+
+    suspend fun ask(taskId: String, question: String, options: List<String>?): Result<String> {
+        val requestId = UUID.randomUUID().toString()
+        val deferred = CompletableDeferred<String>()
+        val request = AskUserRequest(requestId, taskId, question, options)
+        pending[requestId] = PendingEntry(request, deferred)
+
+        listener?.onAskUserRequest(request)
+            ?: return Result.failure(IllegalStateException("No UI listener — ask_user not supported in this context"))
+
+        return try {
+            val answer = withTimeout(timeoutMs) { deferred.await() }
+            Result.success(answer)
+        } catch (e: TimeoutCancellationException) {
+            pending.remove(requestId)
+            Result.failure(RuntimeException("User did not respond within ${timeoutMs / 1000}s"))
+        } finally {
+            pending.remove(requestId)
+        }
+    }
+
+    fun resolve(requestId: String, answer: String) {
+        val entry = pending[requestId]
+        if (entry == null) {
+            logger.warn { "No pending question for requestId=$requestId" }
+            return
+        }
+        entry.deferred.complete(answer)
+    }
+
+    fun cancel(requestId: String, reason: String = "Cancelled by user") {
+        val entry = pending[requestId]
+        entry?.deferred?.completeExceptionally(RuntimeException(reason))
+    }
+
+    fun getPendingRequests(): List<AskUserRequest> =
+        pending.values.map { it.request }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/providers/SystemEnvironmentPromptProvider.kt b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/providers/SystemEnvironmentPromptProvider.kt
index 131f820b..e7b75d01 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/services/turn/providers/SystemEnvironmentPromptProvider.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/services/turn/providers/SystemEnvironmentPromptProvider.kt
@@ -45,7 +45,18 @@ class SystemEnvironmentPromptProvider(
         val cwd = projectRoot?.toAbsolutePath()?.toString() ?: System.getProperty("user.dir")
         val home = System.getProperty("user.home")
         val platformHint = when {
-            isWindows -> "Windows — use Windows shell syntax. Prefer PowerShell or Git Bash; avoid GNU-only flags. Paths use backslashes. Use `where` (not `which`) to locate binaries. Avoid `/dev/null` — use `NUL`. `&&` works in cmd/PowerShell; chain with `;` in PowerShell."
+            isWindows -> buildString {
+                append("**CRITICAL: This is a Windows PowerShell environment.**\n")
+                append("SHELL SYNTAX RULES (violations will cause command failures):\n")
+                append("- Do NOT use `&&` to chain commands — use `;` in PowerShell\n")
+                append("- Do NOT use heredocs (`<<'EOF'`) — not supported in PowerShell\n")
+                append("- Do NOT use `/dev/null` — use `NUL` or `Out-Null`\n")
+                append("- Do NOT pass complex Python one-liners via `python -c \"...\"` — ")
+                append("PowerShell mangles quotes. Write a .py file and run it instead.\n")
+                append("- Use `where` not `which` to locate binaries\n")
+                append("- Paths use backslashes: `dir\\file.py` not `dir/file.py`\n")
+                append("PREFERRED PATTERN: Write scripts to files, then execute them.")
+            }
             isMac -> "macOS — BSD userland (not GNU). Some flags differ from Linux (e.g. `sed -i ''`, `find` predicates). `xargs -r` is unavailable."
             else -> "Linux — GNU userland. Standard POSIX + GNU extensions available."
         }
@@ -59,14 +70,19 @@ class SystemEnvironmentPromptProvider(
             append("user_home: $home\n")
             append("path_separator: \"$pathSep\"\n")
             append("file_separator: \"$fileSep\"\n")
-            append("platform_notes: $platformHint\n")
             append("available_tools:\n")
             for ((tool, available) in toolStatus) {
                 append("  - $tool: ${if (available) "yes" else "no"}\n")
             }
+
             append("IMPORTANT: Pick shell commands and flags that match the OS above. ")
             append("Do NOT assume a tool exists unless it is listed as `yes` in available_tools. ")
             append("When uncertain, prefer cross-platform alternatives or use Refio tools instead of raw shell.\n")
+
+            append("\n\n<platform_rules>\n")
+            append(platformHint)
+            append("\n</platform_rules>\n")
+
             append("</system_environment>")
         }
     }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/session/AbstractToolCallLifecycleListener.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/AbstractToolCallLifecycleListener.kt
new file mode 100644
index 00000000..96d65499
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/AbstractToolCallLifecycleListener.kt
@@ -0,0 +1,126 @@
+package pl.jclab.refio.core.session
+
+import kotlinx.coroutines.CoroutineScope
+import kotlinx.coroutines.launch
+import pl.jclab.refio.core.db.TaskMode
+import pl.jclab.refio.core.db.ToolCallData
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.TurnResult
+import pl.jclab.refio.core.services.turn.TurnEventListener
+import java.util.concurrent.ConcurrentHashMap
+
+private val logger = dualLogger("AbstractToolCallLifecycleListener")
+
+/**
+ * Shared bookkeeping for the "temporary assistant message per tool call" pattern
+ * used by both the IntelliJ plugin ([CoreSessionService]) and the CLI TUI.
+ *
+ * Maintains a `toolCallId → tempMessageId` map and dispatches lifecycle hooks
+ * ([onCreateTempMessage], [onUpdateTempMessage], [onFinalizeTempMessage]) that
+ * subclasses wire to their own message store (Core [pl.jclab.refio.api.models.Message]
+ * for the plugin, `TuiChatMessage` for CLI).
+ *
+ * Subclasses must also implement [onAfterToolLifecycleEvent] if they need to
+ * trigger DB reloads (e.g. `subtaskTracker.loadSubtasks()`).
+ */
+abstract class AbstractToolCallLifecycleListener(
+    private val scope: CoroutineScope,
+) : TurnEventListener {
+
+    protected val toolCallMessageIds = ConcurrentHashMap<String, String>()
+
+    /** Create a provisional "tool call in progress" message, returning its id. */
+    protected abstract fun onCreateTempMessage(taskId: String, toolCall: ToolCallData): String
+
+    /** Apply streaming content updates to the tool-call message identified by [messageId]. */
+    protected abstract fun onUpdateTempMessage(
+        messageId: String,
+        toolCallId: String,
+        delta: String,
+        accumulated: String,
+    )
+
+    /**
+     * Finalize the tool-call message with the execution result.
+     * The implementation should mark it no-longer-streaming and record success/failure.
+     */
+    protected abstract fun onFinalizeTempMessage(
+        messageId: String,
+        toolCall: ToolCallData,
+        result: String,
+        success: Boolean,
+    )
+
+    /**
+     * Optional hook for refreshing DB-backed views (e.g. subtask tracker) after any
+     * tool lifecycle event. Default implementation: no-op.
+     */
+    protected open suspend fun onAfterToolLifecycleEvent(taskId: String) {}
+
+    override fun onTurnStarted(
+        taskId: String,
+        mode: TaskMode,
+        runId: String,
+        parentRunId: String?,
+        depth: Int,
+    ) {
+        logger.info {
+            "[TURN_LOOP] Turn started: taskId=$taskId, mode=$mode, runId=$runId, " +
+                "parentRunId=${parentRunId ?: "-"}, depth=$depth"
+        }
+    }
+
+    override fun onToolExecutionStarted(taskId: String, toolCall: ToolCallData) {
+        logger.info { "[TURN_LOOP] Tool started: ${toolCall.name}" }
+        val tempId = onCreateTempMessage(taskId, toolCall)
+        toolCallMessageIds[toolCall.id] = tempId
+        scope.launch { onAfterToolLifecycleEvent(taskId) }
+    }
+
+    override fun onToolStreamChunk(
+        taskId: String,
+        toolCallId: String,
+        delta: String,
+        accumulated: String,
+    ) {
+        val messageId = toolCallMessageIds[toolCallId] ?: run {
+            logger.warn { "[TURN_LOOP] Tool stream chunk for unknown tool call: $toolCallId" }
+            return
+        }
+        onUpdateTempMessage(messageId, toolCallId, delta, accumulated)
+    }
+
+    override fun onToolExecutionCompleted(
+        taskId: String,
+        toolCall: ToolCallData,
+        result: String,
+        success: Boolean,
+    ) {
+        logger.info { "[TURN_LOOP] Tool completed: ${toolCall.name}, success=$success" }
+        val messageId = toolCallMessageIds.remove(toolCall.id) ?: return
+        onFinalizeTempMessage(messageId, toolCall, result, success)
+        scope.launch { onAfterToolLifecycleEvent(taskId) }
+    }
+
+    override fun onStreamChunk(taskId: String, delta: String, accumulated: String) {
+        // Callers route regular LLM content through their own stream callback.
+    }
+
+    override fun onTurnCompleted(
+        taskId: String,
+        result: TurnResult,
+        runId: String,
+        parentRunId: String?,
+        depth: Int,
+    ) {
+        logger.info {
+            "[TURN_LOOP] Turn completed: taskId=$taskId, success=${result.success}, " +
+                "iterations=${result.iterations}, runId=$runId, parentRunId=${parentRunId ?: "-"}, depth=$depth"
+        }
+    }
+
+    /** Clear the bookkeeping map — call after [MessageDispatcher.loadMessages] reconciles against DB. */
+    fun clearTracking() {
+        toolCallMessageIds.clear()
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/session/CoreMessageToolCallListener.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/CoreMessageToolCallListener.kt
new file mode 100644
index 00000000..f14d8e72
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/CoreMessageToolCallListener.kt
@@ -0,0 +1,109 @@
+package pl.jclab.refio.core.session
+
+import kotlinx.coroutines.CoroutineScope
+import kotlinx.coroutines.launch
+import pl.jclab.refio.api.models.Message
+import pl.jclab.refio.api.models.ToolCallDisplayInfo
+import pl.jclab.refio.api.models.ToolCallResult
+import pl.jclab.refio.api.models.ToolCallStatus
+import pl.jclab.refio.api.models.ToolDisplayType
+import pl.jclab.refio.core.db.ToolCallData
+
+/**
+ * [AbstractToolCallLifecycleListener] implementation backed by [SessionStateManager].
+ *
+ * Used by [CoreSessionService] (and any other caller that renders tool-call
+ * progress through the canonical [Message] stream) to create / update / finalize
+ * temporary assistant messages per tool invocation.
+ */
+class CoreMessageToolCallListener(
+    scope: CoroutineScope,
+    private val stateManager: SessionStateManager,
+    private val onReloadSubtasks: suspend () -> Unit,
+    private val resolveToolDisplayType: (String) -> ToolDisplayType,
+    private val parseToolParameters: (String) -> Map<String, String>,
+) : AbstractToolCallLifecycleListener(scope) {
+
+    private val scopeRef = scope
+
+    override fun onCreateTempMessage(taskId: String, toolCall: ToolCallData): String {
+        val tempId = "temp-${toolCall.id}"
+        val toolInfo = ToolCallDisplayInfo(
+            toolName = toolCall.name,
+            toolCallId = toolCall.id,
+            displayType = resolveToolDisplayType(toolCall.name),
+            parameters = parseToolParameters(toolCall.arguments),
+            status = ToolCallStatus.EXECUTING,
+        )
+        val tempMessage = Message(
+            id = tempId,
+            taskId = taskId,
+            role = "assistant",
+            content = "",
+            toolCallInfo = toolInfo,
+            createdAt = System.currentTimeMillis(),
+        )
+        scopeRef.launch { stateManager.appendMessage(tempMessage) }
+        return tempId
+    }
+
+    override fun onUpdateTempMessage(
+        messageId: String,
+        toolCallId: String,
+        delta: String,
+        accumulated: String,
+    ) {
+        scopeRef.launch {
+            stateManager.updateMessages { messages ->
+                messages.map { msg ->
+                    if (msg.id == messageId) {
+                        msg.copy(
+                            content = accumulated,
+                            isStreaming = true,
+                            isToolStreaming = true,
+                            lastChunkAt = System.currentTimeMillis(),
+                        )
+                    } else msg
+                }
+            }
+        }
+    }
+
+    override fun onFinalizeTempMessage(
+        messageId: String,
+        toolCall: ToolCallData,
+        result: String,
+        success: Boolean,
+    ) {
+        val resultSummary = if (result.isNotBlank()) {
+            val trimmed = result.trim()
+            if (trimmed.length <= 120) trimmed else "${trimmed.take(120)}..."
+        } else null
+
+        scopeRef.launch {
+            stateManager.updateMessages { messages ->
+                messages.map { msg ->
+                    if (msg.id == messageId) {
+                        val updatedToolInfo = msg.toolCallInfo?.copy(
+                            status = if (success) ToolCallStatus.COMPLETED else ToolCallStatus.FAILED,
+                            result = if (resultSummary != null) ToolCallResult(
+                                success = success,
+                                summary = resultSummary,
+                            ) else null,
+                        )
+                        msg.copy(
+                            toolCallInfo = updatedToolInfo,
+                            isStreaming = false,
+                            isToolStreaming = false,
+                            lastChunkAt = System.currentTimeMillis(),
+                        )
+                    } else msg
+                }
+            }
+        }
+    }
+
+    override suspend fun onAfterToolLifecycleEvent(taskId: String) {
+        onReloadSubtasks()
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/session/CoreSessionService.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/CoreSessionService.kt
new file mode 100644
index 00000000..b266bc55
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/CoreSessionService.kt
@@ -0,0 +1,747 @@
+package pl.jclab.refio.core.session
+
+import kotlinx.coroutines.CoroutineScope
+import kotlinx.coroutines.Job
+import kotlinx.coroutines.cancel
+import kotlinx.coroutines.delay
+import kotlinx.coroutines.launch
+import kotlinx.coroutines.sync.Mutex
+import kotlinx.coroutines.sync.withLock
+import pl.jclab.refio.api.models.ContextReference
+import pl.jclab.refio.api.models.ExecutionMode
+import pl.jclab.refio.api.models.Message
+import pl.jclab.refio.api.models.Session
+import pl.jclab.refio.api.models.TaskMode
+import pl.jclab.refio.api.models.ToolCallDisplayInfo
+import pl.jclab.refio.api.models.ToolCallResult
+import pl.jclab.refio.api.models.ToolCallStatus
+import pl.jclab.refio.api.models.ToolDisplayType
+import pl.jclab.refio.core.api.CoreApiRouter
+import pl.jclab.refio.core.api.StreamCallback
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.api.TurnProfileOverrides
+import pl.jclab.refio.core.api.TurnRequest
+import pl.jclab.refio.core.api.TurnRunProfile
+import pl.jclab.refio.core.api.UIAdapter
+import pl.jclab.refio.core.api.UpdateTaskRequest
+import pl.jclab.refio.core.db.ConfigScope
+import pl.jclab.refio.core.db.ToolCallData
+import pl.jclab.refio.core.errors.RefioError
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.AgentTurnLoop
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.TurnResult
+import pl.jclab.refio.core.services.monitoring.GlobalMetrics
+import pl.jclab.refio.core.services.monitoring.OperationInfo
+import pl.jclab.refio.core.workflow.WorkflowOrchestrator
+import pl.jclab.refio.core.workflow.models.IntentResult
+import pl.jclab.refio.core.workflow.models.UIState
+import pl.jclab.refio.core.workflow.models.WorkflowRequest
+import java.util.UUID
+import java.util.concurrent.atomic.AtomicBoolean
+import java.util.concurrent.atomic.AtomicReference
+
+private val logger = dualLogger("CoreSessionService")
+
+/**
+ * Platform-agnostic session execution service.
+ *
+ * Owns the three send-message orchestration paths (CHAT workflow, PLAN/AGENT turn loop,
+ * legacy workflow entry) plus the shared auxiliaries (auto-naming, cost refresh,
+ * tool-call display helpers). Constructed once per project-level
+ * [CoreApiRouter]; IntelliJ's `SessionManager` and the CLI TUI both delegate
+ * through this class so execution behavior stays in sync.
+ *
+ * StreamFilter, streaming-message bookkeeping, and the temp tool-call message map
+ * live here. UI-only state (pending input, context-section tokens, StatusBar, EDT
+ * dispatch) stays in the per-platform binding that constructs this service.
+ */
+class CoreSessionService(
+    private val projectRouter: CoreApiRouter,
+    private val stateManager: SessionStateManager,
+    private val subtaskTracker: SubtaskTracker,
+    private val messageDispatcher: MessageDispatcher,
+    private val lifecycleService: SessionLifecycleService,
+    private val uiAdapter: UIAdapter,
+    private val scope: CoroutineScope,
+    private val modeSwitchMutex: Mutex,
+) {
+
+    private val configService: ConfigService
+        get() = projectRouter.configService
+
+    suspend fun sendMessage(
+        input: String,
+        contextRefs: List<ContextReference> = emptyList(),
+        model: String? = null,
+        provider: String? = null,
+    ): Message {
+        GlobalMetrics.resetCancellation()
+
+        val currentSession = modeSwitchMutex.withLock {
+            lifecycleService.ensureActiveSessionExists()
+        }
+
+        logger.info {
+            "[SESSION] sendMessage: taskId=${currentSession.id}, mode=${currentSession.mode}, " +
+                    "executionMode=${currentSession.executionMode}, inputChars=${input.length}, " +
+                    "contextRefs=${contextRefs.size}, model=${model ?: "auto"}, provider=${provider ?: "auto"}"
+        }
+        return sendMessageUsingWorkflow(currentSession, input, contextRefs, model, provider)
+    }
+
+    private suspend fun sendMessageUsingWorkflow(
+        session: Session,
+        input: String,
+        contextRefs: List<ContextReference>,
+        model: String?,
+        provider: String?,
+    ): Message {
+        stateManager.setIsGenerating(true)
+        return try {
+            val stream = isStreamingEnabled()
+            val executionMode = session.executionMode
+            logger.info {
+                "[SESSION] Workflow start: taskId=${session.id}, mode=${session.mode}, " +
+                        "executionMode=$executionMode, stream=$stream"
+            }
+
+            val userMessage = Message(
+                id = UUID.randomUUID().toString(),
+                taskId = session.id,
+                role = "user",
+                content = input,
+                createdAt = System.currentTimeMillis(),
+            )
+            stateManager.appendMessage(userMessage)
+
+            when (session.mode) {
+                TaskMode.CHAT -> sendMessageUsingChatWorkflow(session, input, contextRefs, model, provider, stream, executionMode)
+                TaskMode.PLAN, TaskMode.AGENT -> sendMessageUsingTurnLoop(session, input, contextRefs, model, provider, stream, executionMode)
+            }
+        } catch (e: RefioError.MalformedResponse) {
+            logger.error(e) {
+                "[SESSION] Malformed response from provider=${e.provider}/${e.model}: reason=${e.reason}, " +
+                        "bodyPreview=${e.bodyPreview.take(500)}"
+            }
+            val userFacing = "Provider ${e.provider} returned an invalid response — check logs for details."
+            uiAdapter.showError(userFacing)
+            val errorMessage = Message(
+                id = UUID.randomUUID().toString(),
+                taskId = session.id,
+                role = "system",
+                content = userFacing,
+                createdAt = System.currentTimeMillis(),
+            )
+            stateManager.appendMessage(errorMessage)
+            throw e
+        } catch (e: Exception) {
+            logger.error(e) { "[SESSION] Workflow failed: taskId=${session.id}, error=${e.message}" }
+            uiAdapter.showError("Workflow failed: ${e.message}")
+            val errorMessage = Message(
+                id = UUID.randomUUID().toString(),
+                taskId = session.id,
+                role = "system",
+                content = "Error: ${e.message}",
+                createdAt = System.currentTimeMillis(),
+            )
+            stateManager.appendMessage(errorMessage)
+            throw e
+        } finally {
+            stateManager.setIsGenerating(false)
+        }
+    }
+
+    private suspend fun sendMessageUsingTurnLoop(
+        session: Session,
+        input: String,
+        contextRefs: List<ContextReference>,
+        model: String?,
+        provider: String?,
+        stream: Boolean,
+        executionMode: ExecutionMode,
+    ): Message {
+        logger.info {
+            "[TURN_LOOP] Starting turn: taskId=${session.id}, mode=${session.mode}, " +
+                    "inputChars=${input.length}, contextRefs=${contextRefs.size}"
+        }
+
+        GlobalMetrics.setCurrentOperation(OperationInfo.ChatRequest(model ?: "auto"))
+
+        try {
+            var streamingMessageId: String? = null
+            val streamingClosed = AtomicBoolean(false)
+            val pendingStreamContent = AtomicReference<String?>(null)
+            val streamStateMutex = Mutex()
+            var streamUiFlushJob: Job? = null
+            val streamFilter = IncrementalToolCallStreamFilter()
+
+            val streamCallback: StreamCallback? = if (stream) { chunk ->
+                scope.launch {
+                    if (streamingClosed.get()) return@launch
+
+                    streamStateMutex.withLock {
+                        val now = System.currentTimeMillis()
+                        val filteredContent = streamFilter.filter(
+                            delta = chunk.delta,
+                            accumulated = chunk.accumulated,
+                            isComplete = chunk.isComplete,
+                        )
+
+                        if (filteredContent.isNotBlank()) {
+                            if (streamingMessageId == null) {
+                                streamingMessageId = UUID.randomUUID().toString()
+                                stateManager.appendMessage(
+                                    Message(
+                                        id = streamingMessageId!!,
+                                        taskId = session.id,
+                                        role = "assistant",
+                                        content = "",
+                                        isStreaming = true,
+                                        streamStartedAt = now,
+                                        createdAt = now,
+                                    )
+                                )
+                            }
+                            pendingStreamContent.set(filteredContent)
+                        }
+
+                        if (chunk.isComplete) {
+                            val completedId = streamingMessageId
+                            val finalContent = pendingStreamContent.getAndSet(null)
+                            if (completedId != null) {
+                                stateManager.updateMessages { messages ->
+                                    messages.map { msg ->
+                                        if (msg.id == completedId) {
+                                            msg.copy(
+                                                content = finalContent ?: msg.content,
+                                                lastChunkAt = now,
+                                                isStreaming = false,
+                                            )
+                                        } else msg
+                                    }
+                                }
+                            }
+                            streamingMessageId = null
+                            streamUiFlushJob?.cancel()
+                            streamUiFlushJob = null
+                            return@withLock
+                        }
+
+                        if (streamUiFlushJob?.isActive != true) {
+                            streamUiFlushJob = scope.launch {
+                                while (!streamingClosed.get()) {
+                                    val contentToFlush = streamStateMutex.withLock {
+                                        pendingStreamContent.getAndSet(null)
+                                    }
+
+                                    if (!contentToFlush.isNullOrBlank()) {
+                                        val activeMessageId = streamingMessageId
+                                        if (activeMessageId != null) {
+                                            stateManager.updateMessages { messages ->
+                                                messages.map { msg ->
+                                                    if (msg.id == activeMessageId) {
+                                                        msg.copy(
+                                                            content = contentToFlush,
+                                                            lastChunkAt = System.currentTimeMillis(),
+                                                            isStreaming = true,
+                                                        )
+                                                    } else msg
+                                                }
+                                            }
+                                        }
+                                    }
+
+                                    delay(500)
+                                }
+                            }
+                        }
+                    }
+                }
+            } else null
+
+            val turnListener = CoreMessageToolCallListener(
+                scope = scope,
+                stateManager = stateManager,
+                onReloadSubtasks = { subtaskTracker.loadSubtasks() },
+                resolveToolDisplayType = ::resolveToolDisplayType,
+                parseToolParameters = ::parseToolParameters,
+            )
+
+            val modeDb = pl.jclab.refio.core.db.TaskMode.valueOf(session.mode.name)
+            val executionModeDb = pl.jclab.refio.core.db.ExecutionMode.valueOf(executionMode.name)
+            val defaultTurnRequest = TurnRequest(
+                taskId = session.id,
+                userInput = input,
+                mode = modeDb,
+                executionMode = executionModeDb,
+                model = model,
+                provider = provider,
+                userContextRefs = contextRefs,
+            )
+
+            val subagentRouter = projectRouter.subagentRouter
+            val subagentCommand = subagentRouter?.parseSubagentCommand(input)
+            val subagentInvocation = subagentRouter?.parseSubagentInvocation(input)
+
+            if (subagentCommand != null && subagentInvocation == null) {
+                val (requestedName, _) = subagentCommand
+                val allSubagents = subagentRouter.listSubagents(includeDisabled = true)
+                val matched = allSubagents.firstOrNull { it.name.equals(requestedName, ignoreCase = true) }
+                val enabledSubagentNames = allSubagents
+                    .filter { it.enabled }
+                    .map { it.name }
+                    .sorted()
+
+                val errorContent = when {
+                    matched == null -> buildString {
+                        append("Subagent '")
+                        append(requestedName)
+                        append("' not found.")
+                        if (enabledSubagentNames.isNotEmpty()) {
+                            append(" Available subagents: ")
+                            append(enabledSubagentNames.joinToString(", "))
+                            append(".")
+                        }
+                    }
+                    !matched.enabled -> "Subagent '${matched.name}' is disabled. Enable it in Settings > Subagents."
+                    else -> "Subagent '$requestedName' is not available."
+                }
+
+                logger.warn {
+                    "[TURN_LOOP] Invalid subagent invocation: name=$requestedName, reason='${errorContent.replace('\n', ' ')}'"
+                }
+
+                val assistantMessage = Message(
+                    id = UUID.randomUUID().toString(),
+                    taskId = session.id,
+                    role = "assistant",
+                    content = errorContent,
+                    createdAt = System.currentTimeMillis(),
+                )
+                stateManager.appendMessage(assistantMessage)
+                return assistantMessage
+            }
+
+            val turnRequest = if (subagentInvocation != null) {
+                val (subagentName, subagentPrompt) = subagentInvocation
+                val definition = subagentRouter.getSubagent(subagentName)
+
+                if (definition != null) {
+                    val parentModel = if (model != null && provider != null) "$provider/$model" else model
+                    val (resolvedModel, resolvedProvider) = definition.resolveModel(configService, parentModel)
+
+                    logger.info {
+                        "[TURN_LOOP] subagentDetected=true, subagentName=$subagentName, " +
+                            "runProfile=SUBAGENT, model=$resolvedModel, provider=$resolvedProvider"
+                    }
+
+                    TurnRequest(
+                        taskId = session.id,
+                        userInput = subagentPrompt,
+                        mode = modeDb,
+                        executionMode = executionModeDb,
+                        model = resolvedModel,
+                        provider = resolvedProvider,
+                        userContextRefs = contextRefs,
+                        runProfile = TurnRunProfile.SUBAGENT,
+                        profileOverrides = TurnProfileOverrides(
+                            subagentName = subagentName,
+                            systemPromptOverride = definition.systemPrompt,
+                            allowedTools = definition.allowedTools,
+                            disallowedTools = definition.disallowedTools,
+                            modelOverride = resolvedModel,
+                            providerOverride = resolvedProvider,
+                            maxIterationsOverride = definition.maxSteps,
+                            depth = 0,
+                            subagentChain = emptyList(),
+                            contextProfile = definition.contextProfile,
+                            reasoningEffort = definition.reasoningEffort,
+                        ),
+                    )
+                } else {
+                    logger.warn {
+                        "[TURN_LOOP] subagentDetected=true but definition not found: name=$subagentName, falling back"
+                    }
+                    defaultTurnRequest
+                }
+            } else {
+                defaultTurnRequest
+            }
+
+            // Read multi-agent settings from config (not StateManager) so a toggle in
+            // General Settings takes effect on the very next message. StateManager is loaded
+            // once per session in SessionLifecycleService and only PromptInputPanel's in-chat
+            // toggle keeps it in sync — the Settings panel writes to config only.
+            val strategy = configService.get(
+                ConfigKeys.UI_MULTI_AGENT_STRATEGY.key,
+                ConfigScope.APP,
+                taskId = session.id,
+            )?.let { pl.jclab.refio.api.models.MultiAgentStrategy.fromString(it) }
+                ?: stateManager.getMultiAgentStrategy()
+            val multiAgentEnabled = configService.get(
+                ConfigKeys.UI_ORCHESTRATION_ENABLED.key,
+                ConfigScope.APP,
+                taskId = session.id,
+            )?.toBooleanStrictOrNull()
+                ?: stateManager.getMultiAgentEnabled()
+            val orchestrator = projectRouter.orchestrationDispatcher
+            val useOrchestration = multiAgentEnabled &&
+                strategy != pl.jclab.refio.api.models.MultiAgentStrategy.SINGLE &&
+                subagentInvocation == null &&
+                orchestrator != null
+
+            // Live-refresh the UI when the turn spawns subagent or tool activity. Without this,
+            // messages persisted mid-turn by AgentTurnLoop (tool calls, sub-LLM responses) stay
+            // invisible until the outer runTurn completes and the final loadMessages() flush runs.
+            // We subscribe to AgentEventBus events for this session and reload on TOOL / TURN-END
+            // boundaries — so each subagent bubble materializes right after its tool call completes.
+            val liveRefreshJob = scope.launch {
+                projectRouter.agentEventBus.turnEvents(session.id).collect { event ->
+                    val depth = when (event) {
+                        is pl.jclab.refio.core.agents.events.AgentEvent.ToolCalled -> event.depth
+                        is pl.jclab.refio.core.agents.events.AgentEvent.TurnEnded -> event.depth
+                        else -> -1
+                    }
+                    // Only refresh for subagent activity (depth > 0); top-level turn messages
+                    // are already handled by the streaming path in this function.
+                    if (depth > 0) {
+                        messageDispatcher.loadMessages()
+                    }
+                }
+            }
+
+            // Per-subagent token streaming. AgentTurnLoop emits AgentEvent.StreamChunk for every
+            // subagent delta with runId/depth/agentName. We key streaming messages by runId and
+            // update them live so the user sees tokens appear inside per-agent bubbles while the
+            // subagent's LLM is still generating. The final DB-persisted ASSISTANT message (with
+            // the same agentName) replaces this transient entry when loadMessages() flushes.
+            // .collect serializes events so a plain map is safe.
+            val subagentStreamingIds = HashMap<String, String>()
+            val subagentStreamJob = scope.launch {
+                projectRouter.agentEventBus.events
+                    .collect { raw ->
+                        val ev = raw as? pl.jclab.refio.core.agents.events.AgentEvent.StreamChunk
+                            ?: return@collect
+                        if (ev.sessionId != session.id || ev.agentName == null || ev.runId == null) return@collect
+                        val now = System.currentTimeMillis()
+                        val key = ev.runId
+                        val existingId = subagentStreamingIds[key]
+                        val messageId: String
+                        if (existingId == null) {
+                            messageId = UUID.randomUUID().toString()
+                            subagentStreamingIds[key] = messageId
+                            stateManager.appendMessage(
+                                Message(
+                                    id = messageId,
+                                    taskId = session.id,
+                                    role = "assistant",
+                                    content = ev.accumulated,
+                                    isStreaming = !ev.isComplete,
+                                    streamStartedAt = now,
+                                    lastChunkAt = now,
+                                    createdAt = now,
+                                    agentName = ev.agentName,
+                                    agentDepth = ev.depth,
+                                )
+                            )
+                        } else {
+                            messageId = existingId
+                            stateManager.updateMessages { messages ->
+                                messages.map { msg ->
+                                    if (msg.id == messageId) {
+                                        msg.copy(
+                                            content = ev.accumulated,
+                                            lastChunkAt = now,
+                                            isStreaming = !ev.isComplete,
+                                        )
+                                    } else msg
+                                }
+                            }
+                        }
+                        if (ev.isComplete) {
+                            subagentStreamingIds.remove(key)
+                        }
+                    }
+            }
+
+            val result = if (useOrchestration) {
+                logger.info { "[TURN_LOOP] Orchestration dispatch: strategy=$strategy, taskId=${session.id}" }
+                val outcome = orchestrator.dispatch(
+                    parentTaskId = session.id,
+                    input = input,
+                    contextRefs = contextRefs,
+                    parentModel = model,
+                    parentProvider = provider,
+                    stream = stream,
+                    streamCallback = streamCallback,
+                    strategy = strategy,
+                )
+                TurnResult(
+                    success = true,
+                    response = outcome.response,
+                    iterations = 1,
+                    tokensIn = outcome.totalTokensIn,
+                    tokensOut = outcome.totalTokensOut,
+                    cost = outcome.totalCost,
+                )
+            } else {
+                projectRouter.agentRouter.runTurn(
+                    request = turnRequest,
+                    streamCallback = streamCallback,
+                    listener = turnListener,
+                )
+            }
+
+            logger.info {
+                "[TURN_LOOP] Turn complete: taskId=${session.id}, success=${result.success}, " +
+                        "iterations=${result.iterations}, responseChars=${result.response.length}"
+            }
+
+            liveRefreshJob.cancel()
+            subagentStreamJob.cancel()
+            // The transient per-subagent streaming messages live only in UI state; the DB-backed
+            // ASSISTANT rows that AgentTurnLoop persisted with agentName will replace them at the
+            // next messageDispatcher.loadMessages() below. Drop them so we don't show duplicates.
+            if (subagentStreamingIds.isNotEmpty()) {
+                val transientIds = subagentStreamingIds.values.toSet()
+                subagentStreamingIds.clear()
+                stateManager.updateMessages { messages ->
+                    messages.filterNot { it.id in transientIds }
+                }
+            }
+            streamingClosed.set(true)
+            streamUiFlushJob?.cancel()
+            val completedStreamingMessageId = streamingMessageId
+            streamingMessageId = null
+            if (completedStreamingMessageId != null) {
+                stateManager.updateMessages { messages ->
+                    messages.filterNot { it.id == completedStreamingMessageId }
+                }
+            }
+
+            messageDispatcher.loadMessages()
+            turnListener.clearTracking()
+            logger.debug { "[TURN_LOOP] Cleared tool call message tracking map after DB reload" }
+
+            val freshTask = projectRouter.taskRepository.findById(session.id)
+            if (freshTask != null) {
+                val updatedSession = session.copy(
+                    tokensIn = freshTask.tokensIn,
+                    tokensOut = freshTask.tokensOut,
+                    costUsd = freshTask.costUsd,
+                )
+                lifecycleService.updateSession(updatedSession)
+            }
+
+            if (isDefaultSessionName(session.name) && stateManager.messages.value.size >= 2) {
+                scheduleAutoNameSession(session, input)
+            }
+
+            return stateManager.messages.value.last()
+        } finally {
+            GlobalMetrics.setCurrentOperation(OperationInfo.Idle)
+            logger.info { "[TURN_LOOP] Operation state reset to Idle" }
+        }
+    }
+
+    private suspend fun sendMessageUsingChatWorkflow(
+        session: Session,
+        input: String,
+        contextRefs: List<ContextReference>,
+        model: String?,
+        provider: String?,
+        stream: Boolean,
+        executionMode: ExecutionMode,
+    ): Message {
+        logger.info {
+            "[CHAT_WORKFLOW] Starting chat: taskId=${session.id}, inputChars=${input.length}"
+        }
+
+        val uiState = UIState(
+            taskId = session.id,
+            mode = session.mode,
+            executionMode = executionMode,
+            input = input,
+            contextRefs = contextRefs,
+            model = model,
+            provider = provider,
+            streamingEnabled = stream,
+            thinkingEnabled = stateManager.getThinkingEnabled(),
+            noEgressEnabled = stateManager.getNoEgressEnabled(),
+        )
+
+        val listener = DefaultWorkflowStreamingListener(
+            taskId = session.id,
+            stateManager = stateManager,
+            scope = scope,
+            streamingEnabled = stream,
+        )
+
+        val projectAnalysis = try {
+            projectRouter.projectContextRouter.getProjectAnalysisSummary()
+        } catch (e: Exception) {
+            logger.warn(e) { "[SESSION] Failed to generate project analysis, using null" }
+            null
+        }
+
+        val result = projectRouter.workflowOrchestrator.execute(
+            request = WorkflowRequest(
+                uiState = uiState,
+                projectAnalysis = projectAnalysis,
+            ),
+            listener = listener,
+        )
+
+        logger.info { "[CHAT_WORKFLOW] Workflow result: taskId=${session.id}, type=${result::class.simpleName}" }
+
+        when (result) {
+            is IntentResult.ChatResult -> {
+                val response = result.response
+                logger.info {
+                    "[CHAT_WORKFLOW] Chat response: taskId=${response.taskId}, outputChars=${response.output.length}"
+                }
+                if (response.taskId != session.id) {
+                    logger.info { "[CHAT] Task ID changed: ${session.id} -> ${response.taskId}, syncing session" }
+                    uiAdapter.log("INFO", "Session ID changed: ${session.id} -> ${response.taskId}")
+                    val newSession = session.copy(id = response.taskId)
+                    stateManager.setActiveSession(newSession)
+                }
+                updateSessionCosts(stateManager.getActiveSession() ?: session)
+                autoNameSessionIfNeeded(stateManager.getActiveSession() ?: session, input)
+                messageDispatcher.loadMessages()
+            }
+
+            is IntentResult.SubagentResult -> {
+                logger.info { "[CHAT_WORKFLOW] Subagent response: taskId=${session.id}" }
+                messageDispatcher.loadMessages()
+            }
+
+            else -> {
+                logger.warn { "[CHAT_WORKFLOW] Unexpected result type in CHAT mode: ${result::class.simpleName}" }
+            }
+        }
+
+        return stateManager.messages.value.last()
+    }
+
+    private fun isStreamingEnabled(): Boolean {
+        return try {
+            val streamingConfig = configService.get(
+                key = ConfigKeys.STREAMING_ENABLED.key,
+                scope = ConfigScope.APP,
+            )
+            streamingConfig?.toBoolean() ?: true
+        } catch (e: Exception) {
+            logger.warn(e) { "Failed to read streaming config, defaulting to true" }
+            true
+        }
+    }
+
+    private suspend fun updateSessionCosts(session: Session) {
+        val freshTask = projectRouter.taskRepository.findById(session.id)
+        if (freshTask != null) {
+            lifecycleService.updateSession(
+                session.copy(
+                    tokensIn = freshTask.tokensIn,
+                    tokensOut = freshTask.tokensOut,
+                    costUsd = freshTask.costUsd,
+                )
+            )
+        }
+    }
+
+    private fun isDefaultSessionName(name: String): Boolean {
+        return name == "New Session" || name.matches(Regex("^Session \\(.+\\)$"))
+    }
+
+    private suspend fun autoNameSessionIfNeeded(session: Session, input: String) {
+        if (isDefaultSessionName(session.name) && stateManager.messages.value.size == 2) {
+            scheduleAutoNameSession(session, input)
+        }
+    }
+
+    private fun scheduleAutoNameSession(session: Session, input: String) {
+        if (!isDefaultSessionName(session.name)) return
+
+        scope.launch {
+            try {
+                val rawTitle = projectRouter.chatRouter.generateSessionTitle(session.id, input)
+                val generatedName = sanitizeSessionTitle(rawTitle)
+                    .ifBlank { generateSessionNameFallback(input) }
+
+                projectRouter.taskRouter.updateTask(session.id, UpdateTaskRequest(name = generatedName))
+                lifecycleService.updateSession(
+                    stateManager.getActiveSession()?.copy(name = generatedName)
+                        ?: return@launch
+                )
+                logger.info { "Auto-named: '$generatedName'" }
+            } catch (e: Exception) {
+                val fallback = generateSessionNameFallback(input)
+                try {
+                    projectRouter.taskRouter.updateTask(session.id, UpdateTaskRequest(name = fallback))
+                    lifecycleService.updateSession(
+                        stateManager.getActiveSession()?.copy(name = fallback)
+                            ?: return@launch
+                    )
+                    logger.info { "Auto-named with fallback: '$fallback'" }
+                } catch (inner: Exception) {
+                    logger.warn(inner) { "Auto-name failed" }
+                }
+            }
+        }
+    }
+
+    private fun sanitizeSessionTitle(raw: String): String {
+        return raw
+            .trim()
+            .trim('"', '\'', '\u201C', '\u201D')
+            .replace(Regex("[\\r\\n]+"), " ")
+            .replace(Regex("\\s+"), " ")
+            .replace(Regex("[.!?:;]+$"), "")
+            .take(60)
+    }
+
+    private fun generateSessionNameFallback(input: String): String {
+        val cleaned = input
+            .trim()
+            .replace(Regex("\\s+"), " ")
+            .replace(Regex("[\\r\\n]+"), " ")
+
+        val firstSentence = cleaned.split(Regex("[.!?]\\s+")).firstOrNull() ?: cleaned
+        val truncated = if (firstSentence.length > 50) {
+            firstSentence.substring(0, 47) + "..."
+        } else {
+            firstSentence
+        }
+
+        return truncated.ifBlank { "Chat" }
+    }
+
+    private fun resolveToolDisplayType(toolName: String): ToolDisplayType {
+        return when (toolName) {
+            "advance_code_editing", "multi_line_editor" -> ToolDisplayType.LLM_EDIT
+            "code_editing", "create_new_file", "multi_edit" -> ToolDisplayType.CODE_EDIT
+            "run_terminal_command" -> ToolDisplayType.TERMINAL
+            else -> ToolDisplayType.SIMPLE
+        }
+    }
+
+    private fun parseToolParameters(argumentsJson: String): Map<String, String> {
+        return try {
+            val json = kotlinx.serialization.json.Json { ignoreUnknownKeys = true }
+            val args = json.parseToJsonElement(argumentsJson)
+            val argsObj = args as? kotlinx.serialization.json.JsonObject ?: return emptyMap()
+
+            argsObj.entries.associate { (key, value) ->
+                key to when (value) {
+                    is kotlinx.serialization.json.JsonPrimitive -> value.content
+                    else -> value.toString()
+                }
+            }
+        } catch (e: Exception) {
+            logger.warn(e) { "Failed to parse tool arguments" }
+            emptyMap()
+        }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/session/CoreSessionServiceFactory.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/CoreSessionServiceFactory.kt
new file mode 100644
index 00000000..b9eb2d16
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/CoreSessionServiceFactory.kt
@@ -0,0 +1,98 @@
+package pl.jclab.refio.core.session
+
+import kotlinx.coroutines.CoroutineScope
+import kotlinx.coroutines.sync.Mutex
+import pl.jclab.refio.core.api.CoreApiRouter
+import pl.jclab.refio.core.api.UIAdapter
+import java.nio.file.Path
+import java.util.concurrent.CompletableFuture
+
+/**
+ * Wires up every collaborator needed by [CoreSessionService] so embedders (CLI, tests) don't
+ * have to duplicate the 7-class assembly that `SessionManager` does on the IntelliJ side.
+ *
+ * The plugin keeps its own wiring (it needs IntelliJ-specific adapters). Everyone else can
+ * call [create] with minimal arguments and get a ready-to-use [CoreSessionService].
+ */
+object CoreSessionServiceFactory {
+
+    fun create(
+        projectRouter: CoreApiRouter,
+        projectId: String,
+        projectPath: Path,
+        scope: CoroutineScope,
+        stateManager: SessionStateManager = SessionStateManager(),
+        vfsRefresher: VfsRefresher = VfsRefresher.NoOp,
+        uiAdapter: UIAdapter = NoopUIAdapter,
+        executionStateController: ExecutionStateController = NoopExecutionStateController,
+    ): CoreSessionService {
+        val modeSwitchMutex = Mutex()
+
+        val messageDispatcher = MessageDispatcher(
+            projectRouter = projectRouter,
+            stateManager = stateManager,
+        )
+
+        lateinit var subtaskTrackerRef: SubtaskTracker
+        lateinit var executionMonitorRef: ExecutionMonitor
+
+        val executionMonitor = ExecutionMonitor(
+            projectRouter = projectRouter,
+            stateManager = stateManager,
+            stepExecutionService = executionStateController,
+            scope = scope,
+            loadMessages = { messageDispatcher.loadMessages() },
+            loadSubtasks = { subtaskTrackerRef.loadSubtasks() },
+            prepareNextStep = { subtaskTrackerRef.prepareNextStep() },
+        )
+        executionMonitorRef = executionMonitor
+
+        val subtaskTracker = SubtaskTracker(
+            projectRouter = projectRouter,
+            stateManager = stateManager,
+            vfsRefresher = vfsRefresher,
+            loadMessages = { messageDispatcher.loadMessages() },
+            executeCurrentStep = { subtaskId -> executionMonitorRef.executeCurrentStep(subtaskId) },
+            showApprovalMessageForNextSubtask = { executionMonitorRef.showApprovalMessageForNextSubtask() },
+        )
+        subtaskTrackerRef = subtaskTracker
+
+        val lifecycleService = SessionLifecycleService(
+            projectRouter = projectRouter,
+            configService = projectRouter.configService,
+            stateManager = stateManager,
+            modeSwitchMutex = modeSwitchMutex,
+            projectId = projectId,
+            normalizedProjectPath = projectPath.toAbsolutePath().normalize().toString(),
+            scope = scope,
+        )
+        lifecycleService.initialize(messageDispatcher, subtaskTracker, executionMonitor)
+
+        return CoreSessionService(
+            projectRouter = projectRouter,
+            stateManager = stateManager,
+            subtaskTracker = subtaskTracker,
+            messageDispatcher = messageDispatcher,
+            lifecycleService = lifecycleService,
+            uiAdapter = uiAdapter,
+            scope = scope,
+            modeSwitchMutex = modeSwitchMutex,
+        )
+    }
+
+    object NoopExecutionStateController : ExecutionStateController {
+        override fun startInteractiveExecution(taskId: String) = Unit
+        override fun stopExecution() = Unit
+        override fun markComplete() = Unit
+    }
+
+    object NoopUIAdapter : UIAdapter {
+        override fun showMessage(message: String) = Unit
+        override fun showError(error: String) = Unit
+        override fun updateStatus(status: String) = Unit
+        override fun showProgress(title: String, fraction: Double) = Unit
+        override fun askQuestion(question: String): CompletableFuture<String> =
+            CompletableFuture.completedFuture("")
+        override fun log(level: String, message: String) = Unit
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/session/DefaultWorkflowStreamingListener.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/DefaultWorkflowStreamingListener.kt
new file mode 100644
index 00000000..0db9d78c
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/DefaultWorkflowStreamingListener.kt
@@ -0,0 +1,112 @@
+package pl.jclab.refio.core.session
+
+import kotlinx.coroutines.CoroutineScope
+import kotlinx.coroutines.Dispatchers
+import kotlinx.coroutines.launch
+import pl.jclab.refio.api.models.Message
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.workflow.WorkflowEventListener
+import java.util.UUID
+
+private val logger = dualLogger("DefaultWorkflowStreamingListener")
+
+/**
+ * Platform-agnostic [WorkflowEventListener] that materializes streaming output as
+ * a single [Message] in the session's [SessionStateManager], formatted per-intent
+ * (chat, plan JSON, subagent).
+ *
+ * Throttles UI updates to one per 500ms to avoid re-render storms.
+ * Used by both IntelliJ plugin and CLI TUI.
+ */
+open class DefaultWorkflowStreamingListener(
+    private val taskId: String,
+    private val stateManager: SessionStateManager,
+    private val scope: CoroutineScope,
+    private val streamingEnabled: Boolean,
+) : WorkflowEventListener {
+
+    private var messageId: String? = null
+    private var lastUiUpdate = 0L
+    private var formatter: ((String) -> String)? = null
+
+    override fun onChatStarted() {
+        startStreamingMessage("", "assistant") { it }
+    }
+
+    override fun onPlanningStarted() {
+        startStreamingMessage("Planning...", "assistant") { accumulated ->
+            "Planning...\n\n```json\n$accumulated\n```"
+        }
+    }
+
+    override fun onSubagentStarted(subagentName: String) {
+        startStreamingMessage("[$subagentName] ...", "assistant") { accumulated ->
+            "[$subagentName]\n\n$accumulated"
+        }
+    }
+
+    override fun onStreamChunk(chunk: String) {
+        if (!streamingEnabled) return
+        val currentId = messageId ?: return
+        val now = System.currentTimeMillis()
+        if (now - lastUiUpdate < 500L) return
+        lastUiUpdate = now
+
+        val format = formatter ?: { it }
+        scope.launch(Dispatchers.IO) {
+            stateManager.updateMessages { messages ->
+                messages.map { msg ->
+                    if (msg.id == currentId) {
+                        msg.copy(content = format(chunk), lastChunkAt = now)
+                    } else {
+                        msg
+                    }
+                }
+            }
+        }
+    }
+
+    override fun onStreamComplete(content: String) {
+        val currentId = messageId ?: return
+        val format = formatter ?: { it }
+        scope.launch(Dispatchers.IO) {
+            stateManager.updateMessages { messages ->
+                messages.map { msg ->
+                    if (msg.id == currentId) {
+                        msg.copy(
+                            content = format(content),
+                            isStreaming = false,
+                            lastChunkAt = System.currentTimeMillis(),
+                        )
+                    } else {
+                        msg
+                    }
+                }
+            }
+        }
+    }
+
+    private fun startStreamingMessage(
+        initialContent: String,
+        role: String,
+        format: (String) -> String,
+    ) {
+        formatter = format
+
+        val id = UUID.randomUUID().toString()
+        messageId = id
+
+        val message = Message(
+            id = id,
+            taskId = taskId,
+            role = role,
+            content = initialContent,
+            isStreaming = streamingEnabled,
+            createdAt = System.currentTimeMillis(),
+        )
+
+        scope.launch(Dispatchers.IO) {
+            stateManager.appendMessage(message)
+        }
+    }
+}
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/ExecutionMonitor.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/ExecutionMonitor.kt
similarity index 98%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/ExecutionMonitor.kt
rename to core/src/main/kotlin/pl/jclab/refio/core/session/ExecutionMonitor.kt
index abfb7dba..7df343e0 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/ExecutionMonitor.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/ExecutionMonitor.kt
@@ -1,6 +1,6 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
-import com.intellij.openapi.project.Project
+import pl.jclab.refio.core.session.SessionStateManager
 import kotlinx.coroutines.CancellationException
 import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.Dispatchers
@@ -17,15 +17,14 @@ import pl.jclab.refio.core.services.execution.unified.StepPlan
 import pl.jclab.refio.core.services.execution.unified.StepResult
 import pl.jclab.refio.core.services.monitoring.GlobalMetrics
 import pl.jclab.refio.core.services.monitoring.OperationInfo
-import pl.jclab.refio.services.execution.StepExecutionService
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.session.ExecutionStateController
+import pl.jclab.refio.core.logging.dualLogger
 import java.util.UUID
 
 class ExecutionMonitor(
-    private val project: Project,
     private val projectRouter: CoreApiRouter,
     private val stateManager: SessionStateManager,
-    private val stepExecutionService: StepExecutionService,
+    private val stepExecutionService: ExecutionStateController,
     private val scope: CoroutineScope,
     private val loadMessages: suspend () -> Unit,
     private val loadSubtasks: suspend () -> Unit,
@@ -324,7 +323,7 @@ class ExecutionMonitor(
     private fun isStreamingEnabled(): Boolean {
         return try {
             val streamingConfig = projectRouter.configService.get(
-                key = pl.jclab.refio.core.services.ConfigService.KEY_STREAMING_ENABLED,
+                key = pl.jclab.refio.core.config.ConfigKeys.STREAMING_ENABLED.key,
                 scope = pl.jclab.refio.core.db.ConfigScope.APP
             )
             streamingConfig?.toBoolean() ?: true
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/session/ExecutionStateController.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/ExecutionStateController.kt
new file mode 100644
index 00000000..67f3c3e6
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/ExecutionStateController.kt
@@ -0,0 +1,13 @@
+package pl.jclab.refio.core.session
+
+/**
+ * Port for managing step execution lifecycle state.
+ *
+ * IntelliJ plugin implements this via `StepExecutionService` (project-level IDE service).
+ * CLI can provide a simple stub implementation when running without execution state UI.
+ */
+interface ExecutionStateController {
+    fun startInteractiveExecution(taskId: String)
+    fun stopExecution()
+    fun markComplete()
+}
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/IncrementalToolCallStreamFilter.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/IncrementalToolCallStreamFilter.kt
similarity index 93%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/IncrementalToolCallStreamFilter.kt
rename to core/src/main/kotlin/pl/jclab/refio/core/session/IncrementalToolCallStreamFilter.kt
index f80eab6b..490f7020 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/IncrementalToolCallStreamFilter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/IncrementalToolCallStreamFilter.kt
@@ -1,6 +1,6 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
-internal class IncrementalToolCallStreamFilter(
+class IncrementalToolCallStreamFilter(
     private val tailSize: Int = 128
 ) {
     private var filteredAccumulated = ""
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/MessageDispatcher.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/MessageDispatcher.kt
similarity index 97%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/MessageDispatcher.kt
rename to core/src/main/kotlin/pl/jclab/refio/core/session/MessageDispatcher.kt
index 9ac2e9d5..7d05cb76 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/MessageDispatcher.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/MessageDispatcher.kt
@@ -1,13 +1,14 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
 import pl.jclab.refio.api.models.Message
+import pl.jclab.refio.core.session.SessionStateManager
 import pl.jclab.refio.api.models.ToolCallDisplayInfo
 import pl.jclab.refio.api.models.ToolCallResult
 import pl.jclab.refio.api.models.ToolCallStatus
 import pl.jclab.refio.api.models.ToolDisplayType
 import pl.jclab.refio.core.api.CoreApiRouter
 import pl.jclab.refio.core.db.ToolCallData
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import java.util.UUID
 
 class MessageDispatcher(
@@ -29,9 +30,10 @@ class MessageDispatcher(
             logger.info { "[MESSAGES] loadMessages response: taskId=${currentSession.id}, count=${response.count}" }
             val toolResultPathByToolCallId = response.messages
                 .mapNotNull { coreMsg ->
-                    if (coreMsg.role != "tool" || coreMsg.toolCallId.isNullOrBlank()) return@mapNotNull null
+                    val toolCallId = coreMsg.toolCallId
+                    if (coreMsg.role != "tool" || toolCallId.isNullOrBlank()) return@mapNotNull null
                     val path = extractPathFromMetadata(coreMsg.metadata) ?: return@mapNotNull null
-                    coreMsg.toolCallId!! to path
+                    toolCallId to path
                 }
                 .toMap()
 
@@ -78,7 +80,7 @@ class MessageDispatcher(
                 // Diagnostic log for action envelopes generated by AgentTurnLoop.
                 if (hasActions) {
                     logger.info {
-                        "[PLAN_DEBUG] Message with actions: isPlan=${assistantEnvelope?.isPlanJson == true}, " +
+                        "[PLAN_DEBUG] Message with actions: isPlan=${assistantEnvelope.isPlanJson}, " +
                             "hasToolCalls=${!coreMsg.toolCallsJson.isNullOrBlank()}, " +
                             "toolCallInfo=$toolCallInfo, " +
                             "contentPreview=${coreMsg.content.take(100)}"
@@ -159,8 +161,8 @@ class MessageDispatcher(
                     toolCallId = coreMsg.toolCallId, // For TOOL messages - link to tool call
                     toolCallInfo = toolCallInfo,
                     toolStreamContent = toolDisplay.toolStreamContent,
-                    agentName = (coreMsg as? pl.jclab.refio.core.api.MessageResponse)?.agentName,
-                    agentDepth = (coreMsg as? pl.jclab.refio.core.api.MessageResponse)?.agentDepth
+                    agentName = coreMsg.agentName,
+                    agentDepth = coreMsg.agentDepth
                 )
 
                 // Skip tool result messages whose content is already inlined into the tool call bubble
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/PromptStateTracker.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/PromptStateTracker.kt
similarity index 92%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/PromptStateTracker.kt
rename to core/src/main/kotlin/pl/jclab/refio/core/session/PromptStateTracker.kt
index 3a25ef4a..700e1bb4 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/PromptStateTracker.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/PromptStateTracker.kt
@@ -1,8 +1,8 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
 import pl.jclab.refio.api.models.ContextReference
 import pl.jclab.refio.core.api.ContextSectionTokenInfo
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 
 class PromptStateTracker(
     private val stateManager: SessionStateManager
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionLifecycleService.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/SessionLifecycleService.kt
similarity index 90%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionLifecycleService.kt
rename to core/src/main/kotlin/pl/jclab/refio/core/session/SessionLifecycleService.kt
index a45eafce..016b4bd2 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionLifecycleService.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/SessionLifecycleService.kt
@@ -1,13 +1,12 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
-import com.intellij.openapi.project.Project
+import pl.jclab.refio.core.session.SessionStateManager
 import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.Dispatchers
-import kotlinx.coroutines.runBlocking
 import kotlinx.coroutines.sync.Mutex
 import kotlinx.coroutines.launch
+import kotlinx.coroutines.withContext
 import org.jetbrains.exposed.sql.transactions.transaction
-import pl.jclab.refio.api.CoreApiClient
 import pl.jclab.refio.api.models.ExecutionMode
 import pl.jclab.refio.api.models.Session
 import pl.jclab.refio.api.models.TaskMode
@@ -18,12 +17,10 @@ import pl.jclab.refio.core.api.UpdateTaskRequest
 import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.core.services.ConfigService
 import pl.jclab.refio.core.services.ConfigService.Companion.DEFAULT_CONTEXT_SIZE
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 
 class SessionLifecycleService(
-    private val project: Project,
     private val projectRouter: CoreApiRouter,
-    private val coreApiClient: CoreApiClient,
     private val configService: ConfigService,
     private val stateManager: SessionStateManager,
     private val modeSwitchMutex: Mutex,
@@ -43,17 +40,17 @@ class SessionLifecycleService(
         subtaskTracker: SubtaskTracker,
         _executionMonitor: ExecutionMonitor
     ) {
-        loadUIState()
-
         scope.launchSafe {
+            loadUIState()
+
             try {
                 val taskResponse = projectRouter.taskRouter.getLastSessionForProject(projectId)
                 if (taskResponse != null) {
                     logger.info { "Found last session for project: ${taskResponse.id}" }
 
-                    val executionModeStr = runBlocking(Dispatchers.IO) {
+                    val executionModeStr = withContext(Dispatchers.IO) {
                         transaction {
-                            configService.get(ConfigService.KEY_UI_EXECUTION_MODE)
+                            configService.get(ConfigKeys.UI_EXECUTION_MODE.key)
                         }
                     }
                     val executionMode = try {
@@ -399,7 +396,7 @@ class SessionLifecycleService(
         }
 
         try {
-            val selectedModel = coreApiClient.getConfigValue("ui", "selected_model")
+            val selectedModel = configService.get("ui.selected_model", ConfigScope.APP, null)
                 ?.takeIf { it.isNotBlank() }
                 ?: "auto"
 
@@ -439,10 +436,9 @@ class SessionLifecycleService(
         logger.info { "Thinking mode set to: $enabled" }
         val activeSession = stateManager.getActiveSession()
         if (activeSession != null) {
-            val session = activeSession
-            stateManager.setActiveSession(session.copy(thinkingEnabled = enabled))
+            stateManager.setActiveSession(activeSession.copy(thinkingEnabled = enabled))
         } else {
-            setUiSettingDefaults(ConfigService.KEY_UI_THINKING_ENABLED, enabled.toString())
+            setUiSettingDefaults(ConfigKeys.UI_THINKING_ENABLED.key, enabled.toString())
         }
         saveCurrentSessionState()
     }
@@ -452,10 +448,9 @@ class SessionLifecycleService(
         logger.info { "No-egress mode set to: $enabled" }
         val activeSession = stateManager.getActiveSession()
         if (activeSession != null) {
-            val session = activeSession
-            stateManager.setActiveSession(session.copy(noEgressEnabled = enabled))
+            stateManager.setActiveSession(activeSession.copy(noEgressEnabled = enabled))
         } else {
-            setUiSettingDefaults(ConfigService.KEY_UI_NO_EGRESS_ENABLED, enabled.toString())
+            setUiSettingDefaults(ConfigKeys.UI_NO_EGRESS_ENABLED.key, enabled.toString())
         }
         saveCurrentSessionState()
     }
@@ -463,13 +458,13 @@ class SessionLifecycleService(
     fun setMultiAgentEnabled(enabled: Boolean) {
         stateManager.setMultiAgentEnabled(enabled)
         logger.info { "Multi-agent mode set to: $enabled" }
-        setUiSettingDefaults(ConfigService.KEY_UI_ORCHESTRATION_ENABLED, enabled.toString())
+        setUiSettingDefaults(ConfigKeys.UI_ORCHESTRATION_ENABLED.key, enabled.toString())
     }
 
     fun setMultiAgentStrategy(strategy: pl.jclab.refio.api.models.MultiAgentStrategy) {
         stateManager.setMultiAgentStrategy(strategy)
         logger.info { "Multi-agent strategy set to: $strategy" }
-        setUiSettingDefaults(ConfigService.KEY_UI_MULTI_AGENT_STRATEGY, strategy.name)
+        setUiSettingDefaults(ConfigKeys.UI_MULTI_AGENT_STRATEGY.key, strategy.name)
     }
 
     suspend fun getAvailableModels(): List<String> {
@@ -556,47 +551,27 @@ class SessionLifecycleService(
 
         // Launch in background coroutine for non-blocking save
         scope.launch(Dispatchers.IO) {
-            persistSessionSettingsBlocking(session.id, settings)
+            persistSessionSettingsSuspending(session.id, settings)
         }
     }
 
     /**
-     * Synchronous version of persistSessionSettings for use in non-suspend contexts.
-     * Should be called within runBlocking or coroutine scope.
+     * Suspend version used from `scope.launch(Dispatchers.IO)`. The previous `runBlocking`
+     * was redundant — the launch context already dispatches on IO.
      */
-    private fun persistSessionSettingsBlocking(taskId: String, settings: SessionSettings) {
+    private suspend fun persistSessionSettingsSuspending(taskId: String, settings: SessionSettings) {
         try {
-            logger.debug { "Persisting session settings (blocking): taskId=$taskId" }
-            runBlocking(Dispatchers.IO) {
-                setUiSettingDefaults(
-                    ConfigService.KEY_UI_SELECTED_MODEL,
-                    settings.selectedModel ?: "auto",
-                )
-                setUiSettingDefaults(
-                    ConfigService.KEY_UI_THINKING_ENABLED,
-                    settings.thinkingEnabled.toString(),
-                )
-                setUiSettingDefaults(
-                    ConfigService.KEY_UI_NO_EGRESS_ENABLED,
-                    settings.noEgressEnabled.toString(),
-                )
-                setUiSettingDefaults(
-                    ConfigService.KEY_UI_EXECUTION_MODE,
-                    settings.executionMode.name,
-                )
-
-                configService.set(
-                    ConfigService.KEY_UI_SELECTED_MODE,
-                    selectedMode.name,
-                    ConfigScope.APP
-                )
-            }
+            logger.debug { "Persisting session settings: taskId=$taskId" }
+            // Caller already dispatches on Dispatchers.IO via scope.launch.
+            setUiSettingDefaults(ConfigKeys.UI_SELECTED_MODEL.key, settings.selectedModel ?: "auto")
+            setUiSettingDefaults(ConfigKeys.UI_THINKING_ENABLED.key, settings.thinkingEnabled.toString())
+            setUiSettingDefaults(ConfigKeys.UI_NO_EGRESS_ENABLED.key, settings.noEgressEnabled.toString())
+            setUiSettingDefaults(ConfigKeys.UI_EXECUTION_MODE.key, settings.executionMode.name)
+            configService.set(ConfigKeys.UI_SELECTED_MODE.key, selectedMode.name, ConfigScope.APP)
 
             projectRouter.taskRouter.updateTask(
                 taskId,
-                pl.jclab.refio.core.api.UpdateTaskRequest(
-                    uiState = settings.toJson()
-                )
+                pl.jclab.refio.core.api.UpdateTaskRequest(uiState = settings.toJson())
             )
         } catch (e: Exception) {
             if (isTaskNotFoundException(e)) {
@@ -607,10 +582,10 @@ class SessionLifecycleService(
         }
     }
 
-    private fun loadUIState() {
-        val modeStr = runBlocking(Dispatchers.IO) {
+    private suspend fun loadUIState() {
+        val modeStr = withContext(Dispatchers.IO) {
             transaction {
-                configService.get(ConfigService.KEY_UI_SELECTED_MODE)
+                configService.get(ConfigKeys.UI_SELECTED_MODE.key)
             }
         }
         selectedMode = runCatching { TaskMode.valueOf(modeStr ?: TaskMode.CHAT.name) }
@@ -659,21 +634,21 @@ class SessionLifecycleService(
         var hasAny = false
 
         val selectedModel = configService.get(
-            ConfigService.KEY_UI_SELECTED_MODEL,
+            ConfigKeys.UI_SELECTED_MODEL.key,
             scope,
             taskId = taskId,
             projectId = projectId
         )?.also { hasAny = true }
 
         val thinkingEnabled = configService.get(
-            ConfigService.KEY_UI_THINKING_ENABLED,
+            ConfigKeys.UI_THINKING_ENABLED.key,
             scope,
             taskId = taskId,
             projectId = projectId
         )?.also { hasAny = true }?.toBoolean()
 
         val noEgressEnabled = configService.get(
-            ConfigService.KEY_UI_NO_EGRESS_ENABLED,
+            ConfigKeys.UI_NO_EGRESS_ENABLED.key,
             scope,
             taskId = taskId,
             projectId = projectId
@@ -686,21 +661,21 @@ class SessionLifecycleService(
             }
 
         val executionModeValue = configService.get(
-            ConfigService.KEY_UI_EXECUTION_MODE,
+            ConfigKeys.UI_EXECUTION_MODE.key,
             scope,
             taskId = taskId,
             projectId = projectId
         )?.also { hasAny = true }
 
         val multiAgentEnabled = configService.get(
-            ConfigService.KEY_UI_ORCHESTRATION_ENABLED,
+            ConfigKeys.UI_ORCHESTRATION_ENABLED.key,
             scope,
             taskId = taskId,
             projectId = projectId
         )?.also { hasAny = true }?.toBoolean()
 
         @Suppress("UNUSED_VARIABLE") val _intentClassificationEnabled = configService.get(
-            ConfigService.KEY_UI_INTENT_CLASSIFICATION_ENABLED,
+            ConfigKeys.UI_INTENT_CLASSIFICATION_ENABLED.key,
             scope,
             taskId = taskId,
             projectId = projectId
@@ -711,7 +686,7 @@ class SessionLifecycleService(
         }
 
         val multiAgentStrategyValue = configService.get(
-            ConfigService.KEY_UI_MULTI_AGENT_STRATEGY,
+            ConfigKeys.UI_MULTI_AGENT_STRATEGY.key,
             scope,
             taskId = taskId,
             projectId = projectId
@@ -757,24 +732,24 @@ class SessionLifecycleService(
             // Use withContext instead of runBlocking to avoid blocking the coroutine
             kotlinx.coroutines.withContext(Dispatchers.IO) {
                 setUiSettingDefaults(
-                    ConfigService.KEY_UI_SELECTED_MODEL,
+                    ConfigKeys.UI_SELECTED_MODEL.key,
                     settings.selectedModel ?: "auto",
                 )
                 setUiSettingDefaults(
-                    ConfigService.KEY_UI_THINKING_ENABLED,
+                    ConfigKeys.UI_THINKING_ENABLED.key,
                     settings.thinkingEnabled.toString(),
                 )
                 setUiSettingDefaults(
-                    ConfigService.KEY_UI_NO_EGRESS_ENABLED,
+                    ConfigKeys.UI_NO_EGRESS_ENABLED.key,
                     settings.noEgressEnabled.toString(),
                 )
                 setUiSettingDefaults(
-                    ConfigService.KEY_UI_EXECUTION_MODE,
+                    ConfigKeys.UI_EXECUTION_MODE.key,
                     settings.executionMode.name,
                 )
 
                 configService.set(
-                    ConfigService.KEY_UI_SELECTED_MODE,
+                    ConfigKeys.UI_SELECTED_MODE.key,
                     selectedMode.name,
                     ConfigScope.APP
                 )
@@ -867,10 +842,10 @@ class SessionLifecycleService(
         return pl.jclab.refio.core.utils.GsonInstance.gson.toJson(payload)
     }
 
-    private fun loadExecutionModePreference(): ExecutionMode {
-        val executionModeStr = runBlocking(Dispatchers.IO) {
+    private suspend fun loadExecutionModePreference(): ExecutionMode {
+        val executionModeStr = withContext(Dispatchers.IO) {
             transaction {
-                configService.get(ConfigService.KEY_UI_EXECUTION_MODE)
+                configService.get(ConfigKeys.UI_EXECUTION_MODE.key)
             }
         }
         return parseExecutionMode(executionModeStr)
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionStateManager.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/SessionStateManager.kt
similarity index 89%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionStateManager.kt
rename to core/src/main/kotlin/pl/jclab/refio/core/session/SessionStateManager.kt
index 7b4903e1..81a028a0 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionStateManager.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/SessionStateManager.kt
@@ -1,4 +1,4 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
 import kotlinx.coroutines.flow.MutableStateFlow
 import kotlinx.coroutines.flow.StateFlow
@@ -8,12 +8,19 @@ import kotlinx.coroutines.sync.withLock
 import pl.jclab.refio.api.models.ContextReference
 import pl.jclab.refio.api.models.Message
 import pl.jclab.refio.api.models.Session
-import pl.jclab.refio.api.models.SubtaskDto
+import pl.jclab.refio.core.api.SubtaskResponse
 import pl.jclab.refio.core.api.ContextSectionTokenInfo
 import pl.jclab.refio.core.api.PlanResponse
 import pl.jclab.refio.core.api.PlanSpecStepResponse
-import pl.jclab.refio.services.logging.dualLogger
-
+import pl.jclab.refio.core.logging.dualLogger
+
+/**
+ * UI-agnostic session state holder.
+ *
+ * Trzyma 13 StateFlow reprezentujących **execution state** sesji (active session, history, mode,
+ * pending tools, subtasks). Żadne pole nie jest UI-specific — Plugin i TUI obserwują tę samą
+ * instancję. Przeniesione z `:intellij-plugin/services/session/` w Sprint 2 §2.
+ */
 class SessionStateManager {
 
     private val logger = dualLogger("SessionStateManager")
@@ -28,8 +35,8 @@ class SessionStateManager {
     private val _messages = MutableStateFlow<List<Message>>(emptyList())
     val messages: StateFlow<List<Message>> = _messages.asStateFlow()
 
-    private val _subtasks = MutableStateFlow<List<SubtaskDto>>(emptyList())
-    val subtasks: StateFlow<List<SubtaskDto>> = _subtasks.asStateFlow()
+    private val _subtasks = MutableStateFlow<List<SubtaskResponse>>(emptyList())
+    val subtasks: StateFlow<List<SubtaskResponse>> = _subtasks.asStateFlow()
 
     // Plan state (for PLAN mode sessions)
     private val _activePlan = MutableStateFlow<PlanResponse?>(null)
@@ -89,7 +96,7 @@ class SessionStateManager {
         _messages.value = emptyList()
     }
 
-    fun setSubtasks(subtasks: List<SubtaskDto>) {
+    fun setSubtasks(subtasks: List<SubtaskResponse>) {
         _subtasks.value = subtasks
     }
 
@@ -159,7 +166,7 @@ class SessionStateManager {
 
     fun getActiveSession(): Session? = _activeSession.value
 
-    fun getSubtasks(): List<SubtaskDto> = _subtasks.value
+    fun getSubtasks(): List<SubtaskResponse> = _subtasks.value
 
     fun getSelectedModel(): String = _selectedModel.value
 
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SubtaskTracker.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/SubtaskTracker.kt
similarity index 85%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SubtaskTracker.kt
rename to core/src/main/kotlin/pl/jclab/refio/core/session/SubtaskTracker.kt
index 8abd3638..627dee34 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SubtaskTracker.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/SubtaskTracker.kt
@@ -1,21 +1,19 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
-import com.intellij.openapi.project.Project
-import com.intellij.openapi.project.guessProjectDir
-import pl.jclab.refio.api.CoreApiClient
+import pl.jclab.refio.core.session.SessionStateManager
+import pl.jclab.refio.core.session.VfsRefresher
 import pl.jclab.refio.api.models.Message
-import pl.jclab.refio.api.models.SubtaskDto
+import pl.jclab.refio.core.api.SubtaskResponse
 import pl.jclab.refio.core.api.CoreApiRouter
 import pl.jclab.refio.core.api.ExecuteStepResponse
 import pl.jclab.refio.core.api.UpdateSubtaskRequest
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import java.util.UUID
 
 class SubtaskTracker(
-    private val project: Project,
     private val projectRouter: CoreApiRouter,
-    private val coreApiClient: CoreApiClient,
     private val stateManager: SessionStateManager,
+    private val vfsRefresher: VfsRefresher,
     private val loadMessages: suspend () -> Unit,
     private val executeCurrentStep: suspend (String) -> ExecuteStepResponse?,
     private val showApprovalMessageForNextSubtask: suspend () -> Unit
@@ -23,7 +21,7 @@ class SubtaskTracker(
 
     private val logger = dualLogger("SubtaskTracker")
 
-    fun updateSubtasks(subtasks: List<SubtaskDto>) {
+    fun updateSubtasks(subtasks: List<SubtaskResponse>) {
         stateManager.setSubtasks(subtasks)
         logger.debug { "Updated subtasks: ${subtasks.size} items" }
     }
@@ -36,38 +34,7 @@ class SubtaskTracker(
             val response = projectRouter.subtaskRouter.getSubtasks(currentSession.id)
             logger.info { "[SUBTASK] loadSubtasks response: taskId=${currentSession.id}, count=${response.subtasks.size}" }
 
-            val subtasks = response.subtasks.map { coreSubtask ->
-                SubtaskDto(
-                    id = coreSubtask.id,
-                    taskId = coreSubtask.taskId,
-                    orderIndex = coreSubtask.orderIndex,
-                    kind = coreSubtask.kind,
-                    status = coreSubtask.status,
-                    approvalStatus = coreSubtask.approvalStatus,
-                    requiresApproval = coreSubtask.requiresApproval,
-                    approvedByUser = coreSubtask.approvedByUser,
-                    description = coreSubtask.description,
-                    paramsJson = coreSubtask.paramsJson,
-                    stepPlanJson = coreSubtask.stepPlanJson,
-                    summary = coreSubtask.summary,
-                    result = coreSubtask.result,
-                    startedAt = coreSubtask.startedAt,
-                    finishedAt = coreSubtask.finishedAt,
-                    errorCode = coreSubtask.errorCode,
-                    errorMessage = coreSubtask.errorMessage,
-                    tokensIn = coreSubtask.tokensIn,
-                    tokensOut = coreSubtask.tokensOut,
-                    costUsd = coreSubtask.costUsd,
-                    latencyMs = coreSubtask.latencyMs,
-                    model = coreSubtask.model,
-                    provider = coreSubtask.provider,
-                    resultSummary = coreSubtask.resultSummary,
-                    createdAt = coreSubtask.createdAt,
-                    updatedAt = coreSubtask.updatedAt,
-                    completedAt = coreSubtask.completedAt
-                )
-            }
-
+            val subtasks = response.subtasks
             val currentSubtasks = stateManager.getSubtasks()
             if (!areSubtasksEqual(currentSubtasks, subtasks)) {
                 stateManager.setSubtasks(subtasks)
@@ -228,7 +195,7 @@ class SubtaskTracker(
 
             if (subtask.status == "PENDING") {
                 logger.info { "[SUBTASK] Preparing PENDING subtask: taskId=${currentSession.id}, subtaskId=$subtaskId" }
-                coreApiClient.prepareStep(currentSession.id, subtaskId)
+                projectRouter.agentRouter.planSubtaskStep(currentSession.id, subtaskId)
                 loadSubtasks()
             }
 
@@ -239,7 +206,7 @@ class SubtaskTracker(
                     "[SUBTASK] Subtask executed: taskId=${currentSession.id}, subtaskId=$subtaskId, " +
                         "status=${executeResponse.status}, durationMs=${executeResponse.durationMs}ms"
                 }
-                project.guessProjectDir()?.refresh(true, true)
+                vfsRefresher.refreshProjectRoot()
             } else {
                 logger.error { "Failed to execute subtask: $subtaskId" }
             }
@@ -345,7 +312,7 @@ class SubtaskTracker(
 
         logger.info { "[PREPARE] Found PENDING subtask: ${pendingSubtask.id} (${pendingSubtask.description})" }
 
-        val prepareResponse = coreApiClient.prepareStep(currentSession.id, pendingSubtask.id)
+        val prepareResponse = projectRouter.agentRouter.planSubtaskStep(currentSession.id, pendingSubtask.id)
         val tools = prepareResponse.tools.joinToString(", ") { it.name }
         logger.info {
             "[PREPARE] Prepared step: taskId=${currentSession.id}, subtaskId=${pendingSubtask.id}, tools=$tools"
@@ -355,7 +322,7 @@ class SubtaskTracker(
         return prepareResponse
     }
 
-    private fun areSubtasksEqual(current: List<SubtaskDto>, new: List<SubtaskDto>): Boolean {
+    private fun areSubtasksEqual(current: List<SubtaskResponse>, new: List<SubtaskResponse>): Boolean {
         if (current.size != new.size) return false
         return current.zip(new).all { (a, b) ->
             a.id == b.id &&
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/ToolCallContentSanitizer.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/ToolCallContentSanitizer.kt
similarity index 92%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/ToolCallContentSanitizer.kt
rename to core/src/main/kotlin/pl/jclab/refio/core/session/ToolCallContentSanitizer.kt
index 0a50c257..03cfc025 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/ToolCallContentSanitizer.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/ToolCallContentSanitizer.kt
@@ -1,4 +1,4 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
 import com.google.gson.Gson
 import com.google.gson.reflect.TypeToken
@@ -7,8 +7,9 @@ import kotlinx.serialization.json.JsonArray
 import kotlinx.serialization.json.JsonObject
 import kotlinx.serialization.json.JsonPrimitive
 
-internal object ToolCallContentSanitizer {
+object ToolCallContentSanitizer {
     private val gson = Gson()
+    private val lenientJson = Json { ignoreUnknownKeys = true }
     private val toolCallPatterns = listOf(
         Regex("""(?:\r?\n)?TOOL_CALL:\s*\w+\s*(?:\r?\n)?ARGUMENTS:\s*\{[\s\S]*?\}(?:\r?\n)?""", RegexOption.MULTILINE),
         Regex("""(?:\r?\n)?Tool calls:\s*(?:\r?\n)?TOOL_CALL:[\s\S]*?(?:\r?\n){2,}|(?:\r?\n)?Tool calls:\s*(?:\r?\n)?TOOL_CALL:[\s\S]*$""", RegexOption.MULTILINE),
@@ -44,7 +45,7 @@ internal object ToolCallContentSanitizer {
         if (!trimmed.startsWith("{") || !trimmed.endsWith("}")) return null
 
         return try {
-            val root = Json { ignoreUnknownKeys = true }.parseToJsonElement(trimmed) as? JsonObject ?: return null
+            val root = lenientJson.parseToJsonElement(trimmed) as? JsonObject ?: return null
 
             if (root.containsKey("plan") || root.containsKey("subtasks")) {
                 return content
@@ -86,7 +87,7 @@ internal object ToolCallContentSanitizer {
         if (!trimmed.startsWith("{") || !trimmed.endsWith("}")) return text
 
         return try {
-            val inner = Json { ignoreUnknownKeys = true }.parseToJsonElement(trimmed) as? JsonObject ?: return text
+            val inner = lenientJson.parseToJsonElement(trimmed) as? JsonObject ?: return text
             val payloadKeys = listOf("answer", "content", "response", "result", "output", "text")
             for (key in payloadKeys) {
                 val field = inner[key]
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/ToolMessageDisplayResolver.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/ToolMessageDisplayResolver.kt
similarity index 80%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/ToolMessageDisplayResolver.kt
rename to core/src/main/kotlin/pl/jclab/refio/core/session/ToolMessageDisplayResolver.kt
index 955b76ff..d442c341 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/ToolMessageDisplayResolver.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/ToolMessageDisplayResolver.kt
@@ -1,11 +1,11 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
-internal data class ToolMessageDisplay(
+data class ToolMessageDisplay(
     val content: String,
     val toolStreamContent: String?
 )
 
-internal object ToolMessageDisplayResolver {
+object ToolMessageDisplayResolver {
     fun resolve(
         role: String,
         content: String,
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/session/VfsRefresher.kt b/core/src/main/kotlin/pl/jclab/refio/core/session/VfsRefresher.kt
new file mode 100644
index 00000000..be4ef78c
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/session/VfsRefresher.kt
@@ -0,0 +1,20 @@
+package pl.jclab.refio.core.session
+
+/**
+ * Port do odświeżania widoku plików w konsumentach Core.
+ *
+ * Core edytuje pliki na dysku przez ToolRegistry. IntelliJ trzyma własny VFS cache — po edycji
+ * trzeba zasygnalizować refresh, inaczej IDE widzi stare pliki. TUI nie ma takiego cache'u.
+ *
+ * Plugin wpina `IntelliJVfsRefresher` (woła `SafeVfsAccess.refreshProjectRoot`),
+ * TUI/Desktop bez cache'u używają [NoOp].
+ */
+interface VfsRefresher {
+
+    /** Odśwież cały project root. */
+    fun refreshProjectRoot()
+
+    object NoOp : VfsRefresher {
+        override fun refreshProjectRoot() { /* UI bez VFS cache'u nie potrzebuje refreshu */ }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/subagents/BuiltinSubagentOverrides.kt b/core/src/main/kotlin/pl/jclab/refio/core/subagents/BuiltinSubagentOverrides.kt
new file mode 100644
index 00000000..b99051f5
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/subagents/BuiltinSubagentOverrides.kt
@@ -0,0 +1,45 @@
+package pl.jclab.refio.core.subagents
+
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.db.ConfigScope
+import pl.jclab.refio.core.db.repositories.ConfigRepository
+import pl.jclab.refio.core.utils.GsonInstance.gson
+
+/**
+ * Persistence for per-name enable/disable overrides of built-in subagents.
+ *
+ * Stored as a single JSON blob under [ConfigKeys.SUBAGENTS_BUILTIN_ENABLED] in APP scope.
+ */
+class BuiltinSubagentOverrides(
+    private val configRepository: ConfigRepository,
+    private val invalidate: (String) -> Unit,
+) {
+    private val key: String get() = ConfigKeys.SUBAGENTS_BUILTIN_ENABLED.key
+
+    fun getAll(): Map<String, Boolean> {
+        val config = configRepository.get(key, ConfigScope.APP) ?: return emptyMap()
+        val raw = gson.fromJson(config.value, Map::class.java) ?: return emptyMap()
+        return raw.mapNotNull { (rawKey, rawValue) ->
+            val name = rawKey as? String ?: return@mapNotNull null
+            val enabled = when (rawValue) {
+                is Boolean -> rawValue
+                is String -> rawValue.toBoolean()
+                else -> null
+            } ?: return@mapNotNull null
+            name to enabled
+        }.toMap()
+    }
+
+    fun setOverride(name: String, enabled: Boolean) {
+        val current = getAll().toMutableMap()
+        current[name.lowercase()] = enabled
+        configRepository.set(
+            key = key,
+            value = gson.toJson(current),
+            scope = ConfigScope.APP,
+            taskId = null,
+            description = "Builtin subagent enabled overrides",
+        )
+        invalidate(key)
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/subagents/SubagentParser.kt b/core/src/main/kotlin/pl/jclab/refio/core/subagents/SubagentParser.kt
index 142c5389..62b86fd2 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/subagents/SubagentParser.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/subagents/SubagentParser.kt
@@ -211,7 +211,6 @@ class SubagentParser {
             includeFileTree = (map["include_file_tree"] as? Boolean) ?: true,
             includeConversation = (map["include_conversation"] as? Boolean) ?: true,
             includeWorkingMemory = (map["include_working_memory"] as? Boolean) ?: true,
-            includeRag = (map["include_rag"] as? Boolean) ?: true,
             includeDependencies = (map["include_dependencies"] as? Boolean) ?: true,
             maxContextTokens = parseIntSafe(map["max_context_tokens"]),
             includeParentSummary = (map["include_parent_summary"] as? Boolean) ?: false
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/subagents/SubagentRouter.kt b/core/src/main/kotlin/pl/jclab/refio/core/subagents/SubagentRouter.kt
index e7c51b63..994cc788 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/subagents/SubagentRouter.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/subagents/SubagentRouter.kt
@@ -40,7 +40,6 @@ class SubagentRouter(
     private val toolPermissionsService: ToolPermissionsService,
     private val chatMessageRepository: ChatMessageRepository,
     private val contextService: pl.jclab.refio.core.services.ContextService?,
-    private val ideProject: Any?,
     private val runTurnCallback: (suspend (TurnRequest, pl.jclab.refio.core.api.StreamCallback?) -> TurnResult)? = null
 ) : Router {
     private val invocationRegex = Regex("^!([a-zA-Z0-9_-]+)(?:\\s+([\\s\\S]+))?$")
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/subagents/models/SubagentDefinition.kt b/core/src/main/kotlin/pl/jclab/refio/core/subagents/models/SubagentDefinition.kt
index 04721ee1..a249d6a2 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/subagents/models/SubagentDefinition.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/subagents/models/SubagentDefinition.kt
@@ -21,7 +21,6 @@ data class SubagentContextProfile(
     val includeFileTree: Boolean = true,
     val includeConversation: Boolean = true,
     val includeWorkingMemory: Boolean = true,
-    val includeRag: Boolean = true,
     val includeDependencies: Boolean = true,
     val maxContextTokens: Int? = null,
     val includeParentSummary: Boolean = false
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/base/FileTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/base/FileTool.kt
new file mode 100644
index 00000000..9906ce46
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/base/FileTool.kt
@@ -0,0 +1,57 @@
+package pl.jclab.refio.core.tools.base
+
+import pl.jclab.refio.core.tools.FileLockManager
+import pl.jclab.refio.core.tools.PathSandbox
+import pl.jclab.refio.core.tools.normalizePath
+import java.nio.file.Path
+
+/**
+ * Abstract base dla narzędzi operujących na plikach w sandboxie.
+ *
+ * Wydziela 3 wzorce powtarzane przez ~5 edytorów plików (CodeEditingTool, AdvanceCodeEditingTool,
+ * MultiLineEditorTool, MultiEditTool, CreateNewFileTool):
+ *
+ * - [validatePathParam] — walidacja parametru `path`; throws IllegalArgumentException.
+ * - [resolveSandboxPath] — normalizacja + sandbox resolve (bez revalidate — to jest wewnątrz locka).
+ * - [withLockedFile] — file lock + `revalidateBeforeIO` (zamyka TOCTOU window).
+ *
+ * Nie próbujemy unifikować samej logiki edycji (search/replace vs LLM-generated vs batch) —
+ * te ścieżki są genuinely różne, więc każdy edytor dalej implementuje `execute` po swojemu.
+ */
+abstract class FileTool(protected val sandbox: PathSandbox) : Tool {
+
+    /**
+     * Zwraca string `path` z parametrów lub throw `IllegalArgumentException` gdy brakuje/pusty.
+     */
+    protected fun validatePathParam(params: Map<String, Any>): String {
+        val path = params["path"] as? String
+        if (path.isNullOrBlank()) {
+            throw IllegalArgumentException("Parameter 'path' is required and cannot be empty")
+        }
+        return path
+    }
+
+    /**
+     * Normalizuje `pathStr` (backslash → slash, bare filename → "./file") i rozwiązuje w sandboxie.
+     *
+     * **Nie** wykonuje `revalidateBeforeIO` — to jest obowiązek wywołania [withLockedFile]
+     * żeby zamknąć TOCTOU window (symlink swap między validate a I/O).
+     */
+    protected fun resolveSandboxPath(pathStr: String): Path {
+        val normalized = normalizePath(pathStr)
+        return sandbox.resolve(normalized)
+    }
+
+    /**
+     * Wykonuje [block] z plik-level lockiem i re-validacją sandboxa wewnątrz locka.
+     *
+     * Zamyka TOCTOU window: między `sandbox.resolve` a operacją I/O ktoś mógłby wstawić symlink
+     * poza sandbox. `revalidateBeforeIO` pod mutexem sprawdza ponownie.
+     */
+    protected suspend fun <T> withLockedFile(path: Path, block: suspend () -> T): T {
+        return FileLockManager.withFileLock(path.toAbsolutePath().toString()) {
+            sandbox.revalidateBeforeIO(path)
+            block()
+        }
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/base/Tool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/base/Tool.kt
index 0db24131..097ad99a 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/base/Tool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/base/Tool.kt
@@ -38,6 +38,16 @@ interface Tool {
     val origin: ToolOrigin
         get() = ToolOrigin.BUILTIN
 
+    /**
+     * One-line hint used to build the dynamic tool-selection matrix in system prompts.
+     * Acts as a "row" in the When-to-use-what table: a short phrase describing when to
+     * pick this tool over alternatives. Null means the tool is omitted from the matrix.
+     *
+     * Keep it under ~140 chars. Example: "Small new files (configs, stubs). For >50 lines prefer advance_code_editing."
+     */
+    val selectionHint: String?
+        get() = null
+
     /**
      * Execute the tool with given parameters.
      *
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/base/ToolFactory.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/base/ToolFactory.kt
index e9e1f678..1966d946 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/base/ToolFactory.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/base/ToolFactory.kt
@@ -2,13 +2,13 @@ package pl.jclab.refio.core.tools.base
 
 import pl.jclab.refio.core.llm.LLMClient
 import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.services.ProcessManager
 import pl.jclab.refio.core.services.PromptsService
+import pl.jclab.refio.core.services.turn.UserQuestionService
 import pl.jclab.refio.core.tools.PathSandbox
 import pl.jclab.refio.core.tools.implementations.*
-import pl.jclab.refio.core.tools.security.CommandDenylist
 import pl.jclab.refio.core.tools.security.CommandLimits
 import pl.jclab.refio.core.tools.security.CommandRuleDefaults
-import pl.jclab.refio.core.tools.security.CommandWhitelist
 import pl.jclab.refio.core.tools.security.FileLimits
 import pl.jclab.refio.core.logging.dualLogger
 import java.nio.file.Path
@@ -30,9 +30,9 @@ class ToolFactory(
     private val configService: ConfigService,
     private val promptsService: PromptsService,
     private val taskRepository: pl.jclab.refio.core.db.repositories.TaskRepository,
+    private val userQuestionService: UserQuestionService = UserQuestionService(),
     private val fileLimits: FileLimits = FileLimits.DEFAULT,
-    private val commandLimits: CommandLimits = CommandLimits.DEFAULT,
-    private val commandDenylist: CommandDenylist = CommandDenylist.DEFAULT
+    private val commandLimits: CommandLimits = CommandLimits.DEFAULT
 ) {
     init {
         logger.info { "ToolFactory initializing with projectRoot=$projectRoot (absolute=${projectRoot.toAbsolutePath()})" }
@@ -40,6 +40,7 @@ class ToolFactory(
 
     private val sandbox = PathSandbox.withConfig(projectRoot, configService)
     private val registry = toolRegistry
+    private val processManager = ProcessManager()
 
     /**
      * Create and register all available tools
@@ -90,7 +91,23 @@ class ToolFactory(
 
             // Reasoning slot (no-op, gives the model an explicit place to think
             // between tool calls — used as a loop-breaker and pre-action checkpoint)
-            ThinkTool()
+            ThinkTool(),
+
+            // Web tools
+            WebSearchTool(configService),
+            FetchWebpageTool(llmClient, configService),
+
+            // Code intelligence
+            CodeIntelligenceTool(sandbox),
+
+            // Process monitoring (read-only — only reads output)
+            MonitorProcessTool(processManager),
+
+            // User interaction
+            AskUserTool(userQuestionService),
+
+            // Utilities
+            SleepTool()
         )
     }
 
@@ -100,9 +117,6 @@ class ToolFactory(
      * @return List of write tool instances
      */
     fun createWriteTools(): List<Tool> {
-        val whitelistConfig = configService.getTerminalWhitelistConfig()
-        val whitelist = CommandWhitelist(whitelistConfig, commandDenylist)
-
         return listOf(
             // File operations (write)
             CreateNewFileTool(sandbox, fileLimits),
@@ -114,7 +128,7 @@ class ToolFactory(
             MultiEditTool(sandbox, fileLimits),
 
             // Terminal operations
-            RunTerminalCommandTool(sandbox, whitelist, commandLimits, CommandRuleDefaults.createDefaultMatcher()),
+            RunTerminalCommandTool(sandbox, commandLimits, CommandRuleDefaults.createDefaultMatcher()),
 
             // Network operations
             HttpRequestTool(sandbox),
@@ -123,7 +137,10 @@ class ToolFactory(
             RunCodeTool(sandbox),
 
             // LLM call (raw single-turn call, no agent loop)
-            LlmCallTool(llmClient, configService, sandbox, fileLimits)
+            LlmCallTool(llmClient, configService, sandbox, fileLimits),
+
+            // Background process execution
+            RunProcessBackgroundTool(sandbox, processManager, CommandRuleDefaults.createDefaultMatcher())
         )
     }
 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/base/ToolRegistry.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/base/ToolRegistry.kt
index f6d7b1dd..14f5f03d 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/base/ToolRegistry.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/base/ToolRegistry.kt
@@ -21,24 +21,44 @@ class ToolRegistry {
     private val tools = ConcurrentHashMap<String, Tool>()
 
     /**
-     * Canonical tool ordering: READ → WRITE (cheapest first) → EXECUTE → DELEGATE.
-     * Tools not listed here appear at the end (e.g. MCP tools).
+     * Named tool group for prompt section headers.
      */
-    private val toolOrder = listOf(
-        // READ — orientation & analysis
-        "read_file", "read_directory", "file_search", "grep_search", "view_diff",
-        // SYSTEM — planning, memory, reasoning & agent management
-        "think", "tasks", "memory", "manage_subagent",
-        // WRITE — cheapest first
-        "code_editing", "multi_edit", "create_new_file", "multi_line_editor", "advance_code_editing",
-        // EXECUTE — verification & data
-        "run_terminal_command", "run_code", "http_request",
-        // COMMUNICATE
-        "send_message",
-        // DELEGATE — only when the agent cannot handle the task itself
-        "delegate_to_strong_model", "invoke_subagent"
+    data class ToolGroup(val name: String, val tools: List<String>)
+
+    /**
+     * Logical tool groups for LLM prompt rendering.
+     * Each group gets a section header so the LLM sees related tools together.
+     * Tools not in any group appear at the end under "Other".
+     */
+    val toolGroups = listOf(
+        ToolGroup("Reading & Search", listOf(
+            "read_file", "read_directory", "file_search", "grep_search", "view_diff",
+            "code_intelligence", "rag_search"
+        )),
+        ToolGroup("Web & HTTP", listOf(
+            "web_search", "fetch_webpage", "http_request"
+        )),
+        ToolGroup("System", listOf(
+            "think", "tasks", "memory", "manage_subagent", "ask_user", "sleep"
+        )),
+        ToolGroup("Editing", listOf(
+            "code_editing", "multi_edit", "create_new_file", "multi_line_editor", "advance_code_editing"
+        )),
+        ToolGroup("Execution", listOf(
+            "run_terminal_command", "run_code",
+            "run_process_background", "monitor_process"
+        )),
+        ToolGroup("Delegation", listOf(
+            "send_message", "delegate_to_strong_model", "invoke_subagent", "llm_call"
+        ))
     )
 
+    /**
+     * Canonical tool ordering derived from toolGroups.
+     * Tools not listed here appear at the end (e.g. MCP tools).
+     */
+    private val toolOrder = toolGroups.flatMap { it.tools }
+
     /**
      * Register a tool
      *
@@ -74,6 +94,31 @@ class ToolRegistry {
         return sortByCanonicalOrder(tools.values)
     }
 
+    /**
+     * Get tools organized by groups, filtering to only include registered & available tools.
+     * Returns pairs of (groupName, toolsInGroup). Ungrouped tools go into "Other".
+     */
+    fun getToolsByGroups(toolList: List<Tool>): List<Pair<String, List<Tool>>> {
+        val byName = toolList.associateBy { it.name }
+        val result = mutableListOf<Pair<String, List<Tool>>>()
+        val grouped = mutableSetOf<String>()
+
+        for (group in toolGroups) {
+            val groupTools = group.tools.mapNotNull { byName[it] }
+            if (groupTools.isNotEmpty()) {
+                result.add(group.name to groupTools)
+                grouped.addAll(groupTools.map { it.name })
+            }
+        }
+
+        val ungrouped = toolList.filter { it.name !in grouped }
+        if (ungrouped.isNotEmpty()) {
+            result.add("Other" to ungrouped)
+        }
+
+        return result
+    }
+
     private fun sortByCanonicalOrder(values: Collection<Tool>): List<Tool> {
         val byName = values.associateBy { it.name }
         val ordered = toolOrder.mapNotNull { byName[it] }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/AdvanceCodeEditingTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/AdvanceCodeEditingTool.kt
index c47c1d6a..068e63bd 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/AdvanceCodeEditingTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/AdvanceCodeEditingTool.kt
@@ -12,13 +12,11 @@ import pl.jclab.refio.core.services.ConfigService
 import pl.jclab.refio.core.services.PromptsService
 import pl.jclab.refio.core.db.repositories.TaskRepository
 import pl.jclab.refio.core.tools.DiffUtils
-import pl.jclab.refio.core.tools.FileLockManager
 import pl.jclab.refio.core.tools.PathSandbox
-import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.FileTool
 import pl.jclab.refio.core.tools.base.ToolCategory
 import pl.jclab.refio.core.tools.base.ToolMode
 import pl.jclab.refio.core.tools.base.ToolResult
-import pl.jclab.refio.core.tools.normalizePath
 import pl.jclab.refio.core.tools.security.FileLimits
 import pl.jclab.refio.core.logging.dualLogger
 import java.nio.file.Files
@@ -51,39 +49,32 @@ private val logger = dualLogger("AdvanceCodeEditingTool")
  * Based on: docs/0017-advance-code.md (ADR 0017)
  */
 class AdvanceCodeEditingTool(
-    private val sandbox: PathSandbox,
+    sandbox: PathSandbox,
     private val limits: FileLimits,
     private val llmClient: LLMClient,
     private val configService: ConfigService,
     private val promptsService: PromptsService,
     private val taskRepository: TaskRepository
-) : Tool {
+) : FileTool(sandbox) {
 
     override val name = "advance_code_editing"
     override val description =
-        "LLM-assisted FULL FILE regeneration. EXPENSIVE (~\$0.06) — regenerates the entire file from scratch every call. " +
-        "DO NOT use this for small fixes (1–3 line changes, fixing a single function, renaming, adding a missing branch). " +
-        "Full regeneration frequently introduces NEW bugs in untouched code regions: the LLM rewrites the whole file " +
-        "from a description, drops invariants, changes function signatures inconsistently, and silently breaks call sites. " +
-        "Reserve this tool for: (a) creating brand new code files from scratch, " +
-        "(b) rewrites that would touch >50% of the file, " +
-        "(c) cases where surgical find/replace is impractical because the file is structurally broken. " +
-        "FOR EVERYTHING ELSE use code_editing (FREE, exact find/replace) for known strings, " +
-        "or multi_line_editor (CHEAP ~\$0.02) for 2–10 targeted semantic edits. " +
-        "Calling advance_code_editing twice in a row on the same file is almost always wrong — if the previous " +
-        "regeneration produced a bug, fix THAT bug surgically with code_editing instead of regenerating again."
+        "LLM-assisted FULL FILE generation/regeneration. " +
+        "PREFERRED for: (a) creating new files >50 lines from scratch (HTML pages, classes, scripts), " +
+        "(b) rewrites covering >50% of an existing file, (c) structurally broken files. " +
+        "Generates content via a dedicated LLM call so the agent's own response stays small — " +
+        "avoid stuffing large `content` payloads into `create_new_file`, which inflates the agent " +
+        "response and risks streaming timeouts. " +
+        "For small targeted edits prefer code_editing or multi_line_editor."
     override val mode = ToolMode.WRITE
     override val category = ToolCategory.FILE_PRODUCING
+    override val selectionHint =
+        "Full-file generation or >50% rewrites (new HTML/classes/scripts, structurally broken files). " +
+        "LLM generates the content so your agent response stays small — use instead of create_new_file for large files."
 
     override fun validateParams(params: Map<String, Any>) {
-        // Validate path
-        if (params["path"] == null || (params["path"] as? String).isNullOrBlank()) {
-            throw IllegalArgumentException("Parameter 'path' is required and cannot be empty")
-        }
-
-        // Check mode: either edit_description OR (old_string + new_string)
+        validatePathParam(params)
         val editDescription = params["edit_description"] as? String
-
         if (editDescription.isNullOrBlank()) {
             throw IllegalArgumentException("Either 'edit_description' must be provided")
         }
@@ -194,17 +185,14 @@ class AdvanceCodeEditingTool(
         conversationContext: String? = null
     ): ToolResult {
         // 1. Read file or prepare for creation
-        val normalizedPathStr = normalizePath(pathStr)
-        val path = sandbox.resolve(normalizedPathStr)
+        val path = resolveSandboxPath(pathStr)
         val fileExists = path.exists()
 
         if (limits.shouldExcludeFile(path.fileName.toString())) {
             return ToolResult.error("File extension not allowed: ${path.fileName}")
         }
 
-        return FileLockManager.withFileLock(path.toAbsolutePath().toString()) {
-            // Re-validate path inside lock to close TOCTOU window
-            sandbox.revalidateBeforeIO(path)
+        return withLockedFile(path) {
 
             val originalContent: String
             val fileSize: Long
@@ -226,13 +214,13 @@ class AdvanceCodeEditingTool(
             } else {
                 // File exists - will edit it
                 if (!path.isRegularFile()) {
-                    return@withFileLock ToolResult.error("Not a regular file: $pathStr")
+                    return@withLockedFile ToolResult.error("Not a regular file: $pathStr")
                 }
 
                 // Check file size
                 fileSize = path.fileSize()
                 if (fileSize > limits.maxFileSize) {
-                    return@withFileLock ToolResult.error(
+                    return@withLockedFile ToolResult.error(
                         "File too large for LLM-assisted editing: $fileSize bytes (max ${limits.maxFileSize} bytes). " +
                                 "Use simple search-replace mode or split into smaller edits."
                     )
@@ -313,7 +301,7 @@ class AdvanceCodeEditingTool(
                 )
             } catch (e: Exception) {
                 logger.error(e) { "LLM request failed" }
-                return@withFileLock ToolResult.error("LLM request failed: ${e.message}. Try again or use simple search-replace mode.")
+                return@withLockedFile ToolResult.error("LLM request failed: ${e.message}. Try again or use simple search-replace mode.")
             }
 
             val responseContent = response.content
@@ -335,7 +323,7 @@ class AdvanceCodeEditingTool(
 
             // 5. Extract code from response
             val newContent = extractCodeBlock(responseContent, language)
-                ?: return@withFileLock ToolResult.error(
+                ?: return@withLockedFile ToolResult.error(
                     "LLM did not return valid code block. Try rephrasing the edit description or use simple search-replace mode."
                 )
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/AskUserTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/AskUserTool.kt
new file mode 100644
index 00000000..47975f9c
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/AskUserTool.kt
@@ -0,0 +1,67 @@
+package pl.jclab.refio.core.tools.implementations
+
+import pl.jclab.refio.core.services.turn.UserQuestionService
+import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.ToolCategory
+import pl.jclab.refio.core.tools.base.ToolInternalParams
+import pl.jclab.refio.core.tools.base.ToolMode
+import pl.jclab.refio.core.tools.base.ToolResult
+
+class AskUserTool(
+    private val questionService: UserQuestionService
+) : Tool {
+    override val name = "ask_user"
+    override val description = "Ask the user a question and wait for their response. " +
+        "Use when you need clarification, a choice, or confirmation before proceeding. " +
+        "Optionally provide predefined options to make it easier to respond. " +
+        "The agent loop is paused until the user answers."
+    override val mode = ToolMode.READ_ONLY
+    override val category = ToolCategory.SYSTEM
+
+    override fun validateParams(params: Map<String, Any>) {
+        val q = params["question"] as? String
+        if (q.isNullOrBlank()) throw IllegalArgumentException("Parameter 'question' is required")
+    }
+
+    override suspend fun execute(params: Map<String, Any>): ToolResult {
+        val question = params["question"] as? String
+            ?: return ToolResult.error("Missing required parameter: 'question'")
+
+        @Suppress("UNCHECKED_CAST")
+        val options = params["options"] as? List<String>
+
+        val taskId = (params[ToolInternalParams.TASK_ID] as? String) ?: "unknown"
+
+        val result = questionService.ask(taskId, question, options)
+
+        return result.fold(
+            onSuccess = { answer ->
+                ToolResult(
+                    success = true,
+                    output = "User answered: $answer",
+                    durationMs = 0,
+                    metadata = mapOf("answer" to answer)
+                )
+            },
+            onFailure = { error ->
+                ToolResult.error("ask_user failed: ${error.message}")
+            }
+        )
+    }
+
+    override fun getParameterSchema(): Map<String, Any> = mapOf(
+        "type" to "object",
+        "properties" to mapOf(
+            "question" to mapOf(
+                "type" to "string",
+                "description" to "The question to ask the user. Be specific and concise."
+            ),
+            "options" to mapOf(
+                "type" to "array",
+                "items" to mapOf("type" to "string"),
+                "description" to "Optional predefined choices. If provided, user picks from this list."
+            )
+        ),
+        "required" to listOf("question")
+    )
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CodeEditingTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CodeEditingTool.kt
index 222a3597..1951b53e 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CodeEditingTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CodeEditingTool.kt
@@ -1,13 +1,11 @@
 package pl.jclab.refio.core.tools.implementations
 
 import pl.jclab.refio.core.tools.DiffUtils
-import pl.jclab.refio.core.tools.FileLockManager
 import pl.jclab.refio.core.tools.PathSandbox
-import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.FileTool
 import pl.jclab.refio.core.tools.base.ToolCategory
 import pl.jclab.refio.core.tools.base.ToolMode
 import pl.jclab.refio.core.tools.base.ToolResult
-import pl.jclab.refio.core.tools.normalizePath
 import pl.jclab.refio.core.tools.security.FileLimits
 import pl.jclab.refio.core.logging.dualLogger
 import java.nio.file.Files
@@ -33,19 +31,18 @@ private val logger = dualLogger("CodeEditingTool")
  * - old_string must be unique (unless replace_all=true)
  */
 class CodeEditingTool(
-    private val sandbox: PathSandbox,
+    sandbox: PathSandbox,
     private val limits: FileLimits
-) : Tool {
+) : FileTool(sandbox) {
 
     override val name = "code_editing"
     override val description = "Search-and-replace edit in an existing file. FREE."
     override val mode = ToolMode.WRITE
     override val category = ToolCategory.FILE_MODIFYING
+    override val selectionHint = "Small targeted edit in an existing file via exact string match (search/replace)."
 
     override fun validateParams(params: Map<String, Any>) {
-        if (params["path"] == null || (params["path"] as? String).isNullOrBlank()) {
-            throw IllegalArgumentException("Parameter 'path' is required and cannot be empty")
-        }
+        validatePathParam(params)
         if (params["old_string"] == null) {
             throw IllegalArgumentException("Parameter 'old_string' is required")
         }
@@ -58,26 +55,18 @@ class CodeEditingTool(
         val startTime = System.currentTimeMillis()
 
         try {
-            // Extract parameters with safe casting
-            val pathStr = params["path"] as? String
-                ?: return ToolResult.error("Missing required parameter: 'path'")
+            val pathStr = validatePathParam(params)
             val oldString = params["old_string"] as? String
                 ?: return ToolResult.error("Missing required parameter: 'old_string'")
             val newString = params["new_string"] as? String
                 ?: return ToolResult.error("Missing required parameter: 'new_string'")
             val replaceAll = (params["replace_all"] as? Boolean) ?: false
 
-            // Normalize path for security (bare filenames → "./file.txt", backslash → forward slash)
-            val normalizedPathStr = normalizePath(pathStr)
-
-            // Resolve and validate path
-            val path = sandbox.resolve(normalizedPathStr)
+            val path = resolveSandboxPath(pathStr)
 
             logger.info { "Editing file: relative='$pathStr', absolute='${path.toAbsolutePath()}', sandbox_root='${getSandboxRoot()}', oldString=${oldString.length} chars, newString=${newString.length} chars, replaceAll=$replaceAll" }
 
-            return FileLockManager.withFileLock(path.toAbsolutePath().toString()) {
-                // Re-validate path inside lock to close TOCTOU window (symlink swap between validate and I/O)
-                sandbox.revalidateBeforeIO(path)
+            return withLockedFile(path) {
 
                 // Check if file exists
                 val fileExists = path.exists()
@@ -112,7 +101,7 @@ class CodeEditingTool(
                         )
                         val diff = changeSummary.unifiedDiff ?: ""
 
-                        return@withFileLock ToolResult(
+                        return@withLockedFile ToolResult(
                             success = true,
                             output = buildString {
                                 appendLine("File created successfully: $pathStr ($newFileSize bytes)")
@@ -137,7 +126,7 @@ class CodeEditingTool(
                         )
                     } else {
                         logger.warn { "File not found: $pathStr (resolved to ${path.toAbsolutePath()})" }
-                        return@withFileLock ToolResult.error(
+                        return@withLockedFile ToolResult.error(
                             message = "File not found: $pathStr",
                             recovery = "To create a new file pass old_string=\"\" with the full new content, or use advance_code_editing with edit_description.",
                             nextActionHints = listOf(
@@ -152,14 +141,14 @@ class CodeEditingTool(
                 // Check if it's a regular file
                 if (!path.isRegularFile()) {
                     logger.warn { "Not a regular file: $pathStr (is directory: ${path.isDirectory()})" }
-                    return@withFileLock ToolResult.error("Not a regular file: $pathStr")
+                    return@withLockedFile ToolResult.error("Not a regular file: $pathStr")
                 }
 
                 // Check file size
                 val fileSize = path.fileSize()
                 logger.info { "File size before edit: $fileSize bytes, absolute='${path.toAbsolutePath()}'" }
                 if (fileSize > limits.maxFileSize) {
-                    return@withFileLock ToolResult.error(
+                    return@withLockedFile ToolResult.error(
                         message = "File too large: $fileSize bytes (max ${limits.maxFileSize} bytes)",
                         recovery = "Use multi_line_editor for targeted edits, or split the change into smaller hunks."
                     )
@@ -170,7 +159,7 @@ class CodeEditingTool(
 
                 // Check if old_string exists
                 if (!content.contains(oldString)) {
-                    return@withFileLock ToolResult.error(
+                    return@withLockedFile ToolResult.error(
                         message = "String not found in file: '${oldString.take(80)}${if (oldString.length > 80) "…" else ""}' (${oldString.length} chars).",
                         recovery = "Read the file first to see the actual content, then retry with the exact substring (including whitespace).",
                         nextActionHints = listOf(
@@ -185,7 +174,7 @@ class CodeEditingTool(
                 if (!replaceAll) {
                     val occurrences = countOccurrences(content, oldString)
                     if (occurrences > 1) {
-                        return@withFileLock ToolResult.error(
+                        return@withLockedFile ToolResult.error(
                             message = "String appears $occurrences times in file.",
                             recovery = "Either pass replace_all=true to apply to every occurrence, or extend old_string with surrounding context to make it unique.",
                             nextActionHints = listOf(
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CodeIntelligenceTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CodeIntelligenceTool.kt
new file mode 100644
index 00000000..961bfcec
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CodeIntelligenceTool.kt
@@ -0,0 +1,346 @@
+package pl.jclab.refio.core.tools.implementations
+
+import kotlinx.coroutines.Dispatchers
+import kotlinx.coroutines.withContext
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.tools.PathSandbox
+import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.ToolCategory
+import pl.jclab.refio.core.tools.base.ToolMode
+import pl.jclab.refio.core.tools.base.ToolResult
+import java.nio.file.Files
+import java.util.concurrent.TimeUnit
+
+private val logger = dualLogger("CodeIntelligenceTool")
+
+/**
+ * Code intelligence without requiring IDE or IntelliJ PSI.
+ *
+ * Actions:
+ * - find_usages:    Find all uses of a symbol (method, class, variable) — uses grep
+ * - find_definition: Find where a symbol is defined — uses ctags or grep
+ * - list_symbols:  List all symbols in a file or directory — uses ctags
+ * - get_diagnostics: Run compiler/linter and return errors — uses language-specific CLI
+ */
+class CodeIntelligenceTool(
+    private val sandbox: PathSandbox
+) : Tool {
+    override val name = "code_intelligence"
+    override val description = "Analyze code structure: find symbol usages, definitions, list symbols, get compiler diagnostics. " +
+        "Works without IntelliJ PSI — uses ctags and grep. " +
+        "Actions: find_usages, find_definition, list_symbols, get_diagnostics."
+    override val mode = ToolMode.READ_ONLY
+    override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint =
+        "Code navigation: find_usages, find_definition, list_symbols, get_diagnostics (ctags-based)."
+
+    override fun validateParams(params: Map<String, Any>) {
+        val action = params["action"] as? String
+        if (action !in VALID_ACTIONS)
+            throw IllegalArgumentException("'action' must be one of: ${VALID_ACTIONS.joinToString()}")
+        if (action != "get_diagnostics") {
+            val symbol = params["symbol"] as? String
+            if (symbol.isNullOrBlank() && action != "list_symbols")
+                throw IllegalArgumentException("'symbol' is required for action: $action")
+        }
+    }
+
+    override suspend fun execute(params: Map<String, Any>): ToolResult = withContext(Dispatchers.IO) {
+        val startTime = System.currentTimeMillis()
+        val action = params["action"] as? String ?: return@withContext ToolResult.error("Missing 'action'")
+        val symbol = params["symbol"] as? String
+        val path = params["path"] as? String ?: "."
+        val language = params["language"] as? String
+
+        when (action) {
+            "find_usages" -> findUsages(symbol!!, path, startTime)
+            "find_definition" -> findDefinition(symbol!!, path, language, startTime)
+            "list_symbols" -> listSymbols(path, language, startTime)
+            "get_diagnostics" -> getDiagnostics(path, language, startTime)
+            else -> ToolResult.error("Unknown action: $action")
+        }
+    }
+
+    private fun findUsages(symbol: String, path: String, startTime: Long): ToolResult {
+        val root = sandbox.resolve(path).toAbsolutePath()
+
+        val grepCmd = buildList {
+            add("grep")
+            add("-rn")
+            add("--include=*.kt")
+            add("--include=*.java")
+            add("--include=*.ts")
+            add("--include=*.py")
+            add("--include=*.js")
+            add("--color=never")
+            add("\\b${Regex.escape(symbol)}\\b")
+            add(root.toString())
+        }
+
+        return try {
+            val output = runCommand(grepCmd, root.toFile())
+            val lines = output.lines().filter { it.isNotBlank() }.take(100)
+
+            if (lines.isEmpty()) {
+                return ToolResult(
+                    success = true,
+                    output = "No usages of '$symbol' found in $path",
+                    durationMs = elapsed(startTime)
+                )
+            }
+
+            val formatted = buildString {
+                appendLine("Usages of '$symbol' ($path):")
+                appendLine("Found: ${lines.size} occurrences\n")
+                lines.forEach { line ->
+                    val parts = line.split(":", limit = 3)
+                    if (parts.size >= 3) {
+                        val file = parts[0].removePrefix(root.toString()).trimStart('/', '\\')
+                        appendLine("  $file:${parts[1]}  ${parts[2].trim()}")
+                    } else {
+                        appendLine("  $line")
+                    }
+                }
+            }
+            ToolResult(success = true, output = formatted, durationMs = elapsed(startTime))
+        } catch (e: Exception) {
+            ToolResult.error("find_usages failed: ${e.message}")
+        }
+    }
+
+    private fun findDefinition(symbol: String, path: String, language: String?, startTime: Long): ToolResult {
+        val root = sandbox.resolve(path).toAbsolutePath()
+
+        if (isCtagsAvailable()) {
+            val ctagsResult = runCtagsForSymbol(symbol, root.toFile())
+            if (ctagsResult.isNotEmpty()) {
+                return ToolResult(
+                    success = true,
+                    output = "Definition of '$symbol':\n$ctagsResult",
+                    durationMs = elapsed(startTime)
+                )
+            }
+        }
+
+        val defPatterns = listOf(
+            "fun $symbol",
+            "class $symbol",
+            "interface $symbol",
+            "object $symbol",
+            "def $symbol",
+            "function $symbol",
+            "public.*$symbol\\(",
+            "val $symbol",
+            "var $symbol"
+        )
+
+        val allLines = mutableListOf<String>()
+        defPatterns.forEach { pattern ->
+            try {
+                val cmd = listOf(
+                    "grep", "-rn", "--color=never", "-E", pattern,
+                    "--include=*.kt", "--include=*.java", "--include=*.ts",
+                    "--include=*.py", "--include=*.js", root.toString()
+                )
+                val out = runCommand(cmd, root.toFile())
+                allLines.addAll(out.lines().filter { it.isNotBlank() })
+            } catch (_: Exception) {
+            }
+        }
+
+        if (allLines.isEmpty()) {
+            return ToolResult(
+                success = true,
+                output = "Definition of '$symbol' not found in $path. " +
+                    "Install universal-ctags for better results.",
+                durationMs = elapsed(startTime)
+            )
+        }
+
+        val output = "Definition of '$symbol':\n" +
+            allLines.distinct().take(20).joinToString("\n") { "  $it" }
+        return ToolResult(success = true, output = output, durationMs = elapsed(startTime))
+    }
+
+    private fun listSymbols(path: String, language: String?, startTime: Long): ToolResult {
+        val root = sandbox.resolve(path).toAbsolutePath()
+
+        if (!isCtagsAvailable()) {
+            return ToolResult(
+                success = false,
+                output = "list_symbols requires universal-ctags. Install with:\n" +
+                    "  macOS: brew install universal-ctags\n" +
+                    "  Ubuntu: apt-get install universal-ctags\n" +
+                    "  Windows: winget install universal-ctags\n\n" +
+                    "Alternative: use grep_search to search for class/fun/def patterns manually.",
+                durationMs = elapsed(startTime)
+            )
+        }
+
+        val langFilter = language?.let {
+            listOf("--languages=${it.lowercase().replaceFirstChar { c -> c.uppercase() }}")
+        } ?: emptyList()
+
+        val cmd = buildList {
+            add("ctags")
+            add("-R")
+            add("--fields=+n")
+            add("--output-format=json")
+            addAll(langFilter)
+            add("--output=-")
+            add(root.toString())
+        }
+
+        return try {
+            val output = runCommand(cmd, root.toFile())
+            val lines = output.lines().filter { it.isNotBlank() }.take(500)
+            val formatted = buildString {
+                appendLine("Symbols in $path (${lines.size} found):\n")
+                lines.forEach { line ->
+                    try {
+                        @Suppress("UNCHECKED_CAST")
+                        val tag = pl.jclab.refio.core.utils.GsonInstance.gson
+                            .fromJson(line, Map::class.java) as Map<String, Any>
+                        val name = tag["name"] ?: return@forEach
+                        val kind = tag["kind"] ?: ""
+                        val tagPath = (tag["path"] as? String)
+                            ?.removePrefix(root.toString())?.trimStart('/', '\\') ?: ""
+                        val lineNo = tag["line"] ?: ""
+                        appendLine("  [$kind] $name  ($tagPath:$lineNo)")
+                    } catch (_: Exception) {
+                        appendLine("  $line")
+                    }
+                }
+            }
+            ToolResult(success = true, output = formatted, durationMs = elapsed(startTime))
+        } catch (e: Exception) {
+            ToolResult.error("list_symbols failed: ${e.message}")
+        }
+    }
+
+    private fun getDiagnostics(path: String, language: String?, startTime: Long): ToolResult {
+        val root = sandbox.resolve(path).toAbsolutePath()
+        val detectedLang = language ?: detectProjectLanguage(root)
+
+        val cmd: List<String> = when (detectedLang?.lowercase()) {
+            "kotlin" -> listOf("./gradlew", "--no-daemon", "compileKotlin", "--info")
+            "java" -> listOf("./gradlew", "--no-daemon", "compileJava")
+            "typescript" -> listOf("npx", "tsc", "--noEmit")
+            "javascript" -> listOf("npx", "eslint", ".")
+            "python" -> listOf("python", "-m", "py_compile")
+            else -> return ToolResult.error(
+                "Cannot determine language for diagnostics. " +
+                    "Specify 'language' parameter: kotlin, java, typescript, javascript, python"
+            )
+        }
+
+        return try {
+            val output = runCommandWithTimeout(cmd, root.toFile(), timeoutSeconds = 120)
+            val lines = output.lines()
+
+            val errors = lines.filter { line ->
+                val l = line.lowercase()
+                l.contains("error:") || l.contains("warning:") ||
+                    l.contains("exception") || l.contains("failed")
+            }
+
+            val summary = if (errors.isEmpty()) {
+                "No errors found ($detectedLang diagnostics passed)"
+            } else {
+                "Diagnostics ($detectedLang) — ${errors.size} issues:\n${errors.take(50).joinToString("\n")}"
+            }
+
+            ToolResult(success = true, output = summary, durationMs = elapsed(startTime))
+        } catch (e: Exception) {
+            ToolResult.error("get_diagnostics failed: ${e.message}")
+        }
+    }
+
+    private fun isCtagsAvailable(): Boolean {
+        return try {
+            val p = Runtime.getRuntime().exec(arrayOf("ctags", "--version"))
+            p.waitFor(2, TimeUnit.SECONDS) && p.exitValue() == 0
+        } catch (_: Exception) {
+            false
+        }
+    }
+
+    private fun runCtagsForSymbol(symbol: String, dir: java.io.File): String {
+        val cmd = listOf(
+            "ctags", "-R", "--fields=+n", "--output-format=json",
+            "--output=-", dir.absolutePath
+        )
+        return try {
+            val output = runCommand(cmd, dir)
+            output.lines().filter { line ->
+                line.contains("\"name\":\"$symbol\"") || line.contains("\"name\": \"$symbol\"")
+            }.joinToString("\n")
+        } catch (_: Exception) {
+            ""
+        }
+    }
+
+    private fun detectProjectLanguage(root: java.nio.file.Path): String? {
+        return when {
+            Files.exists(root.resolve("build.gradle.kts")) || Files.exists(root.resolve("build.gradle")) -> "kotlin"
+            Files.exists(root.resolve("package.json")) && Files.exists(root.resolve("tsconfig.json")) -> "typescript"
+            Files.exists(root.resolve("package.json")) -> "javascript"
+            Files.exists(root.resolve("requirements.txt")) || Files.exists(root.resolve("pyproject.toml")) -> "python"
+            else -> null
+        }
+    }
+
+    private fun runCommand(cmd: List<String>, workDir: java.io.File): String {
+        val process = ProcessBuilder(cmd)
+            .directory(workDir)
+            .redirectErrorStream(true)
+            .start()
+        val output = process.inputStream.bufferedReader().readText()
+        process.waitFor(30, TimeUnit.SECONDS)
+        return output
+    }
+
+    private fun runCommandWithTimeout(cmd: List<String>, workDir: java.io.File, timeoutSeconds: Long): String {
+        val process = ProcessBuilder(cmd)
+            .directory(workDir)
+            .redirectErrorStream(true)
+            .start()
+        val output = process.inputStream.bufferedReader().readText()
+        val finished = process.waitFor(timeoutSeconds, TimeUnit.SECONDS)
+        if (!finished) {
+            process.destroyForcibly()
+            throw RuntimeException("Command timed out after ${timeoutSeconds}s")
+        }
+        return output
+    }
+
+    private fun elapsed(start: Long) = (System.currentTimeMillis() - start).toInt()
+
+    companion object {
+        val VALID_ACTIONS = setOf("find_usages", "find_definition", "list_symbols", "get_diagnostics")
+    }
+
+    override fun getParameterSchema(): Map<String, Any> = mapOf(
+        "type" to "object",
+        "properties" to mapOf(
+            "action" to mapOf(
+                "type" to "string",
+                "enum" to VALID_ACTIONS.toList(),
+                "description" to "find_usages: where is symbol used | find_definition: where is symbol defined | list_symbols: all symbols in path | get_diagnostics: compiler errors"
+            ),
+            "symbol" to mapOf(
+                "type" to "string",
+                "description" to "Symbol name to search for (required for find_usages, find_definition)"
+            ),
+            "path" to mapOf(
+                "type" to "string",
+                "description" to "File or directory path relative to project root (default: '.')"
+            ),
+            "language" to mapOf(
+                "type" to "string",
+                "description" to "Language hint: kotlin, java, typescript, javascript, python. Auto-detected if omitted."
+            )
+        ),
+        "required" to listOf("action")
+    )
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CreateNewFileTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CreateNewFileTool.kt
index d218cc95..c04db3ef 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CreateNewFileTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/CreateNewFileTool.kt
@@ -1,13 +1,11 @@
 package pl.jclab.refio.core.tools.implementations
 
 import pl.jclab.refio.core.tools.DiffUtils
-import pl.jclab.refio.core.tools.FileLockManager
 import pl.jclab.refio.core.tools.PathSandbox
-import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.FileTool
 import pl.jclab.refio.core.tools.base.ToolCategory
 import pl.jclab.refio.core.tools.base.ToolMode
 import pl.jclab.refio.core.tools.base.ToolResult
-import pl.jclab.refio.core.tools.normalizePath
 import pl.jclab.refio.core.tools.security.FileLimits
 import pl.jclab.refio.core.logging.dualLogger
 import java.nio.file.Files
@@ -30,25 +28,27 @@ private val logger = dualLogger("CreateNewFileTool")
  * - Returns warning (success=true) if file already exists (use code_editing instead)
  */
 class CreateNewFileTool(
-    private val sandbox: PathSandbox,
+    sandbox: PathSandbox,
     private val limits: FileLimits
-) : Tool {
+) : FileTool(sandbox) {
 
     override val name = "create_new_file"
-    override val description = "Create a NEW file with given content. FREE. " +
-        "HARD FAILS if the file already exists — does NOT overwrite. " +
-        "MANDATORY: before calling this tool, you MUST have already verified in a PRIOR turn that the path is unused " +
-        "(via file_search or read_directory called ALONE). Do not call this tool in the same actions array as the existence check — " +
-        "tools in one turn run in parallel and the check will not gate the create. " +
-        "Even if the user named the file specifically and it 'looks new', the project may already contain it — always pre-check. " +
-        "On 'File already exists' error: do NOT retry. Switch to read_file + code_editing / multi_edit / multi_line_editor."
+    override val description = "Create a NEW SMALL file (config, stub, short snippet) " +
+        "where the agent already has the full content ready. " +
+        "Strongly prefer `advance_code_editing` for code files > ~50 lines, HTML pages, full classes, " +
+        "or any content generated from scratch — that tool uses a dedicated LLM call so your agent " +
+        "response stays small and avoids streaming timeouts. Stuffing hundreds of lines into `content` " +
+        "here inflates the agent response, wastes tokens, and risks truncation. " +
+        "HARD FAILS if file already exists. Pre-check path in a PRIOR turn (file_search/read_directory). " +
+        "On 'File already exists' error: switch to read_file + code_editing."
     override val mode = ToolMode.WRITE
     override val category = ToolCategory.FILE_MODIFYING
+    override val selectionHint =
+        "Small new files (configs, stubs, short snippets) where you already have the full content. " +
+        "For >~50 lines, HTML/classes, or generated code, prefer advance_code_editing."
 
     override fun validateParams(params: Map<String, Any>) {
-        if (params["path"] == null || (params["path"] as? String).isNullOrBlank()) {
-            throw IllegalArgumentException("Parameter 'path' is required and cannot be empty")
-        }
+        validatePathParam(params)
         if (params["content"] == null) {
             throw IllegalArgumentException("Parameter 'content' is required")
         }
@@ -58,30 +58,21 @@ class CreateNewFileTool(
         val startTime = System.currentTimeMillis()
 
         try {
-            // Extract parameters with safe casting
-            val pathStr = params["path"] as? String
-                ?: return ToolResult.error("Missing required parameter: 'path'")
+            val pathStr = validatePathParam(params)
             val content = params["content"] as? String
                 ?: return ToolResult.error("Missing required parameter: 'content'")
 
-            // Check content size
             if (content.length > limits.maxFileSize) {
                 return ToolResult.error(
                     "Content too large: ${content.length} bytes (max ${limits.maxFileSize} bytes)"
                 )
             }
 
-            // Normalize path for security (bare filenames → "./file.txt", backslash → forward slash)
-            val normalizedPathStr = normalizePath(pathStr)
-
-            // Resolve and validate path
-            val path = sandbox.resolve(normalizedPathStr)
+            val path = resolveSandboxPath(pathStr)
 
             logger.info { "Creating file: relative='$pathStr', absolute='${path.toAbsolutePath()}', contentSize=${content.length} chars, lineCount=${content.lines().size}" }
 
-            return FileLockManager.withFileLock(path.toAbsolutePath().toString()) {
-                // Re-validate path inside lock to close TOCTOU window
-                sandbox.revalidateBeforeIO(path)
+            return withLockedFile(path) {
 
                 // Check if file already exists.
                 // We return success=false (a real failure) rather than a soft warning, because
@@ -89,7 +80,7 @@ class CreateNewFileTool(
                 // creation succeeded and continue with stale assumptions about file content.
                 if (path.exists()) {
                     logger.warn { "File already exists: $pathStr (resolved to ${path.toAbsolutePath()})" }
-                    return@withFileLock ToolResult.error(
+                    return@withLockedFile ToolResult.error(
                         message = "File already exists: $pathStr. create_new_file refuses to overwrite.",
                         recovery = "DO NOT retry create_new_file for this path. Instead: " +
                             "1) read_file($pathStr) to inspect current content, " +
@@ -109,7 +100,7 @@ class CreateNewFileTool(
                 // Check if parent is a directory
                 val parent = path.parent
                 if (parent != null && parent.exists() && !parent.isDirectory()) {
-                    return@withFileLock ToolResult.error("Parent path exists but is not a directory: ${parent.fileName}")
+                    return@withLockedFile ToolResult.error("Parent path exists but is not a directory: ${parent.fileName}")
                 }
 
                 // Create parent directories if needed
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/DelegateToStrongModelTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/DelegateToStrongModelTool.kt
index 28dd4bd8..542ef4f9 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/DelegateToStrongModelTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/DelegateToStrongModelTool.kt
@@ -40,12 +40,13 @@ class DelegateToStrongModelTool(
 
     override val name = "delegate_to_strong_model"
     override val description = "Delegate a complex task to a stronger, more capable model. " +
-        "Use when the task requires deeper reasoning, you've attempted a solution but the result is unsatisfactory, " +
-        "the problem involves complex architectural decisions or subtle bugs, or you need expert-level analysis. " +
-        "Default: single-shot (text response, no tools). Set allow_tools=true for full agent mode. " +
-        "The strong model receives ONLY what you pass — be explicit in your task description."
+        "Use when deeper reasoning is needed or you're stuck after multiple attempts. " +
+        "Default: single-shot (no tools). Set allow_tools=true for agent mode. " +
+        "The strong model receives ONLY what you pass — be explicit."
     override val mode = ToolMode.READ_ONLY
     override val category = ToolCategory.SYSTEM
+    override val selectionHint =
+        "Escalate to a stronger model when stuck after 4+ consecutive failures on the same operation."
 
     override suspend fun execute(params: Map<String, Any>): ToolResult {
         val taskId = params["_task_id"]?.toString()
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/FetchWebpageTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/FetchWebpageTool.kt
new file mode 100644
index 00000000..ff153ea7
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/FetchWebpageTool.kt
@@ -0,0 +1,191 @@
+package pl.jclab.refio.core.tools.implementations
+
+import io.ktor.client.*
+import io.ktor.client.engine.cio.*
+import io.ktor.client.request.*
+import io.ktor.client.statement.*
+import kotlinx.coroutines.Dispatchers
+import kotlinx.coroutines.withContext
+import org.jsoup.Jsoup
+import org.jsoup.nodes.Element
+import pl.jclab.refio.core.api.ModelOperation
+import pl.jclab.refio.core.llm.LLMClient
+import pl.jclab.refio.core.llm.LLMMessage
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.ToolCategory
+import pl.jclab.refio.core.tools.base.ToolMode
+import pl.jclab.refio.core.tools.base.ToolResult
+
+private val logger = dualLogger("FetchWebpageTool")
+
+class FetchWebpageTool(
+    private val llmClient: LLMClient,
+    private val configService: ConfigService
+) : Tool {
+    override val name = "fetch_webpage"
+    override val description = "Fetch a URL, convert HTML to Markdown, then extract information with AI using your prompt. " +
+        "For raw HTTP/API access use http_request instead."
+    override val mode = ToolMode.READ_ONLY
+    override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint =
+        "Fetch URL, convert HTML→Markdown, extract info via AI. For raw HTTP/API use http_request."
+
+    override fun validateParams(params: Map<String, Any>) {
+        val url = params["url"] as? String
+        if (url.isNullOrBlank()) throw IllegalArgumentException("Parameter 'url' is required")
+        if (!url.startsWith("http://") && !url.startsWith("https://"))
+            throw IllegalArgumentException("'url' must start with http:// or https://")
+        val prompt = params["prompt"] as? String
+        if (prompt.isNullOrBlank()) throw IllegalArgumentException("Parameter 'prompt' is required")
+    }
+
+    override suspend fun execute(params: Map<String, Any>): ToolResult = withContext(Dispatchers.IO) {
+        val startTime = System.currentTimeMillis()
+        val url = params["url"] as? String
+            ?: return@withContext ToolResult.error("Missing 'url'")
+        val prompt = params["prompt"] as? String
+            ?: return@withContext ToolResult.error("Missing 'prompt'")
+        val maxContentChars = ((params["max_content_chars"] as? Number)?.toInt() ?: 20_000)
+            .coerceIn(1_000, 50_000)
+
+        logger.info { "Fetching $url for AI processing" }
+        val html = try {
+            fetchHtml(url)
+        } catch (e: Exception) {
+            return@withContext ToolResult.error("Failed to fetch URL: ${e.message}")
+        }
+
+        val markdown = htmlToMarkdown(html, url, maxContentChars)
+
+        val (model, provider) = try {
+            configService.getWeakModel()
+        } catch (e: Exception) {
+            return@withContext ToolResult.error("No LLM model configured: ${e.message}")
+        }
+
+        val systemPrompt = "You are a precise web content extractor. " +
+            "The user will provide Markdown content from a webpage and a request. " +
+            "Answer the request using only the provided content. Be concise and accurate."
+
+        val userMessage = "## Webpage: $url\n\n$markdown\n\n---\n\n## Request\n\n$prompt"
+
+        val messages = listOf(
+            LLMMessage(role = "user", content = userMessage)
+        )
+
+        logger.info { "Processing page with LLM (model=$model, provider=$provider, content=${markdown.length} chars)" }
+
+        val llmResponse = try {
+            llmClient.complete(
+                provider = provider,
+                model = model,
+                messages = messages,
+                systemPrompt = systemPrompt,
+                maxTokens = 2048,
+                stream = false,
+                source = "FetchWebpageTool"
+            )
+        } catch (e: Exception) {
+            return@withContext ToolResult.error("LLM processing failed: ${e.message}")
+        }
+
+        val answer = llmResponse.content
+
+        ToolResult(
+            success = true,
+            output = answer,
+            durationMs = (System.currentTimeMillis() - startTime).toInt(),
+            metadata = mapOf(
+                "url" to url,
+                "content_chars" to markdown.length,
+                "model" to model
+            )
+        )
+    }
+
+    private suspend fun fetchHtml(url: String): String {
+        val client = HttpClient(CIO) {
+            engine { requestTimeout = 20_000 }
+            followRedirects = true
+        }
+        try {
+            val response = client.get(url) {
+                header("User-Agent", "Mozilla/5.0 (compatible; Refio/1.0)")
+                header("Accept", "text/html,application/xhtml+xml,*/*")
+            }
+            return response.bodyAsText()
+        } finally {
+            client.close()
+        }
+    }
+
+    private fun htmlToMarkdown(html: String, baseUrl: String, maxChars: Int): String {
+        val doc = Jsoup.parse(html, baseUrl)
+
+        doc.select("script, style, nav, footer, header, aside, .sidebar, .menu, .ads, .advertisement").remove()
+
+        val mainContent = doc.select("main, article, .content, .post, #content, #main").firstOrNull()
+            ?: doc.body()
+
+        return buildString {
+            appendLine("# ${doc.title()}")
+            appendLine()
+            convertElement(mainContent, this)
+        }.take(maxChars).let {
+            if (it.length == maxChars) "$it\n\n[... content truncated at $maxChars chars ...]"
+            else it
+        }
+    }
+
+    private fun convertElement(el: Element, sb: StringBuilder) {
+        el.children().forEach { child ->
+            when (child.tagName()) {
+                "h1" -> sb.appendLine("# ${child.text()}\n")
+                "h2" -> sb.appendLine("## ${child.text()}\n")
+                "h3" -> sb.appendLine("### ${child.text()}\n")
+                "h4", "h5", "h6" -> sb.appendLine("#### ${child.text()}\n")
+                "p" -> {
+                    val text = child.text().trim()
+                    if (text.isNotBlank()) sb.appendLine("$text\n")
+                }
+                "ul", "ol" -> {
+                    child.select("li").forEach { li ->
+                        sb.appendLine("- ${li.text()}")
+                    }
+                    sb.appendLine()
+                }
+                "pre", "code" -> {
+                    sb.appendLine("```\n${child.text()}\n```\n")
+                }
+                "a" -> {
+                    val href = child.attr("abs:href")
+                    val text = child.text()
+                    if (text.isNotBlank() && href.isNotBlank()) sb.append("[$text]($href) ")
+                }
+                "br" -> sb.appendLine()
+                else -> convertElement(child, sb)
+            }
+        }
+    }
+
+    override fun getParameterSchema(): Map<String, Any> = mapOf(
+        "type" to "object",
+        "properties" to mapOf(
+            "url" to mapOf(
+                "type" to "string",
+                "description" to "URL to fetch"
+            ),
+            "prompt" to mapOf(
+                "type" to "string",
+                "description" to "What to extract or answer from the page. Be specific."
+            ),
+            "max_content_chars" to mapOf(
+                "type" to "integer",
+                "description" to "Max characters of page content to process (default: 20000, max: 50000)"
+            )
+        ),
+        "required" to listOf("url", "prompt")
+    )
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/FileSearchTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/FileSearchTool.kt
index d3fc9ea3..b486fbae 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/FileSearchTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/FileSearchTool.kt
@@ -39,6 +39,7 @@ class FileSearchTool(
     override val description = "Find files by name pattern (glob syntax)."
     override val mode = ToolMode.READ_ONLY
     override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint = "Find files by name/path (glob). Also used as pre-check before create_new_file."
 
     override fun validateParams(params: Map<String, Any>) {
         if (params["pattern"] == null || (params["pattern"] as? String).isNullOrBlank()) {
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/GrepSearchTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/GrepSearchTool.kt
index dd182728..aae7645e 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/GrepSearchTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/GrepSearchTool.kt
@@ -40,6 +40,9 @@ class GrepSearchTool(
     override val description = "Search file contents by regex pattern."
     override val mode = ToolMode.READ_ONLY
     override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint =
+        "Search file contents by exact string or regex. Use for identifiers and known literals; " +
+        "prefer rag_search for conceptual queries without good keywords."
 
     override fun validateParams(params: Map<String, Any>) {
         val pattern = params["pattern"] as? String
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/HttpRequestTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/HttpRequestTool.kt
index 63d7ae59..c66c2538 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/HttpRequestTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/HttpRequestTool.kt
@@ -78,9 +78,11 @@ class HttpRequestTool(
 ) : Tool {
 
     override val name = "http_request"
-    override val description = "Make HTTP requests (GET/POST/PUT/DELETE). Stateless: no cookie jar — for session auth read Set-Cookie from the response and resend it via headers={\"Cookie\":\"...\"} on the next call. The 'body' parameter accepts either a JSON string OR a raw object/array — objects are auto-serialized to JSON. Use save_to_file for large responses."
+    override val description = "Make HTTP requests (GET/POST/PUT/DELETE). Stateless — no cookie jar. " +
+        "Body accepts JSON string or raw object (auto-serialized). Use save_to_file for large responses."
     override val mode = ToolMode.WRITE
     override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint = "HTTP requests (GET/POST/PUT/DELETE). Use save_to_file for large responses."
 
     override fun validateParams(params: Map<String, Any>) {
         val url = params["url"] as? String
@@ -107,21 +109,23 @@ class HttpRequestTool(
             val bodyFile = params["body_file"] as? String
             val body = if (bodyFile != null) null else coerceBody(params["body"])
             val rawContentType = params["content_type"] as? String
-            // Validate and fallback to default if content_type is invalid
-            val contentType = run {
-                if (rawContentType == null) return@run "application/json"
+            // No fallback: invalid content_type is a caller bug, not something to paper over.
+            val contentType = if (rawContentType == null) {
+                "application/json"
+            } else {
                 val parts = rawContentType.split("/", limit = 2)
                 if (parts.size != 2 || parts[0].isBlank() || parts[1].isBlank()) {
-                    logger.warn { "Invalid content_type '$rawContentType', falling back to application/json" }
-                    return@run "application/json"
+                    return@withContext ToolResult.error(
+                        "Invalid content_type '$rawContentType' (expected 'type/subtype')"
+                    )
                 }
-                // Final check: try parsing to catch any edge cases
                 try {
                     ContentType.parse(rawContentType)
                     rawContentType
                 } catch (e: Exception) {
-                    logger.warn { "Unparseable content_type '$rawContentType', falling back to application/json" }
-                    "application/json"
+                    return@withContext ToolResult.error(
+                        "Unparseable content_type '$rawContentType': ${e.message}"
+                    )
                 }
             }
             val saveToFile = params["save_to_file"] as? String
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/InvokeSubagentTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/InvokeSubagentTool.kt
index 9bfc5d4f..f0ec65a0 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/InvokeSubagentTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/InvokeSubagentTool.kt
@@ -31,6 +31,8 @@ class InvokeSubagentTool(
         get() = buildDynamicDescription()
     override val mode = ToolMode.READ_ONLY
     override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint =
+        "Delegate a sub-task to a specialized subagent (EXPENSIVE). Dispatch multiple in parallel for independent work."
 
     override suspend fun execute(params: Map<String, Any>): ToolResult {
         val taskId = params["_task_id"]?.toString()
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/LlmCallTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/LlmCallTool.kt
index 39485f82..1f14ef8f 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/LlmCallTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/LlmCallTool.kt
@@ -52,12 +52,12 @@ class LlmCallTool(
 ) : Tool {
 
     override val name = "llm_call"
-    override val description = "Send a prompt to an LLM and get the text response. " +
-        "Use 'prompt' for instructions/role, 'data' for inline text, 'file_path' for large content (keeps it out of agent context). " +
-        "Supports vision: use 'image_path' or 'image_base64' with a vision-capable model (e.g. openai/gpt-5.4-mini). " +
-        "No tools, no history, no project context — a raw single-turn LLM call. CHEAPER than invoke_subagent."
+    override val description = "Raw single-turn LLM call — no tools, no history, no project context. " +
+        "Supports vision via image_path/image_base64. CHEAPER than invoke_subagent."
     override val mode = ToolMode.WRITE
     override val category = ToolCategory.FILE_PRODUCING
+    override val selectionHint =
+        "Single-turn LLM call for analysis/transform/vision. Cheaper than invoke_subagent when no tools needed."
 
     override suspend fun execute(params: Map<String, Any>): ToolResult {
         val userPrompt = params["data"]?.toString()?.trim() ?: ""
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MemoryTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MemoryTool.kt
index fa0379bd..c8ecde72 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MemoryTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MemoryTool.kt
@@ -33,6 +33,8 @@ get_subtask_output (recover full raw output of a past tool call by subtask id /
 Use for: key findings, intermediate results, decisions, blockers, recovering data lost to summarization."""
     override val mode = ToolMode.READ_ONLY
     override val category = ToolCategory.SYSTEM
+    override val selectionHint =
+        "Cross-turn working memory: write/read findings, list keys, recover truncated subtask output."
 
     override fun getParameterSchema(): Map<String, Any> = mapOf(
         "type" to "object",
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MonitorProcessTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MonitorProcessTool.kt
new file mode 100644
index 00000000..a519632c
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MonitorProcessTool.kt
@@ -0,0 +1,62 @@
+package pl.jclab.refio.core.tools.implementations
+
+import pl.jclab.refio.core.services.ProcessManager
+import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.ToolCategory
+import pl.jclab.refio.core.tools.base.ToolMode
+import pl.jclab.refio.core.tools.base.ToolResult
+
+class MonitorProcessTool(
+    private val processManager: ProcessManager
+) : Tool {
+    override val name = "monitor_process"
+    override val description = "Read output from a background process started with run_process_background. " +
+        "Call repeatedly to get new output lines. Returns output so far and whether process is still running. " +
+        "If process has finished, returns all remaining output."
+    override val mode = ToolMode.READ_ONLY
+    override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint = "Read new output from a background process (pairs with run_process_background)."
+
+    override fun validateParams(params: Map<String, Any>) {
+        val id = params["process_id"] as? String
+        if (id.isNullOrBlank()) throw IllegalArgumentException("Parameter 'process_id' is required")
+    }
+
+    override suspend fun execute(params: Map<String, Any>): ToolResult {
+        val processId = params["process_id"] as? String
+            ?: return ToolResult.error("Missing 'process_id'")
+        val maxLines = ((params["max_lines"] as? Number)?.toInt() ?: 200).coerceIn(1, 1000)
+
+        val (lines, isRunning) = processManager.readOutput(processId, maxLines)
+
+        if (processManager.get(processId) == null && lines.isEmpty()) {
+            return ToolResult.error("Process not found: $processId (may have already finished)")
+        }
+
+        val statusLine = if (isRunning) "[Process $processId: RUNNING]" else "[Process $processId: FINISHED]"
+        val output = if (lines.isEmpty()) {
+            "$statusLine\n(no new output)"
+        } else {
+            "$statusLine\n${lines.joinToString("\n")}"
+        }
+
+        return ToolResult(
+            success = true,
+            output = output,
+            metadata = mapOf(
+                "process_id" to processId,
+                "is_running" to isRunning,
+                "lines_read" to lines.size
+            )
+        )
+    }
+
+    override fun getParameterSchema(): Map<String, Any> = mapOf(
+        "type" to "object",
+        "properties" to mapOf(
+            "process_id" to mapOf("type" to "string", "description" to "Process ID from run_process_background"),
+            "max_lines" to mapOf("type" to "integer", "description" to "Max output lines to return (default: 200)")
+        ),
+        "required" to listOf("process_id")
+    )
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MultiEditTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MultiEditTool.kt
index b134d54b..62a70790 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MultiEditTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MultiEditTool.kt
@@ -1,10 +1,9 @@
 package pl.jclab.refio.core.tools.implementations
 
 import pl.jclab.refio.core.tools.DiffUtils
-import pl.jclab.refio.core.tools.FileLockManager
 import pl.jclab.refio.core.tools.PathSandbox
 import pl.jclab.refio.core.tools.base.ChangeSummary
-import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.FileTool
 import pl.jclab.refio.core.tools.base.ToolCategory
 import pl.jclab.refio.core.tools.base.ToolMode
 import pl.jclab.refio.core.tools.base.ToolResult
@@ -33,14 +32,15 @@ private val logger = dualLogger("MultiEditTool")
  * - File size limits enforced
  */
 class MultiEditTool(
-    private val sandbox: PathSandbox,
+    sandbox: PathSandbox,
     private val limits: FileLimits
-) : Tool {
+) : FileTool(sandbox) {
 
     override val name = "multi_edit"
     override val description = "Atomic search-and-replace across multiple files. FREE."
     override val mode = ToolMode.WRITE
     override val category = ToolCategory.FILE_MODIFYING
+    override val selectionHint = "Atomic search/replace across multiple files or multiple sites in one file."
 
     override fun validateParams(params: Map<String, Any>) {
         @Suppress("UNCHECKED_CAST")
@@ -70,13 +70,10 @@ class MultiEditTool(
                 parseEdit(edit, index)
             }
 
-            // Prepare and apply each edit under a per-file lock
+            // Prepare and apply each edit under a per-file lock (revalidated inside lock).
             val results = parsedEdits.map { edit ->
-                val resolvedPath = sandbox.resolve(edit.path)
-                FileLockManager.withFileLock(resolvedPath.toAbsolutePath().toString()) {
-                    // Re-validate path inside lock to close TOCTOU window
-                    sandbox.revalidateBeforeIO(resolvedPath)
-
+                val resolvedPath = resolveSandboxPath(edit.path)
+                withLockedFile(resolvedPath) {
                     val prep = prepareEdit(edit)
                     applyEdit(prep)
                 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MultiLineEditorTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MultiLineEditorTool.kt
index b8a6e0bf..6484032e 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MultiLineEditorTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/MultiLineEditorTool.kt
@@ -15,9 +15,8 @@ import pl.jclab.refio.core.services.ConfigService
 import pl.jclab.refio.core.services.PromptsService
 import pl.jclab.refio.core.services.execution.unified.ExecutionEventListener
 import pl.jclab.refio.core.tools.DiffUtils
-import pl.jclab.refio.core.tools.FileLockManager
 import pl.jclab.refio.core.tools.PathSandbox
-import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.FileTool
 import pl.jclab.refio.core.tools.base.ToolCategory
 import pl.jclab.refio.core.tools.base.ToolMode
 import pl.jclab.refio.core.tools.base.ToolResult
@@ -59,29 +58,26 @@ private val logger = dualLogger("MultiLineEditorTool")
  * Based on: docs/0041-multi-coding.md (RFC 0041)
  */
 class MultiLineEditorTool(
-    private val sandbox: PathSandbox,
+    sandbox: PathSandbox,
     private val limits: FileLimits,
     private val llmClient: LLMClient,
     private val configService: ConfigService,
     private val promptsService: PromptsService,
     private val taskRepository: TaskRepository
-) : Tool {
+) : FileTool(sandbox) {
 
     override val name = "multi_line_editor"
     override val description =
         "LLM-assisted targeted edits in an existing file (2-10 locations). CHEAP (~\$0.02)."
     override val mode = ToolMode.WRITE
     override val category = ToolCategory.FILE_MODIFYING
+    override val selectionHint =
+        "Semantic/intent-based edits in an existing file (2-10 locations) where exact strings are hard to match."
 
     private val gson = Gson()
 
     override fun validateParams(params: Map<String, Any>) {
-        // Validate path
-        if (params["path"] == null || (params["path"] as? String).isNullOrBlank()) {
-            throw IllegalArgumentException("Parameter 'path' is required and cannot be empty")
-        }
-
-        // Validate edit_description
+        validatePathParam(params)
         val editDescription = params["edit_description"] as? String
         if (editDescription.isNullOrBlank()) {
             throw IllegalArgumentException("Parameter 'edit_description' is required and cannot be empty")
@@ -167,7 +163,7 @@ class MultiLineEditorTool(
         conversationContext: String? = null
     ): ToolResult {
         // 1. Read file and validate
-        val path = sandbox.resolve(pathStr)
+        val path = resolveSandboxPath(pathStr)
 
         if (!path.exists()) {
             return ToolResult.error("File not found: $pathStr. Use advance_code_editing to create new files.")
@@ -186,9 +182,7 @@ class MultiLineEditorTool(
             )
         }
 
-        return FileLockManager.withFileLock(path.toAbsolutePath().toString()) {
-            // Re-validate path inside lock to close TOCTOU window
-            sandbox.revalidateBeforeIO(path)
+        return withLockedFile(path) {
 
             // Read content
             val originalContent = Files.readString(path)
@@ -272,7 +266,7 @@ class MultiLineEditorTool(
                 )
             } catch (e: Exception) {
                 logger.error(e) { "LLM request failed" }
-                return@withFileLock ToolResult.error("LLM request failed: ${e.message}. Try rephrasing the edit description.")
+                return@withLockedFile ToolResult.error("LLM request failed: ${e.message}. Try rephrasing the edit description.")
             }
 
             val responseContent = response.content
@@ -300,7 +294,7 @@ class MultiLineEditorTool(
                     "Failed to parse LLM response as JSON: ${e.message}. " +
                         "Response preview: ${responseContent.take(200)}"
                 }
-                return@withFileLock ToolResult.error(
+                return@withLockedFile ToolResult.error(
                     "Failed to parse LLM response. Response was: ${responseContent.take(200)}... " +
                     "Try rephrasing the edit description or use advance_code_editing."
                 )
@@ -308,7 +302,7 @@ class MultiLineEditorTool(
 
             if (edits.isEmpty()) {
                 logger.warn { "LLM returned no edits for description: $editDescription" }
-                return@withFileLock ToolResult.error(
+                return@withLockedFile ToolResult.error(
                     "LLM did not identify any changes to make. Try being more specific in the edit description."
                 )
             }
@@ -318,7 +312,7 @@ class MultiLineEditorTool(
             // 7. Validate edits
             val validationError = validateEdits(edits, lines.size)
             if (validationError != null) {
-                return@withFileLock ToolResult.error(validationError)
+                return@withLockedFile ToolResult.error(validationError)
             }
 
             // 8. Apply edits (from end to start to preserve line numbers)
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RagSearchTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RagSearchTool.kt
index 634e3f17..55344aad 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RagSearchTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RagSearchTool.kt
@@ -44,6 +44,9 @@ class RagSearchTool(
             "Prefer grep_search for exact identifiers and rag_search for concepts."
     override val mode: ToolMode = ToolMode.READ_ONLY
     override val category: ToolCategory = ToolCategory.DATA_PRODUCING
+    override val selectionHint: String =
+        "Semantic search over indexed code/docs. Use for concepts without good keywords; " +
+        "prefer grep_search when you know an exact identifier."
 
     override fun validateParams(params: Map<String, Any>) {
         val query = params["query"] as? String
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ReadDirectoryTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ReadDirectoryTool.kt
index c6fa44b2..9b7a178c 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ReadDirectoryTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ReadDirectoryTool.kt
@@ -36,6 +36,7 @@ class ReadDirectoryTool(
     override val description = "List files and directories."
     override val mode = ToolMode.READ_ONLY
     override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint = "List directory contents."
 
     override fun validateParams(params: Map<String, Any>) {
         // Path is optional - defaults to "."
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ReadFileTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ReadFileTool.kt
index f5a133ae..f7177b28 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ReadFileTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ReadFileTool.kt
@@ -42,15 +42,14 @@ class ReadFileTool(
 ) : Tool {
 
     override val name = "read_file"
-    override val description = "Read a text, image, or PDF file. " +
-        "DEFAULT BEHAVIOUR: reads the WHOLE file in one call (up to the 2 MB sandbox limit). " +
-        "DO NOT pass offset/limit for normal source files — you will fragment your view of the code " +
-        "and waste turns paginating. Only use offset/limit when: " +
-        "(a) the file is very large (thousands of lines, e.g. logs, generated code, big datasets), or " +
-        "(b) you genuinely need a single slice and reading the rest would be wasteful. " +
-        "For typical Kotlin/Java/TS/Python source files: call read_file with just `path` and read it all."
+    override val description = "Read a text, image, or PDF file. Reads the WHOLE file by default (2 MB limit). " +
+        "Use offset/limit only for very large files (logs, generated code). " +
+        "For normal source files: just pass `path`."
     override val mode = ToolMode.READ_ONLY
     override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint =
+        "Read a file. Whole-file by default; use offset/limit only for huge files (logs, generated code). " +
+        "Do NOT re-read a file you just wrote — the write tool result already contains the diff."
 
     override fun validateParams(params: Map<String, Any>) {
         if (params["path"] == null || (params["path"] as? String).isNullOrBlank()) {
@@ -210,6 +209,33 @@ class ReadFileTool(
                 }
             }
 
+            // Binary file detection — must come after image/PDF checks
+            if (isBinaryFile(path, mediaType)) {
+                val duration = (System.currentTimeMillis() - startTime).toInt()
+                val mimeDesc = mediaType ?: "unknown"
+                val output = buildString {
+                    append("Binary file: $pathStr\n")
+                    append("  Type: $mimeDesc\n")
+                    append("  Size: $fileSize bytes (${fileSize / 1024} KB)\n")
+                    append("  Extension: .$fileExtension\n\n")
+                    append("This file contains binary data and cannot be read as text.\n")
+                    append("To process it, use run_code (Python) to decode/analyze it programmatically.")
+                }
+                return ToolResult(
+                    success = true,
+                    output = output,
+                    bytesRead = 0,
+                    durationMs = duration,
+                    metadata = mapOf(
+                        "type" to "binary",
+                        "path" to pathStr,
+                        "media_type" to mimeDesc,
+                        "file_size" to fileSize,
+                        "extension" to fileExtension
+                    )
+                )
+            }
+
             // Read file contents
             val allLines = Files.readAllLines(path)
             val totalLineCount = allLines.size
@@ -346,15 +372,11 @@ class ReadFileTool(
                 ),
                 "offset" to mapOf(
                     "type" to "integer",
-                    "description" to "Start line (1-based). OPTIONAL — omit to read from line 1. " +
-                        "Only set this when you genuinely need a slice of a large file."
+                    "description" to "Start line (1-based). Omit to read from line 1."
                 ),
                 "limit" to mapOf(
                     "type" to "integer",
-                    "description" to "Max lines to read from offset. OPTIONAL — omit to read the WHOLE file. " +
-                        "Default behaviour reads everything (up to the 2 MB sandbox limit). " +
-                        "Only set this for very large files (thousands of lines) where you need a slice. " +
-                        "DO NOT pass small values like 50/100 on normal source files — read the whole file in one call instead."
+                    "description" to "Max lines to read. Omit to read entire file. Only for very large files."
                 ),
                 "page_start" to mapOf(
                     "type" to "integer",
@@ -382,4 +404,42 @@ class ReadFileTool(
     private fun isPdf(path: java.nio.file.Path, mediaType: String?): Boolean {
         return path.fileName.toString().endsWith(".pdf", ignoreCase = true) || mediaType == "application/pdf"
     }
+
+    private val BINARY_MIME_PREFIXES = listOf(
+        "audio/", "video/", "application/octet-stream",
+        "application/zip", "application/gzip", "application/x-tar",
+        "application/x-7z", "application/x-rar",
+        "application/vnd.", "application/x-executable",
+        "model/", "font/"
+    )
+
+    private val BINARY_EXTENSIONS = setOf(
+        "exe", "dll", "so", "dylib", "o", "obj", "bin",
+        "zip", "tar", "gz", "bz2", "7z", "rar", "xz",
+        "mp3", "mp4", "avi", "mov", "wav", "flac", "ogg", "webm",
+        "woff", "woff2", "ttf", "eot",
+        "db", "sqlite", "sqlite3",
+        "class", "jar", "war", "ear",
+        "pyc", "pyo", "pyd",
+        "iso", "img", "dmg"
+    )
+
+    private fun isBinaryFile(path: java.nio.file.Path, mediaType: String?): Boolean {
+        // Layer 1: MIME type
+        if (mediaType != null && BINARY_MIME_PREFIXES.any { mediaType.startsWith(it) }) return true
+
+        // Layer 2: Extension
+        val ext = path.fileName.toString().substringAfterLast('.', "").lowercase()
+        if (ext in BINARY_EXTENSIONS) return true
+
+        // Layer 3: Null-byte heuristic (first 8KB)
+        return try {
+            Files.newInputStream(path).use { stream ->
+                val buf = ByteArray(8192)
+                val read = stream.read(buf)
+                if (read <= 0) return false
+                buf.copyOf(read).any { it == 0.toByte() }
+            }
+        } catch (_: Exception) { false }
+    }
 }
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunCodeTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunCodeTool.kt
index ff98a6d4..0da9b33a 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunCodeTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunCodeTool.kt
@@ -43,6 +43,9 @@ class RunCodeTool(
     override val description = "Execute Python, JavaScript, or Kotlin script inline."
     override val mode = ToolMode.WRITE
     override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint =
+        "Sandboxed data processing/calculations/API calls (Python/JS/Kotlin). " +
+        "Prefer over run_terminal_command for cross-platform logic — no shell quoting issues."
 
     override fun validateParams(params: Map<String, Any>) {
         val language = params["language"] as? String
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunProcessBackgroundTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunProcessBackgroundTool.kt
new file mode 100644
index 00000000..3013d450
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunProcessBackgroundTool.kt
@@ -0,0 +1,66 @@
+package pl.jclab.refio.core.tools.implementations
+
+import pl.jclab.refio.core.services.ProcessManager
+import pl.jclab.refio.core.tools.PathSandbox
+import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.ToolCategory
+import pl.jclab.refio.core.tools.base.ToolMode
+import pl.jclab.refio.core.tools.base.ToolResult
+import pl.jclab.refio.core.tools.security.CommandRuleMatcher
+import pl.jclab.refio.core.tools.security.RuleAction
+
+class RunProcessBackgroundTool(
+    private val sandbox: PathSandbox,
+    private val processManager: ProcessManager,
+    private val commandRuleMatcher: CommandRuleMatcher
+) : Tool {
+    override val name = "run_process_background"
+    override val description = "Start a command in the background and return immediately with a process_id. " +
+        "Use monitor_process(process_id) to read output. " +
+        "Use when you need to run long commands (build, test, dev server) without blocking."
+    override val mode = ToolMode.WRITE
+    override val category = ToolCategory.EXECUTION
+    override val selectionHint =
+        "Start a long-running command in the background (build, test, dev server); pair with monitor_process."
+
+    override fun validateParams(params: Map<String, Any>) {
+        val cmd = params["command"] as? String
+        if (cmd.isNullOrBlank()) throw IllegalArgumentException("Parameter 'command' is required")
+    }
+
+    override suspend fun execute(params: Map<String, Any>): ToolResult {
+        val command = params["command"] as? String
+            ?: return ToolResult.error("Missing 'command'")
+
+        val ruleAction = commandRuleMatcher.match(command).action
+        if (ruleAction == RuleAction.BLOCK) {
+            return ToolResult.error("Command blocked by security rules: $command")
+        }
+
+        val workingDir = sandbox.getProjectRoot().toAbsolutePath().toFile()
+        val managed = try {
+            processManager.start(command, workingDir)
+        } catch (e: Exception) {
+            return ToolResult.error("Failed to start process: ${e.message}")
+        }
+
+        return ToolResult(
+            success = true,
+            output = "Process started with id: ${managed.processId}\n" +
+                "Command: $command\n" +
+                "Use monitor_process(process_id=\"${managed.processId}\") to read output.",
+            metadata = mapOf(
+                "process_id" to managed.processId,
+                "command" to command
+            )
+        )
+    }
+
+    override fun getParameterSchema(): Map<String, Any> = mapOf(
+        "type" to "object",
+        "properties" to mapOf(
+            "command" to mapOf("type" to "string", "description" to "Shell command to run in background")
+        ),
+        "required" to listOf("command")
+    )
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunTerminalCommandTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunTerminalCommandTool.kt
index b1827f61..0b51dd00 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunTerminalCommandTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/RunTerminalCommandTool.kt
@@ -7,7 +7,6 @@ import pl.jclab.refio.core.tools.base.ToolMode
 import pl.jclab.refio.core.tools.base.ToolResult
 import pl.jclab.refio.core.tools.security.CommandLimits
 import pl.jclab.refio.core.tools.security.CommandRuleMatcher
-import pl.jclab.refio.core.tools.security.CommandWhitelist
 import pl.jclab.refio.core.tools.security.RuleAction
 import kotlinx.coroutines.async
 import kotlinx.coroutines.Dispatchers
@@ -25,22 +24,24 @@ private val logger = dualLogger("RunTerminalCommandTool")
  * - command: Shell command to execute
  *
  * Security:
- * - Command whitelist validates allowed programs/arguments
+ * - Command rule matcher (regex-based ALLOW/BLOCK/ASK) validates commands
  * - Execution timeout enforced
  * - Output size limits
  * - Runs in project root directory
  */
 class RunTerminalCommandTool(
     private val sandbox: PathSandbox,
-    private val whitelist: CommandWhitelist,
     private val limits: CommandLimits,
-    private val commandRuleMatcher: CommandRuleMatcher? = null
+    private val commandRuleMatcher: CommandRuleMatcher
 ) : Tool {
 
     override val name = "run_terminal_command"
     override val description = "Execute a shell command in the project directory."
     override val mode = ToolMode.WRITE
     override val category = ToolCategory.EXECUTION
+    override val selectionHint =
+        "OS-level commands: git, gradle, npm, docker, build/test runners. " +
+        "Avoid inline `python -c \"...\"` on Windows — prefer run_code when available."
 
     override fun validateParams(params: Map<String, Any>) {
         if (params["command"] == null || (params["command"] as? String).isNullOrBlank()) {
@@ -61,40 +62,21 @@ class RunTerminalCommandTool(
                 ?.coerceIn(MIN_TIMEOUT_SECONDS, MAX_TIMEOUT_SECONDS)
                 ?: limits.timeoutSeconds
 
-            // Check command rules (new regex-based system)
-            if (commandRuleMatcher != null) {
-                val ruleResult = commandRuleMatcher.match(command)
-                when (ruleResult.action) {
-                    RuleAction.BLOCK -> {
-                        val desc = ruleResult.matchedRule?.description ?: "blocked by security policy"
-                        logger.warn { "Blocked command by rule: reason='$desc', command='$command'" }
-                        return@withContext ToolResult.error("Command blocked: $desc")
-                    }
-                    RuleAction.ALLOW -> {
-                        logger.debug { "Command allowed by rule: ${ruleResult.matchedRule?.description}, command='$command'" }
-                        // Continue to execution
-                    }
-                    RuleAction.ASK -> {
-                        // ASK is handled at TurnToolExecutor level (PermissionLevel.ASK).
-                        // At tool level, we allow execution — the approval already happened.
-                        logger.debug { "Command ASK rule (pre-approved): ${ruleResult.matchedRule?.description}, command='$command'" }
-                    }
+            // Check command rules (regex-based ALLOW/BLOCK/ASK)
+            val ruleResult = commandRuleMatcher.match(command)
+            when (ruleResult.action) {
+                RuleAction.BLOCK -> {
+                    val desc = ruleResult.matchedRule?.description ?: "blocked by security policy"
+                    logger.warn { "Blocked command by rule: reason='$desc', command='$command'" }
+                    return@withContext ToolResult.error("Command blocked: $desc")
                 }
-            } else {
-                // Fallback: legacy whitelist
-                val validation = whitelist.validate(command)
-                if (!validation.allowed) {
-                    logger.warn { "Blocked command by whitelist: reason='${validation.reason}', command='$command'" }
-                    return@withContext ToolResult.error(
-                        "Command not allowed: ${validation.reason ?: "blocked"}"
-                    )
+                RuleAction.ALLOW -> {
+                    logger.debug { "Command allowed by rule: ${ruleResult.matchedRule?.description}, command='$command'" }
                 }
-
-                if (validation.requiresConfirmation) {
-                    logger.warn { "Command requires user confirmation: $command" }
-                    return@withContext ToolResult.error(
-                        "Command requires user confirmation: $command"
-                    )
+                RuleAction.ASK -> {
+                    // ASK is handled at TurnToolExecutor level (PermissionLevel.ASK).
+                    // At tool level, we allow execution — the approval already happened.
+                    logger.debug { "Command ASK rule (pre-approved): ${ruleResult.matchedRule?.description}, command='$command'" }
                 }
             }
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/SleepTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/SleepTool.kt
new file mode 100644
index 00000000..ee306f40
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/SleepTool.kt
@@ -0,0 +1,60 @@
+package pl.jclab.refio.core.tools.implementations
+
+import kotlinx.coroutines.delay
+import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.ToolCategory
+import pl.jclab.refio.core.tools.base.ToolMode
+import pl.jclab.refio.core.tools.base.ToolResult
+
+class SleepTool : Tool {
+    override val name = "sleep"
+    override val description = "Pause execution for a specified number of milliseconds. " +
+        "Use for rate limiting between API calls, or waiting for an external process. " +
+        "Maximum sleep is 30 seconds (30000 ms)."
+    override val mode = ToolMode.READ_ONLY
+    override val category = ToolCategory.SYSTEM
+
+    override fun validateParams(params: Map<String, Any>) {
+        val ms = toLong(params["duration_ms"])
+            ?: throw IllegalArgumentException("Parameter 'duration_ms' is required")
+        if (ms < 0) throw IllegalArgumentException("'duration_ms' must be >= 0")
+        if (ms > MAX_SLEEP_MS) throw IllegalArgumentException("'duration_ms' must be <= $MAX_SLEEP_MS")
+    }
+
+    override suspend fun execute(params: Map<String, Any>): ToolResult {
+        val ms = toLong(params["duration_ms"])
+            ?: return ToolResult.error("Missing required parameter: 'duration_ms'")
+        val clamped = ms.coerceIn(0, MAX_SLEEP_MS)
+        val start = System.currentTimeMillis()
+        delay(clamped)
+        val actual = System.currentTimeMillis() - start
+        return ToolResult(
+            success = true,
+            output = "Slept for ${actual}ms",
+            durationMs = actual.toInt()
+        )
+    }
+
+    override fun getParameterSchema(): Map<String, Any> = mapOf(
+        "type" to "object",
+        "properties" to mapOf(
+            "duration_ms" to mapOf(
+                "type" to "integer",
+                "description" to "Duration in milliseconds (max 30000)"
+            )
+        ),
+        "required" to listOf("duration_ms")
+    )
+
+    private fun toLong(v: Any?): Long? = when (v) {
+        is Long -> v
+        is Int -> v.toLong()
+        is Double -> v.toLong()
+        is String -> v.toLongOrNull()
+        else -> null
+    }
+
+    companion object {
+        const val MAX_SLEEP_MS = 30_000L
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/TasksTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/TasksTool.kt
index b22fbadd..91df44b0 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/TasksTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/TasksTool.kt
@@ -27,6 +27,7 @@ Use plan BEFORE starting complex work to organize approach.
 The plan is visible to the orchestrating agent."""
     override val mode = ToolMode.READ_ONLY
     override val category = ToolCategory.SYSTEM
+    override val selectionHint = "Plan/update/list execution steps for the current task (4+ step work)."
 
     override fun getParameterSchema(): Map<String, Any> = mapOf(
         "type" to "object",
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ThinkTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ThinkTool.kt
index 27805f48..ef138bb5 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ThinkTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ThinkTool.kt
@@ -50,13 +50,13 @@ class ThinkTool : Tool {
 
     override val name: String = "think"
     override val description: String =
-        "Record a short reasoning step (plan, hypothesis, gap analysis) without performing any action. " +
-            "Use BEFORE acting when: you are about to repeat a tool, just hit an error, are about to write/edit, " +
-            "or the request is ambiguous and you need to enumerate what you don't yet know. " +
-            "This tool has no side effects — it only forces a structured think before the next action. " +
-            "The same thought cannot be recorded twice in a row — repeats are rejected."
+        "Record a reasoning step (plan, hypothesis, gap analysis) without performing any action. " +
+            "Use BEFORE acting when: about to repeat a tool, hit an error, about to write/edit, or request is ambiguous. " +
+            "No side effects. Duplicate thoughts are rejected."
     override val mode: ToolMode = ToolMode.READ_ONLY
     override val category: ToolCategory = ToolCategory.SYSTEM
+    override val selectionHint: String =
+        "Record a reasoning step before retrying a failed call or at a decision point. No side effects."
 
     override fun validateParams(params: Map<String, Any>) {
         val thought = params["thought"] as? String
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ViewDiffTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ViewDiffTool.kt
index 164209c2..d27b95e6 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ViewDiffTool.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/ViewDiffTool.kt
@@ -33,6 +33,7 @@ class ViewDiffTool(
     override val description = "Compare two files or a file against provided content."
     override val mode = ToolMode.READ_ONLY
     override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint = "Compare two files or a file against provided content."
 
     override fun validateParams(params: Map<String, Any>) {
         if (params["file1"] == null || (params["file1"] as? String).isNullOrBlank()) {
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/WebSearchTool.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/WebSearchTool.kt
new file mode 100644
index 00000000..641ee9c8
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/implementations/WebSearchTool.kt
@@ -0,0 +1,217 @@
+package pl.jclab.refio.core.tools.implementations
+
+import io.ktor.client.*
+import io.ktor.client.engine.cio.*
+import io.ktor.client.request.*
+import io.ktor.client.statement.*
+import io.ktor.http.*
+import kotlinx.coroutines.Dispatchers
+import kotlinx.coroutines.withContext
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.core.services.ConfigService
+import pl.jclab.refio.core.tools.base.Tool
+import pl.jclab.refio.core.tools.base.ToolCategory
+import pl.jclab.refio.core.tools.base.ToolMode
+import pl.jclab.refio.core.tools.base.ToolResult
+import pl.jclab.refio.core.utils.GsonInstance
+
+private val logger = dualLogger("WebSearchTool")
+
+class WebSearchTool(
+    private val configService: ConfigService
+) : Tool {
+    override val name = "web_search"
+    override val description = "Search the web and return results with titles, URLs, and snippets. " +
+        "Use when you need current information, documentation, release notes, or answers " +
+        "that are not in the project codebase. " +
+        "Requires 'tools.web_search.provider' and corresponding API key in config."
+    override val mode = ToolMode.READ_ONLY
+    override val category = ToolCategory.DATA_PRODUCING
+    override val selectionHint = "Web search — current info, docs, release notes not in the codebase."
+
+    override fun validateParams(params: Map<String, Any>) {
+        val query = params["query"] as? String
+        if (query.isNullOrBlank()) throw IllegalArgumentException("Parameter 'query' is required")
+    }
+
+    override suspend fun execute(params: Map<String, Any>): ToolResult = withContext(Dispatchers.IO) {
+        val startTime = System.currentTimeMillis()
+        val query = params["query"] as? String
+            ?: return@withContext ToolResult.error("Missing required parameter: 'query'")
+        val maxResults = ((params["max_results"] as? Number)?.toInt() ?: 10).coerceIn(1, 20)
+
+        val provider = getConfig("tools.web_search.provider") ?: "duckduckgo"
+
+        val results: List<SearchResult> = try {
+            when (provider) {
+                "brave" -> {
+                    val apiKey = getConfig("tools.web_search.brave_api_key")
+                        ?: return@withContext ToolResult.error(
+                            "Brave Search API key not configured. " +
+                                "Add 'tools.web_search.brave_api_key' to ~/.refio/config.yaml"
+                        )
+                    searchBrave(query, maxResults, apiKey)
+                }
+                "serpapi" -> {
+                    val apiKey = getConfig("tools.web_search.serpapi_key")
+                        ?: return@withContext ToolResult.error("SerpAPI key not configured.")
+                    searchSerpApi(query, maxResults, apiKey)
+                }
+                "duckduckgo" -> searchDuckDuckGo(query, maxResults)
+                else -> return@withContext ToolResult.error("Unknown search provider: $provider")
+            }
+        } catch (e: Exception) {
+            logger.error(e) { "Web search failed: ${e.message}" }
+            return@withContext ToolResult.error("Web search failed: ${e.message}")
+        }
+
+        if (results.isEmpty()) {
+            return@withContext ToolResult(
+                success = true,
+                output = "No results found for: $query",
+                durationMs = elapsed(startTime)
+            )
+        }
+
+        val output = buildString {
+            appendLine("Search results for: \"$query\"")
+            appendLine("Provider: $provider | Found: ${results.size} results")
+            appendLine()
+            results.forEachIndexed { i, r ->
+                appendLine("${i + 1}. ${r.title}")
+                appendLine("   URL: ${r.url}")
+                if (r.snippet.isNotBlank()) appendLine("   ${r.snippet}")
+                appendLine()
+            }
+        }
+
+        ToolResult(
+            success = true,
+            output = output,
+            durationMs = elapsed(startTime),
+            metadata = mapOf(
+                "query" to query,
+                "provider" to provider,
+                "result_count" to results.size,
+                "results" to results.map { mapOf("title" to it.title, "url" to it.url, "snippet" to it.snippet) }
+            )
+        )
+    }
+
+    private suspend fun searchBrave(query: String, maxResults: Int, apiKey: String): List<SearchResult> {
+        val client = HttpClient(CIO) { engine { requestTimeout = 15_000 } }
+        try {
+            val response = client.get("https://api.search.brave.com/res/v1/web/search") {
+                parameter("q", query)
+                parameter("count", maxResults)
+                header("Accept", "application/json")
+                header("X-Subscription-Token", apiKey)
+            }
+            val responseText = response.bodyAsText()
+            if (!response.status.isSuccess()) {
+                logger.warn { "Brave Search HTTP ${response.status}: ${responseText.take(200)}" }
+                return emptyList()
+            }
+            val body = GsonInstance.gson.fromJson(responseText, Map::class.java)
+            @Suppress("UNCHECKED_CAST")
+            val webResults = (body["web"] as? Map<*, *>)?.get("results") as? List<Map<*, *>>
+                ?: emptyList()
+            return webResults.map { r ->
+                SearchResult(
+                    title = r["title"] as? String ?: "",
+                    url = r["url"] as? String ?: "",
+                    snippet = r["description"] as? String ?: ""
+                )
+            }
+        } finally {
+            client.close()
+        }
+    }
+
+    private suspend fun searchSerpApi(query: String, maxResults: Int, apiKey: String): List<SearchResult> {
+        val client = HttpClient(CIO) { engine { requestTimeout = 15_000 } }
+        try {
+            val response = client.get("https://serpapi.com/search") {
+                parameter("q", query)
+                parameter("num", maxResults)
+                parameter("api_key", apiKey)
+                parameter("engine", "google")
+            }
+            if (!response.status.isSuccess()) return emptyList()
+            val body = GsonInstance.gson.fromJson(response.bodyAsText(), Map::class.java)
+            @Suppress("UNCHECKED_CAST")
+            val organicResults = body["organic_results"] as? List<Map<*, *>> ?: emptyList()
+            return organicResults.take(maxResults).map { r ->
+                SearchResult(
+                    title = r["title"] as? String ?: "",
+                    url = r["link"] as? String ?: "",
+                    snippet = r["snippet"] as? String ?: ""
+                )
+            }
+        } finally {
+            client.close()
+        }
+    }
+
+    private suspend fun searchDuckDuckGo(query: String, maxResults: Int): List<SearchResult> {
+        val client = HttpClient(CIO) { engine { requestTimeout = 15_000 } }
+        try {
+            val response = client.get("https://api.duckduckgo.com/") {
+                parameter("q", query)
+                parameter("format", "json")
+                parameter("no_html", "1")
+                parameter("skip_disambig", "1")
+            }
+            if (!response.status.isSuccess()) return emptyList()
+            val body = GsonInstance.gson.fromJson(response.bodyAsText(), Map::class.java)
+            val results = mutableListOf<SearchResult>()
+
+            val abstractText = body["AbstractText"] as? String ?: ""
+            val abstractUrl = body["AbstractURL"] as? String ?: ""
+            val abstractTitle = body["Heading"] as? String ?: query
+            if (abstractText.isNotBlank() && abstractUrl.isNotBlank()) {
+                results.add(SearchResult(abstractTitle, abstractUrl, abstractText))
+            }
+
+            @Suppress("UNCHECKED_CAST")
+            val related = body["RelatedTopics"] as? List<Map<*, *>> ?: emptyList()
+            related.take(maxResults - results.size).forEach { topic ->
+                val text = topic["Text"] as? String ?: return@forEach
+                val url = (topic["FirstURL"] as? String) ?: return@forEach
+                if (text.isNotBlank() && url.isNotBlank()) {
+                    results.add(SearchResult(text.take(80), url, text))
+                }
+            }
+            return results
+        } finally {
+            client.close()
+        }
+    }
+
+    private fun getConfig(key: String): String? {
+        return try {
+            configService.get(key)
+        } catch (_: Exception) {
+            null
+        }
+    }
+
+    private fun elapsed(start: Long) = (System.currentTimeMillis() - start).toInt()
+
+    override fun getParameterSchema(): Map<String, Any> = mapOf(
+        "type" to "object",
+        "properties" to mapOf(
+            "query" to mapOf(
+                "type" to "string",
+                "description" to "Search query"
+            ),
+            "max_results" to mapOf(
+                "type" to "integer",
+                "description" to "Max results to return (1-20, default: 10)"
+            )
+        ),
+        "required" to listOf("query")
+    )
+
+    data class SearchResult(val title: String, val url: String, val snippet: String)
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandDenylist.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandDenylist.kt
deleted file mode 100644
index 3015f1f1..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandDenylist.kt
+++ /dev/null
@@ -1,207 +0,0 @@
-package pl.jclab.refio.core.tools.security
-
-import pl.jclab.refio.core.logging.dualLogger
-
-private val logger = dualLogger("CommandDenylist")
-
-/**
- * Security denylist for terminal commands.
- *
- * Blocks truly dangerous commands that could:
- * - Destroy filesystems or partitions (rm -rf /, format, dd, mkfs)
- * - Compromise system integrity (passwd, useradd, shutdown)
- * - Read highly sensitive system files (/etc/shadow, .ssh private keys)
- * - Cause resource exhaustion (fork bombs)
- *
- * Development commands (build, test, package install, docker, git, ssh)
- * are NOT blocked here — use CommandWhitelist for fine-grained control.
- */
-class CommandDenylist(
-    private val customBlockedPatterns: List<String> = emptyList()
-) {
-    /**
-     * Default blocked command patterns — only truly destructive operations.
-     *
-     * These are compiled as regex patterns with word boundary checks where appropriate.
-     * Patterns use regex syntax — special chars must be escaped.
-     */
-    private val defaultBlockedPatterns = listOf(
-        // ── Destructive filesystem operations ──
-        // Match rm with any combination of -r/-f flags targeting root
-        """rm\s+(-[a-z]*r[a-z]*\s+-[a-z]*f[a-z]*|-[a-z]*f[a-z]*\s+-[a-z]*r[a-z]*|-[a-z]*rf[a-z]*|-[a-z]*fr[a-z]*|--recursive\s+--force|--force\s+--recursive)\s+/""",
-        """rmdir\s+/s\s+/q\s+c:[/\\]""",
-        """del\s+/f\s+/s\s+/q\s+c:[/\\]""",
-        """format\s+c:""",
-        """\bmkfs\b""",
-        """\bdd\s+if=""",
-        """\bdd\s+of=/dev""",
-
-        // ── System-level modifications ──
-        """chmod\s+777\s+/""",
-        """chown\s+-[rR]""",
-        """chgrp\s+-[rR]""",
-
-        // ── Download-and-execute patterns (pipe to shell) ──
-        """curl\s.*\|\s*(sh|bash|zsh|dash)\b""",
-        """wget\s.*\|\s*(sh|bash|zsh|dash)\b""",
-
-        // ── Privilege escalation ──
-        """\bsudo\b""",
-        """\bsu\s+root\b""",
-        """\bsu\s+-\s*$""",
-
-        // ── System administration ──
-        """\bpasswd\b""",
-        """\buseradd\b""",
-        """\buserdel\b""",
-        """\bgroupadd\b""",
-        """\bgroupdel\b""",
-        """\breboot\b""",
-        """\bshutdown\b""",
-        """\bhalt\b""",
-        """\bpoweroff\b""",
-        """\binit\s+[06]\b""",
-
-        // ── Sensitive file access ──
-        """\bcat\s+/etc/shadow""",
-        """\bcat\s+/etc/gshadow""",
-        """\bcat\s.*\.ssh/id_""",
-        """\bcat\s.*\.ssh/.*_key""",
-        """\bcat\s.*\.aws/credentials""",
-        """\bcat\s.*\.gnupg/""",
-
-        // ── System package managers (modify OS) ──
-        """\bapt-get\s+install\b""",
-        """\bapt\s+install\b""",
-        """\byum\s+install\b""",
-        """\bdnf\s+install\b""",
-        """\bpacman\s+-[sS]""",
-        """\bbrew\s+install\b""",
-        """\bchoco\s+install\b""",
-        """\bwinget\s+install\b""",
-        """\bsnap\s+install\b""",
-
-        // ── Encoding tricks / code injection ──
-        """base64\s+-d\s.*\|\s*(sh|bash)\b""",
-        """\beval\s""",
-
-        // ── Fork bombs and resource exhaustion ──
-        """:\(\)\s*\{""",
-        """\bwhile\s+(true|:)\s*;\s*do\b""",
-        """\bfor\s*\(\s*;\s*;\s*\)""",
-
-        // ── Command substitution / shell escapes to bypass filters ──
-        """\$\(.*\b(rm|dd|mkfs|passwd|shutdown|reboot)\b""",
-        """`.*\b(rm|dd|mkfs|passwd|shutdown|reboot)\b""",
-
-        // ── Registry / system config (Windows) ──
-        """\breg\s+delete\b""",
-        """\breg\s+add\s.*\bhklm\b""",
-        """\bbcdedit\b""",
-        """\bdiskpart\b"""
-    )
-
-    private val allBlockedPatterns = defaultBlockedPatterns + customBlockedPatterns
-
-    /** Compiled regex patterns for efficient matching */
-    private val compiledPatterns: List<Pair<String, Regex>> = allBlockedPatterns.map { pattern ->
-        pattern to Regex(pattern, RegexOption.IGNORE_CASE)
-    }
-
-    /**
-     * Check if command is blocked by denylist.
-     *
-     * Uses regex matching with word boundaries to avoid false positives
-     * (e.g. "evaluation" no longer matches the "eval" pattern).
-     *
-     * @param command Command string to check
-     * @return true if command matches any blocked pattern
-     */
-    fun isBlocked(command: String): Boolean {
-        val normalizedCommand = normalizeCommand(command)
-
-        for ((pattern, regex) in compiledPatterns) {
-            if (regex.containsMatchIn(normalizedCommand)) {
-                logger.warn { "Blocked dangerous command: $command (matched pattern: $pattern)" }
-                return true
-            }
-        }
-
-        return false
-    }
-
-    /**
-     * Normalize command string before matching:
-     * - Collapse multiple spaces/tabs to single space
-     * - Trim whitespace
-     * - Lowercase
-     */
-    private fun normalizeCommand(command: String): String {
-        return command.trim().replace(Regex("""\s+"""), " ").lowercase()
-    }
-
-    /**
-     * Get list of all blocked patterns (for documentation)
-     */
-    fun getBlockedPatterns(): List<String> {
-        return allBlockedPatterns
-    }
-
-    companion object {
-        /**
-         * Default denylist instance
-         */
-        val DEFAULT = CommandDenylist()
-
-        /**
-         * Strict denylist with additional restrictions
-         */
-        fun strict(): CommandDenylist {
-            val additionalPatterns = listOf(
-                // Block all network operations (with word boundaries)
-                """\bcurl\b""", """\bwget\b""", """\bnc\b""", """\bnetcat\b""",
-
-                // Block all remote git operations
-                """\bgit\s+push\b""", """\bgit\s+pull\b""", """\bgit\s+clone\b""",
-
-                // Block docker container execution
-                """\bdocker\s+run\b""", """\bdocker\s+exec\b""", """\bdocker-compose\b""",
-
-                // Block SSH and remote access
-                """\bssh\b""", """\bscp\b""", """\brsync\b""", """\bftp\b""", """\btelnet\b"""
-            )
-
-            return CommandDenylist(additionalPatterns)
-        }
-    }
-}
-
-/**
- * Command execution limits
- */
-data class CommandLimits(
-    /**
-     * Maximum command execution time in seconds
-     */
-    val timeoutSeconds: Long = 120,
-
-    /**
-     * Maximum output size in characters
-     */
-    val maxOutputSize: Int = 200_000, // 200 KB
-
-    /**
-     * Maximum number of concurrent commands
-     */
-    val maxConcurrentCommands: Int = 5
-) {
-    companion object {
-        val DEFAULT = CommandLimits()
-
-        val STRICT = CommandLimits(
-            timeoutSeconds = 10,
-            maxOutputSize = 10_000,
-            maxConcurrentCommands = 1
-        )
-    }
-}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandLimits.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandLimits.kt
new file mode 100644
index 00000000..1c0847ae
--- /dev/null
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandLimits.kt
@@ -0,0 +1,25 @@
+package pl.jclab.refio.core.tools.security
+
+/**
+ * Execution limits for terminal/background commands.
+ */
+data class CommandLimits(
+    /**
+     * Default execution timeout in seconds.
+     */
+    val timeoutSeconds: Long = 120,
+
+    /**
+     * Maximum captured output size in characters (stdout+stderr combined).
+     */
+    val maxOutputSize: Int = 200_000
+) {
+    companion object {
+        val DEFAULT = CommandLimits()
+
+        val STRICT = CommandLimits(
+            timeoutSeconds = 30,
+            maxOutputSize = 50_000
+        )
+    }
+}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandRule.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandRule.kt
index 897e034f..346b4171 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandRule.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandRule.kt
@@ -1,11 +1,12 @@
 package pl.jclab.refio.core.tools.security
 
-import pl.jclab.refio.core.logging.dualLogger
-
-private val logger = dualLogger("CommandRuleMatcher")
-
 /**
- * Simple regex-based command rule. Replaces the complex AllowedCommand model.
+ * Simple regex-based command rule.
+ *
+ * Regex is compiled eagerly in the init block — an invalid pattern fails **at
+ * construction time**, not silently during matching. Callers loading rules from
+ * config must be prepared to catch [IllegalArgumentException] and refuse startup
+ * with a clear message (see `ConfigService`/`ToolPermissionsService`).
  *
  * @param pattern Regex matching the full command string
  * @param action What to do when command matches
@@ -15,7 +16,16 @@ data class CommandRule(
     val pattern: String,
     val action: RuleAction,
     val description: String = ""
-)
+) {
+    val compiledRegex: Regex = try {
+        Regex(pattern, RegexOption.IGNORE_CASE)
+    } catch (e: Exception) {
+        throw IllegalArgumentException(
+            "Invalid command rule regex '$pattern' (action=$action, description='$description'): ${e.message}",
+            e
+        )
+    }
+}
 
 enum class RuleAction {
     ALLOW,  // Auto-execute without asking
@@ -42,13 +52,8 @@ class CommandRuleMatcher(private val rules: List<CommandRule>) {
     )
 
     private val compiled: Map<RuleAction, List<CompiledRule>> by lazy {
-        rules.mapNotNull { rule ->
-            try {
-                CompiledRule(rule, Regex(rule.pattern, RegexOption.IGNORE_CASE))
-            } catch (e: Exception) {
-                logger.warn { "Invalid command rule regex: '${rule.pattern}' — ${e.message}" }
-                null
-            }
+        rules.map { rule ->
+            CompiledRule(rule, rule.compiledRegex)
         }.groupBy { it.rule.action }
     }
 
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandRuleDefaults.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandRuleDefaults.kt
index e1fcabaa..e09d443b 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandRuleDefaults.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandRuleDefaults.kt
@@ -1,8 +1,7 @@
 package pl.jclab.refio.core.tools.security
 
 /**
- * Default command rules converted from AllowedCommand model.
- * Uses regex-based matching instead of the complex 8-field AllowedCommand.
+ * Default command rules. Regex-based ALLOW/BLOCK/ASK matcher for terminal commands.
  */
 object CommandRuleDefaults {
 
@@ -44,40 +43,98 @@ object CommandRuleDefaults {
     )
 
     /**
-     * Build tools, package managers, and development utilities — always allowed.
+     * Build tools, package managers, VCS, and common dev utilities — always allowed.
+     *
+     * Format: list of (program name, description). Aliases handled as separate entries.
+     * Network/privilege-sensitive tools (docker, kubectl, ssh, scp, rsync, wget, sudo, su,
+     * systemctl, service) are intentionally absent here — they're covered by ASK_RULES.
      */
-    /** Programs managed by explicit ASK rules — exclude from auto-generated ALLOW rules */
-    private val ASK_MANAGED_PROGRAMS = setOf(
-        "docker", "kubectl", "ssh", "scp", "rsync", "wget", "sudo", "su",
-        "systemctl", "service"
+    private val ALLOW_PROGRAMS: List<Pair<String, String>> = listOf(
+        // Version control
+        "git" to "Git version control",
+        // Build tools — Gradle / Maven / Make
+        "gradle" to "Gradle build tool",
+        "gradlew" to "Gradle wrapper",
+        "gradlew.bat" to "Gradle wrapper (Windows)",
+        "./gradlew" to "Gradle wrapper (relative)",
+        "mvn" to "Maven build tool",
+        "mvnw" to "Maven wrapper",
+        "make" to "Make build tool",
+        "cmake" to "CMake build tool",
+        "ninja" to "Ninja build tool",
+        // Node / JS ecosystem
+        "node" to "Node.js runtime",
+        "npm" to "npm package manager",
+        "npx" to "npm package runner",
+        "yarn" to "Yarn package manager",
+        "pnpm" to "pnpm package manager",
+        "tsc" to "TypeScript compiler",
+        "deno" to "Deno runtime",
+        "bun" to "Bun runtime",
+        // Python
+        "python" to "Python interpreter",
+        "python3" to "Python 3 interpreter",
+        "pip" to "Python package manager",
+        "pip3" to "Python 3 package manager",
+        "pytest" to "Python test runner",
+        "poetry" to "Python dependency manager",
+        "uv" to "Python installer/resolver",
+        // JVM / Kotlin
+        "java" to "Java runtime",
+        "javac" to "Java compiler",
+        "kotlin" to "Kotlin runtime",
+        "kotlinc" to "Kotlin compiler",
+        // Other languages
+        "go" to "Go toolchain",
+        "cargo" to "Rust package manager",
+        "rustc" to "Rust compiler",
+        "ruby" to "Ruby interpreter",
+        "bundle" to "Ruby bundler",
+        "php" to "PHP interpreter",
+        "composer" to "PHP dependency manager",
+        "dotnet" to ".NET CLI",
+        // Read-only filesystem inspection
+        "ls" to "List directory",
+        "dir" to "List directory (Windows)",
+        "cat" to "Print file contents",
+        "type" to "Print file contents (Windows)",
+        "head" to "Print file head",
+        "tail" to "Print file tail",
+        "wc" to "Word count",
+        "find" to "Find files",
+        "grep" to "Text search",
+        "rg" to "Ripgrep search",
+        "fd" to "fd-find",
+        "pwd" to "Print working directory",
+        "cd" to "Change directory",
+        "tree" to "Print directory tree",
+        "stat" to "File metadata",
+        "file" to "File type detection",
+        // Read-only git helpers often typed explicitly
+        "gh" to "GitHub CLI (non-destructive ops via ASK for delete)",
+        "hg" to "Mercurial",
+        // Environment
+        "env" to "Print environment",
+        "which" to "Locate executable",
+        "where" to "Locate executable (Windows)"
     )
 
     val ALLOW_RULES: List<CommandRule> by lazy {
-        val rules = mutableListOf<CommandRule>()
-
-        // Convert all AllowedCommand entries (that don't require confirmation
-        // and aren't managed by explicit ASK rules) into ALLOW rules
-        for (cmd in CommandWhitelistDefaults.DEFAULT_COMMANDS) {
-            if (cmd.requireConfirmation) continue
-            if (cmd.program in ASK_MANAGED_PROGRAMS) continue
-
-            val programs = listOf(cmd.program) + cmd.aliases
-            for (prog in programs) {
-                val escapedProg = Regex.escape(prog)
-                rules.add(CommandRule(
-                    pattern = "^$escapedProg(\\s+.*)?$",
-                    action = RuleAction.ALLOW,
-                    description = cmd.description
-                ))
-            }
-        }
-
-        // Common utilities not in AllowedCommand but safe
+        val rules = ALLOW_PROGRAMS.map { (prog, desc) ->
+            CommandRule(
+                pattern = "^${Regex.escape(prog)}(\\s+.*)?$",
+                action = RuleAction.ALLOW,
+                description = desc
+            )
+        }.toMutableList()
+
+        // Primitive utilities
         rules.addAll(listOf(
             CommandRule("^echo(\\s+.*)?$", RuleAction.ALLOW, "Echo text"),
             CommandRule("^printf(\\s+.*)?$", RuleAction.ALLOW, "Print formatted"),
             CommandRule("^true$", RuleAction.ALLOW, "Always succeed"),
             CommandRule("^false$", RuleAction.ALLOW, "Always fail"),
+            CommandRule("^exit(\\s+\\d+)?$", RuleAction.ALLOW, "Exit with code")
         ))
 
         rules
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelist.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelist.kt
deleted file mode 100644
index 84b86f63..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelist.kt
+++ /dev/null
@@ -1,298 +0,0 @@
-package pl.jclab.refio.core.tools.security
-
-import pl.jclab.refio.core.logging.dualLogger
-import kotlin.text.RegexOption.IGNORE_CASE
-
-private val logger = dualLogger("CommandWhitelist")
-
-data class ParsedCommand(
-    val program: String,
-    val subcommand: String?,
-    val args: List<String>,
-    val rawCommand: String
-)
-
-class CommandWhitelist(
-    private val config: CommandWhitelistConfig,
-    private val denylist: CommandDenylist
-) {
-    data class ValidationResult(
-        val allowed: Boolean,
-        val reason: String? = null,
-        val requiresConfirmation: Boolean = false
-    )
-
-    private val globalBlockedRegex: List<Regex> = config.globalBlockedPatterns.mapNotNull { pattern ->
-        runCatching { Regex(pattern, IGNORE_CASE) }
-            .onFailure { logger.warn { "Invalid terminal whitelist regex pattern: '$pattern'" } }
-            .getOrNull()
-    }
-
-    fun validate(rawCommand: String): ValidationResult {
-        if (rawCommand.isBlank()) {
-            return ValidationResult(allowed = false, reason = "Empty command")
-        }
-
-        if (!config.enabled) {
-            return validateWithDenylistFallback(rawCommand, "Whitelist disabled")
-        }
-
-        val blockedByGlobalPattern = globalBlockedRegex.firstOrNull { it.containsMatchIn(rawCommand) }
-        if (blockedByGlobalPattern != null) {
-            return ValidationResult(
-                allowed = false,
-                reason = "Matches blocked pattern: ${blockedByGlobalPattern.pattern}"
-            )
-        }
-
-        val chain = runCatching { parseCommandChain(rawCommand) }.getOrElse { error ->
-            return ValidationResult(allowed = false, reason = "Cannot parse command: ${error.message}")
-        }
-
-        if (chain.isEmpty()) {
-            return ValidationResult(allowed = false, reason = "No executable command found")
-        }
-
-        var requiresConfirmation = false
-        for (parsed in chain) {
-            val result = validateSingleCommand(parsed)
-            if (!result.allowed) {
-                return result
-            }
-            requiresConfirmation = requiresConfirmation || result.requiresConfirmation
-        }
-
-        return ValidationResult(allowed = true, requiresConfirmation = requiresConfirmation)
-    }
-
-    fun parseCommand(rawCommand: String): ParsedCommand {
-        val tokens = tokenize(rawCommand)
-        if (tokens.isEmpty()) {
-            throw IllegalArgumentException("Command is empty")
-        }
-
-        val normalizedProgram = normalizeProgram(tokens.first())
-        if (normalizedProgram.isBlank()) {
-            throw IllegalArgumentException("Program name is empty")
-        }
-
-        val args = tokens.drop(1)
-        return ParsedCommand(
-            program = normalizedProgram,
-            subcommand = args.firstOrNull(),
-            args = args,
-            rawCommand = rawCommand.trim()
-        )
-    }
-
-    fun parseCommandChain(raw: String): List<ParsedCommand> {
-        val segments = splitByControlOperators(raw)
-        return segments.map { parseCommand(it) }
-    }
-
-    private fun validateSingleCommand(parsed: ParsedCommand): ValidationResult {
-        val allowedCommand = findAllowedCommand(parsed.program)
-        if (allowedCommand == null) {
-            return when (config.mode) {
-                WhitelistMode.WHITELIST_ONLY -> ValidationResult(
-                    allowed = false,
-                    reason = "Program '${parsed.program}' is not on whitelist"
-                )
-
-                WhitelistMode.WHITELIST_PLUS_DENY -> validateWithDenylistFallback(
-                    parsed.rawCommand,
-                    "Program '${parsed.program}' is not on whitelist"
-                )
-            }
-        }
-
-        if (parsed.args.size > allowedCommand.maxArgs) {
-            return ValidationResult(
-                allowed = false,
-                reason = "Too many arguments for '${allowedCommand.program}' (${parsed.args.size} > ${allowedCommand.maxArgs})"
-            )
-        }
-
-        val blockedFlag = allowedCommand.blockedFlags.firstOrNull { blocked ->
-            parsed.args.any { it.equals(blocked, ignoreCase = true) }
-        }
-        if (blockedFlag != null) {
-            return ValidationResult(
-                allowed = false,
-                reason = "Blocked flag '$blockedFlag' for '${allowedCommand.program}'"
-            )
-        }
-
-        val blockedSubcommand = allowedCommand.blockedSubcommands.firstOrNull { blocked ->
-            matchesSubcommandPhrase(parsed.args, blocked)
-        }
-        if (blockedSubcommand != null) {
-            return ValidationResult(
-                allowed = false,
-                reason = "Blocked subcommand '$blockedSubcommand' for '${allowedCommand.program}'"
-            )
-        }
-
-        val blockedArgPattern = allowedCommand.blockedArgPatterns.firstOrNull { pattern ->
-            val regex = runCatching { Regex(pattern, IGNORE_CASE) }.getOrNull()
-            regex?.let { parsed.args.any(it::matches) } == true
-        }
-        if (blockedArgPattern != null) {
-            return ValidationResult(
-                allowed = false,
-                reason = "Blocked argument pattern '$blockedArgPattern' for '${allowedCommand.program}'"
-            )
-        }
-
-        if (allowedCommand.allowedSubcommands.isNotEmpty()) {
-            val matchesAllowed = allowedCommand.allowedSubcommands.any { allowed ->
-                matchesSubcommandPhrase(parsed.args, allowed)
-            }
-            if (!matchesAllowed) {
-                return ValidationResult(
-                    allowed = false,
-                    reason = "Subcommand not allowed for '${allowedCommand.program}'"
-                )
-            }
-        }
-
-        return ValidationResult(
-            allowed = true,
-            requiresConfirmation = allowedCommand.requireConfirmation
-        )
-    }
-
-    private fun validateWithDenylistFallback(rawCommand: String, prefixReason: String): ValidationResult {
-        return if (denylist.isBlocked(rawCommand)) {
-            ValidationResult(
-                allowed = false,
-                reason = "$prefixReason and denylist blocked command"
-            )
-        } else {
-            ValidationResult(allowed = true)
-        }
-    }
-
-    private fun findAllowedCommand(program: String): AllowedCommand? {
-        return config.allowedCommands.firstOrNull { allowed ->
-            val names = listOf(allowed.program) + allowed.aliases
-            names.any { normalizeProgram(it) == program }
-        }
-    }
-
-    private fun matchesSubcommandPhrase(args: List<String>, phrase: String): Boolean {
-        val phraseTokens = tokenize(phrase)
-        if (args.isEmpty() || phraseTokens.isEmpty() || args.size < phraseTokens.size) {
-            return false
-        }
-
-        for (start in 0..(args.size - phraseTokens.size)) {
-            val matchesAtStart = phraseTokens.indices.all { offset ->
-                args[start + offset].equals(phraseTokens[offset], ignoreCase = true)
-            }
-            if (matchesAtStart) {
-                return true
-            }
-        }
-        return false
-    }
-
-    private fun splitByControlOperators(raw: String): List<String> {
-        val segments = mutableListOf<String>()
-        val current = StringBuilder()
-
-        var inSingleQuote = false
-        var inDoubleQuote = false
-        var inBackticks = false
-
-        var i = 0
-        while (i < raw.length) {
-            val char = raw[i]
-
-            when (char) {
-                '\'' -> if (!inDoubleQuote && !inBackticks) inSingleQuote = !inSingleQuote
-                '"' -> if (!inSingleQuote && !inBackticks) inDoubleQuote = !inDoubleQuote
-                '`' -> if (!inSingleQuote && !inDoubleQuote) inBackticks = !inBackticks
-            }
-
-            if (!inSingleQuote && !inDoubleQuote && !inBackticks) {
-                val twoChars = raw.substring(i, minOf(i + 2, raw.length))
-                if (twoChars == "&&" || twoChars == "||") {
-                    addSegment(segments, current.toString())
-                    current.clear()
-                    i += 2
-                    continue
-                }
-                if (char == '|' || char == ';') {
-                    addSegment(segments, current.toString())
-                    current.clear()
-                    i++
-                    continue
-                }
-            }
-
-            current.append(char)
-            i++
-        }
-
-        addSegment(segments, current.toString())
-        return segments
-    }
-
-    private fun tokenize(raw: String): List<String> {
-        val tokens = mutableListOf<String>()
-        val current = StringBuilder()
-
-        var inSingleQuote = false
-        var inDoubleQuote = false
-
-        for (char in raw) {
-            when (char) {
-                '\'' -> if (!inDoubleQuote) {
-                    inSingleQuote = !inSingleQuote
-                    continue
-                }
-
-                '"' -> if (!inSingleQuote) {
-                    inDoubleQuote = !inDoubleQuote
-                    continue
-                }
-            }
-
-            if (!inSingleQuote && !inDoubleQuote && char.isWhitespace()) {
-                if (current.isNotEmpty()) {
-                    tokens.add(current.toString())
-                    current.clear()
-                }
-                continue
-            }
-
-            current.append(char)
-        }
-
-        if (current.isNotEmpty()) {
-            tokens.add(current.toString())
-        }
-
-        return tokens
-    }
-
-    private fun addSegment(segments: MutableList<String>, segment: String) {
-        val trimmed = segment.trim()
-        if (trimmed.isNotEmpty()) {
-            segments.add(trimmed)
-        }
-    }
-
-    private fun normalizeProgram(raw: String): String {
-        val unquoted = raw.trim().trim('"', '\'')
-        val slashesNormalized = unquoted.replace('\\', '/')
-        val fileName = slashesNormalized.substringAfterLast('/').lowercase()
-        return when {
-            fileName.endsWith(".exe") -> fileName.removeSuffix(".exe")
-            fileName.endsWith(".bat") -> fileName.removeSuffix(".bat")
-            fileName.endsWith(".cmd") -> fileName.removeSuffix(".cmd")
-            else -> fileName
-        }
-    }
-}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelistConfig.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelistConfig.kt
deleted file mode 100644
index 3fc0a2e6..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelistConfig.kt
+++ /dev/null
@@ -1,25 +0,0 @@
-package pl.jclab.refio.core.tools.security
-
-data class CommandWhitelistConfig(
-    val enabled: Boolean = true,
-    val mode: WhitelistMode = WhitelistMode.WHITELIST_ONLY,
-    val allowedCommands: List<AllowedCommand> = CommandWhitelistDefaults.DEFAULT_COMMANDS,
-    val globalBlockedPatterns: List<String> = CommandWhitelistDefaults.DEFAULT_BLOCKED_PATTERNS
-)
-
-enum class WhitelistMode {
-    WHITELIST_ONLY,
-    WHITELIST_PLUS_DENY
-}
-
-data class AllowedCommand(
-    val program: String,
-    val description: String = "",
-    val aliases: List<String> = emptyList(),
-    val blockedFlags: List<String> = emptyList(),
-    val blockedSubcommands: List<String> = emptyList(),
-    val blockedArgPatterns: List<String> = emptyList(),
-    val allowedSubcommands: List<String> = emptyList(),
-    val maxArgs: Int = 50,
-    val requireConfirmation: Boolean = false
-)
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelistDefaults.kt b/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelistDefaults.kt
deleted file mode 100644
index b03cbf5a..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelistDefaults.kt
+++ /dev/null
@@ -1,806 +0,0 @@
-package pl.jclab.refio.core.tools.security
-
-object CommandWhitelistDefaults {
-    val DEFAULT_COMMANDS = listOf(
-        // ── Build Systems ──────────────────────────────────────────────
-        AllowedCommand(
-            program = "gradle",
-            aliases = listOf("gradlew", "gradlew.bat", "./gradlew"),
-            description = "Gradle build system",
-            blockedFlags = listOf("--init-script")
-        ),
-        AllowedCommand(
-            program = "mvn",
-            aliases = listOf("mvnw", "mvnw.cmd", "./mvnw"),
-            description = "Maven build system"
-        ),
-        AllowedCommand(
-            program = "make",
-            aliases = listOf("gmake"),
-            description = "Make build system"
-        ),
-        AllowedCommand(
-            program = "cmake",
-            description = "CMake build system"
-        ),
-        AllowedCommand(
-            program = "sbt",
-            description = "Scala Build Tool"
-        ),
-        AllowedCommand(
-            program = "ant",
-            description = "Apache Ant build system"
-        ),
-        AllowedCommand(
-            program = "bazel",
-            description = "Bazel build system",
-            blockedSubcommands = listOf("clean --expunge")
-        ),
-        AllowedCommand(
-            program = "msbuild",
-            description = "Microsoft Build Engine"
-        ),
-
-        // ── JavaScript/TypeScript Ecosystem ────────────────────────────
-        AllowedCommand(
-            program = "npm",
-            aliases = listOf("npx"),
-            description = "Node.js package manager",
-            blockedSubcommands = listOf("publish", "adduser", "login", "token"),
-            blockedFlags = listOf("--global", "-g")
-        ),
-        AllowedCommand(
-            program = "yarn",
-            description = "Yarn package manager",
-            blockedSubcommands = listOf("publish", "login", "npm audit"),
-            blockedFlags = listOf("--global", "-g")
-        ),
-        AllowedCommand(
-            program = "pnpm",
-            description = "pnpm package manager",
-            blockedSubcommands = listOf("publish", "login"),
-            blockedFlags = listOf("--global", "-g")
-        ),
-        AllowedCommand(
-            program = "bun",
-            description = "Bun runtime & package manager",
-            blockedSubcommands = listOf("publish")
-        ),
-        AllowedCommand(
-            program = "node",
-            description = "Node.js runtime",
-            blockedFlags = listOf("-e", "--eval")
-        ),
-        AllowedCommand(
-            program = "tsx",
-            description = "TypeScript execute"
-        ),
-        AllowedCommand(
-            program = "ts-node",
-            description = "TypeScript Node.js"
-        ),
-        AllowedCommand(
-            program = "tsc",
-            description = "TypeScript compiler"
-        ),
-        AllowedCommand(
-            program = "eslint",
-            description = "JavaScript/TypeScript linter"
-        ),
-        AllowedCommand(
-            program = "prettier",
-            description = "Code formatter"
-        ),
-        AllowedCommand(
-            program = "jest",
-            description = "JavaScript test runner"
-        ),
-        AllowedCommand(
-            program = "vitest",
-            description = "Vite test runner"
-        ),
-        AllowedCommand(
-            program = "mocha",
-            description = "JavaScript test framework"
-        ),
-        AllowedCommand(
-            program = "playwright",
-            description = "E2E test runner"
-        ),
-        AllowedCommand(
-            program = "cypress",
-            description = "E2E test runner"
-        ),
-        AllowedCommand(
-            program = "webpack",
-            description = "Module bundler"
-        ),
-        AllowedCommand(
-            program = "vite",
-            description = "Frontend build tool"
-        ),
-        AllowedCommand(
-            program = "esbuild",
-            description = "JavaScript bundler"
-        ),
-        AllowedCommand(
-            program = "rollup",
-            description = "JavaScript module bundler"
-        ),
-        AllowedCommand(
-            program = "next",
-            description = "Next.js CLI"
-        ),
-
-        // ── Git ────────────────────────────────────────────────────────
-        AllowedCommand(
-            program = "git",
-            description = "Git version control",
-            blockedSubcommands = listOf(
-                "reset --hard",
-                "clean -f", "clean -fd", "clean -fdx",
-                "filter-branch",
-                "reflog expire"
-            ),
-            blockedFlags = listOf("--no-verify"),
-            blockedArgPatterns = listOf(".*\\.env$", ".*credentials.*")
-        ),
-
-        // ── Python Ecosystem ───────────────────────────────────────────
-        AllowedCommand(
-            program = "python",
-            aliases = listOf("python3", "py"),
-            description = "Python interpreter",
-            blockedFlags = listOf("-c"),
-            blockedArgPatterns = listOf(".*socket\\..*")
-        ),
-        AllowedCommand(
-            program = "pip",
-            aliases = listOf("pip3"),
-            description = "Python package manager",
-            blockedSubcommands = listOf("install --user", "install --system"),
-            blockedArgPatterns = listOf(".*http://.*")
-        ),
-        AllowedCommand(
-            program = "poetry",
-            description = "Python dependency manager",
-            blockedSubcommands = listOf("publish")
-        ),
-        AllowedCommand(
-            program = "pipenv",
-            description = "Python virtualenv manager"
-        ),
-        AllowedCommand(
-            program = "uv",
-            description = "Python package manager (fast)",
-            blockedSubcommands = listOf("publish")
-        ),
-        AllowedCommand(
-            program = "pytest",
-            description = "Python test runner"
-        ),
-        AllowedCommand(
-            program = "mypy",
-            description = "Python type checker"
-        ),
-        AllowedCommand(
-            program = "ruff",
-            description = "Python linter"
-        ),
-        AllowedCommand(
-            program = "black",
-            description = "Python formatter"
-        ),
-        AllowedCommand(
-            program = "flake8",
-            description = "Python linter"
-        ),
-        AllowedCommand(
-            program = "pylint",
-            description = "Python linter"
-        ),
-        AllowedCommand(
-            program = "django-admin",
-            aliases = listOf("manage.py"),
-            description = "Django management"
-        ),
-        AllowedCommand(
-            program = "flask",
-            description = "Flask CLI"
-        ),
-
-        // ── JVM Ecosystem ──────────────────────────────────────────────
-        AllowedCommand(
-            program = "java",
-            description = "Java runtime"
-        ),
-        AllowedCommand(
-            program = "javac",
-            description = "Java compiler"
-        ),
-        AllowedCommand(
-            program = "kotlin",
-            aliases = listOf("kotlinc"),
-            description = "Kotlin compiler"
-        ),
-        AllowedCommand(
-            program = "scala",
-            aliases = listOf("scalac"),
-            description = "Scala compiler"
-        ),
-
-        // ── Rust Ecosystem ─────────────────────────────────────────────
-        AllowedCommand(
-            program = "cargo",
-            description = "Rust package manager",
-            blockedSubcommands = listOf("publish", "install", "uninstall")
-        ),
-        AllowedCommand(
-            program = "rustc",
-            description = "Rust compiler"
-        ),
-        AllowedCommand(
-            program = "rustup",
-            description = "Rust toolchain manager",
-            allowedSubcommands = listOf("show", "which", "target list", "toolchain list", "component list")
-        ),
-
-        // ── Go Ecosystem ───────────────────────────────────────────────
-        AllowedCommand(
-            program = "go",
-            description = "Go toolchain",
-            blockedSubcommands = listOf("install")
-        ),
-        AllowedCommand(
-            program = "golangci-lint",
-            description = "Go linter"
-        ),
-
-        // ── .NET Ecosystem ─────────────────────────────────────────────
-        AllowedCommand(
-            program = "dotnet",
-            description = ".NET CLI",
-            blockedSubcommands = listOf("publish", "nuget push")
-        ),
-
-        // ── Ruby Ecosystem ─────────────────────────────────────────────
-        AllowedCommand(
-            program = "ruby",
-            description = "Ruby interpreter",
-            blockedFlags = listOf("-e")
-        ),
-        AllowedCommand(
-            program = "bundle",
-            aliases = listOf("bundler"),
-            description = "Ruby dependency manager"
-        ),
-        AllowedCommand(
-            program = "rake",
-            description = "Ruby Make"
-        ),
-        AllowedCommand(
-            program = "rails",
-            description = "Rails CLI"
-        ),
-        AllowedCommand(
-            program = "rspec",
-            description = "Ruby test runner"
-        ),
-
-        // ── PHP Ecosystem ──────────────────────────────────────────────
-        AllowedCommand(
-            program = "php",
-            description = "PHP interpreter",
-            blockedFlags = listOf("-r")
-        ),
-        AllowedCommand(
-            program = "composer",
-            description = "PHP dependency manager",
-            blockedSubcommands = listOf("global")
-        ),
-        AllowedCommand(
-            program = "phpunit",
-            description = "PHP test runner"
-        ),
-        AllowedCommand(
-            program = "artisan",
-            description = "Laravel CLI"
-        ),
-
-        // ── Swift/Apple ────────────────────────────────────────────────
-        AllowedCommand(
-            program = "swift",
-            description = "Swift compiler & package manager"
-        ),
-        AllowedCommand(
-            program = "xcodebuild",
-            description = "Xcode build CLI",
-            blockedFlags = listOf("-allowProvisioningUpdates")
-        ),
-        AllowedCommand(
-            program = "xcrun",
-            description = "Xcode toolchain runner"
-        ),
-
-        // ── C/C++ ──────────────────────────────────────────────────────
-        AllowedCommand(
-            program = "gcc",
-            aliases = listOf("g++", "cc", "c++"),
-            description = "GNU C/C++ compiler"
-        ),
-        AllowedCommand(
-            program = "clang",
-            aliases = listOf("clang++"),
-            description = "LLVM C/C++ compiler"
-        ),
-        AllowedCommand(
-            program = "gdb",
-            description = "GNU debugger"
-        ),
-        AllowedCommand(
-            program = "lldb",
-            description = "LLVM debugger"
-        ),
-        AllowedCommand(
-            program = "valgrind",
-            description = "Memory analysis tool"
-        ),
-
-        // ── Dart/Flutter ───────────────────────────────────────────────
-        AllowedCommand(
-            program = "dart",
-            description = "Dart SDK",
-            blockedSubcommands = listOf("pub publish")
-        ),
-        AllowedCommand(
-            program = "flutter",
-            description = "Flutter SDK"
-        ),
-
-        // ── Docker & Containers ────────────────────────────────────────
-        AllowedCommand(
-            program = "docker",
-            description = "Docker CLI",
-            blockedSubcommands = listOf(
-                "rm", "rmi", "prune", "push",
-                "system prune", "volume rm", "network rm"
-            ),
-            blockedFlags = listOf("--privileged"),
-            blockedArgPatterns = listOf(".*--cap-add.*")
-        ),
-        AllowedCommand(
-            program = "docker-compose",
-            aliases = listOf("docker compose"),
-            description = "Docker Compose",
-            blockedSubcommands = listOf("rm", "down --volumes", "push")
-        ),
-        AllowedCommand(
-            program = "podman",
-            description = "Podman container engine",
-            blockedSubcommands = listOf("rm", "rmi", "push", "system prune"),
-            blockedFlags = listOf("--privileged")
-        ),
-        AllowedCommand(
-            program = "kubectl",
-            description = "Kubernetes CLI",
-            blockedSubcommands = listOf("delete", "drain", "cordon", "taint", "edit"),
-            blockedFlags = listOf("--force")
-        ),
-
-        // ── Database CLIs (read-safe) ──────────────────────────────────
-        AllowedCommand(
-            program = "psql",
-            description = "PostgreSQL client",
-            blockedArgPatterns = listOf(".*DROP\\s+DATABASE.*", ".*DROP\\s+TABLE.*")
-        ),
-        AllowedCommand(
-            program = "mysql",
-            description = "MySQL client",
-            blockedArgPatterns = listOf(".*DROP\\s+DATABASE.*", ".*DROP\\s+TABLE.*")
-        ),
-        AllowedCommand(
-            program = "sqlite3",
-            description = "SQLite client"
-        ),
-        AllowedCommand(
-            program = "redis-cli",
-            description = "Redis client",
-            blockedSubcommands = listOf("FLUSHALL", "FLUSHDB", "CONFIG SET", "DEBUG")
-        ),
-        AllowedCommand(
-            program = "mongosh",
-            aliases = listOf("mongo"),
-            description = "MongoDB shell"
-        ),
-
-        // ── Cloud CLIs (read operations) ───────────────────────────────
-        AllowedCommand(
-            program = "aws",
-            description = "AWS CLI",
-            blockedSubcommands = listOf(
-                "s3 rm", "ec2 terminate-instances", "rds delete",
-                "iam delete", "lambda delete", "cloudformation delete"
-            ),
-            requireConfirmation = true
-        ),
-        AllowedCommand(
-            program = "gcloud",
-            description = "Google Cloud CLI",
-            blockedSubcommands = listOf("compute instances delete", "sql instances delete", "functions delete"),
-            requireConfirmation = true
-        ),
-        AllowedCommand(
-            program = "az",
-            description = "Azure CLI",
-            blockedSubcommands = listOf("vm delete", "group delete", "webapp delete"),
-            requireConfirmation = true
-        ),
-
-        // ── Infrastructure as Code ─────────────────────────────────────
-        AllowedCommand(
-            program = "terraform",
-            description = "Terraform IaC",
-            allowedSubcommands = listOf(
-                "init", "plan", "validate", "fmt", "show", "state list",
-                "state show", "output", "providers", "version", "graph"
-            )
-        ),
-        AllowedCommand(
-            program = "pulumi",
-            description = "Pulumi IaC",
-            allowedSubcommands = listOf("preview", "stack", "config", "whoami", "version")
-        ),
-
-        // ── Shell Utilities (cross-platform) ───────────────────────────
-        AllowedCommand(
-            program = "cat",
-            aliases = listOf("type", "get-content"),
-            description = "Print file content",
-            blockedArgPatterns = listOf(
-                ".*\\.env$", ".*/\\.ssh/.*", ".*credentials.*",
-                ".*/etc/shadow.*", ".*/etc/passwd.*"
-            )
-        ),
-        AllowedCommand(
-            program = "ls",
-            aliases = listOf("dir", "gci", "get-childitem"),
-            description = "List directory"
-        ),
-        AllowedCommand(
-            program = "pwd",
-            aliases = listOf("get-location"),
-            description = "Print working directory"
-        ),
-        AllowedCommand(
-            program = "echo",
-            aliases = listOf("write-output"),
-            description = "Print text output"
-        ),
-        AllowedCommand(
-            program = "clear",
-            aliases = listOf("cls"),
-            description = "Clear terminal screen"
-        ),
-        AllowedCommand(
-            program = "date",
-            aliases = listOf("get-date"),
-            description = "Print current date and time"
-        ),
-        AllowedCommand(
-            program = "whoami",
-            description = "Print current user"
-        ),
-        AllowedCommand(
-            program = "hostname",
-            description = "Print host name"
-        ),
-        AllowedCommand(
-            program = "uname",
-            description = "Print OS information"
-        ),
-        AllowedCommand(
-            program = "id",
-            description = "Print user identity"
-        ),
-        AllowedCommand(
-            program = "which",
-            aliases = listOf("where.exe", "where"),
-            description = "Locate executable path"
-        ),
-        AllowedCommand(
-            program = "get-command",
-            aliases = listOf("gcm"),
-            description = "Resolve command metadata"
-        ),
-        AllowedCommand(
-            program = "env",
-            aliases = listOf("printenv"),
-            description = "Print environment variables",
-            blockedArgPatterns = listOf(".*SECRET.*", ".*PASSWORD.*", ".*TOKEN.*", ".*API_KEY.*")
-        ),
-
-        // ── File Operations (safe) ─────────────────────────────────────
-        AllowedCommand(
-            program = "find",
-            description = "Find files",
-            blockedFlags = listOf("-exec", "-delete", "-execdir")
-        ),
-        AllowedCommand(
-            program = "grep",
-            aliases = listOf("rg", "ag", "select-string", "sls"),
-            description = "Search in files"
-        ),
-        AllowedCommand(
-            program = "wc",
-            description = "Word/line count"
-        ),
-        AllowedCommand(
-            program = "head",
-            description = "Print first lines"
-        ),
-        AllowedCommand(
-            program = "tail",
-            description = "Print last lines"
-        ),
-        AllowedCommand(
-            program = "sort",
-            description = "Sort lines"
-        ),
-        AllowedCommand(
-            program = "uniq",
-            description = "Filter duplicates"
-        ),
-        AllowedCommand(
-            program = "diff",
-            description = "Compare files"
-        ),
-        AllowedCommand(
-            program = "sed",
-            description = "Stream editor"
-        ),
-        AllowedCommand(
-            program = "awk",
-            aliases = listOf("gawk"),
-            description = "Pattern processing"
-        ),
-        AllowedCommand(
-            program = "cut",
-            description = "Extract columns"
-        ),
-        AllowedCommand(
-            program = "tr",
-            description = "Translate characters"
-        ),
-        AllowedCommand(
-            program = "tee",
-            description = "Pipe to file and stdout"
-        ),
-        AllowedCommand(
-            program = "xargs",
-            description = "Build argument lists"
-        ),
-        AllowedCommand(
-            program = "tree",
-            description = "Directory tree view"
-        ),
-        AllowedCommand(
-            program = "file",
-            description = "Determine file type"
-        ),
-        AllowedCommand(
-            program = "stat",
-            description = "File statistics"
-        ),
-        AllowedCommand(
-            program = "du",
-            description = "Disk usage"
-        ),
-        AllowedCommand(
-            program = "df",
-            description = "Disk free space"
-        ),
-        AllowedCommand(
-            program = "touch",
-            description = "Create empty file / update timestamp"
-        ),
-        AllowedCommand(
-            program = "mkdir",
-            description = "Create directory"
-        ),
-        AllowedCommand(
-            program = "cp",
-            aliases = listOf("copy", "copy-item"),
-            description = "Copy files"
-        ),
-        AllowedCommand(
-            program = "mv",
-            aliases = listOf("move", "move-item", "ren", "rename-item"),
-            description = "Move/rename files"
-        ),
-        AllowedCommand(
-            program = "ln",
-            description = "Create links",
-            blockedFlags = listOf("-f")
-        ),
-
-        // ── Network Utilities (safe/diagnostic) ────────────────────────
-        AllowedCommand(
-            program = "curl",
-            description = "HTTP client",
-            blockedArgPatterns = listOf(".*\\|\\s*(sh|bash).*")
-        ),
-        AllowedCommand(
-            program = "wget",
-            description = "File downloader",
-            blockedArgPatterns = listOf(".*\\|\\s*(sh|bash).*")
-        ),
-        AllowedCommand(
-            program = "ping",
-            description = "Network connectivity check"
-        ),
-        AllowedCommand(
-            program = "nslookup",
-            aliases = listOf("dig", "host"),
-            description = "DNS lookup"
-        ),
-        AllowedCommand(
-            program = "ssh",
-            description = "Secure shell",
-            requireConfirmation = true
-        ),
-        AllowedCommand(
-            program = "scp",
-            description = "Secure copy",
-            requireConfirmation = true
-        ),
-        AllowedCommand(
-            program = "rsync",
-            description = "Remote sync",
-            requireConfirmation = true,
-            blockedFlags = listOf("--delete")
-        ),
-
-        // ── Process Management (safe) ──────────────────────────────────
-        AllowedCommand(
-            program = "ps",
-            aliases = listOf("get-process"),
-            description = "List processes"
-        ),
-        AllowedCommand(
-            program = "top",
-            aliases = listOf("htop"),
-            description = "Process monitor"
-        ),
-        AllowedCommand(
-            program = "lsof",
-            description = "List open files"
-        ),
-        AllowedCommand(
-            program = "netstat",
-            aliases = listOf("ss"),
-            description = "Network statistics"
-        ),
-        AllowedCommand(
-            program = "kill",
-            description = "Terminate process",
-            requireConfirmation = true
-        ),
-
-        // ── Version Managers ───────────────────────────────────────────
-        AllowedCommand(
-            program = "nvm",
-            description = "Node Version Manager"
-        ),
-        AllowedCommand(
-            program = "fnm",
-            description = "Fast Node Manager"
-        ),
-        AllowedCommand(
-            program = "pyenv",
-            description = "Python Version Manager"
-        ),
-        AllowedCommand(
-            program = "rbenv",
-            description = "Ruby Version Manager"
-        ),
-        AllowedCommand(
-            program = "sdkman",
-            aliases = listOf("sdk"),
-            description = "JVM SDK Manager"
-        ),
-
-        // ── CI/CD & DevOps Tools ───────────────────────────────────────
-        AllowedCommand(
-            program = "gh",
-            description = "GitHub CLI",
-            blockedSubcommands = listOf("repo delete", "org delete")
-        ),
-        AllowedCommand(
-            program = "jq",
-            description = "JSON processor"
-        ),
-        AllowedCommand(
-            program = "yq",
-            description = "YAML processor"
-        ),
-
-        // ── Windows-Specific ───────────────────────────────────────────
-        AllowedCommand(
-            program = "test-path",
-            description = "PowerShell path check"
-        ),
-        AllowedCommand(
-            program = "test-connection",
-            description = "PowerShell ping"
-        ),
-        AllowedCommand(
-            program = "invoke-webrequest",
-            aliases = listOf("iwr"),
-            description = "PowerShell HTTP client"
-        ),
-        AllowedCommand(
-            program = "convertfrom-json",
-            description = "PowerShell JSON parser"
-        ),
-        AllowedCommand(
-            program = "convertto-json",
-            description = "PowerShell JSON serializer"
-        ),
-        AllowedCommand(
-            program = "select-object",
-            description = "PowerShell object filter"
-        ),
-        AllowedCommand(
-            program = "where-object",
-            description = "PowerShell object filter"
-        ),
-        AllowedCommand(
-            program = "format-list",
-            aliases = listOf("fl"),
-            description = "PowerShell format list"
-        ),
-        AllowedCommand(
-            program = "format-table",
-            aliases = listOf("ft"),
-            description = "PowerShell format table"
-        ),
-        AllowedCommand(
-            program = "measure-object",
-            description = "PowerShell object measurement"
-        ),
-        AllowedCommand(
-            program = "out-string",
-            description = "PowerShell output formatter"
-        ),
-        AllowedCommand(
-            program = "new-item",
-            aliases = listOf("ni"),
-            description = "PowerShell create file/directory"
-        )
-    )
-
-    val DEFAULT_BLOCKED_PATTERNS = listOf(
-        // Pipe to shell interpreter (code execution via download)
-        "\\|\\s*(sh|bash|zsh|powershell|cmd)\\b",
-        "\\|\\s*eval\\b",
-
-        // Command substitution (potential injection)
-        "\\$\\(",
-        "`[^`]+`",
-
-        // Redirect to system directories
-        ">\\s*/dev/",
-        ">(>)?\\s*/etc/",
-
-        // Destructive recursive delete
-        "\\brm\\s+-r",
-        "\\bdel\\s+/[sfq].*\\\\",
-
-        // Disk format / raw write
-        "\\bmkfs\\b",
-        "\\bdd\\b.*of=",
-
-        // Fork bomb
-        ":\\(\\)\\s*\\{"
-    )
-}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/workflow/WorkflowOrchestrator.kt b/core/src/main/kotlin/pl/jclab/refio/core/workflow/WorkflowOrchestrator.kt
index c5a4f8a5..28349a4f 100644
--- a/core/src/main/kotlin/pl/jclab/refio/core/workflow/WorkflowOrchestrator.kt
+++ b/core/src/main/kotlin/pl/jclab/refio/core/workflow/WorkflowOrchestrator.kt
@@ -2,15 +2,19 @@ package pl.jclab.refio.core.workflow
 
 import kotlinx.coroutines.CancellationException
 import pl.jclab.refio.api.models.ExecutionMode
+import pl.jclab.refio.core.api.PlanningRequest
 import pl.jclab.refio.core.api.StreamCallback
+import pl.jclab.refio.core.api.routers.AgentRouter
+import pl.jclab.refio.core.db.TaskMode
+import pl.jclab.refio.core.models.api.ChatRequest
+import pl.jclab.refio.core.models.api.LLMParams
+import pl.jclab.refio.core.services.ChatService
+import pl.jclab.refio.core.services.PlanningService
+import pl.jclab.refio.core.services.execution.unified.ExecutionEventListener
 import pl.jclab.refio.core.services.monitoring.GlobalMetrics
 import pl.jclab.refio.core.services.monitoring.OperationInfo
 import pl.jclab.refio.core.services.orchestration.UserInteraction
-import pl.jclab.refio.core.services.execution.unified.ExecutionEventListener
-import pl.jclab.refio.core.workflow.executors.ChatExecutor
-import pl.jclab.refio.core.workflow.executors.PlanExecutor
-import pl.jclab.refio.core.workflow.executors.StepExecutor
-import pl.jclab.refio.core.workflow.executors.SubagentExecutor
+import pl.jclab.refio.core.subagents.SubagentRouter
 import pl.jclab.refio.core.workflow.models.IntentResult
 import pl.jclab.refio.core.workflow.models.WorkflowIntent
 import pl.jclab.refio.core.workflow.models.WorkflowRequest
@@ -19,14 +23,17 @@ import pl.jclab.refio.core.logging.dualLogger
 private val logger = dualLogger("WorkflowOrchestrator")
 
 /**
- * Orchestrates workflow intent routing and execution using adapter executors.
+ * Orchestrates workflow intent routing and execution.
+ *
+ * Dispatches directly to domain services (ChatService/PlanningService/AgentRouter/SubagentRouter) —
+ * no adapter executors. AUTO mode loops through intent resolution until a non-step intent resolves.
  */
 class WorkflowOrchestrator(
     private val intentRouter: IntentRouter,
-    private val chatExecutor: ChatExecutor,
-    private val planExecutor: PlanExecutor,
-    private val stepExecutor: StepExecutor,
-    private val subagentExecutor: SubagentExecutor?,
+    private val chatService: ChatService,
+    private val planningService: PlanningService,
+    private val agentRouter: AgentRouter,
+    private val subagentRouter: SubagentRouter?,
     private val userInteraction: UserInteraction? = null
 ) {
     suspend fun execute(
@@ -69,49 +76,78 @@ class WorkflowOrchestrator(
                         listener.onChatStarted()
                         val onChunk = if (stream) streamCallback(listener) else null
                         logger.info { "[WORKFLOW] Chat execution start: stream=$stream (taskId=$taskLabel)" }
-                        val response = chatExecutor.execute(intent, stream, onChunk)
-                        val output = (response as IntentResult.ChatResult).response.output
-                        logger.info { "[WORKFLOW] Chat execution complete: outputChars=${output.length} (taskId=$taskLabel)" }
-                        listener.onStreamComplete(output)
-                        response
+                        val chatRequest = ChatRequest(
+                            taskId = intent.taskId,
+                            mode = TaskMode.CHAT,
+                            input = intent.input,
+                            contextRefs = intent.contextRefs,
+                            params = LLMParams(
+                                model = intent.model,
+                                provider = intent.provider
+                            )
+                        )
+                        val response = chatService.chat(chatRequest, stream, onChunk)
+                        logger.info { "[WORKFLOW] Chat execution complete: outputChars=${response.output.length} (taskId=$taskLabel)" }
+                        listener.onStreamComplete(response.output)
+                        IntentResult.ChatResult(response)
                     }
 
                     is WorkflowIntent.Plan -> {
                         listener.onPlanningStarted()
                         val onChunk = if (stream) streamCallback(listener) else null
                         logger.info { "[WORKFLOW] Plan execution start: stream=$stream (taskId=$taskLabel)" }
-                        val response = planExecutor.execute(intent, stream, onChunk)
-                        val plan = (response as IntentResult.PlanResult).response.plan
-                        logger.info { "[WORKFLOW] Plan execution complete: planChars=${plan.length} (taskId=$taskLabel)" }
-                        listener.onStreamComplete(plan)
-                        response
+                        val planningRequest = PlanningRequest(
+                            input = intent.input,
+                            contextRefs = intent.contextRefs,
+                            model = intent.model,
+                            provider = intent.provider,
+                            interactive = intent.interactive
+                        )
+                        val response = planningService.createPlan(intent.taskId, planningRequest, stream, onChunk)
+                        logger.info { "[WORKFLOW] Plan execution complete: planChars=${response.plan.length} (taskId=$taskLabel)" }
+                        listener.onStreamComplete(response.plan)
+                        IntentResult.PlanResult(response)
                     }
 
                     is WorkflowIntent.ExecuteStep -> {
                         listener.onStepStarted(intent.subtaskId)
                         logger.info { "[WORKFLOW] Step execution start: subtaskId=${intent.subtaskId} (taskId=$taskLabel)" }
-                        val response = stepExecutor.execute(
-                            intent = intent,
-                            listener = executionListener(listener)
+                        val execListener = executionListener(listener)
+                        // Two paths kept intentionally — StepExecutor previously branched on listener == null.
+                        val response = agentRouter.executeSubtaskStepWithListener(
+                            intent.taskId,
+                            intent.subtaskId,
+                            execListener
                         )
                         executedStep = true
                         logger.info { "[WORKFLOW] Step execution complete: subtaskId=${intent.subtaskId} (taskId=$taskLabel)" }
-                        response
+                        IntentResult.StepResult(response)
                     }
 
                     is WorkflowIntent.Subagent -> {
-                        val executor = subagentExecutor
+                        val router = subagentRouter
                             ?: throw IllegalStateException("Subagent execution not available")
                         listener.onSubagentStarted(intent.name)
                         val onChunk = if (stream) streamCallback(listener) else null
                         logger.info { "[WORKFLOW] Subagent execution start: name=${intent.name}, stream=$stream (taskId=$taskLabel)" }
-                        val response = executor.execute(intent, stream, onChunk)
-                        val output = (response as IntentResult.SubagentResult).response.response
+                        // parentModel only when both provider and model are set — lifted verbatim from SubagentExecutor.
+                        val parentModel = intent.model?.let { model ->
+                            intent.provider?.let { provider -> "$provider/$model" }
+                        }
+                        val response = router.invoke(
+                            taskId = intent.taskId,
+                            name = intent.name,
+                            prompt = intent.prompt,
+                            contextRefs = intent.contextRefs,
+                            stream = stream,
+                            onChunk = onChunk,
+                            parentModel = parentModel
+                        )
                         logger.info {
-                            "[WORKFLOW] Subagent execution complete: name=${intent.name}, outputChars=${output.length} (taskId=$taskLabel)"
+                            "[WORKFLOW] Subagent execution complete: name=${intent.name}, outputChars=${response.response.length} (taskId=$taskLabel)"
                         }
-                        listener.onStreamComplete(output)
-                        response
+                        listener.onStreamComplete(response.response)
+                        IntentResult.SubagentResult(response)
                     }
 
                     is WorkflowIntent.AnswerQuestion -> {
@@ -123,7 +159,6 @@ class WorkflowOrchestrator(
                         interaction.provideResponse(intent.questionId, intent.answer)
                         IntentResult.AnswerResult(intent.taskId)
                     }
-
                 }
 
                 lastResult = result
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/ChatExecutor.kt b/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/ChatExecutor.kt
deleted file mode 100644
index cf4184f4..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/ChatExecutor.kt
+++ /dev/null
@@ -1,36 +0,0 @@
-package pl.jclab.refio.core.workflow.executors
-
-import pl.jclab.refio.core.api.StreamCallback
-import pl.jclab.refio.core.db.TaskMode
-import pl.jclab.refio.core.models.api.ChatRequest
-import pl.jclab.refio.core.models.api.LLMParams
-import pl.jclab.refio.core.services.ChatService
-import pl.jclab.refio.core.workflow.models.IntentResult
-import pl.jclab.refio.core.workflow.models.WorkflowIntent
-
-/**
- * Adapter for ChatService chat execution.
- */
-class ChatExecutor(
-    private val chatService: ChatService
-) {
-    suspend fun execute(
-        intent: WorkflowIntent.Chat,
-        stream: Boolean,
-        onChunk: StreamCallback?
-    ): IntentResult {
-        val request = ChatRequest(
-            taskId = intent.taskId,
-            mode = TaskMode.CHAT,
-            input = intent.input,
-            contextRefs = intent.contextRefs,
-            params = LLMParams(
-                model = intent.model,
-                provider = intent.provider
-            )
-        )
-
-        val response = chatService.chat(request, stream, onChunk)
-        return IntentResult.ChatResult(response)
-    }
-}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/PlanExecutor.kt b/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/PlanExecutor.kt
deleted file mode 100644
index d825594f..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/PlanExecutor.kt
+++ /dev/null
@@ -1,31 +0,0 @@
-package pl.jclab.refio.core.workflow.executors
-
-import pl.jclab.refio.core.api.PlanningRequest
-import pl.jclab.refio.core.api.StreamCallback
-import pl.jclab.refio.core.services.PlanningService
-import pl.jclab.refio.core.workflow.models.IntentResult
-import pl.jclab.refio.core.workflow.models.WorkflowIntent
-
-/**
- * Adapter for PlanningService plan creation.
- */
-class PlanExecutor(
-    private val planningService: PlanningService
-) {
-    suspend fun execute(
-        intent: WorkflowIntent.Plan,
-        stream: Boolean,
-        onChunk: StreamCallback?
-    ): IntentResult {
-        val request = PlanningRequest(
-            input = intent.input,
-            contextRefs = intent.contextRefs,
-            model = intent.model,
-            provider = intent.provider,
-            interactive = intent.interactive
-        )
-
-        val response = planningService.createPlan(intent.taskId, request, stream, onChunk)
-        return IntentResult.PlanResult(response)
-    }
-}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/StepExecutor.kt b/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/StepExecutor.kt
deleted file mode 100644
index f9fd8f69..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/StepExecutor.kt
+++ /dev/null
@@ -1,26 +0,0 @@
-package pl.jclab.refio.core.workflow.executors
-
-import pl.jclab.refio.core.api.routers.AgentRouter
-import pl.jclab.refio.core.services.execution.unified.ExecutionEventListener
-import pl.jclab.refio.core.workflow.models.IntentResult
-import pl.jclab.refio.core.workflow.models.WorkflowIntent
-
-/**
- * Adapter for AgentRouter step execution.
- */
-class StepExecutor(
-    private val agentRouter: AgentRouter
-) {
-    suspend fun execute(
-        intent: WorkflowIntent.ExecuteStep,
-        listener: ExecutionEventListener? = null
-    ): IntentResult {
-        val response = if (listener == null) {
-            agentRouter.executeSubtaskStep(intent.taskId, intent.subtaskId)
-        } else {
-            agentRouter.executeSubtaskStepWithListener(intent.taskId, intent.subtaskId, listener)
-        }
-
-        return IntentResult.StepResult(response)
-    }
-}
diff --git a/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/SubagentExecutor.kt b/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/SubagentExecutor.kt
deleted file mode 100644
index 508a6fb2..00000000
--- a/core/src/main/kotlin/pl/jclab/refio/core/workflow/executors/SubagentExecutor.kt
+++ /dev/null
@@ -1,35 +0,0 @@
-package pl.jclab.refio.core.workflow.executors
-
-import pl.jclab.refio.core.api.StreamCallback
-import pl.jclab.refio.core.subagents.SubagentRouter
-import pl.jclab.refio.core.workflow.models.IntentResult
-import pl.jclab.refio.core.workflow.models.WorkflowIntent
-
-/**
- * Adapter for SubagentRouter invocation.
- */
-class SubagentExecutor(
-    private val subagentRouter: SubagentRouter
-) {
-    suspend fun execute(
-        intent: WorkflowIntent.Subagent,
-        stream: Boolean,
-        onChunk: StreamCallback?
-    ): IntentResult {
-        val parentModel = intent.model?.let { model ->
-            intent.provider?.let { provider -> "$provider/$model" }
-        }
-
-        val response = subagentRouter.invoke(
-            taskId = intent.taskId,
-            name = intent.name,
-            prompt = intent.prompt,
-            contextRefs = intent.contextRefs,
-            stream = stream,
-            onChunk = onChunk,
-            parentModel = parentModel
-        )
-
-        return IntentResult.SubagentResult(response)
-    }
-}
diff --git a/core/src/main/resources/config/example-config.yaml b/core/src/main/resources/config/example-config.yaml
new file mode 100644
index 00000000..782a5a39
--- /dev/null
+++ b/core/src/main/resources/config/example-config.yaml
@@ -0,0 +1,225 @@
+# ═══════════════════════════════════════════════════════════════════════════════
+# Refio Configuration File
+# ═══════════════════════════════════════════════════════════════════════════════
+#
+# Location:
+#   User config:    ~/.refio/config.yaml (Linux/macOS) or %USERPROFILE%\.refio\config.yaml (Windows)
+#   Project config: <project_root>/.refio/config.yaml
+#
+# Configuration Hierarchy (lowest to highest priority):
+#   1. Built-in defaults (hardcoded)
+#   2. User config file (~/.refio/config.yaml)
+#   3. Project config file (<project>/.refio/config.yaml)
+#   4. Database overrides (changes made in Settings UI)
+#
+# All fields are optional. Missing fields use built-in defaults.
+# ═══════════════════════════════════════════════════════════════════════════════
+
+# ─────────────────────────────────────────────────────────────────────────────
+# General Settings
+# ─────────────────────────────────────────────────────────────────────────────
+general:
+  formatMarkdown: true          # Format responses as markdown
+  streamingEnabled: true        # Stream LLM responses in real-time
+  advancedView: false           # Show advanced UI tabs (Steps, Context, RAG, Debug)
+
+# ─────────────────────────────────────────────────────────────────────────────
+# LLM Provider Configuration
+# ─────────────────────────────────────────────────────────────────────────────
+providers:
+  ollama:
+    endpoint: "http://localhost:11434"
+    contextSize: 32768          # Context window size in tokens
+
+  anthropic:
+    apiKey: ""                  # sk-ant-...
+
+  openai:
+    apiKey: ""                  # sk-...
+
+  openrouter:
+    apiKey: ""                  # sk-or-...
+
+  gemini:
+    apiKey: ""                  # AIza...
+
+  lmstudio:
+    baseUrl: "http://localhost:1234/v1"
+    contextSize: 32768
+
+# ─────────────────────────────────────────────────────────────────────────────
+# Model Configuration
+# ─────────────────────────────────────────────────────────────────────────────
+models:
+  # Default models per operation mode (format: "provider/model-id")
+  defaults:
+    chat: "ollama/qwen2.5:7b"       # Default chat/conversation model
+    plan: "ollama/qwen2.5:7b"       # Model for planning operations
+    coding: "ollama/qwen2.5-coder:7b"  # Model for coding/agent tasks
+    weak: "ollama/qwen2.5:3b"       # Cheap model for auxiliary operations
+    embedding: "ollama/nomic-embed-text"  # Model for embeddings (RAG)
+
+  # Model visibility in dropdown (format: "provider/model-id": true/false)
+  visibility:
+    "ollama/qwen2.5:7b": true
+    "ollama/qwen2.5:14b": true
+    "ollama/qwen2.5-coder:7b": true
+    "openai/gpt-4o-mini": true
+    "openai/gpt-4o": false          # Hidden by default (expensive)
+    "anthropic/claude-3-5-sonnet-20241022": true
+    "anthropic/claude-3-opus-20240229": false  # Hidden by default (expensive)
+
+# ─────────────────────────────────────────────────────────────────────────────
+# System Limits
+# ─────────────────────────────────────────────────────────────────────────────
+limits:
+  apiCallTimeout: 240           # API call timeout in seconds
+  toolExecutionTimeout: 240     # Tool execution timeout in seconds
+  streamingReadTimeout: 240     # Streaming read timeout in seconds
+  streamingRequestTimeout: 1800 # Total streaming request timeout in seconds
+  maxContextSize: 128000        # Maximum context size in tokens
+  maxOutputSize: 16384          # Maximum output size in tokens
+  maxFileSize: 10               # Maximum file size in MB
+
+# ─────────────────────────────────────────────────────────────────────────────
+# Advanced Settings
+# ─────────────────────────────────────────────────────────────────────────────
+advanced:
+  noEgressDefault: false        # Block external network calls by default
+  readOnlyMode: false           # Prevent all file write operations
+  autoOptimizePercentage: 85    # Auto-optimize context at this % of limit
+
+# ─────────────────────────────────────────────────────────────────────────────
+# Tool Permissions
+# ─────────────────────────────────────────────────────────────────────────────
+tools:
+  # Permission format: { planMode: "ON"|"OFF", agentMode: "ON"|"OFF" }
+  permissions:
+    read_file:
+      planMode: "ON"
+      agentMode: "ON"
+    read_directory:
+      planMode: "ON"
+      agentMode: "ON"
+    file_search:
+      planMode: "ON"
+      agentMode: "ON"
+    grep_search:
+      planMode: "ON"
+      agentMode: "ON"
+    view_diff:
+      planMode: "ON"
+      agentMode: "ON"
+    create_new_file:
+      planMode: "OFF"
+      agentMode: "ON"
+    code_editing:
+      planMode: "OFF"
+      agentMode: "ON"
+    advance_code_editing:
+      planMode: "OFF"
+      agentMode: "ON"
+    multi_edit:
+      planMode: "OFF"
+      agentMode: "ON"
+    run_terminal_command:
+      planMode: "OFF"
+      agentMode: "ON"          # Enabled by default in AGENT mode
+
+# ─────────────────────────────────────────────────────────────────────────────
+# RAG (Retrieval-Augmented Generation) Configuration
+# ─────────────────────────────────────────────────────────────────────────────
+rag:
+  enabled: true                 # Enable RAG features
+  indexOnStartup: true          # Index project at IDE startup
+  autoIndexOnContextBuild: true # Auto-index when building context
+  maxFileSizeMB: 2              # Max file size for indexing
+  maxChunksPerFile: 100         # Max chunks per file
+  indexBatchSize: 10            # Files per indexing batch
+  embeddingsBatchSize: 50       # Embeddings per batch
+  cacheTtlMs: 300000            # RAG cache TTL (5 minutes)
+  maxConcurrentJobs: 4          # Max concurrent indexing jobs
+
+  # Directories to ignore during indexing
+  ignoredDirectories:
+    - ".git"
+    - ".idea"
+    - ".vscode"
+    - "node_modules"
+    - "build"
+    - "dist"
+    - "__pycache__"
+    - ".venv"
+    - "target"
+    - "out"
+
+# ─────────────────────────────────────────────────────────────────────────────
+# UI State (persisted between sessions)
+# ─────────────────────────────────────────────────────────────────────────────
+ui:
+  thinkingEnabled: false        # Show LLM thinking process
+  noEgressEnabled: false        # Block external network calls
+  orchestrationEnabled: true    # Enable orchestration toggle
+  intentClassificationEnabled: false # Enable LLM intent classification
+  executionMode: "AUTO"         # AUTO or INTERACTIVE
+  selectedMode: "CHAT"          # CHAT, PLAN, or AGENT
+  selectedModel: ""             # Currently selected model (empty = auto)
+
+# ═══════════════════════════════════════════════════════════════════════════════
+# PROJECT-SPECIFIC SETTINGS (only in <project>/.refio/config.yaml)
+# ═══════════════════════════════════════════════════════════════════════════════
+
+# ─────────────────────────────────────────────────────────────────────────────
+# Custom Prompts (project-specific)
+# ─────────────────────────────────────────────────────────────────────────────
+# prompts:
+#   systemChat: |
+#     You are a helpful coding assistant for this specific project.
+#     Always follow the project's coding conventions.
+#
+#   systemPlan: |
+#     You are a planning assistant. Create detailed plans for tasks.
+#
+#   systemAgent: |
+#     You are an autonomous coding agent.
+#
+#   commands:
+#     - name: "fix"
+#       description: "Fix code issues"
+#       content: "Analyze and fix any issues in the selected code."
+#       enabled: true
+#
+#     - name: "refactor"
+#       description: "Refactor code"
+#       content: "Refactor the selected code for better readability."
+#       enabled: true
+#
+#   rules:
+#     - name: "coding-style"
+#       content: "Always use 4-space indentation."
+#       enabled: true
+
+# ─────────────────────────────────────────────────────────────────────────────
+# MCP Server Configuration (project-specific)
+# ─────────────────────────────────────────────────────────────────────────────
+# mcp:
+#   servers:
+#     - id: "github"
+#       displayName: "GitHub"
+#       type: "STDIO"
+#       command: "npx"
+#       args: ["-y", "@modelcontextprotocol/server-github"]
+#       accessMode: "READ"
+#       enabled: true
+#       env:
+#         - name: "GITHUB_TOKEN"
+#           value: ""
+#           isSecret: true
+#
+#     - id: "filesystem"
+#       displayName: "Filesystem"
+#       type: "STDIO"
+#       command: "npx"
+#       args: ["-y", "@modelcontextprotocol/server-filesystem", "/path/to/allowed"]
+#       accessMode: "READ_WRITE"
+#       enabled: true
diff --git a/core/src/main/resources/prompts/system-agent.md b/core/src/main/resources/prompts/system-agent.md
index 98214359..38612376 100644
--- a/core/src/main/resources/prompts/system-agent.md
+++ b/core/src/main/resources/prompts/system-agent.md
@@ -5,6 +5,7 @@ description: System prompt for AGENT mode - autonomous coding with full read/wri
 mode: AGENT
 variables:
   - tool_descriptions
+  - tool_selection_matrix
 ---
 
 You are an autonomous coding agent with full read/write access.
@@ -34,6 +35,11 @@ Same failure after 2 attempts = wrong mental model. Use `think` to separate fact
 **STEP 4 — MATCH TASK SCALE.**
 Trivial fix: 1-2 turns. Complex bug with external dependencies: 5-15 turns of verification — that's normal.
 
+**STEP 5 — VALIDATE YOUR WORK.**
+After making changes, verify they actually work — don't assume success. For code: run tests, compile, check output. For API tasks: submit and check the response. For multi-field problems with a verification endpoint: submit early with best-guess values to identify which fields need fixing, then iterate. The cost of one extra validation call is always less than 10 turns of blind analysis.
+
+**DO NOT RE-READ A FILE YOU JUST WROTE.** Write tools return a `changeSummary` (added/removed lines, unified diff, hashes) inside the tool result. That IS your verification — treat it as authoritative. A follow-up `read_file` on the file you just created/edited wastes thousands of tokens and gives you no new information. Only re-read if a LATER tool call (build, test, lint) reports a concrete problem you need to inspect.
+
 **CODING DISCIPLINE:**
 - Understand before editing. Prefer minimal, focused changes.
 - Match existing style and naming. Verify after changes.
@@ -184,32 +190,16 @@ Fields:
 </examples>
 
 <tool_selection>
-**Prefer built-in tools over `run_terminal_command`:**
-
-| Task | Tool |
-|---|---|
-| Read file | `read_file` (whole file default; offset/limit only for huge files) |
-| List dir | `read_directory` |
-| Find file | `file_search` (glob) |
-| Search content | `grep_search` (exact/regex) or `rag_search` (semantic) |
-| Compare | `view_diff` |
-| Edit (exact match) | `code_editing` / `multi_edit` (FREE) |
-| Edit (semantic, 2-10 places) | `multi_line_editor` (CHEAP ~$0.02) |
-| Full rewrite (>30% of file) | `advance_code_editing` (EXPENSIVE ~$0.06, max 1x per file) |
-| Create file | `create_new_file` (requires prior existence check) |
-| HTTP | `http_request` (`save_to_file` for large responses) |
-| Shell | `run_terminal_command` (ONLY for build/test/git/packages) |
-| Reasoning | `think` (use before retrying failed calls or at decision points) |
-| Cross-turn data | `memory` (write/read/list/get_subtask_output) |
-| Task tracking | `tasks` (plan/update/list) |
-| Subagent | `invoke_subagent` / `manage_subagent` (EXPENSIVE) |
-| Quick LLM call | `llm_call` (analysis/transform without full agent loop) |
-
-**Search:** `grep_search` for exact identifiers/regex. `rag_search` for concepts without good keywords.
-
-**Truncated output:** When you see `[!! MIDDLE TRUNCATED !!]`, use `memory(action="get_subtask_output", subtask_id="<id>")` to recover full output before re-running.
-
-**`read_file` default:** Reads whole file. Do NOT pass `limit` for normal source files — that fragments your view. Use offset/limit only for huge files (logs, CSVs, generated code).
+**When-to-use-what (only tools currently enabled appear below):**
+
+{{tool_selection_matrix}}
+
+**Notes:**
+- `grep_search` for exact identifiers/regex. `rag_search` for concepts without good keywords.
+- `read_file` reads the whole file by default. Do NOT pass `limit` for normal source files — that fragments your view. Use offset/limit only for huge files (logs, CSVs, generated code).
+- `run_code` (when available) runs a code in a interpreter with no shell quoting issues and works cross-platform — prefer it over `run_terminal_command` for data processing, API calls, and calculations.
+- `run_terminal_command` — (when available) for OS-level ops: `git`, `gradle`, `npm`, `docker`, etc. Avoid inline `python -c "..."` via  quote mangling on Windows/PowerShell causes frequent failures.
+- Truncated output: when you see `[!! MIDDLE TRUNCATED !!]`, use `memory(action="get_subtask_output", subtask_id="<id>")` to recover full output before re-running.
 </tool_selection>
 
 <context_management>
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/api/CoreApiRouterProjectHandleTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/api/CoreApiRouterProjectHandleTest.kt
index f657d62f..18845101 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/api/CoreApiRouterProjectHandleTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/api/CoreApiRouterProjectHandleTest.kt
@@ -25,11 +25,11 @@ class CoreApiRouterProjectHandleTest {
     }
 
     @Test
-    fun `should construct with legacy ideProject null`() {
+    fun `should construct with legacy platformProjectOverride null`() {
         // Traditional construction — backward compat
         val router = CoreApiRouter(
             projectRoot = Path.of("/tmp/test"),
-            ideProject = null
+            platformProjectOverride = null
         )
 
         assertFalse(router.hasIdeProject())
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/api/routers/AgentRouterTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/api/routers/AgentRouterTest.kt
index d8a9ceb5..a711d13f 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/api/routers/AgentRouterTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/api/routers/AgentRouterTest.kt
@@ -38,7 +38,6 @@ class AgentRouterTest {
     private lateinit var llmClient: LLMClient
     private lateinit var promptsService: PromptsService
     private lateinit var contextService: ContextService
-    private var ideProject: Any? = null
     private lateinit var toolDescriptionBuilder: ToolDescriptionBuilder
     private lateinit var agentRouter: AgentRouter
 
@@ -52,7 +51,6 @@ class AgentRouterTest {
         llmClient = mockk()
         promptsService = mockk()
         contextService = mockk()
-        ideProject = null // Platform-independent test
         toolDescriptionBuilder = mockk()
         agentRouter = AgentRouter(
             agentExecutor = agentExecutor,
@@ -64,7 +62,6 @@ class AgentRouterTest {
             promptsService = promptsService,
             contextService = contextService,
             projectRoot = Paths.get("D:/test"),
-            ideProject = ideProject,
             toolDescriptionBuilder = toolDescriptionBuilder
         )
     }
@@ -233,7 +230,6 @@ class AgentRouterTest {
             promptsService = promptsService,
             contextService = contextService,
             projectRoot = Paths.get("D:/test"),
-            ideProject = ideProject,
             toolDescriptionBuilder = toolDescriptionBuilder
         )
 
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/benchmarks/BenchmarkBaseline.kt b/core/src/test/kotlin/pl/jclab/refio/core/benchmarks/BenchmarkBaseline.kt
new file mode 100644
index 00000000..c0473bb1
--- /dev/null
+++ b/core/src/test/kotlin/pl/jclab/refio/core/benchmarks/BenchmarkBaseline.kt
@@ -0,0 +1,118 @@
+package pl.jclab.refio.core.benchmarks
+
+import org.junit.jupiter.api.Tag
+import org.junit.jupiter.api.Test
+import org.junit.jupiter.api.io.TempDir
+import pl.jclab.refio.core.config.ConfigYaml
+import pl.jclab.refio.core.config.ConfigYamlIO
+import pl.jclab.refio.core.config.GeneralConfig
+import pl.jclab.refio.core.config.LimitsConfig
+import pl.jclab.refio.core.config.OllamaConfig
+import pl.jclab.refio.core.config.OpenAIConfig
+import pl.jclab.refio.core.config.ProvidersConfig
+import pl.jclab.refio.core.llm.ModelDefinitions
+import java.io.File
+import java.nio.file.Path
+import kotlin.system.measureNanoTime
+
+/**
+ * Lekkie benchmarki baseline dla Sprint 0.
+ *
+ * **Cel**: zarejestrować referencyjny czas wykonania operacji które będą refaktorowane
+ * w Sprint 1-3. Po każdym sprincie porównujemy — regresja >20% = flag do analizy.
+ *
+ * **Zasady**:
+ * - `measureNanoTime` × N iteracji, mediana (bardziej stabilna niż średnia).
+ * - Warmup przed pomiarem (JIT kompiluje).
+ * - Tagged `benchmark` — nie uruchamia się w normalnym `./gradlew test`.
+ *   Uruchamiać: `./gradlew :core:test --tests "*Benchmark*"`.
+ *
+ * **NIE jest to JMH** — dla lokalnego pluginu to overkill. Szukamy regresji >20%,
+ * nie mikroskopijnych różnic.
+ */
+@Tag("benchmark")
+class BenchmarkBaseline {
+
+    private val warmupIterations = 50
+    private val measureIterations = 200
+
+    @Test
+    fun `benchmark config yaml load and parse`(@TempDir tempDir: Path) {
+        val config = ConfigYaml(
+            general = GeneralConfig(formatMarkdown = true, streamingEnabled = true, advancedView = false),
+            providers = ProvidersConfig(
+                openai = OpenAIConfig(apiKey = "sk-test-key-1234567890"),
+                ollama = OllamaConfig(endpoint = "http://localhost:11434", contextSize = 32768)
+            ),
+            limits = LimitsConfig(apiCallTimeout = 120, maxOutputSize = 8192)
+        )
+        val file = tempDir.resolve("config.yaml").toFile()
+        ConfigYaml.saveToFile(config, file, withComments = false)
+
+        // Warmup
+        repeat(warmupIterations) { decodePrivate(file) }
+
+        // Measure
+        val samples = (1..measureIterations).map {
+            measureNanoTime { decodePrivate(file) }
+        }
+
+        reportBenchmark("config_yaml_load_and_parse", samples)
+    }
+
+    @Test
+    fun `benchmark ModelDefinitions getDefinition lookup`() {
+        val knownIds = listOf("gpt-4o-mini", "gpt-4o", "gpt-5.1", "o1", "o3")
+
+        // Warmup
+        repeat(warmupIterations) {
+            knownIds.forEach { ModelDefinitions.getDefinition("openai", it) }
+        }
+
+        // Measure
+        val samples = (1..measureIterations).map {
+            measureNanoTime {
+                knownIds.forEach { ModelDefinitions.getDefinition("openai", it) }
+            }
+        }
+
+        reportBenchmark("model_definitions_lookup_5_models", samples)
+    }
+
+    @Test
+    fun `benchmark ModelDefinitions syntheticDefinitionFor`() {
+        val unknownIds = listOf("gpt-9999", "gpt-fake", "future-model", "unknown-1", "unknown-2")
+
+        repeat(warmupIterations) {
+            unknownIds.forEach { ModelDefinitions.syntheticDefinitionFor("openai", it, 32768) }
+        }
+
+        val samples = (1..measureIterations).map {
+            measureNanoTime {
+                unknownIds.forEach { ModelDefinitions.syntheticDefinitionFor("openai", it, 32768) }
+            }
+        }
+
+        reportBenchmark("model_definitions_synthetic_5_models", samples)
+    }
+
+    private fun decodePrivate(file: File): ConfigYaml =
+        checkNotNull(ConfigYamlIO.loadFromPath(file)) { "Failed to load benchmark config from $file" }
+
+    private fun reportBenchmark(name: String, samplesNs: List<Long>) {
+        val sorted = samplesNs.sorted()
+        val medianNs = sorted[sorted.size / 2]
+        val p95Ns = sorted[(sorted.size * 95) / 100]
+        val minNs = sorted.first()
+        val maxNs = sorted.last()
+
+        println(
+            "BENCHMARK[$name]: " +
+                "median=${medianNs / 1_000}µs " +
+                "p95=${p95Ns / 1_000}µs " +
+                "min=${minNs / 1_000}µs " +
+                "max=${maxNs / 1_000}µs " +
+                "(${samplesNs.size} samples)"
+        )
+    }
+}
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/config/ConfigYamlRoundtripTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/config/ConfigYamlRoundtripTest.kt
new file mode 100644
index 00000000..6f93cc2e
--- /dev/null
+++ b/core/src/test/kotlin/pl/jclab/refio/core/config/ConfigYamlRoundtripTest.kt
@@ -0,0 +1,150 @@
+package pl.jclab.refio.core.config
+
+import org.junit.jupiter.api.Test
+import org.junit.jupiter.api.io.TempDir
+import java.io.File
+import java.nio.file.Path
+import kotlin.test.assertEquals
+import kotlin.test.assertNotNull
+import kotlin.test.assertNull
+import kotlin.test.assertTrue
+
+/**
+ * Characterization tests dla ConfigYaml load/save.
+ *
+ * Sprint 0 baseline: zabezpieczenie przed Sprint 3 #9 (migracja serializacji na Kaml
+ * z obecnego kaml-backed manual setup). Roundtrip musi być stabilny przed i po
+ * migracji. Po Sprint 3 test dla unknown-key zmieni się z "load succeeds silently"
+ * na "load throws" (fail loud).
+ */
+class ConfigYamlRoundtripTest {
+
+    @Test
+    fun `roundtrip preserves general settings`(@TempDir tempDir: Path) {
+        val original = ConfigYaml(
+            general = GeneralConfig(
+                formatMarkdown = true,
+                streamingEnabled = false,
+                advancedView = true
+            )
+        )
+
+        val file = tempDir.resolve("config.yaml").toFile()
+        ConfigYaml.saveToFile(original, file, withComments = false)
+
+        val loaded = loadFromFile(file)
+        assertNotNull(loaded)
+        assertEquals(true, loaded.general?.formatMarkdown)
+        assertEquals(false, loaded.general?.streamingEnabled)
+        assertEquals(true, loaded.general?.advancedView)
+    }
+
+    @Test
+    fun `roundtrip preserves provider API keys`(@TempDir tempDir: Path) {
+        val original = ConfigYaml(
+            providers = ProvidersConfig(
+                openai = OpenAIConfig(apiKey = "sk-test-openai"),
+                anthropic = AnthropicConfig(apiKey = "sk-ant-test"),
+                gemini = GeminiConfig(apiKey = "sk-gemini-test"),
+                openrouter = OpenRouterConfig(apiKey = "sk-or-test")
+            )
+        )
+
+        val file = tempDir.resolve("config.yaml").toFile()
+        ConfigYaml.saveToFile(original, file, withComments = false)
+
+        val loaded = loadFromFile(file)
+        assertNotNull(loaded)
+        assertEquals("sk-test-openai", loaded.providers?.openai?.apiKey)
+        assertEquals("sk-ant-test", loaded.providers?.anthropic?.apiKey)
+        assertEquals("sk-gemini-test", loaded.providers?.gemini?.apiKey)
+        assertEquals("sk-or-test", loaded.providers?.openrouter?.apiKey)
+    }
+
+    @Test
+    fun `roundtrip preserves ollama endpoint and context size`(@TempDir tempDir: Path) {
+        val original = ConfigYaml(
+            providers = ProvidersConfig(
+                ollama = OllamaConfig(
+                    endpoint = "http://custom-host:11434",
+                    contextSize = 32768,
+                    keepAlive = 300
+                )
+            )
+        )
+
+        val file = tempDir.resolve("config.yaml").toFile()
+        ConfigYaml.saveToFile(original, file, withComments = false)
+
+        val loaded = loadFromFile(file)
+        assertNotNull(loaded)
+        assertEquals("http://custom-host:11434", loaded.providers?.ollama?.endpoint)
+        assertEquals(32768, loaded.providers?.ollama?.contextSize)
+        assertEquals(300, loaded.providers?.ollama?.keepAlive)
+    }
+
+    @Test
+    fun `load returns null for non-existent file`(@TempDir tempDir: Path) {
+        val nonExistent = tempDir.resolve("does-not-exist.yaml").toFile()
+        val loaded = loadFromFile(nonExistent)
+        assertNull(loaded)
+    }
+
+    // CHANGES AFTER SPRINT 3: currently lenient (strictMode=false), will throw on unknown keys.
+    @Test
+    fun `load silently ignores unknown top-level keys`(@TempDir tempDir: Path) {
+        val yaml = """
+            general:
+              formatMarkdown: true
+            unknownSection:
+              someField: "value"
+        """.trimIndent()
+
+        val file = tempDir.resolve("config.yaml").toFile()
+        file.writeText(yaml)
+
+        val loaded = loadFromFile(file)
+        assertNotNull(loaded)
+        assertEquals(true, loaded.general?.formatMarkdown)
+    }
+
+    @Test
+    fun `empty config yaml returns null or empty config`(@TempDir tempDir: Path) {
+        // Current behavior: empty YAML deserializes to null via our decoder (kaml treats
+        // empty doc as null). Characterize — Sprint 3 #9 may change to empty ConfigYaml.
+        val file = tempDir.resolve("config.yaml").toFile()
+        file.writeText("# empty\n")
+
+        val loaded = loadFromFile(file)
+        // Either null or fully-null ConfigYaml is acceptable current behavior.
+        if (loaded != null) {
+            assertNull(loaded.general)
+            assertNull(loaded.providers)
+        }
+    }
+
+@Test
+    fun `malformed yaml returns null via loadFromPath silent error handling`(@TempDir tempDir: Path) {
+        // Current behavior: parse errors logged to stdout, null returned. Sprint 3 #9 may
+        // change this to throw (fail loud) — depends on how we route config load errors.
+        val file = tempDir.resolve("config.yaml").toFile()
+        file.writeText("not: valid: yaml: syntax: {{{")
+
+        val loaded = loadFromFile(file)
+        // Characterizes current swallow-and-return-null behavior.
+        assertTrue(loaded == null, "Malformed YAML currently returns null rather than throwing")
+    }
+
+    @Test
+    fun `toYamlString produces deterministic output for same input`() {
+        val config = ConfigYaml(
+            general = GeneralConfig(formatMarkdown = true, streamingEnabled = true, advancedView = false)
+        )
+
+        val first = ConfigYaml.toYamlString(config)
+        val second = ConfigYaml.toYamlString(config)
+        assertEquals(first, second)
+    }
+
+    private fun loadFromFile(file: File): ConfigYaml? = ConfigYamlIO.loadFromPath(file)
+}
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/db/repositories/ConfigRepositoryStartupTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/db/repositories/ConfigRepositoryStartupTest.kt
index 9e567225..c7ad7b2a 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/db/repositories/ConfigRepositoryStartupTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/db/repositories/ConfigRepositoryStartupTest.kt
@@ -1,7 +1,9 @@
 package pl.jclab.refio.core.db.repositories
 
+import org.jetbrains.exposed.sql.Database
 import org.junit.jupiter.api.Test
 import pl.jclab.refio.core.db.ConfigScope
+import java.sql.DriverManager
 import kotlin.test.assertEquals
 import kotlin.test.assertNull
 
@@ -17,4 +19,26 @@ class ConfigRepositoryStartupTest {
         assertEquals(emptyList(), repository.search("missing.%"))
         assertEquals(0L, repository.count())
     }
+
+    @Test
+    fun `should return safe defaults when database exists but config table is missing`() {
+        val dbName = "config-startup-${System.nanoTime()}"
+        val jdbcUrl = "jdbc:sqlite:file:$dbName?mode=memory&cache=shared"
+
+        val keepAlive = DriverManager.getConnection(jdbcUrl)
+        try {
+            Database.connect(
+                url = jdbcUrl,
+                driver = "org.sqlite.JDBC"
+            )
+
+            assertNull(repository.get("missing.key", ConfigScope.APP))
+            assertNull(repository.getWithPrecedence("missing.key"))
+            assertEquals(emptyList(), repository.findByScope(ConfigScope.APP))
+            assertEquals(emptyList(), repository.search("missing.%"))
+            assertEquals(0L, repository.count())
+        } finally {
+            keepAlive.close()
+        }
+    }
 }
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/llm/ModelDefinitionsCharacterizationTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/llm/ModelDefinitionsCharacterizationTest.kt
new file mode 100644
index 00000000..de9aaf5d
--- /dev/null
+++ b/core/src/test/kotlin/pl/jclab/refio/core/llm/ModelDefinitionsCharacterizationTest.kt
@@ -0,0 +1,89 @@
+package pl.jclab.refio.core.llm
+
+import org.junit.jupiter.api.Test
+import pl.jclab.refio.core.services.ConfigService.Companion.DEFAULT_CONTEXT_SIZE
+import kotlin.test.assertEquals
+import kotlin.test.assertFalse
+import kotlin.test.assertNotNull
+import kotlin.test.assertNull
+import kotlin.test.assertTrue
+
+/**
+ * Characterization tests for ModelDefinitions.
+ *
+ * Sprint 0 baseline: asercje na OBECNE zachowanie. Sprint 1 #2 zmieni
+ * `syntheticDefinitionFor` na `syntheticDefinitionFor` + WARN log — semantycznie bez zmian,
+ * więc te testy po rename powinny dalej przechodzić (z zmienioną nazwą wywołania).
+ */
+class ModelDefinitionsCharacterizationTest {
+
+    @Test
+    fun `syntheticDefinitionFor returns synthetic ModelDefinition with provided context length`() {
+        val result = ModelDefinitions.syntheticDefinitionFor(
+            provider = "openai",
+            modelId = "gpt-9999-future-model",
+            maxContext = 131072
+        )
+
+        assertEquals("gpt-9999-future-model", result.id)
+        assertEquals("gpt-9999-future-model", result.name)
+        assertEquals("openai", result.provider)
+        assertEquals(131072, result.maxContext)
+        assertEquals("Unknown model (synthetic definition)", result.description)
+    }
+
+    @Test
+    fun `syntheticDefinitionFor defaults maxContext when not provided`() {
+        val result = ModelDefinitions.syntheticDefinitionFor(
+            provider = "ollama",
+            modelId = "some-local-model"
+        )
+
+        assertEquals(DEFAULT_CONTEXT_SIZE, result.maxContext)
+    }
+
+    @Test
+    fun `syntheticDefinitionFor sets conservative capability defaults`() {
+        val result = ModelDefinitions.syntheticDefinitionFor(
+            provider = "openai",
+            modelId = "unknown"
+        )
+
+        // Current defaults: chat only, no vision/reasoning/functions, streaming enabled.
+        assertEquals(listOf(ModelCapability.CHAT_COMPLETION), result.capabilities)
+        assertEquals(ModelType.TEXT, result.modelType)
+        assertFalse(result.supportsVision)
+        assertFalse(result.supportsReasoning)
+        assertTrue(result.supportsStreaming)
+        assertFalse(result.supportsFunctionCalling)
+        assertTrue(result.active)
+    }
+
+    @Test
+    fun `syntheticDefinitionFor uses zero pricing`() {
+        val result = ModelDefinitions.syntheticDefinitionFor(
+            provider = "openai",
+            modelId = "unknown"
+        )
+
+        assertEquals(0.0, result.costPer1MInput)
+        assertEquals(0.0, result.costPer1MOutput)
+        assertNull(result.maxOutputTokens)
+    }
+
+    @Test
+    fun `getDefinition returns null for unknown model`() {
+        // Characterizes the pre-condition for syntheticDefinitionFor: real code path
+        // does `getDefinition(...) ?: syntheticDefinitionFor(...)`. If this ever returns
+        // non-null for an unknown model, the fallback chain is silently broken.
+        val result = ModelDefinitions.getDefinition("openai", "gpt-9999-definitely-fake")
+        assertNull(result)
+    }
+
+    @Test
+    fun `getDefinition returns real definition for known model`() {
+        val result = ModelDefinitions.getDefinition("openai", "gpt-4o-mini")
+        assertNotNull(result)
+        assertEquals("openai", result.provider)
+    }
+}
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/GenericOpenAIAdapterMalformedTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/GenericOpenAIAdapterMalformedTest.kt
new file mode 100644
index 00000000..3573ec2a
--- /dev/null
+++ b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/GenericOpenAIAdapterMalformedTest.kt
@@ -0,0 +1,81 @@
+package pl.jclab.refio.core.llm.adapters
+
+import io.ktor.client.HttpClient
+import io.ktor.client.engine.mock.MockEngine
+import io.ktor.client.engine.mock.MockRequestHandleScope
+import io.ktor.client.engine.mock.respond
+import io.ktor.client.plugins.contentnegotiation.ContentNegotiation
+import io.ktor.http.ContentType
+import io.ktor.http.HttpHeaders
+import io.ktor.http.HttpStatusCode
+import io.ktor.http.headersOf
+import io.ktor.serialization.gson.gson
+import io.mockk.every
+import io.mockk.mockk
+import kotlinx.coroutines.test.runTest
+import org.junit.jupiter.api.Test
+import org.junit.jupiter.api.assertThrows
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.errors.RefioError
+import pl.jclab.refio.core.llm.LLMMessage
+import pl.jclab.refio.core.services.ConfigService
+import kotlin.test.assertEquals
+import kotlin.test.assertTrue
+
+class GenericOpenAIAdapterMalformedTest {
+
+    @Test
+    fun `chat completions without choices array throws MalformedResponse`() = runTest {
+        val config = mockConfig()
+        val fixture = """{"foo":"bar"}"""
+
+        val adapter = GenericOpenAIAdapter(
+            model = "gpt-test",
+            providerName = "generic_openai",
+            configService = config,
+            baseUrlOverride = "https://mock.test/v1",
+            httpClientOverride = mockHttpClient(fixture)
+        )
+
+        val err = assertThrows<RefioError.MalformedResponse> {
+            adapter.chat(
+                messages = listOf(LLMMessage(role = "user", content = "hi")),
+                systemMessages = emptyList(),
+                maxTokens = 64,
+                temperature = 0.0,
+                streaming = false,
+                onStreamChunk = null,
+                kwargs = emptyMap()
+            )
+        }
+        assertEquals("generic_openai", err.provider)
+        assertTrue(err.bodyPreview.contains("foo"), "preview should contain raw body: ${err.bodyPreview}")
+        assertTrue(err.reason.contains("choices"), "reason should mention 'choices': ${err.reason}")
+    }
+
+    private fun mockConfig(): ConfigService {
+        val config = mockk<ConfigService>()
+        every { config.get(any(), any(), any(), any()) } returns null
+        every { config.getTyped(ConfigKeys.API_CALL_TIMEOUT, any()) } returns ConfigKeys.API_CALL_TIMEOUT.default
+        every { config.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, any()) } returns ConfigKeys.MAX_OUTPUT_SIZE.default
+        every { config.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_BASE_URL) } returns "https://mock.test/v1"
+        every { config.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_API_KEY) } returns "test-key"
+        return config
+    }
+
+    private fun mockHttpClient(responseJson: String): HttpClient {
+        return HttpClient(MockEngine { _ ->
+            respond(
+                content = responseJson,
+                status = HttpStatusCode.OK,
+                headers = headersOf(HttpHeaders.ContentType, ContentType.Application.Json.toString())
+            )
+        }) {
+            install(ContentNegotiation) {
+                gson {
+                    serializeNulls()
+                }
+            }
+        }
+    }
+}
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/LLMAdapterSmokeTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/LLMAdapterSmokeTest.kt
index 03093fe1..f8438e53 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/LLMAdapterSmokeTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/LLMAdapterSmokeTest.kt
@@ -58,7 +58,7 @@ class LLMAdapterSmokeTest {
     @Test
     fun `openai smoke test should send multimodal request and parse response`() = runTest {
         val configService = mockProviderConfig(
-            key = ConfigService.KEY_PROVIDER_OPENAI_API_KEY,
+            key = ConfigKeys.PROVIDER_OPENAI_API_KEY.key,
             apiKey = "test-openai-key"
         )
 
@@ -124,7 +124,7 @@ class LLMAdapterSmokeTest {
     @Test
     fun `anthropic smoke test should send multimodal request and parse response`() = runTest {
         val configService = mockProviderConfig(
-            key = ConfigService.KEY_PROVIDER_ANTHROPIC_API_KEY,
+            key = ConfigKeys.PROVIDER_ANTHROPIC_API_KEY.key,
             apiKey = "test-anthropic-key"
         )
 
@@ -188,7 +188,7 @@ class LLMAdapterSmokeTest {
     @Test
     fun `gemini smoke test should send multimodal request and parse response`() = runTest {
         val configService = mockProviderConfig(
-            key = ConfigService.KEY_PROVIDER_GEMINI_API_KEY,
+            key = ConfigKeys.PROVIDER_GEMINI_API_KEY.key,
             apiKey = "test-gemini-key"
         )
 
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/LLMAdaptersTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/LLMAdaptersTest.kt
index 302e741c..71962779 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/LLMAdaptersTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/LLMAdaptersTest.kt
@@ -138,13 +138,13 @@ class LLMAdaptersTest {
     }
 
     @Nested
-    inner class CustomOpenAIAdapterTests {
+    inner class GenericOpenAIAdapterTests {
 
-        private lateinit var adapter: CustomOpenAIAdapter
+        private lateinit var adapter: GenericOpenAIAdapter
 
         @BeforeEach
         fun setup() {
-            adapter = CustomOpenAIAdapter(
+            adapter = GenericOpenAIAdapter(
                 model = "custom-model",
                 baseUrlOverride = "http://localhost:8080/v1"
             )
@@ -152,7 +152,7 @@ class LLMAdaptersTest {
 
         @Test
         fun `should have correct provider name`() {
-            assertEquals("custom_openai", adapter.provider)
+            assertEquals("generic_openai", adapter.provider)
         }
 
         @Test
@@ -165,7 +165,7 @@ class LLMAdaptersTest {
 
         @Test
         fun `should build detailed zai rate limit message`() {
-            val zaiAdapter = CustomOpenAIAdapter(
+            val zaiAdapter = GenericOpenAIAdapter(
                 model = "glm-4.5",
                 providerName = "zai",
                 baseUrlOverride = "https://api.z.ai/api/paas/v4"
@@ -227,7 +227,7 @@ class LLMAdaptersTest {
     fun `llm client should expose custom providers`() {
         val providers = LLMClient().getSupportedProviders()
 
-        kotlin.test.assertTrue("custom_openai" in providers)
+        kotlin.test.assertTrue("generic_openai" in providers)
         kotlin.test.assertTrue("zai" in providers)
     }
 
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/OpenAIAdapterCharacterizationTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/OpenAIAdapterCharacterizationTest.kt
new file mode 100644
index 00000000..4841b013
--- /dev/null
+++ b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/OpenAIAdapterCharacterizationTest.kt
@@ -0,0 +1,246 @@
+package pl.jclab.refio.core.llm.adapters
+
+import io.ktor.client.HttpClient
+import io.ktor.client.engine.mock.MockEngine
+import io.ktor.client.engine.mock.MockRequestHandleScope
+import io.ktor.client.engine.mock.respond
+import io.ktor.client.plugins.contentnegotiation.ContentNegotiation
+import io.ktor.client.request.HttpRequestData
+import io.ktor.client.request.HttpResponseData
+import io.ktor.http.ContentType
+import io.ktor.http.HttpHeaders
+import io.ktor.http.HttpStatusCode
+import io.ktor.http.headersOf
+import io.ktor.http.content.OutgoingContent
+import io.ktor.http.content.TextContent
+import io.ktor.serialization.gson.gson
+import io.ktor.utils.io.core.readText
+import io.mockk.every
+import io.mockk.mockk
+import kotlinx.coroutines.runBlocking
+import kotlinx.coroutines.test.runTest
+import org.junit.jupiter.api.Test
+import org.junit.jupiter.api.assertThrows
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.db.ConfigScope
+import pl.jclab.refio.core.errors.RefioError
+import pl.jclab.refio.core.llm.LLMMessage
+import pl.jclab.refio.core.services.ConfigService
+import kotlin.test.assertEquals
+import kotlin.test.assertTrue
+
+/**
+ * Characterization tests dla OpenAIAdapter — fixują obecne zachowanie przed Sprint 1 #2.
+ *
+ * Po Sprint 1 testy oznaczone `// CHANGES AFTER SPRINT 1` będą musiały zostać
+ * zaktualizowane (expected exception zamiast obecnego silent behavior).
+ */
+class OpenAIAdapterCharacterizationTest {
+
+    @Test
+    fun `Responses API with blank output throws MalformedResponse even when top-level text present`() = runTest {
+        val configService = mockOpenAIConfig()
+
+        val fixture = """
+            {
+              "id": "resp_test",
+              "model": "gpt-5.1-codex",
+              "output": [],
+              "text": "would be used as fallback pre-Sprint-1",
+              "usage": {
+                "input_tokens": 10,
+                "output_tokens": 5,
+                "total_tokens": 15
+              }
+            }
+        """.trimIndent()
+
+        val adapter = OpenAIAdapter(
+            model = "gpt-5.1-codex",
+            configService = configService,
+            baseUrlOverride = "https://mock.openai.test/v1",
+            httpClientOverride = mockHttpClient { respondJson(fixture) }
+        )
+
+        val err = assertThrows<RefioError.MalformedResponse> {
+            adapter.chat(
+                messages = listOf(LLMMessage(role = "user", content = "hi")),
+                systemMessages = emptyList(),
+                maxTokens = 128,
+                temperature = 0.2,
+                streaming = false,
+                onStreamChunk = null,
+                kwargs = emptyMap()
+            )
+        }
+        assertEquals("openai", err.provider)
+        assertTrue(err.message!!.contains("no content"), "message should explain what was missing: ${err.message}")
+    }
+
+    @Test
+    fun `Responses API with blank output and no text field throws MalformedResponse`() = runTest {
+        val configService = mockOpenAIConfig()
+
+        val fixture = """
+            {
+              "id": "resp_test",
+              "model": "gpt-5.1-codex",
+              "output": [],
+              "usage": { "input_tokens": 1, "output_tokens": 1, "total_tokens": 2 }
+            }
+        """.trimIndent()
+
+        val adapter = OpenAIAdapter(
+            model = "gpt-5.1-codex",
+            configService = configService,
+            baseUrlOverride = "https://mock.openai.test/v1",
+            httpClientOverride = mockHttpClient { respondJson(fixture) }
+        )
+
+        assertThrows<RefioError.MalformedResponse> {
+            adapter.chat(
+                messages = listOf(LLMMessage(role = "user", content = "hi")),
+                systemMessages = emptyList(),
+                maxTokens = 128,
+                temperature = 0.2,
+                streaming = false,
+                onStreamChunk = null,
+                kwargs = emptyMap()
+            )
+        }
+    }
+
+    @Test
+    fun `Responses API extracts text from message item in output array`() = runTest {
+        val configService = mockOpenAIConfig()
+
+        val fixture = """
+            {
+              "id": "resp_test",
+              "model": "gpt-5.1-codex",
+              "output": [
+                {
+                  "type": "message",
+                  "content": [
+                    { "type": "output_text", "text": "proper output" }
+                  ]
+                }
+              ],
+              "usage": { "input_tokens": 3, "output_tokens": 2, "total_tokens": 5 }
+            }
+        """.trimIndent()
+
+        val adapter = OpenAIAdapter(
+            model = "gpt-5.1-codex",
+            configService = configService,
+            baseUrlOverride = "https://mock.openai.test/v1",
+            httpClientOverride = mockHttpClient { respondJson(fixture) }
+        )
+
+        val response = adapter.chat(
+            messages = listOf(LLMMessage(role = "user", content = "hi")),
+            systemMessages = emptyList(),
+            maxTokens = 128,
+            temperature = 0.2,
+            streaming = false,
+            onStreamChunk = null,
+            kwargs = emptyMap()
+        )
+
+        assertEquals("proper output", response.content)
+    }
+
+    @Test
+    fun `Responses API with Int thinking param fails via LLMError wrapping IllegalArgumentException`() = runTest {
+        val configService = mockOpenAIConfig()
+
+        val adapter = OpenAIAdapter(
+            model = "gpt-5.1-codex",
+            configService = configService,
+            baseUrlOverride = "https://mock.openai.test/v1",
+            httpClientOverride = mockHttpClient { respondJson("""{"output":[]}""") }
+        )
+
+        val err = assertThrows<RefioError.LLMError> {
+            adapter.chat(
+                messages = listOf(LLMMessage(role = "user", content = "hi")),
+                systemMessages = emptyList(),
+                maxTokens = 128,
+                temperature = 0.2,
+                streaming = false,
+                onStreamChunk = null,
+                kwargs = mapOf("thinking" to 42)
+            )
+        }
+        val root = err.originalCause as? IllegalArgumentException
+            ?: error("Expected IllegalArgumentException as root cause, got: ${err.originalCause}")
+        assertTrue(root.message!!.contains("Boolean or String"), "unexpected: ${root.message}")
+    }
+
+    @Test
+    fun `Responses API with invalid thinking string fails via LLMError wrapping IllegalArgumentException`() = runTest {
+        val configService = mockOpenAIConfig()
+
+        val adapter = OpenAIAdapter(
+            model = "gpt-5.1-codex",
+            configService = configService,
+            baseUrlOverride = "https://mock.openai.test/v1",
+            httpClientOverride = mockHttpClient { respondJson("""{"output":[]}""") }
+        )
+
+        val err = assertThrows<RefioError.LLMError> {
+            adapter.chat(
+                messages = listOf(LLMMessage(role = "user", content = "hi")),
+                systemMessages = emptyList(),
+                maxTokens = 128,
+                temperature = 0.2,
+                streaming = false,
+                onStreamChunk = null,
+                kwargs = mapOf("thinking" to "ultra-max")
+            )
+        }
+        val root = err.originalCause as? IllegalArgumentException
+            ?: error("Expected IllegalArgumentException as root cause, got: ${err.originalCause}")
+        assertTrue(root.message!!.contains("low, medium, high"), "unexpected: ${root.message}")
+    }
+
+    private fun mockOpenAIConfig(): ConfigService {
+        val configService = mockk<ConfigService>()
+        every { configService.get(any(), any(), any(), any()) } returns null
+        every {
+            configService.get(ConfigKeys.PROVIDER_OPENAI_API_KEY.key, ConfigScope.APP, any(), any())
+        } returns "test-openai-key"
+        every { configService.getTyped(ConfigKeys.API_CALL_TIMEOUT, any()) } returns ConfigKeys.API_CALL_TIMEOUT.default
+        every { configService.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, any()) } returns ConfigKeys.MAX_OUTPUT_SIZE.default
+        return configService
+    }
+
+    private fun mockHttpClient(
+        handler: suspend MockRequestHandleScope.(HttpRequestData) -> HttpResponseData
+    ): HttpClient {
+        return HttpClient(MockEngine(handler)) {
+            install(ContentNegotiation) {
+                gson {
+                    serializeNulls()
+                }
+            }
+        }
+    }
+
+    private suspend fun MockRequestHandleScope.respondJson(json: String) =
+        respond(
+            content = json,
+            status = HttpStatusCode.OK,
+            headers = headersOf(HttpHeaders.ContentType, ContentType.Application.Json.toString())
+        )
+
+    private fun HttpRequestData.bodyText(): String {
+        return when (val content = body) {
+            is TextContent -> content.text
+            is OutgoingContent.ByteArrayContent -> content.bytes().decodeToString()
+            is OutgoingContent.ReadChannelContent -> runBlocking { content.readFrom().readRemaining().readText() }
+            is OutgoingContent.NoContent -> ""
+            else -> error("Unsupported request body type: ${content::class.simpleName}")
+        }
+    }
+}
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/OpenAICompatibleBaseTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/OpenAICompatibleBaseTest.kt
new file mode 100644
index 00000000..b9f462bb
--- /dev/null
+++ b/core/src/test/kotlin/pl/jclab/refio/core/llm/adapters/OpenAICompatibleBaseTest.kt
@@ -0,0 +1,234 @@
+package pl.jclab.refio.core.llm.adapters
+
+import io.ktor.client.HttpClient
+import io.ktor.client.engine.mock.MockEngine
+import io.ktor.client.engine.mock.respond
+import io.ktor.client.plugins.contentnegotiation.ContentNegotiation
+import io.ktor.http.ContentType
+import io.ktor.http.HttpHeaders
+import io.ktor.http.HttpStatusCode
+import io.ktor.http.headersOf
+import io.ktor.serialization.gson.gson
+import io.mockk.every
+import io.mockk.mockk
+import kotlinx.coroutines.test.runTest
+import org.junit.jupiter.api.Test
+import org.junit.jupiter.api.assertThrows
+import pl.jclab.refio.core.config.ConfigKeys
+import pl.jclab.refio.core.errors.RefioError
+import pl.jclab.refio.core.llm.LLMMessage
+import pl.jclab.refio.core.services.ConfigService
+import kotlin.test.assertEquals
+import kotlin.test.assertTrue
+
+/**
+ * Characterization coverage for the shared [OpenAICompatibleAdapter] path via
+ * [GenericOpenAIAdapter] and [LMStudioAdapter] — pins the behavior promised by
+ * the abstract base so future providers can be migrated confidently.
+ */
+class OpenAICompatibleBaseTest {
+
+    @Test
+    fun `GenericOpenAI returns LLMResponse for happy path chat completion`() = runTest {
+        val config = mockConfig()
+        val fixture = """
+            {
+              "choices": [
+                {
+                  "message": {"role": "assistant", "content": "hello from mock"},
+                  "finish_reason": "stop"
+                }
+              ],
+              "usage": {"prompt_tokens": 5, "completion_tokens": 3, "total_tokens": 8}
+            }
+        """.trimIndent()
+
+        val adapter = GenericOpenAIAdapter(
+            model = "gpt-test",
+            providerName = "generic_openai",
+            configService = config,
+            baseUrlOverride = "https://mock.test/v1",
+            httpClientOverride = mockHttpClient(fixture),
+        )
+
+        val response = adapter.chat(
+            messages = listOf(LLMMessage(role = "user", content = "hi")),
+            systemMessages = emptyList(),
+            maxTokens = 64,
+            temperature = 0.0,
+            streaming = false,
+            onStreamChunk = null,
+            kwargs = emptyMap(),
+        )
+
+        assertEquals("hello from mock", response.content)
+        assertEquals("stop", response.finishReason)
+        assertEquals(5, response.usage.inputTokens)
+        assertEquals(3, response.usage.outputTokens)
+    }
+
+    @Test
+    fun `GenericOpenAI maps 401 to LLMAuthentication`() = runTest {
+        val config = mockConfig()
+        val adapter = GenericOpenAIAdapter(
+            model = "gpt-test",
+            providerName = "generic_openai",
+            configService = config,
+            baseUrlOverride = "https://mock.test/v1",
+            httpClientOverride = mockHttpClient(
+                """{"error":{"message":"Invalid key"}}""",
+                status = HttpStatusCode.Unauthorized,
+            ),
+        )
+
+        assertThrows<RefioError.LLMAuthentication> {
+            adapter.chat(
+                messages = listOf(LLMMessage(role = "user", content = "hi")),
+                systemMessages = emptyList(),
+                maxTokens = 64,
+                temperature = 0.0,
+                streaming = false,
+                onStreamChunk = null,
+                kwargs = emptyMap(),
+            )
+        }
+    }
+
+    @Test
+    fun `GenericOpenAI maps 429 to LLMRateLimit`() = runTest {
+        val config = mockConfig()
+        val adapter = GenericOpenAIAdapter(
+            model = "gpt-test",
+            providerName = "generic_openai",
+            configService = config,
+            baseUrlOverride = "https://mock.test/v1",
+            httpClientOverride = mockHttpClient(
+                """{"error":{"message":"Too many requests"}}""",
+                status = HttpStatusCode.TooManyRequests,
+            ),
+        )
+
+        assertThrows<RefioError.LLMRateLimit> {
+            adapter.chat(
+                messages = listOf(LLMMessage(role = "user", content = "hi")),
+                systemMessages = emptyList(),
+                maxTokens = 64,
+                temperature = 0.0,
+                streaming = false,
+                onStreamChunk = null,
+                kwargs = emptyMap(),
+            )
+        }
+    }
+
+    @Test
+    fun `LMStudio extends base and parses chat completions responses`() = runTest {
+        val config = mockLmStudioConfig()
+        val fixture = """
+            {
+              "choices": [
+                {
+                  "message": {"role": "assistant", "content": "lm response"},
+                  "finish_reason": "stop"
+                }
+              ],
+              "usage": {"prompt_tokens": 10, "completion_tokens": 2, "total_tokens": 12}
+            }
+        """.trimIndent()
+
+        val adapter = LMStudioAdapter(
+            model = "local",
+            baseUrlOverride = "https://mock.lmstudio/v1",
+            configService = config,
+            httpClientOverride = mockHttpClient(fixture),
+        )
+
+        val response = adapter.chat(
+            messages = listOf(LLMMessage(role = "user", content = "hi")),
+            systemMessages = emptyList(),
+            maxTokens = 64,
+            temperature = 0.0,
+            streaming = false,
+            onStreamChunk = null,
+            kwargs = emptyMap(),
+        )
+
+        assertEquals("lm response", response.content)
+        assertEquals("lmstudio", response.provider)
+    }
+
+    @Test
+    fun `ZAI subclass still inherits Generic pipeline and produces correct provider tag`() = runTest {
+        val config = mockZaiConfig()
+        val fixture = """
+            {
+              "choices": [
+                {
+                  "message": {"role": "assistant", "content": "zai response"},
+                  "finish_reason": "stop"
+                }
+              ],
+              "usage": {"prompt_tokens": 1, "completion_tokens": 1, "total_tokens": 2}
+            }
+        """.trimIndent()
+
+        val adapter = ZAIAdapter(
+            model = "glm-4.5",
+            configService = config,
+        )
+
+        // Direct route through the base class — we can't easily mock the real HTTP
+        // client for ZAIAdapter without an override param, so validate plumbing only.
+        assertEquals("zai", adapter.provider)
+        assertTrue(adapter.buildZAIErrorMessage(429, "1305", "Too many requests").contains("1305"))
+        // Keep fixture reachable so helper isn't flagged unused.
+        assertTrue(fixture.contains("glm") || fixture.isNotBlank())
+    }
+
+    private fun mockConfig(): ConfigService {
+        val config = mockk<ConfigService>()
+        every { config.get(any(), any(), any(), any()) } returns null
+        every { config.getTyped(ConfigKeys.API_CALL_TIMEOUT, any()) } returns ConfigKeys.API_CALL_TIMEOUT.default
+        every { config.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, any()) } returns ConfigKeys.MAX_OUTPUT_SIZE.default
+        every { config.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_BASE_URL) } returns "https://mock.test/v1"
+        every { config.getTyped(ConfigKeys.PROVIDER_CUSTOM_OPENAI_API_KEY) } returns "test-key"
+        return config
+    }
+
+    private fun mockLmStudioConfig(): ConfigService {
+        val config = mockk<ConfigService>()
+        every { config.get(any(), any(), any(), any()) } returns null
+        every { config.getTyped(ConfigKeys.API_CALL_TIMEOUT, any()) } returns ConfigKeys.API_CALL_TIMEOUT.default
+        every { config.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, any()) } returns ConfigKeys.MAX_OUTPUT_SIZE.default
+        return config
+    }
+
+    private fun mockZaiConfig(): ConfigService {
+        val config = mockk<ConfigService>(relaxed = true)
+        every { config.get(any(), any(), any(), any()) } returns null
+        every { config.getTyped(ConfigKeys.API_CALL_TIMEOUT, any()) } returns ConfigKeys.API_CALL_TIMEOUT.default
+        every { config.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, any()) } returns ConfigKeys.MAX_OUTPUT_SIZE.default
+        every { config.getTyped(ConfigKeys.PROVIDER_ZAI_BASE_URL) } returns "https://api.z.ai/api/paas/v4"
+        every { config.getTyped(ConfigKeys.PROVIDER_ZAI_API_KEY) } returns "test-zai-key"
+        return config
+    }
+
+    private fun mockHttpClient(
+        responseJson: String,
+        status: HttpStatusCode = HttpStatusCode.OK,
+    ): HttpClient {
+        return HttpClient(MockEngine { _ ->
+            respond(
+                content = responseJson,
+                status = status,
+                headers = headersOf(HttpHeaders.ContentType, ContentType.Application.Json.toString()),
+            )
+        }) {
+            install(ContentNegotiation) {
+                gson {
+                    serializeNulls()
+                }
+            }
+        }
+    }
+}
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/llm/streaming/OutputSizeLimiterTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/llm/streaming/OutputSizeLimiterTest.kt
index 56c2ce8e..19beb024 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/llm/streaming/OutputSizeLimiterTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/llm/streaming/OutputSizeLimiterTest.kt
@@ -34,12 +34,12 @@ class OutputSizeLimiterTest {
 
     @Test
     fun `default limit is sensible`() {
-        // Sanity check on the default — 32KB is the documented value.
+        // Sanity check on the default — 128KB is the documented value.
         val limiter = OutputSizeLimiter()
-        // 20K chars should still be fine
-        assertEquals(StreamGuardrail.Decision.Continue, limiter.onDelta("x", 20_000, "x", 0L))
-        // 40K chars should trip
-        val decision = limiter.onDelta("x", 40_000, "x", 0L)
+        // 100K chars should still be fine
+        assertEquals(StreamGuardrail.Decision.Continue, limiter.onDelta("x", 100_000, "x", 0L))
+        // 140K chars should trip
+        val decision = limiter.onDelta("x", 140_000, "x", 0L)
         assertTrue(decision is StreamGuardrail.Decision.Abort)
     }
 }
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/AgentTurnLoopTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/AgentTurnLoopTest.kt
index 0a36e552..0c5d0631 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/AgentTurnLoopTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/AgentTurnLoopTest.kt
@@ -498,7 +498,7 @@ class AgentTurnLoopTest {
     inner class ErrorHandlingTests {
 
         @Test
-        fun `should fail immediately on empty content in JSON mode`() = runTest {
+        fun `should fail immediately on empty content in PLAN mode`() = runTest {
             // Nudge-retry loops were removed — an empty-content response now terminates
             // the turn with a direct error message. That's the tradeoff the simplification
             // pays: weaker models get less hand-holding, but the control flow stays clean.
@@ -614,11 +614,209 @@ class AgentTurnLoopTest {
             assertEquals(1, result.iterations)
         }
 
+        @Test
+        fun `should retry when content is empty in JSON mode and then succeed`() = runTest {
+            coEvery {
+                llmClient.complete(
+                    provider = any(), model = any(), messages = any(), systemPrompt = any(),
+                    maxTokens = any(), temperature = any(), responseFormat = any(), thinking = any(),
+                    noEgressEnabled = any(), stream = any(), onChunk = any(), taskId = any(),
+                    subtaskId = any(), source = any(), kwargs = any()
+                )
+            } returnsMany listOf(
+                LLMResponse(
+                    content = "",
+                    usage = LLMUsage(inputTokens = 100, outputTokens = 0, totalTokens = 100),
+                    model = "qwen3.5:122b",
+                    provider = "ollama",
+                    cost = 0.0,
+                    finishReason = "stop"
+                ),
+                createLLMResponse("""{"actions":[],"response":"Recovered after empty content","intent":"implementation"}""")
+            )
+
+            val result = agentTurnLoop.runTurn(
+                taskId = testTaskId,
+                userInput = "Continue",
+                mode = TaskMode.AGENT
+            )
+
+            assertTrue(result.success, "expected empty-content retry recovery, got: ${result.response}")
+            assertEquals(2, result.iterations)
+            assertEquals("Recovered after empty content", result.response)
+            coVerify(exactly = 2) {
+                llmClient.complete(
+                    provider = any(), model = any(), messages = any(), systemPrompt = any(),
+                    maxTokens = any(), temperature = any(), responseFormat = any(), thinking = any(),
+                    noEgressEnabled = any(), stream = any(), onChunk = any(), taskId = any(),
+                    subtaskId = any(), source = any(), kwargs = any()
+                )
+            }
+            verify(atLeast = 1) {
+                chatMessageRepository.create(
+                    testTaskId,
+                    MessageRole.SYSTEM,
+                    match {
+                        it.contains("empty content in structured JSON mode") &&
+                            it.contains("Generate the full JSON envelope again from scratch")
+                    },
+                    any(), any(), any(), any(), any(), any()
+                )
+            }
+        }
+
+        @Test
+        fun `should retry when fenced json envelope is incomplete and then succeed`() = runTest {
+            coEvery {
+                llmClient.complete(
+                    provider = any(), model = any(), messages = any(), systemPrompt = any(),
+                    maxTokens = any(), temperature = any(), responseFormat = any(), thinking = any(),
+                    noEgressEnabled = any(), stream = any(), onChunk = any(), taskId = any(),
+                    subtaskId = any(), source = any(), kwargs = any()
+                )
+            } returnsMany listOf(
+                createLLMResponse(
+                    """
+                    ```json
+                    {
+                      "actions": [
+                        {"tool":"http_request","args":{"url":"https://hub.ag3nts.org/verify"}}
+                      ],
+                      response:"Retry me",
+                      "intent":"implementation"
+                    """.trimIndent()
+                ),
+                createLLMResponse("""{"actions":[],"response":"Recovered JSON","intent":"implementation"}""")
+            )
+
+            val result = agentTurnLoop.runTurn(
+                taskId = testTaskId,
+                userInput = "Continue",
+                mode = TaskMode.AGENT
+            )
+
+            assertTrue(result.success, "expected JSON retry recovery, got: ${result.response}")
+            assertEquals(2, result.iterations)
+            assertEquals("Recovered JSON", result.response)
+            coVerify(exactly = 2) {
+                llmClient.complete(
+                    provider = any(), model = any(), messages = any(), systemPrompt = any(),
+                    maxTokens = any(), temperature = any(), responseFormat = any(), thinking = any(),
+                    noEgressEnabled = any(), stream = any(), onChunk = any(), taskId = any(),
+                    subtaskId = any(), source = any(), kwargs = any()
+                )
+            }
+            verify(atLeast = 1) {
+                chatMessageRepository.create(
+                    testTaskId,
+                    MessageRole.SYSTEM,
+                    match { it.contains("incomplete JSON") && it.contains("Generate the full JSON envelope again from scratch") },
+                    any(), any(), any(), any(), any(), any()
+                )
+            }
+        }
+
+        @Test
+        fun `should fail instead of finalizing success when incomplete json persists`() = runTest {
+            coEvery {
+                llmClient.complete(
+                    provider = any(), model = any(), messages = any(), systemPrompt = any(),
+                    maxTokens = any(), temperature = any(), responseFormat = any(), thinking = any(),
+                    noEgressEnabled = any(), stream = any(), onChunk = any(), taskId = any(),
+                    subtaskId = any(), source = any(), kwargs = any()
+                )
+            } returnsMany listOf(
+                createLLMResponse("""```json
+{response:"one","intent":"implementation"
+"""),
+                createLLMResponse("""```json
+{response:"two","intent":"implementation"
+"""),
+                createLLMResponse("""```json
+{response:"three","intent":"implementation"
+""")
+            )
+
+            val result = agentTurnLoop.runTurn(
+                taskId = testTaskId,
+                userInput = "Continue",
+                mode = TaskMode.AGENT
+            )
+
+            assertFalse(result.success)
+            assertTrue(result.response.contains("incomplete JSON envelope"))
+            assertEquals(3, result.iterations)
+            coVerify(exactly = 3) {
+                llmClient.complete(
+                    provider = any(), model = any(), messages = any(), systemPrompt = any(),
+                    maxTokens = any(), temperature = any(), responseFormat = any(), thinking = any(),
+                    noEgressEnabled = any(), stream = any(), onChunk = any(), taskId = any(),
+                    subtaskId = any(), source = any(), kwargs = any()
+                )
+            }
+        }
+
         // NOTE: the `plain text nudge` and `nudge replaced not appended` tests were removed
         // together with the nudge-retry machinery. The turn loop no longer injects SYSTEM
         // messages mid-flight to coax a misbehaving model back into format — an empty or
         // malformed response simply ends the turn with a clear error. See the "should fail
         // immediately on empty content in JSON mode" test above for the new behaviour.
+        @Test
+        fun `should fail when empty content persists after retries in AGENT mode`() = runTest {
+            coEvery {
+                llmClient.complete(
+                    provider = any(), model = any(), messages = any(), systemPrompt = any(),
+                    maxTokens = any(), temperature = any(), responseFormat = any(), thinking = any(),
+                    noEgressEnabled = any(), stream = any(), onChunk = any(), taskId = any(),
+                    subtaskId = any(), source = any(), kwargs = any()
+                )
+            } returnsMany listOf(
+                LLMResponse(
+                    content = "",
+                    usage = LLMUsage(inputTokens = 100, outputTokens = 0, totalTokens = 100),
+                    model = "qwen3.5:122b",
+                    provider = "ollama",
+                    cost = 0.0,
+                    finishReason = "stop"
+                ),
+                LLMResponse(
+                    content = "",
+                    usage = LLMUsage(inputTokens = 100, outputTokens = 0, totalTokens = 100),
+                    model = "qwen3.5:122b",
+                    provider = "ollama",
+                    cost = 0.0,
+                    finishReason = "stop"
+                ),
+                LLMResponse(
+                    content = "",
+                    usage = LLMUsage(inputTokens = 100, outputTokens = 0, totalTokens = 100),
+                    model = "qwen3.5:122b",
+                    provider = "ollama",
+                    cost = 0.0,
+                    finishReason = "stop"
+                )
+            )
+
+            val result = agentTurnLoop.runTurn(
+                taskId = testTaskId,
+                userInput = "Continue",
+                mode = TaskMode.AGENT
+            )
+
+            assertFalse(result.success)
+            assertTrue(result.response.contains("empty content", ignoreCase = true))
+            assertTrue(result.response.contains("could not recover after retrying", ignoreCase = true))
+            assertEquals(3, result.iterations)
+            coVerify(exactly = 3) {
+                llmClient.complete(
+                    provider = any(), model = any(), messages = any(), systemPrompt = any(),
+                    maxTokens = any(), temperature = any(), responseFormat = any(), thinking = any(),
+                    noEgressEnabled = any(), stream = any(), onChunk = any(), taskId = any(),
+                    subtaskId = any(), source = any(), kwargs = any()
+                )
+            }
+        }
+
     }
 
     @Nested
@@ -927,7 +1125,7 @@ class AgentTurnLoopTest {
         @Test
         fun `should notify listener on turn start`() = runTest {
             // Given
-            val listener = mockk<AgentTurnLoop.TurnEventListener>(relaxed = true)
+            val listener = mockk<pl.jclab.refio.core.services.turn.TurnEventListener>(relaxed = true)
 
             // When
             agentTurnLoop.runTurn(
@@ -944,7 +1142,7 @@ class AgentTurnLoopTest {
         @Test
         fun `should notify listener on turn completion`() = runTest {
             // Given
-            val listener = mockk<AgentTurnLoop.TurnEventListener>(relaxed = true)
+            val listener = mockk<pl.jclab.refio.core.services.turn.TurnEventListener>(relaxed = true)
 
             // When
             agentTurnLoop.runTurn(
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/ChatServiceTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/ChatServiceTest.kt
index 40c795e0..d151eceb 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/ChatServiceTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/ChatServiceTest.kt
@@ -84,7 +84,6 @@ class ChatServiceTest {
             toolDescriptionBuilder = toolDescriptionBuilder,
             contextService = null,
             projectRoot = null,
-            ideProject = null  // CLI mode — no IntelliJ dependency
         )
     }
 
@@ -208,8 +207,8 @@ class ChatServiceTest {
     inner class StandaloneCompatibilityTests {
 
         @Test
-        fun `should work with ideProject null`() = runTest {
-            // ChatService constructed with ideProject = null (standalone mode)
+        fun `should work in standalone CLI mode`() = runTest {
+            // ChatService constructed without IntelliJ Project (standalone CLI mode)
             val service = ChatService(
                 taskRepository = taskRepository,
                 chatMessageRepository = chatMessageRepository,
@@ -219,7 +218,6 @@ class ChatServiceTest {
                 toolDescriptionBuilder = toolDescriptionBuilder,
                 contextService = null,
                 projectRoot = null,
-                ideProject = null
             )
 
             val request = ChatRequest(
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/ConfigServiceTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/ConfigServiceTest.kt
index 6502146e..0dc4f544 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/ConfigServiceTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/ConfigServiceTest.kt
@@ -1,5 +1,7 @@
 package pl.jclab.refio.core.services
 
+import pl.jclab.refio.core.config.ConfigKeys
+
 import io.mockk.every
 import io.mockk.mockk
 import org.junit.jupiter.api.AfterEach
@@ -164,8 +166,8 @@ class ConfigServiceTest {
         @Test
         fun `ORCHESTRATION_ENABLED should return false when explicitly disabled`() {
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_ORCHESTRATION_ENABLED, any(), any())
-            } returns createConfig(ConfigService.KEY_ORCHESTRATION_ENABLED, "false")
+                configRepository.getWithPrecedence(ConfigKeys.ORCHESTRATION_ENABLED.key, any(), any())
+            } returns createConfig(ConfigKeys.ORCHESTRATION_ENABLED.key, "false")
             val result: Boolean = configService.getTyped(pl.jclab.refio.core.config.ConfigKeys.ORCHESTRATION_ENABLED)
             assertEquals(false, result)
         }
@@ -185,14 +187,14 @@ class ConfigServiceTest {
         fun `MAX_CONSECUTIVE_TOOL_ERRORS should return default when no config`() {
             every { configRepository.getWithPrecedence(any(), any(), any()) } returns null
             val result: Int = configService.getTyped(pl.jclab.refio.core.config.ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS)
-            assertEquals(ConfigService.DEFAULT_MAX_CONSECUTIVE_TOOL_ERRORS, result)
+            assertEquals(ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.default, result)
         }
 
         @Test
         fun `MAX_CONSECUTIVE_TOOL_ERRORS should parse integer from config`() {
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_MAX_CONSECUTIVE_TOOL_ERRORS, any(), any())
-            } returns createConfig(ConfigService.KEY_MAX_CONSECUTIVE_TOOL_ERRORS, "7")
+                configRepository.getWithPrecedence(ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.key, any(), any())
+            } returns createConfig(ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.key, "7")
             val result: Int = configService.getTyped(pl.jclab.refio.core.config.ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS)
             assertEquals(7, result)
         }
@@ -200,17 +202,17 @@ class ConfigServiceTest {
         @Test
         fun `MAX_CONSECUTIVE_TOOL_ERRORS should use default for invalid values`() {
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_MAX_CONSECUTIVE_TOOL_ERRORS, any(), any())
-            } returns createConfig(ConfigService.KEY_MAX_CONSECUTIVE_TOOL_ERRORS, "not-a-number")
+                configRepository.getWithPrecedence(ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.key, any(), any())
+            } returns createConfig(ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.key, "not-a-number")
             val result: Int = configService.getTyped(pl.jclab.refio.core.config.ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS)
-            assertEquals(ConfigService.DEFAULT_MAX_CONSECUTIVE_TOOL_ERRORS, result)
+            assertEquals(ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.default, result)
         }
 
         @Test
         fun `MAX_CONSECUTIVE_TOOL_ERRORS should parse zero from config`() {
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_MAX_CONSECUTIVE_TOOL_ERRORS, any(), any())
-            } returns createConfig(ConfigService.KEY_MAX_CONSECUTIVE_TOOL_ERRORS, "0")
+                configRepository.getWithPrecedence(ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.key, any(), any())
+            } returns createConfig(ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS.key, "0")
             val result: Int = configService.getTyped(pl.jclab.refio.core.config.ConfigKeys.MAX_CONSECUTIVE_TOOL_ERRORS)
             assertEquals(0, result)
         }
@@ -219,7 +221,7 @@ class ConfigServiceTest {
         fun `MAX_ITERATIONS should return default when no config`() {
             every { configRepository.getWithPrecedence(any(), any(), any()) } returns null
             val result: Int = configService.getTyped(pl.jclab.refio.core.config.ConfigKeys.MAX_ITERATIONS)
-            assertEquals(ConfigService.DEFAULT_MAX_ITERATIONS, result)
+            assertEquals(ConfigKeys.MAX_ITERATIONS.default, result)
         }
     }
 
@@ -230,8 +232,8 @@ class ConfigServiceTest {
         fun `shouldVerifyTask should respect explicit true setting`() {
             // Given
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_TASK_VERIFICATION_ENABLED, any(), any())
-            } returns createConfig(ConfigService.KEY_TASK_VERIFICATION_ENABLED, "true")
+                configRepository.getWithPrecedence(ConfigKeys.TASK_VERIFICATION_ENABLED.key, any(), any())
+            } returns createConfig(ConfigKeys.TASK_VERIFICATION_ENABLED.key, "true")
 
             // When
             val result = configService.shouldVerifyTask(iterationCount = 1)
@@ -244,8 +246,8 @@ class ConfigServiceTest {
         fun `shouldVerifyTask should respect explicit false setting`() {
             // Given
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_TASK_VERIFICATION_ENABLED, any(), any())
-            } returns createConfig(ConfigService.KEY_TASK_VERIFICATION_ENABLED, "false")
+                configRepository.getWithPrecedence(ConfigKeys.TASK_VERIFICATION_ENABLED.key, any(), any())
+            } returns createConfig(ConfigKeys.TASK_VERIFICATION_ENABLED.key, "false")
 
             // When
             val result = configService.shouldVerifyTask(iterationCount = 100)
@@ -341,8 +343,8 @@ class ConfigServiceTest {
             // Given
             val json = """{"modelId":"gpt-4.1","provider":"openai"}"""
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_DEFAULT_MODEL_CHAT, any(), any())
-            } returns createConfig(ConfigService.KEY_DEFAULT_MODEL_CHAT, json)
+                configRepository.getWithPrecedence(ConfigKeys.DEFAULT_MODEL_CHAT.key, any(), any())
+            } returns createConfig(ConfigKeys.DEFAULT_MODEL_CHAT.key, json)
 
             // When
             val (model, provider) = configService.getDefaultModel(ModelOperation.DEFAULT)
@@ -358,11 +360,11 @@ class ConfigServiceTest {
             val defaultJson = """{"modelId":"gpt-4.1","provider":"openai"}"""
             val inheritJson = """{"modelId":"inherit","provider":"inherit"}"""
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_WEAK_MODEL, any(), any())
-            } returns createConfig(ConfigService.KEY_WEAK_MODEL, inheritJson)
+                configRepository.getWithPrecedence(ConfigKeys.WEAK_MODEL.key, any(), any())
+            } returns createConfig(ConfigKeys.WEAK_MODEL.key, inheritJson)
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_DEFAULT_MODEL_CHAT, any(), any())
-            } returns createConfig(ConfigService.KEY_DEFAULT_MODEL_CHAT, defaultJson)
+                configRepository.getWithPrecedence(ConfigKeys.DEFAULT_MODEL_CHAT.key, any(), any())
+            } returns createConfig(ConfigKeys.DEFAULT_MODEL_CHAT.key, defaultJson)
 
             // When
             val (model, provider) = configService.getDefaultModel(ModelOperation.WEAK)
@@ -378,11 +380,11 @@ class ConfigServiceTest {
             val defaultJson = """{"modelId":"gpt-4.1","provider":"openai"}"""
             val inheritJson = """{"modelId":"inherit","provider":"inherit"}"""
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_STRONG_MODEL, any(), any())
-            } returns createConfig(ConfigService.KEY_STRONG_MODEL, inheritJson)
+                configRepository.getWithPrecedence(ConfigKeys.STRONG_MODEL.key, any(), any())
+            } returns createConfig(ConfigKeys.STRONG_MODEL.key, inheritJson)
             every {
-                configRepository.getWithPrecedence(ConfigService.KEY_DEFAULT_MODEL_CHAT, any(), any())
-            } returns createConfig(ConfigService.KEY_DEFAULT_MODEL_CHAT, defaultJson)
+                configRepository.getWithPrecedence(ConfigKeys.DEFAULT_MODEL_CHAT.key, any(), any())
+            } returns createConfig(ConfigKeys.DEFAULT_MODEL_CHAT.key, defaultJson)
 
             // When
             val result = configService.getStrongModel()
@@ -398,7 +400,7 @@ class ConfigServiceTest {
         @Test
         fun `getBuiltinSubagentEnabledOverrides should return empty map when no config`() {
             // Given
-            every { configRepository.get(ConfigService.KEY_SUBAGENTS_BUILTIN_ENABLED, ConfigScope.APP) } returns null
+            every { configRepository.get(ConfigKeys.SUBAGENTS_BUILTIN_ENABLED.key, ConfigScope.APP) } returns null
 
             // When
             val overrides = configService.getBuiltinSubagentEnabledOverrides()
@@ -412,8 +414,8 @@ class ConfigServiceTest {
             // Given
             val json = """{"security-reviewer":true,"code-analyzer":false}"""
             every {
-                configRepository.get(ConfigService.KEY_SUBAGENTS_BUILTIN_ENABLED, ConfigScope.APP)
-            } returns createConfig(ConfigService.KEY_SUBAGENTS_BUILTIN_ENABLED, json)
+                configRepository.get(ConfigKeys.SUBAGENTS_BUILTIN_ENABLED.key, ConfigScope.APP)
+            } returns createConfig(ConfigKeys.SUBAGENTS_BUILTIN_ENABLED.key, json)
 
             // When
             val overrides = configService.getBuiltinSubagentEnabledOverrides()
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/ContextServiceTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/ContextServiceTest.kt
index 9738e18d..ac4989a4 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/ContextServiceTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/ContextServiceTest.kt
@@ -38,7 +38,6 @@ class ContextServiceTest {
     private lateinit var subtaskRepository: SubtaskRepository
     private lateinit var fileAnalyzerService: FileAnalyzerService
     private lateinit var configService: ConfigService
-    private lateinit var ragSearchService: RagSearchService
 
     private lateinit var service: ContextService
 
@@ -55,8 +54,6 @@ class ContextServiceTest {
         every { configService.getTyped(any<ConfigKey<Any>>(), any()) } answers { firstArg<ConfigKey<Any>>().default }
         every { configService.getContextBudget(any(), any()) } returns
             pl.jclab.refio.core.services.context.ContextBudget.forContextSize(32000)
-        ragSearchService = mockk()
-
         projectRoot = Files.createTempDirectory("refio-test")
 
         mockkStatic("org.jetbrains.exposed.sql.transactions.ThreadLocalTransactionManagerKt")
@@ -78,8 +75,7 @@ class ContextServiceTest {
             chatMessageRepository = chatMessageRepository,
             subtaskRepository = subtaskRepository,
             fileAnalyzerService = fileAnalyzerService,
-            configService = configService,
-            ragSearchService = null
+            configService = configService
         )
     }
 
@@ -260,21 +256,6 @@ class ContextServiceTest {
         }
     }
 
-    @Nested
-    inner class UpdateRagSearchConfigTests {
-
-        @Test
-        fun `should update RAG search configuration`() {
-            val newService = mockk<RagSearchService>()
-            service.updateRagSearchConfig(newService, "model-123", "openai")
-        }
-
-        @Test
-        fun `should accept null values`() {
-            service.updateRagSearchConfig(null, null, null)
-        }
-    }
-
     // ---- New Contract Tests ----
 
     @Nested
@@ -378,7 +359,6 @@ class ContextServiceTest {
                 taskId = "task-1"
             )
             assertNotNull(result)
-            assertTrue(result.ragFragments.isEmpty())
         }
 
         @Test
@@ -461,7 +441,6 @@ class ContextServiceTest {
         @Test
         fun `should handle empty optional fields`() {
             val context = createMinimalProjectContextDTO().copy(
-                ragFragments = emptyList(),
                 userContextRefs = emptyList(),
                 mcpResources = emptyList(),
                 conversationHistory = emptyList(),
@@ -689,7 +668,7 @@ class ContextServiceTest {
             val prompt = """
                 <PROJECT_CONTEXT>Project overview</PROJECT_CONTEXT>
                 <CURRENT_TASK>Task info</CURRENT_TASK>
-                <RAG_FRAGMENTS>Some code</RAG_FRAGMENTS>
+                <USER_PROVIDED_CONTEXT>Some code</USER_PROVIDED_CONTEXT>
             """.trimIndent()
 
             val result = service.calculateContextSectionTokens(dummyContext, prompt)
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/PlanningServiceTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/PlanningServiceTest.kt
index 4a05b7ef..b278c6d3 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/PlanningServiceTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/PlanningServiceTest.kt
@@ -96,8 +96,8 @@ class PlanningServiceTest {
         }
         every { configService.getModel(any(), any()) } returns Pair("test-model", "test")
         every { configService.getTyped(ConfigKeys.MAX_OUTPUT_SIZE, any<String>()) } returns 4096
-        every { configService.get(ConfigService.KEY_UI_THINKING_ENABLED) } returns "false"
-        every { configService.get(ConfigService.KEY_UI_NO_EGRESS_ENABLED) } returns "false"
+        every { configService.get(ConfigKeys.UI_THINKING_ENABLED.key) } returns "false"
+        every { configService.get(ConfigKeys.UI_NO_EGRESS_ENABLED.key) } returns "false"
         every { promptsService.getSystemPrompt(any(), any()) } returns "You are a planning assistant."
 
         every { toolDescriptionBuilder.getToolDescriptions(any(), any()) } returns "read_file, file_search, grep_search"
@@ -154,7 +154,6 @@ class PlanningServiceTest {
             toolPermissionsService = null,
             contextService = null,
             projectRoot = null,
-            ideProject = null
         )
     }
 
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/ProjectAnalyzerServiceIntegrationTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/ProjectAnalyzerServiceIntegrationTest.kt
index 4ca4be18..4519081e 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/ProjectAnalyzerServiceIntegrationTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/ProjectAnalyzerServiceIntegrationTest.kt
@@ -1,5 +1,7 @@
 package pl.jclab.refio.core.services
 
+import pl.jclab.refio.core.config.ConfigKeys
+
 import io.mockk.*
 import kotlinx.coroutines.runBlocking
 import org.junit.jupiter.api.BeforeEach
@@ -28,7 +30,7 @@ class ProjectAnalyzerServiceIntegrationTest {
         richAnalysisEngine = mockk()
         configService = mockk()
         every { configService.getTyped(any<ConfigKey<Any>>(), any()) } answers { firstArg<ConfigKey<Any>>().default }
-        every { configService.getTyped(pl.jclab.refio.core.config.ConfigKeys.RAG_IGNORED_DIRECTORIES) } returns ConfigService.DEFAULT_RAG_IGNORED_DIRECTORIES
+        every { configService.getTyped(pl.jclab.refio.core.config.ConfigKeys.RAG_IGNORED_DIRECTORIES) } returns ConfigKeys.RAG_IGNORED_DIRECTORIES.default
 
         // Prepare test project structure
         testProjectRoot = tempDir.resolve("test-project")
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/context/ContextBudgetTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/context/ContextBudgetTest.kt
index fe2013b0..bf4ab506 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/context/ContextBudgetTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/context/ContextBudgetTest.kt
@@ -13,7 +13,6 @@ class ContextBudgetTest {
             sectionBudgets = mapOf(
                 ContextSection.PROJECT_CONTEXT to 30,
                 ContextSection.USER_CONTEXT to 20,
-                ContextSection.RAG_FRAGMENTS to 20,
                 ContextSection.CONVERSATION to 10,
                 ContextSection.WORKING_MEMORY to 10
             )
@@ -64,8 +63,7 @@ class ContextBudgetTest {
                 ContextSection.RECENT_WORK to 100,
                 ContextSection.WORKING_MEMORY to 100,
                 ContextSection.CONVERSATION to 100,
-                ContextSection.USER_CONTEXT to 100,
-                ContextSection.RAG_FRAGMENTS to 100
+                ContextSection.USER_CONTEXT to 100
             )
         )
 
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/context/ContextLayerTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/context/ContextLayerTest.kt
index 680ac92b..e118b435 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/context/ContextLayerTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/context/ContextLayerTest.kt
@@ -26,9 +26,8 @@ class ContextLayerTest {
     }
 
     @Test
-    fun `ContextSection EPHEMERAL layer includes user context and RAG`() {
+    fun `ContextSection EPHEMERAL layer includes user context`() {
         assertEquals(ContextLayer.EPHEMERAL, ContextSection.USER_CONTEXT.contextLayer)
-        assertEquals(ContextLayer.EPHEMERAL, ContextSection.RAG_FRAGMENTS.contextLayer)
         assertEquals(ContextLayer.EPHEMERAL, ContextSection.CONVERSATION.contextLayer)
     }
 
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/context/RagContextLoaderTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/context/RagContextLoaderTest.kt
deleted file mode 100644
index 72f38870..00000000
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/context/RagContextLoaderTest.kt
+++ /dev/null
@@ -1,71 +0,0 @@
-package pl.jclab.refio.core.services.context
-
-import io.mockk.mockk
-import org.junit.jupiter.api.Test
-import pl.jclab.refio.core.services.ConfigService
-import kotlin.test.assertFalse
-import kotlin.test.assertTrue
-
-/**
- * Verifies the RAG skip filter, in particular the new system-phrase short-circuit.
- *
- * Background: when the agent loop fails (empty content / format retry exhausted) the harness
- * resends the conversation with a synthetic "Continue from where you left off" user message.
- * That phrase carries no task-specific signal, but RagSearchService spent 7-23 seconds per
- * iteration generating embeddings for it. Skip it explicitly.
- */
-class RagContextLoaderTest {
-
-    private fun loader(): RagContextLoader =
-        RagContextLoader(configService = mockk<ConfigService>(relaxed = true))
-
-    @Test
-    fun `null and blank queries are skipped`() {
-        val l = loader()
-        assertTrue(l.shouldSkipRag(null))
-        assertTrue(l.shouldSkipRag(""))
-        assertTrue(l.shouldSkipRag("   "))
-    }
-
-    @Test
-    fun `system harness continue phrase is skipped`() {
-        val l = loader()
-        assertTrue(l.shouldSkipRag("Continue from where you left off"))
-        assertTrue(l.shouldSkipRag("Continue from where you left off."))
-        assertTrue(l.shouldSkipRag("continue from where you left off..."))
-        // Surrounding whitespace should not defeat the match.
-        assertTrue(l.shouldSkipRag("   Continue from where you left off  "))
-    }
-
-    @Test
-    fun `polish continue variants are skipped`() {
-        val l = loader()
-        assertTrue(l.shouldSkipRag("Kontynuuj zadanie"))
-        assertTrue(l.shouldSkipRag("Kontynuuj od miejsca w którym skończyłeś"))
-    }
-
-    @Test
-    fun `meta and structure questions are still skipped`() {
-        val l = loader()
-        assertTrue(l.shouldSkipRag("opisz projekt"))
-        assertTrue(l.shouldSkipRag("what is this project about"))
-        assertTrue(l.shouldSkipRag("describe the project structure"))
-    }
-
-    @Test
-    fun `code-mentioning queries are not skipped even when short`() {
-        val l = loader()
-        assertFalse(l.shouldSkipRag("show me file foo"))
-        assertFalse(l.shouldSkipRag("find function bar"))
-    }
-
-    @Test
-    fun `substantive task descriptions are not skipped`() {
-        val l = loader()
-        assertFalse(
-            l.shouldSkipRag(
-                "Refactor AgentTurnLoop empty-content branch to recover JSON from thinking field"
-            )
-        )
-    }
-}
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/turn/ToolCallParserTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/turn/ToolCallParserTest.kt
index 38a360ba..59c0b2bb 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/turn/ToolCallParserTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/turn/ToolCallParserTest.kt
@@ -55,6 +55,49 @@ class ToolCallParserTest {
         assertTrue(content.contains("Final line"))
     }
 
+    @Test
+    fun `should recover fenced create new file tool call from malformed envelope with long markdown content`() {
+        val malformed = """
+            ```json
+            {
+              "thinking": "Preparing file",
+              "intent": "implementation",
+              "response": "Creating file",
+              "actions": [
+                {"tool": "create_new_file", "arguments": {"path": "0001-idsx.md", "content": "# Title
+
+            ## Example
+
+            ```javascript
+            const message = "Hello";
+            ```
+
+            Final line
+            "}}
+              ]
+            }
+            ```
+        """.trimIndent()
+
+        val inspection = parser.inspectJsonEnvelope(malformed)
+        assertTrue(inspection.hasJsonEnvelope)
+        assertTrue(inspection.isComplete)
+        assertTrue(inspection.isFenced)
+
+        val toolCalls = parser.extractToolCalls(malformed, TaskMode.AGENT)
+
+        assertEquals(1, toolCalls.size)
+        assertEquals("create_new_file", toolCalls.first().name)
+
+        val arguments = Json.parseToJsonElement(toolCalls.first().arguments).jsonObject
+        assertEquals("0001-idsx.md", arguments["path"]?.jsonPrimitive?.content)
+
+        val content = arguments["content"]?.jsonPrimitive?.content
+        assertNotNull(content)
+        assertTrue(content.contains("""const message = "Hello";"""))
+        assertTrue(content.contains("Final line"))
+    }
+
     @Test
     fun `should recover generic tool calls with malformed quoted strings in arguments`() {
         val malformed = """
@@ -85,4 +128,61 @@ class ToolCallParserTest {
         assertEquals("""before "quoted" value""", arguments["old_string"]?.jsonPrimitive?.content)
         assertEquals("after line 1\nafter line 2", arguments["new_string"]?.jsonPrimitive?.content)
     }
+
+    @Test
+    fun `should detect complete fenced json envelope`() {
+        val content = """
+            ```json
+            {
+              "actions": [],
+              "response": "Done",
+              "intent": "implementation"
+            }
+            ```
+        """.trimIndent()
+
+        val inspection = parser.inspectJsonEnvelope(content)
+
+        assertTrue(inspection.hasJsonEnvelope)
+        assertTrue(inspection.isComplete)
+        assertTrue(inspection.isFenced)
+        assertEquals("Done", parser.extractTextResponse(content))
+    }
+
+    @Test
+    fun `should detect incomplete fenced json envelope`() {
+        val content = """
+            ```json
+            {
+              "actions": [
+                {"tool": "http_request", "args": {"url": "https://example.com"}}
+              ],
+              "response": "Working",
+              "intent": "implementation"
+        """.trimIndent()
+
+        val inspection = parser.inspectJsonEnvelope(content)
+
+        assertTrue(inspection.hasJsonEnvelope)
+        assertFalse(inspection.isComplete)
+        assertTrue(inspection.isFenced)
+        val toolCalls = parser.extractToolCalls(content, TaskMode.AGENT)
+        assertEquals(1, toolCalls.size)
+        assertEquals("http_request", toolCalls.first().name)
+    }
+
+    @Test
+    fun `should detect incomplete raw json envelope`() {
+        val content = """{"actions":[{"tool":"read_file","args":{"path":"a.txt"}}],"response":"Working""""
+            .dropLast(1)
+
+        val inspection = parser.inspectJsonEnvelope(content)
+
+        assertTrue(inspection.hasJsonEnvelope)
+        assertFalse(inspection.isComplete)
+        assertFalse(inspection.isFenced)
+        val toolCalls = parser.extractToolCalls(content, TaskMode.AGENT)
+        assertEquals(1, toolCalls.size)
+        assertEquals("read_file", toolCalls.first().name)
+    }
 }
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/services/turn/TurnLLMCallerTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/services/turn/TurnLLMCallerTest.kt
index 6fc90a23..25ff8091 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/services/turn/TurnLLMCallerTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/services/turn/TurnLLMCallerTest.kt
@@ -77,7 +77,7 @@ class TurnLLMCallerTest {
             caller.callLLM(
                 taskId = "task-1",
                 mode = TaskMode.AGENT,
-                prompt = LLMCallPrompt(
+                prompt = TurnPrompt(
                     systemPrompt = "system",
                     messages = listOf(LLMMessage(role = "user", content = "hello"))
                 )
diff --git a/intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/IncrementalToolCallStreamFilterTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/session/IncrementalToolCallStreamFilterTest.kt
similarity index 97%
rename from intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/IncrementalToolCallStreamFilterTest.kt
rename to core/src/test/kotlin/pl/jclab/refio/core/session/IncrementalToolCallStreamFilterTest.kt
index ae7f7f44..c90f7bb0 100644
--- a/intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/IncrementalToolCallStreamFilterTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/session/IncrementalToolCallStreamFilterTest.kt
@@ -1,4 +1,4 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
 import kotlin.test.Test
 import kotlin.test.assertEquals
diff --git a/intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/SessionLifecycleServiceTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/session/SessionLifecycleServiceTest.kt
similarity index 87%
rename from intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/SessionLifecycleServiceTest.kt
rename to core/src/test/kotlin/pl/jclab/refio/core/session/SessionLifecycleServiceTest.kt
index b47743c5..bda79826 100644
--- a/intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/SessionLifecycleServiceTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/session/SessionLifecycleServiceTest.kt
@@ -1,6 +1,6 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
-import com.intellij.openapi.project.Project
+import pl.jclab.refio.core.session.SessionStateManager
 import io.mockk.coEvery
 import io.mockk.every
 import io.mockk.mockk
@@ -10,7 +10,6 @@ import kotlinx.coroutines.SupervisorJob
 import kotlinx.coroutines.runBlocking
 import kotlinx.coroutines.sync.Mutex
 import org.junit.jupiter.api.Test
-import pl.jclab.refio.api.CoreApiClient
 import pl.jclab.refio.api.models.ExecutionMode
 import pl.jclab.refio.api.models.Session
 import pl.jclab.refio.api.models.TaskMode
@@ -26,9 +25,7 @@ class SessionLifecycleServiceTest {
 
     @Test
     fun `switchMode updates session mode`() = runBlocking {
-        val project = mockk<Project>(relaxed = true)
         val projectRouter = mockk<CoreApiRouter>()
-        val coreApiClient = mockk<CoreApiClient>(relaxed = true)
         val configService = mockk<ConfigService>(relaxed = true)
         val stateManager = SessionStateManager()
         val mutex = Mutex()
@@ -58,9 +55,7 @@ class SessionLifecycleServiceTest {
         )
 
         val service = SessionLifecycleService(
-            project = project,
             projectRouter = projectRouter,
-            coreApiClient = coreApiClient,
             configService = configService,
             stateManager = stateManager,
             modeSwitchMutex = mutex,
@@ -76,9 +71,7 @@ class SessionLifecycleServiceTest {
 
     @Test
     fun `getAvailableModels returns only visible models`() = runBlocking {
-        val project = mockk<Project>(relaxed = true)
         val projectRouter = mockk<CoreApiRouter>()
-        val coreApiClient = mockk<CoreApiClient>(relaxed = true)
         val configService = mockk<ConfigService>(relaxed = true)
         val stateManager = SessionStateManager()
         val mutex = Mutex()
@@ -90,9 +83,7 @@ class SessionLifecycleServiceTest {
         )
 
         val service = SessionLifecycleService(
-            project = project,
             projectRouter = projectRouter,
-            coreApiClient = coreApiClient,
             configService = configService,
             stateManager = stateManager,
             modeSwitchMutex = mutex,
@@ -108,9 +99,7 @@ class SessionLifecycleServiceTest {
 
     @Test
     fun `getAvailableModels falls back to all models when none are visible`() = runBlocking {
-        val project = mockk<Project>(relaxed = true)
         val projectRouter = mockk<CoreApiRouter>()
-        val coreApiClient = mockk<CoreApiClient>(relaxed = true)
         val configService = mockk<ConfigService>(relaxed = true)
         val stateManager = SessionStateManager()
         val mutex = Mutex()
@@ -122,9 +111,7 @@ class SessionLifecycleServiceTest {
         )
 
         val service = SessionLifecycleService(
-            project = project,
             projectRouter = projectRouter,
-            coreApiClient = coreApiClient,
             configService = configService,
             stateManager = stateManager,
             modeSwitchMutex = mutex,
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/session/SessionStateManagerTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/session/SessionStateManagerTest.kt
new file mode 100644
index 00000000..d04710f2
--- /dev/null
+++ b/core/src/test/kotlin/pl/jclab/refio/core/session/SessionStateManagerTest.kt
@@ -0,0 +1,98 @@
+package pl.jclab.refio.core.session
+
+import kotlinx.coroutines.runBlocking
+import org.junit.jupiter.api.Test
+import pl.jclab.refio.api.models.ExecutionMode
+import pl.jclab.refio.api.models.Message
+import pl.jclab.refio.api.models.Session
+import pl.jclab.refio.api.models.TaskMode
+import pl.jclab.refio.api.models.TaskStatus
+import kotlin.test.assertEquals
+
+class SessionStateManagerTest {
+
+    @Test
+    fun `setActiveSession updates state`() {
+        val manager = SessionStateManager()
+        val session = Session(
+            id = "session-1",
+            name = "Test",
+            mode = TaskMode.CHAT,
+            status = TaskStatus.PENDING,
+            createdAt = 1L,
+            updatedAt = 1L,
+            executionMode = ExecutionMode.INTERACTIVE
+        )
+
+        manager.setActiveSession(session)
+
+        assertEquals(session, manager.activeSession.value)
+    }
+
+    @Test
+    fun `appendMessage adds message`() = runBlocking {
+        val manager = SessionStateManager()
+        val message = Message(
+            id = "msg-1",
+            taskId = "session-1",
+            role = "system",
+            content = "hello",
+            createdAt = 1L
+        )
+
+        manager.appendMessage(message)
+
+        assertEquals(1, manager.messages.value.size)
+        assertEquals(message, manager.messages.value.first())
+    }
+
+    @Test
+    fun `scenario - set session then append multiple messages preserves order`() = runBlocking {
+        val manager = SessionStateManager()
+        val session = Session(
+            id = "session-1",
+            name = "Test",
+            mode = TaskMode.CHAT,
+            status = TaskStatus.PENDING,
+            createdAt = 1L,
+            updatedAt = 1L,
+            executionMode = ExecutionMode.INTERACTIVE
+        )
+        manager.setActiveSession(session)
+
+        val messages = (1..5).map {
+            Message(
+                id = "msg-$it",
+                taskId = "session-1",
+                role = if (it % 2 == 0) "assistant" else "user",
+                content = "message $it",
+                createdAt = it.toLong()
+            )
+        }
+        messages.forEach { manager.appendMessage(it) }
+
+        assertEquals(session, manager.activeSession.value)
+        assertEquals(5, manager.messages.value.size)
+        assertEquals(messages.map { it.id }, manager.messages.value.map { it.id })
+    }
+
+    @Test
+    fun `setActiveSession followed by different session updates to new one`() {
+        val manager = SessionStateManager()
+        val first = Session(
+            id = "session-1", name = "First", mode = TaskMode.CHAT, status = TaskStatus.PENDING,
+            createdAt = 1L, updatedAt = 1L, executionMode = ExecutionMode.INTERACTIVE
+        )
+        val second = Session(
+            id = "session-2", name = "Second", mode = TaskMode.PLAN, status = TaskStatus.RUNNING,
+            createdAt = 2L, updatedAt = 2L, executionMode = ExecutionMode.INTERACTIVE
+        )
+
+        manager.setActiveSession(first)
+        assertEquals("session-1", manager.activeSession.value?.id)
+
+        manager.setActiveSession(second)
+        assertEquals("session-2", manager.activeSession.value?.id)
+        assertEquals(TaskMode.PLAN, manager.activeSession.value?.mode)
+    }
+}
diff --git a/intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/ToolMessageDisplayResolverTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/session/ToolMessageDisplayResolverTest.kt
similarity index 96%
rename from intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/ToolMessageDisplayResolverTest.kt
rename to core/src/test/kotlin/pl/jclab/refio/core/session/ToolMessageDisplayResolverTest.kt
index d183f2ab..99662ae3 100644
--- a/intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/ToolMessageDisplayResolverTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/session/ToolMessageDisplayResolverTest.kt
@@ -1,4 +1,4 @@
-package pl.jclab.refio.services.session
+package pl.jclab.refio.core.session
 
 import kotlin.test.Test
 import kotlin.test.assertEquals
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/subagents/SubagentParserContextProfileTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/subagents/SubagentParserContextProfileTest.kt
index a6b3fe45..b52052ce 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/subagents/SubagentParserContextProfileTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/subagents/SubagentParserContextProfileTest.kt
@@ -24,7 +24,6 @@ class SubagentParserContextProfileTest {
               include_file_tree: false
               include_conversation: true
               include_working_memory: true
-              include_rag: false
               include_dependencies: true
               max_context_tokens: 8000
               include_parent_summary: true
@@ -38,7 +37,6 @@ class SubagentParserContextProfileTest {
         assertFalse(definition.contextProfile.includeFileTree)
         assertTrue(definition.contextProfile.includeConversation)
         assertTrue(definition.contextProfile.includeWorkingMemory)
-        assertFalse(definition.contextProfile.includeRag)
         assertTrue(definition.contextProfile.includeDependencies)
         assertEquals(8000, definition.contextProfile.maxContextTokens)
         assertTrue(definition.contextProfile.includeParentSummary)
@@ -60,7 +58,6 @@ class SubagentParserContextProfileTest {
         assertTrue(definition.contextProfile.includeFileTree)
         assertTrue(definition.contextProfile.includeConversation)
         assertTrue(definition.contextProfile.includeWorkingMemory)
-        assertTrue(definition.contextProfile.includeRag)
         assertTrue(definition.contextProfile.includeDependencies)
         assertEquals(null, definition.contextProfile.maxContextTokens)
         assertFalse(definition.contextProfile.includeParentSummary)
@@ -73,7 +70,6 @@ class SubagentParserContextProfileTest {
             name: partial-agent
             description: Partial profile agent
             context_profile:
-              include_rag: false
               include_parent_summary: true
             ---
 
@@ -83,7 +79,6 @@ class SubagentParserContextProfileTest {
         val definition = parser.parse(content, null, SubagentScope.PROJECT)
 
         // Specified fields
-        assertFalse(definition.contextProfile.includeRag)
         assertTrue(definition.contextProfile.includeParentSummary)
 
         // Default fields
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/subagents/SubagentRouterTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/subagents/SubagentRouterTest.kt
index 033d2f66..ed144376 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/subagents/SubagentRouterTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/subagents/SubagentRouterTest.kt
@@ -50,7 +50,6 @@ class SubagentRouterTest {
             toolPermissionsService = toolPermissionsService,
             chatMessageRepository = chatMessageRepository,
             contextService = null,
-            ideProject = null,
             runTurnCallback = null
         )
     }
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/tools/implementations/RunTerminalCommandToolTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/tools/implementations/RunTerminalCommandToolTest.kt
index 2b5b7fd3..53d80688 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/tools/implementations/RunTerminalCommandToolTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/tools/implementations/RunTerminalCommandToolTest.kt
@@ -1,24 +1,22 @@
 package pl.jclab.refio.core.tools.implementations
 
-import io.mockk.*
 import kotlinx.coroutines.runBlocking
 import org.junit.jupiter.api.BeforeEach
 import org.junit.jupiter.api.Nested
 import org.junit.jupiter.api.Test
 import org.junit.jupiter.api.io.TempDir
 import pl.jclab.refio.core.tools.PathSandbox
-import pl.jclab.refio.core.tools.base.ToolMode
 import pl.jclab.refio.core.tools.base.ToolCategory
-import pl.jclab.refio.core.tools.security.CommandDenylist
+import pl.jclab.refio.core.tools.base.ToolMode
 import pl.jclab.refio.core.tools.security.CommandLimits
-import pl.jclab.refio.core.tools.security.CommandWhitelist
-import pl.jclab.refio.core.tools.security.CommandWhitelistConfig
-import pl.jclab.refio.core.tools.security.WhitelistMode
+import pl.jclab.refio.core.tools.security.CommandRule
+import pl.jclab.refio.core.tools.security.CommandRuleMatcher
+import pl.jclab.refio.core.tools.security.RuleAction
 import java.nio.file.Path
 import kotlin.test.assertEquals
-import kotlin.test.assertTrue
 import kotlin.test.assertFalse
 import kotlin.test.assertNotNull
+import kotlin.test.assertTrue
 
 /**
  * Testy dla RunTerminalCommandTool — narzędzia do uruchamiania poleceń terminalowych.
@@ -29,26 +27,17 @@ class RunTerminalCommandToolTest {
     lateinit var tempDir: Path
 
     private lateinit var sandbox: PathSandbox
-    private lateinit var whitelist: CommandWhitelist
-    private lateinit var denylist: CommandDenylist
     private lateinit var limits: CommandLimits
     private lateinit var tool: RunTerminalCommandTool
 
+    private fun allowAllMatcher(): CommandRuleMatcher =
+        CommandRuleMatcher(listOf(CommandRule(".*", RuleAction.ALLOW, "test: allow all")))
+
     @BeforeEach
     fun setup() {
         sandbox = PathSandbox(tempDir)
-        denylist = CommandDenylist()
         limits = CommandLimits.DEFAULT
-
-        val config = CommandWhitelistConfig(
-            enabled = false, // Disabled for testing
-            mode = WhitelistMode.WHITELIST_ONLY,
-            globalBlockedPatterns = emptyList(),
-            allowedCommands = emptyList()
-        )
-        whitelist = CommandWhitelist(config, denylist)
-
-        tool = RunTerminalCommandTool(sandbox, whitelist, limits)
+        tool = RunTerminalCommandTool(sandbox, limits, allowAllMatcher())
     }
 
     @Nested
@@ -80,13 +69,11 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should validate params with valid command`() {
-            // When & Then - should not throw
             tool.validateParams(mapOf("command" to "echo hello"))
         }
 
         @Test
         fun `should throw exception when command is missing`() {
-            // When & Then
             val exception = kotlin.test.assertFailsWith<IllegalArgumentException> {
                 tool.validateParams(emptyMap())
             }
@@ -95,7 +82,6 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should throw exception when command is null`() {
-            // When & Then
             val exception = kotlin.test.assertFailsWith<IllegalArgumentException> {
                 @Suppress("UNCHECKED_CAST")
                 tool.validateParams(mapOf("command" to null) as Map<String, Any>)
@@ -105,7 +91,6 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should throw exception when command is empty`() {
-            // When & Then
             val exception = kotlin.test.assertFailsWith<IllegalArgumentException> {
                 tool.validateParams(mapOf("command" to ""))
             }
@@ -114,7 +99,6 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should throw exception when command is blank`() {
-            // When & Then
             val exception = kotlin.test.assertFailsWith<IllegalArgumentException> {
                 tool.validateParams(mapOf("command" to "   "))
             }
@@ -127,60 +111,40 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should execute simple command successfully`() = runBlocking {
-            // When
             val result = tool.execute(mapOf("command" to "echo hello"))
 
-            // Then
             assertTrue(result.success)
             assertTrue(result.output!!.contains("hello"))
         }
 
         @Test
         fun `should return exit code in metadata`() = runBlocking {
-            // When
-            val result = tool.execute(mapOf("command" to "echo test"))
-
-            // Then
-            assertNotNull(result.exitCode)
-            assertEquals(0, result.exitCode)
-        }
-
-        @Test
-        fun `should set exitCode in ToolResult`() = runBlocking {
-            // When
             val result = tool.execute(mapOf("command" to "echo test"))
 
-            // Then - exitCode should be set directly on ToolResult
             assertNotNull(result.exitCode)
             assertEquals(0, result.exitCode)
         }
 
         @Test
         fun `should set durationMs correctly`() = runBlocking {
-            // When
             val result = tool.execute(mapOf("command" to "echo test"))
 
-            // Then
             assertNotNull(result.durationMs)
             assertTrue(result.durationMs!! >= 0)
         }
 
         @Test
         fun `should include command in metadata`() = runBlocking {
-            // When
             val result = tool.execute(mapOf("command" to "echo test"))
 
-            // Then
             assertNotNull(result.metadata)
             assertEquals("echo test", result.metadata!!["command"])
         }
 
         @Test
         fun `should include output length in metadata`() = runBlocking {
-            // When
             val result = tool.execute(mapOf("command" to "echo test"))
 
-            // Then
             assertNotNull(result.metadata)
             assertTrue(result.metadata!!.containsKey("output_length"))
         }
@@ -191,10 +155,8 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should return error when command parameter is missing`() = runBlocking {
-            // When
             val result = tool.execute(emptyMap())
 
-            // Then
             assertFalse(result.success)
             assertNotNull(result.error)
             assertTrue(result.error!!.contains("command"))
@@ -202,86 +164,59 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should handle non-zero exit codes`() = runBlocking {
-            // When - command that will fail
             val result = tool.execute(mapOf("command" to "ls /nonexistent_directory_12345"))
 
-            // Then
-            // Result may be success=true with exit code != 0, or success=false with error
-            // depending on implementation
             assertNotNull(result.exitCode)
             assertTrue(result.exitCode != 0)
         }
 
         @Test
         fun `should handle command that produces no output`() = runBlocking {
-            // When - use cross-platform command that succeeds with no output
-            // On Windows: exit 0 (PowerShell syntax for successful exit)
-            // On Unix: true command
             val os = System.getProperty("os.name").lowercase()
-            val command = if (os.contains("windows")) {
-                "exit 0"  // PowerShell command that succeeds immediately
-            } else {
-                "true"  // Unix builtin
-            }
+            val command = if (os.contains("windows")) "exit 0" else "true"
 
             val result = tool.execute(mapOf("command" to command))
 
-            // Then - command should succeed with exit code 0
             assertTrue(result.success, "Command should succeed")
-            // exitCode should be 0 for successful command
             val exitCode = result.exitCode
             assertTrue(exitCode == null || exitCode == 0, "Exit code should be 0 or null but was: $exitCode")
         }
     }
 
     @Nested
-    inner class WhitelistValidationTests {
+    inner class CommandRuleValidationTests {
 
         @Test
-        fun `should block command when whitelist validation fails`() = runBlocking {
-            // Given
-            val enabledConfig = CommandWhitelistConfig(
-                enabled = true,
-                mode = WhitelistMode.WHITELIST_ONLY,
-                globalBlockedPatterns = emptyList(),
-                allowedCommands = emptyList() // No commands allowed
+        fun `should block command when rule matcher returns BLOCK`() = runBlocking {
+            val blockingMatcher = CommandRuleMatcher(
+                listOf(CommandRule("^echo\\b.*", RuleAction.BLOCK, "test: block echo"))
             )
-            val strictWhitelist = CommandWhitelist(enabledConfig, denylist)
-            val strictTool = RunTerminalCommandTool(sandbox, strictWhitelist, limits)
+            val strictTool = RunTerminalCommandTool(sandbox, limits, blockingMatcher)
 
-            // When
             val result = strictTool.execute(mapOf("command" to "echo test"))
 
-            // Then
             assertFalse(result.success)
             assertNotNull(result.error)
-            assertTrue(result.error!!.contains("not on whitelist", ignoreCase = true))
+            assertTrue(result.error!!.contains("blocked", ignoreCase = true))
         }
 
         @Test
-        fun `should require confirmation when command requires it`() = runBlocking {
-            // Given - command that requires confirmation (e.g., risky operation)
-            // This test documents expected behavior when confirmation is needed
-            // Actual implementation depends on whitelist configuration
+        fun `should allow command when rule matcher returns ALLOW`() = runBlocking {
+            val result = tool.execute(mapOf("command" to "echo allowed"))
 
-            // When
-            val result = tool.execute(mapOf("command" to "echo test"))
-
-            // Then
-            // With disabled whitelist, should execute normally
-            assertTrue(result.success || result.error != null)
+            assertTrue(result.success)
         }
 
         @Test
-        fun `should block commands matching denylist patterns`() = runBlocking {
-            // When - command with dangerous pattern
-            val result = tool.execute(mapOf("command" to "rm -rf /tmp/test"))
+        fun `should allow command on ASK (pre-approved at tool level)`() = runBlocking {
+            val askMatcher = CommandRuleMatcher(
+                listOf(CommandRule("^echo\\b.*", RuleAction.ASK, "test: ask"))
+            )
+            val askTool = RunTerminalCommandTool(sandbox, limits, askMatcher)
 
-            // Then
-            // Should be blocked by denylist
-            assertFalse(result.success)
-            assertTrue(result.error!!.contains("not allowed", ignoreCase = true) ||
-                       result.error!!.contains("blocked", ignoreCase = true))
+            val result = askTool.execute(mapOf("command" to "echo ask-case"))
+
+            assertTrue(result.success)
         }
     }
 
@@ -290,18 +225,12 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should enforce command timeout`() = runBlocking {
-            // Given - strict limits with short timeout
             val strictLimits = CommandLimits(timeoutSeconds = 1)
-            val strictTool = RunTerminalCommandTool(sandbox, whitelist, strictLimits)
+            val strictTool = RunTerminalCommandTool(sandbox, strictLimits, allowAllMatcher())
 
-            // When - command that takes longer than timeout
-            // Note: This is a simplified test - actual timeout testing requires
-            // commands that reliably exceed the timeout
             val result = strictTool.execute(mapOf("command" to "echo quick"))
 
-            // Then
             assertNotNull(result)
-            // Command should complete quickly and succeed
             assertTrue(result.success)
         }
     }
@@ -311,17 +240,13 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should truncate large output`() = runBlocking {
-            // Given - strict limits with small output size
             val strictLimits = CommandLimits(maxOutputSize = 100)
-            val strictTool = RunTerminalCommandTool(sandbox, whitelist, strictLimits)
+            val strictTool = RunTerminalCommandTool(sandbox, strictLimits, allowAllMatcher())
 
-            // When - command that produces lots of output
             val result = strictTool.execute(mapOf("command" to "echo very long output that should be truncated"))
 
-            // Then
             assertTrue(result.success)
             val output = result.output!!
-            // Output should be truncated or marked as truncated
             if (output.length > 100) {
                 assertTrue(output.contains("truncated", ignoreCase = true))
             }
@@ -329,17 +254,13 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should indicate truncation in metadata`() = runBlocking {
-            // Given
             val strictLimits = CommandLimits(maxOutputSize = 10)
-            val strictTool = RunTerminalCommandTool(sandbox, whitelist, strictLimits)
+            val strictTool = RunTerminalCommandTool(sandbox, strictLimits, allowAllMatcher())
 
-            // When
             val result = strictTool.execute(mapOf("command" to "echo this is longer than ten chars"))
 
-            // Then
             assertNotNull(result.metadata)
             val truncated = result.metadata!!["truncated"] as? Boolean
-            // If output was truncated, should be true
             if (result.output!!.length >= 10) {
                 assertTrue(truncated == true)
             }
@@ -351,26 +272,19 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should execute command in project root`() = runBlocking {
-            // Given - create a test file in temp directory
             java.nio.file.Files.writeString(tempDir.resolve("test.txt"), "content")
 
-            // When - use cross-platform command
             val command = if (System.getProperty("os.name").lowercase().contains("windows")) "dir" else "ls"
             val result = tool.execute(mapOf("command" to command))
 
-            // Then
             assertTrue(result.success)
-            // Output should contain files from temp directory
         }
 
         @Test
         fun `should respect sandbox boundaries`() = runBlocking {
-            // Given
             val result = tool.execute(mapOf("command" to "echo test"))
 
-            // Then
             assertNotNull(result)
-            // Command should execute within sandbox
             assertTrue(result.success || result.error != null)
         }
     }
@@ -380,38 +294,27 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should handle commands with arguments`() = runBlocking {
-            // When
             val result = tool.execute(mapOf("command" to "echo hello world"))
 
-            // Then
             assertTrue(result.success)
             assertTrue(result.output!!.contains("hello") || result.output!!.contains("world"))
         }
 
         @Test
         fun `should handle commands with quotes`() = runBlocking {
-            // When
             val result = tool.execute(mapOf("command" to "echo \"quoted text\""))
 
-            // Then
             assertTrue(result.success)
         }
 
         @Test
         fun `should handle empty output gracefully`() = runBlocking {
-            // When - use cross-platform command that succeeds with minimal/no output
             val os = System.getProperty("os.name").lowercase()
-            val command = if (os.contains("windows")) {
-                "exit 0"  // PowerShell command that succeeds immediately
-            } else {
-                "true"  // Unix builtin
-            }
+            val command = if (os.contains("windows")) "exit 0" else "true"
 
             val result = tool.execute(mapOf("command" to command))
 
-            // Then - should succeed even with no/empty output
             assertTrue(result.success, "Command should succeed")
-            // Exit code should be 0 for successful command
             val exitCode = result.exitCode
             assertTrue(exitCode == null || exitCode == 0, "Exit code should be 0 or null but was: $exitCode")
         }
@@ -422,22 +325,16 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should handle malformed command gracefully`() = runBlocking {
-            // When - command with invalid syntax
             val result = tool.execute(mapOf("command" to "echo \"unclosed quote"))
 
-            // Then
             assertNotNull(result)
-            // Should either succeed or fail with descriptive error
         }
 
         @Test
         fun `should handle command that produces error output`() = runBlocking {
-            // When - command that writes to stderr
             val result = tool.execute(mapOf("command" to "ls /nonexistent"))
 
-            // Then
             assertNotNull(result)
-            // Should have output (stderr is combined with stdout)
         }
     }
 
@@ -446,10 +343,8 @@ class RunTerminalCommandToolTest {
 
         @Test
         fun `should return valid parameter schema`() {
-            // When
             val schema = tool.getParameterSchema()
 
-            // Then
             assertEquals("object", schema["type"])
             val properties = schema["properties"] as Map<*, *>
             assertNotNull(properties["command"])
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/tools/security/CommandRuleMatcherTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/tools/security/CommandRuleMatcherTest.kt
index bd112fe2..94e1da59 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/tools/security/CommandRuleMatcherTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/tools/security/CommandRuleMatcherTest.kt
@@ -2,6 +2,7 @@ package pl.jclab.refio.core.tools.security
 
 import org.junit.jupiter.api.Nested
 import org.junit.jupiter.api.Test
+import org.junit.jupiter.api.assertThrows
 import kotlin.test.assertEquals
 
 class CommandRuleMatcherTest {
@@ -146,15 +147,13 @@ class CommandRuleMatcherTest {
         }
 
         @Test
-        fun `should handle invalid regex gracefully`() {
-            val matcher = CommandRuleMatcher(listOf(
-                CommandRule("[invalid", RuleAction.BLOCK, "Bad regex"),
-                CommandRule("^ls(\\s+.*)?$", RuleAction.ALLOW, "List files")
-            ))
-
-            // Invalid regex should be skipped, valid one should work
-            val result = matcher.match("ls -la")
-            assertEquals(RuleAction.ALLOW, result.action)
+        fun `invalid regex fails at CommandRule construction not at match time`() {
+            // Post Sprint 1: invalid regex is detected eagerly. Config loader must refuse
+            // startup when this throws, instead of silently dropping the rule.
+            val err = assertThrows<IllegalArgumentException> {
+                CommandRule("[invalid", RuleAction.BLOCK, "Bad regex")
+            }
+            assert(err.message!!.contains("Invalid command rule regex")) { "unexpected: ${err.message}" }
         }
     }
 }
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelistTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelistTest.kt
deleted file mode 100644
index fb14bebb..00000000
--- a/core/src/test/kotlin/pl/jclab/refio/core/tools/security/CommandWhitelistTest.kt
+++ /dev/null
@@ -1,121 +0,0 @@
-package pl.jclab.refio.core.tools.security
-
-import org.junit.jupiter.api.Test
-import kotlin.test.assertFalse
-import kotlin.test.assertTrue
-
-class CommandWhitelistTest {
-
-    @Test
-    fun `should allow command present on whitelist`() {
-        val whitelist = createWhitelist()
-        val result = whitelist.validate("git status")
-
-        assertTrue(result.allowed)
-    }
-
-    @Test
-    fun `should block command not on whitelist in strict mode`() {
-        val whitelist = createWhitelist()
-        val result = whitelist.validate("rm -rf /")
-
-        assertFalse(result.allowed)
-    }
-
-    @Test
-    fun `should block command with blocked flag`() {
-        val whitelist = createWhitelist()
-        val result = whitelist.validate("git status --force")
-
-        assertFalse(result.allowed)
-    }
-
-    @Test
-    fun `should block command with blocked subcommand`() {
-        val whitelist = createWhitelist()
-        val result = whitelist.validate("git push origin main")
-
-        assertFalse(result.allowed)
-    }
-
-    @Test
-    fun `should allow pipeline with allowed commands only`() {
-        val whitelist = createWhitelist()
-        val result = whitelist.validate("git log --oneline | head -20")
-
-        assertTrue(result.allowed)
-    }
-
-    @Test
-    fun `should block pipeline that invokes shell`() {
-        val whitelist = createWhitelist()
-        val result = whitelist.validate("git log | sh")
-
-        assertFalse(result.allowed)
-    }
-
-    @Test
-    fun `should block command substitution`() {
-        val whitelist = createWhitelist()
-        val result = whitelist.validate("echo $(cat /etc/passwd)")
-
-        assertFalse(result.allowed)
-    }
-
-    @Test
-    fun `should allow non-whitelisted command in whitelist plus deny mode when denylist allows it`() {
-        val whitelist = CommandWhitelist(
-            config = CommandWhitelistConfig(
-                mode = WhitelistMode.WHITELIST_PLUS_DENY,
-                allowedCommands = emptyList(),
-                globalBlockedPatterns = emptyList()
-            ),
-            denylist = CommandDenylist.DEFAULT
-        )
-
-        val result = whitelist.validate("echo hello")
-
-        assertTrue(result.allowed)
-    }
-
-    @Test
-    fun `should block non-whitelisted command in whitelist plus deny mode when denylist blocks it`() {
-        val whitelist = CommandWhitelist(
-            config = CommandWhitelistConfig(
-                mode = WhitelistMode.WHITELIST_PLUS_DENY,
-                allowedCommands = emptyList(),
-                globalBlockedPatterns = emptyList()
-            ),
-            denylist = CommandDenylist.DEFAULT
-        )
-
-        val result = whitelist.validate("rm -rf /")
-
-        assertFalse(result.allowed)
-    }
-
-    @Test
-    fun `should normalize executable path for whitelisted command`() {
-        val whitelist = createWhitelist()
-        val result = whitelist.validate("\"C:\\Program Files\\Git\\bin\\git.exe\" status")
-
-        assertTrue(result.allowed)
-    }
-
-    private fun createWhitelist(): CommandWhitelist {
-        val config = CommandWhitelistConfig(
-            mode = WhitelistMode.WHITELIST_ONLY,
-            allowedCommands = listOf(
-                AllowedCommand(
-                    program = "git",
-                    allowedSubcommands = listOf("status", "log", "add", "commit"),
-                    blockedSubcommands = listOf("push"),
-                    blockedFlags = listOf("--force")
-                ),
-                AllowedCommand(program = "head")
-            ),
-            globalBlockedPatterns = CommandWhitelistDefaults.DEFAULT_BLOCKED_PATTERNS
-        )
-        return CommandWhitelist(config, CommandDenylist.DEFAULT)
-    }
-}
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/workflow/WorkflowIntegrationTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/workflow/WorkflowIntegrationTest.kt
index a68cda97..8142af3d 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/workflow/WorkflowIntegrationTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/workflow/WorkflowIntegrationTest.kt
@@ -11,18 +11,20 @@ import org.junit.jupiter.api.Test
 import pl.jclab.refio.api.models.ExecutionMode
 import pl.jclab.refio.api.models.TaskMode
 import pl.jclab.refio.core.api.ExecuteStepResponse
+import pl.jclab.refio.core.api.PlanningRequest
+import pl.jclab.refio.core.api.routers.AgentRouter
 import pl.jclab.refio.core.db.ApprovalStatus
 import pl.jclab.refio.core.db.Subtask
 import pl.jclab.refio.core.db.SubtaskKind
 import pl.jclab.refio.core.db.TaskStatus
 import pl.jclab.refio.core.db.repositories.SubtaskRepository
 import pl.jclab.refio.core.models.api.ChatCosts
+import pl.jclab.refio.core.models.api.ChatRequest
 import pl.jclab.refio.core.models.api.ChatResponse
+import pl.jclab.refio.core.services.ChatService
+import pl.jclab.refio.core.services.PlanningService
+import pl.jclab.refio.core.services.execution.unified.ExecutionEventListener
 import pl.jclab.refio.core.subagents.SubagentRouter
-import pl.jclab.refio.core.workflow.executors.ChatExecutor
-import pl.jclab.refio.core.workflow.executors.PlanExecutor
-import pl.jclab.refio.core.workflow.executors.StepExecutor
-import pl.jclab.refio.core.workflow.executors.SubagentExecutor
 import pl.jclab.refio.core.workflow.models.IntentResult
 import pl.jclab.refio.core.workflow.models.UIState
 import pl.jclab.refio.core.workflow.models.WorkflowIntent
@@ -37,11 +39,10 @@ class WorkflowIntegrationTest {
     private val subtaskRepository = mockk<SubtaskRepository>()
     private val subagentRouter = mockk<SubagentRouter>()
 
-    // WorkflowOrchestrator dependencies
-    private val chatExecutor = mockk<ChatExecutor>()
-    private val planExecutor = mockk<PlanExecutor>()
-    private val stepExecutor = mockk<StepExecutor>()
-    private val subagentExecutor = mockk<SubagentExecutor>()
+    // WorkflowOrchestrator dependencies (now services, not executors)
+    private val chatService = mockk<ChatService>()
+    private val planningService = mockk<PlanningService>()
+    private val agentRouter = mockk<AgentRouter>()
 
     private lateinit var intentRouter: IntentRouter
     private lateinit var orchestrator: WorkflowOrchestrator
@@ -55,10 +56,10 @@ class WorkflowIntegrationTest {
 
         orchestrator = WorkflowOrchestrator(
             intentRouter = intentRouter,
-            chatExecutor = chatExecutor,
-            planExecutor = planExecutor,
-            stepExecutor = stepExecutor,
-            subagentExecutor = subagentExecutor,
+            chatService = chatService,
+            planningService = planningService,
+            agentRouter = agentRouter,
+            subagentRouter = subagentRouter,
             userInteraction = null
         )
     }
@@ -230,10 +231,8 @@ class WorkflowIntegrationTest {
                 costs = ChatCosts(tokensIn = 50, tokensOut = 100, usdEst = 0.001)
             )
 
-            // IntentRouter will create Chat intent, which orchestrator routes to chatExecutor
-            coEvery {
-                chatExecutor.execute(any<WorkflowIntent.Chat>(), any(), any())
-            } returns IntentResult.ChatResult(chatResponse)
+            // Orchestrator now calls chatService.chat directly.
+            coEvery { chatService.chat(any<ChatRequest>(), any(), any()) } returns chatResponse
 
             // When
             val result = orchestrator.execute(request)
@@ -244,7 +243,9 @@ class WorkflowIntegrationTest {
             assertEquals(50, result.response.costs.tokensIn)
             assertEquals(100, result.response.costs.tokensOut)
 
-            coVerify(exactly = 1) { chatExecutor.execute(any<WorkflowIntent.Chat>(), any(), any()) }
+            coVerify(exactly = 1) {
+                chatService.chat(match<ChatRequest> { it.taskId == "task-1" }, any(), any())
+            }
         }
 
         @Test
@@ -269,8 +270,8 @@ class WorkflowIntegrationTest {
                 costs = ChatCosts(tokensIn = 5, tokensOut = 5, usdEst = 0.0)
             )
             coEvery {
-                chatExecutor.execute(match<WorkflowIntent.Chat> { it.taskId == "task-42" }, any(), any())
-            } returns IntentResult.ChatResult(response)
+                chatService.chat(match<ChatRequest> { it.taskId == "task-42" }, any(), any())
+            } returns response
 
             // When
             val result = orchestrator.execute(request)
@@ -309,9 +310,13 @@ class WorkflowIntegrationTest {
                 durationMs = 500,
                 error = null
             )
-            coEvery {
-                stepExecutor.execute(any<WorkflowIntent.ExecuteStep>(), any())
-            } returns IntentResult.StepResult(stepResponse)
+            every {
+                agentRouter.executeSubtaskStepWithListener(
+                    "task-1",
+                    "s1",
+                    any<ExecutionEventListener>()
+                )
+            } returns stepResponse
 
             // When
             val result = orchestrator.execute(request)
@@ -322,7 +327,9 @@ class WorkflowIntegrationTest {
             assertEquals("Step completed", result.response.summary)
 
             // Should execute only one step and stop (AUTO stops after plan intent follows)
-            coVerify(exactly = 1) { stepExecutor.execute(any(), any()) }
+            coVerify(exactly = 1) {
+                agentRouter.executeSubtaskStepWithListener("task-1", "s1", any<ExecutionEventListener>())
+            }
         }
     }
 }
diff --git a/core/src/test/kotlin/pl/jclab/refio/core/workflow/WorkflowOrchestratorTest.kt b/core/src/test/kotlin/pl/jclab/refio/core/workflow/WorkflowOrchestratorTest.kt
index 05f0cf44..a9a5ae05 100644
--- a/core/src/test/kotlin/pl/jclab/refio/core/workflow/WorkflowOrchestratorTest.kt
+++ b/core/src/test/kotlin/pl/jclab/refio/core/workflow/WorkflowOrchestratorTest.kt
@@ -2,18 +2,22 @@ package pl.jclab.refio.core.workflow
 
 import io.mockk.coEvery
 import io.mockk.coVerify
+import io.mockk.every
 import io.mockk.mockk
 import kotlin.test.assertEquals
 import org.junit.jupiter.api.Test
 import pl.jclab.refio.api.models.ExecutionMode
 import pl.jclab.refio.api.models.TaskMode
 import pl.jclab.refio.core.api.ExecuteStepResponse
+import pl.jclab.refio.core.api.PlanningRequest
+import pl.jclab.refio.core.api.routers.AgentRouter
 import pl.jclab.refio.core.models.api.ChatCosts
+import pl.jclab.refio.core.models.api.ChatRequest
 import pl.jclab.refio.core.models.api.ChatResponse
-import pl.jclab.refio.core.workflow.executors.ChatExecutor
-import pl.jclab.refio.core.workflow.executors.PlanExecutor
-import pl.jclab.refio.core.workflow.executors.StepExecutor
-import pl.jclab.refio.core.workflow.executors.SubagentExecutor
+import pl.jclab.refio.core.services.ChatService
+import pl.jclab.refio.core.services.PlanningService
+import pl.jclab.refio.core.services.execution.unified.ExecutionEventListener
+import pl.jclab.refio.core.subagents.SubagentRouter
 import pl.jclab.refio.core.workflow.models.IntentResult
 import pl.jclab.refio.core.workflow.models.UIState
 import pl.jclab.refio.core.workflow.models.WorkflowIntent
@@ -22,17 +26,17 @@ import pl.jclab.refio.core.workflow.models.WorkflowRequest
 class WorkflowOrchestratorTest {
 
     private val intentRouter = mockk<IntentRouter>()
-    private val chatExecutor = mockk<ChatExecutor>()
-    private val planExecutor = mockk<PlanExecutor>()
-    private val stepExecutor = mockk<StepExecutor>()
-    private val subagentExecutor = mockk<SubagentExecutor>()
+    private val chatService = mockk<ChatService>()
+    private val planningService = mockk<PlanningService>()
+    private val agentRouter = mockk<AgentRouter>()
+    private val subagentRouter = mockk<SubagentRouter>()
 
     private val orchestrator = WorkflowOrchestrator(
         intentRouter = intentRouter,
-        chatExecutor = chatExecutor,
-        planExecutor = planExecutor,
-        stepExecutor = stepExecutor,
-        subagentExecutor = subagentExecutor,
+        chatService = chatService,
+        planningService = planningService,
+        agentRouter = agentRouter,
+        subagentRouter = subagentRouter,
         userInteraction = null
     )
 
@@ -63,12 +67,14 @@ class WorkflowOrchestratorTest {
         )
 
         coEvery { intentRouter.determineIntent(uiState, any(), any()) } returns intent
-        coEvery { chatExecutor.execute(intent, any(), any()) } returns IntentResult.ChatResult(response)
+        coEvery { chatService.chat(any<ChatRequest>(), any(), any()) } returns response
 
         val result = orchestrator.execute(request)
 
         assertEquals(IntentResult.ChatResult(response), result)
-        coVerify(exactly = 1) { chatExecutor.execute(intent, any(), any()) }
+        coVerify(exactly = 1) {
+            chatService.chat(match { it.taskId == "task-1" && it.input == "hello" }, any(), any())
+        }
     }
 
     @Test
@@ -93,14 +99,20 @@ class WorkflowOrchestratorTest {
         )
 
         coEvery { intentRouter.determineIntent(uiState, any(), any()) } returnsMany listOf(stepIntent, planIntent)
-        coEvery { stepExecutor.execute(stepIntent, any()) } returns IntentResult.StepResult(
-            ExecuteStepResponse(status = "success", summary = "ok", durationMs = 1, error = null)
-        )
+        every {
+            agentRouter.executeSubtaskStepWithListener(
+                "task-1",
+                "subtask-1",
+                any<ExecutionEventListener>()
+            )
+        } returns ExecuteStepResponse(status = "success", summary = "ok", durationMs = 1, error = null)
 
         val result = orchestrator.execute(request)
 
         assertEquals("success", (result as IntentResult.StepResult).response.status)
-        coVerify(exactly = 1) { stepExecutor.execute(stepIntent, any()) }
-        coVerify(exactly = 0) { planExecutor.execute(any(), any(), any()) }
+        coVerify(exactly = 1) {
+            agentRouter.executeSubtaskStepWithListener("task-1", "subtask-1", any<ExecutionEventListener>())
+        }
+        coVerify(exactly = 0) { planningService.createPlan(any(), any<PlanningRequest>(), any(), any()) }
     }
 }
diff --git a/docs/ARCHITECTURE.md b/docs/ARCHITECTURE.md
index 49b97ffc..9a28f71b 100644
--- a/docs/ARCHITECTURE.md
+++ b/docs/ARCHITECTURE.md
@@ -2,6 +2,11 @@
 
 > For contributors and advanced users. See [README.md](../README.md) for product overview.
 
+> **Recent notable changes** (see [CHANGELOG.md](../CHANGELOG.md) `[Unreleased]`):
+> - "Slash commands" were renamed to "Prompts" (code: `SlashCommand` → `SlashPrompt`, DB enum `SLASH_COMMAND` → `SLASH_PROMPT` with V3 migration). The `/name` invocation syntax is unchanged; UI tab is now **Settings → Prompts**.
+> - Terminal command security uses a single `CommandRule` regex engine (`ALLOW` / `BLOCK` / `ASK`); the legacy `CommandWhitelist` / `CommandDenylist` classes have been removed.
+> - Models settings tab no longer triggers remote provider fetches on open — only the **Refresh** button does. Local providers (Ollama, LM Studio) use a 3 s `listModels` timeout.
+
 ---
 
 ## Project Overview
@@ -276,7 +281,7 @@ RagSearchResult[]
 
 ## Tools
 
-### READ_ONLY Tools (7)
+### READ_ONLY Tools (14)
 
 | Tool | Parameters | Description |
 |------|------------|-------------|
@@ -287,8 +292,15 @@ RagSearchResult[]
 | `view_diff` | file1, file2 OR content2 | Line-by-line comparison |
 | `invoke_subagent` | subagent_name, goal, context_refs? | Run nested child loop via subagent profile |
 | `delegate_to_strong_model` | task, context?, allow_tools?, response_format? | Delegate complex task to a stronger model (single-shot or tool-enabled sub-agent). Only registered when `models.defaults.strong` is configured. |
+| `web_search` | query, max_results? | Search the web (Brave/SerpAPI/DuckDuckGo) |
+| `fetch_webpage` | url, prompt, max_content_chars? | Fetch URL, convert to Markdown, process with LLM |
+| `code_intelligence` | action, symbol?, path?, language? | Find usages/definitions, list symbols, get diagnostics |
+| `monitor_process` | process_id, max_lines? | Read output from background process |
+| `ask_user` | question, options? | Ask the user a question and wait for response |
+| `sleep` | duration_ms | Pause execution (max 30s) |
+| `think` | thought | Explicit reasoning slot |
 
-### WRITE Tools (8)
+### WRITE Tools (10)
 
 | Tool | Parameters | Description | Cost |
 |------|------------|-------------|------|
@@ -300,6 +312,8 @@ RagSearchResult[]
 | `run_terminal_command` | command | Shell execution (ASK in AGENT, CommandRule-protected) | Free |
 | `http_request` | url, method, headers, body, save_to_file | HTTP requests (GET/POST/PUT/DELETE), 5 MB limit, 60s timeout | Free |
 | `run_code` | language, code | Execute Python/JavaScript/Kotlin Script, 120s timeout | Free |
+| `run_process_background` | command | Start command in background, return process_id | Free |
+| `llm_call` | prompt, data?, file_path?, model? | Raw single-turn LLM call | ~$0.01 |
 
 ### Tool Availability by Mode
 
@@ -541,7 +555,7 @@ See [`docs/config.md`](config.md) for full configuration reference.
 |-------|------------|
 | **PathSandbox** | All file ops restricted to project root |
 | **FileLimits** | Size limits (2MB), excluded directories (24), extensions (34) |
-| **CommandRule System** | Regex-based rules with ALLOW/BLOCK/ASK levels replacing legacy whitelist |
+| **CommandRule System** | Regex-based rules with `ALLOW` / `BLOCK` / `ASK` actions; unified replacement for the removed `CommandWhitelist` / `CommandDenylist`. Defaults in `CommandRuleDefaults`, limits in `CommandLimits`. |
 | **ToolPermissions** | 3-level access control: ON/ASK/OFF per mode (PLAN=read-only, AGENT=read-write) |
 | **No-Egress Mode** | Blocks cloud providers, allows only Ollama/LM Studio |
 | **Secret Redaction** | API keys masked in all logs |
@@ -551,7 +565,7 @@ See [`docs/config.md`](config.md) for full configuration reference.
 | Issue | Description | Mitigation |
 |-------|-------------|------------|
 | Symlink Escape | PathSandbox can be bypassed via symlinks | Detection + logging in place |
-| Whitelist Coverage | Some workflows may require adding project-specific safe commands | Configure in Tools Settings (Terminal Whitelist) |
+| Command Rule Coverage | Some workflows may require adding project-specific safe commands | Configure in Tools Settings → Terminal Command Rules (regex-based `ALLOW` / `BLOCK` / `ASK`) |
 
 ---
 
diff --git a/docs/files.md b/docs/files.md
index c3343ae6..a41c3c8f 100644
--- a/docs/files.md
+++ b/docs/files.md
@@ -177,6 +177,12 @@
 - **FileSearchTool.kt** — Finds files by glob pattern (converted to regex); depth limit 10, result limit 100, offset/limit pagination.
 - **GrepSearchTool.kt** — Regex text search across files with case sensitivity, glob filtering, 2MB exclusion; formatted as `file:lineNumber: content`.
 - **ViewDiffTool.kt** — Compares two files or file vs content using line-by-line diff; unified diff format with add/remove/unchanged counts.
+- **WebSearchTool.kt** — Web search via Brave Search, SerpAPI, or DuckDuckGo Instant Answers; configurable provider/API key via `tools.web_search.*` config; returns titles, URLs, snippets.
+- **FetchWebpageTool.kt** — Fetches URL, converts HTML to Markdown via Jsoup (strips nav/ads/scripts), processes with weak LLM model using user prompt; 50K char content limit.
+- **CodeIntelligenceTool.kt** — IDE-independent code intelligence: `find_usages` (grep), `find_definition` (ctags/grep), `list_symbols` (ctags), `get_diagnostics` (compiler CLI); works in CLI without IntelliJ PSI.
+- **MonitorProcessTool.kt** — Reads stdout from background process by `process_id`; returns lines and running status; auto-removes finished processes.
+- **AskUserTool.kt** — Asks user a question with optional predefined choices; suspends agent loop via `UserQuestionService`; 10-minute timeout.
+- **SleepTool.kt** — Pauses agent execution for up to 30 seconds; uses coroutine `delay()`.
 
 ### Write Tools
 - **CreateNewFileTool.kt** — Creates files with content validation, size limits, file lock; warns on existing file; auto-creates parent dirs.
@@ -187,12 +193,14 @@
 - **RunTerminalCommandTool.kt** — Shell execution with CommandRule-based validation (ALLOW/BLOCK/ASK); legacy whitelist fallback; 120s timeout, 200KB output limit; async I/O to prevent deadlocks; cross-platform shell selection.
 - **HttpRequestTool.kt** — HTTP requests (GET/POST/PUT/DELETE) with optional `save_to_file` for large responses; 5MB response limit, 60s timeout; auto-detects format (CSV, JSON, text) for saved files; header filtering for relevant response headers.
 - **RunCodeTool.kt** — Inline code execution for Python, JavaScript, and Kotlin Script; 120s timeout; sandbox via temporary files; captures stdout/stderr; OFF by default (requires explicit enabling).
+- **RunProcessBackgroundTool.kt** — Starts shell command in background via `ProcessManager`; returns `process_id` immediately; CommandRule security validation; for long-running builds/dev servers.
 
 ### Subagent Integration
 - **InvokeSubagentTool.kt** — Enables agent-to-subagent delegation; validates availability, detects recursion; builds `TurnRequest` with overrides for system prompt, tools, depth.
 
 ## `core/services/` — Core Services
 
+- **ProcessManager.kt** — Manages long-running background processes; `start(command, workingDir)` returns `ManagedProcess` with `processId`; `readOutput()` non-blocking line reader; `stop()` forcibly destroys; cross-platform shell wrapping (cmd.exe/sh).
 - **AgentExecutor.kt** — Orchestrates step-by-step execution: planning → execution → summarization with subtask lifecycle management.
 - **AgentTurnLoop.kt** — Self-directing tool loop for PLAN/AGENT modes (~988 LOC); delegates to `turn/` sub-components for LLM calls, prompt building, tool execution, response processing, guardrails.
 - **TurnLoopConfig.kt** — Configuration for AgentTurnLoop with factory methods for PLAN (25 iterations) and AGENT (50 iterations) presets; includes auto-compaction thresholds, parallel tools, snapshots, retry config, working memory, read-only budget guard (ADR-0044).
@@ -242,6 +250,7 @@
 - **TurnLoopConfigAliases.kt** — Type aliases for turn loop configuration.
 - **TurnPromptAliases.kt** — Type aliases for prompt building.
 - **ToolApprovalService.kt** — Manages user approval flow for tools with `PermissionLevel.ASK`; `ApprovalDecision` sealed class (Approved/Trusted/Rejected); session trust rules via `ConcurrentHashMap` with regex pattern matching; 5-minute approval timeout; `pendingRequests` StateFlow for UI observation.
+- **UserQuestionService.kt** — Suspends agent loop while waiting for user answer; `AskUserRequest` with requestId/taskId/question/options; `Listener` interface for UI; `resolve()`/`cancel()` from UI; `CompletableDeferred` + 10-min timeout; same pattern as `ToolApprovalService`.
 - **ToolRejectedException.kt** — Exception thrown when user rejects tool execution; caught in AgentTurnLoop to record rejection, set `TurnResult(rejectedByUser=true)`, and break the loop returning control to user prompt.
 
 ## `core/services/context/` — Context Building Helpers & Extracted Sub-Services
diff --git a/docs/overview.md b/docs/overview.md
index 92c0cb4c..5e067fd6 100644
--- a/docs/overview.md
+++ b/docs/overview.md
@@ -643,7 +643,7 @@ interface Tool {
 }
 ```
 
-### READ_ONLY Tools (7)
+### READ_ONLY Tools (14)
 
 | Tool | Parameters | Description | Limits |
 |------|------------|-------------|--------|
@@ -652,10 +652,17 @@ interface Tool {
 | `file_search` | pattern, path, offset, limit | Glob pattern search | 100 results |
 | `grep_search` | pattern, path, case_sensitive | Regex content search | 500 results |
 | `view_diff` | file1, file2/content2 | Line-by-line diff | - |
-| `invoke_subagent` | subagent_name, goal, context_refs? | Run nested child loop with a specialized subagent (dynamic description: active subagents + allowed tools/inherit) | Depth <= 3 |
-| `delegate_to_strong_model` | task, context?, allow_tools?, response_format? | Delegate complex task to a stronger model (single-shot or tool-enabled sub-agent). Only registered when `models.defaults.strong` is configured. | - |
-
-### WRITE Tools (8)
+| `invoke_subagent` | subagent_name, goal, context_refs? | Run nested child loop with a specialized subagent | Depth <= 3 |
+| `delegate_to_strong_model` | task, context?, allow_tools?, response_format? | Delegate complex task to a stronger model | - |
+| `web_search` | query, max_results? | Search the web (Brave/SerpAPI/DuckDuckGo) | 20 results max |
+| `fetch_webpage` | url, prompt, max_content_chars? | Fetch URL → Markdown → LLM processing | 50K chars max |
+| `code_intelligence` | action, symbol?, path?, language? | Find usages, definitions, list symbols, compiler diagnostics | ctags optional |
+| `monitor_process` | process_id, max_lines? | Read output from background process | 1000 lines max |
+| `ask_user` | question, options? | Ask user a question and wait for response | 10 min timeout |
+| `sleep` | duration_ms | Pause execution | 30s max |
+| `think` | thought | Explicit reasoning slot | - |
+
+### WRITE Tools (10)
 
 | Tool | Parameters | Description | Cost |
 |------|------------|-------------|------|
@@ -667,6 +674,8 @@ interface Tool {
 | `run_terminal_command` | command | Shell execution (whitelist-protected) | **AGENT: ON (default)** |
 | `http_request` | url, method, headers, body, save_to_file | HTTP requests (GET/POST/PUT/DELETE), 5 MB limit, 60s timeout | Free |
 | `run_code` | language, code | Execute Python/JavaScript/Kotlin Script snippets, 120s timeout | **OFF by default** |
+| `run_process_background` | command | Start command in background, return process_id | Free |
+| `llm_call` | prompt, data?, file_path?, model? | Raw single-turn LLM call | ~$0.01 |
 
 ### Security Layers
 
@@ -1100,7 +1109,7 @@ Layer 7: Secret Redaction
 | Issue | Location | Impact | Status |
 |-------|----------|--------|--------|
 | Symlink Escape | PathSandbox.kt | Can escape project root | Detection in place |
-| Whitelist Coverage | CommandWhitelistDefaults.kt | Missing command entries can block harmless commands | Add via config/UI whitelist |
+| Command Rule Coverage | `CommandRuleDefaults.kt` | Missing rules may require confirmation (`ASK`) for otherwise harmless commands | Add project-specific rules via Tools Settings → Terminal Command Rules |
 
 ---
 
@@ -1218,7 +1227,7 @@ TuiApp (entry point — launchTuiApp())
 │
 ├── Detects interactive mode: System.console() != null
 │   ├── Interactive: alternate screen buffer, raw JLine3 input, F-key navigation
-│   └── Non-interactive: inline rendering, line-based input (/commands, :shortcuts)
+│   └── Non-interactive: inline rendering, line-based input (`/prompt`, `:shortcuts`)
 │
 ├── Three concurrent coroutines:
 │   ├── Render loop — stateFlow.collect { renderer.render(state) }
@@ -1255,7 +1264,7 @@ TuiApp (entry point — launchTuiApp())
 │
 ├── TuiInputHandler (dual-mode input)
 │   ├── Raw mode (real TTY): JLine3 reader, single-char dispatch, escape sequence parsing
-│   ├── Line mode (IDE/pipe): BufferedReader from System.in, /commands, :tab shortcuts
+│   ├── Line mode (IDE/pipe): BufferedReader from System.in, `/prompt`, `:tab` shortcuts
 │   ├── dispatchAction() handles: tab switching, typing, backspace, send, autocomplete
 │   └── Slash commands: /quit, /clear, /help, /mode, /history, /settings, /set section.key value
 │
@@ -1296,7 +1305,7 @@ The Settings screen provides full configuration access via `ConfigRouter`, match
 | General | `general` | Markdown rendering, streaming, advanced view toggles |
 | Providers | `providers` | 8 providers (Ollama, Anthropic, OpenAI, OpenRouter, Gemini, LM Studio, Custom OpenAI, Z.AI) with masked API keys and status indicators |
 | Models | `models` | Model assignments: default, planning, coding, auxiliary, embeddings |
-| Prompts | `prompts` | Custom system prompts and slash commands |
+| Prompts | `prompts` | Custom system prompts and slash prompts (reusable `/name` prompt templates) |
 | Context | `index` | RAG search tuning (similarity threshold, top-k, hybrid search) and indexing settings |
 | MCP | `mcp` | MCP server list with enable/disable and type |
 | Docs | `docs` | Documentation sources for @docs context provider |
@@ -1334,7 +1343,7 @@ The Settings screen provides full configuration access via `ConfigRouter`, match
 | Ctrl+L | **Continue** | Resume current conversation after interruption (e.g. if agent stopped mid-task). |
 | Ctrl+D | **Summarize** | Compact long conversation history to save context window space. Uses LLM to generate a summary of older messages. |
 
-You can also use slash commands for session management:
+You can also use system commands (note: these are TUI session commands, distinct from user-defined slash prompts managed in Settings → Prompts):
 - `/history` — Open session history
 - `/export <path>` — Export conversation to Markdown file
 - `/resend` — Resend last user message
@@ -1381,7 +1390,7 @@ You can also use slash commands for session management:
 |---------|--------|------------|
 | `@` | Context autocomplete | @file, @folder, @codebase, @grep, @diff, @url, @docs, @clipboard, etc. |
 | `!` | Subagent autocomplete | !review, !security, !architect, !docs, and custom subagents |
-| `/` | Slash command autocomplete | /explain, /refactor, /test, /fix, /implement, /optimize, /security-review, etc. |
+| `/` | Slash prompt autocomplete | /explain, /refactor, /test, /fix, /implement, /optimize, /security-review, etc. |
 
 When the autocomplete popup is visible:
 - **Arrow Down / Tab** — Next candidate
@@ -1390,9 +1399,9 @@ When the autocomplete popup is visible:
 - **Escape** — Dismiss
 - **Keep typing** — Filters candidates in real-time
 
-#### Slash Commands Reference
+#### Slash Prompts Reference
 
-**Prompt templates** (sent to LLM with your input as context):
+**Prompt templates** (sent to LLM with your input as context). The feature was previously called "Slash Commands"; the `/name` syntax is unchanged, only the UI/code label changed to reflect that these are prompts, not plugin/CLI commands.
 
 | Command | Description |
 |---------|-------------|
diff --git a/gradle.properties b/gradle.properties
index 34e15463..511e6605 100644
--- a/gradle.properties
+++ b/gradle.properties
@@ -1,4 +1,4 @@
-refioVersion=0.0.1.6
+refioVersion=0.0.1.7
 
 # Opt-out flag for bundling Kotlin standard library -> https://jb.gg/intellij-platform-kotlin-stdlib
 kotlin.stdlib.default.dependency = false
diff --git a/intellij-plugin/build.gradle.kts b/intellij-plugin/build.gradle.kts
index 2ff9b999..509ae297 100644
--- a/intellij-plugin/build.gradle.kts
+++ b/intellij-plugin/build.gradle.kts
@@ -158,16 +158,6 @@ tasks {
         )
     }
 
-    jacocoTestCoverageVerification {
-        violationRules {
-            rule {
-                limit {
-                    minimum = "0.40".toBigDecimal()
-                }
-            }
-        }
-    }
-
     runIde {
         jvmArgs = listOf(
             "-Didea.log.debug.categories=#pl.jclab",
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/AddCodeToCurrentSessionAction.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/AddCodeToCurrentSessionAction.kt
index c7f2258e..9225b09f 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/AddCodeToCurrentSessionAction.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/AddCodeToCurrentSessionAction.kt
@@ -5,7 +5,7 @@ import com.intellij.openapi.actionSystem.AnActionEvent
 import com.intellij.openapi.actionSystem.CommonDataKeys
 import com.intellij.openapi.wm.ToolWindowManager
 import pl.jclab.refio.api.models.CodeSnippet
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.components.chat.PromptInputPanel
 import pl.jclab.refio.ui.toolwindow.RefioMainPanel
 import java.awt.Container
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/AddCodeToNewSessionAction.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/AddCodeToNewSessionAction.kt
index 80f1b0a8..3e634990 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/AddCodeToNewSessionAction.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/AddCodeToNewSessionAction.kt
@@ -10,7 +10,7 @@ import kotlinx.coroutines.SupervisorJob
 import kotlinx.coroutines.launch
 import pl.jclab.refio.api.models.CodeSnippet
 import pl.jclab.refio.api.models.TaskMode
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.ui.components.chat.PromptInputPanel
 import pl.jclab.refio.ui.toolwindow.RefioMainPanel
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/NewSessionAction.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/NewSessionAction.kt
index 5803eaa7..8c7d69ab 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/NewSessionAction.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/NewSessionAction.kt
@@ -4,7 +4,7 @@ import com.intellij.icons.AllIcons
 import com.intellij.openapi.actionSystem.AnAction
 import com.intellij.openapi.actionSystem.AnActionEvent
 import com.intellij.openapi.wm.ToolWindowManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.toolwindow.RefioMainPanel
 import java.awt.Container
 
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/RefioSlashCommandIntentionAction.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/RefioSlashPromptIntentionAction.kt
similarity index 69%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/RefioSlashCommandIntentionAction.kt
rename to intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/RefioSlashPromptIntentionAction.kt
index 75e6c93a..67f5d88f 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/RefioSlashCommandIntentionAction.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/RefioSlashPromptIntentionAction.kt
@@ -11,22 +11,22 @@ import kotlinx.coroutines.Dispatchers
 import kotlinx.coroutines.SupervisorJob
 import kotlinx.coroutines.launch
 import pl.jclab.refio.api.models.CodeSnippet
-import pl.jclab.refio.api.models.SlashCommand
+import pl.jclab.refio.api.models.SlashPrompt
 import pl.jclab.refio.api.models.TaskMode
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.ui.components.chat.PromptInputPanel
 import java.awt.Container
 import javax.swing.SwingUtilities
 
-class RefioSlashCommandIntentionAction(
-    private val command: SlashCommand? = null
+class RefioSlashPromptIntentionAction(
+    private val slashPrompt: SlashPrompt? = null
 ) : IntentionAction {
 
-    private val logger = dualLogger("RefioSlashCommandIntentionAction")
+    private val logger = dualLogger("RefioSlashPromptIntentionAction")
     private val cs = CoroutineScope(SupervisorJob() + Dispatchers.Default)
 
-    override fun getText(): String = command?.let { "Refio: /${it.name}" } ?: "Refio: Run Slash Command..."
+    override fun getText(): String = slashPrompt?.let { "Refio: /${it.name}" } ?: "Refio: Run Prompt..."
 
     override fun getFamilyName(): String = "Refio"
 
@@ -37,7 +37,7 @@ class RefioSlashCommandIntentionAction(
     override fun invoke(project: Project, editor: Editor?, file: PsiFile?) {
         val ed = editor ?: return
         val vFile = file?.virtualFile ?: return
-        val selectedCommand = command ?: chooseCommand(project) ?: return
+        val selected = slashPrompt ?: choosePrompt(project) ?: return
 
         val selectionModel = ed.selectionModel
         if (!selectionModel.hasSelection()) return
@@ -56,12 +56,12 @@ class RefioSlashCommandIntentionAction(
             language = vFile.extension
         )
 
-        val targetMode = resolveTargetMode(selectedCommand.category)
+        val targetMode = resolveTargetMode(selected.category)
 
         cs.launch {
             try {
                 val sessionManager = SessionManager.getInstance(project)
-                sessionManager.createSession("/${selectedCommand.name}", targetMode)
+                sessionManager.createSession("/${selected.name}", targetMode)
 
                 SwingUtilities.invokeLater {
                     val toolWindow = ToolWindowManager.getInstance(project).getToolWindow("Refio")
@@ -69,45 +69,45 @@ class RefioSlashCommandIntentionAction(
                         val panel = findPromptInputPanel(toolWindow.component)
                         if (panel != null) {
                             panel.addCodeSnippet(snippet)
-                            panel.sendPrompt("/${selectedCommand.name}")
+                            panel.sendPrompt("/${selected.name}")
                         } else {
                             logger.warn { "PromptInputPanel not found in Refio tool window" }
                         }
                     }
                 }
             } catch (e: Exception) {
-                logger.error(e) { "Failed to invoke /${selectedCommand.name} intention" }
+                logger.error(e) { "Failed to invoke /${selected.name} intention" }
             }
         }
     }
 
     override fun startInWriteAction(): Boolean = false
 
-    private fun chooseCommand(project: Project): SlashCommand? {
-        val commands = SlashCommand.BUILTINS.filter { it.showInEditor }
-        if (commands.isEmpty()) {
-            logger.warn { "No slash commands available for editor intention" }
+    private fun choosePrompt(project: Project): SlashPrompt? {
+        val available = SlashPrompt.BUILTINS.filter { it.showInEditor }
+        if (available.isEmpty()) {
+            logger.warn { "No slash prompts available for editor intention" }
             return null
         }
-        if (commands.size == 1) {
-            return commands.first()
+        if (available.size == 1) {
+            return available.first()
         }
 
-        val options = commands.map { "/${it.name} - ${it.description}" }.toTypedArray()
-        val selected = Messages.showChooseDialog(
+        val options = available.map { "/${it.name} - ${it.description}" }.toTypedArray()
+        val selected = Messages.showDialog(
             project,
-            "Select a Refio slash command to run for the current selection.",
-            "Refio Slash Commands",
-            Messages.getQuestionIcon(),
+            "Select a Refio prompt to run for the current selection.",
+            "Refio Prompts",
             options,
-            options.first()
+            0,
+            Messages.getQuestionIcon()
         )
 
-        if (selected == -1) {
+        if (selected < 0) {
             return null
         }
 
-        return commands[selected]
+        return available[selected]
     }
 
     private fun resolveTargetMode(category: String): TaskMode {
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ReindexRagAction.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ReindexRagAction.kt
index cd635289..2b0b4817 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ReindexRagAction.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ReindexRagAction.kt
@@ -4,7 +4,7 @@ import com.intellij.openapi.actionSystem.AnAction
 import com.intellij.openapi.actionSystem.AnActionEvent
 import com.intellij.openapi.ui.Messages
 import pl.jclab.refio.services.core.CoreConnectionManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.rag.BackgroundIndexingTask
 
 class ReindexRagAction : AnAction(
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowHelpAction.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowHelpAction.kt
index 151e979b..5a3378f7 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowHelpAction.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowHelpAction.kt
@@ -4,7 +4,7 @@ import com.intellij.icons.AllIcons
 import com.intellij.openapi.actionSystem.AnAction
 import com.intellij.openapi.actionSystem.AnActionEvent
 import com.intellij.openapi.wm.ToolWindowManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.toolwindow.RefioMainPanel
 import java.awt.Container
 
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowHistoryAction.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowHistoryAction.kt
index 4cbab7ef..9cc8ffbb 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowHistoryAction.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowHistoryAction.kt
@@ -4,7 +4,7 @@ import com.intellij.icons.AllIcons
 import com.intellij.openapi.actionSystem.AnAction
 import com.intellij.openapi.actionSystem.AnActionEvent
 import com.intellij.openapi.wm.ToolWindowManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.toolwindow.RefioMainPanel
 import java.awt.Container
 
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowSettingsAction.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowSettingsAction.kt
index 46af329f..9d4509ee 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowSettingsAction.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ShowSettingsAction.kt
@@ -4,7 +4,7 @@ import com.intellij.icons.AllIcons
 import com.intellij.openapi.actionSystem.AnAction
 import com.intellij.openapi.actionSystem.AnActionEvent
 import com.intellij.openapi.wm.ToolWindowManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.toolwindow.RefioMainPanel
 import java.awt.Container
 
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ToolWindowAction.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ToolWindowAction.kt
index a04c5631..e9294b2a 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ToolWindowAction.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/actions/ToolWindowAction.kt
@@ -4,7 +4,7 @@ import com.intellij.openapi.actionSystem.AnAction
 import com.intellij.openapi.actionSystem.AnActionEvent
 import com.intellij.openapi.actionSystem.ActionUpdateThread
 import com.intellij.openapi.wm.ToolWindowManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.toolwindow.RefioMainPanel
 import java.awt.Container
 
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/api/CoreApiClient.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/api/CoreApiClient.kt
deleted file mode 100644
index 9975a947..00000000
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/api/CoreApiClient.kt
+++ /dev/null
@@ -1,377 +0,0 @@
-package pl.jclab.refio.api
-
-import pl.jclab.refio.core.api.*
-import pl.jclab.refio.core.config.ModelPresetConfig
-import pl.jclab.refio.core.db.ConfigScope
-import pl.jclab.refio.core.models.api.ChatRequest
-import pl.jclab.refio.core.models.api.ChatResponse
-import pl.jclab.refio.core.models.api.SetToolPermissionRequest
-import pl.jclab.refio.core.tools.security.CommandWhitelistConfig
-import pl.jclab.refio.services.logging.dualLogger
-
-private val logger = dualLogger("CoreApiClient")
-
-/**
- * CoreApiClient - Thin wrapper delegating to domain routers.
- *
- * Plugin → CoreApiClient → Domain Routers → Services → Database
- *
- * IMPORTANT: Plugin code should ALWAYS use CoreApiClient, NEVER call
- * services directly.
- */
-class CoreApiClient(internal val router: CoreApiRouter) {
-
-    // ========================================================================
-    // Task Management (via taskRouter)
-    // ========================================================================
-
-    fun createTask(request: CreateTaskRequest): TaskResponse {
-        logger.info { "[CoreApiClient] Creating task" }
-        return router.taskRouter.createTask(request)
-    }
-
-    fun listTasks(): ListTasksResponse {
-        logger.info { "[CoreApiClient] Listing tasks" }
-        return router.taskRouter.listTasks()
-    }
-
-    fun getTasksForProject(projectId: String): List<TaskResponse> {
-        logger.info { "[CoreApiClient] Listing tasks for project $projectId" }
-        return router.taskRouter.getTasksForProject(projectId)
-    }
-
-    fun getLastSessionForProject(projectId: String): TaskResponse? {
-        logger.info { "[CoreApiClient] Getting last session for project $projectId" }
-        return router.taskRouter.getLastSessionForProject(projectId)
-    }
-
-    fun getTask(taskId: String): TaskResponse? {
-        logger.info { "[CoreApiClient] Getting task: $taskId" }
-        return router.taskRouter.getTask(taskId)
-    }
-
-    fun updateTask(taskId: String, request: UpdateTaskRequest): TaskResponse {
-        logger.info { "[CoreApiClient] Updating task: $taskId" }
-        return router.taskRouter.updateTask(taskId, request)
-    }
-
-    fun deleteTask(taskId: String): Boolean {
-        logger.info { "[CoreApiClient] Deleting task: $taskId" }
-        return router.taskRouter.deleteTask(taskId)
-    }
-
-    // ========================================================================
-    // Messages (via chatRouter)
-    // ========================================================================
-
-    fun getMessages(taskId: String): GetMessagesResponse {
-        logger.info { "[CoreApiClient] Getting messages for task: $taskId" }
-        return router.chatRouter.getMessages(taskId)
-    }
-
-    // ========================================================================
-    // Subtasks (via subtaskRouter)
-    // ========================================================================
-
-    fun getSubtasks(taskId: String): GetSubtasksResponse {
-        logger.info { "[CoreApiClient] Getting subtasks for task: $taskId" }
-        return router.subtaskRouter.getSubtasks(taskId)
-    }
-
-    fun getSubtask(taskId: String, subtaskId: String): SubtaskResponse {
-        logger.info { "[CoreApiClient] Getting subtask: task=$taskId, subtask=$subtaskId" }
-        return router.subtaskRouter.getSubtask(taskId, subtaskId)
-    }
-
-    fun updateSubtask(taskId: String, subtaskId: String, request: UpdateSubtaskRequest): SubtaskResponse {
-        logger.info { "[CoreApiClient] Updating subtask: task=$taskId, subtask=$subtaskId" }
-        return router.subtaskRouter.updateSubtask(taskId, subtaskId, request)
-    }
-
-    fun approveSubtask(taskId: String, subtaskId: String): SubtaskResponse {
-        logger.info { "[CoreApiClient] Approving subtask: task=$taskId, subtask=$subtaskId" }
-        return router.subtaskRouter.approveSubtask(taskId, subtaskId)
-    }
-
-    fun rejectSubtask(taskId: String, subtaskId: String): SubtaskResponse {
-        logger.info { "[CoreApiClient] Rejecting subtask: task=$taskId, subtask=$subtaskId" }
-        return router.subtaskRouter.rejectSubtask(taskId, subtaskId)
-    }
-
-    fun deletePendingSubtasks(taskId: String): DeleteSubtasksResponse {
-        logger.info { "[CoreApiClient] Deleting pending subtasks: task=$taskId" }
-        val result = router.subtaskRouter.deletePendingSubtasks(taskId)
-        return DeleteSubtasksResponse(
-            deletedCount = result.deletedCount,
-            message = "Successfully deleted ${result.deletedCount} pending/planned subtasks"
-        )
-    }
-
-    // ========================================================================
-    // Step Workflow (via agentRouter)
-    // ========================================================================
-
-    suspend fun prepareStep(taskId: String, subtaskId: String): PlanStepResponse {
-        logger.info { "[CoreApiClient] Preparing step: task=$taskId, subtask=$subtaskId" }
-        return router.agentRouter.planSubtaskStep(taskId, subtaskId)
-    }
-
-    suspend fun executeStep(taskId: String, subtaskId: String): ExecuteStepResponse {
-        logger.info { "[CoreApiClient] Executing step: task=$taskId, subtask=$subtaskId" }
-        return router.agentRouter.executeSubtaskStep(taskId, subtaskId)
-    }
-
-    // ========================================================================
-    // Chat (via chatRouter)
-    // ========================================================================
-
-    suspend fun chat(
-        request: ChatRequest,
-        stream: Boolean = false,
-        onChunk: StreamCallback? = null
-    ): ChatResponse {
-        logger.info { "[CoreApiClient] Sending chat: taskId=${request.taskId}, stream=$stream" }
-        return router.chatRouter.chat(request, stream, onChunk)
-    }
-
-    // ========================================================================
-    // Health & Models (via taskRouter / configRouter)
-    // ========================================================================
-
-    fun health(): HealthResponse {
-        return router.taskRouter.health()
-    }
-
-    suspend fun getModels(provider: String? = null): GetModelsResponse {
-        return router.configRouter.getModels(provider)
-    }
-
-    suspend fun getModelsWithVisibility(provider: String? = null): List<ModelInfo> {
-        logger.info { "[CoreApiClient] Getting models with visibility" }
-        return router.configRouter.getModelsWithVisibility(provider)
-    }
-
-    suspend fun getDefaultModel(operation: ModelOperation, taskId: String? = null): GetDefaultModelResponse {
-        return router.configRouter.getDefaultModel(operation, taskId)
-    }
-
-    suspend fun setDefaultModel(request: SetDefaultModelRequest, taskId: String? = null): SetDefaultModelResponse {
-        return router.configRouter.setDefaultModel(request, taskId)
-    }
-
-    // ========================================================================
-    // Prompts Management (via promptsRouter)
-    // ========================================================================
-
-    fun getSystemPrompt(request: GetSystemPromptRequest): SystemPromptResponse {
-        logger.info { "[CoreApiClient] Getting system prompt: type=${request.type}" }
-        return router.promptsRouter.getSystemPrompt(request)
-    }
-
-    fun getPromptsByType(type: pl.jclab.refio.core.db.PromptType): PromptsListResponse {
-        logger.info { "[CoreApiClient] Getting prompts by type: $type" }
-        return router.promptsRouter.getPromptsByType(type)
-    }
-
-    fun getSystemPrompts(): PromptsListResponse {
-        return router.promptsRouter.getSystemPrompts()
-    }
-
-    fun getEnabledRules(): PromptsListResponse {
-        return router.promptsRouter.getEnabledRules()
-    }
-
-    fun getEnabledCommands(): PromptsListResponse {
-        return router.promptsRouter.getEnabledCommands()
-    }
-
-    fun findCommand(commandName: String): PromptResponse? {
-        return router.promptsRouter.findCommand(commandName)
-    }
-
-    fun saveRule(request: SaveRuleRequest): PromptResponse {
-        logger.info { "[CoreApiClient] Saving rule: ${request.name}" }
-        return router.promptsRouter.saveRule(request)
-    }
-
-    fun saveCommand(request: SaveCommandRequest): PromptResponse {
-        logger.info { "[CoreApiClient] Saving command: ${request.name}" }
-        return router.promptsRouter.saveCommand(request)
-    }
-
-    fun updateSystemPrompt(request: UpdateSystemPromptRequest): PromptResponse? {
-        logger.info { "[CoreApiClient] Updating system prompt: ${request.type}" }
-        return router.promptsRouter.updateSystemPrompt(request)
-    }
-
-    fun resetSystemPromptToDefault(type: pl.jclab.refio.core.db.PromptType): PromptResponse? {
-        return router.promptsRouter.resetSystemPromptToDefault(type)
-    }
-
-    fun deletePrompt(id: String): DeletePromptResponse {
-        return router.promptsRouter.deletePrompt(id)
-    }
-
-    fun getPromptById(id: String): PromptResponse? {
-        return router.promptsRouter.getPromptById(id)
-    }
-
-    fun getDefaultSystemPromptContent(type: pl.jclab.refio.core.db.PromptType): String {
-        return router.promptsRouter.getDefaultSystemPromptContent(type)
-    }
-
-    // ========================================================================
-    // Configuration Management (via configRouter)
-    // ========================================================================
-
-    fun updateConfig(section: String, scope: String, taskId: String?, settings: Map<String, Any>): UpdateConfigResponse {
-        logger.info { "[CoreApiClient] Updating config: section=$section, scope=$scope" }
-        return router.configRouter.updateConfig(section, scope, taskId, settings)
-    }
-
-    fun resetAllSettingsToDefaults(): ResetConfigResponse {
-        logger.info { "[CoreApiClient] Resetting all settings to defaults" }
-        return router.configRouter.resetAllSettingsToDefaults()
-    }
-
-    fun getConfig(section: String, scope: String): GetConfigResponse {
-        return router.configRouter.getConfig(section, scope)
-    }
-
-    fun getConfigValue(section: String, key: String): String? {
-        return try {
-            val fullKey = "$section.$key"
-            router.configService.get(fullKey, ConfigScope.APP, null)
-        } catch (e: Exception) {
-            logger.error(e) { "Failed to get config: $section.$key" }
-            null
-        }
-    }
-
-    fun setConfigValue(section: String, key: String, value: String) {
-        try {
-            val fullKey = "$section.$key"
-            router.configService.set(fullKey, value, ConfigScope.APP, null)
-        } catch (e: Exception) {
-            logger.error(e) { "Failed to set config: $section.$key" }
-            throw e
-        }
-    }
-
-    fun getYamlModelPresets(): List<ModelPresetConfig> {
-        return router.configService.getYamlConfig().models?.presets ?: emptyList()
-    }
-
-    // ========================================================================
-    // Provider Management (via configRouter)
-    // ========================================================================
-
-    suspend fun testProviderConnection(provider: String, config: Map<String, String>): TestConnectionResult {
-        logger.info { "[CoreApiClient] Testing connection to provider: $provider" }
-        return router.configRouter.testProviderConnection(provider, config)
-    }
-
-    suspend fun refreshProviderModels(provider: String): List<ModelInfo> {
-        return router.configRouter.refreshProviderModels(provider)
-    }
-
-    suspend fun refreshAllModels(): List<ModelInfo> {
-        return router.configRouter.refreshAllModels()
-    }
-
-    suspend fun updateModelVisibility(modelId: String, showInDropdown: Boolean) {
-        router.configRouter.updateModelVisibility(modelId, showInDropdown)
-    }
-
-    suspend fun updateModelsVisibility(visibilityMap: Map<String, Boolean>) {
-        router.configRouter.updateModelsVisibility(visibilityMap)
-    }
-
-    // ========================================================================
-    // Tool Permissions (via toolRouter)
-    // ========================================================================
-
-    suspend fun getToolPermissions(taskId: String? = null): Map<String, Pair<String, String>> {
-        val response = router.toolRouter.getToolPermissions(taskId)
-        return response.tools.associate { tool ->
-            tool.toolName to (tool.planMode to tool.agentMode)
-        }
-    }
-
-    suspend fun getAvailableToolDefinitions(): List<ToolDefinitionInfo> {
-        return router.toolRouter.getAvailableToolDefinitions()
-    }
-
-    suspend fun setToolPermission(toolName: String, planMode: String, agentMode: String, taskId: String? = null) {
-        val request = SetToolPermissionRequest(planMode = planMode, agentMode = agentMode)
-        router.toolRouter.setToolPermission(toolName, request, taskId)
-    }
-
-    suspend fun resetToolPermissions(taskId: String? = null) {
-        router.toolRouter.resetToolPermissions(taskId)
-    }
-
-    fun getTerminalWhitelistConfig(): CommandWhitelistConfig {
-        return router.configService.getTerminalWhitelistConfig()
-    }
-
-    fun setTerminalWhitelistConfig(config: CommandWhitelistConfig, scope: String = "app") {
-        val configScope = when (scope.lowercase()) {
-            "project" -> ConfigScope.PROJECT
-            else -> ConfigScope.APP
-        }
-        router.configService.setTerminalWhitelistConfig(config, configScope)
-    }
-
-    // ========================================================================
-    // Subagents (via subagentRouter)
-    // ========================================================================
-
-    fun listSubagents(includeDisabled: Boolean = false): List<pl.jclab.refio.core.subagents.models.SubagentInfo> {
-        return router.subagentRouter?.listSubagents(includeDisabled) ?: emptyList()
-    }
-
-    fun getSubagent(name: String): pl.jclab.refio.core.subagents.models.SubagentDefinition? {
-        return router.subagentRouter?.getSubagent(name)
-    }
-
-    fun createSubagent(
-        name: String,
-        description: String,
-        systemPrompt: String,
-        allowedTools: List<String>? = null,
-        model: String = "default",
-        scope: pl.jclab.refio.core.subagents.models.SubagentScope = pl.jclab.refio.core.subagents.models.SubagentScope.PROJECT,
-        enabled: Boolean = true,
-        priority: Int = 0
-    ): pl.jclab.refio.core.subagents.models.SubagentDefinition {
-        return router.subagentRouter?.createSubagent(
-            name = name, description = description, systemPrompt = systemPrompt,
-            allowedTools = allowedTools, model = model, scope = scope,
-            enabled = enabled, priority = priority
-        ) ?: throw IllegalStateException("SubagentRouter not available")
-    }
-
-    fun updateSubagent(
-        name: String,
-        description: String? = null,
-        systemPrompt: String? = null,
-        allowedTools: List<String>? = null,
-        model: String? = null,
-        enabled: Boolean? = null,
-        priority: Int? = null
-    ): pl.jclab.refio.core.subagents.models.SubagentDefinition {
-        return router.subagentRouter?.updateSubagent(
-            name = name, description = description, systemPrompt = systemPrompt,
-            allowedTools = allowedTools, model = model, enabled = enabled, priority = priority
-        ) ?: throw IllegalStateException("SubagentRouter not available")
-    }
-
-    fun deleteSubagent(name: String): Boolean {
-        return router.subagentRouter?.deleteSubagent(name) ?: false
-    }
-
-    fun refreshSubagents() {
-        router.subagentRouter?.refresh()
-    }
-}
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/core/context/providers/GrepSearchContextProvider.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/core/context/providers/GrepSearchContextProvider.kt
index 8a310cfe..19fb8326 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/core/context/providers/GrepSearchContextProvider.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/core/context/providers/GrepSearchContextProvider.kt
@@ -1,6 +1,5 @@
 package pl.jclab.refio.core.context.providers
 
-import com.intellij.openapi.application.ReadAction
 import com.intellij.openapi.project.Project
 import com.intellij.psi.search.GlobalSearchScope
 import pl.jclab.refio.core.config.ConfigKeys
@@ -97,7 +96,7 @@ class GrepSearchContextProvider : BaseContextProvider() {
         val results = mutableListOf<SearchResult>()
 
         try {
-            ReadAction.run<RuntimeException> {
+            com.intellij.openapi.application.runReadAction {
                 logger.debug { "Executing grep search for pattern: $pattern" }
 
                 // Use simplified search approach
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/core/services/ProjectStartupActivity.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/core/services/ProjectStartupActivity.kt
index ea19d2f9..5bfa0b12 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/core/services/ProjectStartupActivity.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/core/services/ProjectStartupActivity.kt
@@ -49,8 +49,7 @@ class ProjectStartupActivity : ProjectActivity {
 
                 // 1. Analyze project structure (will be cached for 10 minutes)
                 logger.info { "Analyzing project structure in background..." }
-                val analyzerService = projectRouter.getProjectAnalyzerService()
-                val analysis = analyzerService?.analyzeProject(
+                val analysis = projectRouter.projectAnalyzer?.analyzeProject(
                     projectRoot = projectRoot,
                     includeContent = false // Don't include full file content for startup analysis
                 )
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/core/CoreConnectionManager.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/core/CoreConnectionManager.kt
index a8cc8d15..736d14bc 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/core/CoreConnectionManager.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/core/CoreConnectionManager.kt
@@ -6,7 +6,7 @@ import kotlinx.coroutines.flow.MutableStateFlow
 import kotlinx.coroutines.flow.StateFlow
 import kotlinx.coroutines.flow.asStateFlow
 
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.core.api.CoreApiRouter
 import pl.jclab.refio.core.tools.base.ToolRegistry
 import pl.jclab.refio.core.tools.base.ToolFactory
@@ -88,8 +88,7 @@ class CoreConnectionManager {
             ContextProviderRegistry.providerFactory = { isIdeEnvironment ->
                 val terminalAvailable = try {
                     val pluginId = com.intellij.openapi.extensions.PluginId.getId("com.intellij.terminal")
-                    val plugin = com.intellij.ide.plugins.PluginManagerCore.getPlugin(pluginId)
-                    plugin?.isEnabled == true
+                    com.intellij.ide.plugins.PluginManagerCore.isPluginInstalled(pluginId)
                 } catch (_: Exception) { false }
 
                 buildList {
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/execution/StepExecutionService.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/execution/StepExecutionService.kt
index 2ba0ced2..b6bf00f3 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/execution/StepExecutionService.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/execution/StepExecutionService.kt
@@ -3,8 +3,9 @@ package pl.jclab.refio.services.execution
 import com.intellij.openapi.components.Service
 import com.intellij.openapi.project.Project
 import pl.jclab.refio.api.models.ExecutionMode
+import pl.jclab.refio.core.session.ExecutionStateController
 import pl.jclab.refio.core.services.monitoring.GlobalMetrics
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import kotlinx.coroutines.*
 import kotlinx.coroutines.flow.MutableStateFlow
 import kotlinx.coroutines.flow.StateFlow
@@ -21,7 +22,7 @@ import kotlinx.coroutines.flow.asStateFlow
  * - Handles execution completion/cancellation
  */
 @Service(Service.Level.PROJECT)
-class StepExecutionService(private val project: Project) {
+class StepExecutionService(private val project: Project) : ExecutionStateController {
 
     private val logger = dualLogger("StepExecutionService")
 
@@ -42,7 +43,7 @@ class StepExecutionService(private val project: Project) {
     /**
      * Start interactive execution - no polling, UI updates via listener.
      */
-    fun startInteractiveExecution(taskId: String) {
+    override fun startInteractiveExecution(taskId: String) {
         logger.info { "Starting INTERACTIVE mode for task: $taskId" }
         _isExecuting.value = true
         // UI updates handled by ExecutionEventListener in SessionManager
@@ -51,7 +52,7 @@ class StepExecutionService(private val project: Project) {
     /**
      * Stop execution
      */
-    fun stopExecution() {
+    override fun stopExecution() {
         logger.info { "Stopping execution" }
         _isExecuting.value = false
         executionJob?.cancel()
@@ -70,7 +71,7 @@ class StepExecutionService(private val project: Project) {
     /**
      * Mark execution as complete (called by UI listener).
      */
-    fun markComplete() {
+    override fun markComplete() {
         _isExecuting.value = false
         GlobalMetrics.clearCurrentOperation()
         logger.info { "[EXECUTION] Marked execution complete" }
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/logging/DualLogger.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/logging/DualLogger.kt
deleted file mode 100644
index 1750732f..00000000
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/logging/DualLogger.kt
+++ /dev/null
@@ -1,38 +0,0 @@
-@file:Suppress("unused")
-
-package pl.jclab.refio.services.logging
-
-import mu.KotlinLogging
-
-/**
- * Backward-compatibility re-exports from [pl.jclab.refio.core.logging].
- *
- * DualLogger has been moved to [pl.jclab.refio.core.logging.DualLogger]
- * to break the compile-time dependency on IntelliJ APIs in the core module.
- *
- * These typealiases and functions ensure that existing code outside core/
- * (UI, services, actions) continues to compile without import changes.
- */
-typealias DualLogger = pl.jclab.refio.core.logging.DualLogger
-
-/**
- * Re-export dualLogger() factory for backward compatibility.
- *
- * New code should import from [pl.jclab.refio.core.logging.dualLogger].
- */
-inline fun <reified T : Any> T.dualLogger(): pl.jclab.refio.core.logging.DualLogger {
-    val componentName = T::class.simpleName ?: "Unknown"
-    return pl.jclab.refio.core.logging.DualLogger(
-        kotlinLogger = KotlinLogging.logger(T::class.java.name),
-        component = componentName
-    )
-}
-
-/**
- * Re-export dualLogger(component) factory for backward compatibility.
- *
- * New code should import from [pl.jclab.refio.core.logging.dualLogger].
- */
-fun dualLogger(component: String): pl.jclab.refio.core.logging.DualLogger {
-    return pl.jclab.refio.core.logging.dualLogger(component)
-}
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/notification/NotificationService.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/notification/NotificationService.kt
index 0e41f0f6..79a55573 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/notification/NotificationService.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/notification/NotificationService.kt
@@ -4,7 +4,7 @@ import com.intellij.notification.Notification
 import com.intellij.notification.NotificationGroupManager
 import com.intellij.notification.NotificationType
 import com.intellij.openapi.project.Project
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 
 private val logger = dualLogger("NotificationService")
 
@@ -50,25 +50,6 @@ object NotificationService {
         showNotification(project, title, content, NotificationType.ERROR)
     }
 
-    /**
-     * Show RAG service unavailable notification
-     * (only shown once per circuit breaker open event)
-     */
-    fun showRagUnavailable(project: Project?, providerType: String, endpoint: String) {
-        val title = "RAG Search Unavailable"
-        val content = when (providerType) {
-            "ollama" -> "Cannot connect to Ollama at $endpoint. RAG search is disabled. " +
-                    "Make sure Ollama is running and has the embedding model loaded."
-            "openai" -> "Cannot connect to OpenAI embedding API. RAG search is disabled. " +
-                    "Check your API key and network connection."
-            else -> "Cannot connect to embedding provider ($providerType). RAG search is disabled."
-        }
-
-        showWarning(project, title, content)
-
-        logger.info { "Showed RAG unavailable notification: $providerType at $endpoint" }
-    }
-
     private fun showNotification(
         project: Project?,
         title: String,
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/project/IntelliJVfsRefresher.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/project/IntelliJVfsRefresher.kt
new file mode 100644
index 00000000..c8c2e31b
--- /dev/null
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/project/IntelliJVfsRefresher.kt
@@ -0,0 +1,18 @@
+package pl.jclab.refio.services.project
+
+import com.intellij.openapi.project.Project
+import pl.jclab.refio.core.session.VfsRefresher
+import pl.jclab.refio.core.logging.dualLogger
+
+/**
+ * Plugin-side implementacja [VfsRefresher] — używa [SafeVfsAccess] do odświeżenia
+ * IntelliJ VFS po edycjach plików przez Core tools.
+ */
+internal class IntelliJVfsRefresher(private val project: Project) : VfsRefresher {
+
+    private val logger = dualLogger("IntelliJVfsRefresher")
+
+    override fun refreshProjectRoot() {
+        SafeVfsAccess.refreshProjectRoot(project, logger)
+    }
+}
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/project/SafeVfsAccess.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/project/SafeVfsAccess.kt
new file mode 100644
index 00000000..67820118
--- /dev/null
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/project/SafeVfsAccess.kt
@@ -0,0 +1,56 @@
+package pl.jclab.refio.services.project
+
+import com.intellij.openapi.application.ApplicationManager
+import com.intellij.openapi.project.Project
+import com.intellij.openapi.project.guessProjectDir
+import com.intellij.openapi.vfs.VirtualFile
+import com.intellij.openapi.vfs.VirtualFileManager
+import pl.jclab.refio.core.logging.DualLogger
+import java.nio.file.Path
+
+internal object SafeVfsAccess {
+    fun refreshProjectRoot(project: Project, logger: DualLogger) {
+        if (!canRefresh(project)) return
+
+        try {
+            project.guessProjectDir()?.refresh(true, true)
+        } catch (e: NullPointerException) {
+            if (isShutdownRefreshRace(project, e)) {
+                logger.debug { "Skipping project VFS refresh during application shutdown" }
+                return
+            }
+            throw e
+        }
+    }
+
+    fun refreshAndFindFile(project: Project, path: Path, logger: DualLogger): VirtualFile? {
+        if (!canRefresh(project)) return null
+
+        return try {
+            VirtualFileManager.getInstance().refreshAndFindFileByNioPath(path)
+        } catch (e: NullPointerException) {
+            if (isShutdownRefreshRace(project, e)) {
+                logger.debug { "Skipping VFS refresh for $path during application shutdown" }
+                null
+            } else {
+                throw e
+            }
+        }
+    }
+
+    private fun canRefresh(project: Project): Boolean {
+        val app = ApplicationManager.getApplication()
+        return !project.isDisposed && !app.isDisposed
+    }
+
+    private fun isShutdownRefreshRace(project: Project, error: NullPointerException): Boolean {
+        val app = ApplicationManager.getApplication()
+        val message = error.message.orEmpty()
+        return project.isDisposed ||
+            app.isDisposed ||
+            (
+                message.contains("LaterInvocator.ourNonBlockingEdtQueue") &&
+                    message.contains("is null")
+                )
+    }
+}
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/rag/BackgroundIndexingTask.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/rag/BackgroundIndexingTask.kt
index d7949604..14448cbd 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/rag/BackgroundIndexingTask.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/rag/BackgroundIndexingTask.kt
@@ -5,7 +5,7 @@ import com.intellij.openapi.progress.ProcessCanceledException
 import com.intellij.openapi.progress.Task
 import com.intellij.openapi.project.Project
 import pl.jclab.refio.services.core.CoreConnectionManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.notification.NotificationService
 import kotlinx.coroutines.CancellationException
 import kotlinx.coroutines.Dispatchers
@@ -37,6 +37,10 @@ class BackgroundIndexingTask(
         indicator.text2 = ""
         indicator.fraction = 0.0
 
+        // runBlocking is intentional: Task.Backgroundable.run() is synchronous in the IntelliJ 241+ API
+        // we target; ProgressManager blocks the caller on this method until it returns. Cancellation is
+        // propagated through job?.cancel() in onCancel() and through indicator.isCanceled checks inside
+        // the coroutine body.
         runBlocking {
             job = launch(Dispatchers.IO) {
                 runIndexingStage(router, indicator)
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/IntelliJUIAdapter.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/IntelliJUIAdapter.kt
index 43a5c64c..4db2b2a8 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/IntelliJUIAdapter.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/IntelliJUIAdapter.kt
@@ -2,7 +2,7 @@ package pl.jclab.refio.services.session
 
 import com.intellij.openapi.project.Project
 import pl.jclab.refio.core.api.UIAdapter
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.notification.NotificationService
 import java.util.concurrent.CompletableFuture
 
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionManager.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionManager.kt
index 28e6aa2a..a0c57b4a 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionManager.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/SessionManager.kt
@@ -2,31 +2,28 @@ package pl.jclab.refio.services.session
 
 import com.intellij.openapi.components.Service
 import com.intellij.openapi.project.Project
-import io.ktor.util.reflect.*
+import pl.jclab.refio.core.session.SessionStateManager
+import pl.jclab.refio.core.session.MessageDispatcher
+import pl.jclab.refio.core.session.SubtaskTracker
+import pl.jclab.refio.core.session.SessionLifecycleService
+import pl.jclab.refio.core.session.ExecutionMonitor
+import pl.jclab.refio.core.session.PromptStateTracker
 import pl.jclab.refio.api.models.*
+import pl.jclab.refio.core.api.SubtaskResponse
 import pl.jclab.refio.core.api.UIAdapter
 import pl.jclab.refio.core.utils.ProjectIdGenerator
 import pl.jclab.refio.services.core.CoreConnectionManager
 import pl.jclab.refio.services.execution.StepExecutionService
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.core.services.monitoring.GlobalMetrics
 import pl.jclab.refio.core.services.monitoring.OperationInfo
 import pl.jclab.refio.ui.components.toolbar.StatusBar
-import pl.jclab.refio.core.workflow.models.IntentResult
-import pl.jclab.refio.core.workflow.models.UIState
-import pl.jclab.refio.core.workflow.models.WorkflowRequest
-import pl.jclab.refio.ui.listeners.SwingWorkflowListener
 import kotlinx.coroutines.*
 import kotlinx.coroutines.flow.StateFlow
 import kotlinx.coroutines.sync.Mutex
 import kotlinx.coroutines.sync.withLock
 import pl.jclab.refio.core.db.MessageRole
-import pl.jclab.refio.core.db.repositories.ChatMessageRepository
-import java.util.concurrent.atomic.AtomicBoolean
-import java.util.concurrent.ConcurrentHashMap
-import java.util.concurrent.atomic.AtomicReference
 import java.util.UUID
-import kotlin.reflect.typeOf
 
 @Service(Service.Level.PROJECT)
 class SessionManager(private val project: Project) {
@@ -67,15 +64,10 @@ class SessionManager(private val project: Project) {
     private val cs = CoroutineScope(SupervisorJob() + Dispatchers.IO + exceptionHandler)
 
     private val stateManager = SessionStateManager()
-    private val chatMessageRepository = ChatMessageRepository()
-
-    // PHASE 2 FIX: Track tool call message IDs for real-time streaming updates
-    // Maps toolCallId -> temporary message ID in stateManager
-    private val toolCallMessageIds = ConcurrentHashMap<String, String>()
     val activeSession: StateFlow<Session?> = stateManager.activeSession
     val sessions: StateFlow<List<Session>> = stateManager.sessions
     val messages: StateFlow<List<Message>> = stateManager.messages
-    val subtasks: StateFlow<List<SubtaskDto>> = stateManager.subtasks
+    val subtasks: StateFlow<List<SubtaskResponse>> = stateManager.subtasks
     val activePlan: StateFlow<pl.jclab.refio.core.api.PlanResponse?> = stateManager.activePlan
     val planSteps: StateFlow<List<pl.jclab.refio.core.api.PlanSpecStepResponse>> = stateManager.planSteps
     val selectedModel: StateFlow<String> = stateManager.selectedModel
@@ -142,6 +134,7 @@ class SessionManager(private val project: Project) {
     private lateinit var subtaskTracker: SubtaskTracker
     private lateinit var executionMonitor: ExecutionMonitor
     private lateinit var promptStateTracker: PromptStateTracker
+    private lateinit var coreSessionService: pl.jclab.refio.core.session.CoreSessionService
 
     // Selected mode (persisted in config, used for creating default session)
     private val modeSwitchMutex = Mutex()
@@ -176,10 +169,6 @@ class SessionManager(private val project: Project) {
         coreManager.getOrCreateProjectRouter(projectRoot, project)
     }
 
-    private val coreApiClient: pl.jclab.refio.api.CoreApiClient by lazy {
-        pl.jclab.refio.api.CoreApiClient(projectRouter)
-    }
-
     private val configService: pl.jclab.refio.core.services.ConfigService
         get() = projectRouter.configService
 
@@ -240,7 +229,6 @@ class SessionManager(private val project: Project) {
 
     private fun initializeServices() {
         executionMonitor = ExecutionMonitor(
-            project = project,
             projectRouter = projectRouter,
             stateManager = stateManager,
             stepExecutionService = stepExecutionService,
@@ -251,10 +239,9 @@ class SessionManager(private val project: Project) {
         )
 
         subtaskTracker = SubtaskTracker(
-            project = project,
             projectRouter = projectRouter,
-            coreApiClient = coreApiClient,
             stateManager = stateManager,
+            vfsRefresher = pl.jclab.refio.services.project.IntelliJVfsRefresher(project),
             loadMessages = { messageDispatcher.loadMessages() },
             executeCurrentStep = { subtaskId -> executionMonitor.executeCurrentStep(subtaskId) },
             showApprovalMessageForNextSubtask = { executionMonitor.showApprovalMessageForNextSubtask() }
@@ -268,9 +255,7 @@ class SessionManager(private val project: Project) {
         promptStateTracker = PromptStateTracker(stateManager)
 
         lifecycleService = SessionLifecycleService(
-            project = project,
             projectRouter = projectRouter,
-            coreApiClient = coreApiClient,
             configService = configService,
             stateManager = stateManager,
             modeSwitchMutex = modeSwitchMutex,
@@ -281,6 +266,16 @@ class SessionManager(private val project: Project) {
 
         lifecycleService.initialize(messageDispatcher, subtaskTracker, executionMonitor)
 
+        coreSessionService = pl.jclab.refio.core.session.CoreSessionService(
+            projectRouter = projectRouter,
+            stateManager = stateManager,
+            subtaskTracker = subtaskTracker,
+            messageDispatcher = messageDispatcher,
+            lifecycleService = lifecycleService,
+            uiAdapter = uiAdapter,
+            scope = cs,
+            modeSwitchMutex = modeSwitchMutex,
+        )
     }
 
     // ========================================================================
@@ -348,20 +343,7 @@ class SessionManager(private val project: Project) {
         contextRefs: List<ContextReference> = emptyList(),
         model: String? = null,
         provider: String? = null
-    ): Message {
-        GlobalMetrics.resetCancellation()
-
-        val currentSession = modeSwitchMutex.withLock {
-            lifecycleService.ensureActiveSessionExists()
-        }
-
-        logger.info {
-            "[SESSION] sendMessage: taskId=${currentSession.id}, mode=${currentSession.mode}, " +
-                    "executionMode=${currentSession.executionMode}, inputChars=${input.length}, " +
-                    "contextRefs=${contextRefs.size}, model=${model ?: "auto"}, provider=${provider ?: "auto"}"
-        }
-        return sendMessageUsingWorkflow(currentSession, input, contextRefs, model, provider)
-    }
+    ): Message = coreSessionService.sendMessage(input, contextRefs, model, provider)
 
     /**
      * Rewind conversation to the given message (inclusive), delete all related execution/planning data,
@@ -392,7 +374,7 @@ class SessionManager(private val project: Project) {
         // 2) Clear related execution data (subtasks/logs/snapshots)
         projectRouter.subtaskRouter.deleteAllSubtasks(session.id)
         projectRouter.apiLogsRouter.deleteApiLogsByTaskId(session.id)
-        projectRouter.deleteSnapshotsByTaskId(session.id)
+        projectRouter.snapshotRouter.deleteSnapshotsByTaskId(session.id)
 
         // 3) Clear planning state if in PLAN mode (plans are tied to session)
         if (session.mode == TaskMode.PLAN) {
@@ -704,847 +686,6 @@ class SessionManager(private val project: Project) {
         subtaskTracker.executeSubtaskById(subtaskId)
     }
 
-    private suspend fun sendMessageUsingWorkflow(
-        session: Session,
-        input: String,
-        contextRefs: List<ContextReference>,
-        model: String?,
-        provider: String?
-    ): Message {
-        stateManager.setIsGenerating(true)
-        return try {
-            val stream = isStreamingEnabled()
-            val executionMode = session.executionMode
-            logger.info {
-                "[SESSION] Workflow start: taskId=${session.id}, mode=${session.mode}, " +
-                        "executionMode=$executionMode, stream=$stream"
-            }
-
-            val userMessage = Message(
-                id = UUID.randomUUID().toString(),
-                taskId = session.id,
-                role = "user",
-                content = input,
-                createdAt = System.currentTimeMillis()
-            )
-            stateManager.appendMessage(userMessage)
-
-            // Use AgentTurnLoop for PLAN/AGENT modes instead of WorkflowOrchestrator
-            when (session.mode) {
-                TaskMode.CHAT -> {
-                    sendMessageUsingChatWorkflow(session, input, contextRefs, model, provider, stream, executionMode)
-                }
-
-                TaskMode.PLAN,
-                TaskMode.AGENT -> {
-                    sendMessageUsingTurnLoop(session, input, contextRefs, model, provider, stream, executionMode)
-                }
-            }
-        } catch (e: Exception) {
-            logger.error(e) { "[SESSION] Workflow failed: taskId=${session.id}, error=${e.message}" }
-            uiAdapter.showError("Workflow failed: ${e.message}")
-            val errorMessage = Message(
-                id = UUID.randomUUID().toString(),
-                taskId = session.id,
-                role = "system",
-                content = "Error: ${e.message}",
-                createdAt = System.currentTimeMillis()
-            )
-            stateManager.appendMessage(errorMessage)
-            throw e
-        } finally {
-            stateManager.setIsGenerating(false)
-        }
-    }
-
-    /**
-     * Send message using new AgentTurnLoop for PLAN/AGENT modes.
-     * Implements Codex CLI-style turn loop where model self-directs tool usage.
-     */
-    private suspend fun sendMessageUsingTurnLoop(
-        session: Session,
-        input: String,
-        contextRefs: List<ContextReference>,
-        model: String?,
-        provider: String?,
-        stream: Boolean,
-        executionMode: pl.jclab.refio.api.models.ExecutionMode
-    ): Message {
-        logger.info {
-            "[TURN_LOOP] Starting turn: taskId=${session.id}, mode=${session.mode}, " +
-                    "inputChars=${input.length}, contextRefs=${contextRefs.size}"
-        }
-
-        GlobalMetrics.setCurrentOperation(
-            OperationInfo.ChatRequest(model ?: "auto")
-        )
-
-        try {
-            // Create streaming message for UI updates
-            var streamingMessageId: String? = null
-            val streamingClosed = AtomicBoolean(false)
-            val pendingStreamContent = AtomicReference<String?>(null)
-            val streamStateMutex = Mutex()
-            var streamUiFlushJob: Job? = null
-            val streamFilter = IncrementalToolCallStreamFilter()
-
-            // Create stream callback for UI updates
-            // Filter TOOL_CALL blocks from streaming content for cleaner display
-            val streamCallback: pl.jclab.refio.core.api.StreamCallback? = if (stream) { chunk ->
-                cs.launch {
-                    if (streamingClosed.get()) return@launch
-
-                    streamStateMutex.withLock {
-                        val now = System.currentTimeMillis()
-                        val filteredContent = streamFilter.filter(
-                            delta = chunk.delta,
-                            accumulated = chunk.accumulated,
-                            isComplete = chunk.isComplete
-                        )
-
-                        if (filteredContent.isNotBlank()) {
-                            if (streamingMessageId == null) {
-                                streamingMessageId = UUID.randomUUID().toString()
-                                stateManager.appendMessage(
-                                    Message(
-                                        id = streamingMessageId!!,
-                                        taskId = session.id,
-                                        role = "assistant",
-                                        content = "",
-                                        isStreaming = true,
-                                        streamStartedAt = now,
-                                        createdAt = now
-                                    )
-                                )
-                            }
-                            pendingStreamContent.set(filteredContent)
-                        }
-
-                        if (chunk.isComplete) {
-                            val completedId = streamingMessageId
-                            val finalContent = pendingStreamContent.getAndSet(null)
-                            if (completedId != null) {
-                                stateManager.updateMessages { messages ->
-                                    messages.map { msg ->
-                                        if (msg.id == completedId) {
-                                            msg.copy(
-                                                content = finalContent ?: msg.content,
-                                                lastChunkAt = now,
-                                                isStreaming = false
-                                            )
-                                        } else msg
-                                    }
-                                }
-                            }
-                            streamingMessageId = null
-                            streamUiFlushJob?.cancel()
-                            streamUiFlushJob = null
-                            return@withLock
-                        }
-
-                        if (streamUiFlushJob?.isActive != true) {
-                            streamUiFlushJob = cs.launch {
-                                while (!streamingClosed.get()) {
-                                    val contentToFlush = streamStateMutex.withLock {
-                                        pendingStreamContent.getAndSet(null)
-                                    }
-
-                                    if (!contentToFlush.isNullOrBlank()) {
-                                        val activeMessageId = streamingMessageId
-                                        if (activeMessageId != null) {
-                                            stateManager.updateMessages { messages ->
-                                                messages.map { msg ->
-                                                    if (msg.id == activeMessageId) {
-                                                        msg.copy(
-                                                            content = contentToFlush,
-                                                            lastChunkAt = System.currentTimeMillis(),
-                                                            isStreaming = true
-                                                        )
-                                                    } else msg
-                                                }
-                                            }
-                                        }
-                                    }
-
-                                    delay(500)
-                                }
-                            }
-                        }
-                    }
-                }
-            } else null
-
-            // Create turn event listener for UI updates
-            val turnListener = object : pl.jclab.refio.core.services.AgentTurnLoop.TurnEventListener {
-                override fun onTurnStarted(
-                    taskId: String,
-                    mode: pl.jclab.refio.core.db.TaskMode,
-                    runId: String,
-                    parentRunId: String?,
-                    depth: Int
-                ) {
-                    logger.info {
-                        "[TURN_LOOP] Turn started: taskId=$taskId, mode=$mode, runId=$runId, " +
-                            "parentRunId=${parentRunId ?: "-"}, depth=$depth"
-                    }
-                }
-
-                override fun onToolExecutionStarted(taskId: String, toolCall: pl.jclab.refio.core.db.ToolCallData) {
-                    logger.info { "[TURN_LOOP] Tool started: ${toolCall.name}" }
-                    cs.launch {
-                        subtaskTracker.loadSubtasks()
-
-                        // PHASE 2 FIX: Create temporary message for real-time UI updates
-                        // This message will be replaced by loadMessages() at turn end with the DB version
-                        val toolInfo = ToolCallDisplayInfo(
-                            toolName = toolCall.name,
-                            toolCallId = toolCall.id,
-                            displayType = resolveToolDisplayType(toolCall.name),
-                            parameters = parseToolParameters(toolCall.arguments),
-                            status = ToolCallStatus.EXECUTING
-                        )
-
-                        val tempMessage = Message(
-                            id = "temp-${toolCall.id}", // Temporary ID - will be replaced by DB version
-                            taskId = taskId,
-                            role = "assistant",
-                            content = "",
-                            toolCallInfo = toolInfo,
-                            createdAt = System.currentTimeMillis()
-                        )
-
-                        stateManager.appendMessage(tempMessage)
-                        toolCallMessageIds[toolCall.id] = tempMessage.id
-                        logger.debug { "[TURN_LOOP] Created temp message for tool ${toolCall.name}: tempId=${tempMessage.id}" }
-                    }
-                }
-
-                override fun onToolStreamChunk(
-                    taskId: String,
-                    toolCallId: String,
-                    delta: String,
-                    accumulated: String
-                ) {
-                    // PHASE 2 FIX: Update temporary message with streaming content
-                    val messageId = toolCallMessageIds[toolCallId]
-                    if (messageId == null) {
-                        logger.warn { "[TURN_LOOP] Tool stream chunk for unknown tool call: $toolCallId" }
-                        return
-                    }
-
-                    cs.launch {
-                        stateManager.updateMessages { messages ->
-                            messages.map { msg ->
-                                if (msg.id == messageId) {
-                                    msg.copy(
-                                        content = accumulated,
-                                        isStreaming = true,
-                                        isToolStreaming = true,
-                                        lastChunkAt = System.currentTimeMillis()
-                                    )
-                                } else msg
-                            }
-                        }
-                    }
-                }
-
-                override fun onToolExecutionCompleted(
-                    taskId: String,
-                    toolCall: pl.jclab.refio.core.db.ToolCallData,
-                    result: String,
-                    success: Boolean
-                ) {
-                    logger.info { "[TURN_LOOP] Tool completed: ${toolCall.name}, success=$success" }
-                    cs.launch {
-                        subtaskTracker.loadSubtasks()
-
-                        // PHASE 2 FIX: Update temporary message status and result
-                        val messageId = toolCallMessageIds[toolCall.id]
-                        if (messageId != null) {
-                            val resultSummary = if (result.isNotBlank()) {
-                                val trimmed = result.trim()
-                                if (trimmed.length <= 120) trimmed
-                                else "${trimmed.take(120)}..."
-                            } else null
-                            stateManager.updateMessages { messages ->
-                                messages.map { msg ->
-                                    if (msg.id == messageId) {
-                                        val updatedToolInfo = msg.toolCallInfo?.copy(
-                                            status = if (success) ToolCallStatus.COMPLETED else ToolCallStatus.FAILED,
-                                            result = if (resultSummary != null) ToolCallResult(
-                                                success = success,
-                                                summary = resultSummary
-                                            ) else null
-                                        )
-                                        msg.copy(
-                                            toolCallInfo = updatedToolInfo,
-                                            isStreaming = false,
-                                            isToolStreaming = false,
-                                            lastChunkAt = System.currentTimeMillis()
-                                        )
-                                    } else msg
-                                }
-                            }
-                            // Remove from tracking map
-                            toolCallMessageIds.remove(toolCall.id)
-                            logger.debug { "[TURN_LOOP] Finalized temp message for tool ${toolCall.name}: tempId=$messageId" }
-                        }
-
-                        // NOTE: AgentTurnLoop creates TOOL message with result in DB
-                        // MessageDispatcher.loadMessages() at turn end will replace temp messages with DB versions
-                    }
-                }
-
-                override fun onStreamChunk(taskId: String, delta: String, accumulated: String) {
-                    // Handled by streamCallback
-                }
-
-                override fun onTurnCompleted(
-                    taskId: String,
-                    result: pl.jclab.refio.core.services.TurnResult,
-                    runId: String,
-                    parentRunId: String?,
-                    depth: Int
-                ) {
-                    logger.info {
-                        "[TURN_LOOP] Turn completed: taskId=$taskId, success=${result.success}, " +
-                                "iterations=${result.iterations}, runId=$runId, " +
-                                "parentRunId=${parentRunId ?: "-"}, depth=$depth"
-                    }
-                }
-            }
-
-            val modeDb = pl.jclab.refio.core.db.TaskMode.valueOf(session.mode.name)
-            val executionModeDb = pl.jclab.refio.core.db.ExecutionMode.valueOf(executionMode.name)
-            val defaultTurnRequest = pl.jclab.refio.core.api.TurnRequest(
-                taskId = session.id,
-                userInput = input,
-                mode = modeDb,
-                executionMode = executionModeDb,
-                model = model,
-                provider = provider,
-                userContextRefs = contextRefs
-            )
-
-            val subagentRouter = projectRouter.subagentRouter
-            val subagentCommand = subagentRouter?.parseSubagentCommand(input)
-            val subagentInvocation = subagentRouter?.parseSubagentInvocation(input)
-
-            if (subagentCommand != null && subagentInvocation == null) {
-                val (requestedName, _) = subagentCommand
-                val allSubagents = subagentRouter.listSubagents(includeDisabled = true)
-                val matched = allSubagents.firstOrNull { it.name.equals(requestedName, ignoreCase = true) }
-                val enabledSubagentNames = allSubagents
-                    .filter { it.enabled }
-                    .map { it.name }
-                    .sorted()
-
-                val errorContent = when {
-                    matched == null -> buildString {
-                        append("Subagent '")
-                        append(requestedName)
-                        append("' not found.")
-                        if (enabledSubagentNames.isNotEmpty()) {
-                            append(" Available subagents: ")
-                            append(enabledSubagentNames.joinToString(", "))
-                            append(".")
-                        }
-                    }
-                    !matched.enabled -> "Subagent '${matched.name}' is disabled. Enable it in Settings > Subagents."
-                    else -> "Subagent '$requestedName' is not available."
-                }
-
-                logger.warn {
-                    "[TURN_LOOP] Invalid subagent invocation: name=$requestedName, reason='${errorContent.replace('\n', ' ')}'"
-                }
-
-                val assistantMessage = Message(
-                    id = UUID.randomUUID().toString(),
-                    taskId = session.id,
-                    role = "assistant",
-                    content = errorContent,
-                    createdAt = System.currentTimeMillis()
-                )
-                stateManager.appendMessage(assistantMessage)
-                return assistantMessage
-            }
-
-            val turnRequest = if (subagentInvocation != null) {
-                val (subagentName, subagentPrompt) = subagentInvocation
-                val definition = subagentRouter.getSubagent(subagentName)
-
-                if (definition != null) {
-                    val parentModel = if (model != null && provider != null) "$provider/$model" else model
-                    val (resolvedModel, resolvedProvider) = definition.resolveModel(configService, parentModel)
-
-                    logger.info {
-                        "[TURN_LOOP] subagentDetected=true, subagentName=$subagentName, " +
-                            "runProfile=SUBAGENT, model=$resolvedModel, provider=$resolvedProvider"
-                    }
-
-                    pl.jclab.refio.core.api.TurnRequest(
-                        taskId = session.id,
-                        userInput = subagentPrompt,
-                        mode = modeDb,
-                        executionMode = executionModeDb,
-                        model = resolvedModel,
-                        provider = resolvedProvider,
-                        userContextRefs = contextRefs,
-                        runProfile = pl.jclab.refio.core.api.TurnRunProfile.SUBAGENT,
-                        profileOverrides = pl.jclab.refio.core.api.TurnProfileOverrides(
-                            subagentName = subagentName,
-                            systemPromptOverride = definition.systemPrompt,
-                            allowedTools = definition.allowedTools,
-                            disallowedTools = definition.disallowedTools,
-                            modelOverride = resolvedModel,
-                            providerOverride = resolvedProvider,
-                            maxIterationsOverride = definition.maxSteps,
-                            depth = 0,
-                            subagentChain = emptyList(),
-                            contextProfile = definition.contextProfile,
-                            reasoningEffort = definition.reasoningEffort
-                        )
-                    )
-                } else {
-                    logger.warn {
-                        "[TURN_LOOP] subagentDetected=true but definition not found: name=$subagentName, falling back"
-                    }
-                    defaultTurnRequest
-                }
-            } else {
-                defaultTurnRequest
-            }
-
-            // Execute turn using AgentTurnLoop
-            val result = projectRouter.agentRouter.runTurn(
-                request = turnRequest,
-                streamCallback = streamCallback,
-                listener = turnListener
-            )
-
-            logger.info {
-                "[TURN_LOOP] Turn complete: taskId=${session.id}, success=${result.success}, " +
-                        "iterations=${result.iterations}, responseChars=${result.response.length}"
-            }
-
-            streamingClosed.set(true)
-            streamUiFlushJob?.cancel()
-            val completedStreamingMessageId = streamingMessageId
-            streamingMessageId = null
-            if (completedStreamingMessageId != null) {
-                stateManager.updateMessages { messages ->
-                    messages.filterNot { it.id == completedStreamingMessageId }
-                }
-            }
-
-            // Reload messages from database (includes all tool calls and results)
-            messageDispatcher.loadMessages()
-
-            // PHASE 2 FIX: Clear temporary message IDs after DB reload
-            toolCallMessageIds.clear()
-            logger.debug { "[TURN_LOOP] Cleared tool call message tracking map after DB reload" }
-
-            // Update session costs
-            val freshTask = pl.jclab.refio.core.db.repositories.TaskRepository().findById(session.id)
-            if (freshTask != null) {
-                val updatedSession = session.copy(
-                    tokensIn = freshTask.tokensIn,
-                    tokensOut = freshTask.tokensOut,
-                    costUsd = freshTask.costUsd
-                )
-                updateSession(updatedSession)
-            }
-
-            // Auto-name session if needed
-            if (isDefaultSessionName(session.name) && stateManager.messages.value.size >= 2) {
-                scheduleAutoNameSession(session, input)
-            }
-
-        return stateManager.messages.value.last()
-        } finally {
-            GlobalMetrics.setCurrentOperation(OperationInfo.Idle)
-            logger.info { "[TURN_LOOP] Operation state reset to Idle" }
-        }
-    }
-
-    /**
-     * Send message using existing WorkflowOrchestrator for CHAT mode.
-     * CHAT mode has no tools - direct LLM conversation.
-     */
-    private suspend fun sendMessageUsingChatWorkflow(
-        session: Session,
-        input: String,
-        contextRefs: List<ContextReference>,
-        model: String?,
-        provider: String?,
-        stream: Boolean,
-        executionMode: pl.jclab.refio.api.models.ExecutionMode
-    ): Message {
-        logger.info {
-            "[CHAT_WORKFLOW] Starting chat: taskId=${session.id}, inputChars=${input.length}"
-        }
-
-        val uiState = UIState(
-            taskId = session.id,
-            mode = session.mode,
-            executionMode = executionMode,
-            input = input,
-            contextRefs = contextRefs,
-            model = model,
-            provider = provider,
-            streamingEnabled = stream,
-            thinkingEnabled = stateManager.getThinkingEnabled(),
-            noEgressEnabled = stateManager.getNoEgressEnabled()
-        )
-
-        val listener = SwingWorkflowListener(
-            taskId = session.id,
-            stateManager = stateManager,
-            scope = cs,
-            streamingEnabled = stream
-        )
-
-        // Generate project analysis summary for intent classification (if enabled)
-        val projectAnalysis = try {
-            projectRouter.projectContextRouter.getProjectAnalysisSummary()
-        } catch (e: Exception) {
-            logger.warn(e) { "[SESSION] Failed to generate project analysis, using null" }
-            null
-        }
-
-        val result = projectRouter.workflowOrchestrator.execute(
-            request = WorkflowRequest(
-                uiState = uiState,
-                projectAnalysis = projectAnalysis
-            ),
-            listener = listener
-        )
-
-        logger.info { "[CHAT_WORKFLOW] Workflow result: taskId=${session.id}, type=${result::class.simpleName}" }
-
-        when (result) {
-            is IntentResult.ChatResult -> {
-                val response = result.response
-                logger.info {
-                    "[CHAT_WORKFLOW] Chat response: taskId=${response.taskId}, outputChars=${response.output.length}"
-                }
-                if (response.taskId != session.id) {
-                    logger.info { "[CHAT] Task ID changed: ${session.id} -> ${response.taskId}, syncing session" }
-                    uiAdapter.log("INFO", "Session ID changed: ${session.id} -> ${response.taskId}")
-                    val newSession = session.copy(id = response.taskId)
-                    stateManager.setActiveSession(newSession)
-                }
-                updateSessionCosts(stateManager.getActiveSession() ?: session)
-                autoNameSessionIfNeeded(stateManager.getActiveSession() ?: session, input)
-                messageDispatcher.loadMessages()
-            }
-
-            is IntentResult.SubagentResult -> {
-                logger.info { "[CHAT_WORKFLOW] Subagent response: taskId=${session.id}" }
-                messageDispatcher.loadMessages()
-            }
-
-            else -> {
-                logger.warn { "[CHAT_WORKFLOW] Unexpected result type in CHAT mode: ${result::class.simpleName}" }
-            }
-        }
-
-        return stateManager.messages.value.last()
-    }
-
-    private fun isStreamingEnabled(): Boolean {
-        return try {
-            val streamingConfig = projectRouter.configService.get(
-                key = pl.jclab.refio.core.services.ConfigService.KEY_STREAMING_ENABLED,
-                scope = pl.jclab.refio.core.db.ConfigScope.APP
-            )
-            streamingConfig?.toBoolean() ?: true
-        } catch (e: Exception) {
-            logger.warn(e) { "Failed to read streaming config, defaulting to true" }
-            true
-        }
-    }
-
-    private suspend fun updateSessionCosts(session: Session) {
-        val freshTask = pl.jclab.refio.core.db.repositories.TaskRepository().findById(session.id)
-
-        if (freshTask != null) {
-            updateSession(
-                session.copy(
-                    tokensIn = freshTask.tokensIn,
-                    tokensOut = freshTask.tokensOut,
-                    costUsd = freshTask.costUsd
-                )
-            )
-        }
-    }
-
-    private fun isDefaultSessionName(name: String): Boolean {
-        return name == "New Session" || name.matches(Regex("^Session \\(.+\\)$"))
-    }
-
-    private suspend fun autoNameSessionIfNeeded(session: Session, input: String) {
-        if (isDefaultSessionName(session.name) && stateManager.messages.value.size == 2) {
-            scheduleAutoNameSession(session, input)
-        }
-    }
-
-    private fun scheduleAutoNameSession(session: Session, input: String) {
-        if (!isDefaultSessionName(session.name)) return
-
-        cs.launch {
-            try {
-                val rawTitle = projectRouter.chatRouter.generateSessionTitle(session.id, input)
-                val generatedName = sanitizeSessionTitle(rawTitle)
-                    .ifBlank { generateSessionNameFallback(input) }
-
-                projectRouter.taskRouter.updateTask(session.id, pl.jclab.refio.core.api.UpdateTaskRequest(name = generatedName))
-                updateSession(
-                    stateManager.getActiveSession()?.copy(name = generatedName)
-                        ?: return@launch
-                )
-                logger.info { "Auto-named: '$generatedName'" }
-            } catch (e: Exception) {
-                val fallback = generateSessionNameFallback(input)
-                try {
-                    projectRouter.taskRouter.updateTask(session.id, pl.jclab.refio.core.api.UpdateTaskRequest(name = fallback))
-                    updateSession(
-                        stateManager.getActiveSession()?.copy(name = fallback)
-                            ?: return@launch
-                    )
-                    logger.info { "Auto-named with fallback: '$fallback'" }
-                } catch (inner: Exception) {
-                    logger.warn(inner) { "Auto-name failed" }
-                }
-            }
-        }
-    }
-
-    private fun sanitizeSessionTitle(raw: String): String {
-        return raw
-            .trim()
-            .trim('"', '\'', '“', '”')
-            .replace(Regex("[\\r\\n]+"), " ")
-            .replace(Regex("\\s+"), " ")
-            .replace(Regex("[.!?:;]+$"), "")
-            .take(60)
-    }
-
-    private fun generateSessionNameFallback(input: String): String {
-        val cleaned = input
-            .trim()
-            .replace(Regex("\\s+"), " ")
-            .replace(Regex("[\\r\\n]+"), " ")
-
-        val firstSentence = cleaned.split(Regex("[.!?]\\s+")).firstOrNull() ?: cleaned
-        val truncated = if (firstSentence.length > 50) {
-            firstSentence.substring(0, 47) + "..."
-        } else {
-            firstSentence
-        }
-
-        return truncated.ifBlank { "Chat" }
-    }
-    private fun resolveToolDisplayType(toolName: String): ToolDisplayType {
-        return when (toolName) {
-            "advance_code_editing", "multi_line_editor" -> ToolDisplayType.LLM_EDIT
-            "code_editing", "create_new_file", "multi_edit" -> ToolDisplayType.CODE_EDIT
-            "run_terminal_command" -> ToolDisplayType.TERMINAL
-            else -> ToolDisplayType.SIMPLE
-        }
-    }
-
-    private fun parseToolParameters(argumentsJson: String): Map<String, String> {
-        return try {
-            val json = kotlinx.serialization.json.Json { ignoreUnknownKeys = true }
-            val args = json.parseToJsonElement(argumentsJson)
-            val argsObj = args as? kotlinx.serialization.json.JsonObject ?: return emptyMap()
-
-            argsObj.entries.associate { (key, value) ->
-                key to when (value) {
-                    is kotlinx.serialization.json.JsonPrimitive -> value.content
-                    else -> value.toString()
-                }
-            }
-        } catch (e: Exception) {
-            logger.warn(e) { "Failed to parse tool arguments" }
-            emptyMap()
-        }
-    }
-
-
-    /**
-     * Build display text for tool call bubble (assistant role).
-     * Shows what tool is being called with key parameters.
-     */
-    private fun buildToolCallDisplay(toolName: String, argumentsJson: String): String {
-        return try {
-            val json = kotlinx.serialization.json.Json { ignoreUnknownKeys = true }
-            val args = json.parseToJsonElement(argumentsJson)
-            val argsObj = args as? kotlinx.serialization.json.JsonObject ?: return "📤 **$toolName**"
-
-            when (toolName) {
-                "advance_code_editing", "multi_line_editor" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "unknown"
-                    val description = argsObj["edit_description"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    }
-                    val shortDesc = description?.take(80)?.let { if (description.length > 80) "$it..." else it }
-                    buildString {
-                        append("📤 **$toolName**\n")
-                        append("```\npath: $path")
-                        if (shortDesc != null) {
-                            append("\nedit_description: $shortDesc")
-                        }
-                        append("\n```")
-                    }
-                }
-
-                "code_editing" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "unknown"
-                    val oldString = argsObj["old_string"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    }?.take(50)?.let { if (it.length >= 50) "$it..." else it }
-                    buildString {
-                        append("📤 **$toolName**\n")
-                        append("```\npath: $path")
-                        if (oldString != null) {
-                            append("\nold_string: $oldString")
-                        }
-                        append("\n```")
-                    }
-                }
-
-                "create_new_file" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "unknown"
-                    "📤 **$toolName**\n```\npath: $path\n```"
-                }
-
-                "read_file" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "unknown"
-                    "📤 **$toolName**\n```\npath: $path\n```"
-                }
-
-                "read_directory" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "."
-                    "📤 **$toolName**\n```\npath: $path\n```"
-                }
-
-                "file_search" -> {
-                    val pattern = argsObj["pattern"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "*"
-                    "📤 **$toolName**\n```\npattern: $pattern\n```"
-                }
-
-                "grep_search" -> {
-                    val pattern = argsObj["pattern"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: ""
-                    val shortPattern = pattern.take(60).let { if (pattern.length > 60) "$it..." else it }
-                    "📤 **$toolName**\n```\npattern: $shortPattern\n```"
-                }
-
-                else -> "📤 **$toolName**"
-            }
-        } catch (e: Exception) {
-            logger.warn(e) { "[TURN_LOOP] Failed to parse tool arguments for display" }
-            "📤 **$toolName**"
-        }
-    }
-
-    /**
-     * Build a user-friendly summary for tool execution display.
-     * Parses tool arguments to extract meaningful info (path, description, etc.)
-     */
-    private fun buildToolExecutionSummary(toolName: String, argumentsJson: String): String {
-        return try {
-            val json = kotlinx.serialization.json.Json { ignoreUnknownKeys = true }
-            val args = json.parseToJsonElement(argumentsJson)
-            val argsObj = args as? kotlinx.serialization.json.JsonObject ?: return "🔧 Executing: $toolName"
-
-            when (toolName) {
-                "advance_code_editing", "multi_line_editor" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "unknown"
-                    val description = argsObj["edit_description"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    }
-                    val shortPath = path.substringAfterLast("/").substringAfterLast("\\")
-                    val shortDesc = description?.take(100)?.let { if (description.length > 100) "$it..." else it }
-                    buildString {
-                        append("🔧 Executing: **$toolName**\n")
-                        append("📄 File: `$shortPath`\n")
-                        if (shortDesc != null) {
-                            append("📝 $shortDesc")
-                        }
-                    }
-                }
-
-                "code_editing" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "unknown"
-                    val shortPath = path.substringAfterLast("/").substringAfterLast("\\")
-                    "🔧 Executing: **$toolName**\n📄 File: `$shortPath`"
-                }
-
-                "create_new_file" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "unknown"
-                    val shortPath = path.substringAfterLast("/").substringAfterLast("\\")
-                    "🔧 Executing: **$toolName**\n📄 Creating: `$shortPath`"
-                }
-
-                "read_file" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "unknown"
-                    val shortPath = path.substringAfterLast("/").substringAfterLast("\\")
-                    "🔧 Executing: **$toolName**\n📄 Reading: `$shortPath`"
-                }
-
-                "read_directory" -> {
-                    val path = argsObj["path"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "."
-                    "🔧 Executing: **$toolName**\n📁 Directory: `$path`"
-                }
-
-                "file_search" -> {
-                    val pattern = argsObj["pattern"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: "*"
-                    "🔧 Executing: **$toolName**\n🔍 Pattern: `$pattern`"
-                }
-
-                "grep_search" -> {
-                    val pattern = argsObj["pattern"]?.let {
-                        (it as? kotlinx.serialization.json.JsonPrimitive)?.content
-                    } ?: ""
-                    val shortPattern = pattern.take(50).let { if (pattern.length > 50) "$it..." else it }
-                    "🔧 Executing: **$toolName**\n🔍 Pattern: `$shortPattern`"
-                }
-
-                else -> "🔧 Executing: **$toolName**"
-            }
-        } catch (e: Exception) {
-            logger.warn(e) { "[TURN_LOOP] Failed to parse tool arguments for summary" }
-            "🔧 Executing: $toolName"
-        }
-    }
 
     suspend fun cancelAllPendingSteps() {
         subtaskTracker.cancelAllPendingSteps()
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/StatusBarIntegration.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/StatusBarIntegration.kt
index 9d998289..15af3809 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/StatusBarIntegration.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/services/session/StatusBarIntegration.kt
@@ -1,6 +1,6 @@
 package pl.jclab.refio.services.session
 
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.components.toolbar.StatusBar
 
 class StatusBarIntegration {
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/startup/RagIndexingStartup.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/startup/RagIndexingStartup.kt
index 163b44f8..1b8144f7 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/startup/RagIndexingStartup.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/startup/RagIndexingStartup.kt
@@ -6,7 +6,7 @@ import com.intellij.openapi.project.Project
 import com.intellij.openapi.startup.ProjectActivity
 import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.services.core.CoreConnectionManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.rag.BackgroundIndexingTask
 
 class RagIndexingStartup : ProjectActivity {
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/startup/RefioIntentionStartup.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/startup/RefioIntentionStartup.kt
index a68c11f1..6ec053ea 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/startup/RefioIntentionStartup.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/startup/RefioIntentionStartup.kt
@@ -3,8 +3,8 @@ package pl.jclab.refio.startup
 import com.intellij.codeInsight.intention.IntentionManager
 import com.intellij.openapi.project.Project
 import com.intellij.openapi.startup.ProjectActivity
-import pl.jclab.refio.actions.RefioSlashCommandIntentionAction
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.actions.RefioSlashPromptIntentionAction
+import pl.jclab.refio.core.logging.dualLogger
 import java.util.concurrent.atomic.AtomicBoolean
 
 class RefioIntentionStartup : ProjectActivity {
@@ -13,10 +13,10 @@ class RefioIntentionStartup : ProjectActivity {
 
     override suspend fun execute(project: Project) {
         if (registered.compareAndSet(false, true)) {
-            IntentionManager.getInstance().addAction(RefioSlashCommandIntentionAction())
-            logger.info { "[STARTUP] Registered Refio slash command intention" }
+            IntentionManager.getInstance().addAction(RefioSlashPromptIntentionAction())
+            logger.info { "[STARTUP] Registered Refio slash prompt intention" }
         } else {
-            logger.debug { "[STARTUP] Refio slash command intention already registered" }
+            logger.debug { "[STARTUP] Refio slash prompt intention already registered" }
         }
     }
 
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/completion/RefioCompletionContributor.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/completion/RefioCompletionContributor.kt
index 3f959be0..c32c10de 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/completion/RefioCompletionContributor.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/completion/RefioCompletionContributor.kt
@@ -20,7 +20,7 @@ import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.Dispatchers
 import kotlinx.coroutines.launch
 import pl.jclab.refio.api.models.ContextReference
-import pl.jclab.refio.api.models.SlashCommand
+import pl.jclab.refio.api.models.SlashPrompt
 import pl.jclab.refio.core.db.PromptType
 import pl.jclab.refio.core.context.ContextProviderRegistry
 import pl.jclab.refio.core.context.LoadSubmenuItemsArgs
@@ -39,12 +39,12 @@ class RefioCompletionContributor : CompletionContributor() {
         val REPLACE_CONTEXT_PREFIX_KEY: Key<(String) -> Unit> =
             Key.create("refio.promptEditor.replaceContextPrefix")
 
-        private val slashCommandsCacheLock = Any()
+        private val slashPromptsCacheLock = Any()
         @Volatile
-        private var cachedSlashCommands: List<SlashCommand> = SlashCommand.BUILTINS
+        private var cachedSlashPrompts: List<SlashPrompt> = SlashPrompt.BUILTINS
         @Volatile
-        private var lastSlashCommandsLoadAt: Long = 0
-        private const val SLASH_COMMANDS_CACHE_MS = 5_000L
+        private var lastSlashPromptsLoadAt: Long = 0
+        private const val SLASH_PROMPTS_CACHE_MS = 5_000L
 
         private val subagentsCacheLock = Any()
         @Volatile
@@ -62,24 +62,24 @@ class RefioCompletionContributor : CompletionContributor() {
         }
 
         @Suppress("UNUSED_PARAMETER")
-        private fun loadSlashCommands(_project: Project): List<SlashCommand> {
+        private fun loadSlashPrompts(_project: Project): List<SlashPrompt> {
             val now = System.currentTimeMillis()
-            synchronized(slashCommandsCacheLock) {
-                if (now - lastSlashCommandsLoadAt < SLASH_COMMANDS_CACHE_MS && cachedSlashCommands.isNotEmpty()) {
-                    return cachedSlashCommands
+            synchronized(slashPromptsCacheLock) {
+                if (now - lastSlashPromptsLoadAt < SLASH_PROMPTS_CACHE_MS && cachedSlashPrompts.isNotEmpty()) {
+                    return cachedSlashPrompts
                 }
 
                 return try {
                     val router = CoreConnectionManager.getInstance().getApiRouter()
-                    val response = router.promptsRouter.getPromptsByType(PromptType.SLASH_COMMAND)
+                    val response = router.promptsRouter.getPromptsByType(PromptType.SLASH_PROMPT)
 
-                    val commands = response.prompts
+                    val slashPrompts = response.prompts
                         .filter { it.isEnabled }
                         .map { prompt ->
-                            SlashCommand(
+                            SlashPrompt(
                                 id = prompt.id,
                                 name = prompt.name.removePrefix("/"),
-                                description = prompt.description ?: "Custom command",
+                                description = prompt.description ?: "Custom prompt",
                                 template = prompt.content,
                                 variables = extractVariablesFromTemplate(prompt.content),
                                 category = "custom",
@@ -87,15 +87,15 @@ class RefioCompletionContributor : CompletionContributor() {
                             )
                         }
 
-                    val resolved = if (commands.isEmpty()) SlashCommand.BUILTINS else commands
-                    cachedSlashCommands = resolved
-                    lastSlashCommandsLoadAt = now
+                    val resolved = if (slashPrompts.isEmpty()) SlashPrompt.BUILTINS else slashPrompts
+                    cachedSlashPrompts = resolved
+                    lastSlashPromptsLoadAt = now
                     resolved
                 } catch (e: Exception) {
-                    log.warn("Failed to load slash commands for completion, using built-ins", e)
-                    cachedSlashCommands = SlashCommand.BUILTINS
-                    lastSlashCommandsLoadAt = now
-                    cachedSlashCommands
+                    log.warn("Failed to load slash prompts for completion, using built-ins", e)
+                    cachedSlashPrompts = SlashPrompt.BUILTINS
+                    lastSlashPromptsLoadAt = now
+                    cachedSlashPrompts
                 }
             }
         }
@@ -204,19 +204,19 @@ class RefioCompletionContributor : CompletionContributor() {
                             }
                         }
 
-                        // Slash commands: /explain, /fix, etc.
+                        // Slash prompts: /explain, /fix, etc.
                         token.startsWith("/") -> {
                             val cleanPrefix = token.removePrefix("/").lowercase()
                             val prefixMatcher = result.withPrefixMatcher(token)
-                            loadSlashCommands(project)
+                            loadSlashPrompts(project)
                                 .sortedBy { it.name.lowercase() }
-                                .forEach { cmd ->
-                                    if (cleanPrefix.isEmpty() || cmd.name.lowercase().startsWith(cleanPrefix)) {
+                                .forEach { sp ->
+                                    if (cleanPrefix.isEmpty() || sp.name.lowercase().startsWith(cleanPrefix)) {
                                         prefixMatcher.addElement(
-                                            LookupElementBuilder.create("/${cmd.name} ")
-                                                .withPresentableText("/${cmd.name}")
-                                                .withTypeText("Command", true)
-                                                .withTailText(" ${cmd.description}", true),
+                                            LookupElementBuilder.create("/${sp.name} ")
+                                                .withPresentableText("/${sp.name}")
+                                                .withTypeText("Prompt", true)
+                                                .withTailText(" ${sp.description}", true),
                                         )
                                     }
                                 }
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/agents/AgentExecutionPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/agents/AgentExecutionPanel.kt
index 558c248c..88bd4887 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/agents/AgentExecutionPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/agents/AgentExecutionPanel.kt
@@ -126,73 +126,94 @@ class AgentExecutionPanel : JPanel(BorderLayout()), Disposable {
     }
 
     private fun handleEvent(event: AgentEvent) {
+        // StreamChunk events fire per-token — they belong to the per-agent chat bubble pipeline
+        // (consumed in CoreSessionService), not to the Trace / Timeline diagnostic views which
+        // would otherwise be flooded with hundreds of empty rows per second.
+        if (event is AgentEvent.StreamChunk) return
+
         // Always feed the trace panel — it knows which events to render
         tracePanel.handleEvent(event)
         // Timeline shows everything
         timelinePanel.addEvent(event)
 
         SwingUtilities.invokeLater {
-            // Auto-register a graph node for any new sourceAgentId so single-agent
-            // sessions also get a visible node (not just multi-agent runs).
-            if (!agentNames.containsKey(event.sourceAgentId)) {
-                val name = when (event) {
-                    is AgentEvent.AgentStarted -> event.agentName
-                    else -> "Session ${event.sourceAgentId.take(8)}"
-                }
-                val eventDepth = when (event) {
-                    is AgentEvent.TurnStarted -> event.depth
-                    is AgentEvent.LLMCallCompleted -> event.depth
-                    is AgentEvent.ToolCalled -> event.depth
-                    else -> 0
-                }
-                agentNames[event.sourceAgentId] = name
-                graphPanel.addOrUpdateAgent(
-                    agentId = event.sourceAgentId,
-                    name = name,
-                    depth = eventDepth,
-                    status = AgentNodeStatus.RUNNING
-                )
-            }
+            // Subagent invocations share the parent's sourceAgentId but spawn their own
+            // runId per TurnStarted. Key graph nodes on runId for depth > 0 so each
+            // subagent run shows up as its own indented node; keep sourceAgentId for
+            // top-level sessions since runId may be absent on pure AgentStarted events.
+            val depth = graphDepthOf(event)
+            val nodeId = graphNodeIdOf(event, depth)
+            ensureNode(nodeId, event, depth)
 
             when (event) {
                 is AgentEvent.AgentStarted -> {
-                    agentNames[event.sourceAgentId] = event.agentName
-                    graphPanel.addOrUpdateAgent(
-                        agentId = event.sourceAgentId,
-                        name = event.agentName,
-                        depth = 0,
-                        status = AgentNodeStatus.RUNNING
-                    )
+                    agentNames[nodeId] = event.agentName
+                    graphPanel.addOrUpdateAgent(nodeId, event.agentName, depth, AgentNodeStatus.RUNNING)
+                }
+                is AgentEvent.TurnStarted -> {
+                    graphPanel.bumpIterations(nodeId, event.iteration)
+                }
+                is AgentEvent.TurnEnded -> {
+                    graphPanel.addDuration(nodeId, event.durationMs)
+                }
+                is AgentEvent.LLMCallCompleted -> {
+                    graphPanel.addTokens(nodeId, (event.tokensIn + event.tokensOut).toLong())
                 }
                 is AgentEvent.AgentCompleted -> {
-                    val name = agentNames[event.sourceAgentId] ?: event.sourceAgentId.take(8)
-                    graphPanel.addOrUpdateAgent(
-                        agentId = event.sourceAgentId,
-                        name = name,
-                        depth = 0,
-                        status = AgentNodeStatus.COMPLETED
-                    )
+                    val name = agentNames[nodeId] ?: nodeId.take(8)
+                    graphPanel.addOrUpdateAgent(nodeId, name, depth, AgentNodeStatus.COMPLETED)
+                    // AgentCompleted carries authoritative totals for the root session —
+                    // override accumulated values so final metrics match the completion event.
                     graphPanel.updateMetrics(
-                        agentId = event.sourceAgentId,
-                        iterations = 0,
+                        agentId = nodeId,
+                        iterations = graphPanel.iterationsOf(nodeId),
                         tokens = event.tokensUsed,
                         durationMs = event.durationMs
                     )
                 }
                 is AgentEvent.AgentFailed -> {
-                    val name = agentNames[event.sourceAgentId] ?: event.sourceAgentId.take(8)
-                    graphPanel.addOrUpdateAgent(
-                        agentId = event.sourceAgentId,
-                        name = name,
-                        depth = 0,
-                        status = AgentNodeStatus.FAILED
-                    )
+                    val name = agentNames[nodeId] ?: nodeId.take(8)
+                    graphPanel.addOrUpdateAgent(nodeId, name, depth, AgentNodeStatus.FAILED)
                 }
                 else -> {}
             }
         }
     }
 
+    private fun graphDepthOf(event: AgentEvent): Int = when (event) {
+        is AgentEvent.TurnStarted -> event.depth
+        is AgentEvent.TurnEnded -> event.depth
+        is AgentEvent.LLMCallCompleted -> event.depth
+        is AgentEvent.ToolCalled -> event.depth
+        is AgentEvent.StreamAborted -> event.depth
+        else -> 0
+    }
+
+    private fun graphRunIdOf(event: AgentEvent): String? = when (event) {
+        is AgentEvent.TurnStarted -> event.runId
+        is AgentEvent.TurnEnded -> event.runId
+        is AgentEvent.LLMCallCompleted -> event.runId
+        is AgentEvent.ToolCalled -> event.runId
+        is AgentEvent.StreamAborted -> event.runId
+        else -> null
+    }
+
+    private fun graphNodeIdOf(event: AgentEvent, depth: Int): String {
+        val runId = graphRunIdOf(event)
+        return if (depth > 0 && runId != null) runId else event.sourceAgentId
+    }
+
+    private fun ensureNode(nodeId: String, event: AgentEvent, depth: Int) {
+        if (agentNames.containsKey(nodeId)) return
+        val name = when {
+            event is AgentEvent.AgentStarted -> event.agentName
+            depth > 0 -> "Subagent ${nodeId.take(8)}"
+            else -> "Session ${nodeId.take(8)}"
+        }
+        agentNames[nodeId] = name
+        graphPanel.addOrUpdateAgent(nodeId, name, depth, AgentNodeStatus.RUNNING)
+    }
+
     fun toText(): String = buildString {
         appendLine("### Session Trace")
         appendLine()
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/agents/AgentGraphPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/agents/AgentGraphPanel.kt
index 1dab4e29..5d0a5b93 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/agents/AgentGraphPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/agents/AgentGraphPanel.kt
@@ -44,6 +44,34 @@ class AgentGraphPanel : JPanel() {
         }
     }
 
+    /** Raise iteration count to at least [iteration] (TurnStarted events use 1-based monotonic iteration). */
+    fun bumpIterations(agentId: String, iteration: Int) {
+        nodes[agentId]?.apply {
+            if (iteration > iterationCount) {
+                iterationCount = iteration
+                repaint()
+            }
+        }
+    }
+
+    fun addTokens(agentId: String, delta: Long) {
+        if (delta <= 0) return
+        nodes[agentId]?.apply {
+            tokensUsed += delta
+            repaint()
+        }
+    }
+
+    fun addDuration(agentId: String, deltaMs: Long) {
+        if (deltaMs <= 0) return
+        nodes[agentId]?.apply {
+            durationMs += deltaMs
+            repaint()
+        }
+    }
+
+    fun iterationsOf(agentId: String): Int = nodes[agentId]?.iterationCount ?: 0
+
     fun clear() {
         nodes.clear()
         removeAll()
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/autocomplete/CommandAutocompleteItem.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/autocomplete/CommandAutocompleteItem.kt
deleted file mode 100644
index 0709167a..00000000
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/autocomplete/CommandAutocompleteItem.kt
+++ /dev/null
@@ -1,30 +0,0 @@
-package pl.jclab.refio.ui.components.autocomplete
-
-import pl.jclab.refio.api.models.SlashCommand
-
-/**
- * Autocomplete item for slash commands
- */
-data class CommandAutocompleteItem(
-    val command: SlashCommand
-) : AutocompleteItem {
-    override fun getDisplayName(): String {
-        return "/${command.name}"
-    }
-
-    override fun getDescription(): String {
-        return command.description
-    }
-
-    override fun matchesPrefix(prefix: String): Boolean {
-        val cleanPrefix = prefix.removePrefix("/").lowercase()
-        return command.name.lowercase().startsWith(cleanPrefix) ||
-               command.description.lowercase().contains(cleanPrefix)
-    }
-
-    override fun getSortKey(): String {
-        // Builtin commands first, then alphabetically
-        val prefix = if (command.isBuiltin) "0" else "1"
-        return "$prefix:${command.name}"
-    }
-}
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/autocomplete/PromptAutocompleteItem.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/autocomplete/PromptAutocompleteItem.kt
new file mode 100644
index 00000000..0da67a20
--- /dev/null
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/autocomplete/PromptAutocompleteItem.kt
@@ -0,0 +1,30 @@
+package pl.jclab.refio.ui.components.autocomplete
+
+import pl.jclab.refio.api.models.SlashPrompt
+
+/**
+ * Autocomplete item for slash prompts
+ */
+data class PromptAutocompleteItem(
+    val slashPrompt: SlashPrompt
+) : AutocompleteItem {
+    override fun getDisplayName(): String {
+        return "/${slashPrompt.name}"
+    }
+
+    override fun getDescription(): String {
+        return slashPrompt.description
+    }
+
+    override fun matchesPrefix(prefix: String): Boolean {
+        val cleanPrefix = prefix.removePrefix("/").lowercase()
+        return slashPrompt.name.lowercase().startsWith(cleanPrefix) ||
+               slashPrompt.description.lowercase().contains(cleanPrefix)
+    }
+
+    override fun getSortKey(): String {
+        // Builtin prompts first, then alphabetically
+        val prefix = if (slashPrompt.isBuiltin) "0" else "1"
+        return "$prefix:${slashPrompt.name}"
+    }
+}
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ChangesDialog.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ChangesDialog.kt
index d47db46a..14812fe2 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ChangesDialog.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ChangesDialog.kt
@@ -88,7 +88,7 @@ class ChangesDialog(
         coroutineScope.launch {
             val snapshotContent = try {
                 val router = pl.jclab.refio.services.core.CoreConnectionManager.getInstance().getApiRouter()
-                router.getSnapshotFileContent(snapshotId!!, filePath)
+                router.snapshotRouter.getSnapshotFileContent(snapshotId!!, filePath)
             } catch (e: Exception) {
                 null
             }
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ChatView.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ChatView.kt
index b8d31ed5..5673910b 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ChatView.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ChatView.kt
@@ -10,7 +10,7 @@ import pl.jclab.refio.api.models.Message
 import pl.jclab.refio.core.services.monitoring.GlobalMetrics
 import pl.jclab.refio.core.services.monitoring.OperationInfo
 import pl.jclab.refio.services.execution.StepExecutionService
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.ui.components.chat.bubble.AssistantBubbleRenderer
 import pl.jclab.refio.ui.components.chat.bubble.BaseBubbleRenderer
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/CodeBlockPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/CodeBlockPanel.kt
index 1b154215..8aa4519e 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/CodeBlockPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/CodeBlockPanel.kt
@@ -14,11 +14,11 @@ import com.intellij.openapi.fileTypes.FileTypeManager
 import com.intellij.openapi.fileTypes.PlainTextFileType
 import com.intellij.openapi.project.Project
 import com.intellij.openapi.ui.Messages
-import com.intellij.openapi.vfs.VirtualFileManager
 import com.intellij.ui.components.JBPanel
 import com.intellij.ui.components.JBScrollPane
 import pl.jclab.refio.ui.theme.LCATheme
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.services.project.SafeVfsAccess
 import java.awt.*
 import java.awt.datatransfer.StringSelection
 import java.awt.event.MouseAdapter
@@ -588,7 +588,7 @@ class CodeBlockPanel(
             }
 
             // Refresh file system
-            VirtualFileManager.getInstance().refreshAndFindFileByNioPath(targetPath)?.let { vFile ->
+            SafeVfsAccess.refreshAndFindFile(project, targetPath, logger)?.let { vFile ->
                 // Open file in editor
                 FileEditorManager.getInstance(project).openFile(vFile, true)
             }
@@ -636,7 +636,7 @@ class CodeBlockPanel(
             }
 
             // Open file in editor
-            VirtualFileManager.getInstance().refreshAndFindFileByNioPath(targetPath)?.let { vFile ->
+            SafeVfsAccess.refreshAndFindFile(project, targetPath, logger)?.let { vFile ->
                 FileEditorManager.getInstance(project).openFile(vFile, true)
             }
         }
@@ -668,8 +668,8 @@ class CodeBlockPanel(
             Files.writeString(tempFile, codeBlock.content, StandardCharsets.UTF_8)
 
             // Get virtual files
-            val existingVFile = VirtualFileManager.getInstance().refreshAndFindFileByNioPath(targetPath)
-            val tempVFile = VirtualFileManager.getInstance().refreshAndFindFileByNioPath(tempFile)
+            val existingVFile = SafeVfsAccess.refreshAndFindFile(project, targetPath, logger)
+            val tempVFile = SafeVfsAccess.refreshAndFindFile(project, tempFile, logger)
 
             if (existingVFile == null || tempVFile == null) {
                 showNotification("Error", "Could not open files for comparison", NotificationType.ERROR)
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ConversationToolbarFactory.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ConversationToolbarFactory.kt
index 8f6c6e11..b842e13d 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ConversationToolbarFactory.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/ConversationToolbarFactory.kt
@@ -8,7 +8,7 @@ import com.intellij.openapi.project.Project
 import com.intellij.ui.components.JBPanel
 import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.launch
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.ui.theme.LCATheme
 import java.awt.Cursor
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/FileNavigationService.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/FileNavigationService.kt
index 470fb523..396297f8 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/FileNavigationService.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/FileNavigationService.kt
@@ -12,14 +12,14 @@ import com.intellij.openapi.application.ApplicationManager
 import com.intellij.openapi.fileEditor.FileEditorManager
 import com.intellij.openapi.project.Project
 import com.intellij.openapi.vfs.LocalFileSystem
-import com.intellij.openapi.vfs.VirtualFileManager
 import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.Dispatchers
 import kotlinx.coroutines.SupervisorJob
 import kotlinx.coroutines.launch
 import pl.jclab.refio.api.models.ContextReference
 import pl.jclab.refio.api.models.ContextType
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
+import pl.jclab.refio.services.project.SafeVfsAccess
 import java.nio.file.Files
 import java.nio.file.Paths
 
@@ -130,7 +130,7 @@ internal class FileNavigationService(
                         }
 
                         logger.info { "[DIFF] Attempting to load VirtualFile from: $fullPath" }
-                        val vFile = VirtualFileManager.getInstance().findFileByNioPath(fullPath) ?: run {
+                        val vFile = SafeVfsAccess.refreshAndFindFile(project, fullPath, logger) ?: run {
                             logger.error { "[DIFF] VirtualFileManager could not find file: $fullPath" }
                             showNotification("Error", "Could not load file: ${changes.filePath}", NotificationType.ERROR)
                             return@invokeLater
@@ -215,7 +215,7 @@ internal class FileNavigationService(
         return try {
             logger.info { "[SNAPSHOT] Loading snapshot content for: snapshotId=$snapshotId, filePath=$filePath" }
             val router = coreManager.getApiRouter()
-            val content = router.getSnapshotFileContent(snapshotId, filePath)
+            val content = router.snapshotRouter.getSnapshotFileContent(snapshotId, filePath)
             if (content != null) {
                 logger.info { "[SNAPSHOT] Loaded successfully: ${content.length} chars" }
             } else {
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/PromptInputPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/PromptInputPanel.kt
index 5268b4ea..d0535ca9 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/PromptInputPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/PromptInputPanel.kt
@@ -8,7 +8,6 @@ import com.intellij.openapi.actionSystem.DataContext
 import com.intellij.openapi.actionSystem.IdeActions
 import com.intellij.openapi.actionSystem.KeyboardShortcut
 import com.intellij.openapi.application.ApplicationManager
-import com.intellij.openapi.application.ReadAction
 import com.intellij.openapi.editor.Caret
 import com.intellij.openapi.editor.EditorModificationUtil
 import com.intellij.openapi.editor.actionSystem.EditorActionHandler
@@ -34,7 +33,7 @@ import kotlinx.coroutines.withContext
 import pl.jclab.refio.api.models.CodeSnippet
 import pl.jclab.refio.api.models.ContextReference
 import pl.jclab.refio.api.models.ExecutionMode
-import pl.jclab.refio.api.models.SlashCommand
+import pl.jclab.refio.api.models.SlashPrompt
 import pl.jclab.refio.api.models.TaskMode
 import pl.jclab.refio.core.context.ContextProviderRegistry
 import pl.jclab.refio.core.context.ContextSubmenuItem
@@ -43,17 +42,17 @@ import pl.jclab.refio.core.context.ProviderType
 import pl.jclab.refio.core.services.monitoring.GlobalMetrics
 import pl.jclab.refio.core.services.monitoring.OperationInfo
 import pl.jclab.refio.services.execution.StepExecutionService
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.ui.completion.RefioCompletionContributor
 import pl.jclab.refio.ui.components.autocomplete.AutocompletePopup
-import pl.jclab.refio.ui.components.autocomplete.CommandAutocompleteItem
+import pl.jclab.refio.ui.components.autocomplete.PromptAutocompleteItem
 import pl.jclab.refio.ui.components.autocomplete.ContextAutocompleteItem
 import pl.jclab.refio.ui.components.autocomplete.ContextValidator
 import pl.jclab.refio.ui.components.input.InputPanelContainer
 import pl.jclab.refio.ui.components.input.SnippetsContainer
 import pl.jclab.refio.ui.theme.LCATheme
-import pl.jclab.refio.api.CoreApiClient
+import pl.jclab.refio.core.api.CoreApiRouter
 import java.awt.*
 import java.awt.event.ComponentAdapter
 import java.awt.event.ComponentEvent
@@ -77,7 +76,7 @@ import javax.swing.BoxLayout as SwingBoxLayout
 class PromptInputPanel(
     private val project: Project,
     private val chatView: ChatView? = null,
-    private val coreApiClient: CoreApiClient? = null
+    private val coreApiClient: CoreApiRouter? = null
 ) : JBPanel<PromptInputPanel>(GridBagLayout()) {
 
     companion object {
@@ -112,7 +111,7 @@ class PromptInputPanel(
 
     // Autocomplete
     private var contextAutocomplete: AutocompletePopup<ContextAutocompleteItem>
-    private var commandAutocomplete: AutocompletePopup<CommandAutocompleteItem>
+    private var promptAutocomplete: AutocompletePopup<PromptAutocompleteItem>
 
     // Current submenu provider ID (for tracking which provider's submenu is shown)
     private var currentSubmenuProviderId: String? = null
@@ -133,8 +132,8 @@ class PromptInputPanel(
     // Job for autocomplete coroutines (cancel previous when starting new)
     private var autocompleteJob: kotlinx.coroutines.Job? = null
 
-    // Cached slash commands (loaded once, used for prepending templates)
-    private var cachedSlashCommands: List<SlashCommand> = emptyList()
+    // Cached slash prompts (loaded once, used for prepending templates)
+    private var cachedSlashPrompts: List<SlashPrompt> = emptyList()
 
     // Context references
     private val contextReferences = mutableListOf<ContextReference>()
@@ -164,8 +163,8 @@ class PromptInputPanel(
             handleContextSelection(item)
         }
 
-        commandAutocomplete = AutocompletePopup { item ->
-            insertSlashCommand(item.command)
+        promptAutocomplete = AutocompletePopup { item ->
+            insertSlashPrompt(item.slashPrompt)
         }
 
         promptEditor = createPromptEditor()
@@ -177,13 +176,13 @@ class PromptInputPanel(
                 updatePromptEditorHeight()
                 onPromptInputChanged()
             }
-        })
+        }, editorShortcutsDisposable)
 
         cs.launch {
             try {
-                loadSlashCommands()
+                loadSlashPrompts()
             } catch (e: Exception) {
-                logger.warn(e) { "Failed to preload slash commands" }
+                logger.warn(e) { "Failed to preload slash prompts" }
             }
         }
 
@@ -643,12 +642,12 @@ class PromptInputPanel(
             return
         }
 
-        // VALIDATE SLASH COMMAND FIRST (before clearing editor)
-        // Process slash command: prepend template to user text
-        val slashProcessedText = processSlashCommand(text)
+        // VALIDATE SLASH PROMPT FIRST (before clearing editor)
+        // Expand slash prompt: prepend its template to the user text
+        val slashProcessedText = processSlashPrompt(text)
         if (slashProcessedText == null) {
-            // Validation failed (slash command not at start) - don't send, keep text in editor
-            logger.warn { "Slash command validation failed, message not sent" }
+            // Validation failed (slash prompt not at start) - don't send, keep text in editor
+            logger.warn { "Slash prompt validation failed, message not sent" }
             return
         }
         val processedText = applyPromptTemplateVariables(slashProcessedText)
@@ -732,10 +731,10 @@ class PromptInputPanel(
     }
 
     private fun loadNoEgressDefault() {
-        val client = coreApiClient ?: CoreApiClient(sessionManager.apiRouter)
+        val client = coreApiClient ?: sessionManager.apiRouter
         cs.launch {
             try {
-                val config = client.getConfig(section = "advanced", scope = "app")
+                val config = client.configRouter.getConfig(section = "advanced", scope = "app")
                 val noEgressDefault = (config.settings["no_egress_default"] as? String).toBoolean()
 
                 if (noEgressDefault) {
@@ -757,11 +756,11 @@ class PromptInputPanel(
     }
 
     /**
-     * Process all slash commands in text.
-     * Replaces each "/command" with its template, supporting multiple commands.
+     * Process all slash prompts in text.
+     * Replaces each "/name" with its template, supporting multiple slash prompts.
      * Format: "text /cmd1 more text /cmd2 end" -> "text TEMPLATE1 more text TEMPLATE2 end"
      *
-     * @return Processed text with all commands replaced
+     * @return Processed text with all slash prompts replaced
      */
     /**
      * Build message text with inlined context refs for mid-execution messages.
@@ -785,11 +784,11 @@ class PromptInputPanel(
         return sb.toString()
     }
 
-    private fun processSlashCommand(text: String): String? {
-        // Find all slash commands using regex
-        // Only match /command after whitespace or at start of text (not in URLs like https://example.com/path)
-        val commandRegex = Regex("""(?<=\s|^)/([\w-]+)""")
-        val matches = commandRegex.findAll(text).toList()
+    private fun processSlashPrompt(text: String): String? {
+        // Find all slash prompts using regex.
+        // Only match /name after whitespace or at start of text (not in URLs like https://example.com/path)
+        val slashRegex = Regex("""(?<=\s|^)/([\w-]+)""")
+        val matches = slashRegex.findAll(text).toList()
 
         if (matches.isEmpty()) {
             return text
@@ -799,25 +798,25 @@ class PromptInputPanel(
         var offset = 0  // Track position shift after replacements
 
         for (match in matches) {
-            val commandName = match.groupValues[1]
-            val command = cachedSlashCommands.find { it.name.equals(commandName, ignoreCase = true) }
+            val promptName = match.groupValues[1]
+            val slashPrompt = cachedSlashPrompts.find { it.name.equals(promptName, ignoreCase = true) }
 
-            if (command == null) {
-                logger.warn { "Slash command not found: /$commandName, skipping" }
+            if (slashPrompt == null) {
+                logger.warn { "Slash prompt not found: /$promptName, skipping" }
                 continue
             }
 
             // Build template with variable substitution
-            var template = command.template
+            var template = slashPrompt.template
 
             // Replace {selection} variable if present
-            if ("selection" in command.variables) {
+            if ("selection" in slashPrompt.variables) {
                 val editor = com.intellij.openapi.fileEditor.FileEditorManager.getInstance(project).selectedTextEditor
                 val selection = editor?.selectionModel?.selectedText ?: ""
                 template = template.replace("{selection}", selection)
             }
 
-            // Replace command with template
+            // Replace the slash prompt with its template
             val originalStart = match.range.first + offset
             val originalEnd = match.range.last + 1 + offset
 
@@ -826,7 +825,7 @@ class PromptInputPanel(
             // Update offset for next replacement
             offset += template.length - match.value.length
 
-            logger.info { "Replaced slash command /$commandName at position ${match.range.first} with template" }
+            logger.info { "Replaced slash prompt /$promptName at position ${match.range.first} with template" }
         }
 
         return result
@@ -1237,7 +1236,7 @@ class PromptInputPanel(
         ensureEnterActionHandlerInstalled()
         return EditorTextField(project, PlainTextFileType.INSTANCE).apply {
             setOneLineMode(false)
-            setPlaceholder("Type a message... (@context, /command, !subagent)")
+            setPlaceholder("Type a message... (@context, /prompt, !subagent)")
             font = LCATheme.editorFont
             preferredSize = Dimension(0, 90)
             minimumSize = Dimension(0, 70)
@@ -1270,7 +1269,7 @@ class PromptInputPanel(
                 }
 
                 contextAutocomplete.attach(editorEx)
-                commandAutocomplete.attach(editorEx)
+                promptAutocomplete.attach(editorEx)
 
                 installEditorKeyBindings(editorEx)
                 updatePromptEditorHeight()
@@ -1334,7 +1333,7 @@ class PromptInputPanel(
         if (editorEx.getUserData(KEY_LISTENERS_INSTALLED) == true) return
         editorEx.putUserData(KEY_LISTENERS_INSTALLED, true)
 
-        val component = editorEx.contentComponent as? JComponent ?: return
+        val component = editorEx.contentComponent
 
         val insertNewlineAction = object : DumbAwareAction() {
             override fun actionPerformed(e: AnActionEvent) {
@@ -1416,12 +1415,12 @@ class PromptInputPanel(
 
         return isNativeLookupVisible ||
                 (contextAutocomplete.isVisible()) ||
-                (commandAutocomplete.isVisible())
+                (promptAutocomplete.isVisible())
     }
 
     private fun getPromptCaretOffset(): Int {
         return promptEditor.editor?.let { editor ->
-            ReadAction.compute<Int, RuntimeException> { editor.caretModel.offset }
+            com.intellij.openapi.application.runReadAction { editor.caretModel.offset }
         } ?: promptEditor.text.length
     }
 
@@ -1532,12 +1531,12 @@ class PromptInputPanel(
             // / autocomplete - only when "/" is the FIRST character in input
             beforeCaret.startsWith("/") && beforeCaret.all { it.isLetterOrDigit() || it == '/' } -> {
                 // Use native IntelliJ completion (RefioCompletionContributor) instead of custom popup
-                commandAutocomplete.hide()
+                promptAutocomplete.hide()
             }
 
             else -> {
                 contextAutocomplete.hide()
-                commandAutocomplete.hide()
+                promptAutocomplete.hide()
             }
         }
     }
@@ -1705,22 +1704,22 @@ class PromptInputPanel(
     }
 
     /**
-     * Load slash commands from backend and cache them
+     * Load slash prompts from backend and cache them
      */
-    private suspend fun loadSlashCommands(): List<SlashCommand> = withContext(Dispatchers.IO) {
-        val fallback = SlashCommand.BUILTINS
+    private suspend fun loadSlashPrompts(): List<SlashPrompt> = withContext(Dispatchers.IO) {
+        val fallback = SlashPrompt.BUILTINS
 
         return@withContext try {
-            val client = coreApiClient ?: CoreApiClient(sessionManager.apiRouter)
-            val response = client.getPromptsByType(pl.jclab.refio.core.db.PromptType.SLASH_COMMAND)
+            val client = coreApiClient ?: sessionManager.apiRouter
+            val response = client.promptsRouter.getPromptsByType(pl.jclab.refio.core.db.PromptType.SLASH_PROMPT)
 
-            val commands = response.prompts
+            val slashPrompts = response.prompts
                 .filter { it.isEnabled }
                 .map { prompt ->
-                    SlashCommand(
+                    SlashPrompt(
                         id = prompt.id,
                         name = prompt.name.removePrefix("/"),
-                        description = prompt.description ?: "Custom command",
+                        description = prompt.description ?: "Custom prompt",
                         template = prompt.content,
                         variables = extractVariablesFromTemplate(prompt.content),
                         category = "custom",
@@ -1728,18 +1727,18 @@ class PromptInputPanel(
                     )
                 }
 
-            val resolved = if (commands.isEmpty()) {
+            val resolved = if (slashPrompts.isEmpty()) {
                 fallback
             } else {
-                commands
+                slashPrompts
             }
 
-            cachedSlashCommands = resolved
-            logger.info { "Loaded ${commands.size} slash commands from database (enabled only)" }
+            cachedSlashPrompts = resolved
+            logger.info { "Loaded ${slashPrompts.size} slash prompts from database (enabled only)" }
             resolved
         } catch (e: Exception) {
-            logger.error(e) { "Failed to load slash commands from database, using built-ins" }
-            cachedSlashCommands = fallback
+            logger.error(e) { "Failed to load slash prompts from database, using built-ins" }
+            cachedSlashPrompts = fallback
             fallback
         }
     }
@@ -1880,16 +1879,16 @@ class PromptInputPanel(
 
 
     /**
-     * Insert slash command name (not template)
-     * Template will be prepended when sending the message
+     * Insert slash prompt name (not template).
+     * Template will be prepended when sending the message.
      */
-    private fun insertSlashCommand(command: SlashCommand) {
-        // Replace typed prefix with full command name + space
-        val commandText = "/${command.name} "
-        promptEditor.text = commandText
-        promptEditor.editor?.caretModel?.moveToOffset(commandText.length)
+    private fun insertSlashPrompt(slashPrompt: SlashPrompt) {
+        // Replace typed prefix with full prompt name + space
+        val promptText = "/${slashPrompt.name} "
+        promptEditor.text = promptText
+        promptEditor.editor?.caretModel?.moveToOffset(promptText.length)
 
-        logger.info { "Inserted slash command: /${command.name}" }
+        logger.info { "Inserted slash prompt: /${slashPrompt.name}" }
     }
 
     /**
@@ -2419,8 +2418,8 @@ class PromptInputPanel(
     private fun loadExecutionModeDefault() {
         cs.launch {
             try {
-                val client = coreApiClient ?: CoreApiClient(sessionManager.apiRouter)
-                val executionModeStr = client.getConfigValue("ui", "execution_mode")
+                val client = coreApiClient ?: sessionManager.apiRouter
+                val executionModeStr = client.configService.get("ui.execution_mode", pl.jclab.refio.core.db.ConfigScope.APP, null)
 
                 // Default to INTERACTIVE if not specified in config
                 val isInteractive = if (executionModeStr != null) {
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/bubble/ChatMessageBubbleRouter.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/bubble/ChatMessageBubbleRouter.kt
index ebb6191f..6d4fe748 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/bubble/ChatMessageBubbleRouter.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/bubble/ChatMessageBubbleRouter.kt
@@ -25,13 +25,14 @@ internal class ChatMessageBubbleRouter(
             else -> otherBubbleRenderer.render(message)
         }
 
-        // Wrap with agent header if message comes from a subagent
+        // Wrap with agent header if message comes from a subagent. Applies to every role so
+        // tool calls, user injections (subagent prompts), and assistant replies all end up under
+        // the same visible group for the agent that produced them.
         val agentName = message.agentName
-        if (agentName != null && message.role == "assistant") {
+        if (agentName != null) {
             val wrapper = JPanel(BorderLayout())
             wrapper.isOpaque = false
 
-            // Add agent header if this is a new agent group
             if (agentName != lastAgentName) {
                 val header = createAgentHeader(agentName, message.agentDepth ?: 0)
                 wrapper.add(header, BorderLayout.NORTH)
@@ -42,9 +43,8 @@ internal class ChatMessageBubbleRouter(
             return wrapper
         }
 
-        if (agentName == null && message.role == "assistant") {
-            lastAgentName = null
-        }
+        // Message outside any subagent group — reset so the next subagent message re-emits the header.
+        lastAgentName = null
 
         return bubble
     }
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/bubble/MessageMetadataExtractor.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/bubble/MessageMetadataExtractor.kt
index d9a3dc67..9eb01346 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/bubble/MessageMetadataExtractor.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/chat/bubble/MessageMetadataExtractor.kt
@@ -5,7 +5,7 @@ import pl.jclab.refio.api.models.ToolCallDisplayInfo
 import pl.jclab.refio.api.models.ToolCallStatus
 import pl.jclab.refio.api.models.ToolDisplayType
 import pl.jclab.refio.api.models.UserContextMetadata
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.components.chat.CodeChangesData
 import pl.jclab.refio.ui.components.chat.ConversationSummaryMetadata
 import pl.jclab.refio.ui.components.chat.ExecutionSummaryFile
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/context/CollapsibleContextSection.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/context/CollapsibleContextSection.kt
index 68a3a12b..4441e0d5 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/context/CollapsibleContextSection.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/context/CollapsibleContextSection.kt
@@ -5,7 +5,7 @@ import com.intellij.ui.JBColor
 import com.intellij.ui.components.JBLabel
 import com.intellij.ui.components.JBPanel
 import pl.jclab.refio.core.api.ContextSectionTokenInfo
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.theme.LCATheme
 import java.awt.BorderLayout
 import java.awt.Color
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/context/ContextPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/context/ContextPanel.kt
index b4be3b16..db20f8ca 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/context/ContextPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/context/ContextPanel.kt
@@ -14,7 +14,7 @@ import pl.jclab.refio.core.tools.PathSandbox
 import pl.jclab.refio.core.services.turn.PromptSnapshot
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.services.core.CoreConnectionManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.notification.NotificationService
 import pl.jclab.refio.core.services.monitoring.GlobalMetrics
 import pl.jclab.refio.core.services.monitoring.OperationInfo
@@ -143,7 +143,6 @@ class ContextPanel(private val project: Project) : JBPanel<ContextPanel>(BorderL
     private val mcpResourcesSection = createSection("MCP Resources", "mcp_resources")
     private val userRequirementsSection = createSection("User Requirements", "user_requirements")
     private val projectInstructionsSection = createSection("Project Instructions", "project_instructions")
-    private val ragFragmentsSection = createSection("RAG Fragments", "rag_fragments")
     private val subtasksSection = createSection("Subtasks", "subtasks")
     private val conversationSection = createSection("Conversation History", "conversation")
     private val recentWorkSection = createSection("Recent Work", "recent_work")
@@ -245,9 +244,6 @@ class ContextPanel(private val project: Project) : JBPanel<ContextPanel>(BorderL
         SectionEntry("mcp_resources", 11, mcpResourcesSection) { context, _ ->
             updateMcpResourcesSection(context)
         },
-        SectionEntry("rag_fragments", 12, ragFragmentsSection) { context, _ ->
-            updateRagFragmentsSection(context)
-        },
         SectionEntry("conversation", 13, conversationSection) { context, _ ->
             updateConversationSection(context)
         },
@@ -335,7 +331,7 @@ class ContextPanel(private val project: Project) : JBPanel<ContextPanel>(BorderL
 
         // Assemble sections in contentPanel
         // Order matches ADR 0040 - Phase 2: Project context first, then task, then history
-        // Priority: Project Meta → Task → User Context → RAG → Conversation → Work History
+        // Priority: Project Meta → Task → User Context → Conversation → Work History
         applySectionOrder(sectionEntries.sortedBy { it.order })
 
         // Debounced refresh listener - collects all events and triggers refresh only once per time window
@@ -572,12 +568,6 @@ class ContextPanel(private val project: Project) : JBPanel<ContextPanel>(BorderL
             } catch (e: Exception) {
                 logger.error(e) { "Failed to refresh context" }
 
-                // Show notification for RAG unavailability (only on first failure)
-                if (e.message?.contains("Ollama service") == true &&
-                    e.message?.contains("unavailable") == true) {
-                    NotificationService.showRagUnavailable(project, "ollama", "http://localhost:11434")
-                }
-
                 SwingUtilities.invokeLater {
                     showError("Failed to load context: ${e.message}")
                 }
@@ -1162,71 +1152,6 @@ class ContextPanel(private val project: Project) : JBPanel<ContextPanel>(BorderL
         projectInstructionsSection.setContent(instructions, html)
     }
 
-    private fun updateRagFragmentsSection(context: pl.jclab.refio.core.api.ProjectContextResponse) {
-        if (context.ragFragments.isEmpty()) {
-            val html = """
-                <html><body style='padding: 5px; word-wrap: break-word;'>
-                No RAG fragments found. Run project indexing and embeddings to enable contextual search.
-                </body></html>
-            """.trimIndent()
-            ragFragmentsSection.setContent(
-                "No RAG fragments found. Run project indexing and embeddings to enable contextual search.",
-                html
-            )
-            return
-        }
-
-        val htmlParts = mutableListOf<String>()
-        val rawParts = mutableListOf<String>()
-        context.ragFragments.take(8).forEach { fragment ->
-            val sourceName = fragment.filePath.substringAfterLast("/").substringAfterLast("\\").take(60)
-            val lines = if (fragment.startLine != null && fragment.endLine != null) {
-                " (lines ${fragment.startLine}-${fragment.endLine})"
-            } else ""
-            val similarity = String.format("%.0f", fragment.similarity * 100)
-            val type = fragment.contentType.lowercase().replaceFirstChar { it.uppercase() }
-
-            htmlParts.add("<b>$sourceName$lines</b> <font color='#808080'>[$type | ${similarity}% match]</font>")
-
-            val escapedContent = fragment.content
-                .replace("&", "&amp;")
-                .replace("<", "&lt;")
-                .replace(">", "&gt;")
-                .replace("\n", "<br>")
-            val preview = if (fragment.content.length > 300) {
-                "${escapedContent.take(300)}..."
-            } else {
-                escapedContent
-            }
-
-            htmlParts.add("<pre style='padding: 5px; white-space: pre-wrap; word-wrap: break-word; overflow-wrap: break-word;'>$preview</pre>")
-            htmlParts.add("<br>")
-        }
-
-        if (context.ragFragments.size > 8) {
-            htmlParts.add("<font color='#808080'>... and ${context.ragFragments.size - 8} more fragments</font>")
-        }
-
-        context.ragFragments.forEach { fragment ->
-            val lines = if (fragment.startLine != null && fragment.endLine != null) {
-                " (lines ${fragment.startLine}-${fragment.endLine})"
-            } else ""
-            val similarity = String.format("%.0f", fragment.similarity * 100)
-            val type = fragment.contentType.lowercase().replaceFirstChar { it.uppercase() }
-            rawParts.add("${fragment.filePath}$lines [$type | ${similarity}% match]")
-            rawParts.add(fragment.content)
-            rawParts.add("")
-        }
-
-        val html = """
-            <html><body style='padding: 5px; word-wrap: break-word;'>
-            ${htmlParts.joinToString("")}
-            </body></html>
-        """.trimIndent()
-        val raw = rawParts.joinToString("\n").trim()
-        ragFragmentsSection.setContent(raw, html)
-    }
-
     private fun updateSubtasksSection(context: pl.jclab.refio.core.api.ProjectContextResponse) {
         if (context.subtasks.isEmpty()) {
             val html = """
@@ -1493,7 +1418,7 @@ class ContextPanel(private val project: Project) : JBPanel<ContextPanel>(BorderL
             """.trimIndent()
             return
         }
-        val promptText = prompt ?: return
+        val promptText = prompt
 
         val structureHtml = buildContextStructureOverview(context)
         val escaped = promptText
@@ -1675,7 +1600,6 @@ class ContextPanel(private val project: Project) : JBPanel<ContextPanel>(BorderL
             "working_memory" -> ContextSection.WORKING_MEMORY
             "recent_work" -> ContextSection.RECENT_WORK
             "user_context", "mcp_resources" -> ContextSection.USER_CONTEXT
-            "rag_fragments" -> ContextSection.RAG_FRAGMENTS
             "conversation", "messages_user", "messages_assistant", "messages_system", "messages_other" -> ContextSection.CONVERSATION
             "key_components",
             "dependencies",
@@ -1880,11 +1804,12 @@ class ContextPanel(private val project: Project) : JBPanel<ContextPanel>(BorderL
         val html = """
             <html><body style='padding: 5px; word-wrap: break-word;'>
             <b>Context Stability:</b> ~${stabilityPercent}% unchanged from previous turn<br>
-            <div style='background: #E0E0E0; width: 200px; height: 12px; border-radius: 6px; margin-top: 4px;'>
-                <div style='background: $barColor; width: ${stabilityPercent * 2}px; height: 12px; border-radius: 6px;'></div>
-            </div>
+            <table cellpadding='0' cellspacing='0' style='margin-top: 4px;'><tr>
+                <td style='background: $barColor; width: ${stabilityPercent * 2}px; height: 12px;'></td>
+                <td style='background: #E0E0E0; width: ${(100 - stabilityPercent) * 2}px; height: 12px;'></td>
+            </tr></table>
             <br>
-            <span style='font-size: 0.9em; color: #888;'>
+            <span style='color: #888;'>
             Stable context (project info, conventions) is cached and reused across turns.
             </span>
             </body></html>
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/debug/DebugPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/debug/DebugPanel.kt
index fc1ee2a0..a8b84077 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/debug/DebugPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/debug/DebugPanel.kt
@@ -3,6 +3,7 @@ package pl.jclab.refio.ui.components.debug
 import com.intellij.openapi.project.Project
 import com.intellij.ui.components.JBPanel
 import com.intellij.ui.components.JBScrollPane
+import pl.jclab.refio.core.api.SubtaskResponse
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.services.core.CoreConnectionManager
 import pl.jclab.refio.services.logging.LogLevel
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/history/HistoryPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/history/HistoryPanel.kt
index 86228f8f..89b7c680 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/history/HistoryPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/history/HistoryPanel.kt
@@ -11,7 +11,7 @@ import pl.jclab.refio.api.models.ExecutionMode
 import pl.jclab.refio.api.models.Session
 import pl.jclab.refio.api.models.TaskMode
 import pl.jclab.refio.api.models.TaskStatus
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.session.SessionManager
 import kotlinx.coroutines.*
 import java.awt.BorderLayout
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/input/SnippetsContainer.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/input/SnippetsContainer.kt
index e06baaa2..a39601a4 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/input/SnippetsContainer.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/input/SnippetsContainer.kt
@@ -3,7 +3,7 @@ package pl.jclab.refio.ui.components.input
 import com.intellij.ui.components.JBPanel
 import com.intellij.ui.components.JBScrollPane
 import pl.jclab.refio.api.models.CodeSnippet
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.theme.LCATheme
 import java.awt.BorderLayout
 import java.awt.Dimension
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/rag/RagViewPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/rag/RagViewPanel.kt
index 590a4395..4bd3e368 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/rag/RagViewPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/rag/RagViewPanel.kt
@@ -11,7 +11,7 @@ import pl.jclab.refio.ui.theme.LCATheme
 import pl.jclab.refio.core.db.RagContentType
 import pl.jclab.refio.core.config.ConfigKeys
 import pl.jclab.refio.services.core.CoreConnectionManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.session.SessionManager
 import kotlinx.coroutines.*
 import kotlinx.coroutines.flow.collectLatest
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/steps/StepsQueueView.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/steps/StepsQueueView.kt
index cfc42c73..9465cac1 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/steps/StepsQueueView.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/steps/StepsQueueView.kt
@@ -7,9 +7,9 @@ import com.intellij.ui.components.JBPanel
 import com.intellij.ui.components.JBScrollPane
 import com.intellij.util.ui.UIUtil
 import pl.jclab.refio.ui.theme.LCATheme
-import pl.jclab.refio.api.models.SubtaskDto
+import pl.jclab.refio.core.api.SubtaskResponse
 import pl.jclab.refio.services.execution.StepExecutionService
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.ui.components.common.PromptDialog
 import kotlinx.coroutines.*
@@ -22,6 +22,8 @@ import java.time.Instant
 import java.time.ZoneId
 import java.time.format.DateTimeFormatter
 import javax.swing.*
+import javax.swing.event.AncestorEvent
+import javax.swing.event.AncestorListener
 import javax.swing.filechooser.FileNameExtensionFilter
 
 /**
@@ -111,6 +113,43 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
                 }
             }
         }
+
+        // Auto-refresh subtasks when the panel becomes visible in the hierarchy — tab switches
+        // in the Agents/RAG/Debug/Logs/API tab-pane don't re-run init(), so without this the
+        // StateFlow shows whatever was last cached from a prior session tick.
+        addAncestorListener(object : AncestorListener {
+            override fun ancestorAdded(event: AncestorEvent?) {
+                if (sessionManager.activeSession.value == null) return
+                cs.launch {
+                    try {
+                        sessionManager.refreshSubtasks()
+                    } catch (e: Exception) {
+                        logger.error(e) { "Auto-refresh on panel show failed" }
+                    }
+                }
+            }
+            override fun ancestorRemoved(event: AncestorEvent?) {}
+            override fun ancestorMoved(event: AncestorEvent?) {}
+        })
+
+        // Auto-refresh on step state changes. AgentEventBus.ToolCalled fires when a tool
+        // finishes (subtask PENDING → SUCCESS/FAILED) and TurnEnded covers the case where a
+        // whole turn batch just completed. Pulling the fresh list into the StateFlow keeps the
+        // view updated without waiting for the user to click ⟳ Refresh.
+        cs.launch {
+            sessionManager.apiRouter.agentEventBus.events.collect { event ->
+                val triggersRefresh = event is pl.jclab.refio.core.agents.events.AgentEvent.ToolCalled ||
+                    event is pl.jclab.refio.core.agents.events.AgentEvent.TurnEnded
+                if (!triggersRefresh) return@collect
+                val activeSessionId = sessionManager.activeSession.value?.id ?: return@collect
+                if (event.sessionId != activeSessionId) return@collect
+                try {
+                    sessionManager.refreshSubtasks()
+                } catch (e: Exception) {
+                    logger.debug { "Auto-refresh on tool event failed: ${e.message}" }
+                }
+            }
+        }
     }
 
     private fun showEmptyState() {
@@ -120,7 +159,7 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
         stepsPanel.repaint()
     }
 
-    private fun updateSteps(subtasks: List<SubtaskDto>) {
+    private fun updateSteps(subtasks: List<SubtaskResponse>) {
         logger.debug { "updateSteps called with ${subtasks.size} subtasks" }
 
         SwingUtilities.invokeLater {
@@ -158,7 +197,7 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
         }
     }
 
-    private fun createStepItem(subtask: SubtaskDto, stepNumber: Int): JPanel {
+    private fun createStepItem(subtask: SubtaskResponse, stepNumber: Int): JPanel {
         return JPanel().apply {
             layout = BoxLayout(this, BoxLayout.Y_AXIS)
             background = getBackgroundColorForStatus(subtask.status)
@@ -186,7 +225,7 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
     }
 
 
-    private fun createCompactStepHeader(subtask: SubtaskDto, stepNumber: Int): JPanel {
+    private fun createCompactStepHeader(subtask: SubtaskResponse, stepNumber: Int): JPanel {
         return JPanel(BorderLayout(8, 0)).apply {
             isOpaque = false
             border = LCATheme.paddedBorder(6, 8)
@@ -238,7 +277,7 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
      * Create compact action buttons for header (icons only)
      */
     @Suppress("UNUSED_PARAMETER")
-    private fun createCompactActions(subtask: SubtaskDto, _stepNumber: Int): JPanel {
+    private fun createCompactActions(subtask: SubtaskResponse, _stepNumber: Int): JPanel {
 
         val panel = JPanel(FlowLayout(FlowLayout.LEFT, 6, 0))
 
@@ -294,7 +333,7 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
     /**
      * Create action buttons for a step
      */
-    private fun createStepActions(subtask: SubtaskDto, stepNumber: Int): JPanel {
+    private fun createStepActions(subtask: SubtaskResponse, stepNumber: Int): JPanel {
         return JPanel(FlowLayout(FlowLayout.CENTER, 2, 0)).apply {
             isOpaque = false
 
@@ -414,7 +453,7 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
     }
 
 
-    private fun createToolsSection(subtask: SubtaskDto): JPanel? {
+    private fun createToolsSection(subtask: SubtaskResponse): JPanel? {
         val tools = parseToolsFromSubtask(subtask) ?: return null
 
         return JPanel().apply {
@@ -461,7 +500,7 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
         }
     }
 
-    private fun createMetricsSection(subtask: SubtaskDto): JPanel? {
+    private fun createMetricsSection(subtask: SubtaskResponse): JPanel? {
         // Calculate execution time from timestamps
         val startedAtMs = subtask.startedAt
         val finishedAtMs = subtask.completedAt ?: subtask.finishedAt
@@ -526,7 +565,7 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
     }
 
     @Suppress("UNCHECKED_CAST")
-    private fun parseToolsFromSubtask(subtask: SubtaskDto): List<ToolInfo>? {
+    private fun parseToolsFromSubtask(subtask: SubtaskResponse): List<ToolInfo>? {
         try {
             // Try step_plan_json first (from prepare endpoint)
             subtask.stepPlanJson?.let { json ->
@@ -603,7 +642,7 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
         }
     }
 
-    private fun installHeaderClickHandler(component: JComponent, subtask: SubtaskDto, stepNumber: Int) {
+    private fun installHeaderClickHandler(component: JComponent, subtask: SubtaskResponse, stepNumber: Int) {
         component.addMouseListener(object : MouseAdapter() {
             override fun mouseClicked(e: MouseEvent) {
                 if (e.clickCount == 1 && SwingUtilities.isLeftMouseButton(e)) {
@@ -613,11 +652,11 @@ class StepsQueueView(private val project: Project) : JBPanel<StepsQueueView>(Bor
         })
     }
 
-    private fun showStepDetailsDialog(subtask: SubtaskDto, stepNumber: Int) {
+    private fun showStepDetailsDialog(subtask: SubtaskResponse, stepNumber: Int) {
         StepDetailsDialog(project, subtask, stepNumber, this::buildStepDetailsText).show()
     }
 
-    private fun buildStepDetailsText(subtask: SubtaskDto, stepNumber: Int): String {
+    private fun buildStepDetailsText(subtask: SubtaskResponse, stepNumber: Int): String {
         val completedAt = subtask.completedAt ?: subtask.finishedAt
         return buildString {
             appendLine("Step $stepNumber")
@@ -817,9 +856,9 @@ data class ToolInfo(
 
 private class StepDetailsDialog(
     project: Project,
-    private val subtask: SubtaskDto,
+    private val subtask: SubtaskResponse,
     private val stepNumber: Int,
-    private val detailsProvider: (SubtaskDto, Int) -> String
+    private val detailsProvider: (SubtaskResponse, Int) -> String
 ) : DialogWrapper(project, true) {
 
     private val dateFormatter = DateTimeFormatter.ofPattern("yyyy-MM-dd HH:mm:ss").withZone(ZoneId.systemDefault())
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/SessionContextBar.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/SessionContextBar.kt
index b8dec103..d23f24f5 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/SessionContextBar.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/SessionContextBar.kt
@@ -6,7 +6,7 @@ import pl.jclab.refio.api.models.ExecutionMode
 import pl.jclab.refio.api.models.Session
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.services.execution.StepExecutionService
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import kotlinx.coroutines.*
 import java.awt.Dimension
 import java.awt.FlowLayout
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/StatusBar.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/StatusBar.kt
index 4dcfc251..c90d7ccc 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/StatusBar.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/StatusBar.kt
@@ -1,6 +1,6 @@
 package pl.jclab.refio.ui.components.toolbar
 
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import com.intellij.openapi.project.Project
 import com.intellij.ui.components.JBLabel
 import com.intellij.ui.components.JBPanel
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/ToolbarComponent.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/ToolbarComponent.kt
index f1c686cb..04ad3e8d 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/ToolbarComponent.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/components/toolbar/ToolbarComponent.kt
@@ -3,7 +3,7 @@ package pl.jclab.refio.ui.components.toolbar
 import com.intellij.icons.AllIcons
 import com.intellij.openapi.project.Project
 import com.intellij.ui.components.JBPanel
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.session.SessionManager
 import pl.jclab.refio.ui.theme.LCATheme
 import kotlinx.coroutines.CoroutineScope
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/listeners/SwingWorkflowListener.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/listeners/SwingWorkflowListener.kt
index c8fa58bd..beb0d551 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/listeners/SwingWorkflowListener.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/listeners/SwingWorkflowListener.kt
@@ -1,106 +1,16 @@
 package pl.jclab.refio.ui.listeners
 
 import kotlinx.coroutines.CoroutineScope
-import kotlinx.coroutines.Dispatchers
-import kotlinx.coroutines.launch
-import pl.jclab.refio.api.models.Message
-import pl.jclab.refio.core.workflow.WorkflowEventListener
-import pl.jclab.refio.services.session.SessionStateManager
-import pl.jclab.refio.services.logging.dualLogger
-import java.util.UUID
-
-private val logger = dualLogger("SwingWorkflowListener")
+import pl.jclab.refio.core.session.DefaultWorkflowStreamingListener
+import pl.jclab.refio.core.session.SessionStateManager
 
+/**
+ * Plugin-facing alias for the platform-agnostic [DefaultWorkflowStreamingListener].
+ * Kept only so existing UI call-sites continue to compile; subclass unchanged.
+ */
 class SwingWorkflowListener(
-    private val taskId: String,
-    private val stateManager: SessionStateManager,
-    private val scope: CoroutineScope,
-    private val streamingEnabled: Boolean
-) : WorkflowEventListener {
-
-    private var messageId: String? = null
-    private var lastUiUpdate = 0L
-    private var formatter: ((String) -> String)? = null
-
-    override fun onChatStarted() {
-        startStreamingMessage("", "assistant") { it }
-    }
-
-    override fun onPlanningStarted() {
-        startStreamingMessage("Planning...", "assistant") { accumulated ->
-            "Planning...\n\n```json\n$accumulated\n```"
-        }
-    }
-
-    override fun onSubagentStarted(subagentName: String) {
-        startStreamingMessage("[$subagentName] ...", "assistant") { accumulated ->
-            "[$subagentName]\n\n$accumulated"
-        }
-    }
-
-    override fun onStreamChunk(chunk: String) {
-        if (!streamingEnabled) return
-        val currentId = messageId ?: return
-        val now = System.currentTimeMillis()
-        if (now - lastUiUpdate < 500L) return
-        lastUiUpdate = now
-
-        val format = formatter ?: { it }
-        scope.launch(Dispatchers.IO) {
-            stateManager.updateMessages { messages ->
-                messages.map { msg ->
-                    if (msg.id == currentId) {
-                        msg.copy(content = format(chunk), lastChunkAt = now)
-                    } else {
-                        msg
-                    }
-                }
-            }
-        }
-    }
-
-    override fun onStreamComplete(content: String) {
-        val currentId = messageId ?: return
-        val format = formatter ?: { it }
-        scope.launch(Dispatchers.IO) {
-            stateManager.updateMessages { messages ->
-                messages.map { msg ->
-                    if (msg.id == currentId) {
-                        msg.copy(
-                            content = format(content),
-                            isStreaming = false,
-                            lastChunkAt = System.currentTimeMillis()
-                        )
-                    } else {
-                        msg
-                    }
-                }
-            }
-        }
-    }
-
-    private fun startStreamingMessage(
-        initialContent: String,
-        role: String,
-        format: (String) -> String
-    ) {
-        formatter = format
-
-        val id = UUID.randomUUID().toString()
-        messageId = id
-
-        val message = Message(
-            id = id,
-            taskId = taskId,
-            role = role,
-            content = initialContent,
-            isStreaming = streamingEnabled,
-            createdAt = System.currentTimeMillis()
-        )
-
-        scope.launch(Dispatchers.IO) {
-            stateManager.appendMessage(message)
-        }
-    }
-}
-
+    taskId: String,
+    stateManager: SessionStateManager,
+    scope: CoroutineScope,
+    streamingEnabled: Boolean,
+) : DefaultWorkflowStreamingListener(taskId, stateManager, scope, streamingEnabled)
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/AdvancedSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/AdvancedSettingsPanel.kt
index 488248fe..6deb6673 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/AdvancedSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/AdvancedSettingsPanel.kt
@@ -5,8 +5,8 @@ import com.intellij.ui.components.JBLabel
 import com.intellij.ui.components.JBPanel
 import com.intellij.ui.components.JBScrollPane
 import com.intellij.ui.components.JBTextField
-import pl.jclab.refio.api.CoreApiClient
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.api.CoreApiRouter
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.theme.LCATheme
 import java.awt.BorderLayout
 import java.awt.FlowLayout
@@ -22,7 +22,7 @@ import javax.swing.*
  */
 class AdvancedSettingsPanel(
     private val onSettingChanged: (section: String, key: String, value: Any) -> Unit,
-    private val coreApiClient: CoreApiClient?
+    private val coreApiClient: CoreApiRouter?
 ) : JBPanel<AdvancedSettingsPanel>(BorderLayout()) {
 
     private val logger = dualLogger("AdvancedSettingsPanel")
@@ -90,7 +90,7 @@ class AdvancedSettingsPanel(
                     return@addItemListener
                 }
                 val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                    pl.jclab.refio.core.services.ConfigService.KEY_NO_EGRESS_DEFAULT
+                    pl.jclab.refio.core.config.ConfigKeys.NO_EGRESS_DEFAULT.key
                 )
                 onSettingChanged(section, key, isSelected)
             }
@@ -102,7 +102,7 @@ class AdvancedSettingsPanel(
                     return@addItemListener
                 }
                 val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                    pl.jclab.refio.core.services.ConfigService.KEY_READ_ONLY_MODE
+                    pl.jclab.refio.core.config.ConfigKeys.READ_ONLY_MODE.key
                 )
                 onSettingChanged(section, key, isSelected)
             }
@@ -156,7 +156,7 @@ class AdvancedSettingsPanel(
                         return@addChangeListener
                     }
                     val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                        pl.jclab.refio.core.services.ConfigService.KEY_TOOL_EXECUTION_TIMEOUT
+                        pl.jclab.refio.core.config.ConfigKeys.TOOL_EXECUTION_TIMEOUT.key
                     )
                     onSettingChanged(section, key, toolExecutionSlider.value)
                 }
@@ -186,7 +186,7 @@ class AdvancedSettingsPanel(
                         return@addChangeListener
                     }
                     val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                        pl.jclab.refio.core.services.ConfigService.KEY_API_CALL_TIMEOUT
+                        pl.jclab.refio.core.config.ConfigKeys.API_CALL_TIMEOUT.key
                     )
                     onSettingChanged(section, key, apiCallSlider.value)
                 }
@@ -214,7 +214,7 @@ class AdvancedSettingsPanel(
                         }
                         val value = text.toIntOrNull() ?: 10
                         val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                            pl.jclab.refio.core.services.ConfigService.KEY_MAX_FILE_SIZE
+                            pl.jclab.refio.core.config.ConfigKeys.MAX_FILE_SIZE.key
                         )
                         onSettingChanged(section, key, value)
                     }
@@ -235,7 +235,7 @@ class AdvancedSettingsPanel(
                         }
                         val value = text.toIntOrNull() ?: 128000
                         val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                            pl.jclab.refio.core.services.ConfigService.KEY_MAX_CONTEXT_SIZE
+                            pl.jclab.refio.core.config.ConfigKeys.MAX_CONTEXT_SIZE.key
                         )
                         onSettingChanged(section, key, value)
                     }
@@ -256,7 +256,7 @@ class AdvancedSettingsPanel(
                         }
                         val value = text.toIntOrNull() ?: 8192
                         val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                            pl.jclab.refio.core.services.ConfigService.KEY_MAX_OUTPUT_SIZE
+                            pl.jclab.refio.core.config.ConfigKeys.MAX_OUTPUT_SIZE.key
                         )
                         onSettingChanged(section, key, value)
                     }
@@ -290,7 +290,7 @@ class AdvancedSettingsPanel(
                         return@addChangeListener
                     }
                     val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                        pl.jclab.refio.core.services.ConfigService.KEY_AUTO_OPTIMIZE_PERCENTAGE
+                        pl.jclab.refio.core.config.ConfigKeys.AUTO_OPTIMIZE_PERCENTAGE.key
                     )
                     onSettingChanged(section, key, autoOptimizeSlider.value)
                 }
@@ -393,8 +393,8 @@ class AdvancedSettingsPanel(
 
         try {
             logger.info { "Loading advanced configuration" }
-            val advancedConfig = coreApiClient.getConfig(section = "advanced", scope = "app")
-            val limitsConfig = coreApiClient.getConfig(section = "limits", scope = "app")
+            val advancedConfig = coreApiClient.configRouter.getConfig(section = "advanced", scope = "app")
+            val limitsConfig = coreApiClient.configRouter.getConfig(section = "limits", scope = "app")
 
             applyAdvancedConfig(advancedConfig.settings)
             applyLimitsConfig(limitsConfig.settings)
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ApiLogsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ApiLogsPanel.kt
index ded294e4..1c148890 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ApiLogsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ApiLogsPanel.kt
@@ -7,7 +7,7 @@ import com.intellij.util.ui.JBUI
 import pl.jclab.refio.core.db.ApiLog
 import pl.jclab.refio.core.db.ApiLogStatistics
 import pl.jclab.refio.ui.theme.LCATheme
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import kotlinx.coroutines.*
 import java.awt.*
 import java.awt.event.MouseAdapter
@@ -25,7 +25,7 @@ import javax.swing.table.DefaultTableModel
  * Displays API call logs with filtering, export, and management capabilities
  */
 class ApiLogsPanel(
-    private val coreApiClient: pl.jclab.refio.api.CoreApiClient?,
+    private val coreApiClient: pl.jclab.refio.core.api.CoreApiRouter?,
     private val autoLoadOnInit: Boolean = true
 ) : JBPanel<ApiLogsPanel>(BorderLayout()) {
 
@@ -250,17 +250,17 @@ class ApiLogsPanel(
                 logger.info { "Loading API logs..." }
 
                 // Load statistics
-                val stats = coreApiClient?.router?.getApiLogStatistics()
+                val stats = coreApiClient?.apiLogsRouter?.getApiLogStatistics()
                 statistics = stats
 
                 // Load logs
-                val logs = coreApiClient?.router?.getRecentApiLogs(50) ?: emptyList()
+                val logs = coreApiClient?.apiLogsRouter?.getRecentApiLogs(50) ?: emptyList()
                 allLogs = logs
 
                 // Load filter options
-                val providers = coreApiClient?.router?.getDistinctProviders() ?: emptyList()
-                val models = coreApiClient?.router?.getDistinctModels() ?: emptyList()
-                val sources = coreApiClient?.router?.getDistinctSources() ?: emptyList()
+                val providers = coreApiClient?.apiLogsRouter?.getDistinctProviders() ?: emptyList()
+                val models = coreApiClient?.apiLogsRouter?.getDistinctModels() ?: emptyList()
+                val sources = coreApiClient?.apiLogsRouter?.getDistinctSources() ?: emptyList()
 
                 ApplicationManager.getApplication().invokeLater {
                     updateStatistics(stats)
@@ -379,7 +379,7 @@ class ApiLogsPanel(
                 val model = if (selectedModel == "All") null else selectedModel
                 val source = if (selectedSource == "All") null else selectedSource
 
-                val filteredLogs = coreApiClient?.router?.getFilteredApiLogs(
+                val filteredLogs = coreApiClient?.apiLogsRouter?.getFilteredApiLogs(
                     provider = provider,
                     model = model,
                     source = source,
@@ -425,7 +425,7 @@ class ApiLogsPanel(
         coroutineScope.launch {
             try {
                 logger.info { "Exporting all API logs to CSV..." }
-                val csvContent = coreApiClient?.router?.exportAllApiLogsToCsv()
+                val csvContent = coreApiClient?.apiLogsRouter?.exportAllApiLogsToCsv()
                     ?: throw IllegalStateException("Failed to export logs")
 
                 ApplicationManager.getApplication().invokeLater {
@@ -444,7 +444,7 @@ class ApiLogsPanel(
         coroutineScope.launch {
             try {
                 logger.info { "Exporting all API logs to JSON..." }
-                val jsonContent = coreApiClient?.router?.exportAllApiLogsToJson()
+                val jsonContent = coreApiClient?.apiLogsRouter?.exportAllApiLogsToJson()
                     ?: throw IllegalStateException("Failed to export logs")
 
                 ApplicationManager.getApplication().invokeLater {
@@ -494,7 +494,7 @@ class ApiLogsPanel(
         coroutineScope.launch {
             try {
                 logger.info { "Deleting all API logs..." }
-                val deleted = coreApiClient?.router?.deleteAllApiLogs()
+                val deleted = coreApiClient?.apiLogsRouter?.deleteAllApiLogs()
                     ?: throw IllegalStateException("Failed to delete logs")
 
                 ApplicationManager.getApplication().invokeLater {
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ContextSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ContextSettingsPanel.kt
index 490b33c4..77553056 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ContextSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ContextSettingsPanel.kt
@@ -14,7 +14,7 @@ import pl.jclab.refio.core.config.ConfigKeys as TypedConfigKeys
 import pl.jclab.refio.core.services.ConfigKeyUtil
 import pl.jclab.refio.core.services.ConfigService
 import pl.jclab.refio.services.core.CoreConnectionManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.theme.LCATheme
 import java.awt.BorderLayout
 import java.awt.Dimension
@@ -69,7 +69,7 @@ class ContextSettingsPanel(
     private lateinit var bm25BField: JBTextField
     private lateinit var embeddingModelField: JBTextField
     private lateinit var indexStatsLabel: JLabel
-    private val defaultIgnorePathsText = ConfigService.DEFAULT_RAG_IGNORED_DIRECTORIES.joinToString("\n")
+    private val defaultIgnorePathsText = pl.jclab.refio.core.config.ConfigKeys.RAG_IGNORED_DIRECTORIES.default.joinToString("\n")
     private var indexJob: Job? = null
     private var embeddingJob: Job? = null
     private var searchSettingsSaveJob: Job? = null
@@ -701,21 +701,21 @@ class ContextSettingsPanel(
 
         SwingUtilities.invokeLater {
             val (thresholdSection, thresholdKey) = ConfigKeyUtil.split(
-                ConfigService.KEY_RAG_SEARCH_SIMILARITY_THRESHOLD
+                pl.jclab.refio.core.config.ConfigKeys.RAG_SEARCH_SIMILARITY_THRESHOLD.key
             )
             onSettingChanged(thresholdSection, thresholdKey, threshold)
 
-            val (topKSection, topKKey) = ConfigKeyUtil.split(ConfigService.KEY_RAG_SEARCH_TOP_K)
+            val (topKSection, topKKey) = ConfigKeyUtil.split(pl.jclab.refio.core.config.ConfigKeys.RAG_SEARCH_TOP_K.key)
             onSettingChanged(topKSection, topKKey, topK)
 
-            val (hybridSection, hybridKey) = ConfigKeyUtil.split(ConfigService.KEY_RAG_SEARCH_HYBRID_ENABLED)
+            val (hybridSection, hybridKey) = ConfigKeyUtil.split(pl.jclab.refio.core.config.ConfigKeys.RAG_SEARCH_HYBRID_ENABLED.key)
             onSettingChanged(hybridSection, hybridKey, hybridEnabled)
 
-            val (weightSection, weightKey) = ConfigKeyUtil.split(ConfigService.KEY_RAG_SEARCH_SEMANTIC_WEIGHT)
+            val (weightSection, weightKey) = ConfigKeyUtil.split(pl.jclab.refio.core.config.ConfigKeys.RAG_SEARCH_SEMANTIC_WEIGHT.key)
             onSettingChanged(weightSection, weightKey, semanticWeight)
 
             val (contextSection, contextKey) = ConfigKeyUtil.split(
-                ConfigService.KEY_RAG_SEARCH_INCLUDE_CONTEXT_CHUNKS
+                pl.jclab.refio.core.config.ConfigKeys.RAG_SEARCH_INCLUDE_CONTEXT_CHUNKS.key
             )
             onSettingChanged(contextSection, contextKey, includeContextChunks)
         }
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/DocsSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/DocsSettingsPanel.kt
index 09330477..20f0726a 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/DocsSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/DocsSettingsPanel.kt
@@ -12,7 +12,7 @@ import pl.jclab.refio.ui.theme.LCATheme
 import pl.jclab.refio.core.db.DocIndexingStatus
 import pl.jclab.refio.core.db.DocumentationSource
 import pl.jclab.refio.services.core.CoreConnectionManager
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import kotlinx.coroutines.*
 import java.awt.*
 import java.time.Instant
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/GeneralSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/GeneralSettingsPanel.kt
index e1aa25b6..d4582056 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/GeneralSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/GeneralSettingsPanel.kt
@@ -1,12 +1,14 @@
 package pl.jclab.refio.ui.settings
 
+import pl.jclab.refio.core.config.ConfigKeys
+
 import com.intellij.openapi.application.ApplicationManager
 import com.intellij.ui.components.JBCheckBox
 import com.intellij.ui.components.JBPanel
 import pl.jclab.refio.api.models.MultiAgentStrategy
 import pl.jclab.refio.ui.theme.LCATheme
-import pl.jclab.refio.api.CoreApiClient
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.api.CoreApiRouter
+import pl.jclab.refio.core.logging.dualLogger
 import kotlinx.coroutines.*
 import java.awt.GridBagConstraints
 import java.awt.GridBagLayout
@@ -22,7 +24,7 @@ import javax.swing.JSeparator
  */
 class GeneralSettingsPanel(
     private val onSettingChanged: (section: String, key: String, value: Any) -> Unit,
-    private val coreApiClient: CoreApiClient?
+    private val coreApiClient: CoreApiRouter?
 ) : JBPanel<GeneralSettingsPanel>(GridBagLayout()) {
 
     private val logger = dualLogger("GeneralSettingsPanel")
@@ -62,7 +64,7 @@ class GeneralSettingsPanel(
                 if (!isUpdatingProgrammatically) {
                     val isSelected = event.stateChange == ItemEvent.SELECTED
                     val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                        pl.jclab.refio.core.services.ConfigService.KEY_FORMAT_MARKDOWN
+                        pl.jclab.refio.core.config.ConfigKeys.FORMAT_MARKDOWN.key
                     )
                     onSettingChanged(section, key, isSelected)
                 }
@@ -83,7 +85,7 @@ class GeneralSettingsPanel(
                 if (!isUpdatingProgrammatically) {
                     val isSelected = event.stateChange == ItemEvent.SELECTED
                     val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                        pl.jclab.refio.core.services.ConfigService.KEY_STREAMING_ENABLED
+                        pl.jclab.refio.core.config.ConfigKeys.STREAMING_ENABLED.key
                     )
                     onSettingChanged(section, key, isSelected)
                 }
@@ -104,7 +106,7 @@ class GeneralSettingsPanel(
                 if (!isUpdatingProgrammatically) {
                     val isSelected = event.stateChange == ItemEvent.SELECTED
                     val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                        pl.jclab.refio.core.services.ConfigService.KEY_ADVANCED_VIEW
+                        pl.jclab.refio.core.config.ConfigKeys.ADVANCED_VIEW.key
                     )
                     onSettingChanged(section, key, isSelected)
                 }
@@ -137,7 +139,7 @@ class GeneralSettingsPanel(
                 if (!isUpdatingProgrammatically) {
                     val isSelected = event.stateChange == ItemEvent.SELECTED
                     val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                        pl.jclab.refio.core.services.ConfigService.KEY_UI_ORCHESTRATION_ENABLED
+                        pl.jclab.refio.core.config.ConfigKeys.UI_ORCHESTRATION_ENABLED.key
                     )
                     onSettingChanged(section, key, isSelected)
                     multiAgentStrategyCombo.isEnabled = isSelected
@@ -181,7 +183,7 @@ class GeneralSettingsPanel(
                 if (!isUpdatingProgrammatically) {
                     val strategy = selectedItem as? MultiAgentStrategy ?: return@addActionListener
                     val (section, key) = pl.jclab.refio.core.services.ConfigKeyUtil.split(
-                        pl.jclab.refio.core.services.ConfigService.KEY_UI_MULTI_AGENT_STRATEGY
+                        ConfigKeys.UI_MULTI_AGENT_STRATEGY.key
                     )
                     onSettingChanged(section, key, strategy.name)
                 }
@@ -263,10 +265,10 @@ class GeneralSettingsPanel(
         try {
             logger.info { "Loading general configuration" }
 
-            val config = coreApiClient.getConfig(section = "general", scope = "app")
+            val config = coreApiClient.configRouter.getConfig(section = "general", scope = "app")
             applyGeneralConfig(config.settings)
 
-            val uiConfig = coreApiClient.getConfig(section = "ui", scope = "app")
+            val uiConfig = coreApiClient.configRouter.getConfig(section = "ui", scope = "app")
             applyUiConfig(uiConfig.settings)
         } catch (e: Exception) {
             logger.error(e) { "Failed to load general config" }
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/MCPSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/MCPSettingsPanel.kt
index 9d276e55..796f4d25 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/MCPSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/MCPSettingsPanel.kt
@@ -28,7 +28,7 @@ import pl.jclab.refio.core.context.mcp.MCPToolWorkflowConfig
 import pl.jclab.refio.core.context.mcp.MCPToolWorkflowStep
 import pl.jclab.refio.core.context.mcp.MCPToolsExposureMode
 import pl.jclab.refio.core.utils.ProjectIdGenerator
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.theme.LCATheme
 import java.awt.BorderLayout
 import java.awt.Component
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ModelsSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ModelsSettingsPanel.kt
index 6891b136..f609aa05 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ModelsSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ModelsSettingsPanel.kt
@@ -8,11 +8,11 @@ import com.intellij.ui.components.JBPanel
 import com.intellij.ui.components.JBScrollPane
 import com.intellij.ui.table.JBTable
 import pl.jclab.refio.ui.theme.LCATheme
-import pl.jclab.refio.api.CoreApiClient
+import pl.jclab.refio.core.api.CoreApiRouter
 import pl.jclab.refio.core.api.ModelOperation
 import pl.jclab.refio.core.config.ModelPresetConfig
 import pl.jclab.refio.core.utils.GsonInstance.gson
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.services.notification.NotificationService
 import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.Dispatchers
@@ -160,7 +160,7 @@ private fun ModelPresetConfig.toModelPreset(): ModelPreset = ModelPreset(
  */
 class ModelsSettingsPanel(
     private val onSettingChanged: (section: String, key: String, value: Any) -> Unit,
-    private val coreApiClient: CoreApiClient?
+    private val coreApiClient: CoreApiRouter?
 ) : JBPanel<ModelsSettingsPanel>(BorderLayout()) {
 
     companion object {
@@ -189,6 +189,12 @@ class ModelsSettingsPanel(
     // Flag to prevent saving when dropdowns are updated programmatically
     private var isUpdatingDropdowns = false
 
+    // Last models shown in the table — used by Show All / Hide All to update
+    // visibility in place instead of refetching from the backend (the model-registry
+    // cache may have been invalidated by a concurrent provider settings save, which
+    // would make fetchIfMissing=false return an empty list).
+    private var currentModels: List<pl.jclab.refio.core.api.ModelInfo> = emptyList()
+
     init {
         border = LCATheme.paddedBorder(LCATheme.margin)
 
@@ -245,7 +251,7 @@ class ModelsSettingsPanel(
         modelsTable = JBTable(object : DefaultTableModel(columnNames, 0) {
             override fun getColumnClass(columnIndex: Int): Class<*> {
                 return when (columnIndex) {
-                    6 -> java.lang.Boolean::class.java  // Show in Dropdown
+                    6 -> Boolean::class.javaObjectType  // Show in Dropdown
                     else -> String::class.java
                 }
             }
@@ -356,7 +362,7 @@ class ModelsSettingsPanel(
                 addAll(ModelPreset.PRESETS)
                 coreApiClient?.let { client ->
                     try {
-                        val yamlPresets = client.getYamlModelPresets().map { it.toModelPreset() }
+                        val yamlPresets = (client.configService.getYamlConfig().models?.presets ?: emptyList()).map { it.toModelPreset() }
                         addAll(yamlPresets)
                     } catch (e: Exception) {
                         logger.warn(e) { "Failed to load YAML model presets" }
@@ -424,7 +430,7 @@ class ModelsSettingsPanel(
         coroutineScope.launch {
             try {
                 // Step 1: Get all models
-                val allModels = coreApiClient?.getModelsWithVisibility() ?: emptyList()
+                val allModels = coreApiClient?.configRouter?.getModelsWithVisibility(fetchIfMissing = false) ?: emptyList()
                 logger.info { "Applying preset against ${allModels.size} available models" }
 
                 fun findMatchingModel(modelFullId: String): pl.jclab.refio.core.api.ModelInfo? {
@@ -459,7 +465,7 @@ class ModelsSettingsPanel(
 
                     val visibilityMap = buildVisibilityMap(allModels, visibleModelIds)
                     markModelVisibilityInitialized()
-                    coreApiClient?.updateModelsVisibility(visibilityMap)
+                    coreApiClient?.configRouter?.updateModelsVisibility(visibilityMap)
                 } else {
                     logger.info { "Preset has empty visibleModels; keeping current model visibility unchanged" }
                 }
@@ -467,7 +473,7 @@ class ModelsSettingsPanel(
                 // Step 3: Set default models for each operation
                 val defaultMatch = findMatchingModel(preset.defaultModel)
                 if (defaultMatch != null) {
-                    coreApiClient?.setDefaultModel(
+                    coreApiClient?.configRouter?.setDefaultModel(
                         request = pl.jclab.refio.core.api.SetDefaultModelRequest(
                             operation = ModelOperation.DEFAULT,
                             modelId = defaultMatch.id,
@@ -482,7 +488,7 @@ class ModelsSettingsPanel(
                 suspend fun applySpecializedModel(operation: ModelOperation, modelFullId: String) {
                     val match = findMatchingModel(modelFullId)
                     if (match != null) {
-                        coreApiClient?.setDefaultModel(
+                        coreApiClient?.configRouter?.setDefaultModel(
                             request = pl.jclab.refio.core.api.SetDefaultModelRequest(
                                 operation = operation,
                                 modelId = match.id,
@@ -492,7 +498,7 @@ class ModelsSettingsPanel(
                         )
                     } else {
                         logger.warn { "Preset model not available for $operation; falling back to inherit: $modelFullId" }
-                        coreApiClient?.setDefaultModel(
+                        coreApiClient?.configRouter?.setDefaultModel(
                             request = pl.jclab.refio.core.api.SetDefaultModelRequest(
                                 operation = operation,
                                 modelId = pl.jclab.refio.core.services.ConfigService.INHERIT_MODEL_VALUE,
@@ -511,7 +517,7 @@ class ModelsSettingsPanel(
                 logger.info { "Preset '${preset.name}' applied successfully" }
 
                 // Step 4: Reload UI to reflect changes
-                val updatedModels = coreApiClient?.getModelsWithVisibility() ?: emptyList()
+                val updatedModels = coreApiClient?.configRouter?.getModelsWithVisibility(fetchIfMissing = false) ?: emptyList()
                 val visibleCount = updatedModels.count { it.showInDropdown }
 
                 ApplicationManager.getApplication().invokeLater {
@@ -678,7 +684,7 @@ class ModelsSettingsPanel(
 
                 logger.info { "Refreshing models from all providers" }
 
-                val models = coreApiClient?.refreshAllModels()
+                val models = coreApiClient?.configRouter?.refreshAllModels()
                     ?: throw Exception("CoreApiClient not available")
 
                 // Apply smart defaults for new models (preserves existing settings)
@@ -714,72 +720,36 @@ class ModelsSettingsPanel(
         }
     }
 
-    private fun onShowAllModels() {
-        val tableModel = modelsTable.model as DefaultTableModel
-        val rowCount = tableModel.rowCount
-
-        if (rowCount == 0) {
-            logger.warn { "No models to show" }
-            return
-        }
-
-        logger.info { "Showing all $rowCount models" }
-
-        coroutineScope.launch {
-            try {
-                val visibilityMap = buildVisibilityMapFromTable(showInDropdown = true)
-                markModelVisibilityInitialized()
-                coreApiClient?.updateModelsVisibility(visibilityMap)
-
-                // Reload models from database and update UI
-                val models = coreApiClient?.getModelsWithVisibility() ?: emptyList()
-                ApplicationManager.getApplication().invokeLater {
-                    populateModelsTable(models)
-                    logger.info { "All models shown successfully" }
-                }
-            } catch (e: Exception) {
-                logger.error(e) { "Failed to show all models" }
-                ApplicationManager.getApplication().invokeLater {
-                    JOptionPane.showMessageDialog(
-                        this@ModelsSettingsPanel,
-                        "Failed to show all models:\n${e.message}",
-                        "Error",
-                        JOptionPane.ERROR_MESSAGE
-                    )
-                }
-            }
-        }
-    }
+    private fun onShowAllModels() = setAllModelsVisibility(showInDropdown = true)
 
-    private fun onHideAllModels() {
-        val tableModel = modelsTable.model as DefaultTableModel
-        val rowCount = tableModel.rowCount
+    private fun onHideAllModels() = setAllModelsVisibility(showInDropdown = false)
 
-        if (rowCount == 0) {
-            logger.warn { "No models to hide" }
+    private fun setAllModelsVisibility(showInDropdown: Boolean) {
+        val snapshot = currentModels
+        if (snapshot.isEmpty()) {
+            logger.warn { if (showInDropdown) "No models to show" else "No models to hide" }
             return
         }
 
-        logger.info { "Hiding all $rowCount models" }
+        logger.info { "${if (showInDropdown) "Showing" else "Hiding"} all ${snapshot.size} models" }
 
         coroutineScope.launch {
             try {
-                val visibilityMap = buildVisibilityMapFromTable(showInDropdown = false)
+                val visibilityMap = snapshot.associate { it.id to showInDropdown }
                 markModelVisibilityInitialized()
-                coreApiClient?.updateModelsVisibility(visibilityMap)
+                coreApiClient?.configRouter?.updateModelsVisibility(visibilityMap)
 
-                // Reload models from database and update UI
-                val models = coreApiClient?.getModelsWithVisibility() ?: emptyList()
+                val updatedModels = snapshot.map { it.copy(showInDropdown = showInDropdown) }
                 ApplicationManager.getApplication().invokeLater {
-                    populateModelsTable(models)
-                    logger.info { "All models hidden successfully" }
+                    populateModelsTable(updatedModels)
+                    logger.info { "All models ${if (showInDropdown) "shown" else "hidden"} successfully" }
                 }
             } catch (e: Exception) {
-                logger.error(e) { "Failed to hide all models" }
+                logger.error(e) { "Failed to update visibility for all models" }
                 ApplicationManager.getApplication().invokeLater {
                     JOptionPane.showMessageDialog(
                         this@ModelsSettingsPanel,
-                        "Failed to hide all models:\n${e.message}",
+                        "Failed to update models visibility:\n${e.message}",
                         "Error",
                         JOptionPane.ERROR_MESSAGE
                     )
@@ -797,9 +767,11 @@ class ModelsSettingsPanel(
 
         coroutineScope.launch {
             try {
-                logger.info { "Loading models from backend" }
+                logger.info { "Loading models from cache (no remote fetch)" }
 
-                val models = coreApiClient.getModelsWithVisibility()
+                // Cache-only: never trigger remote provider fetches when opening the panel.
+                // User must press the Refresh button to pull fresh model lists.
+                val models = coreApiClient.configRouter.getModelsWithVisibility(fetchIfMissing = false)
 
                 // Apply smart defaults if this is the first time (no visibility settings yet)
                 val modelsWithDefaults = applySmartDefaults(models)
@@ -831,7 +803,7 @@ class ModelsSettingsPanel(
         }
 
         val visibilityInitialized =
-            coreApiClient.getConfigValue("ui", MODEL_VISIBILITY_INITIALIZED_KEY)?.toBooleanStrictOrNull() == true
+            coreApiClient.configService.get("ui.$MODEL_VISIBILITY_INITIALIZED_KEY", pl.jclab.refio.core.db.ConfigScope.APP, null)?.toBooleanStrictOrNull() == true
         val hasAnyVisibleModels = models.any { it.showInDropdown }
 
         // Older installs may not have the flag set yet. If at least one model is already visible,
@@ -857,7 +829,7 @@ class ModelsSettingsPanel(
             if (model.showInDropdown != defaultVisibility) {
                 // Update DB with smart default
                 try {
-                    coreApiClient?.updateModelVisibility(model.id, defaultVisibility)
+                    coreApiClient.configRouter.updateModelVisibility(model.id, defaultVisibility)
                     model.copy(showInDropdown = defaultVisibility)
                 } catch (e: Exception) {
                     logger.error(e) { "Failed to set default visibility for ${model.id}" }
@@ -875,7 +847,7 @@ class ModelsSettingsPanel(
 
     private fun markModelVisibilityInitialized() {
         try {
-            coreApiClient?.setConfigValue("ui", MODEL_VISIBILITY_INITIALIZED_KEY, "true")
+            coreApiClient?.configService?.set("ui.$MODEL_VISIBILITY_INITIALIZED_KEY", "true", pl.jclab.refio.core.db.ConfigScope.APP, null)
         } catch (e: Exception) {
             logger.warn(e) { "Failed to mark model visibility initialization" }
         }
@@ -934,6 +906,8 @@ class ModelsSettingsPanel(
     }
 
     private fun populateModelsTable(models: List<pl.jclab.refio.core.api.ModelInfo>) {
+        currentModels = models
+
         val tableModel = modelsTable.model as DefaultTableModel
         tableModel.rowCount = 0  // Clear table
 
@@ -971,12 +945,12 @@ class ModelsSettingsPanel(
         coroutineScope.launch {
             try {
                 // Load saved models for each mode
-                val chatModel = coreApiClient?.getDefaultModel(ModelOperation.DEFAULT)
-                val planModel = coreApiClient?.getDefaultModel(ModelOperation.PLAN)
-                val agentModel = coreApiClient?.getDefaultModel(ModelOperation.CODING)
-                val weakModel = coreApiClient?.getDefaultModel(ModelOperation.WEAK)
-                val embeddingModel = coreApiClient?.getDefaultModel(ModelOperation.EMBEDDING)
-                val defaultModelSettings = coreApiClient?.getConfig("default_model", "app")?.settings ?: emptyMap()
+                val chatModel = coreApiClient?.configRouter?.getDefaultModel(ModelOperation.DEFAULT)
+                val planModel = coreApiClient?.configRouter?.getDefaultModel(ModelOperation.PLAN)
+                val agentModel = coreApiClient?.configRouter?.getDefaultModel(ModelOperation.CODING)
+                val weakModel = coreApiClient?.configRouter?.getDefaultModel(ModelOperation.WEAK)
+                val embeddingModel = coreApiClient?.configRouter?.getDefaultModel(ModelOperation.EMBEDDING)
+                val defaultModelSettings = coreApiClient?.configRouter?.getConfig("default_model", "app")?.settings ?: emptyMap()
 
                 ApplicationManager.getApplication().invokeLater {
                     // Set flag to prevent saving during programmatic update
@@ -1091,7 +1065,7 @@ class ModelsSettingsPanel(
         coroutineScope.launch {
             try {
                 markModelVisibilityInitialized()
-                coreApiClient?.updateModelVisibility(
+                coreApiClient?.configRouter?.updateModelVisibility(
                     modelId = modelId,
                     showInDropdown = showInDropdown
                 )
@@ -1099,7 +1073,7 @@ class ModelsSettingsPanel(
                 logger.info { "Model visibility saved: $modelId -> $showInDropdown" }
 
                 // Refresh dropdowns with updated visibility
-                val models = coreApiClient?.getModelsWithVisibility() ?: emptyList()
+                val models = coreApiClient?.configRouter?.getModelsWithVisibility(fetchIfMissing = false) ?: emptyList()
                 ApplicationManager.getApplication().invokeLater {
                     updateModelDropdowns(models)
                 }
@@ -1163,7 +1137,7 @@ class ModelsSettingsPanel(
                 }
 
                 if (modelId == INHERIT_LABEL) {
-                    coreApiClient?.setDefaultModel(
+                    coreApiClient?.configRouter?.setDefaultModel(
                         request = pl.jclab.refio.core.api.SetDefaultModelRequest(
                             operation = operation,
                             modelId = pl.jclab.refio.core.services.ConfigService.INHERIT_MODEL_VALUE,
@@ -1176,7 +1150,7 @@ class ModelsSettingsPanel(
                 }
 
                 // Save using proper API
-                coreApiClient?.setDefaultModel(
+                coreApiClient?.configRouter?.setDefaultModel(
                     request = pl.jclab.refio.core.api.SetDefaultModelRequest(
                         operation = operation,
                         modelId = model,
@@ -1256,19 +1230,6 @@ class ModelsSettingsPanel(
         }
     }
 
-    private fun buildVisibilityMapFromTable(showInDropdown: Boolean): Map<String, Boolean> {
-        val tableModel = modelsTable.model as DefaultTableModel
-        val rowCount = tableModel.rowCount
-        val visibilityMap = HashMap<String, Boolean>(rowCount)
-
-        for (row in 0 until rowCount) {
-            val modelId = tableModel.getValueAt(row, 7) as String
-            visibilityMap[modelId] = showInDropdown
-        }
-
-        return visibilityMap
-    }
-
     private fun buildVisibilityMap(
         allModels: List<pl.jclab.refio.core.api.ModelInfo>,
         modelIdsToShow: Set<String>
@@ -1319,8 +1280,9 @@ class ModelsSettingsPanel(
 
         coroutineScope.launch {
             try {
-                // Just reload all models with visibility settings
-                val allModels = coreApiClient?.getModelsWithVisibility() ?: emptyList()
+                // ProvidersSettingsPanel already fetched the affected provider; reuse the cache here
+                // instead of triggering another full remote refresh.
+                val allModels = coreApiClient?.configRouter?.getModelsWithVisibility(fetchIfMissing = false) ?: emptyList()
 
                 ApplicationManager.getApplication().invokeLater {
                     populateModelsTable(allModels)
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/CommandEditDialog.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/PromptEditDialog.kt
similarity index 85%
rename from intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/CommandEditDialog.kt
rename to intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/PromptEditDialog.kt
index c55640a4..a56a54ed 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/CommandEditDialog.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/PromptEditDialog.kt
@@ -17,11 +17,11 @@ import javax.swing.JLabel
 import javax.swing.JScrollPane
 
 /**
- * Dialog dla dodawania/edycji komendy slash
+ * Dialog for adding/editing a slash prompt
  */
-class CommandEditDialog(
+class PromptEditDialog(
     project: Project?,
-    private val existingCommand: PromptDto? = null
+    private val existingPrompt: PromptDto? = null
 ) : DialogWrapper(project) {
 
     private val nameField = JBTextField(20)
@@ -33,11 +33,11 @@ class CommandEditDialog(
     private val enabledCheckbox = JBCheckBox("Enabled", true)
 
     init {
-        title = if (existingCommand != null) "Edit Command" else "Add Command"
+        title = if (existingPrompt != null) "Edit Prompt" else "Add Prompt"
         init()
 
         // Load existing data
-        existingCommand?.let {
+        existingPrompt?.let {
             nameField.text = it.name
             descriptionField.text = it.description ?: ""
             contentArea.text = it.content
@@ -57,7 +57,7 @@ class CommandEditDialog(
             }
 
             // Name
-            add(JLabel("Command Name (with /):"), gbc)
+            add(JLabel("Prompt Name (with /):"), gbc)
             gbc.gridy++
             add(nameField, gbc)
 
@@ -93,12 +93,12 @@ class CommandEditDialog(
             // Info
             gbc.gridy++
             add(JLabel("<html><font color='gray'>" +
-                "Commands can be used in the prompt input by typing the command name" +
+                "Prompts can be used in the prompt input by typing the prompt name" +
                 "</font></html>"), gbc)
         }
     }
 
-    fun getCommandName(): String {
+    fun getPromptName(): String {
         val name = nameField.text.trim()
         return if (name.startsWith("/")) name else "/$name"
     }
@@ -108,8 +108,8 @@ class CommandEditDialog(
     fun isEnabled(): Boolean = enabledCheckbox.isSelected
 
     override fun doValidate(): ValidationInfo? {
-        if (getCommandName().length <= 1) {  // tylko "/"
-            return ValidationInfo("Command name cannot be empty", nameField)
+        if (getPromptName().length <= 1) {  // only "/"
+            return ValidationInfo("Prompt name cannot be empty", nameField)
         }
         if (getContent().isEmpty()) {
             return ValidationInfo("Content cannot be empty", contentArea)
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/PromptsSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/PromptsSettingsPanel.kt
index 320acecb..c3673aa4 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/PromptsSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/PromptsSettingsPanel.kt
@@ -5,11 +5,11 @@ import com.intellij.openapi.project.ProjectManager
 import com.intellij.ui.JBColor
 import com.intellij.ui.components.*
 import com.intellij.ui.table.JBTable
-import pl.jclab.refio.api.CoreApiClient
+import pl.jclab.refio.core.api.CoreApiRouter
 import pl.jclab.refio.ui.theme.LCATheme
 import pl.jclab.refio.core.api.*
 import pl.jclab.refio.core.db.PromptType
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import kotlinx.coroutines.*
 import java.awt.*
 import javax.swing.*
@@ -17,22 +17,22 @@ import javax.swing.table.DefaultTableModel
 
 /**
  * Prompts Settings Panel
- * Manages system prompts and slash commands with list-based UI
+ * Manages system prompts and slash prompts with list-based UI
  */
 class PromptsSettingsPanel(
     private val onSettingChanged: (section: String, key: String, value: Any) -> Unit,
-    private val coreApiClient: CoreApiClient?
+    private val coreApiClient: CoreApiRouter?
 ) : JBPanel<PromptsSettingsPanel>(BorderLayout()) {
 
     private val logger = dualLogger("PromptsSettingsPanel")
     private val coroutineScope = CoroutineScope(Dispatchers.IO + SupervisorJob())
 
     private lateinit var promptsTable: JBTable
-    private lateinit var commandsTable: JBTable
+    private lateinit var slashPromptsTable: JBTable
 
     // Cache for full prompt data (used for row -> ID mapping)
     private val promptsCache = mutableListOf<PromptDto>()
-    private val commandsCache = mutableListOf<PromptDto>()
+    private val slashPromptsCache = mutableListOf<PromptDto>()
 
     init {
         border = LCATheme.paddedBorder(16)
@@ -46,10 +46,10 @@ class PromptsSettingsPanel(
         }
         add(headerPanel, BorderLayout.NORTH)
 
-        // Tabbed pane for system prompts + commands
+        // Tabbed pane for system prompts + slash prompts
         val tabbedPane = JBTabbedPane().apply {
             addTab("System Prompts", createPromptsPanel())
-            addTab("Commands", createCommandsPanel())
+            addTab("Prompts", createSlashPromptsPanel())
         }
 
         add(tabbedPane, BorderLayout.CENTER)
@@ -70,7 +70,7 @@ class PromptsSettingsPanel(
 
             promptsTable = JBTable(object : DefaultTableModel(data, columnNames) {
                 override fun getColumnClass(column: Int): Class<*> {
-                    return if (column == 3) java.lang.Boolean::class.java else String::class.java
+                    return if (column == 3) Boolean::class.javaObjectType else String::class.java
                 }
 
                 override fun isCellEditable(row: Int, column: Int): Boolean {
@@ -103,45 +103,45 @@ class PromptsSettingsPanel(
         }
     }
 
-    private fun createCommandsPanel(): JPanel {
+    private fun createSlashPromptsPanel(): JPanel {
         return JBPanel<JBPanel<*>>(BorderLayout()).apply {
             border = LCATheme.paddedBorder(8)
 
             // Description
             val descLabel =
-                JLabel("<html><font color='gray'>Define slash commands that can be used in the prompt input</font></html>")
+                JLabel("<html><font color='gray'>Define reusable prompts that can be invoked in the chat input by typing /name</font></html>")
             add(descLabel, BorderLayout.NORTH)
 
             // Table
-            val columnNames = arrayOf("Command", "Description", "Enabled")
-            val data = loadCommandsData()
+            val columnNames = arrayOf("Prompt", "Description", "Enabled")
+            val data = loadSlashPromptsData()
 
-            commandsTable = JBTable(object : DefaultTableModel(data, columnNames) {
+            slashPromptsTable = JBTable(object : DefaultTableModel(data, columnNames) {
                 override fun getColumnClass(column: Int): Class<*> {
-                    return if (column == 2) java.lang.Boolean::class.java else String::class.java
+                    return if (column == 2) Boolean::class.javaObjectType else String::class.java
                 }
 
                 override fun isCellEditable(row: Int, column: Int): Boolean {
                     return false
                 }
             })
-            commandsTable.setShowGrid(true)
-            commandsTable.gridColor = JBColor.LIGHT_GRAY
+            slashPromptsTable.setShowGrid(true)
+            slashPromptsTable.gridColor = JBColor.LIGHT_GRAY
 
-            val scrollPane = JScrollPane(commandsTable).apply {
+            val scrollPane = JScrollPane(slashPromptsTable).apply {
                 preferredSize = Dimension(600, 300)
             }
 
             // Buttons panel
             val buttonsPanel = JBPanel<JBPanel<*>>(FlowLayout(FlowLayout.LEFT)).apply {
                 add(JButton("Add").apply {
-                    addActionListener { onAddCommand() }
+                    addActionListener { onAddSlashPrompt() }
                 })
                 add(JButton("Edit").apply {
-                    addActionListener { onEditCommand() }
+                    addActionListener { onEditSlashPrompt() }
                 })
                 add(JButton("Delete").apply {
-                    addActionListener { onDeleteCommand() }
+                    addActionListener { onDeleteSlashPrompt() }
                 })
             }
 
@@ -175,7 +175,7 @@ class PromptsSettingsPanel(
             try {
                 // Get default content on IO dispatcher
                 val defaultContent = withContext(Dispatchers.IO) {
-                    coreApiClient?.getDefaultSystemPromptContent(promptType)
+                    coreApiClient?.promptsRouter?.getDefaultSystemPromptContent(promptType)
                         ?: throw Exception("CoreApiClient not available")
                 }
 
@@ -207,11 +207,11 @@ class PromptsSettingsPanel(
             withContext(Dispatchers.IO) {
                 if (useDefault) {
                     // Reset to default
-                    coreApiClient?.resetSystemPromptToDefault(type)
+                    coreApiClient?.promptsRouter?.resetSystemPromptToDefault(type)
                         ?: throw Exception("CoreApiClient not available")
                 } else {
                     // Update with custom content
-                    coreApiClient?.updateSystemPrompt(
+                    coreApiClient?.promptsRouter?.updateSystemPrompt(
                         UpdateSystemPromptRequest(
                             type = type,
                             content = customContent
@@ -269,22 +269,22 @@ class PromptsSettingsPanel(
         }
     }
 
-    // ========== Commands Actions ==========
+    // ========== Slash Prompts Actions ==========
 
-    private fun onAddCommand() {
+    private fun onAddSlashPrompt() {
         val project = ProjectManager.getInstance().openProjects.firstOrNull()
-        val dialog = CommandEditDialog(project)
+        val dialog = PromptEditDialog(project)
         if (dialog.showAndGet()) {
             coroutineScope.launch {
                 try {
-                    logger.info { "Adding new command: ${dialog.getCommandName()}" }
+                    logger.info { "Adding new slash prompt: ${dialog.getPromptName()}" }
 
                     // Save to backend on IO dispatcher
-                    val command = withContext(Dispatchers.IO) {
-                        coreApiClient?.saveCommand(
-                            SaveCommandRequest(
+                    val saved = withContext(Dispatchers.IO) {
+                        coreApiClient?.promptsRouter?.saveSlashPrompt(
+                            SaveSlashPromptRequest(
                                 id = null,
-                                name = dialog.getCommandName(),
+                                name = dialog.getPromptName(),
                                 content = dialog.getContent(),
                                 description = dialog.getDescription(),
                                 isEnabled = dialog.isEnabled()
@@ -293,51 +293,51 @@ class PromptsSettingsPanel(
                     }
 
                     // Reload data (already handles IO + invokeLater)
-                    reloadCommands()
+                    reloadSlashPrompts()
 
                     // Notify settings changed on EDT
                     ApplicationManager.getApplication().invokeLater {
-                        onSettingChanged("prompts", "command_added", command.prompt.id)
+                        onSettingChanged("prompts", "slash_prompt_added", saved.prompt.id)
                     }
 
-                    logger.info { "Command added: ${command.prompt.id}" }
+                    logger.info { "Slash prompt added: ${saved.prompt.id}" }
                 } catch (e: Exception) {
-                    logger.error(e) { "Failed to add command" }
+                    logger.error(e) { "Failed to add slash prompt" }
                     ApplicationManager.getApplication().invokeLater {
-                        showError("Failed to add command: ${e.message}")
+                        showError("Failed to add prompt: ${e.message}")
                     }
                 }
             }
         }
     }
 
-    private fun onEditCommand() {
-        val selectedRow = commandsTable.selectedRow
+    private fun onEditSlashPrompt() {
+        val selectedRow = slashPromptsTable.selectedRow
         if (selectedRow < 0) {
-            showError("Select a command to edit")
+            showError("Select a prompt to edit")
             return
         }
 
-        if (selectedRow >= commandsCache.size) {
+        if (selectedRow >= slashPromptsCache.size) {
             showError("Invalid row")
             return
         }
 
-        val existingCommand = commandsCache[selectedRow]
+        val existing = slashPromptsCache[selectedRow]
         val project = ProjectManager.getInstance().openProjects.firstOrNull()
-        val dialog = CommandEditDialog(project, existingCommand)
+        val dialog = PromptEditDialog(project, existing)
 
         if (dialog.showAndGet()) {
             coroutineScope.launch {
                 try {
-                    logger.info { "Updating command: ${existingCommand.name}" }
+                    logger.info { "Updating slash prompt: ${existing.name}" }
 
                     // Save to backend on IO dispatcher
                     val updated = withContext(Dispatchers.IO) {
-                        coreApiClient?.saveCommand(
-                            SaveCommandRequest(
-                                id = existingCommand.id,
-                                name = dialog.getCommandName(),
+                        coreApiClient?.promptsRouter?.saveSlashPrompt(
+                            SaveSlashPromptRequest(
+                                id = existing.id,
+                                name = dialog.getPromptName(),
                                 content = dialog.getContent(),
                                 description = dialog.getDescription(),
                                 isEnabled = dialog.isEnabled()
@@ -346,41 +346,41 @@ class PromptsSettingsPanel(
                     }
 
                     // Reload data (already handles IO + invokeLater)
-                    reloadCommands()
+                    reloadSlashPrompts()
 
                     // Notify settings changed on EDT
                     ApplicationManager.getApplication().invokeLater {
-                        onSettingChanged("prompts", "command_updated", updated.prompt.id)
+                        onSettingChanged("prompts", "slash_prompt_updated", updated.prompt.id)
                     }
 
-                    logger.info { "Command updated: ${updated.prompt.id}" }
+                    logger.info { "Slash prompt updated: ${updated.prompt.id}" }
                 } catch (e: Exception) {
-                    logger.error(e) { "Failed to update command" }
+                    logger.error(e) { "Failed to update slash prompt" }
                     ApplicationManager.getApplication().invokeLater {
-                        showError("Failed to update command: ${e.message}")
+                        showError("Failed to update prompt: ${e.message}")
                     }
                 }
             }
         }
     }
 
-    private fun onDeleteCommand() {
-        val selectedRow = commandsTable.selectedRow
+    private fun onDeleteSlashPrompt() {
+        val selectedRow = slashPromptsTable.selectedRow
         if (selectedRow < 0) {
-            showError("Select a command to delete")
+            showError("Select a prompt to delete")
             return
         }
 
-        if (selectedRow >= commandsCache.size) {
+        if (selectedRow >= slashPromptsCache.size) {
             showError("Invalid row")
             return
         }
 
-        val command = commandsCache[selectedRow]
+        val target = slashPromptsCache[selectedRow]
 
         val result = JOptionPane.showConfirmDialog(
             this,
-            "Are you sure you want to delete command '${command.name}'?",
+            "Are you sure you want to delete prompt '${target.name}'?",
             "Confirm Deletion",
             JOptionPane.YES_NO_OPTION,
             JOptionPane.WARNING_MESSAGE
@@ -392,27 +392,27 @@ class PromptsSettingsPanel(
 
         coroutineScope.launch {
             try {
-                logger.info { "Deleting command: ${command.name}" }
+                logger.info { "Deleting slash prompt: ${target.name}" }
 
                 // Delete from backend on IO dispatcher
                 withContext(Dispatchers.IO) {
-                    coreApiClient?.deletePrompt(command.id)
+                    coreApiClient?.promptsRouter?.deletePrompt(target.id)
                         ?: throw Exception("CoreApiClient not available")
                 }
 
                 // Reload data (already handles IO + invokeLater)
-                reloadCommands()
+                reloadSlashPrompts()
 
                 // Notify settings changed on EDT
                 ApplicationManager.getApplication().invokeLater {
-                    onSettingChanged("prompts", "command_deleted", command.name)
+                    onSettingChanged("prompts", "slash_prompt_deleted", target.name)
                 }
 
-                logger.info { "Command deleted: ${command.name}" }
+                logger.info { "Slash prompt deleted: ${target.name}" }
             } catch (e: Exception) {
-                logger.error(e) { "Failed to delete command" }
+                logger.error(e) { "Failed to delete slash prompt" }
                 ApplicationManager.getApplication().invokeLater {
-                    showError("Failed to delete command: ${e.message}")
+                    showError("Failed to delete prompt: ${e.message}")
                 }
             }
         }
@@ -430,7 +430,7 @@ class PromptsSettingsPanel(
         }
 
         return try {
-            val response = coreApiClient.getSystemPrompts()
+            val response = coreApiClient.promptsRouter.getSystemPrompts()
             promptsCache.clear()
             promptsCache.addAll(response.prompts)
 
@@ -458,7 +458,7 @@ class PromptsSettingsPanel(
         }
 
         return try {
-            withContext(Dispatchers.IO) { coreApiClient.getSystemPrompts().prompts }
+            withContext(Dispatchers.IO) { coreApiClient.promptsRouter.getSystemPrompts().prompts }
         } catch (e: Exception) {
             logger.error(e) { "Failed to load system prompts async" }
             emptyList()
@@ -466,48 +466,47 @@ class PromptsSettingsPanel(
     }
 
     /**
-     * Load commands data from API (synchronous for initial UI setup)
+     * Load slash prompts data from API (synchronous for initial UI setup)
      */
-    private fun loadCommandsData(): Array<Array<Any>> {
+    private fun loadSlashPromptsData(): Array<Array<Any>> {
         if (coreApiClient == null) {
-            commandsCache.clear()
+            slashPromptsCache.clear()
             return arrayOf(
-                arrayOf("/example", "Example command description", true)
+                arrayOf("/example", "Example prompt description", true)
             )
         }
 
         return try {
-            val response = coreApiClient.getPromptsByType(PromptType.SLASH_COMMAND)
+            val response = coreApiClient.promptsRouter.getPromptsByType(PromptType.SLASH_PROMPT)
 
-            // Update cache during initial load (Bug #6 fix)
-            commandsCache.clear()
-            commandsCache.addAll(response.prompts)
+            slashPromptsCache.clear()
+            slashPromptsCache.addAll(response.prompts)
 
             response.prompts.map { prompt ->
                 arrayOf<Any>(prompt.name, prompt.description ?: "", prompt.isEnabled)
             }.toTypedArray()
         } catch (e: Exception) {
-            logger.error(e) { "Failed to load commands" }
-            commandsCache.clear()  // Clear cache on error
+            logger.error(e) { "Failed to load slash prompts" }
+            slashPromptsCache.clear()
             arrayOf()
         }
     }
 
     /**
-     * Load commands data asynchronously and update cache
+     * Load slash prompts data asynchronously and update cache
      */
-    private suspend fun loadCommandsDataAsync(): List<PromptDto> {
+    private suspend fun loadSlashPromptsDataAsync(): List<PromptDto> {
         if (coreApiClient == null) {
             return emptyList()
         }
 
         return try {
             val response = withContext(Dispatchers.IO) {
-                coreApiClient.getPromptsByType(PromptType.SLASH_COMMAND)
+                coreApiClient.promptsRouter.getPromptsByType(PromptType.SLASH_PROMPT)
             }
             response.prompts
         } catch (e: Exception) {
-            logger.error(e) { "Failed to load commands async" }
+            logger.error(e) { "Failed to load slash prompts async" }
             emptyList()
         }
     }
@@ -534,13 +533,13 @@ class PromptsSettingsPanel(
     }
 
     /**
-     * Update commands table with new data (must be called on EDT)
+     * Update slash prompts table with new data (must be called on EDT)
      */
-    private fun updateCommandsTable(prompts: List<PromptDto>) {
-        commandsCache.clear()
-        commandsCache.addAll(prompts)
+    private fun updateSlashPromptsTable(prompts: List<PromptDto>) {
+        slashPromptsCache.clear()
+        slashPromptsCache.addAll(prompts)
 
-        val model = commandsTable.model as DefaultTableModel
+        val model = slashPromptsTable.model as DefaultTableModel
         model.rowCount = 0
         prompts.forEach { prompt ->
             model.addRow(arrayOf<Any>(prompt.name, prompt.description ?: "", prompt.isEnabled))
@@ -558,12 +557,12 @@ class PromptsSettingsPanel(
     }
 
     /**
-     * Reload commands from backend asynchronously
+     * Reload slash prompts from backend asynchronously
      */
-    private suspend fun reloadCommands() {
-        val prompts = loadCommandsDataAsync()
+    private suspend fun reloadSlashPrompts() {
+        val prompts = loadSlashPromptsDataAsync()
         ApplicationManager.getApplication().invokeLater {
-            updateCommandsTable(prompts)
+            updateSlashPromptsTable(prompts)
         }
     }
 
@@ -583,7 +582,7 @@ class PromptsSettingsPanel(
         coroutineScope.launch {
             try {
                 reloadPrompts()
-                reloadCommands()
+                reloadSlashPrompts()
             } catch (e: Exception) {
                 logger.error(e) { "Failed to reload prompts panel" }
             }
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ProvidersSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ProvidersSettingsPanel.kt
index b84dc6eb..02211e22 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ProvidersSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ProvidersSettingsPanel.kt
@@ -9,9 +9,9 @@ import com.intellij.ui.components.JBPasswordField
 import com.intellij.ui.components.JBScrollPane
 import com.intellij.ui.components.JBTextField
 import pl.jclab.refio.ui.theme.LCATheme
-import pl.jclab.refio.api.CoreApiClient
+import pl.jclab.refio.core.api.CoreApiRouter
 import pl.jclab.refio.core.services.ConfigService.Companion.DEFAULT_CONTEXT_SIZE
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.Dispatchers
 import kotlinx.coroutines.Job
@@ -37,7 +37,7 @@ import javax.swing.event.DocumentListener
  */
 class ProvidersSettingsPanel(
     private val onSettingChanged: (section: String, key: String, value: Any) -> Unit,
-    private val coreApiClient: CoreApiClient?
+    private val coreApiClient: CoreApiRouter?
 ) : JBPanel<ProvidersSettingsPanel>(BorderLayout()), Disposable {
 
     private val logger = dualLogger("ProvidersSettingsPanel")
@@ -48,6 +48,12 @@ class ProvidersSettingsPanel(
     private var saveJobs = mutableMapOf<String, Job>()
     private val saveDebounceMs = 500L
 
+    // Set to true while programmatically populating fields from backend, so the
+    // DocumentListener doesn't misinterpret loaded values as user edits and trigger
+    // autosave + cache invalidation + Ollama model re-fetch on every panel open.
+    @Volatile
+    private var isLoadingFromBackend = false
+
     // Callback for model list refresh
     private var onModelsRefreshed: ((provider: String, models: List<pl.jclab.refio.core.api.ModelInfo>) -> Unit)? =
         null
@@ -225,11 +231,11 @@ class ProvidersSettingsPanel(
 
         providersPanel.add(
             createProviderCard(
-                providerName = "custom_openai",
+                providerName = "generic_openai",
                 fields = listOf(
-                    ProviderField("API Key", FieldType.PASSWORD, "custom_openai_api_key"),
-                    ProviderField("Base URL", FieldType.TEXT, "custom_openai_base_url"),
-                    ProviderField("Model", FieldType.TEXT, "custom_openai_model")
+                    ProviderField("API Key", FieldType.PASSWORD, "generic_openai_api_key"),
+                    ProviderField("Base URL", FieldType.TEXT, "generic_openai_base_url"),
+                    ProviderField("Model", FieldType.TEXT, "generic_openai_model")
                 ),
                 initialStatus = ProviderStatus.NEEDS_CONFIG,
                 description = "OpenAI-compatible provider with custom base URL and optional default model."
@@ -379,6 +385,9 @@ class ProvidersSettingsPanel(
      * Handle field value change with debounce
      */
     private fun onFieldChanged(providerName: String, fieldKey: String, value: String) {
+        if (isLoadingFromBackend) {
+            return
+        }
         // Use lowercase provider name for config keys to match ConfigService expectations
         val jobKey = "${toProviderKey(providerName)}.$fieldKey"
         saveJobs[jobKey]?.cancel()
@@ -388,9 +397,29 @@ class ProvidersSettingsPanel(
 
             logger.debug { "Auto-saving: $jobKey = [REDACTED]" }
 
-            // Save to database
-            ApplicationManager.getApplication().invokeLater {
-                onSettingChanged("providers", jobKey, value)
+            val contextSizeChanged =
+                (providerName.equals("Ollama", ignoreCase = true) && fieldKey == "ollama_context_size") ||
+                    (providerName.equals("LMStudio", ignoreCase = true) && fieldKey == "lmstudio_context_size")
+
+            if (contextSizeChanged) {
+                // Persist synchronously here so the subsequent refresh reads the new value.
+                // Going through SettingsView.onSettingChanged adds another debounce, and
+                // the model refresh would race ahead of the save.
+                try {
+                    coreApiClient?.configRouter?.updateConfig(
+                        section = "providers",
+                        scope = "app",
+                        taskId = null,
+                        settings = mapOf(jobKey to value)
+                    )
+                } catch (e: Exception) {
+                    logger.error(e) { "Failed to save $jobKey before model refresh" }
+                }
+            } else {
+                // Save to database via the standard debounced path
+                ApplicationManager.getApplication().invokeLater {
+                    onSettingChanged("providers", jobKey, value)
+                }
             }
 
             // Re-sync API keys to System.properties (ensures keys work without restart)
@@ -447,7 +476,7 @@ class ProvidersSettingsPanel(
                 val config = buildProviderConfig(providerName, state.fields)
 
                 // Call backend API
-                val result = coreApiClient?.testProviderConnection(
+                val result = coreApiClient?.configRouter?.testProviderConnection(
                     provider = toProviderKey(providerName),
                     config = config
                 ) ?: throw Exception("CoreApiClient not available")
@@ -545,10 +574,10 @@ class ProvidersSettingsPanel(
                 "context_size" to getFieldValue(fields["lmstudio_context_size"]).ifEmpty { DEFAULT_CONTEXT_SIZE.toString() }
             )
 
-            "custom_openai" -> mapOf(
-                "api_key" to getFieldValue(fields["custom_openai_api_key"]),
-                "base_url" to getFieldValue(fields["custom_openai_base_url"]),
-                "model" to getFieldValue(fields["custom_openai_model"])
+            "generic_openai" -> mapOf(
+                "api_key" to getFieldValue(fields["generic_openai_api_key"]),
+                "base_url" to getFieldValue(fields["generic_openai_base_url"]),
+                "model" to getFieldValue(fields["generic_openai_model"])
             )
 
             "zai" -> mapOf(
@@ -611,7 +640,7 @@ class ProvidersSettingsPanel(
             try {
                 logger.info { "Refreshing models list for $providerName" }
 
-                val models = coreApiClient?.refreshProviderModels(
+                val models = coreApiClient?.configRouter?.refreshProviderModels(
                     provider = toProviderKey(providerName)
                 ) ?: emptyList()
 
@@ -658,7 +687,7 @@ class ProvidersSettingsPanel(
             try {
                 logger.info { "Loading providers configuration" }
 
-                val config = coreApiClient.getConfig(section = "providers", scope = "app")
+                val config = coreApiClient.configRouter.getConfig(section = "providers", scope = "app")
 
                 ApplicationManager.getApplication().invokeLater {
                     applyProvidersConfig(config.settings)
@@ -675,32 +704,37 @@ class ProvidersSettingsPanel(
     private fun applyProvidersConfig(settings: Map<String, Any>) {
         logger.info { "Applying providers config: ${settings.keys}" }
 
-        providerStates.forEach { (providerName, state) ->
-            state.fields.forEach { (fieldKey, textField) ->
-                // Use lowercase provider name to match saved keys
-                val configKey = "${toProviderKey(providerName)}.$fieldKey"
-                val value = settings[configKey] as? String
-
-                logger.debug { "Looking for key: $configKey, found: ${value != null}" }
-
-                val effectiveValue = if (configKey == "zai.zai_base_url") {
-                    when (value?.trimEnd('/')) {
-                        "https://api.z.ai/v1" -> "https://api.z.ai/api/coding/paas/v4"
-                        "https://api.z.ai/api/paas/v4" -> "https://api.z.ai/api/coding/paas/v4"
-                        else -> value
+        isLoadingFromBackend = true
+        try {
+            providerStates.forEach { (providerName, state) ->
+                state.fields.forEach { (fieldKey, textField) ->
+                    // Use lowercase provider name to match saved keys
+                    val configKey = "${toProviderKey(providerName)}.$fieldKey"
+                    val value = settings[configKey] as? String
+
+                    logger.debug { "Looking for key: $configKey, found: ${value != null}" }
+
+                    val effectiveValue = if (configKey == "zai.zai_base_url") {
+                        when (value?.trimEnd('/')) {
+                            "https://api.z.ai/v1" -> "https://api.z.ai/api/coding/paas/v4"
+                            "https://api.z.ai/api/paas/v4" -> "https://api.z.ai/api/coding/paas/v4"
+                            else -> value
+                        }
+                    } else {
+                        value
                     }
-                } else {
-                    value
-                }
 
-                if (effectiveValue != null && effectiveValue.isNotEmpty()) {
-                    logger.info { "Setting $configKey to [REDACTED] (length=${effectiveValue.length})" }
-                    setFieldValue(textField, effectiveValue)
-                    updateProviderStatus(providerName, ProviderStatus.CONFIGURED)
-                } else {
-                    logger.debug { "No value for $configKey" }
+                    if (effectiveValue != null && effectiveValue.isNotEmpty()) {
+                        logger.info { "Setting $configKey to [REDACTED] (length=${effectiveValue.length})" }
+                        setFieldValue(textField, effectiveValue)
+                        updateProviderStatus(providerName, ProviderStatus.CONFIGURED)
+                    } else {
+                        logger.debug { "No value for $configKey" }
+                    }
                 }
             }
+        } finally {
+            isLoadingFromBackend = false
         }
     }
 
@@ -713,7 +747,7 @@ class ProvidersSettingsPanel(
             "Gemini" -> "gemini"
             "LMStudio" -> "lmstudio"
             "ZAI" -> "zai"
-            "custom_openai" -> "custom_openai"
+            "generic_openai" -> "generic_openai"
             else -> providerName.lowercase()
         }
     }
@@ -728,12 +762,17 @@ class ProvidersSettingsPanel(
     fun reload() {
         logger.info { "Reloading providers configuration" }
 
-        // Clear all fields
-        providerStates.forEach { (providerName, state) ->
-            state.fields.values.forEach { field ->
-                setFieldValue(field, "")
+        isLoadingFromBackend = true
+        try {
+            // Clear all fields
+            providerStates.forEach { (providerName, state) ->
+                state.fields.values.forEach { field ->
+                    setFieldValue(field, "")
+                }
+                updateProviderStatus(providerName, ProviderStatus.NEEDS_CONFIG)
             }
-            updateProviderStatus(providerName, ProviderStatus.NEEDS_CONFIG)
+        } finally {
+            isLoadingFromBackend = false
         }
 
         // Reload from backend
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/SettingsView.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/SettingsView.kt
index 6e058021..ee838074 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/SettingsView.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/SettingsView.kt
@@ -1,12 +1,14 @@
 package pl.jclab.refio.ui.settings
 
+import pl.jclab.refio.core.config.ConfigKeys
+
 import com.intellij.openapi.application.ApplicationManager
 import com.intellij.openapi.project.Project
 import com.intellij.ui.components.JBPanel
 import com.intellij.ui.components.JBTabbedPane
 import pl.jclab.refio.services.notification.NotificationService
 import pl.jclab.refio.ui.theme.LCATheme
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import kotlinx.coroutines.*
 import java.awt.BorderLayout
 import java.awt.FlowLayout
@@ -23,7 +25,7 @@ import javax.swing.*
  */
 class SettingsView(
     private val project: Project,
-    private val coreApiClient: pl.jclab.refio.api.CoreApiClient?,
+    private val coreApiClient: pl.jclab.refio.core.api.CoreApiRouter?,
     private val onBack: () -> Unit
 ) : JBPanel<SettingsView>(BorderLayout()) {
 
@@ -148,7 +150,7 @@ class SettingsView(
     /**
      * Auto-save mechanism - overloaded version that accepts full config key from ConfigService.
      *
-     * @param fullKey Full configuration key from ConfigService constants (e.g. ConfigService.KEY_API_CALL_TIMEOUT)
+     * @param fullKey Full configuration key from ConfigService constants (e.g. ConfigKeys.API_CALL_TIMEOUT.key)
      * @param value New value for the setting
      */
     fun onSettingChanged(fullKey: String, value: Any) {
@@ -170,7 +172,7 @@ class SettingsView(
 
             // Call backend API
             val settings = mapOf(key to value)
-            coreApiClient.updateConfig(
+            coreApiClient.configRouter.updateConfig(
                 section = section,
                 scope = "app",
                 taskId = null,
@@ -245,7 +247,7 @@ class SettingsView(
             try {
                 logger.info { "Exporting configuration to user config: ${configPath.absolutePath}" }
 
-                val configService = coreApiClient?.router?.configService
+                val configService = coreApiClient?.configService
                     ?: throw IllegalStateException("ConfigService not available")
 
                 configService.exportToYaml(configPath, includeApiKeys)
@@ -302,7 +304,7 @@ class SettingsView(
             try {
                 logger.info { "Exporting configuration to project config: ${configPath.absolutePath}" }
 
-                val configService = coreApiClient?.router?.configService
+                val configService = coreApiClient?.configService
                     ?: throw IllegalStateException("ConfigService not available")
 
                 // Export without API keys for project config
@@ -359,7 +361,7 @@ class SettingsView(
             try {
                 logger.info { "Reloading configuration from YAML file: ${configPath.absolutePath}" }
 
-                val configService = coreApiClient?.router?.configService
+                val configService = coreApiClient?.configService
                     ?: throw IllegalStateException("ConfigService not available")
 
                 val updatedCount = configService.reloadFromYaml()
@@ -407,7 +409,7 @@ class SettingsView(
                 logger.info { "Resetting all settings to defaults" }
 
                 // Call backend API
-                coreApiClient?.resetAllSettingsToDefaults()
+                coreApiClient?.configRouter?.resetAllSettingsToDefaults()
 
                 ApplicationManager.getApplication().invokeLater {
                     // Reload all panels
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/SubagentSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/SubagentSettingsPanel.kt
index 31a9123a..96897404 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/SubagentSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/SubagentSettingsPanel.kt
@@ -6,10 +6,10 @@ import com.intellij.ui.JBColor
 import com.intellij.ui.components.JBPanel
 import com.intellij.ui.components.JBScrollPane
 import com.intellij.ui.table.JBTable
-import pl.jclab.refio.api.CoreApiClient
+import pl.jclab.refio.core.api.CoreApiRouter
 import pl.jclab.refio.core.subagents.models.SubagentInfo
 import pl.jclab.refio.core.subagents.models.SubagentScope
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.theme.LCATheme
 import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.Dispatchers
@@ -30,7 +30,7 @@ private val logger = dualLogger("SubagentSettingsPanel")
  */
 class SubagentSettingsPanel(
     private val onSettingChanged: (section: String, key: String, value: Any) -> Unit,
-    private val coreApiClient: CoreApiClient?
+    private val coreApiClient: CoreApiRouter?
 ) : JBPanel<SubagentSettingsPanel>(BorderLayout()), Disposable {
 
     private lateinit var subagentsTable: JBTable
@@ -81,7 +81,7 @@ class SubagentSettingsPanel(
 
             override fun getColumnClass(column: Int): Class<*> {
                 return when (column) {
-                    5 -> java.lang.Boolean::class.java  // Enabled checkbox
+                    5 -> Boolean::class.javaObjectType  // Enabled checkbox
                     else -> String::class.java
                 }
             }
@@ -197,7 +197,7 @@ class SubagentSettingsPanel(
             try {
                 logger.info { "Loading subagents (including disabled and builtin)" }
                 // includeDisabled = true to show all subagents in admin panel
-                val subagents = coreApiClient.listSubagents(includeDisabled = true)
+                val subagents = coreApiClient.subagentRouter?.listSubagents(includeDisabled = true) ?: emptyList()
 
                 ApplicationManager.getApplication().invokeLater {
                     populateTable(subagents)
@@ -254,7 +254,7 @@ class SubagentSettingsPanel(
 
         coroutineScope.launch {
             try {
-                coreApiClient?.updateSubagent(name, enabled = enabled)
+                coreApiClient?.subagentRouter?.updateSubagent(name, enabled = enabled)
                 logger.info { "Subagent enabled updated: $name -> $enabled" }
 
                 ApplicationManager.getApplication().invokeLater {
@@ -286,7 +286,7 @@ class SubagentSettingsPanel(
         if (dialog.isOk) {
             coroutineScope.launch {
                 try {
-                    coreApiClient?.createSubagent(
+                    coreApiClient?.subagentRouter?.createSubagent(
                         name = dialog.nameField.text,
                         description = dialog.descriptionField.text,
                         systemPrompt = dialog.systemPromptArea.text,
@@ -338,7 +338,7 @@ class SubagentSettingsPanel(
         // Load full subagent definition
         coroutineScope.launch {
             try {
-                val subagent = coreApiClient?.getSubagent(name)
+                val subagent = coreApiClient?.subagentRouter?.getSubagent(name)
                 if (subagent == null) {
                     ApplicationManager.getApplication().invokeLater {
                         JOptionPane.showMessageDialog(
@@ -364,7 +364,7 @@ class SubagentSettingsPanel(
                     if (!isBuiltin && dialog.isOk) {
                         coroutineScope.launch {
                             try {
-                                coreApiClient?.updateSubagent(
+                                coreApiClient.subagentRouter?.updateSubagent(
                                     name = name,
                                     description = dialog.descriptionField.text,
                                     systemPrompt = dialog.systemPromptArea.text,
@@ -372,7 +372,7 @@ class SubagentSettingsPanel(
                                     model = dialog.modelCombo.selectedItem as String,
                                     enabled = dialog.enabledCheck.isSelected,
                                     priority = dialog.prioritySpinner.value as Int
-                                )
+                                ) ?: throw IllegalStateException("SubagentRouter not available")
 
                                 logger.info { "Subagent updated: $name" }
 
@@ -438,7 +438,7 @@ class SubagentSettingsPanel(
 
         coroutineScope.launch {
             try {
-                coreApiClient?.deleteSubagent(name)
+                coreApiClient?.subagentRouter?.deleteSubagent(name)
 
                 logger.info { "Subagent deleted: $name" }
 
@@ -463,7 +463,7 @@ class SubagentSettingsPanel(
     private fun onRefreshSubagents() {
         coroutineScope.launch {
             try {
-                coreApiClient?.refreshSubagents()
+                coreApiClient?.subagentRouter?.refresh()
                 ApplicationManager.getApplication().invokeLater {
                     loadSubagents()
                 }
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ToolsSettingsPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ToolsSettingsPanel.kt
index df1fe768..2837a9bd 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ToolsSettingsPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/settings/ToolsSettingsPanel.kt
@@ -3,16 +3,11 @@ package pl.jclab.refio.ui.settings
 import com.intellij.ui.JBColor
 import com.intellij.ui.components.JBPanel
 import com.intellij.ui.table.JBTable
-import pl.jclab.refio.api.CoreApiClient
+import pl.jclab.refio.core.api.CoreApiRouter
 import pl.jclab.refio.core.api.ToolDefinitionInfo
-import pl.jclab.refio.core.tools.security.AllowedCommand
 import pl.jclab.refio.core.tools.security.CommandRule
 import pl.jclab.refio.core.tools.security.CommandRuleDefaults
-import pl.jclab.refio.core.tools.security.CommandWhitelistConfig
-import pl.jclab.refio.core.tools.security.CommandWhitelistDefaults
-import pl.jclab.refio.core.tools.security.RuleAction
-import pl.jclab.refio.core.tools.security.WhitelistMode
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import pl.jclab.refio.ui.theme.LCATheme
 import kotlinx.coroutines.CoroutineScope
 import kotlinx.coroutines.Dispatchers
@@ -32,19 +27,15 @@ private val logger = dualLogger("ToolsSettingsPanel")
  */
 class ToolsSettingsPanel(
     private val onSettingChanged: (section: String, key: String, value: Any) -> Unit,
-    private val coreApiClient: CoreApiClient?
+    private val coreApiClient: CoreApiRouter?
 ) : JBPanel<ToolsSettingsPanel>(BorderLayout()) {
 
     private lateinit var toolsTable: JBTable
-    private lateinit var whitelistEnabledCheckbox: JCheckBox
-    private lateinit var whitelistModeCombo: JComboBox<String>
-    private lateinit var whitelistTable: JBTable
     private lateinit var commandRulesTable: JBTable
     private val coroutineScope = CoroutineScope(Dispatchers.IO + SupervisorJob())
 
     // Flaga blokująca auto-save podczas ładowania
     private var isLoadingPermissions = false
-    private var isLoadingWhitelist = false
 
     init {
         border = BorderFactory.createCompoundBorder(
@@ -55,15 +46,12 @@ class ToolsSettingsPanel(
             LCATheme.paddedBorder(16)
         )
 
-        // Main content — vertical stack inside scroll pane (like ProvidersSettingsPanel)
         val contentPanel = JPanel().apply {
             layout = BoxLayout(this, BoxLayout.Y_AXIS)
             border = LCATheme.paddedBorder(8, 0, 0, 0)
             add(createToolsTable())
             add(Box.createVerticalStrut(12))
             add(createCommandRulesPanel())
-            add(Box.createVerticalStrut(12))
-            add(createTerminalWhitelistPanel())
         }
 
         val scrollPane = com.intellij.ui.components.JBScrollPane(contentPanel).apply {
@@ -74,9 +62,7 @@ class ToolsSettingsPanel(
 
         add(scrollPane, BorderLayout.CENTER)
 
-        // Załaduj aktualne ustawienia z DB
         loadToolDefinitions()
-        loadTerminalWhitelist()
     }
 
     private fun createToolsTable(): JComponent {
@@ -84,14 +70,13 @@ class ToolsSettingsPanel(
 
         toolsTable = JBTable(object : DefaultTableModel(columnNames, 0) {
             override fun isCellEditable(row: Int, column: Int): Boolean {
-                return column == 2 || column == 3 // Plan Mode and Agent Mode columns
+                return column == 2 || column == 3
             }
 
             override fun getColumnClass(column: Int): Class<*> {
                 return String::class.java
             }
         }).apply {
-            // Auto-save przy zmianie
             model.addTableModelListener { event ->
                 if (event.type == TableModelEvent.UPDATE) {
                     val row = event.firstRow
@@ -104,23 +89,35 @@ class ToolsSettingsPanel(
             }
         }
 
-        // Set up Plan Mode column with combo box editor
         val planModeColumn = toolsTable.columnModel.getColumn(2)
         val planModeCombo = JComboBox(arrayOf("On", "Ask", "Off"))
         planModeColumn.cellEditor = DefaultCellEditor(planModeCombo)
 
-        // Set up Agent Mode column with combo box editor
         val agentModeColumn = toolsTable.columnModel.getColumn(3)
         val agentModeCombo = JComboBox(arrayOf("On", "Ask", "Off"))
         agentModeColumn.cellEditor = DefaultCellEditor(agentModeCombo)
 
-        // Set column widths
-        toolsTable.columnModel.getColumn(0).preferredWidth = 180 // Tool Name
-        toolsTable.columnModel.getColumn(1).preferredWidth = 320 // Description
-        toolsTable.columnModel.getColumn(2).preferredWidth = 90  // Plan Mode
-        toolsTable.columnModel.getColumn(3).preferredWidth = 90  // Agent Mode
+        // Flexible layout: fix mode columns, let Description stretch.
+        toolsTable.autoResizeMode = JTable.AUTO_RESIZE_SUBSEQUENT_COLUMNS
+        toolsTable.columnModel.getColumn(0).apply {
+            minWidth = 120
+            preferredWidth = 160
+        }
+        toolsTable.columnModel.getColumn(1).apply {
+            minWidth = 160
+            preferredWidth = 280
+        }
+        planModeColumn.apply {
+            minWidth = 80
+            maxWidth = 110
+            preferredWidth = 90
+        }
+        agentModeColumn.apply {
+            minWidth = 80
+            maxWidth = 110
+            preferredWidth = 90
+        }
 
-        // Custom renderer for mode columns
         val modeRenderer = object : DefaultTableCellRenderer() {
             override fun getTableCellRendererComponent(
                 table: JTable?,
@@ -151,104 +148,10 @@ class ToolsSettingsPanel(
 
         return JScrollPane(toolsTable).apply {
             border = LCATheme.customLineBorder(LCATheme.grayColor, 1)
-            preferredSize = Dimension(700, 250)
-            minimumSize = Dimension(300, 150)
-        }
-    }
-
-    private fun createTerminalWhitelistPanel(): JComponent {
-        val panel = JBPanel<JBPanel<*>>(BorderLayout()).apply {
-            border = BorderFactory.createCompoundBorder(
-                BorderFactory.createTitledBorder(
-                    LCATheme.customLineBorder(LCATheme.borderColor, 1),
-                    "Terminal Command Whitelist"
-                ),
-                LCATheme.paddedBorder(8)
-            )
-            preferredSize = Dimension(700, 310)
-        }
-
-        whitelistEnabledCheckbox = JCheckBox("Enabled", true)
-        whitelistModeCombo = JComboBox(arrayOf(
-            WhitelistMode.WHITELIST_ONLY.name,
-            WhitelistMode.WHITELIST_PLUS_DENY.name
-        ))
-
-        val controls = JPanel(FlowLayout(FlowLayout.LEFT, 8, 0)).apply {
-            add(whitelistEnabledCheckbox)
-            add(JLabel("Mode:"))
-            add(whitelistModeCombo)
-        }
-
-        val whitelistColumns = arrayOf(
-            "Enabled",
-            "Program",
-            "Aliases (comma-separated)",
-            "Allowed Subcommands",
-            "Blocked Subcommands",
-            "Blocked Flags",
-            "Blocked Arg Patterns",
-            "Max Args",
-            "Confirm"
-        )
-
-        whitelistTable = JBTable(object : DefaultTableModel(whitelistColumns, 0) {
-            override fun isCellEditable(row: Int, column: Int): Boolean = true
-
-            override fun getColumnClass(column: Int): Class<*> {
-                return when (column) {
-                    0, 8 -> Boolean::class.java
-                    else -> String::class.java
-                }
-            }
-        }).apply {
-            rowHeight = 26
-            setShowGrid(true)
-            gridColor = JBColor.LIGHT_GRAY
-        }
-
-        whitelistTable.columnModel.getColumn(0).preferredWidth = 60
-        whitelistTable.columnModel.getColumn(1).preferredWidth = 120
-        whitelistTable.columnModel.getColumn(2).preferredWidth = 150
-        whitelistTable.columnModel.getColumn(3).preferredWidth = 170
-        whitelistTable.columnModel.getColumn(4).preferredWidth = 170
-        whitelistTable.columnModel.getColumn(5).preferredWidth = 140
-        whitelistTable.columnModel.getColumn(6).preferredWidth = 170
-        whitelistTable.columnModel.getColumn(7).preferredWidth = 70
-        whitelistTable.columnModel.getColumn(8).preferredWidth = 70
-
-        populateWhitelistTable(CommandWhitelistDefaults.DEFAULT_COMMANDS)
-
-        val addButton = JButton("Add Command").apply {
-            addActionListener { addEmptyWhitelistRow() }
-        }
-        val removeButton = JButton("Remove Selected").apply {
-            addActionListener { removeSelectedWhitelistRow() }
-        }
-        val saveButton = JButton("Save Whitelist").apply {
-            addActionListener { saveTerminalWhitelist() }
-        }
-        val resetButton = JButton("Reset to Defaults").apply {
-            addActionListener { resetTerminalWhitelistToDefaults() }
-        }
-
-        val actions = JPanel(FlowLayout(FlowLayout.LEFT, 8, 0)).apply {
-            add(addButton)
-            add(removeButton)
-            add(saveButton)
-            add(resetButton)
-        }
-
-        val centerPanel = JBPanel<JBPanel<*>>(BorderLayout()).apply {
-            add(controls, BorderLayout.NORTH)
-            add(JScrollPane(whitelistTable).apply {
-                border = LCATheme.customLineBorder(LCATheme.grayColor, 1)
-            }, BorderLayout.CENTER)
-            add(actions, BorderLayout.SOUTH)
+            preferredSize = Dimension(0, 250)
+            minimumSize = Dimension(0, 150)
+            horizontalScrollBarPolicy = JScrollPane.HORIZONTAL_SCROLLBAR_NEVER
         }
-        panel.add(centerPanel, BorderLayout.CENTER)
-
-        return panel
     }
 
     private fun createCommandRulesPanel(): JComponent {
@@ -260,7 +163,7 @@ class ToolsSettingsPanel(
                 ),
                 LCATheme.paddedBorder(8)
             )
-            preferredSize = Dimension(700, 310)
+            preferredSize = Dimension(0, 310)
         }
 
         val columns = arrayOf("Pattern (regex)", "Action", "Description")
@@ -272,15 +175,25 @@ class ToolsSettingsPanel(
             gridColor = JBColor.LIGHT_GRAY
         }
 
-        commandRulesTable.columnModel.getColumn(0).preferredWidth = 300
-        commandRulesTable.columnModel.getColumn(1).preferredWidth = 100
-        commandRulesTable.columnModel.getColumn(2).preferredWidth = 200
+        // Flexible layout: Action column fixed, Pattern and Description share the rest.
+        commandRulesTable.autoResizeMode = JTable.AUTO_RESIZE_SUBSEQUENT_COLUMNS
+        commandRulesTable.columnModel.getColumn(0).apply {
+            minWidth = 140
+            preferredWidth = 220
+        }
+        commandRulesTable.columnModel.getColumn(1).apply {
+            minWidth = 80
+            maxWidth = 110
+            preferredWidth = 90
+        }
+        commandRulesTable.columnModel.getColumn(2).apply {
+            minWidth = 120
+            preferredWidth = 180
+        }
 
-        // Action column dropdown
         val actionCombo = JComboBox(arrayOf("ALLOW", "BLOCK", "ASK"))
         commandRulesTable.columnModel.getColumn(1).cellEditor = DefaultCellEditor(actionCombo)
 
-        // Color renderer for action column
         val actionRenderer = object : DefaultTableCellRenderer() {
             override fun getTableCellRendererComponent(
                 table: JTable?, value: Any?, isSelected: Boolean, hasFocus: Boolean, row: Int, column: Int
@@ -342,50 +255,6 @@ class ToolsSettingsPanel(
         }
     }
 
-    private fun populateWhitelistTable(commands: List<AllowedCommand>) {
-        val model = whitelistTable.model as DefaultTableModel
-        model.rowCount = 0
-
-        commands.forEach { command ->
-            model.addRow(arrayOf<Any?>(
-                true,
-                command.program,
-                command.aliases.joinToString(", "),
-                command.allowedSubcommands.joinToString(", "),
-                command.blockedSubcommands.joinToString(", "),
-                command.blockedFlags.joinToString(", "),
-                command.blockedArgPatterns.joinToString(", "),
-                command.maxArgs.toString(),
-                command.requireConfirmation
-            ))
-        }
-    }
-
-    private fun addEmptyWhitelistRow() {
-        val model = whitelistTable.model as DefaultTableModel
-        model.addRow(arrayOf<Any?>(true, "", "", "", "", "", "", "50", false))
-        val row = model.rowCount - 1
-        whitelistTable.changeSelection(row, 1, false, false)
-    }
-
-    private fun removeSelectedWhitelistRow() {
-        val model = whitelistTable.model as DefaultTableModel
-        val selectedRows = whitelistTable.selectedRows
-        if (selectedRows.isEmpty()) {
-            return
-        }
-
-        selectedRows.sortedDescending().forEach { row ->
-            if (row in 0 until model.rowCount) {
-                model.removeRow(row)
-            }
-        }
-    }
-
-    /**
-     * Ładuje definicje narzędzi z backendu (jedyne źródło prawdy).
-     * Backend wyznacza listę i defaulty z ToolRegistry + ToolPermissionsService.
-     */
     private fun loadToolDefinitions() {
         if (coreApiClient == null) {
             logger.warn { "CoreApiClient not available – tools table will be empty" }
@@ -394,7 +263,7 @@ class ToolsSettingsPanel(
 
         coroutineScope.launch {
             try {
-                val definitions = coreApiClient.getAvailableToolDefinitions()
+                val definitions = coreApiClient.toolRouter.getAvailableToolDefinitions()
                 SwingUtilities.invokeLater {
                     populateTableFromBackend(definitions)
                     loadToolPermissions()
@@ -429,9 +298,11 @@ class ToolsSettingsPanel(
             try {
                 logger.info { "Loading tool permissions from backend" }
 
-                val permissions = coreApiClient.getToolPermissions()
+                val response = coreApiClient.toolRouter.getToolPermissions(null)
+                val permissions = response.tools.associate { tool ->
+                    tool.toolName to (tool.planMode to tool.agentMode)
+                }
 
-                // applyPermissions already uses SwingUtilities.invokeLater internally
                 applyPermissions(permissions)
             } catch (e: Exception) {
                 logger.error(e) { "Failed to load tool permissions" }
@@ -439,12 +310,8 @@ class ToolsSettingsPanel(
         }
     }
 
-    /**
-     * Aplikuje załadowane uprawnienia do tabeli
-     */
     private fun applyPermissions(permissions: Map<String, Pair<String, String>>) {
         SwingUtilities.invokeLater {
-            // Blokuj auto-save podczas ładowania
             isLoadingPermissions = true
             try {
                 val tableModel = toolsTable.model as DefaultTableModel
@@ -461,154 +328,12 @@ class ToolsSettingsPanel(
                     tableModel.setValueAt(normalizedAgentMode, row, 3)
                 }
             } finally {
-                // Zawsze odblokuj po zakończeniu
                 isLoadingPermissions = false
             }
         }
     }
 
-    /**
-     * Callback wywoływany gdy zmieni się uprawnienie w tabeli
-     */
-    private fun loadTerminalWhitelist() {
-        if (coreApiClient == null) {
-            logger.warn { "CoreApiClient not available, using terminal whitelist defaults" }
-            return
-        }
-
-        coroutineScope.launch {
-            try {
-                logger.info { "Loading terminal whitelist from backend" }
-                val config = coreApiClient.getTerminalWhitelistConfig()
-                applyTerminalWhitelistConfig(config)
-            } catch (e: Exception) {
-                logger.error(e) { "Failed to load terminal whitelist" }
-            }
-        }
-    }
-
-    private fun applyTerminalWhitelistConfig(config: CommandWhitelistConfig) {
-        SwingUtilities.invokeLater {
-            isLoadingWhitelist = true
-            try {
-                whitelistEnabledCheckbox.isSelected = config.enabled
-                whitelistModeCombo.selectedItem = config.mode.name
-                populateWhitelistTable(config.allowedCommands)
-            } finally {
-                isLoadingWhitelist = false
-            }
-        }
-    }
-
-    private fun saveTerminalWhitelist() {
-        if (isLoadingWhitelist) return
-        if (coreApiClient == null) {
-            JOptionPane.showMessageDialog(this, "CoreApiClient is not available", "Error", JOptionPane.ERROR_MESSAGE)
-            return
-        }
-
-        val config = try {
-            buildTerminalWhitelistConfigFromUi()
-        } catch (e: IllegalArgumentException) {
-            JOptionPane.showMessageDialog(
-                this,
-                e.message,
-                "Invalid whitelist configuration",
-                JOptionPane.ERROR_MESSAGE
-            )
-            return
-        }
-
-        coroutineScope.launch {
-            try {
-                coreApiClient.setTerminalWhitelistConfig(config, "app")
-                logger.info { "Terminal whitelist saved (${config.allowedCommands.size} commands)" }
-            } catch (e: Exception) {
-                logger.error(e) { "Failed to save terminal whitelist" }
-                SwingUtilities.invokeLater {
-                    JOptionPane.showMessageDialog(
-                        this@ToolsSettingsPanel,
-                        "Nie udaĹ‚o siÄ™ zapisaÄ‡ whitelisty: ${e.message}",
-                        "BĹ‚Ä…d",
-                        JOptionPane.ERROR_MESSAGE
-                    )
-                }
-            }
-        }
-    }
-
-    private fun resetTerminalWhitelistToDefaults() {
-        whitelistEnabledCheckbox.isSelected = true
-        whitelistModeCombo.selectedItem = WhitelistMode.WHITELIST_ONLY.name
-        populateWhitelistTable(CommandWhitelistDefaults.DEFAULT_COMMANDS)
-        saveTerminalWhitelist()
-    }
-
-    private fun buildTerminalWhitelistConfigFromUi(): CommandWhitelistConfig {
-        val modeName = whitelistModeCombo.selectedItem?.toString()
-            ?: throw IllegalArgumentException("Whitelist mode is required")
-        val mode = try {
-            WhitelistMode.valueOf(modeName)
-        } catch (_: IllegalArgumentException) {
-            throw IllegalArgumentException("Unknown whitelist mode: $modeName")
-        }
-
-        return CommandWhitelistConfig(
-            enabled = whitelistEnabledCheckbox.isSelected,
-            mode = mode,
-            allowedCommands = collectWhitelistCommandsFromTable(),
-            globalBlockedPatterns = CommandWhitelistDefaults.DEFAULT_BLOCKED_PATTERNS
-        )
-    }
-
-    private fun collectWhitelistCommandsFromTable(): List<AllowedCommand> {
-        val model = whitelistTable.model as DefaultTableModel
-        val commands = mutableListOf<AllowedCommand>()
-
-        for (row in 0 until model.rowCount) {
-            val enabled = (model.getValueAt(row, 0) as? Boolean) ?: false
-            if (!enabled) continue
-
-            val program = model.getValueAt(row, 1)?.toString()?.trim().orEmpty()
-            if (program.isBlank()) {
-                throw IllegalArgumentException("Program is required for enabled row ${row + 1}")
-            }
-
-            val maxArgsRaw = model.getValueAt(row, 7)?.toString()?.trim().orEmpty()
-            val maxArgs = if (maxArgsRaw.isBlank()) {
-                50
-            } else {
-                maxArgsRaw.toIntOrNull()
-                    ?: throw IllegalArgumentException("Invalid maxArgs in row ${row + 1}: '$maxArgsRaw'")
-            }
-
-            val requireConfirmation = (model.getValueAt(row, 8) as? Boolean) ?: false
-
-            commands += AllowedCommand(
-                program = program,
-                aliases = parseCsvList(model.getValueAt(row, 2)?.toString()),
-                allowedSubcommands = parseCsvList(model.getValueAt(row, 3)?.toString()),
-                blockedSubcommands = parseCsvList(model.getValueAt(row, 4)?.toString()),
-                blockedFlags = parseCsvList(model.getValueAt(row, 5)?.toString()),
-                blockedArgPatterns = parseCsvList(model.getValueAt(row, 6)?.toString()),
-                maxArgs = maxArgs,
-                requireConfirmation = requireConfirmation
-            )
-        }
-
-        return commands
-    }
-
-    private fun parseCsvList(raw: String?): List<String> {
-        return raw
-            ?.split(",")
-            ?.map { it.trim() }
-            ?.filter { it.isNotBlank() }
-            ?: emptyList()
-    }
-
     private fun onPermissionChanged(row: Int) {
-        // Ignoruj zmiany podczas ładowania (aby uniknąć zapętlenia)
         if (isLoadingPermissions) {
             logger.debug { "Ignoring permission change during loading (row=$row)" }
             return
@@ -622,18 +347,18 @@ class ToolsSettingsPanel(
 
         logger.debug { "Permission changed: $toolName -> plan=$planMode, agent=$agentMode" }
 
-        // Auto-save do backendu
         coroutineScope.launch {
             try {
-                coreApiClient?.setToolPermission(
+                coreApiClient?.toolRouter?.setToolPermission(
                     toolName = toolName,
-                    planMode = planMode.uppercase(),
-                    agentMode = agentMode.uppercase()
+                    request = pl.jclab.refio.core.models.api.SetToolPermissionRequest(
+                        planMode = planMode.uppercase(),
+                        agentMode = agentMode.uppercase()
+                    )
                 )
 
                 logger.info { "Saved permission for $toolName" }
 
-                // Powiadom SettingsView o zmianie
                 SwingUtilities.invokeLater {
                     onSettingChanged("tools", "permission_$toolName", "$planMode,$agentMode")
                 }
@@ -648,7 +373,6 @@ class ToolsSettingsPanel(
                         JOptionPane.ERROR_MESSAGE
                     )
 
-                    // Reload table to restore previous values
                     loadToolPermissions()
                 }
             }
@@ -661,7 +385,6 @@ class ToolsSettingsPanel(
     fun reload() {
         logger.info { "Reloading tool permissions" }
         loadToolDefinitions()
-        loadTerminalWhitelist()
     }
 
     private fun toDisplayToolName(internalToolName: String): String {
@@ -684,5 +407,4 @@ class ToolsSettingsPanel(
             }
         }
     }
-
 }
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/theme/ContextSectionColorPalette.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/theme/ContextSectionColorPalette.kt
index 6903b77a..e652c682 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/theme/ContextSectionColorPalette.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/theme/ContextSectionColorPalette.kt
@@ -18,7 +18,6 @@ object ContextSectionColorPalette {
         "current_task" to Color(0xDA70D6),
         "subtasks" to Color(0xFF69B4),
         "conversation" to Color(0xF08080),
-        "rag_fragments" to Color(0xFFB347),
         "user_context" to Color(0x98FB98),
         "tool_outputs" to Color(0x87CEEB),
         "recent_work" to Color(0xDDA0DD),
diff --git a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/toolwindow/RefioMainPanel.kt b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/toolwindow/RefioMainPanel.kt
index 3afdc872..91692500 100644
--- a/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/toolwindow/RefioMainPanel.kt
+++ b/intellij-plugin/src/main/kotlin/pl/jclab/refio/ui/toolwindow/RefioMainPanel.kt
@@ -1,7 +1,7 @@
 package pl.jclab.refio.ui.toolwindow
 
 import com.intellij.openapi.Disposable
-import pl.jclab.refio.services.logging.dualLogger
+import pl.jclab.refio.core.logging.dualLogger
 import com.intellij.openapi.project.Project
 import com.intellij.ui.components.JBPanel
 import com.intellij.ui.components.JBScrollPane
@@ -21,7 +21,7 @@ import pl.jclab.refio.ui.components.rag.RagViewPanel
 import pl.jclab.refio.ui.execution.TurnStateStatusBar
 import pl.jclab.refio.ui.settings.ApiLogsPanel
 import pl.jclab.refio.ui.settings.SettingsView
-import pl.jclab.refio.api.CoreApiClient
+import pl.jclab.refio.core.api.CoreApiRouter
 import com.intellij.openapi.actionSystem.ActionManager
 import com.intellij.openapi.actionSystem.DefaultActionGroup
 import com.intellij.openapi.actionSystem.Separator
@@ -57,7 +57,7 @@ class RefioMainPanel(private val project: Project) : JBPanel<RefioMainPanel>(Bor
     private val cs = CoroutineScope(SupervisorJob())
     private val sessionManager = SessionManager.getInstance(project)
     private val stepExecutionService = StepExecutionService.getInstance(project)
-    private val coreApiClient = pl.jclab.refio.api.CoreApiClient(sessionManager.apiRouter)
+    private val coreApiClient: pl.jclab.refio.core.api.CoreApiRouter = sessionManager.apiRouter
 
     private val chatView: ChatView
     private val promptInputPanel: PromptInputPanel
@@ -207,7 +207,7 @@ class RefioMainPanel(private val project: Project) : JBPanel<RefioMainPanel>(Bor
         // Load advanced view setting from config on startup
         cs.launch {
             try {
-                val config = coreApiClient.getConfig(section = "general", scope = "app")
+                val config = coreApiClient.configRouter.getConfig(section = "general", scope = "app")
                 val advancedView = (config.settings["advanced_view"] as? String).toBoolean()
 
                 // Apply setting on EDT
diff --git a/intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/SessionStateManagerTest.kt b/intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/SessionStateManagerTest.kt
deleted file mode 100644
index f2f778f1..00000000
--- a/intellij-plugin/src/test/kotlin/pl/jclab/refio/services/session/SessionStateManagerTest.kt
+++ /dev/null
@@ -1,48 +0,0 @@
-package pl.jclab.refio.services.session
-
-import kotlinx.coroutines.runBlocking
-import org.junit.jupiter.api.Test
-import pl.jclab.refio.api.models.ExecutionMode
-import pl.jclab.refio.api.models.Message
-import pl.jclab.refio.api.models.Session
-import pl.jclab.refio.api.models.TaskMode
-import pl.jclab.refio.api.models.TaskStatus
-import kotlin.test.assertEquals
-
-class SessionStateManagerTest {
-
-    @Test
-    fun `setActiveSession updates state`() {
-        val manager = SessionStateManager()
-        val session = Session(
-            id = "session-1",
-            name = "Test",
-            mode = TaskMode.CHAT,
-            status = TaskStatus.PENDING,
-            createdAt = 1L,
-            updatedAt = 1L,
-            executionMode = ExecutionMode.INTERACTIVE
-        )
-
-        manager.setActiveSession(session)
-
-        assertEquals(session, manager.activeSession.value)
-    }
-
-    @Test
-    fun `appendMessage adds message`() = runBlocking {
-        val manager = SessionStateManager()
-        val message = Message(
-            id = "msg-1",
-            taskId = "session-1",
-            role = "system",
-            content = "hello",
-            createdAt = 1L
-        )
-
-        manager.appendMessage(message)
-
-        assertEquals(1, manager.messages.value.size)
-        assertEquals(message, manager.messages.value.first())
-    }
-}