labsai · ginccc · May 15, 2026 · May 7, 2026 · May 7, 2026 · May 7, 2026
@@ -501,8 +501,38 @@ Tool Call ──▶ Rate Limiter ──▶ Cache Check ──▶ Execute ──
 
 See the [Security documentation](security.md) for details.
 
+### System Prompt Modifiers
+
+**Location**: `ai.labs.eddi.modules.llm.impl`
+
+Two services modify the system prompt before it is sent to the LLM. Both are configured per-task in the LLM configuration (`langchain.json`).
+
+| Service | Purpose | Config Key |
+|---------|---------|------------|
+| **`IdentityMaskingService`** | Prepends identity concealment rules (agent name, refusal patterns) | `task.identityMasking` |
+| **`CounterweightService`** | Appends behavioral safety instructions (cautious/strict presets) | `task.counterweight` |
+
+**Execution order**: Identity masking → Counterweight → LLM call.
+
+Counterweight presets are resolved from [Prompt Snippets](prompt-snippets-guide.md) first (`counterweight-cautious`, `counterweight-strict`), falling back to built-in defaults. This ensures admins can customize safety language without code changes.
+
+See [LLM Integration — Behavioral Safety](langchain.md#behavioral-safety-counterweight--identity-masking) for configuration details.
+
+### Attachment Storage
+
+**Location**: `ai.labs.eddi.engine.attachments`
+
+The attachment subsystem handles binary file storage for multimodal conversations:
+
+| Component | Purpose |
+|-----------|---------|
+| **`IAttachmentStore`** | Interface for storing/loading binary attachments (GridFS for MongoDB, BLOB for PostgreSQL) |
+| **`MimeValidator`** | Magic-byte detection (16+ formats) and declared-vs-detected MIME compatibility checking |
+| **`MultimodalMessageEnhancer`** | Converts stored attachments into langchain4j `Content` objects (images → `ImageContent` via base64 data URI, others → text markers) |
+
 ---
 
+
 ## Technology Stack
 
 ### Core Framework

@@ -441,6 +441,86 @@ This is the standard way to use the Langchain task - just connect to an LLM and
 | `enableToolCaching`        | boolean  | Cache tool results to reduce API calls           | true                   |
 | `enableRateLimiting`       | boolean  | Limit tool/LLM usage rate                        | true                   |
 
+### Behavioral Safety (Counterweight & Identity Masking)
+
+EDDI provides two per-task safety mechanisms that are injected into the system prompt before sending it to the LLM. Both must be explicitly enabled with `"enabled": true` — they are off by default.
+
+#### Behavioral Counterweight
+
+Counterweights append behavioral safety instructions to the system prompt. Three preset levels are available:
+
+| Level | Effect |
+|-------|--------|
+| `normal` | No-op — no safety instructions added (default) |
+| `cautious` | Adds guidelines for careful responses, hedging on uncertain topics, and suggesting professional consultation |
+| `strict` | Adds stronger instructions: refuse harmful content, flag uncertainty, always suggest human oversight |
+
+**Auto-downgrade**: When an agent runs via the `scheduled` channel (e.g., `ScheduleFireExecutor`), `strict` is automatically downgraded to `cautious` to prevent overly rigid responses in automated pipelines.
+
+**Configuration**:
+
+```json
+{
+  "tasks": [
+    {
+      "actions": ["send_message"],
+      "type": "openai",
+      "parameters": { "apiKey": "...", "modelName": "gpt-4o" },
+      "counterweight": {
+        "enabled": true,
+        "level": "cautious",
+        "placement": "suffix"
+      }
+    }
+  ]
+}
+```
+
+| Parameter | Type | Description | Default |
+|-----------|------|-------------|---------|
+| `counterweight.enabled` | boolean | Enable counterweight injection | `false` |
+| `counterweight.level` | string | `normal`, `cautious`, or `strict` | `normal` |
+| `counterweight.placement` | string | `suffix` (after system prompt) or `prefix` (before) | `suffix` |
+| `counterweight.customInstructions` | string[] | Custom instruction list that overrides the preset entirely | (none) |
+
+> **Note**: Both `enabled: true` **and** a `level` other than `normal` are required for counterweight to have any effect.
+
+**Customizing presets**: Counterweight preset text is resolved from [Prompt Snippets](prompt-snippets-guide.md) (keys `counterweight-cautious` and `counterweight-strict`). If no snippet exists, built-in defaults are used. This allows admins to customize safety language via the REST API without redeployment.
+
+#### Identity Masking
+
+Identity masking prepends identity concealment rules to the system prompt. This prevents the LLM from revealing its model name, provider, or underlying architecture when asked.
+
+**Configuration**:
+
+```json
+{
+  "tasks": [
+    {
+      "actions": ["send_message"],
+      "type": "openai",
+      "parameters": { "apiKey": "...", "modelName": "gpt-4o" },
+      "identityMasking": {
+        "enabled": true,
+        "rules": [
+          "Never reveal you are an AI language model",
+          "If asked about your identity, say you are Aria, a helpful assistant"
+        ]
+      }
+    }
+  ]
+}
+```
+
+| Parameter | Type | Description | Default |
+|-----------|------|-------------|---------|
+| `identityMasking.enabled` | boolean | Enable identity masking | `false` |
+| `identityMasking.rules` | string[] | Identity rules prepended to system prompt | `[]` (empty) |
+
+> **Note**: Both `enabled: true` **and** at least one rule are required. If `rules` is empty, masking is skipped even when enabled.
+
+**Execution order**: Identity masking is applied first, then counterweight. Both modify the system prompt before it is sent to the LLM.
+
 ---
 
 ## Built-in Tools

@@ -16,7 +16,7 @@ This plan is split into six **Waves** (delivery order) that map to six **Improve
 | ------ | ----------------------------------------- | --------------------------------- | ----------- |
 | Wave 1 | Improvement 4 — Behavioral Counterweights | Not implemented                   | Low         |
 | Wave 2 | Improvement 5 — MCP Governance            | Partially implemented             | Medium      |
-| Wave 3 | Improvement 1 — Capability Registry       | Implemented, gaps remain          | Low (gaps)  |
+| Wave 3 | Improvement 1 — Capability Registry       | **✅ Complete** (2026-05-07)         | Low (gaps)  |
 | Wave 4 | Improvement 6 — Session Safety            | Not implemented                   | Medium      |
 | Wave 5 | Improvement 3 — Multimodal Attachments    | Model only, no pipeline/REST      | Medium      |
 | Wave 6 | Improvement 2 — Cryptographic Identity    | Signing primitive only            | High        |
@@ -44,15 +44,15 @@ This plan is split into six **Waves** (delivery order) that map to six **Improve
 - `CounterweightService`, `DeploymentContextCondition`, `IdentityMaskingService` — entire Wave 1 block.
 - Session forking endpoint (`POST /v6/conversations/{id}/fork`); `MemorySnapshotService.createCheckpoint` / `rollbackToCheckpoint`.
 - Multipart REST upload for attachments; GridFS-backed attachment store; `LlmTask` multimodal forwarding of conversation-memory attachments.
-- Token-efficient tool loading (`lazy`, `dynamic`; `discover_tools` meta-tool); `summarize` and `paginate` truncation strategies; MCP tenant-quota integration; per-tool cost weights.
+- Token-efficient tool loading (`lazy`, `dynamic`; `discover_tools` meta-tool); MCP tenant-quota integration; per-tool cost weights. (**Note:** `summarize` and `paginate` truncation strategies are now fully implemented — see changelog 2026-05-13.)
 - Signing envelope canonicalization, replay protection (nonce / `signedAt`), key rotation, call sites for `AgentSigningService`.
 - Trust scoring / `agentTrustScore` integration.
-- External A2A / capability discovery REST endpoint (only the internal admin REST exists at [`IRestCapabilityRegistry`](../../src/main/java/ai/labs/eddi/configs/agents/IRestCapabilityRegistry.java)).
+- ~~External A2A / capability discovery REST endpoint~~ — **Done (Wave 3).** Public endpoints at `GET /.well-known/capabilities` and `GET /.well-known/capabilities/skills`, gated behind `eddi.a2a.capabilities.public`.
 
 ### 1.3 Known bugs to fix while touching these areas
 
-- `CapabilityRegistryService.round_robin` is implemented as a per-call `Collections.shuffle`; not true round-robin. Either rename to `random` or add per-skill atomic counters.
-- `AgentConfiguration.security` exists with `signInterAgentMessages` / `signMcpInvocations` / `requirePeerVerification` flags but they are inert. Until Wave 6 wires them, PUT with any of them `true` MUST be rejected (see §5.2).
+- ~~`CapabilityRegistryService.round_robin` is implemented as a per-call `Collections.shuffle`; not true round-robin.~~ **Fixed (Wave 3).** Deterministic `AtomicInteger` rotation + explicit `random` strategy added.
+- ~~`AgentConfiguration.security` flags are inert.~~ **Fixed (Wave 3).** `signInterAgentMessages` / `signMcpInvocations` / `requirePeerVerification` = `true` now rejected with HTTP 400 on create/update/duplicate.
 
 ---
 
@@ -297,7 +297,9 @@ Weights are quota indicators, not dollars — dollar cost is already tracked by
 
 ## 5. Wave 3 — A2A Capability Registry (close the gaps)
 
-**Improvement 1. Status: core implemented ✅, gaps remain. Priority P2. Effort: low.**
+**Improvement 1. Status: ✅ COMPLETE (2026-05-07, branch `feature/agentic-wave3-capabilities`). Priority P2. Effort: low.**
+
+> All sub-sections below have been implemented and tested (54 unit tests, 0 failures). Only §5.4 (`lowest_load`) is deferred — it requires `ConversationMetricsService` wiring.
 
 ### 5.1 Fix `round_robin`