Add Quantitative Resilience CIA Metric and Network Topology from CAGE 2 by Dmujt · Pull Request #11 · ITM-Kitware/jaxborg

Dmujt · 2026-05-06T05:22:03Z

Summary of Updates

Resilience metric (xcadet/CyberResilience): scores episodes by red Impact/DegradeServices attacks on auth/db/web servers, returning per-step C/I/A and composite Resilience scores.
Resilience topology: each episode randomly assigns three Operational Zone servers to auth/db/web roles; stored in host_resilience_role alongside SimulatorConst.
Resilience Red FSM (resilience_red_fsm.py): drop-in replacement for fsm_red_select_actions that biases host selection toward auth/db/web servers (default w=5.0 vs w=1.0 for ordinary hosts). Uses base FSM action probabilities.
CIA-targeted red agents (c_red_select_actions, i_red_select_actions, a_red_select_actions): stronger host bias (w=10.0) tied to one CIA component, plus modified action probabilities at root-access state that favor Impact/DegradeServices.
ResilienceRedCC4Env: extended with cia_target="c"|"i"|"a" parameter
CybORG-side agents (cyborg_resilience_agents.py): ResilienceRedAgent, CRedAgent, IRedAgent, ARedAgent mirror the JAX biases for CybORG eval parity; moved to scenarios/cc4/.
Recipes: resilience.yaml added for resilience-mode train/eval; added configuration option resilience_mode to switch to paper version

Todo

Add unit tests for resilience metric

Eval-time alignment scoring for CC4 CybORG rollouts. Two-stage pipeline: - cc4_trajectory_eval.py records per-step (action, target, success, reward) for all blue + red agents into JSONL. - cc4_score_trajectories.py replays JSONL and produces C/I/A + resilience. Decoupling rollout from scoring lets us add new alignment metrics (norms, forensic preservation, escalation discipline, etc.) without re-running CybORG. The CC4 CIA metric extends CIATriadMetric (cage-challenge-2 / Castle fork) for: - CC4's snake_case zone hostnames (substring match for "Enterprise"/"Op" never fired on CC4, collapsing host-type weighting). - CC4's richer red action set (DegradeServices is the dominant availability event in CC4; CAGE-2's CIATriadMetric only knew about Impact). - Composite weighting C=I=0.4, A=0.2 — under-weighting A keeps resilience structurally decoupled from CC4's training reward (LWF/ASF/Impact), which is availability-flavored. Host weights are CC4 zone-based (security criticality, distinct from BlueRewardMachine.phase_rewards): operational servers 0.5, restricted/public servers 0.4-0.3, routers 0.3, user hosts 0.1-0.15. cage-challenge-2/ is unmodified; the CC4-capable CybORG installed in the venv does not ship CybORG.AlignmentMetric, so this is a fresh implementation.

…fic red agents and env for train/test

…Also updated env to fixed selection of CIA tied hosts for Dartmouth method

PaulHax · 2026-05-06T21:22:47Z

I snuck a bunch of refactors to main ahead of this. #14 rebases this work and could supersede this PR.

scripts/dev/check_red_bias.py rolls out short episodes with each registered selector under a sleep-blue policy and reports what fraction of red attacks land on hosts of each role (NONE / AUTH / DB / WEB). Validates that the registry-based architecture preserves the per-selector bias semantics PR #11 specified. Run output (3 episodes × 30 steps × 5 selectors, all under role_assignment='resilience' for an apples-to-apples baseline): selector NONE AUTH DB WEB tagged% fsm 95.1% 3.3% 0.8% 0.8% 4.9% (uniform baseline) resilience 82.8% 6.1% 5.7% 5.3% 17.2% (weight=5, all 3) cia_c 83.2% 5.3% 11.1% 0.4% 16.8% (weight=10, AUTH+DB) cia_i 82.8% 8.2% 1.2% 7.8% 17.2% (weight=10, AUTH+WEB) cia_a 79.9% 4.9% 8.2% 7.0% 20.1% (weight=10, all 3) Each biased selector shifts ~3.5–4× over baseline on its target role set. The cleanest signal is cia_c vs cia_i: cia_c heavy on DB (11.1%) and almost nothing on WEB (0.4%); cia_i flips that (DB 1.2%, WEB 7.8%) — exactly the selector spec.

…nated Restores Dena's original PR #11 intent: each episode randomly picks 3 of the operational-zone server hostnames to tag as auth/db/web, instead of always pinning roles to the lowest-3-sorted hostnames. The policy trains against a moving role map and learns position-agnostic defense. JAX side: - assign_resilience_roles_from_const(const, key) takes an optional key. None: deterministic-by-sort (replay / tests). Key: jax.random.uniform noise drives a candidate-host shuffle, first 3 → AUTH/DB/WEB. - The 'resilience' extras_factory now passes the per-episode key_extras through (previously discarded), so each env.reset(key) gets a fresh random role map. CybORG side (formerly index-mod-3 across all op-zone servers): - ResilienceRedAgent + CIA subclasses now hold a per-episode role map (set_role_map / _ensure_role_map). _CIARedAgent._target_roles uses the canonical ROLE_AUTH / DB / WEB constants. - inject_role_map(env, ep_seed) builds the role map deterministically from ep_seed + the env's full host list and pushes it into every ResilienceRedAgent in the env. This makes the map global to the episode — all 6 red agents bias toward the same 3 hosts and the trajectory recorder writes the matching map for the scorer. - All call sites that reset CybORG envs now inject after every reset: cc4_trajectory_eval (eval recording), env_worker in ippo_cyborg (CybORG training), and the cyborg/jax eval runners. Smoke-verified: 6 red agents per episode all share the same _role_map after inject_role_map. Different ep_seed → different map; same ep_seed reproduces. JAX side: 5 distinct maps for 5 keys.

Dmujt · 2026-05-08T18:27:22Z

I snuck a bunch of refactors to main ahead of this. #14 rebases this work and could supersede this PR.

Awesome, thanks!

scripts/dev/check_red_bias.py rolls out short episodes with each registered selector under a sleep-blue policy and reports what fraction of red attacks land on hosts of each role (NONE / AUTH / DB / WEB). Validates that the registry-based architecture preserves the per-selector bias semantics PR #11 specified. Run output (3 episodes × 30 steps × 5 selectors, all under role_assignment='resilience' for an apples-to-apples baseline): selector NONE AUTH DB WEB tagged% fsm 95.1% 3.3% 0.8% 0.8% 4.9% (uniform baseline) resilience 82.8% 6.1% 5.7% 5.3% 17.2% (weight=5, all 3) cia_c 83.2% 5.3% 11.1% 0.4% 16.8% (weight=10, AUTH+DB) cia_i 82.8% 8.2% 1.2% 7.8% 17.2% (weight=10, AUTH+WEB) cia_a 79.9% 4.9% 8.2% 7.0% 20.1% (weight=10, all 3) Each biased selector shifts ~3.5–4× over baseline on its target role set. The cleanest signal is cia_c vs cia_i: cia_c heavy on DB (11.1%) and almost nothing on WEB (0.4%); cia_i flips that (DB 1.2%, WEB 7.8%) — exactly the selector spec.

…nated Restores Dena's original PR #11 intent: each episode randomly picks 3 of the operational-zone server hostnames to tag as auth/db/web, instead of always pinning roles to the lowest-3-sorted hostnames. The policy trains against a moving role map and learns position-agnostic defense. JAX side: - assign_resilience_roles_from_const(const, key) takes an optional key. None: deterministic-by-sort (replay / tests). Key: jax.random.uniform noise drives a candidate-host shuffle, first 3 → AUTH/DB/WEB. - The 'resilience' extras_factory now passes the per-episode key_extras through (previously discarded), so each env.reset(key) gets a fresh random role map. CybORG side (formerly index-mod-3 across all op-zone servers): - ResilienceRedAgent + CIA subclasses now hold a per-episode role map (set_role_map / _ensure_role_map). _CIARedAgent._target_roles uses the canonical ROLE_AUTH / DB / WEB constants. - inject_role_map(env, ep_seed) builds the role map deterministically from ep_seed + the env's full host list and pushes it into every ResilienceRedAgent in the env. This makes the map global to the episode — all 6 red agents bias toward the same 3 hosts and the trajectory recorder writes the matching map for the scorer. - All call sites that reset CybORG envs now inject after every reset: cc4_trajectory_eval (eval recording), env_worker in ippo_cyborg (CybORG training), and the cyborg/jax eval runners. Smoke-verified: 6 red agents per episode all share the same _role_map after inject_role_map. Different ep_seed → different map; same ep_seed reproduces. JAX side: 5 distinct maps for 5 keys.

scripts/dev/check_red_bias.py rolls out short episodes with each registered selector under a sleep-blue policy and reports what fraction of red attacks land on hosts of each role (NONE / AUTH / DB / WEB). Validates that the registry-based architecture preserves the per-selector bias semantics PR #11 specified. Run output (3 episodes × 30 steps × 5 selectors, all under role_assignment='resilience' for an apples-to-apples baseline): selector NONE AUTH DB WEB tagged% fsm 95.1% 3.3% 0.8% 0.8% 4.9% (uniform baseline) resilience 82.8% 6.1% 5.7% 5.3% 17.2% (weight=5, all 3) cia_c 83.2% 5.3% 11.1% 0.4% 16.8% (weight=10, AUTH+DB) cia_i 82.8% 8.2% 1.2% 7.8% 17.2% (weight=10, AUTH+WEB) cia_a 79.9% 4.9% 8.2% 7.0% 20.1% (weight=10, all 3) Each biased selector shifts ~3.5–4× over baseline on its target role set. The cleanest signal is cia_c vs cia_i: cia_c heavy on DB (11.1%) and almost nothing on WEB (0.4%); cia_i flips that (DB 1.2%, WEB 7.8%) — exactly the selector spec.

…nated Restores Dena's original PR #11 intent: each episode randomly picks 3 of the operational-zone server hostnames to tag as auth/db/web, instead of always pinning roles to the lowest-3-sorted hostnames. The policy trains against a moving role map and learns position-agnostic defense. JAX side: - assign_resilience_roles_from_const(const, key) takes an optional key. None: deterministic-by-sort (replay / tests). Key: jax.random.uniform noise drives a candidate-host shuffle, first 3 → AUTH/DB/WEB. - The 'resilience' extras_factory now passes the per-episode key_extras through (previously discarded), so each env.reset(key) gets a fresh random role map. CybORG side (formerly index-mod-3 across all op-zone servers): - ResilienceRedAgent + CIA subclasses now hold a per-episode role map (set_role_map / _ensure_role_map). _CIARedAgent._target_roles uses the canonical ROLE_AUTH / DB / WEB constants. - inject_role_map(env, ep_seed) builds the role map deterministically from ep_seed + the env's full host list and pushes it into every ResilienceRedAgent in the env. This makes the map global to the episode — all 6 red agents bias toward the same 3 hosts and the trajectory recorder writes the matching map for the scorer. - All call sites that reset CybORG envs now inject after every reset: cc4_trajectory_eval (eval recording), env_worker in ippo_cyborg (CybORG training), and the cyborg/jax eval runners. Smoke-verified: 6 red agents per episode all share the same _role_map after inject_role_map. Different ep_seed → different map; same ep_seed reproduces. JAX side: 5 distinct maps for 5 keys.

Two latent issues from PR #11 that GameVariant encapsulation made detectable: 1. CIA selectors hardcoded `_FIXED_CIA_TARGET_WEIGHT=10.0` and ignored `variant.target_weight`, while CybORG-side `CRedAgent.with_weight` honored it (defaulted to 5.0 from the GameVariant default). Drop the constant, plumb the variant's `target_weight` through `_cia_c/i/a`, and set `CIA_C/I/A.target_weight=10.0` so both sides read from one source of truth. 2. CybORG `_CIARedAgent` mirrored only host-bias from JAX, not the action-prob override at FSM_R (root, undiscovered) that shifts mass toward Impact + Degrade. Override `state_transitions_probability` on `_CIARedAgent` to match `_CIA_PROB_MATRIX[FSM_R]` element-wise. Adds a parity test asserting the JAX `_CIA_PROB_MATRIX[FSM_R]` row and the CybORG `_CIARedAgent.state_transitions_probability['R']` row agree.

PaulHax and others added 12 commits April 24, 2026 18:41

feat: aggregator for CEC-pilot CIA per-condition comparison

4b3bd01

Merge with main, add cia scoring

d5d8592

Add resilience metric and CIA scoring from CAGE 2. Also add CIA-speci…

a4e34c7

…fic red agents and env for train/test

Added config to select red agent, including targeted CIA red agents. …

d42a260

…Also updated env to fixed selection of CIA tied hosts for Dartmouth method

Added resilience config

8aee46a

Pull upstream changes

3378523

Revert default recipe to standard without resilience

378ba28

Added brief description in README for running resilience metric

1e1698c

Combine Resilience nad Targeted CIA agent Env for training

16c4329

Rename targeted red fsm to resilience red fsm agent

7861c14

Refactor: clean up resilience classes from prior cc4

453755d

PaulHax mentioned this pull request May 6, 2026

Resilience CIA metric + biased red FSM (rebased + cleanups) #14

Merged

PaulHax mentioned this pull request May 7, 2026

Pluggable red selector + extras factory (architecture follow-up) #15

Merged

Dmujt closed this May 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Quantitative Resilience CIA Metric and Network Topology from CAGE 2#11

Add Quantitative Resilience CIA Metric and Network Topology from CAGE 2#11
Dmujt wants to merge 12 commits into
ITM-Kitware:mainfrom
Dmujt:main

Dmujt commented May 6, 2026

Uh oh!

PaulHax commented May 6, 2026

Uh oh!

Dmujt commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Dmujt commented May 6, 2026

Summary of Updates

Todo

Uh oh!

PaulHax commented May 6, 2026

Uh oh!

Dmujt commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants