Add: Agentic Attack Surfaces — pipeline trust exploitation, HITL/LITL, and identity anchor attacks by GnomeMan4201 · Pull Request #6 · Libr-AI/OpenRedTeaming

GnomeMan4201 · 2026-05-31T14:37:23Z

The existing taxonomy covers model-level attack strategies well. This adds a new top-level section for attack classes specific to deployed agent architectures, where the target is the trust relationships of the pipeline rather than the model's content boundary.

Three subsections added:

Pipeline Trust Exploitation / Second-Order Injection
Human-in-the-Loop and Legitimate-in-the-Loop Attacks
Identity Anchor Attacks

Also corrects arXiv ID typo on Bagdasaryan et al. in Instruction Indirection: 2307.1049 → 2307.10490

Update 02 - Attack Strategies.md

400dc08

GnomeMan4201 mentioned this pull request May 31, 2026

Add Detection Validation & Adversarial Simulation Frameworks section #7

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add: Agentic Attack Surfaces — pipeline trust exploitation, HITL/LITL, and identity anchor attacks#6

Add: Agentic Attack Surfaces — pipeline trust exploitation, HITL/LITL, and identity anchor attacks#6
GnomeMan4201 wants to merge 1 commit into
Libr-AI:mainfrom
GnomeMan4201:add-agentic-attack-surfaces

GnomeMan4201 commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

GnomeMan4201 commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant