Skip to main content
redteams.ai
All tags

# role-confusion

2 articlestagged with “role-confusion

Instruction Hierarchy Attacks

Exploiting the priority ordering between system, user, and assistant messages to override safety controls, manipulate instruction precedence, and escalate privilege through message role confusion.

prompt-injectioninstruction-hierarchymessage-priorityrole-confusionsystem-promptred-teaming
Intermediate

Role Confusion Attack Walkthrough

Exploit role confusion between system, user, and assistant messages to override safety instructions.

walkthroughsrole-confusioninstruction-hierarchyattacks
Intermediate