Skip to main content
redteams.ai
All tags

# information-routing

1 articletagged with “information-routing

Exploiting Attention Mechanisms

How the self-attention mechanism in transformers can be leveraged to steer model behavior, hijack information routing, and bypass safety instructions.

attentiontransformersinternalsexploit-primitivesinformation-routing
Advanced