Skip to main content
redteams.ai
All tags

# steering-vectors

2 articlestagged with “steering-vectors

Activation Steering

Manipulating model behavior by adding learned steering vectors to intermediate activations, bypassing safety training through direct representation engineering.

activation-steeringrepresentation-engineeringsteering-vectorsmechanisticsafety-bypass
Expert

Lab: Model Steering with Activation Vectors

Use activation steering vectors to control model behavior without prompt modification for security testing.

labssteering-vectorsactivationadvanced
Advanced