# steering-vectors
4 articlestagged with “steering-vectors”
Activation Steering
Manipulating model behavior by adding learned steering vectors to intermediate activations, bypassing safety training through direct representation engineering.
activation-steeringrepresentation-engineeringsteering-vectorsmechanisticsafety-bypass
Lab: Model Steering with Activation Vectors
Use activation steering vectors to control model behavior without prompt modification for security testing.
labssteering-vectorsactivationadvanced
Activation Steering
Manipulating model behavior by adding learned steering vectors to intermediate activations, bypassing safety training through direct representation engineering.
activation-steeringrepresentation-engineeringsteering-vectorsmechanisticsafety-bypass
實驗室: 模型 Steering with Activation Vectors
Use activation steering vectors to control model behavior without prompt modification for security testing.
labssteering-vectorsactivationadvanced