# steering-vectors
標記為「steering-vectors」的 4 篇文章
Activation Steering
Manipulating model behavior by adding learned steering vectors to intermediate activations, bypassing safety training through direct representation engineering.
activation-steeringrepresentation-engineeringsteering-vectorsmechanisticsafety-bypass
Lab: Model Steering with Activation Vectors
Use activation steering vectors to control model behavior without prompt modification for security testing.
labssteering-vectorsactivationadvanced
Activation Steering
Manipulating model behavior by adding learned steering vectors to intermediate activations, bypassing safety training through direct representation engineering.
activation-steeringrepresentation-engineeringsteering-vectorsmechanisticsafety-bypass
實驗室: 模型 Steering with Activation Vectors
Use activation steering vectors to control model behavior without prompt modification for security testing.
labssteering-vectorsactivationadvanced