# representation
2 articlestagged with “representation”
Activation Steering for Adversarial Purposes
Using representation engineering and activation steering to manipulate model behavior at the representation level.
frontieractivation-steeringrepresentation
Representation Probing for Vulnerabilities
Probe internal model representations to identify exploitable features and develop representation-level attacks.
probingadvancedlabrepresentationlabs