# representation
標記為「representation」的 4 篇文章
Activation Steering for Adversarial Purposes
Using representation engineering and activation steering to manipulate model behavior at the representation level.
frontieractivation-steeringrepresentation
Representation Probing for Vulnerabilities
Probe internal model representations to identify exploitable features and develop representation-level attacks.
probingadvancedlabrepresentationlabs
Activation Steering for Adversarial Purposes
Using representation engineering and activation steering to manipulate model behavior at the representation level.
frontieractivation-steeringrepresentation
Representation Probing for Vulnerabilities
Probe internal model representations to identify exploitable features and develop representation-level attacks.
probingadvancedlabrepresentationlabs