Skip to main content
redteams.ai
All tags

# representation

2 articlestagged with “representation

Activation Steering for Adversarial Purposes

Using representation engineering and activation steering to manipulate model behavior at the representation level.

frontieractivation-steeringrepresentation
Expert

Representation Probing for Vulnerabilities

Probe internal model representations to identify exploitable features and develop representation-level attacks.

probingadvancedlabrepresentationlabs
Advanced