Skip to main content
redteams.ai
All tags

# deceptive

1 articletagged with “deceptive

Sleeper Agent Research

Current research on training deceptive LLMs that persist through safety training and activation patterns.

frontier-researchsleeper-agentsdeceptivetraining
Expert