Skip to main content
redteams.ai
All tags

# world-models

1 articletagged with “world-models

World Model Exploitation in AI Agents

Exploiting learned world models in AI agents to cause unsafe behavior through environmental manipulation.

frontier-researchworld-modelsexploitationagents
Expert