1 articletagged with “world-models”
Exploiting learned world models in AI agents to cause unsafe behavior through environmental manipulation.