1 articletagged with “deceptive”
Current research on training deceptive LLMs that persist through safety training and activation patterns.