1 articletagged with “sandbagging”
Detecting when AI models deliberately underperform on capability evaluations to appear less capable.