# capability-evaluation
2 articlestagged with “capability-evaluation”
Sandbagging Detection in Capability Evaluations
Detecting when AI models deliberately underperform on capability evaluations to appear less capable.
frontier-researchsandbaggingcapability-evaluationdetection
Sandbagging Detection in Capability Evaluations
Detecting when AI models deliberately underperform on capability evaluations to appear less capable.
frontier-researchsandbaggingcapability-evaluationdetection