Skip to main content
redteams.ai
All tags

# behavior-diffing

1 articletagged with “behavior-diffing

Model Behavior Diffing

Comparing model behavior before and after incidents: output distribution analysis, safety regression detection, capability change measurement, and statistical significance testing.

behavior-diffingcomparisonregressionmodel-analysis
Advanced