# red-team-automation
2 articlestagged with “red-team-automation”
HarmBench Evaluation Framework Walkthrough
Complete walkthrough of the HarmBench evaluation framework: installation, running standardized benchmarks against models, interpreting results, creating custom behavior evaluations, and comparing model safety across versions.
harmbenchevaluationbenchmarkssafetyred-team-automationwalkthrough
HarmBench Evaluation Framework 導覽
Complete walkthrough of the HarmBench evaluation framework: installation, running standardized benchmarks against models, interpreting results, creating custom behavior evaluations, and comparing model safety across versions.
harmbenchevaluationbenchmarkssafetyred-team-automationwalkthrough