# regression

7 articlestagged with “regression”

Model Behavior Diffing

Comparing model behavior before and after incidents: output distribution analysis, safety regression detection, capability change measurement, and statistical significance testing.

behavior-diffingcomparisonregressionmodel-analysis

Advanced

Security Risks of AI-Assisted Refactoring

Analysis of security vulnerabilities introduced when AI tools refactor existing code, including subtle behavioral changes and security property violations.

code-gen-securityrefactoringbehavioral-changesregression

Advanced

Attack Replay System Development

Building an attack replay system for regression testing defenses against known attack patterns.

exploit-devreplaysystemregression

Intermediate

Regression Testing for AI Security

Implementing automated regression testing for AI security properties that integrates into CI/CD pipelines and catches safety regressions.

exploit-devregressiontestingCI/CD

Intermediate

Lab: Build Behavior Diff Tool

Build a tool that systematically compares language model behavior across versions, configurations, and providers. Detect safety regressions, capability changes, and behavioral drift with automated differential analysis.

labexpertbehavior-diffregressioncomparisonhands-on

Expert

Lab: Regression Testing with promptfoo

Hands-on lab for setting up promptfoo to run automated regression tests against LLM applications, ensuring that safety properties hold across model updates and prompt changes.

labpromptfooregression

Intermediate

Verifying That Remediations Are Effective

Walkthrough for planning and executing remediation verification testing (retesting) to confirm that AI vulnerability fixes are effective and do not introduce regressions.

remediationverificationretestingregressionmethodologywalkthrough

Intermediate