# regression
7 articlestagged with “regression”
Model Behavior Diffing
Comparing model behavior before and after incidents: output distribution analysis, safety regression detection, capability change measurement, and statistical significance testing.
Security Risks of AI-Assisted Refactoring
Analysis of security vulnerabilities introduced when AI tools refactor existing code, including subtle behavioral changes and security property violations.
Attack Replay System Development
Building an attack replay system for regression testing defenses against known attack patterns.
Regression Testing for AI Security
Implementing automated regression testing for AI security properties that integrates into CI/CD pipelines and catches safety regressions.
Lab: Build Behavior Diff Tool
Build a tool that systematically compares language model behavior across versions, configurations, and providers. Detect safety regressions, capability changes, and behavioral drift with automated differential analysis.
Lab: Regression Testing with promptfoo
Hands-on lab for setting up promptfoo to run automated regression tests against LLM applications, ensuring that safety properties hold across model updates and prompt changes.
Verifying That Remediations Are Effective
Walkthrough for planning and executing remediation verification testing (retesting) to confirm that AI vulnerability fixes are effective and do not introduce regressions.