# safety-benchmarks
標記為「safety-benchmarks」的 4 篇文章
Safety Regression Testing
Quantitative methods for measuring safety changes before and after fine-tuning -- benchmark selection, automated safety test suites, statistical methodology for safety regression, and building comprehensive before/after evaluation pipelines.
regression-testingsafety-benchmarksevaluationmetricsbefore-aftersafety-measurementfine-tuning-security
Lab: Running Safety Benchmarks
Run standardized safety benchmarks against LLM models to establish baseline safety profiles for comparison.
labssafety-benchmarkstestingbeginner
Safety Regression Testing
Quantitative methods for measuring safety changes before and after fine-tuning -- benchmark selection, automated safety test suites, statistical methodology for safety regression, and building comprehensive before/after evaluation pipelines.
regression-testingsafety-benchmarksevaluationmetricsbefore-aftersafety-measurementfine-tuning-security
實驗室: Running Safety Benchmarks
Run standardized safety benchmarks against LLM models to establish baseline safety profiles for comparison.
labssafety-benchmarkstestingbeginner