# safety-regression
2 artikelengetagd met “safety-regression”
Hoe fine-tuning de veiligheid aantast
The mechanisms through which fine-tuning erodes model safety -- catastrophic forgetting of safety training, dataset composition effects, the 'few examples' problem, and quantitative methods for measuring safety regression.
safety-degradationcatastrophic-forgettingfine-tuningalignmentsafety-regressionrlhf
Regressietesten van veiligheid bij kwantisatie
Test how model quantization (INT8, INT4, GPTQ) degrades safety alignment and introduces exploitable gaps.
labsquantizationsafety-regressionadvanced