# catastrophic-forgetting
3 articlestagged with “catastrophic-forgetting”
Continual Learning Vulnerabilities
Exploiting continual learning update mechanisms to introduce vulnerabilities through incremental model updates without triggering safety evaluations.
data-trainingcontinual-learningcatastrophic-forgettingmanipulation
How Fine-Tuning Degrades Safety
The mechanisms through which fine-tuning erodes model safety -- catastrophic forgetting of safety training, dataset composition effects, the 'few examples' problem, and quantitative methods for measuring safety regression.
safety-degradationcatastrophic-forgettingfine-tuningalignmentsafety-regressionrlhf
Continual Learning Security
Security risks in continual learning systems: catastrophic forgetting exploitation, task interference attacks, replay buffer poisoning, and stability-plasticity manipulation.
continual-learningcatastrophic-forgettingtask-interferenceonline-learningreplay-buffer