# trigger
3 articlestagged with “trigger”
Trigger-Based Backdoor Attacks
Implementing backdoor attacks using specific trigger patterns that activate pre-programmed model behavior while remaining dormant under normal conditions.
data-trainingbackdoortriggertrojan
Poisoning Fine-Tuning Datasets
Techniques for inserting backdoor triggers into fine-tuning datasets, clean-label poisoning that evades content filters, and scaling attacks across dataset sizes -- how adversarial training data compromises model behavior.
dataset-poisoningbackdoorclean-labeltriggerfine-tuningdata-poisoningsupply-chain
SFT Data Poisoning & Injection
Poisoning supervised fine-tuning datasets through instruction-response pair manipulation, backdoor triggers in SFT data, and determining minimum poisoned example thresholds.
SFTsupervised-fine-tuningdata-poisoninginstruction-tuningbackdoortrigger