1 articletagged with “triggers”
Research on discovering universal adversarial triggers that cause specific behaviors across model families.