1 articletagged with “adversarial-suffixes”
GCG attacks, universal adversarial triggers, soft prompt optimization, and defense evasion techniques for automated alignment bypass.