1 articletagged with “stealthy-jailbreak”
Implement the AutoDAN methodology for generating human-readable stealthy jailbreak prompts using gradient guidance.