Loading...
1 artikelgetagd met “adversarial-optimization”
Implement token-level adversarial optimization to discover minimal perturbations that bypass safety training.