Loading...
標記為「adversarial-optimization」的 1 篇文章
實作 符元-level 對抗性 optimization to discover minimal perturbations that bypass safety training.