# adversarial-suffix
標記為「adversarial-suffix」的 4 篇文章
對抗性後綴優化 (GCG)
實作 Greedy Coordinate Gradient 攻擊以產生對抗性後綴。
labsgcgadversarial-suffixoptimization
實作:生成對抗性後綴
實作 the Greedy Coordinate Gradient (GCG) algorithm to generate 對抗性 suffixes that cause 語言模型 to comply with harmful requests by appending optimized 符元 sequences.
labadversarial-suffixgcg
實作:對抗性後綴優化
實作 GCG-style 對抗性 suffix attacks that automatically discover 符元 sequences causing 語言模型 to comply with harmful requests. Covers 梯度-based optimization, transferability analysis, and defense evaluation.
labexpertadversarial-suffixGCGoptimizationhands-on
對抗性後綴打造演練
以梯度式與無梯度最佳化方法為黑盒模型打造對抗性後綴。
walkthroughsadversarial-suffixcraftingoptimization