# GCG
標記為「GCG」的 8 篇文章
Universal Adversarial Attacks
Universal perturbations that transfer across models, adversarial suffix research, and techniques for creating model-agnostic attack payloads.
Adversarial Suffix Generation
GCG attacks, universal adversarial triggers, soft prompt optimization, and defense evasion techniques for automated alignment bypass.
Lab: Adversarial Suffix Optimization
Implement GCG-style adversarial suffix attacks that automatically discover token sequences causing language models to comply with harmful requests. Covers gradient-based optimization, transferability analysis, and defense evaluation.
Token-Level Adversarial Attacks
Using gradient-based optimization and token manipulation to discover adversarial suffixes that reliably trigger unsafe model behavior.
Universal Adversarial 攻擊s
Universal perturbations that transfer across models, adversarial suffix research, and techniques for creating model-agnostic attack payloads.
Adversarial Suffix Generation
GCG attacks, universal adversarial triggers, soft prompt optimization, and defense evasion techniques for automated alignment bypass.
實驗室: Adversarial Suffix Optimization
Implement GCG-style adversarial suffix attacks that automatically discover token sequences causing language models to comply with harmful requests. Covers gradient-based optimization, transferability analysis, and defense evaluation.
Token-Level Adversarial 攻擊s
Using gradient-based optimization and token manipulation to discover adversarial suffixes that reliably trigger unsafe model behavior.