# automated-attacks

標記為「automated-attacks」的 2 篇文章

實作：集成攻擊

Use multiple 語言模型 collaboratively to discover attack strategies that bypass any single model's defenses, leveraging model diversity for more effective 紅隊演練.

labensemble-attacksmulti-modelautomated-attacks

進階

實作：PAIR 攻擊實作

建構 a complete Prompt Automatic Iterative Refinement system that uses an attacker LLM to automatically generate and refine 越獄 prompts against a target model.

labpairautomated-attacksjailbreaking

進階