# automated-attacks
標記為「automated-attacks」的 2 篇文章
實作:集成攻擊
Use multiple 語言模型 collaboratively to discover attack strategies that bypass any single model's defenses, leveraging model diversity for more effective 紅隊演練.
labensemble-attacksmulti-modelautomated-attacks
實作:PAIR 攻擊實作
建構 a complete Prompt Automatic Iterative Refinement system that uses an attacker LLM to automatically generate and refine 越獄 prompts against a target model.
labpairautomated-attacksjailbreaking