# adversarial-training
標記為「adversarial-training」的 4 篇文章
Guide to Adversarial Training for Robustness
Comprehensive guide to adversarial training techniques that improve model robustness against attacks, including data augmentation strategies, adversarial fine-tuning, RLHF-based hardening, and evaluating the trade-offs between robustness and model capability.
Adversarial Training for LLM Defense (Defense Walkthrough)
Implement adversarial training techniques to improve LLM robustness against prompt injection and jailbreaking.
指南 to Adversarial 訓練 for Robustness
Comprehensive guide to adversarial training techniques that improve model robustness against attacks, including data augmentation strategies, adversarial fine-tuning, RLHF-based hardening, and evaluating the trade-offs between robustness and model capability.
Adversarial 訓練 for LLM 防禦 (防禦 導覽)
Implement adversarial training techniques to improve LLM robustness against prompt injection and jailbreaking.