# robustness
6 articlestagged with “robustness”
Guide to Adversarial Training for Robustness
Comprehensive guide to adversarial training techniques that improve model robustness against attacks, including data augmentation strategies, adversarial fine-tuning, RLHF-based hardening, and evaluating the trade-offs between robustness and model capability.
Prompt Robustness Certification Research
Research on certifying prompt robustness with formal guarantees against bounded adversarial perturbations.
Adversarial Robustness Certification
Research into certifiable adversarial robustness for LLMs, including theoretical bounds and practical certification methods.
Adversarial Robustness Evaluation
Build a comprehensive adversarial robustness evaluation framework for assessing model security posture.
Adversarial Training for LLM Defense (Defense Walkthrough)
Implement adversarial training techniques to improve LLM robustness against prompt injection and jailbreaking.
Adversarial Robustness Testing with ARTKit
Walkthrough for using ARTKit (Adversarial Robustness Testing Kit) to evaluate LLM application resilience through automated adversarial testing, covering test flow configuration, challenger setup, evaluator design, and results analysis.