# safety-framework
標記為「safety-framework」的 4 篇文章
Fine-Tuning Safety Evaluation Framework
A comprehensive framework for evaluating the safety of fine-tuned models -- combining pre-deployment testing, safety regression benchmarks, and continuous monitoring to detect when fine-tuning has compromised model safety.
Safety Framework Circumvention
Attacking safety layers in embodied AI systems: watchdog suppression, emergency stop bypass, safety monitor evasion, and techniques for compromising multi-layered safety architectures in robotic systems.
Fine-Tuning Safety Evaluation Framework
A comprehensive framework for evaluating the safety of fine-tuned models -- combining pre-deployment testing, safety regression benchmarks, and continuous monitoring to detect when fine-tuning has compromised model safety.
Safety Framework Circumvention
攻擊ing safety layers in embodied AI systems: watchdog suppression, emergency stop bypass, safety monitor evasion, and techniques for compromising multi-layered safety architectures in robotic systems.