# safety-framework

標記為「safety-framework」的 2 篇文章

Fine-Tuning Safety Evaluation Framework

A comprehensive framework for evaluating the safety of fine-tuned models -- combining pre-deployment testing, safety regression benchmarks, and continuous monitoring to detect when fine-tuning has compromised model safety.

safety-evaluationregression-testingbenchmarkingmonitoringfine-tuning-securitysafety-framework

中級

Safety Framework Circumvention

攻擊ing safety layers in embodied AI systems: watchdog suppression, emergency stop bypass, safety monitor evasion, and techniques for compromising multi-layered safety architectures in robotic systems.

safety-frameworkwatchdoge-stopsafety-monitor

專家