1 articletagged with “fine-tuned”
Bypass safety fine-tuning on a model with RLHF, constitutional AI, and classifier-based defenses.