Skip to main content
redteams.ai
All tags

# fine-tuned

1 articletagged with “fine-tuned

Alignment Breaker: Level 2 — Safety Fine-Tuned Model

Bypass safety fine-tuning on a model with RLHF, constitutional AI, and classifier-based defenses.

labsctfalignmentfine-tuned
Expert