Skip to main content
redteams.ai
All tags

# model-tampering

1 articletagged with “model-tampering

Fine-Tuning Attack Forensics

Forensic techniques for detecting unauthorized fine-tuning modifications to language models, including safety alignment degradation and capability injection.

ai-forensics-irfine-tuningmodel-tamperingalignment
Advanced