# weight-manipulation
2 articlestagged with “weight-manipulation”
Direct Weight Manipulation
Techniques for directly modifying LoRA adapter weights to bypass safety training, inject targeted capabilities, and hide malicious behaviors -- going beyond dataset-driven fine-tuning to surgical weight-level attacks.
weight-manipulationloraadaptersafety-bypasscapability-injectionhidden-behaviormodel-editing
Llama Family Attacks
Comprehensive attack analysis of Meta's Llama model family including weight manipulation, fine-tuning safety removal, quantization artifacts, uncensored variants, and Llama Guard bypass techniques.
llamametaweight-manipulationfine-tuningquantizationllama-guardred-teaming