# weight-manipulation
標記為「weight-manipulation」的 4 篇文章
Direct Weight Manipulation
Techniques for directly modifying LoRA adapter weights to bypass safety training, inject targeted capabilities, and hide malicious behaviors -- going beyond dataset-driven fine-tuning to surgical weight-level attacks.
weight-manipulationloraadaptersafety-bypasscapability-injectionhidden-behaviormodel-editing
Llama Family Attacks
Comprehensive attack analysis of Meta's Llama model family including weight manipulation, fine-tuning safety removal, quantization artifacts, uncensored variants, and Llama Guard bypass techniques.
llamametaweight-manipulationfine-tuningquantizationllama-guardred-teaming
Direct Weight Manipulation
Techniques for directly modifying LoRA adapter weights to bypass safety training, inject targeted capabilities, and hide malicious behaviors -- going beyond dataset-driven fine-tuning to surgical weight-level attacks.
weight-manipulationloraadaptersafety-bypasscapability-injectionhidden-behaviormodel-editing
Llama 家族攻擊
Meta 之 Llama 模型家族之完整攻擊分析,含權重操弄、微調安全移除、量化產物、未審查變體與 Llama Guard 繞過技術。
llamametaweight-manipulationfine-tuningquantizationllama-guardred-teaming