# instruction
3 articlestagged with “instruction”
Instruction Tuning Safety Bypass
Using instruction tuning to selectively bypass safety mechanisms while maintaining model capability.
instructionfinesafetybypasstuning
Instruction Hierarchy Exploitation
Exploiting ambiguities in instruction priority hierarchies across different model providers.
hierarchyinstructionexploitationinjectionprompt
Instruction Tuning Data Manipulation
Manipulating instruction tuning datasets to embed specific behaviors in the resulting model.
instructionpipelinetuningmanipulationtraining