# instruction-following
標記為「instruction-following」的 6 篇文章
Instruction Following as Attack Surface
Why the instruction-following capability of LLMs is inherently an attack surface.
Lab: Instruction Following Priority
Test how language models prioritize conflicting instructions from system prompts, user messages, and embedded directives to understand the instruction hierarchy.
Lab: Instruction Following Measurement
Quantitatively measure instruction following compliance to identify where models prioritize competing instructions.
Instruction Following as 攻擊 Surface
Why the instruction-following capability of LLMs is inherently an attack surface.
實驗室: Instruction Following Priority
Test how language models prioritize conflicting instructions from system prompts, user messages, and embedded directives to understand the instruction hierarchy.
實驗室: Instruction Following Measurement
Quantitatively measure instruction following compliance to identify where models prioritize competing instructions.