# instruction-following
標記為「instruction-following」的 3 篇文章
指令遵循作為攻擊面
為何大型語言模型的指令遵循能力本質上即為攻擊面。
instruction-followingexploitationattack-surfacefoundations
Lab: Instruction Following Priority
測試 how 語言模型 prioritize conflicting instructions from 系統提示詞s, user messages, and embedded directives to understand the instruction hierarchy.
labinstruction-followingpriorityconflicting-instructionsbeginnerhands-on
實驗:指令遵循度量
量化衡量指令遵循的順從度,辨識模型在相互競爭的指令中如何排序。
labsinstruction-followingmeasurementintermediate