# behavior
3 articlestagged with “behavior”
Instruction Following as Attack Surface
Why the instruction-following capability of LLMs is inherently an attack surface.
foundationsinstruction-followingattack-surfacebehavior
Emergent Behavior Exploitation
Identify and exploit emergent behaviors in frontier models that arise from scale and are not present in smaller models.
exploitationlabexpertbehavioremergentlabs
Model Behavior Monitoring Setup
Set up comprehensive model behavior monitoring to detect drift, anomalies, and potential compromise.
defensemonitoringmodelbehaviorwalkthroughs