# agent-security

20 articlestagged with “agent-security”

Link-Based Exfiltration

Using hyperlinks, redirects, or URL parameters to exfiltrate data from AI systems through markdown links, tool-generated URLs, and API callback exploitation.

exfiltrationlinksdata-theftURL-manipulationagent-security

Intermediate

Markdown Image Injection

Injecting markdown image tags with attacker-controlled URLs to exfiltrate conversation data via HTTP image requests.

exfiltrationmarkdownimage-injectiondata-theftagent-security

Intermediate

Permission Boundary Bypass

Escalating from limited to elevated permissions in AI agent systems through scope creep, implicit permission inheritance, and capability confusion.

privilege-escalationpermissionsagent-securityboundariesred-teaming

Advanced

LangChain Security Deep Dive (Agentic Exploitation)

Comprehensive security analysis of LangChain and LangGraph, covering dangerous defaults, chain composition attacks, callback exploitation, community tool risks, and agent executor vulnerabilities.

langchainlanggraphchain-compositioncallbackscommunity-toolsagent-security

Advanced

Case Study: MCP Tool Poisoning Attacks (Invariant Labs 2025)

Analysis of tool poisoning vulnerabilities in the Model Context Protocol (MCP) discovered by Invariant Labs, where malicious tool descriptions manipulate AI agents into data exfiltration and unauthorized actions.

case-studiesmcptool-poisoninginvariant-labsagent-securityprompt-injection

Advanced

CaMeL & Dual LLM Pattern

Architectural defense patterns that separate trusted and untrusted processing: Simon Willison's Dual LLM concept and Google DeepMind's CaMeL framework for defending tool-using AI agents against prompt injection.

dual-llmcamelprompt-injection-defenseagent-securityarchitecturetool-use

Intermediate

A2A Trust Boundary Attack

Advanced walkthrough of exploiting trust boundaries between agents in multi-agent systems using the Agent-to-Agent (A2A) protocol.

a2atrust-boundarymulti-agentagent-securityprotocol-attackwalkthrough

Advanced

Agent Context Overflow

Walkthrough of overflowing agent context windows to push safety instructions out of the LLM's attention, enabling bypasses of system prompts and guardrails.

context-overflowcontext-windowagent-securityattention-manipulationwalkthrough

Intermediate

Agent Loop Hijacking

Advanced walkthrough of hijacking agentic loops to redirect autonomous agent behavior, alter reasoning chains, and achieve persistent control over multi-step agent workflows.

agent-loophijackingagent-securityreasoning-chainagentic-aiwalkthrough

Advanced

Agent Persistence via Memory

Advanced walkthrough of using agent memory systems to create persistent backdoors that survive restarts, updates, and session boundaries.

agent-persistencebackdoormemory-attacksagent-securitylong-term-compromisewalkthrough

Advanced

Callback Abuse in MCP

Advanced walkthrough of abusing MCP callback mechanisms for unauthorized actions, data exfiltration, and privilege escalation in agent-tool interactions.

mcpcallback-abusemodel-context-protocolagent-securityexfiltrationwalkthrough

Advanced

Function Calling Parameter Injection

Walkthrough of manipulating function call parameters through prompt-level techniques, injecting malicious values into LLM-generated API calls.

function-callingparameter-injectionapi-securityagent-securitywalkthrough

Intermediate

MCP Tool Shadowing

Advanced walkthrough of creating shadow tools that override legitimate MCP (Model Context Protocol) tools, enabling interception and manipulation of agent-tool interactions.

mcptool-shadowingmodel-context-protocolagent-securitytool-poisoningwalkthrough

Advanced

Memory Poisoning Step by Step

Walkthrough of persisting injection payloads in agent memory systems to achieve long-term compromise of LLM-based agents.

memory-poisoningagent-memorypersistenceinjectionagent-securitywalkthrough

Intermediate

Multi-Agent Prompt Relay

Advanced walkthrough of relaying prompt injection payloads across multiple agents in a pipeline, achieving cascading compromise of multi-agent systems.

multi-agentprompt-relayinjection-chainagent-pipelineagent-securitywalkthrough

Advanced

Orchestrator Manipulation

Advanced walkthrough of attacking the orchestrator layer in multi-agent systems to gain control over task delegation, agent coordination, and system-wide behavior.

orchestratormulti-agenttask-delegationagent-securitycoordination-attackwalkthrough

Advanced

Plugin Confusion Attack

Walkthrough of confusing LLM agents about which plugin or tool to invoke, causing them to call the wrong tool or pass data to unintended destinations.

plugin-confusiontool-selectionagent-securitymisdirectionwalkthrough

Intermediate

Tool Call Injection

Step-by-step walkthrough of injecting malicious parameters into LLM tool and function calls to execute unauthorized actions in agent systems.

tool-callingfunction-callinginjectionagent-securitywalkthrough

Intermediate

Sandboxing and Permission Models for Tool-Using Agents

Walkthrough for implementing sandboxing and permission models that constrain tool-using LLM agents, covering least-privilege design, parameter validation, execution sandboxes, approval workflows, and audit logging.

sandboxingtool-usepermissionsagent-securityleast-privilegedefensewalkthrough

Advanced

Security Testing LangChain Applications

Step-by-step walkthrough for identifying and exploiting security vulnerabilities in LangChain-based applications, covering chain injection, agent manipulation, tool abuse, retrieval poisoning, and memory extraction attacks.

langchainsecurity-testingagent-securitychain-injectionrag-securitywalkthrough

Intermediate