# rate-limiting

15 articlestagged with “rate-limiting”

MCP Denial of Wallet: Preventing Token Consumption Attacks

A defense-focused guide to understanding denial-of-wallet attacks via MCP -- how malicious servers create overthinking loops causing 142.4x token amplification -- and implementing budget controls, rate limiting, and cost monitoring to protect LLM endpoints.

mcpdenial-of-wallettoken-amplificationcost-controldefenserate-limiting

Intermediate

Rate Limiting and Abuse Assessment

Assessment of rate limiting bypass techniques, cost-based attacks, and billing abuse in AI services.

assessmentrate-limitingabuse

Intermediate

Rate Limiting and Abuse Prevention

Implementing rate limiting and abuse prevention for LLM API endpoints and applications.

defenserate-limitingabuse-preventionsecurity

Intermediate

Rate Limiting, Sandboxing & Execution Controls

Rate limiting strategies for AI APIs, sandboxing code execution with E2B and Docker, tool call approval workflows, and the principle of least privilege for AI agents.

rate-limitingsandboxingexecution-controlsleast-privilegee2bdockertool-approval

Advanced

AI API Abuse Detection

Detecting and mitigating API abuse patterns targeting AI inference endpoints including prompt extraction and model theft.

infrastructureapi-securityabuse-detectionrate-limiting

Intermediate

LLM API Security Testing

Security testing methodology for LLM APIs, covering authentication, rate limiting, input validation, output filtering, and LLM-specific API vulnerabilities.

api-securityauthenticationrate-limitingtestinginfrastructure

Intermediate

Advanced Rate Limiting Strategies for LLM API Endpoints

Designing, attacking, and defending rate limiting systems for LLM inference APIs to prevent abuse, model extraction, and resource exhaustion

infrastructurerate-limitingllm-apisdenial-of-servicemodel-extraction

Intermediate

Lab: Rate Limit Enumeration and Bypass

Enumerate API rate limits and test common bypass techniques including header manipulation and request distribution.

labsrate-limitingenumerationbeginner

Beginner

Basic Rate Limit Abuse Patterns

Test common rate-limit bypass patterns including header manipulation and endpoint discovery.

labsrate-limitingabusebeginner

Beginner

AI API Reverse Engineering

Techniques for reverse engineering AI APIs including mapping undocumented endpoints, parameter discovery, rate limit profiling, and extracting implementation details from API behavior.

apireverse-engineeringreconnaissanceendpointsrate-limiting

Expert

API Rate Limit Bypass

Techniques to bypass API rate limiting on LLM services, including header manipulation, distributed requests, authentication rotation, and endpoint discovery.

infrastructureapirate-limitingbypassred-teaming

Intermediate

Rate Limiting and Abuse Prevention for LLM APIs

Walkthrough for implementing rate limiting and abuse prevention systems for LLM API endpoints, covering token bucket algorithms, per-user quotas, cost-based limiting, anomaly detection, and graduated enforcement.

rate-limitingabuse-preventionapi-securitytoken-bucketcost-controldefensewalkthrough

Intermediate

AI Rate Limiting Walkthrough

Step-by-step walkthrough for implementing token-aware rate limiting for AI applications: request-level limiting, token budget enforcement, sliding window algorithms, abuse detection, and production deployment.

rate-limitingdefenseabuse-preventiontokensthrottlingimplementationwalkthrough

Beginner

Adaptive Rate Limiting for LLM APIs

Implement adaptive rate limiting that adjusts thresholds based on detected attack patterns and user behavior.

walkthroughsdefenserate-limitingadaptive

Intermediate

AI API Red Team Engagement

Complete walkthrough for testing AI APIs: endpoint enumeration, authentication bypass, rate limit evasion, input validation testing, output data leakage, and model fingerprinting through API behavior.

apiengagementauthenticationrate-limitinginput-validationmodel-fingerprintingwalkthrough

Intermediate