What is Target Profiling?

Techniques for identifying the model, provider, version, safety configuration, and capabilities of a target AI system through behavioral analysis and fingerprinting.

What is System Prompt Extraction?

Techniques for extracting hidden system prompts from AI applications, revealing safety rules, tool definitions, behavioral constraints, and sensitive configuration.

What is Capability Mapping?

Systematic approaches to discovering and mapping the full capability surface of an AI system, including tools, integrations, permissions, and hidden features.

What is Model Identification Techniques?

Fingerprinting models behind APIs using behavioral signatures, output analysis, and systematic probing to determine model family, size, and version.

What is AI API Enumeration?

Discovering AI API endpoints, parameters, model configurations, and undocumented features through systematic enumeration techniques.

What is Social Engineering for AI Systems?

Manipulating human operators and administrators of AI systems to gain access, extract information, or bypass security controls through social engineering techniques.

What is OSINT for AI Red Teaming?

Gathering intelligence about AI deployments from public sources: documentation, job postings, research papers, social media, and technical artifacts.

What is Shadow AI Detection?

Finding unauthorized AI deployments in organizations: detection methods, common shadow AI patterns, and assessment of unmanaged AI risks.

What is AI Attack Surface Mapping?

Systematic methodology for identifying all attack vectors in AI systems: input channels, data flows, tool integrations, and trust boundaries.

What is LLM API Enumeration?

Advanced techniques for enumerating LLM API capabilities, restrictions, hidden parameters, and undocumented features to build a comprehensive attack surface map.

AI Red Teaming Methodology

beginner5 min readUpdated 2026-03-12

A structured methodology for AI red teaming engagements, covering reconnaissance, target profiling, attack planning, and the tradecraft that distinguishes professional assessments.

methodology recon tradecraft red-teaming assessment

Effective AI red teaming follows a structured methodology, just like traditional penetration testing. Random prompt injection attempts are far less effective than a systematic approach that starts with thorough reconnaissance and progressively builds toward targeted exploitation.

The AI Red Teaming Lifecycle

1. Scope & Planning → 2. Reconnaissance → 3. Target Profiling →
4. Attack Planning → 5. Exploitation → 6. Post-Exploitation → 7. Reporting

Phase 1: Scope and Planning

Define what is in scope, what success looks like, and what rules of engagement apply. AI-specific scoping concerns include:

Which models and applications are in scope?
Is fine-tuning/training data testing permitted?
Are supply chain attacks (model registries, dependencies) in scope?
What constitutes a "successful" jailbreak or injection?
How will stochastic results be evaluated?

Phase 2: Reconnaissance

Gather information about the target without directly interacting with the AI system. See Target Profiling.

Phase 3: Target Analysis

Interact with the system to understand its behavior:

System Prompt Extraction — Discover the system's instructions and constraints
Capability Mapping — Map what the system can do, including tools and integrations

Phase 4-7: Attack and Report

Plan attacks based on reconnaissance, execute them, document results, and report findings. See the Capstone section for full engagement methodology.

Key Tradecraft Principles

Principle	Description
Profile before you attack	Invest time in understanding the target before attempting exploits
Test systematically	Vary one parameter at a time to understand what works and why
Document everything	AI behavior is stochastic — record exact prompts, responses, and success rates
Use open models as proxies	Test techniques on open-weight models before targeting production systems
Respect rate limits	Aggressive testing triggers rate limiting and may alert defenders

Reconnaissance Depth

The depth of reconnaissance determines the quality of your attacks:

Recon Depth	What You Learn	Attack Quality
None	"It's a chatbot"	Random injection attempts
Basic	Model family, visible features	Generic attacks for that model type
Moderate	System prompt, tools, safety rules	Targeted attacks against specific defenses
Deep	Architecture, training data sources, deployment details	Custom exploits targeting specific weaknesses

Start with the pages in this section to build your reconnaissance capabilities, then apply them in the context of a full engagement using the Capstone methodology.

Advanced Recon Techniques -- deeper reconnaissance and system prompt extraction methods
Capstone: Full Engagement -- applying reconnaissance in the context of a full professional engagement
Defense Evasion -- bypassing defenses identified during recon
Agent Exploitation -- leveraging capability mapping to exploit agent tools
Target Profiling -- detailed model fingerprinting and profiling techniques

References

Greshake et al., "Not What You've Signed Up For: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection" (2023) -- reconnaissance-informed indirect injection
Perez & Ribeiro, "Ignore This Title and HackAPrompt: Exposing Systemic Weaknesses of LLMs" (2023) -- systematic approach to discovering LLM weaknesses
MITRE, "ATLAS: Adversarial Threat Landscape for AI Systems" (2023) -- structured reconnaissance framework for AI systems

Knowledge Check

Why is reconnaissance important before attempting prompt injection attacks?

AI Red Teaming Methodology

The AI Red Teaming Lifecycle

Phase 1: Scope and Planning

Phase 2: Reconnaissance

Phase 3: Target Analysis

Phase 4-7: Attack and Report

Key Tradecraft Principles

Reconnaissance Depth

References

Learning Path

AI Red Teaming Methodology

The AI Red Teaming Lifecycle

Phase 1: Scope and Planning

Phase 2: Reconnaissance

Phase 3: Target Analysis

Phase 4-7: Attack and Report

Key Tradecraft Principles

Reconnaissance Depth

References

Learning Path

AI Red Teaming Methodology

Learning Path

Related articles

AI Red Teaming Methodology

Learning Path

Related articles