# hands-on

68 articlestagged with “hands-on”

Skill Verification Overview

Overview of timed skill verification labs for AI red teaming, including format, pass/fail criteria, and preparation guidance.

skill-verificationlabshands-onassessment

Intermediate

Skill Verification: Agent Exploitation (Assessment)

Timed skill verification lab: exploit an agent system to perform unauthorized actions within 25 minutes.

skill-verificationagent-exploitationtool-abusehands-on

Advanced

Skill Verification: Defense Implementation

Timed skill verification lab: build a working guardrail system that passes automated attack tests within 45 minutes.

skill-verificationdefenseguardrailsimplementationhands-on

Intermediate

Skill Verification: Jailbreaking

Timed skill verification lab: bypass safety measures on a defended AI system within 30 minutes using jailbreak techniques.

skill-verificationjailbreakingsafety-bypasshands-on

Advanced

Skill Verification: Prompt Injection (Assessment)

Timed skill verification lab: extract a system prompt from a defended AI system within 15 minutes using prompt injection techniques.

skill-verificationprompt-injectionsystem-prompthands-on

Intermediate

Skill Verification: Reconnaissance

Timed skill verification lab: profile an unknown AI system in 20 minutes by identifying the model, extracting configuration, and mapping capabilities.

skill-verificationreconnaissanceprofilinghands-on

Intermediate

Skill Verification: Report Writing

Timed skill verification lab: write a professional AI red team finding report from provided evidence within 30 minutes.

skill-verificationreportingdocumentationprofessional-skillshands-on

Intermediate

Lab: Exploring Embedding Spaces

Hands-on lab using Python to visualize embedding spaces, measure semantic similarity, and demonstrate how adversarial documents can be crafted to match target queries.

labembeddingshands-onpythonintermediate

Intermediate

Lab: Audio Adversarial Examples

Hands-on lab for crafting adversarial audio perturbations that cause speech-to-text models and voice assistants to misinterpret spoken commands, demonstrating attacks on audio AI systems.

labaudioadversarialmultimodaladvancedhands-on

Advanced

Lab: Cloud AI Security Assessment

Conduct an end-to-end security assessment of a cloud-deployed AI service, covering API security, model vulnerabilities, data handling, and infrastructure configuration.

labcloudassessmentend-to-endapi-securityadvancedhands-on

Advanced

Lab: Custom Test Harness for Specific Applications

Build a tailored testing framework for a specific AI application, with custom attack generators, domain-specific evaluators, and application-aware reporting.

labcustom-harnesstesting-frameworkdomain-specificadvancedhands-on

Advanced

Lab: Federated Learning Poisoning Attack

Hands-on lab for understanding and simulating poisoning attacks against federated learning systems, where a malicious participant corrupts the shared model through crafted gradient updates.

labfederated-learningpoisoningexperthands-on

Expert

Lab: Purple Team Exercise

Simultaneously attack and defend an AI application in a structured exercise where red team findings immediately inform blue team defensive improvements.

labpurple-teamattack-defensecollaborativeadvancedhands-on

Advanced

Lab: Transfer Attack Development (Advanced Lab)

Develop adversarial attacks on open-source models that transfer to closed-source models, leveraging weight access for black-box exploitation.

labtransfer-attacksadversarialcross-modeladvancedhands-on

Advanced

Lab: Build Your First Defense (Beginner Lab)

Create a simple input filter that blocks common prompt injection patterns, then test it against the attack techniques you have learned in previous labs.

# hands-on

Skill Verification Overview

Skill Verification: Agent Exploitation (Assessment)

Skill Verification: Defense Implementation

Skill Verification: Jailbreaking

Skill Verification: Prompt Injection (Assessment)

Skill Verification: Reconnaissance

Skill Verification: Report Writing

Lab: Exploring Embedding Spaces

Lab: Audio Adversarial Examples

Lab: Cloud AI Security Assessment

Lab: Custom Test Harness for Specific Applications

Lab: Federated Learning Poisoning Attack

Lab: Purple Team Exercise

Lab: Transfer Attack Development (Advanced Lab)

Lab: Build Your First Defense (Beginner Lab)

Lab: Model Comparison

Lab: Context Manipulation

Lab: Defense Bypass Basics

Lab: Delimiter Escape Attacks

Lab: Ethical Red Teaming

Lab: Your First Prompt Injection

Lab: Your First Jailbreak

Lab: Garak Setup and First Scan

Lab: Injection Detection Tool

Lab: Injection Techniques Survey

Lab: Instruction Following Priority

Lab: Multi-Language Injection

Lab: Output Format Exploitation

Lab: Output Steering

Lab: Payload Crafting

Lab: Prompt Leaking Basics

Lab: Promptfoo Setup and First Eval

Lab: PyRIT Setup and First Attack

Lab: Role-Play Attacks

Lab: Mapping Safety Boundaries

Lab: System Prompt Override

Lab: Adversarial Suffix Optimization

Lab: Alignment Stress Testing

Lab: Build Agent Security Scanner

Lab: Build an AI Fuzzer

Lab: Build Behavior Diff Tool

Lab: Build Guardrail Evaluator

Lab: Build Jailbreak Automation

Lab: Emergent Capability Probing

Lab: Full-Stack AI Exploitation

Lab: Computer Use Agent Exploitation

Lab: Deploy Honeypot AI

Lab: Multi-Agent Attack Coordination

Lab: Novel Jailbreak Research

Lab: ML Pipeline Poisoning

Lab: Exploiting Quantized Model Weaknesses

Lab: Model Registry Compromise

Lab: RLHF Reward Hacking

Lab: Create a Safety Benchmark

Lab: AI Watermark Detection & Removal

Labs & Hands-On Practice

Lab: Automated Red Team Testing

Lab: Data Exfiltration Channels (Intermediate Lab)

Lab: Defense Effectiveness Testing

Lab: Indirect Prompt Injection

Lab: Multimodal Injection (Intermediate Lab)

Lab: Supply Chain Audit

Lab: Crafting Audio Adversarial Examples

Lab: Video Model Adversarial Attacks

Lab: Crafting Image-Based Injections

Lab: Attacking Federated Learning

Lab: Exploiting Quantized Models

Lab: Poisoning a Training Dataset

# hands-on

Skill Verification Overview

Skill Verification: Agent Exploitation (Assessment)

Skill Verification: Defense Implementation

Skill Verification: Jailbreaking

Skill Verification: Prompt Injection (Assessment)

Skill Verification: Reconnaissance

Skill Verification: Report Writing

Lab: Exploring Embedding Spaces

Lab: Audio Adversarial Examples

Lab: Cloud AI Security Assessment