Skill Verification Overview
Overview of timed skill verification labs for AI red teaming, including format, pass/fail criteria, and preparation guidance.
Skill Verification Labs
Skill verification labs are timed, hands-on challenges that test your ability to apply AI red teaming knowledge under realistic conditions. Unlike the multiple-choice assessments that test knowledge, these labs test execution -- can you actually perform the techniques you have studied?
Format
Each lab defines:
- Objective -- a specific task you must accomplish
- Time limit -- a strict clock that simulates real engagement pressure
- Environment -- the target system setup and available tools
- Pass criteria -- specific, measurable outcomes that constitute a pass
- Fail conditions -- actions or outcomes that constitute a fail
Available Labs
| Lab | Difficulty | Time Limit | Skill Tested |
|---|---|---|---|
| Prompt Injection | Intermediate | 15 minutes | System prompt extraction |
| Reconnaissance | Intermediate | 20 minutes | AI system profiling |
| Jailbreaking | Advanced | 30 minutes | Safety bypass techniques |
| Agent Exploitation | Advanced | 25 minutes | Tool abuse and privilege escalation |
| Defense Implementation | Intermediate | 45 minutes | Guardrail engineering |
| Report Writing | Intermediate | 30 minutes | Professional documentation |
How to Use These Labs
Set Up Your Environment
Each lab specifies an environment setup. Prepare your local environment or sandbox before starting the timer. Setup time does not count against the lab clock.
Read the Full Brief
Read the entire lab description, including pass criteria and fail conditions, before starting. Understanding what counts as a pass is as important as the technical execution.
Start the Timer
Use a visible countdown timer. Time pressure is a deliberate part of the assessment -- it tests your ability to work efficiently under constraints, a critical skill in real engagements.
Execute and Document
Complete the objective while documenting your steps. Most labs require evidence of your work (screenshots, logs, written output) as part of the pass criteria.
Self-Evaluate
When the timer expires, evaluate your work against the pass criteria. Be honest -- the goal is accurate self-assessment, not a passing grade.
Recommended Progression
- Prompt Injection and Reconnaissance -- foundational offensive skills
- Defense Implementation and Report Writing -- foundational professional skills
- Jailbreaking -- advanced offensive technique
- Agent Exploitation -- advanced compound technique
Tips for Success
- Practice the techniques before attempting timed labs. The labs test execution speed, not learning. You should already know the techniques.
- Prepare your tooling. Have your prompt library, scripts, and documentation templates ready before starting.
- Manage your time. If an approach is not working after one-third of the time limit, switch techniques.
- Document as you go. Do not leave documentation for the end -- you may run out of time.
- Read the fail conditions. Some labs have specific actions that constitute an automatic fail regardless of whether you achieve the objective.