# automation

practice-examexpertresearchautomationfine-tuningsupply-chainincident-response

Practice Exam 3: Expert Red Team

25-question expert-level practice exam covering research techniques, automation, fine-tuning attacks, supply chain security, and incident response.

assessmenttoolsframeworksautomationred-teaming-tools

Tool Proficiency Assessment

Test your knowledge of AI red teaming tools, frameworks, automation platforms, and their appropriate application in security assessments with 9 intermediate-level questions.

assessmentsskill-verificationautomationpractical

Skill Verification: Red Team Automation

Practical verification of red team automation skills using Garak, PyRIT, and custom tooling.

study-guideadvancedresearchautomationforensics

Advanced Topics Study Guide

Study guide covering AI security research techniques, automation, forensics, emerging attack vectors, and tool development for advanced practitioners.

capstoneplatformred-teamingautomationtooling

Capstone: Build a Complete AI Red Teaming Platform

Design and implement a comprehensive AI red teaming platform with automated attack orchestration, vulnerability tracking, and collaborative reporting.

capstonetoolingautomationsecurity-scanneradvanced

Capstone: Build an AI Security Scanner

Design and implement an automated AI security testing tool that supports prompt injection detection, jailbreak testing, and output analysis.

cloudcomplianceautomation

Cloud AI Compliance Automation

Automating AI compliance checks and security assessments using cloud-native tools and policy-as-code approaches.

cloudsecretsrotationcredentialsautomation

Secrets Rotation for Cloud AI Deployments

Implementing automated secrets rotation strategies for API keys, model endpoint credentials, and service accounts used in cloud AI/LLM deployments across AWS, Azure, and GCP.

ci-cdpipeline-securityautomationdeploymentbuild-security

CI/CD Pipeline AI Risks

Security implications of integrating AI into CI/CD pipelines — covering AI-powered code generation in builds, automated testing risks, deployment decision manipulation, and pipeline hardening.

LLM Security Testing Automation

Building automated security testing pipelines for LLM applications using CI/CD integration and continuous scanning.

defenseautomationci-cd

exploit-devautomationframeworkorchestration

Attack Automation Framework

Building end-to-end attack automation frameworks that orchestrate reconnaissance, payload generation, execution, and result analysis.

automationcartfuzzingtestingexploit-dev

Red Teaming Automation

Frameworks and tools for automating AI red teaming at scale, including CART pipelines, jailbreak fuzzing, regression testing, and continuous monitoring.

custom-toolingexploit-devautomationpythonengineering

Building Custom Red Team Tools

Guide to building custom AI red teaming tools, including target-specific harnesses, result analysis pipelines, and integration with existing security workflows.

exploit-devchainmulti-stepautomation

Exploit Chain Builder

Building tools that automatically discover and chain multiple vulnerabilities into complete exploitation paths for complex LLM systems.

exploit-devtoolingautomationred-teamingmethodology

AI Exploit Development Overview

An introduction to developing exploits and tooling for AI red teaming, covering the unique challenges of building reliable attacks against probabilistic systems.

Beginner

Red Team Reporting Automation

Automating report generation from red team testing data and findings.

exploit-devreportingautomationtooling

cartcontinuousautomationpipelinetelemetryci-cdmonitoringred-teaming

Continuous Automated Red Teaming (CART)

Designing CART pipelines for ongoing AI security validation: architecture, test suites, telemetry, alerting, regression detection, and CI/CD integration.

c2infrastructureautomationtoolingpipelinescannerfuzzercobalt-strikemythicsliver

Red Team Infrastructure & Tooling

AI red team C2 frameworks, automated attack pipelines, custom scanner development, and integration with Cobalt Strike, Mythic, and Sliver.

exploit-devreportingtoolsautomation

Reporting Tool Development

Building automated reporting tools that transform raw test results into professional assessment reports with reproducible findings.

continuous-compliancemonitoringautomationdrift-detectionregulatory-tracking

Continuous Compliance Monitoring

Automated compliance monitoring for AI systems including continuous compliance checks, drift detection, regulatory change tracking, and integration with red team testing pipelines.

evaluationharnessautomationinfrastructure

Building Evaluation Harnesses

Design and implement evaluation harnesses for AI red teaming: architecture patterns, judge model selection, prompt dataset management, scoring pipelines, and reproducible evaluation infrastructure.

industry-verticalsminingresource-extractionautomation

Mining and Resource Extraction AI Security

AI security in mining operations including autonomous equipment, geological modeling, and safety systems.

ci-cdpipeline-securitymlopssupply-chainautomation

Attacking ML CI/CD Pipelines

Advanced techniques for compromising ML continuous integration and deployment pipelines, including pipeline injection, artifact tampering, training job hijacking, and exploiting the unique trust boundaries in automated ML workflows.

jailbreakautomationPAIRTAPAutoDANred-teaming

Automated Jailbreak Pipelines

Building automated jailbreak systems with PAIR, TAP, AutoDAN, and custom pipeline architectures for systematic AI safety evaluation.

injectionprompt-injectionjailbreaksmultimodalresearchautomation

Injection Research

Advanced research in prompt injection, jailbreak automation, and multimodal attack vectors, covering cutting-edge techniques that push beyond standard injection approaches.

Beginner

Jailbreak Research & Automation

Taxonomy of jailbreak primitives, crescendo attacks, many-shot jailbreaking, and automated jailbreak generation with TAP and PAIR.

jailbreakscrescendomany-shotskeleton-keyTAPPAIRautomation

laborchestrationautomationred-team-ops

Lab: Red Team Orchestration

Build an orchestration system that coordinates multiple attack strategies simultaneously, managing parallel attack campaigns and synthesizing results into comprehensive risk assessments.

labregression-testingsafetyautomationci-cd

Lab: Safety Regression Testing at Scale

Build automated pipelines that detect safety degradation across model versions, ensuring that updates and fine-tuning do not introduce new vulnerabilities or weaken existing protections.

labharnessautomationpython

Lab: Building a Simple Test Harness

Build a reusable Python test harness that automates sending test prompts, recording results, and calculating attack success metrics.

Beginner

Lab: Build Jailbreak Automation

Build an automated jailbreak testing framework that generates, mutates, and evaluates attack prompts at scale. Covers prompt mutation engines, success classifiers, and campaign management for systematic red team testing.

labexpertautomationjailbreakframeworkhands-on

labautomationci-cdpromptfoopipelineintermediate

Lab: Automated Red Team Pipeline

Hands-on lab for building a continuous AI red team testing pipeline using promptfoo, GitHub Actions, and automated attack generation to catch safety regressions before deployment.

labllm-judgeevaluationautomation

Lab: Building an LLM Judge Evaluator

Hands-on lab for building an LLM-based evaluator to score red team attack outputs, compare model vulnerability, and lay the foundation for automated attack campaigns.

simulationdefensedefense-in-depthautomationblue-team

Simulation: Defense in Depth

Expert-level defense simulation implementing a full defense stack including input filter, output monitor, rate limiter, anomaly detector, and circuit breaker, then measuring effectiveness against automated attacks.

ml-cicdpipeline-securitytraining-pipelinedeploymentautomationdevops

ML CI/CD Security

Security overview of ML continuous integration and deployment pipelines: how ML CI/CD differs from traditional CI/CD, unique attack surfaces in training workflows, and the security implications of automated model building and deployment.

professionaltoolsdevelopmentautomation

Developing Custom AI Red Team Tools

Guide to designing, building, and maintaining custom tools for AI red team engagements.

professionalcontinuous-testingautomationproduction

Continuous Red Teaming for Production AI Systems

Implementing ongoing, automated red teaming programs for AI systems in production environments.

automationcartci-cdtoolingscalinghuman-in-the-loop

Red Team Automation Strategy

When and how to automate AI red teaming: tool selection, CI/CD integration, continuous automated red teaming (CART), human-in-the-loop design, and scaling assessment coverage through automation.

prompt-injectionautomationchainingorchestration

Injection Chain Automation

Automating the discovery and chaining of multiple injection techniques to create reliable multi-step attack sequences against hardened targets.

system-promptextractionprompt-injectionautomationdetectiontradecraft

System Prompt Extraction Techniques

Catalog of system prompt extraction methods against LLM-powered applications: direct attacks, indirect techniques, multi-turn strategies, and defensive evasion.

continuousautomationmetricskpiprogramred-teamtradecraftadvanced

Continuous Red Teaming Programs

Designing and operating ongoing AI red team programs with automated testing pipelines, metrics dashboards, KPI frameworks, alert-driven assessments, and integration with CI/CD and model deployment workflows.

walkthroughsdefensetestingautomation

Automated Defense Testing Pipeline

Build an automated pipeline that continuously tests defensive measures against evolving attack techniques.

continuous-testingci-cdautomationpipelineregression-testingmethodologywalkthrough

Setting Up Continuous AI Red Teaming Pipelines

Walkthrough for building continuous AI red teaming pipelines that automatically test LLM applications on every deployment, covering automated scan configuration, CI/CD integration, alert thresholds, regression testing, and dashboard reporting.

test-planplanningmethodologytest-casesautomationwalkthrough

Developing Comprehensive AI Security Test Plans

Step-by-step guide to developing structured test plans for AI red team engagements, covering test case design, automation strategy, coverage mapping, and execution scheduling.

counterfitadversarial-mlmicrosoftrobustness-testingautomationwalkthrough

Counterfit Walkthrough

Complete walkthrough of Microsoft's Counterfit adversarial ML testing framework: installation, target configuration, running attacks against ML models, interpreting results, and automating adversarial robustness assessments.

garakci-cdautomationgithub-actionsgitlab-ciwalkthrough

Integrating Garak into CI/CD Pipelines

Intermediate walkthrough on automating garak vulnerability scans within CI/CD pipelines, including GitHub Actions, GitLab CI, threshold-based gating, result caching, and cost management strategies.

garakvulnerability-scanningprobesautomationci-cdwalkthrough

Garak End-to-End Walkthrough

Complete walkthrough of NVIDIA's garak LLM vulnerability scanner: installation, configuration, running probes against local and hosted models, interpreting results, writing custom probes, and CI/CD integration.

promptfooautomationred-teamevaluationci-cdwalkthrough

Automating Red Team Evaluations with Promptfoo

Complete walkthrough for setting up automated red team evaluation pipelines using Promptfoo, covering configuration, custom evaluators, adversarial dataset generation, CI integration, and result analysis.

pythonautomationhttpxaiohttpreportingtest-harnesswalkthrough

Python Red Team Automation

Building custom AI red team automation with Python: test harnesses with httpx and aiohttp, result collection and analysis, automated reporting, and integration with existing tools like promptfoo and garak.

walkthroughstoolsreport-generationautomation

Automated Red Team Report Generation

Build an automated system for generating structured red team reports from testing data and findings.