# capstone

capstoneagenticpentesttool-usepyrit

Capstone: Pentest an Agentic AI System End-to-End

Conduct a full penetration test of an agentic AI system with tool use, multi-step reasoning, and autonomous decision-making capabilities.

capstonecompliancegovernanceeu-ai-actnist

Capstone: Implement an AI Compliance Framework

Build a comprehensive AI compliance framework that maps security testing to regulatory requirements including the EU AI Act, NIST AI RMF, and ISO 42001.

capstoneincident-responsemonitoringdetectionsiem

Capstone: Build an AI Incident Response System

Design and implement an incident response system purpose-built for AI security incidents including prompt injection breaches, model manipulation, and data exfiltration through LLM applications.

capstoneplatformred-teamingautomationtooling

Capstone: Build a Complete AI Red Teaming Platform

Design and implement a comprehensive AI red teaming platform with automated attack orchestration, vulnerability tracking, and collaborative reporting.

capstonebenchmarkingsafetyevaluationtesting

Capstone: Design and Implement an AI Safety Benchmark Suite

Build a comprehensive, reproducible benchmark suite for evaluating LLM safety across multiple risk dimensions including toxicity, bias, hallucination, and adversarial robustness.

Capstone: Autonomous Agent Assessment

Capstone exercise: red team assessment of a fully autonomous agent system with multi-tool access.

capstoneagentautonomous

capstoneautonomous-vehiclesafety-criticalassessment

Capstone: Autonomous Vehicle AI Security

Full-scope security assessment of an autonomous vehicle AI decision system covering perception manipulation, planning attacks, and safety override bypass.

capstonecode-assistantassessmentfull

Capstone: Code Assistant Assessment

Capstone exercise: security assessment of an AI code assistant with repository and CI/CD access.

capstonetool-developmentcustomengineering

Capstone: Custom Security Tool Development

Building a custom AI security testing tool from scratch covering architecture design, module development, and integration with existing frameworks.

capstonedefensearchitecturedesign

Capstone: Defense Architecture Design

Capstone exercise: design and validate a defense-in-depth architecture for an LLM-powered application.

capstoneeducationtutoringassessment

Capstone: Educational AI Platform

Security assessment of an AI tutoring platform addressing content safety, student data privacy, and academic integrity.

capstoneenterprisegovernanceprogram-designrisk-management

Capstone: Design an Enterprise AI Security Program

Architect a comprehensive enterprise AI security program spanning governance, technical controls, risk management, and incident response for organizations deploying LLMs at scale.

capstoneenterpriseragassessment

Capstone: Enterprise RAG Assessment

Capstone exercise: complete red team assessment of an enterprise RAG system with role-based access.

capstonefinancialaicompliance

Capstone: Financial AI Assessment

Capstone exercise: red team assessment of a financial AI advisor with regulatory compliance requirements.

capstoneengagementchatbotfull

Capstone: Full Chatbot Engagement

Complete capstone exercise: conduct a full red team engagement against a production-style chatbot system.

capstonegaraktool-specificassessment

Capstone: Deep Assessment with Garak

Tool-specific capstone using Garak for comprehensive vulnerability scanning including plugin development and custom probe creation.

capstonegovernanceauditcompliance

Capstone: AI Governance Audit

Capstone exercise: conduct a full AI governance audit covering compliance, risk, and operational controls.

capstonehealthcareaicompliance

Capstone: Healthcare AI Assessment

Capstone exercise: security assessment of a healthcare AI system with HIPAA and patient safety requirements.

capstoneincident-responsedrillexercise

Capstone: AI Incident Response Drill

Capstone exercise: execute a complete AI incident response drill from detection through remediation.

capstonelegaldocument-reviewassessment

Capstone: Legal AI Review System

End-to-end security assessment of an AI-powered legal document review system covering data confidentiality, output integrity, and adversarial manipulation.

capstoneguardrailsfirewalldefensenemo

Capstone: Build an LLM Firewall and Guardrails System

Design and implement a layered LLM firewall that inspects, filters, and enforces policies on both inputs and outputs of language model applications.

capstonevulnerability-managementdatabasetracking

Capstone: Build an LLM Vulnerability Tracking Database

Design and implement a structured vulnerability tracking database for cataloging, scoring, and querying LLM-specific security weaknesses across models and deployments.

Capstone: Media Content AI Assessment

Capstone exercise: security assessment of a media content generation and moderation AI system.

capstonemedia

capstonemedicalhealthcaresafety

Capstone: Medical AI System Assessment

Comprehensive red team assessment of a medical AI diagnostic system addressing patient safety, data privacy, and regulatory compliance.

capstoneauditsecuritycomplianceassessment

Capstone: Conduct a Full Model Security Audit

Perform a comprehensive security audit of an LLM deployment covering model behavior, API security, data handling, access controls, and compliance alignment.

capstonemulti-agentassessmentfull

Capstone: Multi-Agent System Assessment

Capstone exercise: end-to-end security assessment of a multi-agent platform with MCP and A2A.

capstonemulti-agentagenticcomplex-system

Capstone: Multi-Agent System Assessment (Capstone)

Assessing security of a complex multi-agent system with tool use, memory, and inter-agent communication covering the full agentic attack surface.

capstonemultimodalattackvisionadversarial

Capstone: Build a Multimodal Attack Testing Suite

Design and implement a comprehensive testing suite for attacking multimodal AI systems across text, image, audio, and document modalities.

capstonemultimodalassessmentfull

Capstone: Multimodal System Assessment

Capstone exercise: red team assessment of a multimodal AI system processing images, documents, and text.

capstonesecurity-auditopen-sourcemodel-securitysupply-chain

Capstone: Security Audit of an Open-Source LLM

Conduct a comprehensive security audit of an open-source large language model, covering model weights integrity, safety alignment evaluation, supply chain verification, and adversarial robustness testing.

capstoneprompt-injectionscannerdetectionml

Capstone: Build a Prompt Injection Detection Scanner

Build a production-grade prompt injection scanner that combines static analysis, ML classification, and runtime monitoring to detect injection attacks across LLM applications.

capstonepromptfoocontinuous-testingCI/CD

Capstone: Continuous Testing with Promptfoo

Implementing continuous AI security testing using Promptfoo integrated into CI/CD pipelines for automated regression testing and safety validation.

capstonePyRITtool-specificengagement

Capstone: Full Engagement with PyRIT

Complete red team engagement using Microsoft PyRIT covering attack strategy configuration, multi-turn orchestration, and automated scoring.

capstoneragsecurityassessmentretrieval

Capstone: Comprehensive RAG Security Assessment

Conduct a thorough security assessment of a Retrieval-Augmented Generation system, testing document poisoning, retrieval manipulation, context window attacks, and data exfiltration vectors.

Capstone: Retail AI Assessment

Complete capstone exercise: red team assessment of a retail AI system with recommendation engine and chatbot.

retailcapstone

capstonesupply-chainlogisticsassessment

Capstone: Supply Chain AI Security

Red team assessment of AI-driven supply chain optimization covering data poisoning, decision manipulation, and operational disruption.

Capstone: ML Supply Chain Audit

Capstone exercise: conduct a complete ML supply chain security audit for an organization.

chaincapstoneauditsupply

capstonesupply-chainsecurityprovenancesbom

Capstone: Build an AI Supply Chain Security Tool

Build a tool that scans, audits, and monitors the security of AI/ML supply chains including model provenance, dependency integrity, and artifact verification.

executionreportingfindingsremediationcapstone

Execution and Reporting

How to execute an AI red teaming engagement and deliver professional findings, including evidence collection, statistical reporting, and remediation guidance.

capstoneengagementmethodologyred-teamingprofessional

Full Engagement Methodology

A comprehensive methodology for conducting full AI red teaming engagements, integrating all techniques from previous sections into a structured professional assessment.

planningscopingengagementmethodologycapstone

Engagement Planning and Scoping

How to plan and scope an AI red teaming engagement, including defining objectives, rules of engagement, success criteria, and methodology selection.

capstoneagenticmcpmulti-agentexpert

Capstone: Agentic System Red Team

Red team a multi-agent system with MCP servers, function calling, and inter-agent communication, producing an attack tree and comprehensive findings report.

capstonecloudawsazuregcpadvanced

Capstone: Cloud AI Security Assessment

Assess AI deployment security across AWS, Azure, and GCP cloud platforms, producing a comprehensive cloud AI security assessment report.

capstonecomplianceeu-ai-actnistiso-42001intermediate

Capstone: Compliance Assessment Simulation

Conduct a simulated compliance assessment against EU AI Act, NIST AI RMF, and ISO 42001, producing a comprehensive gap analysis report.

capstonedefenseguardrailsmonitoringadvanced

Capstone: Defense System Implementation

Build a complete AI defense stack with input filtering, output monitoring, guardrails, rate limiting, and logging, then evaluate it against automated attacks.

capstonered-teamengagementreportexpert

Capstone: Full Red Team Engagement

Scope, plan, execute, and report a complete AI red team engagement against a multi-component AI application including chatbot, RAG, agent, and API layers.

capstoneincident-responseforensicspost-mortemadvanced

Capstone: AI Incident Response Exercise

Respond to a simulated AI security incident through triage, investigation, containment, remediation, and post-mortem reporting.

capstoneopen-sourcegarakpyritatlascontributionadvanced

Capstone: Open Source Contribution

Contribute to an open-source AI security project such as garak, PyRIT, or MITRE ATLAS, producing a merged PR or submitted issue with proof of concept.

capstonetraining-pipelinedata-poisoningbackdooradvanced

Capstone: Training Pipeline Attack & Defense

Attack a model training pipeline through data poisoning and backdoor insertion, then build defenses to detect and prevent these attacks.

capstoneprogram-designgovernancecharterintermediate

Capstone: Red Team Program Design

Design a complete AI red team program for a fictional enterprise, producing a comprehensive program charter document.

capstonetoolingautomationsecurity-scanneradvanced

Capstone: Build an AI Security Scanner

Design and implement an automated AI security testing tool that supports prompt injection detection, jailbreak testing, and output analysis.

capstoneverticalhealthcarefinancelegalgovernmentintermediate

Capstone: Industry Vertical Deep Dive

Select an industry vertical, threat model the AI systems within it, and produce a sector-specific AI security testing guide.

capstonevulnerability-researchdisclosureexpertadvisory

Capstone: Vulnerability Research Project

Identify and responsibly disclose a novel AI vulnerability class, producing an advisory-format writeup, proof of concept, and MITRE ATLAS mapping.

capstonecompetitionctfadversarialeducation

Capstone: Design and Run an Adversarial ML Competition

Design, build, and operate a capture-the-flag style adversarial ML competition with automated scoring, diverse challenge categories, and real-time leaderboards.

capstoneagenticpentesttool-usepyrit

Capstone: Pentest an 代理式 AI System End-to-End

Conduct a full penetration test of an agentic AI system with tool use, multi-step reasoning, and autonomous decision-making capabilities.

capstonecompliancegovernanceeu-ai-actnist

Capstone: Implement an AI Compliance Framework

Build a comprehensive AI compliance framework that maps security testing to regulatory requirements including the EU AI Act, NIST AI RMF, and ISO 42001.

capstoneincident-responsemonitoringdetectionsiem

Capstone: Build an AI Incident Response System

Design and implement an incident response system purpose-built for AI security incidents including prompt injection breaches, model manipulation, and data exfiltration through LLM applications.

capstoneplatformred-teamingautomationtooling

Capstone: Build a Complete AI 紅隊ing Platform

Design and implement a comprehensive AI red teaming platform with automated attack orchestration, vulnerability tracking, and collaborative reporting.

capstonebenchmarkingsafetyevaluationtesting

Capstone: Design and Implement an AI Safety Benchmark Suite

Build a comprehensive, reproducible benchmark suite for evaluating LLM safety across multiple risk dimensions including toxicity, bias, hallucination, and adversarial robustness.

Capstone: Autonomous 代理評量

Capstone exercise: red team assessment of a fully autonomous agent system with multi-tool access.

capstoneagentautonomous

capstoneautonomous-vehiclesafety-criticalassessment

Capstone: Autonomous Vehicle AI 安全

Full-scope security assessment of an autonomous vehicle AI decision system covering perception manipulation, planning attacks, and safety override bypass.

capstonecode-assistantassessmentfull

Capstone: Code Assistant 評量

Capstone exercise: security assessment of an AI code assistant with repository and CI/CD access.

capstonetool-developmentcustomengineering

Capstone: Custom 安全工具 Development

Building a custom AI security testing tool from scratch covering architecture design, module development, and integration with existing frameworks.

capstonedefensearchitecturedesign

Capstone: 防禦 Architecture Design

Capstone exercise: design and validate a defense-in-depth architecture for an LLM-powered application.

capstoneeducationtutoringassessment

Capstone: Educational AI Platform

安全 assessment of an AI tutoring platform addressing content safety, student data privacy, and academic integrity.

capstoneenterprisegovernanceprogram-designrisk-management

Capstone: Design an Enterprise AI 安全 Program

Architect a comprehensive enterprise AI security program spanning governance, technical controls, risk management, and incident response for organizations deploying LLMs at scale.

capstoneenterpriseragassessment

Capstone: Enterprise RAG 評量

Capstone exercise: complete red team assessment of an enterprise RAG system with role-based access.

capstonefinancialaicompliance

Capstone: Financial AI 評量

Capstone exercise: red team assessment of a financial AI advisor with regulatory compliance requirements.

capstoneengagementchatbotfull

Capstone: Full Chatbot Engagement

Complete capstone exercise: conduct a full red team engagement against a production-style chatbot system.

capstonegaraktool-specificassessment

Capstone: Deep 評量 with Garak

工具-specific capstone using Garak for comprehensive vulnerability scanning including plugin development and custom probe creation.

capstonegovernanceauditcompliance

Capstone: AI Governance Audit

Capstone exercise: conduct a full AI governance audit covering compliance, risk, and operational controls.

capstonehealthcareaicompliance

Capstone: Healthcare AI 評量

Capstone exercise: security assessment of a healthcare AI system with HIPAA and patient safety requirements.

capstoneincident-responsedrillexercise

Capstone: AI Incident Response Drill

Capstone exercise: execute a complete AI incident response drill from detection through remediation.

capstonelegaldocument-reviewassessment

Capstone: Legal AI Review System

End-to-end security assessment of an AI-powered legal document review system covering data confidentiality, output integrity, and adversarial manipulation.

capstoneguardrailsfirewalldefensenemo

Capstone: Build an LLM Firewall and Guardrails System

Design and implement a layered LLM firewall that inspects, filters, and enforces policies on both inputs and outputs of language model applications.

capstonevulnerability-managementdatabasetracking

Capstone: Build an LLM 漏洞 Tracking Database

Design and implement a structured vulnerability tracking database for cataloging, scoring, and querying LLM-specific security weaknesses across models and deployments.

Capstone: Media Content AI 評量

Capstone exercise: security assessment of a media content generation and moderation AI system.

capstonemedia

capstonemedicalhealthcaresafety

Capstone: Medical AI System 評量

Comprehensive red team assessment of a medical AI diagnostic system addressing patient safety, data privacy, and regulatory compliance.

capstoneauditsecuritycomplianceassessment

Capstone: Conduct a Full 模型安全 Audit

Perform a comprehensive security audit of an LLM deployment covering model behavior, API security, data handling, access controls, and compliance alignment.

capstonemulti-agentassessmentfull

Capstone: Multi-代理 System 評量

Capstone exercise: end-to-end security assessment of a multi-agent platform with MCP and A2A.

capstonemulti-agentagenticcomplex-system

Capstone: Multi-代理 System 評量 (Capstone)

Assessing security of a complex multi-agent system with tool use, memory, and inter-agent communication covering the full agentic attack surface.

capstonemultimodalattackvisionadversarial

Capstone: Build a Multimodal 攻擊 Testing Suite

Design and implement a comprehensive testing suite for attacking multimodal AI systems across text, image, audio, and document modalities.

capstonemultimodalassessmentfull

Capstone: Multimodal System 評量

Capstone exercise: red team assessment of a multimodal AI system processing images, documents, and text.

capstonesecurity-auditopen-sourcemodel-securitysupply-chain

Capstone: 安全 Audit of an Open-Source LLM

capstoneprompt-injectionscannerdetectionml

Capstone: Build a 提示詞注入 Detection Scanner

Build a production-grade prompt injection scanner that combines static analysis, ML classification, and runtime monitoring to detect injection attacks across LLM applications.

capstonepromptfoocontinuous-testingCI/CD

Capstone: Continuous Testing with Promptfoo

Implementing continuous AI security testing using Promptfoo integrated into CI/CD pipelines for automated regression testing and safety validation.

capstonePyRITtool-specificengagement

Capstone: Full Engagement with PyRIT

Complete red team engagement using Microsoft PyRIT covering attack strategy configuration, multi-turn orchestration, and automated scoring.

capstoneragsecurityassessmentretrieval

Capstone: Comprehensive RAG 安全評量

Conduct a thorough security assessment of a Retrieval-Augmented Generation system, testing document poisoning, retrieval manipulation, context window attacks, and data exfiltration vectors.

Capstone: Retail AI 評量

Complete capstone exercise: red team assessment of a retail AI system with recommendation engine and chatbot.

retailcapstone

capstonesupply-chainlogisticsassessment

Capstone: Supply Chain AI 安全

Red team assessment of AI-driven supply chain optimization covering data poisoning, decision manipulation, and operational disruption.

Capstone: ML Supply Chain Audit

Capstone exercise: conduct a complete ML supply chain security audit for an organization.

chaincapstoneauditsupply

capstonesupply-chainsecurityprovenancesbom

Capstone: Build an AI Supply Chain 安全工具

Build a tool that scans, audits, and monitors the security of AI/ML supply chains including model provenance, dependency integrity, and artifact verification.

executionreportingfindingsremediationcapstone

執行與報告

如何執行 AI 紅隊委任並交付專業發現，包括證據蒐集、統計回報，以及修復指引。

capstoneengagementmethodologyred-teamingprofessional

完整案件方法論

進行完整 AI 紅隊案件的全面方法論，將先前章節的所有技術整合為結構化專業評估。

planningscopingengagementmethodologycapstone

委任規劃與範疇界定

如何規劃並界定 AI 紅隊委任的範圍，包括目標設定、交戰規則、成功判準與方法論選擇。

capstoneagenticmcpmulti-agentexpert

Capstone: 代理式 System 紅隊

Red team a multi-agent system with MCP servers, function calling, and inter-agent communication, producing an attack tree and comprehensive findings report.

capstonecloudawsazuregcpadvanced

Capstone: Cloud AI 安全評量

Assess AI deployment security across AWS, Azure, and GCP cloud platforms, producing a comprehensive cloud AI security assessment report.

capstonecomplianceeu-ai-actnistiso-42001intermediate

Capstone: Compliance 評量 Simulation

Conduct a simulated compliance assessment against EU AI Act, NIST AI RMF, and ISO 42001, producing a comprehensive gap analysis report.

capstonedefenseguardrailsmonitoringadvanced

Capstone: 防禦 System Implementation

Build a complete AI defense stack with input filtering, output monitoring, guardrails, rate limiting, and logging, then evaluate it against automated attacks.

capstonered-teamengagementreportexpert

Capstone: Full 紅隊 Engagement

Scope, plan, execute, and report a complete AI red team engagement against a multi-component AI application including chatbot, RAG, agent, and API layers.

capstoneincident-responseforensicspost-mortemadvanced

Capstone: AI Incident Response 練習

Respond to a simulated AI security incident through triage, investigation, containment, remediation, and post-mortem reporting.

capstoneopen-sourcegarakpyritatlascontributionadvanced

Capstone: Open Source Contribution

Contribute to an open-source AI security project such as garak, PyRIT, or MITRE ATLAS, producing a merged PR or submitted issue with proof of concept.

capstonetraining-pipelinedata-poisoningbackdooradvanced

Capstone: 訓練 Pipeline 攻擊 & 防禦

攻擊 a model training pipeline through data poisoning and backdoor insertion, then build defenses to detect and prevent these attacks.

capstoneprogram-designgovernancecharterintermediate

Capstone: 紅隊 Program Design

Design a complete AI red team program for a fictional enterprise, producing a comprehensive program charter document.

capstonetoolingautomationsecurity-scanneradvanced

Capstone: Build an AI 安全 Scanner

Design and implement an automated AI security testing tool that supports prompt injection detection, jailbreak testing, and output analysis.

capstoneverticalhealthcarefinancelegalgovernmentintermediate

Capstone: Industry Vertical Deep Dive

Select an industry vertical, threat model the AI systems within it, and produce a sector-specific AI security testing guide.

capstonevulnerability-researchdisclosureexpertadvisory

Capstone: 漏洞 Research Project

Identify and responsibly disclose a novel AI vulnerability class, producing an advisory-format writeup, proof of concept, and MITRE ATLAS mapping.