# skill-verification
標記為「skill-verification」的 78 篇文章
Assessments & Skill Verification
Comprehensive assessment suite for validating AI red teaming knowledge, including section assessments, practice exams, study guides, and hands-on skill verification exercises.
Skill Verification Overview
Overview of timed skill verification labs for AI red teaming, including format, pass/fail criteria, and preparation guidance.
Skill Verification: A2A Protocol Attacks
Practical skill verification for multi-agent trust boundary attacks and protocol exploitation.
Skill Verification: Agent Exploitation
Practical skill verification for agent and MCP exploitation techniques.
Skill Verification: Automated Red Teaming
Practical verification of automated attack generation using Garak, PyRIT, and Promptfoo.
Skill Verification: Cloud AI Security
Practical verification of cloud AI platform security assessment skills.
Skill Verification: Cloud AI Security (Assessment)
Hands-on verification of cloud AI service security assessment across AWS, Azure, and GCP.
Skill Verification: Defense Effectiveness Evaluation
Practical verification of skills in evaluating guardrails, classifiers, and monitoring systems.
Skill Verification: Defense Evaluation
Hands-on verification of ability to evaluate and bypass LLM defense mechanisms.
Skill Verification: Encoding and Obfuscation
Skill verification for Base64, Unicode, token smuggling, and encoding-based bypass techniques.
Skill Verification: Function Calling Attacks
Skill verification for schema injection, parameter manipulation, and result poisoning techniques.
Skill Verification: Governance and Compliance
Verification of skills in AI governance framework implementation, audit, and compliance assessment.
Skill Verification: AI Incident Response
Skill verification for AI-specific incident detection, analysis, containment, and recovery.
Skill Verification: AI System Lateral Movement
Skill verification for moving from compromised AI components to connected systems and data stores.
Skill Verification: MCP Exploitation
Hands-on skill verification for MCP transport attacks, tool description injection, and server impersonation.
Skill Verification: Agent Memory Attacks
Practical verification of memory poisoning, context manipulation, and cross-session persistence skills.
Skill Verification: Multimodal Attack Execution
Hands-on verification of image injection, audio manipulation, and cross-modal transfer attacks.
Skill Verification: Multimodal Attacks
Hands-on verification of multimodal attack capabilities across image, audio, and document modalities.
Skill Verification: Prompt Injection
Hands-on skill verification requiring live exploitation of prompt injection vulnerabilities.
Skill Verification: RAG & Data Attacks
Practical verification of RAG poisoning, embedding attacks, and data extraction techniques.
Skill Verification: Reasoning Model Attacks
Verification of skills in reasoning trace manipulation, chain-of-thought exploitation, and thinking-token attacks.
Skill Verification: Red Team Reporting
Practical assessment of red team report writing and finding communication skills.
Skill Verification: Advanced Report Writing
Verification of advanced red team report writing including executive summaries, technical details, and remediation.
Skill Verification: Tool Proficiency
Hands-on verification of proficiency with Garak, PyRIT, Promptfoo, and custom tooling.
Skill Verification: Training Pipeline Security
Skill verification for data poisoning, RLHF exploitation, and fine-tuning attack techniques.
Skill Verification: Agent Exploitation (Assessment)
Timed skill verification lab: exploit an agent system to perform unauthorized actions within 25 minutes.
Skill Verification: Defense Implementation
Timed skill verification lab: build a working guardrail system that passes automated attack tests within 45 minutes.
Skill Verification: Jailbreaking
Timed skill verification lab: bypass safety measures on a defended AI system within 30 minutes using jailbreak techniques.
Skill Verification: Prompt Injection (Assessment)
Timed skill verification lab: extract a system prompt from a defended AI system within 15 minutes using prompt injection techniques.
Skill Verification: Reconnaissance
Timed skill verification lab: profile an unknown AI system in 20 minutes by identifying the model, extracting configuration, and mapping capabilities.
Skill Verification: Report Writing
Timed skill verification lab: write a professional AI red team finding report from provided evidence within 30 minutes.
Skill Verification: Embedding Attacks
Practical verification of embedding and vector database attack capabilities.
Skill Verification: Fine-Tuning Attacks (Assessment)
Practical verification of fine-tuning attack capabilities including alignment removal and backdoor insertion.
Skill Verification: AI Forensics Investigation
Hands-on verification of AI forensics investigation capabilities with simulated incident scenarios.
Skill Verification: Governance Audit (Assessment)
Practical verification of AI governance audit skills against EU AI Act and NIST AI RMF requirements.
Skill Verification: Guardrail Bypass
Hands-on verification of guardrail bypass techniques across NeMo, LLM Guard, and custom implementations.
Skill Verification: MCP Exploitation (Assessment)
Hands-on verification of MCP server exploitation including tool poisoning and resource manipulation.
Skill Verification: Multi-Agent Testing
Hands-on verification of multi-agent system security testing capabilities.
Skill Verification: Red Team Automation
Practical verification of red team automation skills using Garak, PyRIT, and custom tooling.
評估與技能驗證
驗證 AI 紅隊知識的完整評估套件,包含章節評估、練習考試、學習指南與動手技能驗證練習。
技能驗證概覽
AI 紅隊計時技能驗證實驗室概覽,包含格式、通過/失敗標準與準備指引。
Skill Verification: A2A Protocol 攻擊s
Practical skill verification for multi-agent trust boundary attacks and protocol exploitation.
Skill Verification: 代理 利用ation
Practical skill verification for agent and MCP exploitation techniques.
Skill Verification: Automated 紅隊演練
Practical verification of automated attack generation using Garak, PyRIT, and Promptfoo.
Skill Verification: Cloud AI 安全
Practical verification of cloud AI platform security assessment skills.
Skill Verification: Cloud AI 安全 (評量)
Hands-on verification of cloud AI service security assessment across AWS, Azure, and GCP.
Skill Verification: 防禦 Effectiveness Evaluation
Practical verification of skills in evaluating guardrails, classifiers, and monitoring systems.
Skill Verification: 防禦 Evaluation
Hands-on verification of ability to evaluate and bypass LLM defense mechanisms.
Skill Verification: Encoding and Obfuscation
Skill verification for Base64, Unicode, token smuggling, and encoding-based bypass techniques.
Skill Verification: Function Calling 攻擊s
Skill verification for schema injection, parameter manipulation, and result poisoning techniques.
Skill Verification: Governance and Compliance
Verification of skills in AI governance framework implementation, audit, and compliance assessment.
Skill Verification: AI Incident Response
Skill verification for AI-specific incident detection, analysis, containment, and recovery.
Skill Verification: AI System Lateral Movement
Skill verification for moving from compromised AI components to connected systems and data stores.
Skill Verification: MCP 利用ation
Hands-on skill verification for MCP transport attacks, tool description injection, and server impersonation.
Skill Verification: 代理 記憶體 攻擊s
Practical verification of memory poisoning, context manipulation, and cross-session persistence skills.
Skill Verification: Multimodal 攻擊 Execution
Hands-on verification of image injection, audio manipulation, and cross-modal transfer attacks.
Skill Verification: Multimodal 攻擊s
Hands-on verification of multimodal attack capabilities across image, audio, and document modalities.
Skill Verification: 提示詞注入
Hands-on skill verification requiring live exploitation of prompt injection vulnerabilities.
Skill Verification: RAG & Data 攻擊s
Practical verification of RAG poisoning, embedding attacks, and data extraction techniques.
Skill Verification: Reasoning 模型 攻擊s
Verification of skills in reasoning trace manipulation, chain-of-thought exploitation, and thinking-token attacks.
Skill Verification: 紅隊 Reporting
Practical assessment of red team report writing and finding communication skills.
Skill Verification: 進階 Report Writing
Verification of advanced red team report writing including executive summaries, technical details, and remediation.
Skill Verification: 工具 Proficiency
Hands-on verification of proficiency with Garak, PyRIT, Promptfoo, and custom tooling.
Skill Verification: 訓練 Pipeline 安全
Skill verification for data poisoning, RLHF exploitation, and fine-tuning attack techniques.
技能驗證:代理利用(評量)
限時技能驗證實驗室:在 25 分鐘內透過操控工具使用利用代理系統執行未授權動作。
技能驗證:防禦實作
限時技能驗證實驗室:在 45 分鐘內建構通過自動化攻擊測試的可運作護欄系統。
技能驗證:越獄
限時技能驗證實驗室:在 30 分鐘內使用越獄技術繞過防禦 AI 系統的安全措施。
技能驗證:提示詞注入(評估)
計時技能驗證實驗室:使用提示詞注入技術在 15 分鐘內從防禦 AI 系統萃取系統提示詞。
技能驗證:偵察
計時技能驗證實驗室:在 20 分鐘內透過識別模型、萃取設定並描繪能力,剖析未知的 AI 系統。
技能驗證:報告撰寫
限時技能驗證實驗室:在 30 分鐘內從提供的證據撰寫專業 AI 紅隊發現報告。
Skill Verification: Embedding 攻擊s
Practical verification of embedding and vector database attack capabilities.
Skill Verification: Fine-Tuning 攻擊s (評量)
Practical verification of fine-tuning attack capabilities including alignment removal and backdoor insertion.
Skill Verification: AI Forensics Investigation
Hands-on verification of AI forensics investigation capabilities with simulated incident scenarios.
Skill Verification: Governance Audit (評量)
Practical verification of AI governance audit skills against EU AI Act and NIST AI RMF requirements.
Skill Verification: Guardrail Bypass
Hands-on verification of guardrail bypass techniques across NeMo, LLM Guard, and custom implementations.
Skill Verification: MCP 利用ation (評量)
Hands-on verification of MCP server exploitation including tool poisoning and resource manipulation.
Skill Verification: Multi-代理 Testing
Hands-on verification of multi-agent system security testing capabilities.
Skill Verification: 紅隊 Automation
Practical verification of red team automation skills using Garak, PyRIT, and custom tooling.