# automation

evaluationharnessautomationinfrastructure

Building Evaluation Harnesses

Design and implement evaluation harnesses for AI red teaming: architecture patterns, judge model selection, prompt dataset management, scoring pipelines, and reproducible evaluation infrastructure.

industry-verticalsminingresource-extractionautomation

礦業與資源開採 AI 安全

礦業 AI 的安全,包含自主設備、地質建模與安全系統。

ci-cdpipeline-securitymlopssupply-chainautomation

ML 管線 CI/CD 攻擊

針對 ML CI/CD 管線的攻擊,包含建置注入、Artifact 篡改與部署劫持。

jailbreakautomationPAIRTAPAutoDANred-teaming

自動化越獄流水線

運用 PAIR、TAP、AutoDAN 及自訂流水線架構建構自動化越獄系統,以進行系統化的 AI 安全評估。

injectionprompt-injectionjailbreaksmultimodalresearchautomation

注入研究

提示詞注入、越獄自動化與多模態攻擊向量的進階研究，涵蓋超越標準注入方法的尖端技術。

入門

越獄研究與自動化

越獄原語的分類法、漸強攻擊、many-shot 越獄，以及以 TAP 和 PAIR 進行自動化越獄產生。

jailbreakscrescendomany-shotskeleton-keyTAPPAIRautomation

laborchestrationautomationred-team-ops

實作：紅隊編排

建構 an orchestration system that coordinates multiple attack strategies simultaneously, managing parallel attack campaigns and synthesizing results into comprehensive risk assessments.

labregression-testingsafetyautomationci-cd

Lab: Safety Regression Testing at Scale

建構 automated pipelines that detect safety degradation across model versions, ensuring that updates and 微調 do not introduce new vulnerabilities or weaken existing protections.

labharnessautomationpython

實作：建構簡單測試框架

建構 a reusable Python test harness that automates sending test prompts, recording results, and calculating attack success metrics.

入門

Lab: Build Jailbreak Automation

建構 an automated 越獄 testing framework that generates, mutates, and evaluates attack prompts at scale. Covers prompt mutation engines, success classifiers, and campaign management for systematic red team testing.

labexpertautomationjailbreakframeworkhands-on

labautomationci-cdpromptfoopipelineintermediate

實驗室:自動化紅隊流水線

動手實驗室,主題為building a continuous AI red team testing pipeline using promptfoo,GitHub Actions,automated attack generation to catch safety regressions before deployment.

labllm-judgeevaluationautomation

實驗室: Building an LLM Judge Evaluator

動手實驗室,主題為building an LLM-based evaluator to score red team attack outputs,compare model vulnerability,lay the foundation for automated attack campaigns.

simulationdefensedefense-in-depthautomationblue-team

Simulation: 防禦 in Depth

專家-level defense simulation implementing a full defense stack including input filter, output monitor, rate limiter, anomaly detector, and circuit breaker, then measuring effectiveness against automated attacks.

ml-cicdpipeline-securitytraining-pipelinedeploymentautomationdevops

ML CI/CD 安全

ML 持續整合與部署管線的安全概觀：ML CI/CD 與傳統 CI/CD 的差異、訓練工作流程中的獨特攻擊面，以及自動化模型建構與部署的安全意涵。

professionaltoolsdevelopmentautomation

Developing Custom AI 紅隊工具s

指南 to designing, building, and maintaining custom tools for AI red team engagements.

professionalcontinuous-testingautomationproduction

正式環境 AI 系統的持續紅隊演練

為正式環境中的 AI 系統實施持續、自動化的紅隊演練計畫。

automationcartci-cdtoolingscalinghuman-in-the-loop

紅隊自動化策略

AI 紅隊演練何時與如何自動化:工具選擇、CI/CD 整合、持續自動化紅隊演練(CART)、人機迴圈設計,以及透過自動化擴展評估覆蓋率。

prompt-injectionautomationchainingorchestration

注入鏈自動化

自動化發掘並鏈結多種注入技術，建立對強化目標的可靠多步攻擊序列。

system-promptextractionprompt-injectionautomationdetectiontradecraft

系統提示擷取技術

針對 LLM 應用之系統提示擷取方法的目錄：直接攻擊、間接技術、多輪策略與規避偵測。

continuousautomationmetricskpiprogramred-teamtradecraftadvanced

Continuous 紅隊演練 Programs

Designing and operating ongoing AI red team programs with automated testing pipelines, metrics dashboards, KPI frameworks, alert-driven assessments, and integration with CI/CD and model deployment workflows.

walkthroughsdefensetestingautomation

Automated 防禦 Testing Pipeline

Build an automated pipeline that continuously tests defensive measures against evolving attack techniques.

continuous-testingci-cdautomationpipelineregression-testingmethodologywalkthrough

Setting Up Continuous AI 紅隊ing Pipelines

導覽 for building continuous AI red teaming pipelines that automatically test LLM applications on every deployment, covering automated scan configuration, CI/CD integration, alert thresholds, regression testing, and dashboard reporting.

test-planplanningmethodologytest-casesautomationwalkthrough

Developing Comprehensive AI 安全 Test Plans

Step-by-step guide to developing structured test plans for AI red team engagements, covering test case design, automation strategy, coverage mapping, and execution scheduling.

counterfitadversarial-mlmicrosoftrobustness-testingautomationwalkthrough

Counterfit 導覽

Complete walkthrough of Microsoft's Counterfit adversarial ML testing framework: installation, target configuration, running attacks against ML models, interpreting results, and automating adversarial robustness assessments.

garakci-cdautomationgithub-actionsgitlab-ciwalkthrough

Integrating Garak into CI/CD Pipelines

中階 walkthrough on automating garak vulnerability scans within CI/CD pipelines, including GitHub Actions, Git實驗室 CI, threshold-based gating, result caching, and cost management strategies.

garakvulnerability-scanningprobesautomationci-cdwalkthrough

Garak End-to-End 導覽

Complete walkthrough of NVIDIA's garak LLM vulnerability scanner: installation, configuration, running probes against local and hosted models, interpreting results, writing custom probes, and CI/CD integration.

promptfooautomationred-teamevaluationci-cdwalkthrough

Automating 紅隊 Evaluations with Promptfoo

Complete walkthrough for setting up automated red team evaluation pipelines using Promptfoo, covering configuration, custom evaluators, adversarial dataset generation, CI integration, and result analysis.

pythonautomationhttpxaiohttpreportingtest-harnesswalkthrough

Python 紅隊 Automation

Building custom AI red team automation with Python: test harnesses with httpx and aiohttp, result collection and analysis, automated reporting, and integration with existing tools like promptfoo and garak.

walkthroughstoolsreport-generationautomation

自動化紅隊報告生成

以測試資料與發現為輸入,自動生成結構化紅隊報告的系統。