# guardrails

建構 an automated framework for evaluating AI 護欄s and safety filters. 測試 input filters, output classifiers, content moderation systems, and defense-in-depth architectures for coverage gaps and bypass vulnerabilities.

labexpertguardrailsevaluationdefense-testinghands-on

專家

實驗室: AWS Bedrock 護欄測試

動手實驗室,主題為systematically testing與bypassing AWS Bedrock's built-in guardrails,包括 content filters,denied topics,word filters.

labcloudawsbedrockguardrailscloud-ai

中級

實驗室: 防禦 Effectiveness Measurement

動手實驗室,主題為quantifying AI guardrail robustness using attack success rates,evasion metrics,false positive rates,statistical analysis of defense performance.

labdefensemetricsmeasurementguardrailsintermediate

中級

實驗室: 防禦 Effectiveness 測試

Systematically test與measure the robustness of AI guardrails using structured methodology,metrics,repeatable test suites.

labdefense-testingguardrailsmetricsintermediatehands-on

中級

護欄 Fingerprinting

Systematically map the rules與thresholds of input/output guardrail systems.

labsguardrailsfingerprintingintermediate

中級

護欄 Latency-Based 偵測

Use timing side channels to identify與characterize guardrail implementations in LLM applications.

labsguardrailslatencydetectionintermediate

中級

Simulation: Build & Defend a Chatbot

防禦 simulation where you build a chatbot with layered defenses, test it against a standardized attack suite, measure defense effectiveness, and iterate on weaknesses.

simulationdefensechatbotguardrailsblue-team

進階

Simulation: Guardrail Engineering

防禦 simulation where you design and implement a multi-layer guardrail system, test it against progressively sophisticated attacks, and document false positive/negative rates.

simulationdefenseguardrailsengineeringblue-team

進階

防禦規避

繞過為保護大型語言模型應用程式而部署之安全過濾器、內容分類器、護欄與偵測系統的進階技術。

defense-evasionfilter-bypassguardrailssafety-filtersadvanced

專家

防禦繞過快速參考

常見 AI 防禦機制及其已知繞過技術的快速參考卡，依防禦類型組織。

referencecheat-sheetdefense-bypassguardrails

中級

部署 NeMo Guardrails

於生產環境設置 NVIDIA NeMo Guardrails 的逐步演練，涵蓋安裝、Colang 配置、自訂動作、主題與安全護欄、測試與監控。

nemo-guardrailsnvidiaguardrailscolangdefensewalkthrough

中級

Setting Up AI Guardrails

Step-by-step walkthrough for implementing AI guardrails: input validation with NVIDIA NeMo Guardrails, prompt injection detection with rebuff, output filtering for PII and sensitive data, and content policy enforcement.

guardrailsnemoinput-validationoutput-filteringpii-detectioncontent-policywalkthrough

中級

Building Input Guardrails for LLM Applications

Step-by-step walkthrough for implementing production-grade input guardrails that protect LLM applications from prompt injection, content policy violations, and resource abuse through multi-layer validation, classification, and rate limiting.

guardrailsinput-validationprompt-injection-defensecontent-safetydefensewalkthrough

中級

防禦實作演練

實作 AI 安全防禦的逐步指南：護欄配置、監控與偵測設置，以及 AI 系統的事件回應準備。

defenseguardrailsmonitoringincident-responseimplementationwalkthrough

中級

Response Boundary Enforcement

Step-by-step walkthrough for keeping LLM responses within defined topic, format, and content boundaries, covering boundary definition, violation detection, response rewriting, and boundary drift monitoring.

response-boundariesoutput-filteringcontent-policyguardrailsdefensewalkthrough

中級

Function Calling Guardrails Implementation

Implement guardrails for function calling that validate tool selection, parameters, and execution scope.

walkthroughsdefensefunction-callingguardrails

中級

AWS Bedrock 紅隊導覽

Complete guide to red teaming AWS Bedrock deployments: testing guardrails bypass techniques, knowledge base data exfiltration, agent prompt injection, model customization abuse, and CloudTrail evasion.

awsbedrockred-teamguardrailsknowledge-baseagentswalkthrough

中級

AWS Bedrock 紅隊導覽 (Platform 導覽)

End-to-end walkthrough for red teaming AI systems on AWS Bedrock: setting up access, invoking models via the Converse API, testing Bedrock Guardrails, exploiting knowledge bases, and analyzing CloudTrail logs.

awsbedrockcloudguardrailsknowledge-basecloudtrailwalkthrough

中級

NeMo Guardrails 導覽

End-to-end walkthrough of NVIDIA NeMo Guardrails: installation, Colang configuration, dialog flow design, integration with LLM applications, and red team bypass testing techniques.

nemoguardrailscolangnvidiadialog-flowsbypass-testingwalkthrough

中級

# guardrails

進階雲端 AI 安全評估

防禦基礎評量

防禦與緩解評量

護欄實作評量

技能驗證：防禦實作

技能驗證: Guardrail Bypass

Capstone:LLM 防火牆

頂石專案:防禦系統實作

AWS Bedrock Guardrails 紅隊測試

多雲 AI 安全比較矩陣

防禦挑戰:打造無法突破的護欄

自適應護欄系統

防禦有效性基準測試

防護機制與安全層架構

NVIDIA NeMo Guardrails

護欄框架比較 2025

自適應護欄系統

防禦與緩解

自適應護欄系統

實驗室:系統性繞過護欄

自適應護欄系統

AI 防禦景觀

Lab: Chaining Guardrail Bypasses

CTF：防禦挑戰賽（藍隊）

Guardrail Olympics: Multi-Framework Bypass

護欄 Speedrun: Fastest 繞過 Challenge

Lab: Build Guardrail Evaluator

實驗室: AWS Bedrock 護欄 測試

實驗室: 防禦 Effectiveness Measurement

實驗室: 防禦 Effectiveness 測試

護欄 Fingerprinting

護欄 Latency-Based 偵測

Simulation: Build & Defend a Chatbot

Simulation: Guardrail Engineering

防禦規避

防禦繞過快速參考

部署 NeMo Guardrails

Setting Up AI Guardrails

Building Input Guardrails for LLM Applications

防禦實作演練

Response Boundary Enforcement

Function Calling Guardrails Implementation

AWS Bedrock 紅隊 導覽

AWS Bedrock 紅隊 導覽 (Platform 導覽)

NeMo Guardrails 導覽

# guardrails

進階雲端 AI 安全評估

防禦基礎評量

防禦與緩解評量

護欄實作評量

技能驗證：防禦實作

技能驗證: Guardrail Bypass

Capstone:LLM 防火牆

頂石專案:防禦系統實作

AWS Bedrock Guardrails 紅隊測試

多雲 AI 安全比較矩陣

防禦挑戰:打造無法突破的護欄

自適應護欄系統

防禦有效性基準測試

防護機制與安全層架構

NVIDIA NeMo Guardrails

護欄框架比較 2025

自適應護欄系統

防禦與緩解

自適應護欄系統

實驗室:系統性繞過護欄

自適應護欄系統

AI 防禦景觀

Lab: Chaining Guardrail Bypasses

CTF：防禦挑戰賽（藍隊）

Guardrail Olympics: Multi-Framework Bypass

護欄 Speedrun: Fastest 繞過 Challenge

Lab: Build Guardrail Evaluator

實驗室: AWS Bedrock 護欄 測試

實驗室: 防禦 Effectiveness Measurement

實驗室: 防禦 Effectiveness 測試

護欄 Fingerprinting

護欄 Latency-Based 偵測

Simulation: Build & Defend a Chatbot

實驗室: AWS Bedrock 護欄測試

AWS Bedrock 紅隊導覽

AWS Bedrock 紅隊導覽 (Platform 導覽)

實驗室: AWS Bedrock 護欄測試

AWS Bedrock 紅隊導覽

AWS Bedrock 紅隊導覽 (Platform 導覽)