# architecture

architecturesectionassessmentsllmassessment

LLM 架構安全評估

評估涵蓋transformer internals、tokenization 安全、attention 漏洞、model-level 攻擊。

assessmentsdefensearchitectureexam

Defense Architecture 評估

評估涵蓋designing 防禦-in-depth architectures的different AI application types。

capstonedefensearchitecturedesign

Capstone:防禦架構設計

Capstone 演練:為 LLM 驅動應用設計並驗證縱深防禦架構。

雲端 AI 網路安全架構

雲端 AI 部署的網路安全架構,包括 VPC 設計、端點與流量檢查。

cloudnetworkarchitecture

dual-llmcamelprompt-injection-defenseagent-securityarchitecturetool-use

CaMeL 與雙 LLM 模式

分離可信與不可信處理的架構式防禦模式:Simon Willison 的雙 LLM 概念與 Google DeepMind 的 CaMeL 框架,用於防禦使用工具的 AI 代理免於提示詞注入。

defensecontent-filteringarchitecturemoderation

內容過濾架構

為 LLM 應用設計涵蓋輸入、輸出與上下文過濾的內容過濾系統。

defensedefense-in-deptharchitecturellm-apps

LLM 應用的縱深防禦

為正式 LLM 應用實作分層防禦架構。

guardrailsarchitecturesafety-layerspre-processingpost-processingbypass

防護機制與安全層架構

防護系統在架構上如何設計，包括前置處理、推論中處理與後置處理層、常見設計模式，以及各層可被繞過之處。

defenseinput-validationarchitecturesecurity

LLM 的輸入驗證架構

設計在提示詞注入到達模型前偵測並中和的輸入驗證管線。

loggingarchitectureprivacyretentioncomplianceintermediate

AI 日誌架構

AI 系統日誌應擷取什麼——提示詞、補全、延遲、符元、工具呼叫——以及儲存策略、保留政策與隱私考量。

defense-in-depthlayered-defensearchitecturesecurity-strategyllm-security

LLM 應用之縱深防禦

AI 應用之分層防禦策略，涵蓋網路、應用、模型與輸出層，各層的貢獻，以及為何單層防禦必然失敗。

defensecontext-isolationpatternsarchitecture

LLM 應用上下文隔離模式

在 LLM 應用中將不受信任內容與受信任指令隔離的架構模式。

defensellm-firewallarchitecturedesign

LLM 防火牆架構設計

設計並實作檢查、過濾與記錄所有模型互動的 LLM 防火牆。

defensemulti-modelsafety-validationarchitecture

多模型安全驗證架構

以相互檢查架構運用多個模型交叉驗證輸入與輸出的安全性。

defensezero-trustarchitecture

LLM 應用的零信任架構

將零信任原則套用於 LLM 應用架構,涵蓋身分、裝置與資料平面安全。

exploit-devpayloadgeneratorarchitecture

載荷產生器架構

為 LLM 測試設計與實作自動化載荷產生系統,產出多樣且有效的對抗輸入。

custom-harnesspatternsarchitecturetools

自訂 Harness 樣式

為 LLM 漏洞評估打造自訂測試 harness 的設計樣式,涵蓋目標抽象、載荷傳遞、結果收集與 CI 整合。

ragretrievalarchitectureintermediate

RAG 架構：檢索系統如何運作

檢索增強生成管線之端到端解剖——文件攝入、分塊、embedding、索引、檢索、脈絡組裝與生成——含各階段之攻擊面分析。

foundationsllmsmethodologyembeddingsarchitectureadversarial-ml

基礎

AI 紅隊演練的核心建構區塊，涵蓋紅隊方法論、AI 景觀、大型語言模型如何運作、嵌入向量與向量系統、AI 系統架構，以及對抗性機器學習概念。

trust-boundariessecurityfoundationsarchitecture

大型語言模型信任邊界

理解大型語言模型應用中的信任邊界：資料於何處跨越權限層級，以及原生信任強制的缺乏如何建立攻擊面。

llmtransformerarchitecturetrainingalignmentfoundations

大型語言模型如何運作

從安全視角理解大型語言模型——涵蓋 transformer 架構、分詞、注意力、訓練流程與安全對齊機制。

transformerattentionarchitectureintermediate

給攻擊者的 Transformer 架構

深入探討 transformer 架構——注意力、前饋層與殘差連接——從哪些元件可被利用的視角。

deploymentpatternsarchitectureintermediate

常見 AI 部署模式與攻擊面

部署模式之分析——聊天機器人、copilot、自主代理、批次處理與微調模型——及其獨特之攻擊面與安全考量。

architecturesystemsdeploymentbeginner

紅隊視角的 AI 系統架構

AI 系統如何於生產環境部署——模型 API、提示範本、編排、工具、記憶體與防護機制——並於每一層進行攻擊面分析。

transformerarchitecturesecurityfoundations

Transformer 架構基礎（安全視角）

從安全視角理解 transformer 架構基礎：注意力、嵌入、生成如何建立可利用的特性。

frontier-researchsparse-attentionsecurityarchitecture

稀疏注意力的安全性

稀疏注意力機制的安全影響,包括攻擊向量與相應防禦。

infrastructuremulti-cloudsecurityarchitecture

多雲 ML 安全

跨多雲的 ML 系統安全考量,包含資料主權、跨雲認證與威脅偵測。

researchinjectiontransferarchitecture

跨架構注入轉移

研究注入技術如何跨越模型架構轉移,以及哪些架構特性決定轉移性。

architecturecrosstransferlabexpertlabs

Cross-Architecture 遷移實驗室

發展attacks on one model architecture that reliably transfer to fundamentally different architectures.

llmtransformersinternalsexploit-primitivesarchitecture

大型語言模型內部與利用原語

從安全研究員視角出發的大型語言模型架構概覽，涵蓋建立可利用攻擊面的關鍵元件。

modelsarchitecturecomparisonsafety

架構比較:安全性觀點

從安全性觀點比較各種 LLM 架構,涵蓋 Transformer、MoE 與混合模型。

geminigooglemultimodallong-contextarchitecturered-teaming

Gemini（Google）概觀

Google Gemini 模型家族的架構概觀，包括原生多模態設計、長上下文能力、Google 生態整合，以及對紅隊具意義的安全相關特性。

gpt-4openaiarchitecturemoered-teaming

GPT-4 / GPT-4o 概觀

OpenAI GPT-4 與 GPT-4o 模型的架構概觀，涵蓋傳聞中的 Mixture of Experts 設計、能力、API 介面，以及對紅隊具意義的安全相關特性。

model-securityred-teamingattack-surfacemethodologyarchitecture

模型深度剖析

為何模型特定知識對 AI 紅隊演練至關重要、不同架構如何產生不同的攻擊面,以及對任何新模型進行剖繪的系統化方法論。

multimodalfusionarchitecture

多模態融合攻擊

在多模態 AI 系統的模態融合階段進行的對抗性攻擊。

vlmarchitecturevision-encodermultimodal

VLM 架構與視覺—語言對齊

深入探討 VLM 架構，包括 CLIP、SigLIP 與 vision transformers。圖像 patch 如何變成 token、對齊訓練，以及錯位（misalignment）如何製造可利用之缺口。

architecturequantizationdistillationkv-cachemoecontext-window

架構層級攻擊

鎖定模型架構最佳化的攻擊——涵蓋量化利用、蒸餾攻擊、KV 快取攻擊、MoE 路由操控與上下文視窗利用。

moemixture-of-expertsroutingexpert-selectionload-balancingarchitecture

MoE 路由操弄

攻擊 Mixture-of-Experts 路由：專家選擇操弄、負載平衡利用、安全專家繞過，以及路由感知對抗輸入。

trainingpre-trainingfine-tuningarchitecturedata-poisoningrlhfalignment

訓練管線安全

完整 AI 模型訓練管線的安全，涵蓋預訓練攻擊、微調與對齊操控、架構層級漏洞與進階訓練期威脅。

walkthroughsdefensedefense-in-deptharchitecture

Defense-in-Depth Architecture for LLM Apps

Design and implement a complete defense-in-depth architecture for production LLM applications.

dual-llmarchitecturevalidationdefense-in-depthdefensewalkthrough

雙 LLM 架構設置

實作雙 LLM 模式的逐步演練——一個模型產生回應、另一個模型驗證之,涵蓋架構設計、驗證者提示詞工程、延遲最佳化與失敗處理。

walkthroughsdefensellm-firewallarchitecture

LLM 防火牆架構設計（防禦逐步指南）

設計並實作一個全面的 LLM 防火牆，具備輸入分析、輸出過濾與行為監控。

architecturesecuredefenseagentwalkthroughs

Secure Agent Architecture Design

Design a secure architecture for LLM agent systems with sandboxing, capability controls, and audit trails.

architecturesecuredefenseragwalkthroughs

Secure RAG Architecture 詳解

Design and implement a secure RAG architecture with document sanitization, access controls, and output validation.

architecturetrustdefensellmzerowalkthroughs

Zero Trust Architecture for LLM Apps

Implement zero trust principles in LLM application architecture with continuous verification and least privilege.