# security

frontier-researchrepresentation-engineeringsecurityinterpretability

表徵工程的安全影響

用於操弄或防禦模型行為之表徵工程技術的安全影響。

frontier-researchcross-lingualtransfersecurity

跨語言轉移的安全性

跨語言能力轉移的安全影響,包括跨語言越獄與安全轉移失敗。

frontier-researchlong-contextsecurityresearch

長上下文安全研究

針對長上下文模型的新興安全研究,涵蓋注意力機制與利用技術。

frontier-researchmodel-collapsesecuritytraining

模型崩潰的安全影響

模型崩潰的安全影響,包括惡意操弄與遞迴訓練動態。

frontier-researchscaling-lawssecurityproperties

神經縮放法則的安全影響

神經縮放法則的安全研究,包括能力預測與安全意涵。

frontier-researchsparse-attentionsecurityarchitecture

稀疏注意力的安全性

稀疏注意力機制的安全影響,包括攻擊向量與相應防禦。

frontier-researchunlearningsecurityprivacy

機器遺忘安全研究

針對機器遺忘方法的攻擊研究與知識移除驗證。

frameworksstandardsoverviewsecurity

AI 安全框架概觀

AI 安全框架地景，包括 OWASP LLM Top 10、MITRE ATLAS、NIST AI RMF 與 EU AI Act。它們如何關聯、何時使用哪一個，以及缺口分析。

utilitiessecurityverticalsindustryenergy

能源與公用事業 AI 安全

能源與公用事業 AI 安全,涵蓋電網管理、預測性維護與智慧電表。

industry-verticalsgovernmentsecurityfedramp

政府 AI 安全要求

政府環境中 AI 系統的安全要求,包含 FedRAMP 與分級考量。

securityverticalsindustryworkforce

人資與人力 AI 安全

HR AI 的安全分析,包含績效評估、人力規劃與員工聊天機器人。

industry-verticalsmanufacturingsecurityindustrial

製造業 AI 安全

製造業 AI 的安全考量,包含品管、預測性維護與機器人。

industry-verticalstelecomsecuritynetworking

電信 AI 安全(產業垂直領域)

電信業 AI 的安全考量,包含網路最佳化、詐欺偵測與客服。

infrastructureapi-gatewaysecurityai-services

AI 服務的 API 閘道器安全

AI API 閘道器的安全考量,包含速率限制、認證、請求驗證與 DoS 防護。

infrastructurecontainerssecurityml-workloads

ML 工作負載的容器安全

保護容器化 ML 工作負載,包含 Docker 映像、Kubernetes Pod 與 GPU 隔離。

infrastructuredistributedtrainingsecurity

分散式訓練安全

分散式 ML 訓練的安全性,包含梯度洩漏、拜占庭節點與通訊攻擊。

deploymentsecurityedgeinfrastructure

邊緣 AI 部署安全

邊緣 AI 部署的安全,包含模型萃取、實體攻擊與受限環境威脅。

clusterinfrastructuregpusecurity

GPU 叢集安全

保護用於模型訓練與推論的 GPU 叢集,抵禦未授權存取與資料外洩。

integrationframeworksecuritylangchain

整合與框架安全

AI 整合框架（LangChain、LlamaIndex、Semantic Kernel）的安全分析，涵蓋常見漏洞模式與利用技巧。

supply-chaindeep-analysisdependenciessecurity

供應鏈深入分析

AI 供應鏈相依樹之完整分析，涵蓋模型權重、分詞器、資料集、函式庫與基礎設施元件，含稽核方法論。

infrastructuredata-lakestoragesecurity

ML 資料湖安全

ML 資料湖的安全考量,包含存取控制、資料血緣與隱私保護。

infrastructureexperimenttrackingsecurity

ML 實驗基礎設施安全

ML 實驗平台的安全,包含實驗追蹤、模型倉儲與研究者協作。

infrastructureml-pipelinecicdsecurity

ML 管線 CI/CD 安全

ML CI/CD 管線的安全,包含程式碼簽章、Artifact 完整性與部署驗證。

infrastructureartifactssecurityintegrity

模型 Artifact 安全

模型 Artifact 的安全儲存與傳輸,包含加密、存取控制與防竄改。

infrastructuremodel-registrysecurityartifact

模型註冊表安全

模型註冊表的安全,包含存取控制、版本管理與供應鏈攻擊防禦。

infrastructuremulti-cloudsecurityarchitecture

多雲 ML 安全

跨多雲的 ML 系統安全考量,包含資料主權、跨雲認證與威脅偵測。

infrastructurenetworksecurityai-deployments

AI 部署的網路安全

AI 部署的網路安全,包含分段、加密、入侵偵測與流量分析。

infrastructureserverlessLambdasecurity

Serverless ML 安全

Serverless ML 部署的安全,包含函式隔離、冷啟動與供應商特定風險。

infrastructurevector-databasesecurityaccess-control

向量資料庫安全

RAG 系統中向量資料庫的安全,包含嵌入注入、相似性攻擊與存取控制。

attentiontransformersinterpretabilityattention-patternssecurity

安全領域的注意力模式分析

運用注意力地圖來理解並利用模型行為,辨識安全相關的注意力模式,並將注意力機制用於紅隊操作。

labcloudassessmentinfrastructuresecurityadvanced

實驗室:雲端 AI 評估

動手實驗室,主題為conducting an 端對端安全評估 of a 雲端部署 AI system,包括基礎設施檢視,API 測試,模型安全 evaluation,資料流 analysis.

labsrepresentation-engineeringsecurityadvanced

Lab: Representation Engineering for Security

Use representation engineering to analyze and manipulate internal model representations for security research.

labapi-keyssecuritysecrets-managementbeginner

Lab: API Key Security

學習 common API key exposure vectors, secure key management with .env files, detect keys in git history, implement key rotation, and apply least-privilege principles.

securitylabbeginnerembeddingbasicslabs

嵌入 Basics for 安全

Underst與text embeddings與their security relevance by generating,comparing,manipulating embedding vectors.

comparisonsecuritylabbeginnerlabsmodel

模型安全 Comparison 實驗室

Compare the security posture of different LLM models by running identical test suites across providers.

labexpertagentscannersecurityhands-on

Lab: Build Agent Security Scanner

建構 an automated security scanner for agentic AI systems that detects vulnerabilities in tool use, permission handling, memory management, and multi-step execution flows. Cover agent-specific 攻擊面s that traditional LLM testing misses.

labsupply-chainauditdependenciessecurityintermediatehands-on

實驗室: 供應鏈 Audit

Audit an ML project's dependencies for vulnerabilities,covering model files,Python packages,container images,training data provenance.

labsupply-chainsecurityscanningdependenciesintermediate

實驗室: ML 供應鏈 Scan

動手實驗室,主題為auditing machine learning model dependencies,detecting malicious packages in ML pipelines,scanning model files for backdoors與supply chain threats.

labssimulationcode-reviewsecurity

程式碼審查助理評估

測試程式碼審查 AI 的漏洞,包括程式碼分析、建議產生與儲存庫存取。

labssimulationcyber-threat-intelsecurity

網路威脅情報 AI 評估

對處理 IOC、威脅報告與攻擊歸因之網路威脅情報 AI 進行紅隊演練。

llmopsab-testingsecurityimplications

A/B 測試的安全意涵

AI 模型 A/B 測試的安全意涵,包括利用差異化行為進行攻擊的手法。

securityobservabilityllmops

AI 可觀測性與安全

使用可觀測性平台偵測 AI 系統行為中的安全異常。

llmopscontinuous-trainingonline-learningsecurity

持續訓練安全

保護持續學習與線上學習系統免於對抗性資料注入與模型漂移操弄。

llmopsfeature-storesecuritydata

特徵儲存的安全

保護 ML 管線中的特徵儲存,防範投毒與未授權存取。

llmopsKubernetesoperatorssecurity

Kubernetes ML Operator 安全

針對 Kubernetes-based ML operator（KServe、Seldon、Ray）的安全分析，包括權限提升、資源操弄與跨租戶攻擊。

llmopsexperiment-trackingsecuritymlflow

ML 實驗追蹤安全

保護 MLflow、Weights & Biases、Neptune 等實驗追蹤系統。

llmopsMLflowsecurityassessment

MLflow 安全評估

MLflow 部署的安全評估，涵蓋追蹤伺服器漏洞、產物儲存利用與模型註冊表攻擊。

llmopsdeploymentsecurityoperations

模型部署安全

將 LLM 部署到生產環境的安全最佳實務。

llmopssecuritygatewaymodel

模型閘道安全模式

集中式模型閘道部署的安全模式,包括認證、授權與稽核。

rollbackllmopssecuritymodel

模型回滾安全

模型回滾程序的安全意涵,包括暴露窗口與狀態一致性。

llmopsservingsecurityhardening

模型服務安全強化

保護模型服務基礎架構的最佳實務，包括端點強化、認證、速率限制與輸出驗證。

llmopsversioningsecurityrollback

模型版本控管安全

保護模型版本管理,包含回滾安全與版本驗證。

llmopsprompt-managementsecuritytemplates

提示詞管理安全

保護提示詞模板、系統提示詞與提示詞管理基礎架構。

securityversioningllmopstemplateprompt

提示詞模板版本控管安全

保護提示詞模板的版本管理,抵禦未授權修改與注入。

model-deep-divesclaudeanthropicsecurity

Claude 架構安全性

Anthropic 的 Claude 架構、其訓練方法及所衍生安全特性的深入探討。

modelsdistillationteacher-studentsecurity

模型蒸餾的安全性分析

知識蒸餾如何影響學生模型的安全對齊與漏洞。

model-deep-divesgeminigooglesecurity

Gemini 架構安全性

Google Gemini 多模態架構的安全性分析及其攻擊面。

model-deep-divesgpt-4openaisecurity

GPT-4 架構安全性

GPT-4 架構、訓練與安全特性的安全性觀點。

securityanalysisdeepphimodelsmodel

Phi 模型家族安全性分析

Microsoft Phi 模型家族的安全性分析,涵蓋各版本與部署樣態。

modelsQwenAlibabasecurity

Qwen 架構安全性

阿里巴巴 Qwen 模型家族的安全性特性與漏洞。

securityqwenanalysisdeepmodelsmodel

Qwen 模型安全性分析

Qwen 系列 (含視覺與指令版本) 的安全性分析。

Yi 模型安全性評估

零一萬物 Yi 系列模型的安全性評估與漏洞觀察。

modelsYi01.AIsecurity

programsecurityprofessionaltraining

AI 安全訓練 Program Design

Designing and delivering AI security training programs for development and security teams.

securityselectionvendorprofessional

AI 安全工具廠商選擇

評估與選擇 AI 安全測試工具與服務的框架。

securitychecklistdefensequick-referencebest-practices

LLM 安全檢查清單

LLM 驅動應用程式的完整安全檢查清單，涵蓋輸入驗證、提示詞強化、輸出過濾、工具安全、RAG 管線與事件應變。

referencesapisecurityreference

模型 API 安全參考

主要模型 API 的安全參考,包含認證、速率限制與安全功能。

tradecraftOPSECsecuritycovert

AI 紅隊進階 OPSEC

AI 紅隊委託的進階作業安全實務，包括流量混淆、歸因防範與隱蔽測試。

securityanalysispipelinemergetrainingmodel

模型合併安全分析(訓練管道)

模型合併技術的安全分析,以及漏洞透過合併模型的傳播。

training-pipelinerlhfreward-hackingreward-modelsecurity

RLHF 安全攻擊

針對 RLHF 管道的安全攻擊:獎勵模型操弄、偏好資料投毒、對齊繞過。

trainingtransfer-learningsecurity

遷移學習安全分析

遷移學習的安全意涵,包括繼承漏洞與跨領域攻擊轉移。

infrastructuresupply-chainmodel-hubhuggingfacesecurity

模型 Hub Supply Chain 攻擊

攻擊ing the ML model supply chain through hub repositories like Hugging Face, including typosquatting, model poisoning, and repository manipulation techniques.

infrastructurerceserializationpicklesupply-chainsecurity

模型 Serialization RCE

Remote code execution through malicious model files using pickle deserialization, safetensors manipulation, and other model serialization format vulnerabilities.