What is ML CI/CD 安全?

ML 持續整合與部署管線的安全概觀：ML CI/CD 與傳統 CI/CD 的差異、訓練工作流程中的獨特攻擊面，以及自動化模型建構與部署的安全意涵。

What is 實驗追蹤安全?

ML 實驗追蹤系統中的安全風險：會被記錄什麼、哪些是敏感內容，以及追蹤平台為何成為攻擊者尋求智財與管線存取的高價值目標。

What is 模型登錄安全（LLMOps 安全）?

模型登錄之安全概觀：登錄如何管理模型生命週期、存取控制模型、信任邊界，以及儲存與散發不透明 ML 產物的獨特安全挑戰。

What is 特徵儲存安全（LLMops 安全）?

ML 特徵儲存（Feast、Tecton、Vertex Feature Store）之安全概觀：架構與信任模型、線上與離線儲存中之攻擊面，與為 ML 系統之集中化特徵管理之安全意涵。

What is 模型 Deployment 安全?

安全 best practices for deploying LLMs to production environments.

What is 模型 Versioning 安全?

Securing model version management including rollback safety and version validation.

What is Prompt Management 安全?

Securing prompt templates, system prompts, and prompt management infrastructure.

What is A/B Testing 安全 Implications?

安全 implications of A/B testing AI models including differential behavior exploitation.

What is Canary Deployments for AI 模型s?

Implementing canary deployments that catch security regressions in AI model updates.

What is 模型 Monitoring 安全 Metrics?

Key security metrics to monitor for deployed LLMs and alerting thresholds.

LLMOps 安全

Intermediate10 min readUpdated 2026-03-15

Comprehensive overview of security across the LLMOps lifecycle: from data preparation and experiment tracking through model deployment and production monitoring. 攻擊 surfaces, threat models, and defensive strategies for ML operations.

llmops mlops ml-lifecycle attack-surface ml-security devops deployment

LLMOps 安全

大型語言模型 Operations -- LLMOps -- encompasses the tools, processes, and infrastructure that move a model from an initial experiment to a production system serving millions of users. Every stage 在本 lifecycle introduces attack surfaces that do not exist in traditional software development. Unlike conventional DevOps pipelines where the artifact is deterministic code, LLMOps pipelines produce opaque statistical artifacts whose behavior cannot be fully predicted from inspection alone.

The LLMOps Lifecycle

The LLMOps lifecycle extends the traditional MLOps loop with stages specific to 大型語言模型. Each stage has distinct actors, artifacts, and trust boundaries.

Stage 1: Data Preparation

Data preparation for LLMs involves collecting, cleaning, and curating 訓練 corpora at web scale. This stage is uniquely vulnerable 因為 the sheer volume of data makes manual review impossible, and automated filtering can be gamed.

Key assets at risk:

Training datasets (proprietary corpora, licensed content, curated instruction sets)
Data preprocessing scripts and filtering logic
Data provenance records and lineage metadata
PII and sensitive content that should have been filtered

攻擊 surfaces:

Data source compromise (DNS hijacking of crawl targets, man-in-the-middle on data downloads)
Poisoning through contributed datasets on public platforms
Manipulation of deduplication and filtering pipelines to preserve malicious content
Injection of 後門 triggers in instruction-tuning datasets

Stage 2: Experiment Tracking

During model development, teams run thousands of experiments, logging hyperparameters, metrics, model checkpoints, and artifacts to tracking platforms like Weights & Biases or MLflow. These platforms become repositories of sensitive intellectual property and operational metadata.

Key assets at risk:

Hyperparameter configurations revealing architecture decisions
Training metrics exposing dataset properties
Model checkpoints at various stages of 訓練
API keys and credentials stored in experiment configurations

攻擊 surfaces:

Experiment tracking server compromise leading to artifact 投毒
Metadata leakage revealing proprietary 訓練 approaches
API key exposure in logged configurations
Cross-team boundary violations in shared tracking instances

Stage 3: Model Training and 微調

The 訓練 process itself consumes enormous compute resources and produces model weights that encode everything 模型 has learned. Compromising this stage can insert persistent backdoors that survive through deployment.

Key assets at risk:

GPU cluster access and compute resources
Training scripts and loss functions
Intermediate checkpoints and final model weights
微調 datasets (often containing proprietary data)

攻擊 surfaces:

Compute hijacking for cryptocurrency mining or unauthorized 訓練
Training job manipulation to inject backdoors
Checkpoint substitution with poisoned weights
供應鏈 attacks on 訓練 dependencies

Stage 4: Model Registry and Artifact Management

Trained models are stored in registries that manage versioning, access control, and deployment metadata. A compromised registry is a direct path to serving a malicious model to production users.

Key assets at risk:

Production model weights and configurations
Model version history and rollback targets
Deployment metadata and serving configurations
Model provenance and signing information

攻擊 surfaces:

Model substitution via version confusion or namespace squatting
Registry access control bypass
Unsigned model deployment
Malicious model uploads exploiting trust signals

Stage 5: Deployment and Serving

Model deployment involves packaging weights with serving infrastructure, configuring 推論 endpoints, and managing traffic routing. This stage bridges ML-specific concerns with traditional infrastructure 安全.

Key assets at risk:

Production 推論 endpoints
Model serving configurations and scaling parameters
A/B 測試 and canary deployment configurations
User request and response data flowing through 系統

攻擊 surfaces:

Deployment pipeline injection (substituting model at deploy time)
Serving infrastructure misconfiguration exposing model internals
Lack of 安全 gates allowing unvetted models into production
Insufficient isolation between model versions

Stage 6: 監控 and Observability

Production 監控 tracks model performance, data drift, fairness metrics, and 安全 indicators. When 監控 is compromised or evaded, attackers operate without 偵測.

Key assets at risk:

Performance and drift metrics
安全 alerts and anomaly 偵測 systems
User interaction logs
Model behavior baselines

攻擊 surfaces:

監控 evasion through gradual drift below 偵測 thresholds
Alert fatigue 利用 through false positive flooding
Metric manipulation to mask degraded model behavior
Blind spots in 監控 coverage

How LLMOps Differs from Traditional DevOps 安全

Traditional DevOps 安全 has matured over two decades with well-understood threat models, established tooling, and industry standards. LLMOps introduces fundamental differences that break many assumptions.

Artifact Opacity

In DevOps, you can read every line of code being deployed. A model weight file is a matrix of floating-point numbers -- you cannot inspect it to determine what 模型 will do. This opacity means that traditional code review, static analysis, and 漏洞 scanning are insufficient. A model with a planted 後門 is indistinguishable from a clean model through file inspection alone.

Non-deterministic Behavior

Traditional software produces the same 輸出 for the same 輸入. LLMs can produce different outputs for identical inputs depending on sampling parameters, and small changes to inputs can produce radically different outputs. This makes behavioral 測試 fundamentally harder and means that 安全測試 must be statistical rather than binary.

Data as 攻擊 Vector

In traditional DevOps, data is processed by code. In LLMOps, data shapes the behavior of the artifact itself. Poisoning a 訓練 dataset does not just corrupt one query result -- it alters 模型's learned representations, potentially affecting every future interaction. The 攻擊面 includes not just the code pipeline but the entire data 供應鏈.

Scale of Trust Dependencies

LLMOps pipelines depend on pretrained models, community datasets, open-source 訓練 frameworks, and 雲端 ML services -- all consumed with implicit trust. A typical LLM deployment might trust Hugging Face for base weights, a public dataset for 微調 data, PyTorch for the 訓練 framework, and a 雲端 provider's ML platform for serving. Each is a potential 供應鏈 entry point.

Comparison Table

Dimension	Traditional DevOps	Classical MLOps	LLMOps
Primary artifact	Source code (inspectable)	Model weights (opaque)	Model weights + adapters (opaque, very large)
Artifact size	Megabytes	Megabytes to gigabytes	Gigabytes to terabytes
Behavior verification	Unit tests (deterministic)	Accuracy benchmarks	Statistical behavioral 測試 + 紅隊演練
供應鏈	Package registries	Package registries + datasets	Package registries + datasets + pretrained models + adapters
Data sensitivity	Configurations, secrets	訓練資料	訓練資料 + prompt history + RLHF feedback
Rollback complexity	Redeploy previous version	Retrain or redeploy	Extremely expensive retraining
安全 tooling maturity	Mature (SAST, DAST, SCA)	Emerging	Nascent

Threat Actor Profiles

Different threat actors target different stages of the LLMOps lifecycle based on their capabilities and objectives.

External Attackers

External attackers typically target the 供應鏈 (public model registries, open-source dependencies) or production-facing infrastructure (推論 APIs, 監控 systems). They may lack access to internal 訓練 infrastructure but can influence model behavior through 資料投毒 of public resources.

Malicious Insiders

Insiders with access to experiment tracking systems, model registries, or 訓練 infrastructure can cause significant damage. A single compromised ML engineer account can alter 訓練資料, swap model checkpoints, or modify deployment configurations. The opacity of model artifacts makes insider attacks particularly difficult to detect.

Competitors

Industrial espionage targeting LLMOps focuses on exfiltrating 訓練資料, model architectures, and hyperparameter configurations. Experiment tracking metadata is particularly valuable 因為 it reveals the decisions that led to model performance improvements.

Nation-State Actors

State-level actors may target the entire lifecycle, from corrupting public 訓練 datasets used by many organizations to compromising 雲端 ML infrastructure. Their objective may be persistent access, intelligence collection, or the ability to influence model behavior in specific scenarios.

Propagation of Compromise

A key characteristic of LLMOps attacks is how compromise propagates through the lifecycle. Unlike traditional software where a 漏洞 is typically scoped to a single component, an LLMOps compromise can cascade.

資料投毒 in Stage 1 produces a compromised model in Stage 3, which is stored in the registry in Stage 4, deployed to production in Stage 5, and evades 監控 in Stage 6 因為 the 監控 baselines were established against already-compromised behavior.

This cascading effect means that:

偵測 must happen as early as possible in the lifecycle
Each stage needs independent integrity verification, not just trust in the previous stage
Provenance tracking across the entire pipeline is essential for incident response
Rollback may require retraining from scratch if the 投毒 point cannot be identified

Section 概覽

This section covers the major attack surfaces across the LLMOps lifecycle:

Experiment Tracking 安全 -- Securing experiment metadata, tracking platforms, and the sensitive information they accumulate
Model Registry 安全 -- Protecting model artifacts, signing, provenance, and registry-specific attack vectors
ML CI/CD 安全 -- Securing the pipelines that build, 測試, and deploy ML artifacts
Feature Store 安全 -- Protecting feature computation, storage, and access patterns
AI 監控安全 -- Defending observability systems against evasion and manipulation

參考文獻

Google MLOps Whitepaper -- MLOps maturity model and best practices
OWASP ML 安全 Top 10 -- ML-specific 安全 risks
NIST AI Risk Management Framework -- Federal guidance on AI risk management
MITRE ATLAS -- 對抗性 Threat Landscape for AI Systems

Knowledge Check

Why does a 資料投毒 attack in the data preparation stage pose a greater risk in LLMOps than a 漏洞 in a single microservice in traditional DevOps?

LLMOps 安全

Learning Path

Related articles

LLMOps 安全

Learning Path

Related articles