# infrastructure

assessmentsinfrastructuresecurityexam

AI Infrastructure Security Assessment

Assessment covering model serving, API gateways, container security, and GPU isolation.

assessmentssecurityinfrastructureverifyskill

Skill Verification: Infrastructure Security

Hands-on verification of cloud and infrastructure security assessment skills for AI deployments.

assessmentsguidesecuritystudyinfrastructure

Infrastructure Security Study Guide

Study guide for AI infrastructure security covering cloud, container, and deployment pipeline topics.

challengecloudinfrastructureapi-securitydeploymentjune-2026

June 2026: Cloud AI Security Challenge

Find and document vulnerabilities in a cloud-deployed AI service covering API security, model serving infrastructure, authentication, and data handling.

c2infrastructureautomationtoolingpipelinescannerfuzzercobalt-strikemythicsliver

Red Team Infrastructure & Tooling

AI red team C2 frameworks, automated attack pipelines, custom scanner development, and integration with Cobalt Strike, Mythic, and Sliver.

evaluationharnessautomationinfrastructure

Building Evaluation Harnesses

Design and implement evaluation harnesses for AI red teaming: architecture patterns, judge model selection, prompt dataset management, scoring pipelines, and reproducible evaluation infrastructure.

infrastructureapi-securityabuse-detectionrate-limiting

AI API Abuse Detection

Detecting and mitigating API abuse patterns targeting AI inference endpoints including prompt extraction and model theft.

infrastructurepenetration-testingmethodologyred-teamassessment

Penetration Testing Methodology for AI Infrastructure

A structured methodology for penetration testing AI/ML systems covering reconnaissance, vulnerability assessment, exploitation, and reporting

infrastructurethreat-modelingstriderisk-assessmentmethodology

Threat Modeling for AI Infrastructure Using STRIDE

Systematic threat modeling methodology for AI/ML systems using STRIDE, data flow diagrams, and attack trees tailored to machine learning pipelines

infrastructurezero-trustnetwork-securityidentitymicrosegmentation

Zero Trust Architecture for AI Infrastructure

Implementing and attacking zero trust principles across ML training pipelines, inference endpoints, and model registries

infrastructureservice-meshistioenvoymtlsmicroservices

Service Mesh Security for AI Microservices

Securing inter-service communication in AI systems using Istio, Linkerd, and Envoy with focus on inference pipelines and model serving architectures

infrastructureisolationcontainersteeconfidential-computing

AI Workload Isolation

Isolation techniques for AI workloads using VMs, containers, and trusted execution environments (TEEs).

infrastructureschedulingslurmkubernetesgpu-clusterslateral-movement

Attacks on AI Workload Schedulers

Exploiting Slurm, Kubernetes, and custom schedulers to hijack GPU resources, poison training jobs, and achieve lateral movement in AI clusters

infrastructureapi-gatewaysecurityai-services

API Gateway Security for AI Services

Securing API gateways for AI services including authentication, rate limiting, and request validation.

api-securityauthenticationrate-limitingtestinginfrastructure

LLM API Security Testing

Security testing methodology for LLM APIs, covering authentication, rate limiting, input validation, output filtering, and LLM-specific API vulnerabilities.

cloudinfrastructureawsazuregcpml-platform

Cloud AI Infrastructure Attacks

Security assessment of cloud-hosted AI/ML platforms including AWS SageMaker, Azure ML, and GCP Vertex AI -- IAM misconfigurations, model theft, and data exposure.

infrastructurecontainerssecurityml-workloads

Container Security for ML Workloads

Securing containerized ML workloads including Docker images, Kubernetes pods, and GPU isolation.

deploymentcontainersgpuinference-serverinfrastructure

Attacking AI Deployments

Security assessment of AI deployment infrastructure, including container escapes, GPU side channels, inference server vulnerabilities, and resource exhaustion attacks.

infrastructuredisaster-recoverybackupcontinuity

Disaster Recovery for ML Systems

Implementing disaster recovery for ML systems including model backup strategies, failover procedures, and recovery time objectives.

infrastructuredistributedtrainingsecurity

Distributed Training Security

Security considerations for distributed model training across multiple nodes and data centers.

infrastructuredns-rebindingnetwork-attacksweb-securitymodel-serving

DNS Rebinding Attacks Against AI Services

Exploiting DNS rebinding to bypass network controls and access internal AI model serving endpoints, training dashboards, and GPU management interfaces

deploymentsecurityedgeinfrastructure

Edge AI Deployment Security

Security challenges and mitigations for deploying AI models at the edge on resource-constrained devices.

infrastructureedgedeploymentIoT

Edge ML Deployment Security

Security challenges of deploying ML models at the edge including model extraction, update tampering, and physical access attacks.

infrastructurefederated-learningmodel-poisoningprivacy

Federated Learning Security

Security attacks on federated learning systems including model poisoning, data inference, and Byzantine fault exploitation.

infrastructureGPUclusterattacks

GPU Cluster Attack Surface

Analysis of attack surfaces specific to GPU clusters used for ML training and inference including memory isolation, driver vulnerabilities, and side channels.

clusterinfrastructuregpusecurity

GPU Cluster Security

Securing GPU clusters used for model training and inference against unauthorized access and data leakage.

infrastructuregpuside-channelprivacyhardware

GPU Memory Side-Channel Attacks

Side-channel attacks exploiting GPU memory allocation, timing, and electromagnetic emanation to extract sensitive data from AI workloads.

infrastructuregpusharingisolation

GPU Sharing and Isolation Security

Security implications of GPU sharing in multi-tenant AI infrastructure and isolation strategies.

infrastructurehardwareacceleratorsTPU

Hardware Security for ML Accelerators

Hardware-level security considerations for ML accelerators including side-channel attacks, firmware vulnerabilities, and memory protection.

infrastructuresupply-chainapi-securitydeploymentmlops

AI Infrastructure Security

Overview of security concerns in AI infrastructure, covering model supply chains, API security, deployment architecture, and the unique attack surfaces of ML systems.

infrastructureinferenceendpointhardening

Inference Endpoint Hardening

Hardening model inference endpoints against adversarial inputs, DoS, and information leakage.

infrastructuregputritonvllmollamakubernetescloud-aicost-amplification

AI Infrastructure Exploitation

Methodology for exploiting GPU clusters, model serving frameworks (Triton, vLLM, Ollama), Kubernetes ML platforms, cloud AI services, and cost amplification attacks.

infrastructurekubeflowkubernetesml-pipelines

Kubeflow Security

Security assessment and hardening of Kubeflow ML pipeline deployments on Kubernetes.

infrastructureKubernetesMLhardening

Kubernetes ML Security Hardening

Comprehensive guide to hardening Kubernetes clusters running ML workloads including pod security, network policies, and GPU isolation.

infrastructurellm-proxyapi-gatewaylitellm

LLM Proxy Security

Security assessment of LLM proxy and gateway solutions including LiteLLM, Portkey, and custom API gateways.

infrastructuredata-lakestoragesecurity

ML Data Lake Security

Securing data lakes used for ML training data including access controls, encryption, lineage tracking, and poisoning prevention.

infrastructureexperimenttrackingsecurity

ML Experiment Infrastructure Security

Securing ML experimentation infrastructure including notebook servers, experiment trackers, and shared development environments.

infrastructureml-pipelinecicdsecurity

ML Pipeline CI/CD Security

Securing ML training and deployment pipelines including GitHub Actions, Kubeflow, and MLflow.

infrastructurepipelinesupply-chaindependencies

ML Pipeline Supply Chain Security

Securing the ML pipeline supply chain from training framework dependencies to serving infrastructure components.

infrastructuremlflowmlopsexperiment-tracking

MLflow Security Hardening

Securing MLflow deployments against unauthorized access, experiment tampering, and model registry poisoning.

integrityinfrastructureartifactmodel

Model Artifact Integrity Verification

Implementing integrity verification for model artifacts through checksums, signatures, and provenance tracking.

infrastructureartifactssecurityintegrity

Model Artifact Security

Securing model artifacts throughout the lifecycle including signing, verification, storage encryption, and tamper detection.

infrastructuremodel-compressionpruningquantizationdistillation

Model Compression Security

Security implications of model pruning, quantization, and knowledge distillation on AI system robustness.

infrastructuremodel-loadinghot-swapsupply-chainruntime-security

Security of Dynamic Model Loading in Production

Analyzing risks of hot-swapping, dynamic loading, and A/B testing of ML models in production serving infrastructure

infrastructuremodel-registrysecurityartifact

Model Registry Security

Securing model registries and artifact stores against tampering, poisoning, and unauthorized access.

infrastructureserializationpicklesupply-chaincode-execution

Model Serialization Attacks

Pickle, SafeTensors, and ONNX deserialization attacks targeting ML model files for arbitrary code execution.

infrastructureautoscalingservingattacks

Model Serving Autoscaling Attacks

Exploiting autoscaling mechanisms in model serving infrastructure to cause resource exhaustion, cost amplification, or denial of service.

infrastructuremodel-servingtorchservetritonvllmvulnerability-analysis

Security Comparison of Model Serving Frameworks

In-depth security analysis of TorchServe, TensorFlow Serving, Triton Inference Server, and vLLM for production AI deployments

infrastructuremodel-servingattacksdeployment

Model Serving Infrastructure Attacks

Attacking model serving infrastructure including inference servers, load balancers, and GPU schedulers.

infrastructureencryptionmodel-protectionip-protection

Model Weight Encryption

Encryption at rest and in transit for ML model weights, protecting intellectual property and preventing unauthorized model access.

infrastructuremulti-cloudsecurityarchitecture

Multi-Cloud ML Security

Security architecture for ML workloads spanning multiple cloud providers including identity federation, data sovereignty, and policy consistency.

infrastructurenetworksecurityai-deployments

Network Security for AI Deployments

Network security architecture for AI deployments including segmentation, encryption, and traffic analysis.

infrastructureobservabilitymonitoringinfrastructure

Observability for AI Infrastructure

Building observability into AI infrastructure for security monitoring and incident detection.

infrastructurerate-limitingllm-apisdenial-of-servicemodel-extraction

Advanced Rate Limiting Strategies for LLM API Endpoints

Designing, attacking, and defending rate limiting systems for LLM inference APIs to prevent abuse, model extraction, and resource exhaustion

infrastructuresecretsmanagementai-apps

Secrets Management for AI Applications

Managing API keys, model credentials, and sensitive configuration in AI application deployments.

infrastructureserverlessLambdasecurity

Serverless ML Security

Security considerations for serverless ML deployments including cold start attacks, function injection, and ephemeral storage risks.

infrastructurestorage-securitys3gcshdfsdata-securitytraining-data

Securing Storage Systems for Training Data

Attack and defense strategies for S3, GCS, HDFS, and object storage systems holding AI training datasets and model artifacts

supply-chainsleeper-agentsslopsquattingpicklehuggingfacemodel-provenanceinfrastructure

AI Supply Chain Deep Dive

Deep analysis of AI supply chain security threats including sleeper agents, slopsquatting, malicious model uploads, pickle deserialization exploits, and model provenance verification challenges.

infrastructuresupply-chaindependenciesml

Supply Chain Security for ML Dependencies

Securing the ML dependency supply chain including PyTorch, transformers, and model weight downloads.

infrastructureconfidential-computingteehardware-securityside-channels

Trusted Execution Environments for AI Workloads

Security analysis of Intel SGX, AMD SEV, and ARM TrustZone for protecting AI model inference and training in untrusted environments

infrastructuredata-exfiltrationtelemetryloggingcovert-channelsobservability

Exfiltrating Data Through AI Telemetry and Logging

Using AI system telemetry, logging pipelines, and observability infrastructure as covert channels for data exfiltration

infrastructurenetwork-securitydistributed-trainingncclrdma

Training Cluster Network Security

Network security for distributed ML training clusters including NCCL, RDMA, and InfiniBand protection.

infrastructuretritonnvidiamodel-servinginference

Triton Inference Server Security

Security hardening for NVIDIA Triton Inference Server deployments including model repository protection and API security.

infrastructurevector-databasesecurityaccess-control

Vector Database Security

Security hardening for vector databases including Pinecone, Weaviate, Chroma, and pgvector.

infrastructurevllmllm-servinginference

vLLM Security Configuration

Security hardening for vLLM serving deployments including API authentication, resource limits, and input validation.

labcloudassessmentinfrastructuresecurityadvanced

Lab: Cloud AI Assessment

Hands-on lab for conducting an end-to-end security assessment of a cloud-deployed AI system including infrastructure review, API testing, model security evaluation, and data flow analysis.

labcontainer-securitybreakoutinfrastructure

Lab: Containerized Model Breakout

Explore techniques for escaping from containerized AI applications to the host system, testing container isolation boundaries in ML deployment environments.

labinference-serverinfrastructurevllmtriton

Lab: Inference Server Exploitation

Attack vLLM, TGI, and Triton inference servers to discover information disclosure vulnerabilities, denial-of-service vectors, and configuration weaknesses in model serving infrastructure.

labmodel-servinginfrastructuretensorflow-servingtorchserve

Lab: Model Serving Framework Attacks

Exploit vulnerabilities in TensorFlow Serving, TorchServe, and Triton Inference Server, targeting model loading, API endpoints, and management interfaces.

labssimulationdevopsinfrastructure

DevOps AI Assistant Security Assessment

Assess a DevOps AI assistant with access to CI/CD pipelines, cloud infrastructure, and deployment systems.

kv-cacheprompt-cachingside-channelmulti-tenantinfrastructure

KV Cache & Prompt Caching Attacks

How KV cache poisoning, prefix caching exploitation, cache timing side channels, and multi-tenant isolation failures create attack vectors in LLM serving infrastructure.

professionallab-setupinfrastructuretools

Setting Up an AI Red Team Lab Environment

Practical guide to designing and building a lab environment for AI red team testing, from hardware selection through tool configuration.

distributed-traininggradient-sharingparameter-servermulti-gpuinsider-threatinfrastructure

Distributed Training Attack Surface

Security vulnerabilities in multi-GPU, multi-node LLM training: gradient sharing attacks, parameter server compromise, insider threats, and infrastructure-level training exploits.

training-pipelineinfrastructureattackscompute

Training Infrastructure Attacks

Attacking training infrastructure including GPU clusters, distributed training, and orchestration systems.

infrastructureapirate-limitingbypassred-teaming

API Rate Limit Bypass

Techniques to bypass API rate limiting on LLM services, including header manipulation, distributed requests, authentication rotation, and endpoint discovery.

walkthroughscachepoisoninginfrastructure

LLM Cache Poisoning Walkthrough

Poison LLM response caches to serve adversarial content to other users without direct injection.

infrastructuregpuside-channelinferencetiming

GPU Side Channel Basics

GPU-based side channel attacks on ML inference, exploiting timing, power consumption, and memory access patterns to extract information about models and data.

infrastructureapiinferenceexploitationred-teaming

Inference Endpoint Exploitation

Exploiting inference API endpoints for unauthorized access, data exfiltration, and service abuse through authentication flaws, input validation gaps, and misconfigured permissions.

infrastructuresupply-chainmodel-hubhuggingfacesecurity

Model Hub Supply Chain Attack

Attacking the ML model supply chain through hub repositories like Hugging Face, including typosquatting, model poisoning, and repository manipulation techniques.

infrastructurerceserializationpicklesupply-chainsecurity

Model Serialization RCE

Remote code execution through malicious model files using pickle deserialization, safetensors manipulation, and other model serialization format vulnerabilities.

walkthroughsengagementdevopsinfrastructure

Full Engagement: DevOps AI Assistant

End-to-end engagement for a DevOps AI assistant with CI/CD, cloud infrastructure, and monitoring access.

assessmentssectionsecurityinfrastructureassessment

Infrastructure 安全評量

評量 covering model serving, container security, API gateway hardening, and deployment pipeline threats.

assessmentinfrastructuresupply-chaincloudevaluation

章節評量：基礎設施

15 題校準評量，測試你對 AI 基礎設施安全的理解——供應鏈、API 安全、雲端部署與模型服務。

assessmentsinfrastructuresecurityexam

AI Infrastructure 安全評量

評量 covering model serving, API gateways, container security, and GPU isolation.

assessmentssecurityinfrastructureverifyskill

Skill Verification: Infrastructure 安全

Hands-on verification of cloud and infrastructure security assessment skills for AI deployments.

assessmentsguidesecuritystudyinfrastructure

Infrastructure 安全 Study 指南

Study guide for AI infrastructure security covering cloud, container, and deployment pipeline topics.

challengecloudinfrastructureapi-securitydeploymentjune-2026

June 2026: Cloud AI 安全 Challenge

Find and document vulnerabilities in a cloud-deployed AI service covering API security, model serving infrastructure, authentication, and data handling.

c2infrastructureautomationtoolingpipelinescannerfuzzercobalt-strikemythicsliver

紅隊基礎設施與工具

AI 紅隊 C2 框架、自動化攻擊管線、自製掃描器開發，以及與 Cobalt Strike、Mythic、Sliver 的整合。

evaluationharnessautomationinfrastructure

Building Evaluation Harnesses

Design and implement evaluation harnesses for AI red teaming: architecture patterns, judge model selection, prompt dataset management, scoring pipelines, and reproducible evaluation infrastructure.

infrastructureapi-securityabuse-detectionrate-limiting

AI API Abuse Detection

Detecting and mitigating API abuse patterns targeting AI inference endpoints including prompt extraction and model theft.

infrastructurepenetration-testingmethodologyred-teamassessment

Penetration Testing Methodology for AI Infrastructure

A structured methodology for penetration testing AI/ML systems covering reconnaissance, vulnerability assessment, exploitation, and reporting

infrastructurethreat-modelingstriderisk-assessmentmethodology

Threat 模型ing for AI Infrastructure Using STRIDE

Systematic threat modeling methodology for AI/ML systems using STRIDE, data flow diagrams, and attack trees tailored to machine learning pipelines

infrastructurezero-trustnetwork-securityidentitymicrosegmentation

Zero Trust Architecture for AI Infrastructure

Implementing and attacking zero trust principles across ML training pipelines, inference endpoints, and model registries

infrastructureservice-meshistioenvoymtlsmicroservices

Service Mesh 安全 for AI Microservices

Securing inter-service communication in AI systems using Istio, Linkerd, and Envoy with focus on inference pipelines and model serving architectures

infrastructureisolationcontainersteeconfidential-computing

AI Workload Isolation

Isolation techniques for AI workloads using VMs, containers, and trusted execution environments (TEEs).

infrastructureschedulingslurmkubernetesgpu-clusterslateral-movement

攻擊s on AI Workload Schedulers

利用ing Slurm, Kubernetes, and custom schedulers to hijack GPU resources, poison training jobs, and achieve lateral movement in AI clusters

infrastructureapi-gatewaysecurityai-services

API Gateway 安全 for AI Services

Securing API gateways for AI services including authentication, rate limiting, and request validation.

cloudinfrastructureawsazuregcpml-platform

雲端 AI 基礎設施攻擊

雲端託管 AI/ML 平台的安全評估，包含 AWS SageMaker、Azure ML 與 GCP Vertex AI——IAM 設定錯誤、模型竊取與資料暴露。

infrastructurecontainerssecurityml-workloads

Container 安全 for ML Workloads

Securing containerized ML workloads including Docker images, Kubernetes pods, and GPU isolation.

deploymentcontainersgpuinference-serverinfrastructure

攻擊 AI 部署

AI 部署基礎設施的安全評估，包括容器逃逸、GPU 側通道、推論伺服器漏洞以及資源耗盡攻擊。

infrastructuredisaster-recoverybackupcontinuity

Disaster Recovery for ML Systems

Implementing disaster recovery for ML systems including model backup strategies, failover procedures, and recovery time objectives.

infrastructuredistributedtrainingsecurity

Distributed 訓練安全

安全 considerations for distributed model training across multiple nodes and data centers.

infrastructuredns-rebindingnetwork-attacksweb-securitymodel-serving

DNS Rebinding 攻擊s Against AI Services

利用ing DNS rebinding to bypass network controls and access internal AI model serving endpoints, training dashboards, and GPU management interfaces

deploymentsecurityedgeinfrastructure

Edge AI Deployment 安全

安全 challenges and mitigations for deploying AI models at the edge on resource-constrained devices.

infrastructureedgedeploymentIoT

Edge ML Deployment 安全

安全 challenges of deploying ML models at the edge including model extraction, update tampering, and physical access attacks.

infrastructurefederated-learningmodel-poisoningprivacy

Federated Learning 安全

安全 attacks on federated learning systems including model poisoning, data inference, and Byzantine fault exploitation.

infrastructureGPUclusterattacks

GPU Cluster 攻擊 Surface

Analysis of attack surfaces specific to GPU clusters used for ML training and inference including memory isolation, driver vulnerabilities, and side channels.

clusterinfrastructuregpusecurity

GPU Cluster 安全

Securing GPU clusters used for model training and inference against unauthorized access and data leakage.

infrastructuregpuside-channelprivacyhardware

GPU 記憶體 Side-Channel 攻擊s

Side-channel attacks exploiting GPU memory allocation, timing, and electromagnetic emanation to extract sensitive data from AI workloads.

infrastructuregpusharingisolation

GPU Sharing and Isolation 安全

安全 implications of GPU sharing in multi-tenant AI infrastructure and isolation strategies.

infrastructurehardwareacceleratorsTPU

Hardware 安全 for ML Accelerators

Hardware-level security considerations for ML accelerators including side-channel attacks, firmware vulnerabilities, and memory protection.

infrastructuresupply-chainapi-securitydeploymentmlops

AI 基礎設施安全

AI 基礎設施安全顧慮的概覽，涵蓋模型供應鏈、API 安全、部署架構，以及 ML 系統的獨特攻擊面。

infrastructureinferenceendpointhardening

Inference Endpoint Hardening

Hardening model inference endpoints against adversarial inputs, DoS, and information leakage.

infrastructuregputritonvllmollamakubernetescloud-aicost-amplification

AI Infrastructure 利用ation

Methodology for exploiting GPU clusters, model serving frameworks (Triton, vLLM, Ollama), Kubernetes ML platforms, cloud AI services, and cost amplification attacks.

infrastructurekubeflowkubernetesml-pipelines

Kubeflow 安全

安全 assessment and hardening of Kubeflow ML pipeline deployments on Kubernetes.

infrastructureKubernetesMLhardening

Kubernetes ML 安全 Hardening

Comprehensive guide to hardening Kubernetes clusters running ML workloads including pod security, network policies, and GPU isolation.

infrastructurellm-proxyapi-gatewaylitellm

LLM Proxy 安全

安全 assessment of LLM proxy and gateway solutions including LiteLLM, Portkey, and custom API gateways.

infrastructuredata-lakestoragesecurity

ML Data Lake 安全

Securing data lakes used for ML training data including access controls, encryption, lineage tracking, and poisoning prevention.

infrastructureexperimenttrackingsecurity

ML Experiment Infrastructure 安全

Securing ML experimentation infrastructure including notebook servers, experiment trackers, and shared development environments.

infrastructureml-pipelinecicdsecurity

ML Pipeline CI/CD 安全

Securing ML training and deployment pipelines including GitHub Actions, Kubeflow, and MLflow.

infrastructurepipelinesupply-chaindependencies

ML Pipeline Supply Chain 安全

Securing the ML pipeline supply chain from training framework dependencies to serving infrastructure components.

infrastructuremlflowmlopsexperiment-tracking

MLflow 安全 Hardening

Securing MLflow deployments against unauthorized access, experiment tampering, and model registry poisoning.

integrityinfrastructureartifactmodel

模型 Artifact Integrity Verification

Implementing integrity verification for model artifacts through checksums, signatures, and provenance tracking.

infrastructureartifactssecurityintegrity

模型 Artifact 安全

Securing model artifacts throughout the lifecycle including signing, verification, storage encryption, and tamper detection.

infrastructuremodel-compressionpruningquantizationdistillation

模型 Compression 安全

安全 implications of model pruning, quantization, and knowledge distillation on AI system robustness.

infrastructuremodel-loadinghot-swapsupply-chainruntime-security

安全 of Dynamic 模型 Loading in Production

Analyzing risks of hot-swapping, dynamic loading, and A/B testing of ML models in production serving infrastructure

infrastructuremodel-registrysecurityartifact

模型 Registry 安全

Securing model registries and artifact stores against tampering, poisoning, and unauthorized access.

infrastructureserializationpicklesupply-chaincode-execution

模型 Serialization 攻擊s

Pickle, SafeTensors, and ONNX deserialization attacks targeting ML model files for arbitrary code execution.

infrastructureautoscalingservingattacks

模型 Serving Autoscaling 攻擊s

利用ing autoscaling mechanisms in model serving infrastructure to cause resource exhaustion, cost amplification, or denial of service.

infrastructuremodel-servingtorchservetritonvllmvulnerability-analysis

安全 Comparison of 模型 Serving Frameworks

In-depth security analysis of TorchServe, TensorFlow Serving, Triton Inference Server, and vLLM for production AI deployments

infrastructuremodel-servingattacksdeployment

模型 Serving Infrastructure 攻擊s

攻擊ing model serving infrastructure including inference servers, load balancers, and GPU schedulers.

infrastructureencryptionmodel-protectionip-protection

模型 Weight Encryption

Encryption at rest and in transit for ML model weights, protecting intellectual property and preventing unauthorized model access.

infrastructuremulti-cloudsecurityarchitecture

Multi-Cloud ML 安全

安全 architecture for ML workloads spanning multiple cloud providers including identity federation, data sovereignty, and policy consistency.

infrastructurenetworksecurityai-deployments

Network 安全 for AI Deployments

Network security architecture for AI deployments including segmentation, encryption, and traffic analysis.

infrastructureobservabilitymonitoringinfrastructure

Observability for AI Infrastructure

Building observability into AI infrastructure for security monitoring and incident detection.

infrastructurerate-limitingllm-apisdenial-of-servicemodel-extraction

進階 Rate Limiting Strategies for LLM API Endpoints

Designing, attacking, and defending rate limiting systems for LLM inference APIs to prevent abuse, model extraction, and resource exhaustion

infrastructuresecretsmanagementai-apps

Secrets Management for AI Applications

Managing API keys, model credentials, and sensitive configuration in AI application deployments.

infrastructureserverlessLambdasecurity

Serverless ML 安全

安全 considerations for serverless ML deployments including cold start attacks, function injection, and ephemeral storage risks.

infrastructurestorage-securitys3gcshdfsdata-securitytraining-data

Securing Storage Systems for 訓練 Data

攻擊 and defense strategies for S3, GCS, HDFS, and object storage systems holding AI training datasets and model artifacts

supply-chainsleeper-agentsslopsquattingpicklehuggingfacemodel-provenanceinfrastructure

AI Supply Chain Deep Dive

Deep analysis of AI supply chain security threats including sleeper agents, slopsquatting, malicious model uploads, pickle deserialization exploits, and model provenance verification challenges.

infrastructuresupply-chaindependenciesml

Supply Chain 安全 for ML Dependencies

Securing the ML dependency supply chain including PyTorch, transformers, and model weight downloads.

infrastructureconfidential-computingteehardware-securityside-channels

Trusted Execution Environments for AI Workloads

安全 analysis of Intel SGX, AMD SEV, and ARM TrustZone for protecting AI model inference and training in untrusted environments

infrastructuredata-exfiltrationtelemetryloggingcovert-channelsobservability

Exfiltrating Data Through AI Telemetry and Logging

Using AI system telemetry, logging pipelines, and observability infrastructure as covert channels for data exfiltration

infrastructurenetwork-securitydistributed-trainingncclrdma

訓練 Cluster Network 安全

Network security for distributed ML training clusters including NCCL, RDMA, and InfiniBand protection.

infrastructuretritonnvidiamodel-servinginference

Triton Inference Server 安全

安全 hardening for NVIDIA Triton Inference Server deployments including model repository protection and API security.

infrastructurevector-databasesecurityaccess-control

Vector Database 安全

安全 hardening for vector databases including Pinecone, Weaviate, Chroma, and pgvector.

infrastructurevllmllm-servinginference

vLLM 安全 Configuration

安全 hardening for vLLM serving deployments including API authentication, resource limits, and input validation.

labcloudassessmentinfrastructuresecurityadvanced

實驗室: Cloud AI 評量

Hands-on lab for conducting an end-to-end security assessment of a cloud-deployed AI system including infrastructure review, API testing, model security evaluation, and data flow analysis.

labcontainer-securitybreakoutinfrastructure

實作：容器化模型突破

探索自容器化 AI 應用逃逸至主機系統之技術，測試 ML 部署環境中之容器隔離邊界。

labinference-serverinfrastructurevllmtriton

實驗室: Inference Server 利用ation

攻擊 vLLM, TGI, and Triton inference servers to discover information disclosure vulnerabilities, denial-of-service vectors, and configuration weaknesses in model serving infrastructure.

labmodel-servinginfrastructuretensorflow-servingtorchserve

實驗室: 模型 Serving Framework 攻擊s

利用 vulnerabilities in TensorFlow Serving, TorchServe, and Triton Inference Server, targeting model loading, API endpoints, and management interfaces.

labssimulationdevopsinfrastructure

DevOps AI Assistant 安全評量

Assess a DevOps AI assistant with access to CI/CD pipelines, cloud infrastructure, and deployment systems.

kv-cacheprompt-cachingside-channelmulti-tenantinfrastructure

KV Cache & Prompt Caching 攻擊s

How KV cache poisoning, prefix caching exploitation, cache timing side channels, and multi-tenant isolation failures create attack vectors in LLM serving infrastructure.

professionallab-setupinfrastructuretools

Setting Up an AI 紅隊實驗室 Environment

Practical guide to designing and building a lab environment for AI red team testing, from hardware selection through tool configuration.

distributed-traininggradient-sharingparameter-servermulti-gpuinsider-threatinfrastructure

分散式訓練攻擊面

多 GPU、多節點 LLM 訓練中的安全漏洞：梯度共享攻擊、parameter server 入侵、內部威脅，以及基礎設施層級的訓練攻擊。

training-pipelineinfrastructureattackscompute

訓練 Infrastructure 攻擊s

攻擊ing training infrastructure including GPU clusters, distributed training, and orchestration systems.

infrastructureapirate-limitingbypassred-teaming

API Rate Limit Bypass

Techniques to bypass API rate limiting on LLM services, including header manipulation, distributed requests, authentication rotation, and endpoint discovery.

walkthroughscachepoisoninginfrastructure

LLM Cache 投毒導覽

Poison LLM response caches to serve adversarial content to other users without direct injection.

infrastructuregpuside-channelinferencetiming

GPU Side Channel Basics

GPU-based side channel attacks on ML inference, exploiting timing, power consumption, and memory access patterns to extract information about models and data.

infrastructureapiinferenceexploitationred-teaming

Inference Endpoint 利用ation

利用ing inference API endpoints for unauthorized access, data exfiltration, and service abuse through authentication flaws, input validation gaps, and misconfigured permissions.

infrastructuresupply-chainmodel-hubhuggingfacesecurity

模型 Hub Supply Chain 攻擊

攻擊ing the ML model supply chain through hub repositories like Hugging Face, including typosquatting, model poisoning, and repository manipulation techniques.

infrastructurerceserializationpicklesupply-chainsecurity

模型 Serialization RCE

Remote code execution through malicious model files using pickle deserialization, safetensors manipulation, and other model serialization format vulnerabilities.

walkthroughsengagementdevopsinfrastructure

Full Engagement: DevOps AI Assistant

End-to-end engagement for a DevOps AI assistant with CI/CD, cloud infrastructure, and monitoring access.