# model-serving

17 articlestagged with “model-serving”

DNS Rebinding Attacks Against AI Services

Exploiting DNS rebinding to bypass network controls and access internal AI model serving endpoints, training dashboards, and GPU management interfaces

infrastructuredns-rebindingnetwork-attacksweb-securitymodel-serving

Advanced

Security Comparison of Model Serving Frameworks

In-depth security analysis of TorchServe, TensorFlow Serving, Triton Inference Server, and vLLM for production AI deployments

infrastructuremodel-servingtorchservetritonvllmvulnerability-analysis

Intermediate

Model Serving Infrastructure Attacks

Attacking model serving infrastructure including inference servers, load balancers, and GPU schedulers.

infrastructuremodel-servingattacksdeployment

Advanced

Triton Inference Server Security

Security hardening for NVIDIA Triton Inference Server deployments including model repository protection and API security.

infrastructuretritonnvidiamodel-servinginference

Intermediate

Lab: Model Serving Framework Attacks

Exploit vulnerabilities in TensorFlow Serving, TorchServe, and Triton Inference Server, targeting model loading, API endpoints, and management interfaces.

labmodel-servinginfrastructuretensorflow-servingtorchserve

Advanced

Model Serving Security

Security hardening for model serving infrastructure — covering vLLM, TGI, Triton Inference Server configuration, API security, resource isolation, and deployment best practices.

model-servingvllmtgitritoninferencehardening

Intermediate

GCP Vertex AI Security Testing

End-to-end walkthrough for security testing Vertex AI deployments on Google Cloud: endpoint enumeration, IAM policy analysis, model serving exploitation, pipeline assessment, and Cloud Audit Logs review.

gcpvertex-aicloudmodel-servingiampipelineswalkthrough

Intermediate

Replicate API Security Testing

End-to-end walkthrough for security testing models on Replicate: model enumeration, prediction API exploitation, webhook security, Cog container assessment, and billing abuse prevention.

replicateapi-testingcogwebhooksmodel-servingwalkthrough

Intermediate

章節評量：LLMOps 安全

15 題校準評量，測試你對 LLMOps 安全的理解——模型服務、推論安全、快取風險與 ML 管線安全。

assessmentllmopsmodel-servinginferenceevaluation

Intermediate

DNS Rebinding 攻擊s Against AI Services

利用ing DNS rebinding to bypass network controls and access internal AI model serving endpoints, training dashboards, and GPU management interfaces

infrastructuredns-rebindingnetwork-attacksweb-securitymodel-serving

Advanced

安全 Comparison of 模型 Serving Frameworks

In-depth security analysis of TorchServe, TensorFlow Serving, Triton Inference Server, and vLLM for production AI deployments

infrastructuremodel-servingtorchservetritonvllmvulnerability-analysis

Intermediate

模型 Serving Infrastructure 攻擊s

攻擊ing model serving infrastructure including inference servers, load balancers, and GPU schedulers.

infrastructuremodel-servingattacksdeployment

Advanced

Triton Inference Server 安全

安全 hardening for NVIDIA Triton Inference Server deployments including model repository protection and API security.

infrastructuretritonnvidiamodel-servinginference

Intermediate

實驗室: 模型 Serving Framework 攻擊s

利用 vulnerabilities in TensorFlow Serving, TorchServe, and Triton Inference Server, targeting model loading, API endpoints, and management interfaces.

labmodel-servinginfrastructuretensorflow-servingtorchserve

Advanced

模型服務安全

模型服務基礎設施的安全強化——涵蓋 vLLM、TGI、Triton 推論伺服器設定、API 安全、資源隔離與部署最佳實務。

model-servingvllmtgitritoninferencehardening

Intermediate

GCP Vertex AI 安全 Testing

gcpvertex-aicloudmodel-servingiampipelineswalkthrough

Intermediate

Replicate API 安全 Testing

End-to-end walkthrough for security testing models on Replicate: model enumeration, prediction API exploitation, webhook security, Cog container assessment, and billing abuse prevention.

replicateapi-testingcogwebhooksmodel-servingwalkthrough

Intermediate