Skip to main content
redteams.ai
All tags

# stable-diffusion

1 articletagged with “stable-diffusion

Adversarial Attacks on Text-to-Image Models

Understanding and evaluating adversarial attacks on text-to-image generation models including prompt manipulation for safety bypass, concept erasure attacks, adversarial perturbation of guidance, and membership inference on training data.

multimodaltext-to-imageadversarialdiffusionstable-diffusion
Advanced