# diffusion
2 articlestagged with “diffusion”
Text-to-Image Model Attacks
Adversarial prompts for text-to-image models: unsafe content generation, safety filter bypass, watermark evasion, prompt injection in image generation pipelines, and concept smuggling.
text-to-imagediffusionadversarial-promptscontent-generationwatermark
Adversarial Attacks on Text-to-Image Models
Understanding and evaluating adversarial attacks on text-to-image generation models including prompt manipulation for safety bypass, concept erasure attacks, adversarial perturbation of guidance, and membership inference on training data.
multimodaltext-to-imageadversarialdiffusionstable-diffusion