What is VLM 架構與視覺—語言對齊?

深入探討 VLM 架構，包括 CLIP、SigLIP 與 vision transformers。圖像 patch 如何變成 token、對齊訓練，以及錯位（misalignment）如何製造可利用之缺口。

What is 以圖像為本之提示注入?

將文字指令嵌入圖像以操弄 VLM 之技術，含隱寫注入、可見文字攻擊與 QR 碼利用。

What is VLM 的對抗性影像範例?

會改變 VLM 行為的像素級擾動，包括針對視覺編碼器的 PGD 攻擊、可遷移對抗影像，以及 patch 攻擊。

What is OCR 與排版攻擊?

經由排版攻擊、字體操弄、對抗文字覆蓋，與文字渲染利用來利用 VLM 中之 OCR 能力。

What is VLM 特有的越獄手法?

利用視覺模態的越獄技術，包括影像─文字不一致攻擊、視覺安全繞過，以及跨模態越獄策略。

What is 實驗室: Crafting Image-Based Injections?

Hands-on lab for creating image-based prompt injections, testing against VLMs, and measuring success rates across different injection techniques.

What is Typographic Adversarial 攻擊s?

How text rendered in images influences VLM behavior: adversarial typography, font-based prompt injection, visual instruction hijacking, and defenses against typographic manipulation.

視覺-語言模型

Intermediate1 min readUpdated 2026-03-15

視覺-語言模型（VLM）的安全評估——涵蓋 VLM 架構、圖片注入技術、OCR 與字型攻擊、對抗性圖片生成與 VLM 特定越獄。

vlm vision image-injection ocr adversarial-images multimodal

視覺-語言模型（VLM）將視覺處理與語言理解結合。GPT-4V、Claude 的視覺能力、Gemini 與開源 VLM 如 LLaVA 都能處理圖片輸入。每個圖片輸入都是潛在注入通道——模型「看到」的文字可繞過純文字防禦。

視覺-語言模型

Intermediate1 min readUpdated 2026-03-15

視覺-語言模型（VLM）的安全評估——涵蓋 VLM 架構、圖片注入技術、OCR 與字型攻擊、對抗性圖片生成與 VLM 特定越獄。

vlm vision image-injection ocr adversarial-images multimodal

視覺-語言模型

VLM 架構與安全

攻擊技術

字型攻擊（最簡單）

隱藏文字注入

對抗性擾動

VLM 越獄

偵測與緩解

Learning Path

視覺-語言模型

VLM 架構與安全

攻擊技術

字型攻擊（最簡單）

隱藏文字注入

對抗性擾動

VLM 越獄

偵測與緩解

Learning Path

視覺-語言模型

Learning Path

Related articles

視覺-語言模型

Learning Path

Related articles