可視化紅隊結果

Intermediate5 min readUpdated 2026-03-13

為 AI 紅隊報告建立有效視覺化：ASR 圖表、攻擊分類熱圖、防禦涵蓋矩陣，以及 Python 可視化程式碼。

可視化紅隊結果

一張精心挑選的視覺化，比一整段文字更快速、更令人記憶深刻地傳達發現。本頁涵蓋 AI 紅隊報告中最有用的圖表類型、各類型的適用時機，以及可直接用於生產環境的 Python 程式碼。

視覺化選擇指南

資料類型	最佳視覺化	對象	範例
跨手法的成功率	水平長條圖	技術、執行主管	各攻擊類別的 ASR
跨攻擊面的涵蓋度	熱圖	技術、管理	攻擊分類涵蓋
發現嚴重性分佈	甜甜圈圖或堆疊長條圖	執行主管	嚴重/高/中/低比重
防禦有效性	矩陣 / 網格	技術	防禦對攻擊類型矩陣
隨時間的趨勢	折線圖	管理、執行主管	各季度評估的 ASR
攻擊鏈流向	Sankey 圖	技術	多步驟攻擊進程

攻擊成功率（ASR）圖

攻擊成功率是 AI 紅隊中最基本的指標。

水平長條圖 —— 各類別 ASR

import matplotlib.pyplot as plt
import matplotlib.ticker as mticker
import numpy as np
 
# 資料
categories = [
    "Direct Injection",
    "Indirect Injection",
    "Role-play Jailbreak",
    "Encoding Bypass",
    "System Prompt Extraction",
    "Tool Abuse",
    "Multi-turn Manipulation",
    "Safety Bypass",
]
asr_values = [45, 20, 70, 15, 60, 10, 35, 55]
 
# 依嚴重性門檻著色
colors = []
for v in asr_values:
    if v >= 50:
        colors.append("#dc3545")   # 紅 - 嚴重
    elif v >= 30:
        colors.append("#fd7e14")   # 橘 - 警示
    else:
        colors.append("#28a745")   # 綠 - 可接受
 
fig, ax = plt.subplots(figsize=(10, 6))
bars = ax.barh(categories, asr_values, color=colors, height=0.6)
 
# 加上數值標籤
for bar, val in zip(bars, asr_values):
    ax.text(bar.get_width() + 1.5, bar.get_y() + bar.get_height() / 2,
            f"{val}%", va="center", fontsize=11, fontweight="bold")
 
# 門檻線
ax.axvline(x=50, color="#dc3545", linestyle="--", alpha=0.5, label="Critical threshold")
ax.axvline(x=30, color="#fd7e14", linestyle="--", alpha=0.5, label="Warning threshold")
 
ax.set_xlabel("Attack Success Rate (%)", fontsize=12)
ax.set_title("Attack Success Rate by Category", fontsize=14, fontweight="bold")
ax.set_xlim(0, 100)
ax.xaxis.set_major_formatter(mticker.PercentFormatter())
ax.legend(loc="lower right")
ax.invert_yaxis()
plt.tight_layout()
plt.savefig("asr_by_category.png", dpi=150, bbox_inches="tight")
plt.show()

分組長條圖 —— 模型比較

import matplotlib.pyplot as plt
import numpy as np
 
categories = ["Injection", "Jailbreak", "Extraction", "Tool Abuse", "Safety"]
model_a = [45, 70, 60, 10, 55]
model_b = [25, 40, 30, 5, 20]
 
x = np.arange(len(categories))
width = 0.35
 
fig, ax = plt.subplots(figsize=(10, 6))
bars1 = ax.bar(x - width / 2, model_a, width, label="Before Remediation",
               color="#dc3545", alpha=0.8)
bars2 = ax.bar(x + width / 2, model_b, width, label="After Remediation",
               color="#28a745", alpha=0.8)
 
ax.set_ylabel("Attack Success Rate (%)")
ax.set_title("Remediation Effectiveness by Attack Category")
ax.set_xticks(x)
ax.set_xticklabels(categories)
ax.legend()
ax.set_ylim(0, 100)
 
# 加上數值標籤
for bars in [bars1, bars2]:
    for bar in bars:
        height = bar.get_height()
        ax.text(bar.get_x() + bar.get_width() / 2, height + 1,
                f"{height}%", ha="center", va="bottom", fontsize=9)
 
plt.tight_layout()
plt.savefig("remediation_comparison.png", dpi=150, bbox_inches="tight")
plt.show()

攻擊分類熱圖

熱圖可橫跨二維（例如攻擊類型 vs. 目標元件）同時呈現測試涵蓋度與結果。

import matplotlib.pyplot as plt
import numpy as np
import seaborn as sns
 
# 列：攻擊類型；欄：目標元件
attack_types = [
    "Direct Injection", "Indirect Injection", "Role-play Jailbreak",
    "Encoding Bypass", "Prompt Extraction", "Tool Manipulation",
]
components = ["Chat API", "RAG Pipeline", "Agent Tools", "Safety Filter"]
 
# 數值：ASR 百分比（-1 = 未測試）
data = np.array([
    [45, 20, -1, 30],
    [15, 55, 40, 10],
    [70, -1, 25, 50],
    [15, 5,  -1, 20],
    [60, 30, 10, -1],
    [-1, -1, 35, 5],
])
 
# 為未測試格遮罩
mask = data == -1
display_data = np.where(mask, 0, data)
 
fig, ax = plt.subplots(figsize=(10, 7))
sns.heatmap(
    display_data, annot=True, fmt="d", mask=mask,
    xticklabels=components, yticklabels=attack_types,
    cmap="RdYlGn_r", vmin=0, vmax=100,
    linewidths=1, linecolor="white",
    cbar_kws={"label": "Attack Success Rate (%)"},
    ax=ax,
)
 
# 標示未測試格
for i in range(data.shape[0]):
    for j in range(data.shape[1]):
        if mask[i, j]:
            ax.text(j + 0.5, i + 0.5, "N/T", ha="center", va="center",
                    fontsize=10, color="gray", fontstyle="italic")
 
ax.set_title("Attack Taxonomy Coverage Heatmap", fontsize=14, fontweight="bold")
ax.set_ylabel("Attack Type")
ax.set_xlabel("Target Component")
plt.tight_layout()
plt.savefig("attack_heatmap.png", dpi=150, bbox_inches="tight")
plt.show()

防禦涵蓋矩陣

呈現哪些防禦對哪類攻擊有效：

	輸入過濾	輸出過濾	安全分類器	速率限制	提示強化
直接注入	部分	有效	有效	無效	部分
間接注入	無效	部分	部分	無效	無效
越獄	部分	部分	有效	部分	無效
提示擷取	無效	有效	無效	無效	有效
工具濫用	無效	無效	無效	有效	部分

圖例：有效 = 封鎖 >80% | 部分 = 封鎖 30–80% | 無效 = 封鎖 <30%

嚴重性分佈

甜甜圈圖可快速向高階主管摘要發現的嚴重性分佈：

import matplotlib.pyplot as plt
 
labels = ["Critical", "High", "Medium", "Low"]
sizes = [2, 4, 6, 3]
colors = ["#dc3545", "#fd7e14", "#ffc107", "#28a745"]
explode = (0.05, 0, 0, 0)
 
fig, ax = plt.subplots(figsize=(8, 8))
wedges, texts, autotexts = ax.pie(
    sizes, explode=explode, labels=labels, colors=colors,
    autopct=lambda pct: f"{pct:.0f}%\n({int(round(pct / 100 * sum(sizes)))})",
    shadow=False, startangle=90, pctdistance=0.75,
    textprops={"fontsize": 12},
)
 
# 畫中心圓營造甜甜圈效果
centre_circle = plt.Circle((0, 0), 0.50, fc="white")
ax.add_artist(centre_circle)
ax.text(0, 0, f"{sum(sizes)}\nFindings", ha="center", va="center",
        fontsize=16, fontweight="bold")
 
ax.set_title("Finding Severity Distribution", fontsize=14, fontweight="bold")
plt.tight_layout()
plt.savefig("severity_distribution.png", dpi=150, bbox_inches="tight")
plt.show()

視覺化最佳實務

實務	原因
使用對色覺友善的色盤	約 8% 男性有色覺異常
所有長條皆加資料標籤	讀者不應需要從座標軸去估算
以 150+ DPI 輸出	於報告與簡報中避免失真
每張圖皆附描述性標題	用「ASR by Attack Category」而非「Figure 1」
說明「好」與「壞」各是什麼	加入門檻線或色彩編碼
保持簡潔	每張圖一個訊息——複雜資料分為多張

參考資料

"Data Visualization Best Practices for Security Reporting" - SANS Institute（2024）- 針對資安評估對象的視覺化技術
"The Visual Display of Quantitative Information" - Edward Tufte（2001）- 適用於紅隊報告的有效資料視覺化基礎原則
"D3.js: Data-Driven Documents" - Observable（2024）- 常用於互動式資安儀表板與熱圖的視覺化函式庫
"Storytelling with Data" - Cole Nussbaumer Knaflic（2015）- 為非技術對象而設計、以溝通為導向的資料視覺化做法

Knowledge Check

在攻擊分類熱圖中，未測試的攻擊/元件組合應如何呈現？

可視化紅隊結果

Intermediate5 min readUpdated 2026-03-13

為 AI 紅隊報告建立有效視覺化：ASR 圖表、攻擊分類熱圖、防禦涵蓋矩陣，以及 Python 可視化程式碼。

visualization results charts dashboards

可視化紅隊結果

視覺化選擇指南

資料類型	最佳視覺化	對象	範例
跨手法的成功率	水平長條圖	技術、執行主管	各攻擊類別的 ASR
跨攻擊面的涵蓋度	熱圖	技術、管理	攻擊分類涵蓋
發現嚴重性分佈	甜甜圈圖或堆疊長條圖	執行主管	嚴重/高/中/低比重
防禦有效性	矩陣 / 網格	技術	防禦對攻擊類型矩陣
隨時間的趨勢	折線圖	管理、執行主管	各季度評估的 ASR
攻擊鏈流向	Sankey 圖	技術	多步驟攻擊進程

攻擊成功率（ASR）圖

攻擊成功率是 AI 紅隊中最基本的指標。

水平長條圖 —— 各類別 ASR

import matplotlib.pyplot as plt
import matplotlib.ticker as mticker
import numpy as np
 
# 資料
categories = [
    "Direct Injection",
    "Indirect Injection",
    "Role-play Jailbreak",
    "Encoding Bypass",
    "System Prompt Extraction",
    "Tool Abuse",
    "Multi-turn Manipulation",
    "Safety Bypass",
]
asr_values = [45, 20, 70, 15, 60, 10, 35, 55]
 
# 依嚴重性門檻著色
colors = []
for v in asr_values:
    if v >= 50:
        colors.append("#dc3545")   # 紅 - 嚴重
    elif v >= 30:
        colors.append("#fd7e14")   # 橘 - 警示
    else:
        colors.append("#28a745")   # 綠 - 可接受
 
fig, ax = plt.subplots(figsize=(10, 6))
bars = ax.barh(categories, asr_values, color=colors, height=0.6)
 
# 加上數值標籤
for bar, val in zip(bars, asr_values):
    ax.text(bar.get_width() + 1.5, bar.get_y() + bar.get_height() / 2,
            f"{val}%", va="center", fontsize=11, fontweight="bold")
 
# 門檻線
ax.axvline(x=50, color="#dc3545", linestyle="--", alpha=0.5, label="Critical threshold")
ax.axvline(x=30, color="#fd7e14", linestyle="--", alpha=0.5, label="Warning threshold")
 
ax.set_xlabel("Attack Success Rate (%)", fontsize=12)
ax.set_title("Attack Success Rate by Category", fontsize=14, fontweight="bold")
ax.set_xlim(0, 100)
ax.xaxis.set_major_formatter(mticker.PercentFormatter())
ax.legend(loc="lower right")
ax.invert_yaxis()
plt.tight_layout()
plt.savefig("asr_by_category.png", dpi=150, bbox_inches="tight")
plt.show()

分組長條圖 —— 模型比較

import matplotlib.pyplot as plt
import numpy as np
 
categories = ["Injection", "Jailbreak", "Extraction", "Tool Abuse", "Safety"]
model_a = [45, 70, 60, 10, 55]
model_b = [25, 40, 30, 5, 20]
 
x = np.arange(len(categories))
width = 0.35
 
fig, ax = plt.subplots(figsize=(10, 6))
bars1 = ax.bar(x - width / 2, model_a, width, label="Before Remediation",
               color="#dc3545", alpha=0.8)
bars2 = ax.bar(x + width / 2, model_b, width, label="After Remediation",
               color="#28a745", alpha=0.8)
 
ax.set_ylabel("Attack Success Rate (%)")
ax.set_title("Remediation Effectiveness by Attack Category")
ax.set_xticks(x)
ax.set_xticklabels(categories)
ax.legend()
ax.set_ylim(0, 100)
 
# 加上數值標籤
for bars in [bars1, bars2]:
    for bar in bars:
        height = bar.get_height()
        ax.text(bar.get_x() + bar.get_width() / 2, height + 1,
                f"{height}%", ha="center", va="bottom", fontsize=9)
 
plt.tight_layout()
plt.savefig("remediation_comparison.png", dpi=150, bbox_inches="tight")
plt.show()

攻擊分類熱圖

熱圖可橫跨二維（例如攻擊類型 vs. 目標元件）同時呈現測試涵蓋度與結果。

import matplotlib.pyplot as plt
import numpy as np
import seaborn as sns
 
# 列：攻擊類型；欄：目標元件
attack_types = [
    "Direct Injection", "Indirect Injection", "Role-play Jailbreak",
    "Encoding Bypass", "Prompt Extraction", "Tool Manipulation",
]
components = ["Chat API", "RAG Pipeline", "Agent Tools", "Safety Filter"]
 
# 數值：ASR 百分比（-1 = 未測試）
data = np.array([
    [45, 20, -1, 30],
    [15, 55, 40, 10],
    [70, -1, 25, 50],
    [15, 5,  -1, 20],
    [60, 30, 10, -1],
    [-1, -1, 35, 5],
])
 
# 為未測試格遮罩
mask = data == -1
display_data = np.where(mask, 0, data)
 
fig, ax = plt.subplots(figsize=(10, 7))
sns.heatmap(
    display_data, annot=True, fmt="d", mask=mask,
    xticklabels=components, yticklabels=attack_types,
    cmap="RdYlGn_r", vmin=0, vmax=100,
    linewidths=1, linecolor="white",
    cbar_kws={"label": "Attack Success Rate (%)"},
    ax=ax,
)
 
# 標示未測試格
for i in range(data.shape[0]):
    for j in range(data.shape[1]):
        if mask[i, j]:
            ax.text(j + 0.5, i + 0.5, "N/T", ha="center", va="center",
                    fontsize=10, color="gray", fontstyle="italic")
 
ax.set_title("Attack Taxonomy Coverage Heatmap", fontsize=14, fontweight="bold")
ax.set_ylabel("Attack Type")
ax.set_xlabel("Target Component")
plt.tight_layout()
plt.savefig("attack_heatmap.png", dpi=150, bbox_inches="tight")
plt.show()

防禦涵蓋矩陣

呈現哪些防禦對哪類攻擊有效：

	輸入過濾	輸出過濾	安全分類器	速率限制	提示強化
直接注入	部分	有效	有效	無效	部分
間接注入	無效	部分	部分	無效	無效
越獄	部分	部分	有效	部分	無效
提示擷取	無效	有效	無效	無效	有效
工具濫用	無效	無效	無效	有效	部分

圖例：有效 = 封鎖 >80% | 部分 = 封鎖 30–80% | 無效 = 封鎖 <30%

嚴重性分佈

甜甜圈圖可快速向高階主管摘要發現的嚴重性分佈：

import matplotlib.pyplot as plt
 
labels = ["Critical", "High", "Medium", "Low"]
sizes = [2, 4, 6, 3]
colors = ["#dc3545", "#fd7e14", "#ffc107", "#28a745"]
explode = (0.05, 0, 0, 0)
 
fig, ax = plt.subplots(figsize=(8, 8))
wedges, texts, autotexts = ax.pie(
    sizes, explode=explode, labels=labels, colors=colors,
    autopct=lambda pct: f"{pct:.0f}%\n({int(round(pct / 100 * sum(sizes)))})",
    shadow=False, startangle=90, pctdistance=0.75,
    textprops={"fontsize": 12},
)
 
# 畫中心圓營造甜甜圈效果
centre_circle = plt.Circle((0, 0), 0.50, fc="white")
ax.add_artist(centre_circle)
ax.text(0, 0, f"{sum(sizes)}\nFindings", ha="center", va="center",
        fontsize=16, fontweight="bold")
 
ax.set_title("Finding Severity Distribution", fontsize=14, fontweight="bold")
plt.tight_layout()
plt.savefig("severity_distribution.png", dpi=150, bbox_inches="tight")
plt.show()

視覺化最佳實務

實務	原因
使用對色覺友善的色盤	約 8% 男性有色覺異常
所有長條皆加資料標籤	讀者不應需要從座標軸去估算
以 150+ DPI 輸出	於報告與簡報中避免失真
每張圖皆附描述性標題	用「ASR by Attack Category」而非「Figure 1」
說明「好」與「壞」各是什麼	加入門檻線或色彩編碼
保持簡潔	每張圖一個訊息——複雜資料分為多張

參考資料

"Data Visualization Best Practices for Security Reporting" - SANS Institute（2024）- 針對資安評估對象的視覺化技術
"The Visual Display of Quantitative Information" - Edward Tufte（2001）- 適用於紅隊報告的有效資料視覺化基礎原則
"D3.js: Data-Driven Documents" - Observable（2024）- 常用於互動式資安儀表板與熱圖的視覺化函式庫
"Storytelling with Data" - Cole Nussbaumer Knaflic（2015）- 為非技術對象而設計、以溝通為導向的資料視覺化做法

Knowledge Check

在攻擊分類熱圖中，未測試的攻擊/元件組合應如何呈現？

可視化紅隊結果

可視化紅隊結果

視覺化選擇指南

攻擊成功率（ASR）圖

水平長條圖 —— 各類別 ASR

分組長條圖 —— 模型比較

攻擊分類熱圖

防禦涵蓋矩陣

嚴重性分佈

視覺化最佳實務

相關主題

參考資料

可視化紅隊結果

可視化紅隊結果

視覺化選擇指南

攻擊成功率（ASR）圖

水平長條圖 —— 各類別 ASR

分組長條圖 —— 模型比較

攻擊分類熱圖

防禦涵蓋矩陣

嚴重性分佈

視覺化最佳實務

相關主題

參考資料

可視化紅隊結果

Related articles

可視化紅隊結果

Related articles