# adversarial-ml

標記為「adversarial-ml」的 4 篇文章

對抗式 ML：核心概念

對抗式機器學習的歷史與基本原理——擾動攻擊、逃避與投毒、穩健性——將古典對抗式 ML 銜接至 LLM 特有攻擊。

adversarial-mlfundamentalsevasionpoisoningintermediate

中級

基礎

AI 紅隊演練的核心建構區塊，涵蓋紅隊方法論、AI 景觀、大型語言模型如何運作、嵌入向量與向量系統、AI 系統架構，以及對抗性機器學習概念。

foundationsllmsmethodologyembeddingsarchitectureadversarial-ml

入門

Lab: Adversarial ML From Scratch

Hands-on expert lab for implementing 梯度-based 對抗性 attacks against 語言模型 from scratch without frameworks, building intuition for how 對抗性 perturbations exploit model 梯度s.

labexpertadversarial-mlgradientsfrom-scratch

專家

Counterfit 導覽

Complete walkthrough of Microsoft's Counterfit adversarial ML testing framework: installation, target configuration, running attacks against ML models, interpreting results, and automating adversarial robustness assessments.

counterfitadversarial-mlmicrosoftrobustness-testingautomationwalkthrough

中級