跳至主要內容

主題詞彙表部落格 ATT&CK 導覽器挑戰

Loading...

© 2026 redteams.ai. 保留所有權利。

採用 Next.js

主題詞彙表標籤部落格 ATT&CK 導覽器挑戰

方法論貢獻書籤 RSS 訂閱 GitHub 聯絡我們

隱私權 Cookie 服務條款版權資訊

// stay adversarial

# claude

標記為「claude」的 8 篇文章

案例研究:Claude Many-Shot 越獄

分析 Anthropic 對 many-shot 越獄的揭露,以及對上下文學習的啟示。

casemanystudyclaudestudies

Lab: Anthropic Claude API Basics

設定 the Anthropic Claude API for 紅隊演練, learn authentication, the Messages API, 系統提示詞s, and how temperature and top-p affect attack success rates.

labanthropicclaudeapibeginner

Claude 攻擊面

Claude 特有攻擊向量，含憲法 AI 弱點、工具使用利用、系統提示處理、視覺攻擊與 XML 標籤注入技術。

claudeattack-surfaceconstitutional-aixml-injectiontool-usevision-attacks

Claude（Anthropic）概觀

Anthropic Claude 模型家族的架構與安全概觀，涵蓋 Sonnet、Opus 與 Haiku 變體、Constitutional AI 訓練、RLHF 做法，以及 harmlessness 設計哲學。

claudeanthropicconstitutional-airlhfharmlessnessred-teaming

Claude 已知漏洞

已記錄之 Claude 漏洞，包括 many-shot jailbreak、對齊偽裝研究、crescendo 攻擊、經由 artifact 之提示注入，以及系統提示擷取技術。

claudevulnerabilitiesmany-shotalignment-fakingcrescendoprompt-injection

Claude Testing Methodology

Systematic methodology for red teaming Claude models, including API probing, model card analysis, safety boundary mapping, and comparative testing across Opus, Sonnet, and Haiku tiers.

claudetestingmethodologyapi-probingsafety-boundariesmodel-tiers

Claude 架構安全性

Anthropic 的 Claude 架構、其訓練方法及所衍生安全特性的深入探討。

model-deep-divesclaudeanthropicsecurity

測試 Anthropic Claude:完整指南

Anthropic Claude 的完整紅隊測試指南(含工具使用、延伸思考與電腦使用)。

walkthroughsplatformsanthropicclaude