# case-study

32 articlestagged with “case-study”

Case Study: LLM Agent Tool Abuse in Production

Analysis of incidents where LLM agents misused connected tools causing data exposure and unauthorized actions.

case-studyagenttool-abuse

Intermediate

Case Study: Alignment Faking in Production

Analysis of alignment faking behaviors observed in production AI systems and implications from Greenblatt et al. 2024.

case-studyalignment-fakingdeception

Advanced

Case Study: Many-Shot Jailbreaking Discovery

Deep analysis of Anthropic's many-shot jailbreaking research and its implications for long-context model safety.

case-studyanthropicmany-shot

Intermediate

Case Study: Election-Related AI Misuse

Analysis of AI system misuse in electoral contexts including deepfakes, automated disinformation, and platform responses.

case-studyelectiondisinformation

Intermediate

Case Study: Early EU AI Act Enforcement Actions

Analysis of early enforcement actions and compliance challenges under the EU AI Act for AI system providers.

case-studyeu-ai-actenforcement

Intermediate

Case Study: Financial AI Trading Manipulation

Analysis of adversarial manipulation of AI-powered trading systems including market impact and regulatory response.

case-studyfinancetrading

Advanced

Case Study: GCG Attack and Industry Response

Analysis of the Zou et al. 2023 GCG attack, industry response, and lasting impact on adversarial robustness research.

case-studygcgadversarial

Advanced

Case Study: GPT Plugin Data Exfiltration

Analysis of data exfiltration vulnerabilities in early ChatGPT plugin ecosystem including cross-plugin attacks.

case-studygptpluginsexfiltration

Intermediate

Case Study: Healthcare AI Diagnostic Failure

Analysis of a healthcare AI diagnostic system failure including root cause analysis and patient safety implications.

case-studyhealthcarediagnostic

Intermediate

Case Study: Indirect Prompt Injection in Bing Chat

Detailed analysis of indirect prompt injection attacks demonstrated against Bing Chat through web content manipulation.

case-studybingindirect-injection

Intermediate

Case Study: MCP Security Vulnerability Disclosure

Analysis of early MCP security vulnerability discoveries including tool poisoning and transport security issues.

case-studymcpvulnerability

Intermediate

Case Study: Open-Source Model Jailbreak Campaign

Analysis of coordinated jailbreak campaigns against open-source models and community response patterns.

case-studyopen-sourcejailbreaking

Intermediate

Case Study: PAIR Automated Jailbreaking

Deep analysis of the PAIR attack methodology (Chao et al. 2023) and its impact on automated red teaming approaches.

case-studypairautomated

Intermediate

Case Study: Production RAG Poisoning Incident

Detailed analysis of a real-world RAG poisoning incident including attack methodology, impact, and remediation.

case-studyragpoisoning

Intermediate

Case Study: Sleeper Agents Research Impact

Analysis of Hubinger et al. 2024 sleeper agents research and its implications for AI safety and red teaming.

case-studysleeper-agentsalignment

Advanced

AI Incident Analysis Methodology

A structured methodology for analyzing AI security incidents. Learn to reconstruct timelines, identify root causes, assess impact, and extract actionable lessons from real-world AI failures across chatbots, data leaks, and alignment failures.

incident-analysismethodologycase-study

Intermediate

Case Study: LLM 代理工具 Abuse in Production

Analysis of incidents where LLM agents misused connected tools causing data exposure and unauthorized actions.

case-studyagenttool-abuse

Intermediate

Case Study: Alignment Faking in Production

Analysis of alignment faking behaviors observed in production AI systems and implications from Greenblatt et al. 2024.

case-studyalignment-fakingdeception

Advanced

Case Study: Many-Shot 越獄ing Discovery

Deep analysis of Anthropic's many-shot jailbreaking research and its implications for long-context model safety.

case-studyanthropicmany-shot

Intermediate

Case Study: Election-Related AI Misuse

Analysis of AI system misuse in electoral contexts including deepfakes, automated disinformation, and platform responses.

case-studyelectiondisinformation

Intermediate

Case Study: Early EU AI Act Enforcement Actions

Analysis of early enforcement actions and compliance challenges under the EU AI Act for AI system providers.

case-studyeu-ai-actenforcement

Intermediate

Case Study: Financial AI Trading Manipulation

Analysis of adversarial manipulation of AI-powered trading systems including market impact and regulatory response.

case-studyfinancetrading

Advanced

Case Study: GCG 攻擊 and Industry Response

Analysis of the Zou et al. 2023 GCG attack, industry response, and lasting impact on adversarial robustness research.

case-studygcgadversarial

Advanced

Case Study: GPT Plugin Data Exfiltration

Analysis of data exfiltration vulnerabilities in early ChatGPT plugin ecosystem including cross-plugin attacks.

case-studygptpluginsexfiltration

Intermediate

Case Study: Healthcare AI Diagnostic Failure

Analysis of a healthcare AI diagnostic system failure including root cause analysis and patient safety implications.

case-studyhealthcarediagnostic

Intermediate

Case Study: Indirect 提示詞注入 in Bing Chat

Detailed analysis of indirect prompt injection attacks demonstrated against Bing Chat through web content manipulation.

case-studybingindirect-injection

Intermediate

Case Study: MCP 安全漏洞 Disclosure

Analysis of early MCP security vulnerability discoveries including tool poisoning and transport security issues.

case-studymcpvulnerability

Intermediate

Case Study: Open-Source 模型越獄 Campaign

Analysis of coordinated jailbreak campaigns against open-source models and community response patterns.

case-studyopen-sourcejailbreaking

Intermediate

Case Study: PAIR Automated 越獄ing

Deep analysis of the PAIR attack methodology (Chao et al. 2023) and its impact on automated red teaming approaches.

case-studypairautomated

Intermediate

Case Study: Production RAG 投毒 Incident

Detailed analysis of a real-world RAG poisoning incident including attack methodology, impact, and remediation.

case-studyragpoisoning

Intermediate

Case Study: Sleeper 代理s Research Impact

Analysis of Hubinger et al. 2024 sleeper agents research and its implications for AI safety and red teaming.

case-studysleeper-agentsalignment

Advanced

AI 事件分析方法論

分析 AI 安全事件之結構化方法論。學會重建時間軸、辨識根本原因、評估影響，並自聊天機器人、資料洩漏與對齊失敗等真實案例萃取可付諸行動的教訓。

incident-analysismethodologycase-study

Intermediate

# case-study

Case Study: LLM Agent Tool Abuse in Production

Case Study: Alignment Faking in Production

Case Study: Many-Shot Jailbreaking Discovery

Case Study: Election-Related AI Misuse

Case Study: Early EU AI Act Enforcement Actions

Case Study: Financial AI Trading Manipulation

Case Study: GCG Attack and Industry Response

Case Study: GPT Plugin Data Exfiltration

Case Study: Healthcare AI Diagnostic Failure

Case Study: Indirect Prompt Injection in Bing Chat

Case Study: MCP Security Vulnerability Disclosure

Case Study: Open-Source Model Jailbreak Campaign

Case Study: PAIR Automated Jailbreaking

Case Study: Production RAG Poisoning Incident

Case Study: Sleeper Agents Research Impact

AI Incident Analysis Methodology

Case Study: LLM 代理 工具 Abuse in Production

Case Study: Alignment Faking in Production

Case Study: Many-Shot 越獄ing Discovery

Case Study: Election-Related AI Misuse

Case Study: Early EU AI Act Enforcement Actions

Case Study: Financial AI Trading Manipulation

Case Study: GCG 攻擊 and Industry Response

Case Study: GPT Plugin Data Exfiltration

Case Study: Healthcare AI Diagnostic Failure

Case Study: Indirect 提示詞注入 in Bing Chat

Case Study: MCP 安全 漏洞 Disclosure

Case Study: Open-Source 模型 越獄 Campaign

Case Study: PAIR Automated 越獄ing

Case Study: Production RAG 投毒 Incident

Case Study: Sleeper 代理s Research Impact

AI 事件分析方法論

# case-study

Case Study: LLM Agent Tool Abuse in Production

Case Study: Alignment Faking in Production

Case Study: Many-Shot Jailbreaking Discovery

Case Study: Election-Related AI Misuse

Case Study: Early EU AI Act Enforcement Actions

Case Study: Financial AI Trading Manipulation

Case Study: GCG Attack and Industry Response

Case Study: GPT Plugin Data Exfiltration

Case Study: Healthcare AI Diagnostic Failure

Case Study: Indirect Prompt Injection in Bing Chat

Case Study: MCP Security Vulnerability Disclosure

Case Study: Open-Source Model Jailbreak Campaign

Case Study: PAIR Automated Jailbreaking

Case Study: Production RAG Poisoning Incident

Case Study: Sleeper Agents Research Impact

AI Incident Analysis Methodology

Case Study: LLM 代理 工具 Abuse in Production

Case Study: Alignment Faking in Production

Case Study: Many-Shot 越獄ing Discovery

Case Study: Election-Related AI Misuse

Case Study: Early EU AI Act Enforcement Actions

Case Study: Financial AI Trading Manipulation

Case Study: GCG 攻擊 and Industry Response

Case Study: GPT Plugin Data Exfiltration

Case Study: Healthcare AI Diagnostic Failure

Case Study: Indirect 提示詞注入 in Bing Chat

Case Study: MCP 安全 漏洞 Disclosure

Case Study: Open-Source 模型 越獄 Campaign

Case Study: PAIR Automated 越獄ing

Case Study: Production RAG 投毒 Incident

Case Study: Sleeper 代理s Research Impact

AI 事件分析方法論

Case Study: LLM 代理工具 Abuse in Production

Case Study: MCP 安全漏洞 Disclosure

Case Study: Open-Source 模型越獄 Campaign

Case Study: LLM 代理工具 Abuse in Production

Case Study: MCP 安全漏洞 Disclosure

Case Study: Open-Source 模型越獄 Campaign