# case-study
16 artikelengetagd met “case-study”
Casestudy: misbruik van tools door een LLM-agent in productie
Analysis of incidents where LLM agents misused connected tools causing data exposure and unauthorized actions.
Casestudy: alignment faking in productie
Analysis of alignment faking behaviors observed in production AI systems and implications from Greenblatt et al. 2024.
Casestudy: de ontdekking van many-shot jailbreaken
Deep analysis of Anthropic's many-shot jailbreaking research and its implications for long-context model safety.
Casestudy: AI-misbruik rond verkiezingen
Analysis of AI system misuse in electoral contexts including deepfakes, automated disinformation, and platform responses.
Casestudy: Vroege handhavingsacties onder de EU AI Act
Analyse van vroege handhavingsacties en compliance-uitdagingen onder de EU AI Act voor aanbieders van AI-systemen.
Casestudy: manipulatie van financiële handels-AI
Analysis of adversarial manipulation of AI-powered trading systems including market impact and regulatory response.
Casestudy: de GCG-aanval en de reactie van de industrie
Analysis of the Zou et al. 2023 GCG attack, industry response, and lasting impact on adversarial robustness research.
Casestudy: data-exfiltratie via GPT-plug-ins
Analysis of data exfiltration vulnerabilities in early ChatGPT plugin ecosystem including cross-plugin attacks.
Casestudy: diagnostisch falen van AI in de zorg
Analysis of a healthcare AI diagnostic system failure including root cause analysis and patient safety implications.
Casestudy: indirecte prompt injection in Bing Chat
Detailed analysis of indirect prompt injection attacks demonstrated against Bing Chat through web content manipulation.
Casestudy: disclosure van een MCP-beveiligingskwetsbaarheid
Analysis of early MCP security vulnerability discoveries including tool poisoning and transport security issues.
Casestudy: Jailbreak-campagne tegen Open-Source Modellen
Analyse van gecoördineerde jailbreak-campagnes tegen open-source modellen en responspatronen van de community.
Casestudy: geautomatiseerd jailbreaken met PAIR
Deep analysis of the PAIR attack methodology (Chao et al. 2023) and its impact on automated red teaming approaches.
Casestudy: RAG-vergiftigingsincident in productie
Gedetailleerde analyse van een RAG-vergiftigingsincident uit de praktijk, inclusief aanvalsmethodologie, impact en herstel.
Casestudy: impact van het Sleeper Agents-onderzoek
Analysis of Hubinger et al. 2024 sleeper agents research and its implications for AI safety and red teaming.
Methodologie voor analyse van AI-incidenten
A structured methodology for analyzing AI security incidents. Learn to reconstruct timelines, identify root causes, assess impact, and extract actionable lessons from real-world AI failures across chatbots, data leaks, and alignment failures.