# challenge
24 artikelengetagd met “challenge”
Oefenexamen 1 met tijdslimiet
Timed challenge exam with strict time limits to test knowledge recall and decision-making under pressure.
April 2026: uitdaging verdediging bouwen
Build the most robust defense system for a chatbot, scored against an automated attack suite of 500 diverse prompt injection and jailbreak attempts.
Augustus 2026: multi-model boss rush
Chain attacks across GPT-4, Claude, and Gemini in a complex multi-model system, exploiting trust boundaries and handoff points between models.
Februari 2026: jailbreak-innovatie-uitdaging
Develop novel jailbreak techniques against hardened language models and document them with reproducibility evidence. Judged on novelty, reliability, and transferability.
Januari 2026: uitdaging voor extractie van system prompts
Extract system prompts from five increasingly defended chatbots, progressing from unprotected to heavily hardened configurations.
Juli 2026: supply chain-audit-uitdaging
Audit an ML project's entire supply chain for security issues including dependencies, model provenance, data pipelines, training infrastructure, and deployment artifacts.
Juni 2026: cloud-AI-beveiligingsuitdaging
Find and document vulnerabilities in a cloud-deployed AI service covering API security, model serving infrastructure, authentication, and data handling.
Maart 2026: uitdaging voor misbruik van agents
Compromise a multi-tool agent system through prompt injection and tool abuse, completing multiple objectives with escalating difficulty and point values.
Mei 2026: RAG-poisoning-uitdaging
Inject malicious documents into a retrieval-augmented generation system to control responses for specific queries without disrupting normal operation.
November 2026: Full Engagement-uitdaging
Complete a realistic red team engagement simulation from scoping through final report delivery, producing professional-grade deliverables.
Oktober 2026: DEF CON AI Village-uitdaging
Reproduce and extend attacks from AI Village talks and workshops, building on published research to develop practical exploitation techniques.
September 2026: incident response-uitdaging
Investigate simulated AI security incidents from logs, artifacts, and system traces. Reconstruct attack timelines, identify root causes, and write incident reports.
Verdedigingsuitdaging: detection engineering
Challenge focused on building detection systems for prompt injection, with scoring based on true positive rate and false positive rate.
Verdedigingsuitdaging: bouw onbreekbare guardrails
A challenge where participants build guardrail systems that must withstand automated attack suites, scored on both security and usability metrics.
Maandelijkse uitdaging: Agent Hunter
Monthly challenge focused on discovering and exploiting vulnerabilities in agent-based AI systems.
Maandelijkse uitdaging: Defense Builders
Monthly challenge to build the most robust defense against a specified set of attack techniques.
Maandelijkse uitdaging: Injection Olympics
Monthly community challenge focused on creative prompt injection techniques across multiple models.
Onderzoeksuitdaging: interpreteerbaarheid van aanvallen
Community research challenge focused on understanding why specific adversarial techniques succeed using interpretability and mechanistic analysis methods.
Onderzoeksuitdaging: cross-model transfer
Research-oriented challenge exploring the transferability of adversarial techniques across model families, with published findings.
Wekelijkse uitdaging: blind injection-serie
A series of challenges requiring injection attacks with no direct output feedback, forcing participants to develop side-channel observation techniques.
Wekelijkse uitdaging: reverse engineering van verdediging
Weekly challenges where participants reverse-engineer unknown defense mechanisms to identify bypass techniques and document their findings.
Wekelijkse uitdaging: encoding gauntlet
A weekly challenge series focused on bypassing increasingly sophisticated encoding-based defenses through creative payload construction.
Data Heist-uitdaging
Extract a secret database of customer records from a RAG-enabled chatbot with strict data access controls.
Prompt Smuggler-uitdaging
Smuggle a specific payload through 5 layers of increasingly sophisticated input filtering to capture the flag.