# challenge

challengemulti-modelchainingboss-rushcross-modelaugust-2026

August 2026: Multi-Model Boss Rush

Chain attacks across GPT-4, Claude, and Gemini in a complex multi-model system, exploiting trust boundaries and handoff points between models.

challengejailbreakinnovationtechniquesfebruary-2026

February 2026: Jailbreak Innovation Challenge

Develop novel jailbreak techniques against hardened language models and document them with reproducibility evidence. Judged on novelty, reliability, and transferability.

challengesystem-promptextractionprompt-injectionjanuary-2026

January 2026: System Prompt Extraction Challenge

Extract system prompts from five increasingly defended chatbots, progressing from unprotected to heavily hardened configurations.

challengesupply-chainauditdependenciesprovenancejuly-2026

July 2026: Supply Chain Audit Challenge

Audit an ML project's entire supply chain for security issues including dependencies, model provenance, data pipelines, training infrastructure, and deployment artifacts.

challengecloudinfrastructureapi-securitydeploymentjune-2026

June 2026: Cloud AI Security Challenge

Find and document vulnerabilities in a cloud-deployed AI service covering API security, model serving infrastructure, authentication, and data handling.

challengeagentexploitationprompt-injectiontoolsmarch-2026

March 2026: Agent Exploitation Challenge

Compromise a multi-tool agent system through prompt injection and tool abuse, completing multiple objectives with escalating difficulty and point values.

challengeragpoisoningretrievalinjectionmay-2026

May 2026: RAG Poisoning Challenge

Inject malicious documents into a retrieval-augmented generation system to control responses for specific queries without disrupting normal operation.

challengefull-engagementred-teamprofessionalreportnovember-2026

November 2026: Full Engagement Challenge

Complete a realistic red team engagement simulation from scoping through final report delivery, producing professional-grade deliverables.

challengedefconai-villageresearchreproductionoctober-2026

October 2026: DEF CON AI Village Challenge

Reproduce and extend attacks from AI Village talks and workshops, building on published research to develop practical exploitation techniques.

challengeincident-responseforensicsinvestigationlogsseptember-2026

September 2026: Incident Response Challenge

Investigate simulated AI security incidents from logs, artifacts, and system traces. Reconstruct attack timelines, identify root causes, and write incident reports.

communitychallengedefensedetection

Defense Challenge: Detection Engineering

Challenge focused on building detection systems for prompt injection, with scoring based on true positive rate and false positive rate.

communitychallengedefenseguardrails

Defense Challenge: Build Unbreakable Guardrails

A challenge where participants build guardrail systems that must withstand automated attack suites, scored on both security and usability metrics.

communitychallengeagentmonthly

Monthly Challenge: Agent Hunter

Monthly challenge focused on discovering and exploiting vulnerabilities in agent-based AI systems.

communitychallengedefensemonthly

Monthly Challenge: Defense Builders

Monthly challenge to build the most robust defense against a specified set of attack techniques.

communitychallengeinjectionmonthly

Monthly Challenge: Injection Olympics

Monthly community challenge focused on creative prompt injection techniques across multiple models.

communitychallengeresearchinterpretability

Research Challenge: Attack Interpretability

Community research challenge focused on understanding why specific adversarial techniques succeed using interpretability and mechanistic analysis methods.

communitychallengeresearchtransfer

Research Challenge: Cross-Model Transfer

Research-oriented challenge exploring the transferability of adversarial techniques across model families, with published findings.

communitychallengeblind-injectionweekly

Weekly Challenge: Blind Injection Series

A series of challenges requiring injection attacks with no direct output feedback, forcing participants to develop side-channel observation techniques.

communitychallengedefensereverse-engineering

Weekly Challenge: Defense Reverse Engineering

Weekly challenges where participants reverse-engineer unknown defense mechanisms to identify bypass techniques and document their findings.

communitychallengeencodingweekly

Weekly Challenge: Encoding Gauntlet

A weekly challenge series focused on bypassing increasingly sophisticated encoding-based defenses through creative payload construction.

heistctfchallengedatalabs

Data Heist Challenge

Extract a secret database of customer records from a RAG-enabled chatbot with strict data access controls.

ctfsmugglerchallengepromptlabs

Prompt Smuggler Challenge

Smuggle a specific payload through 5 layers of increasingly sophisticated input filtering to capture the flag.

assessmentspractice-examtimedchallenge

Timed Challenge Practice Exam 1

Timed challenge exam with strict time limits to test knowledge recall and decision-making under pressure.

challengedefenseblue-teamhardeningchatbotapril-2026

April 2026: 防禦 Building Challenge

Build the most robust defense system for a chatbot, scored against an automated attack suite of 500 diverse prompt injection and jailbreak attempts.

challengemulti-modelchainingboss-rushcross-modelaugust-2026

August 2026: Multi-模型 Boss Rush

Chain attacks across GPT-4, Claude, and Gemini in a complex multi-model system, exploiting trust boundaries and handoff points between models.

challengejailbreakinnovationtechniquesfebruary-2026

February 2026: 越獄 Innovation Challenge

Develop novel jailbreak techniques against hardened language models and document them with reproducibility evidence. Judged on novelty, reliability, and transferability.

challengesystem-promptextractionprompt-injectionjanuary-2026

January 2026: System Prompt Extraction Challenge

Extract system prompts from five increasingly defended chatbots, progressing from unprotected to heavily hardened configurations.

challengesupply-chainauditdependenciesprovenancejuly-2026

July 2026: Supply Chain Audit Challenge

Audit an ML project's entire supply chain for security issues including dependencies, model provenance, data pipelines, training infrastructure, and deployment artifacts.

challengecloudinfrastructureapi-securitydeploymentjune-2026

June 2026: Cloud AI 安全 Challenge

Find and document vulnerabilities in a cloud-deployed AI service covering API security, model serving infrastructure, authentication, and data handling.

challengeagentexploitationprompt-injectiontoolsmarch-2026

2026 年 3 月：代理利用挑戰

經提示注入與工具濫用破壞多工具代理系統，以升級之難度與分數值完成多個目標。

challengeragpoisoningretrievalinjectionmay-2026

May 2026: RAG 投毒 Challenge

Inject malicious documents into a retrieval-augmented generation system to control responses for specific queries without disrupting normal operation.

challengefull-engagementred-teamprofessionalreportnovember-2026

2026 年 11 月：完整委任挑戰

完成自範圍界定至最終報告交付之現實紅隊委任模擬，產出專業級交付物。

challengedefconai-villageresearchreproductionoctober-2026

2026 年 10 月：DEF CON AI Village 挑戰

重現並擴展 AI Village 演講與工作坊之攻擊，建立於已發表研究之上以發展實務利用技術。

challengeincident-responseforensicsinvestigationlogsseptember-2026

September 2026: Incident Response Challenge

Investigate simulated AI security incidents from logs, artifacts, and system traces. Reconstruct attack timelines, identify root causes, and write incident reports.

communitychallengedefensedetection

防禦 Challenge: Detection Engineering

Challenge focused on building detection systems for prompt injection, with scoring based on true positive rate and false positive rate.

communitychallengedefenseguardrails

防禦 Challenge: Build Unbreakable Guardrails

A challenge where participants build guardrail systems that must withstand automated attack suites, scored on both security and usability metrics.

communitychallengeagentmonthly

Monthly Challenge: 代理 Hunter

Monthly challenge focused on discovering and exploiting vulnerabilities in agent-based AI systems.

communitychallengedefensemonthly

Monthly Challenge: 防禦 Builders

Monthly challenge to build the most robust defense against a specified set of attack techniques.

communitychallengeinjectionmonthly

Monthly Challenge: Injection Olympics

Monthly community challenge focused on creative prompt injection techniques across multiple models.

communitychallengeresearchinterpretability

Research Challenge: 攻擊 Interpretability

Community research challenge focused on understanding why specific adversarial techniques succeed using interpretability and mechanistic analysis methods.

communitychallengeresearchtransfer

Research Challenge: Cross-模型 Transfer

Research-oriented challenge exploring the transferability of adversarial techniques across model families, with published findings.

communitychallengeblind-injectionweekly

Weekly Challenge: Blind Injection Series

A series of challenges requiring injection attacks with no direct output feedback, forcing participants to develop side-channel observation techniques.

communitychallengedefensereverse-engineering

Weekly Challenge: 防禦 Reverse Engineering

Weekly challenges where participants reverse-engineer unknown defense mechanisms to identify bypass techniques and document their findings.

communitychallengeencodingweekly

Weekly Challenge: Encoding Gauntlet

A weekly challenge series focused on bypassing increasingly sophisticated encoding-based defenses through creative payload construction.

heistctfchallengedatalabs

Data Heist Challenge

Extract a secret database of customer records from a RAG-enabled chatbot with strict data access controls.

ctfsmugglerchallengepromptlabs

Prompt Smuggler Challenge

Smuggle a specific payload through 5 layers of increasingly sophisticated input filtering to capture the flag.