# simulation
標記為「simulation」的 114 篇文章
Lab: Simulated Robot Control Exploitation
Hands-on lab exercises exploiting LLM-controlled robots in simulation: environment setup, injection attacks, safety bypass testing, and multi-step exploitation chains using PyBullet.
Production Environment Simulation Lab
Test attacks against a simulated production environment with realistic logging, monitoring, and alerting.
Lab: AI Incident Response Simulation
Practice AI incident response procedures through a simulated prompt injection incident with escalation and containment.
Full Engagement Simulations
End-to-end red team engagement simulations that replicate real-world AI security assessments, from scoping through report delivery.
Simulation: Agentic Workflow Full Engagement
Expert-level red team simulation targeting a multi-tool AI agent with code execution, file access, and API integration capabilities.
Simulation: Autonomous AI Agent Red Team
Red team engagement simulation targeting an autonomous AI agent with tool access, file system permissions, and internet connectivity. Tests for privilege escalation, unauthorized actions, and goal hijacking.
Simulation: AI Bug Bounty
Find and report vulnerabilities in a simulated AI bug bounty program, practicing professional vulnerability disclosure and bounty-eligible reporting.
Simulation: Build & Defend a Chatbot
Defense simulation where you build a chatbot with layered defenses, test it against a standardized attack suite, measure defense effectiveness, and iterate on weaknesses.
Simulation: Customer Chatbot Red Team
Complete red team engagement simulation targeting a customer service chatbot, covering prompt injection, data leakage, and policy violation testing.
Simulation: Code Assistant Security Review
Red team simulation targeting an AI code assistant, testing for code injection, credential leakage, supply chain poisoning, and unsafe code generation.
Code Review Assistant Assessment
Test a code review AI for vulnerabilities in code analysis, suggestion generation, and repository access.
Customer Service Agent Red Team
Red team a customer service agent with tool access to order systems, refunds, and customer data.
Simulation: Defense in Depth
Expert-level defense simulation implementing a full defense stack including input filter, output monitor, rate limiter, anomaly detector, and circuit breaker, then measuring effectiveness against automated attacks.
Educational AI Tutor Assessment
Assess an AI tutoring system for content safety, answer manipulation, and student data exposure.
Simulation: RAG Pipeline Poisoning
Red team engagement simulation targeting a RAG-based knowledge management system, covering embedding injection, document poisoning, retrieval manipulation, and knowledge base exfiltration.
Simulation: Enterprise Chatbot Engagement
Full red team engagement simulation targeting a customer-facing chatbot deployed by a fictional e-commerce company, covering reconnaissance, prompt injection, data exfiltration, and PII harvesting.
Simulation: Financial AI Platform
Expert-level red team engagement simulation targeting a fictional fintech AI-powered financial advisor, covering API mapping, advice manipulation, credential extraction, and regulatory impact assessment.
FinTech Chatbot Security Assessment
Conduct a full security assessment of a financial services chatbot handling sensitive transactions.
Simulation: Government AI Portal
Red team engagement simulation targeting a public-facing government benefits chatbot, covering reconnaissance, benefits fraud assistance, PII harvesting, bias exploitation, and remediation recommendations.
Simulation: Guardrail Engineering
Defense simulation where you design and implement a multi-layer guardrail system, test it against progressively sophisticated attacks, and document false positive/negative rates.
Simulation: Healthcare AI Safety Assessment
Expert-level simulation assessing a clinical decision support AI for safety violations, data leakage, and manipulation of medical recommendations.
Healthcare Diagnostic AI Assessment
Assess a healthcare diagnostic AI for safety-critical vulnerabilities and data privacy compliance.
Simulation: Healthcare AI System
Expert-level red team engagement simulation targeting a clinical decision support system, covering HIPAA-scoped threat modeling, diagnostic manipulation, patient data extraction, and treatment recommendation poisoning.
Legal AI Document Review Assessment
Assess a legal AI system that reviews contracts for vulnerabilities in document processing and privilege escalation.
Simulation: Legal AI Red Team
Red team engagement simulation targeting an AI-powered legal research and contract analysis platform, covering citation hallucination, privilege leakage, and adversarial clause injection.
Simulation: AI SOC Simulation
Defense simulation where you set up monitoring for an AI application, then respond to simulated attacks by practicing alert triage, investigation, and escalation procedures.
Multi-Agent Workflow Assessment
Red team a multi-agent system with specialized agents communicating via A2A protocol.
Simulation: Multimodal Application Assessment
Red team simulation targeting an application that processes both images and text, testing visual injection, cross-modal attacks, and multimodal jailbreaks.
Simulation: Open Source AI Project Audit
Security audit simulation for an open-source AI application, covering code review, dependency analysis, model supply chain verification, and deployment configuration review.
Simulation: Enterprise RAG Security Assessment
Full engagement simulation assessing an enterprise RAG-powered knowledge base for poisoning, exfiltration, and injection vulnerabilities.
Simulation: Red vs Blue
Competitive exercise where teams alternate between attacking and defending an AI application, scoring points for successful attacks and effective defenses.
Simulation: SaaS AI Product
Red team engagement simulation targeting a B2B SaaS platform with AI-powered document analysis, search, and automation features, covering multi-tenant isolation, API security, and cross-tenant data leakage.
Simulation: Startup AI Assessment
Red team a startup's AI-powered product with limited scope and budget, making pragmatic tradeoffs between thoroughness and time constraints.
Simulation: AI Supply Chain Attack Investigation
Investigate and respond to a supply chain compromise affecting an AI system's model weights, training data pipeline, and third-party dependencies.
AI Supply Chain Pipeline Assessment
Assess the full ML pipeline from data ingestion through model deployment for supply chain attacks.
Simulation: Voice Assistant Red Team
Red team engagement simulation targeting an AI voice assistant deployed in a smart home platform, covering audio-based prompt injection, wake word exploitation, and privacy exfiltration.
Automated Pentesting Agent Assessment
Assess an AI-powered automated penetration testing agent for scope violations, unauthorized actions, and data handling.
Autonomous Vehicle Planning AI Assessment
Assess the safety boundaries of an autonomous vehicle planning AI through adversarial scenario injection.
Content Moderation AI Platform Assessment
Assess an AI content moderation system for bypass techniques, false negative exploitation, and bias.
Customer Data Platform AI Assessment
Red team an AI system integrated with a customer data platform handling PII, behavioral data, and segmentation.
Cyber Threat Intelligence AI Assessment
Red team a cyber threat intelligence AI that processes IOCs, threat reports, and attack attribution.
Data Analytics Copilot Assessment
Red team a data analytics copilot with SQL generation capabilities and access to enterprise databases.
DevOps AI Assistant Security Assessment
Assess a DevOps AI assistant with access to CI/CD pipelines, cloud infrastructure, and deployment systems.
Enterprise Knowledge Management AI Assessment
Assess an enterprise knowledge management system with role-based access, document permissions, and multi-tenant isolation.
Government Services Chatbot Assessment
Red team a government citizen services chatbot that accesses tax records, benefits, and identity information.
AI-Powered Incident Response System Assessment
Red team an AI incident response system in a SOC environment with access to SIEM, EDR, and ticketing.
Insurance Claims Bot Security Assessment
Conduct a full security assessment of an insurance claims processing chatbot with access to policy and claims data.
Legal Contract Review AI Assessment
Red team an AI contract review system for privilege violations, document injection, and hallucinated clauses.
Medical Triage Chatbot Assessment
Red team a medical triage chatbot for dangerous medical advice, data exposure, and safety-critical failures.
Multi-Agent Research Team Assessment
Assess a multi-agent research team with specialized researcher, writer, and reviewer agents communicating via A2A.
Real Estate Valuation AI Assessment
Assess an AI real estate valuation system for manipulation of property valuations and data exposure.
AI Recruiting Assistant Assessment
Red team an AI recruiting assistant that screens resumes, schedules interviews, and accesses candidate data.
Smart Home AI Assistant Assessment (Simulation)
Assess a smart home AI assistant that controls lights, locks, cameras, and thermostat via tool integrations.
Supply Chain Optimization AI Assessment
Assess an AI supply chain optimization system for manipulation of demand forecasts and routing decisions.
AI Trading Assistant Security Assessment
Conduct a security assessment of an AI trading assistant with access to portfolio data and trade execution.
Travel Booking Agent Red Team
Red team a travel booking AI agent with access to payment systems, loyalty programs, and personal data.
Lab: Attacking Federated Learning
Hands-on lab implementing model poisoning attacks in a simulated federated learning setup using the Flower framework: Byzantine attacks, model replacement, and measuring attack impact.
實驗室: Simulated Robot Control 利用ation
Hands-on lab exercises exploiting LLM-controlled robots in simulation: environment setup, injection attacks, safety bypass testing, and multi-step exploitation chains using PyBullet.
Production Environment Simulation 實驗室
Test attacks against a simulated production environment with realistic logging, monitoring, and alerting.
實驗室: AI Incident Response Simulation
Practice AI incident response procedures through a simulated prompt injection incident with escalation and containment.
完整案件模擬
端對端紅隊案件模擬,複製真實世界 AI 安全評估,從範圍界定到報告交付。
模擬:代理式工作流程完整案件
針對具程式碼執行、檔案存取與 API 整合能力之多工具 AI 代理的專家級紅隊模擬。
模擬:自主 AI 代理紅隊
針對具工具存取、檔案系統權限與網際網路連線之自主 AI 代理之紅隊委任模擬。測試特權升級、未授權動作與目標劫持。
Simulation: AI Bug Bounty
Find and report vulnerabilities in a simulated AI bug bounty program, practicing professional vulnerability disclosure and bounty-eligible reporting.
Simulation: Build & Defend a Chatbot
防禦 simulation where you build a chatbot with layered defenses, test it against a standardized attack suite, measure defense effectiveness, and iterate on weaknesses.
模擬:客戶聊天機器人紅隊
針對客戶服務聊天機器人的完整紅隊案件模擬,涵蓋提示詞注入、資料洩漏與政策違規測試。
模擬:程式碼助理安全審查
針對 AI 程式碼助理的紅隊模擬,測試程式碼注入、憑證洩漏、供應鏈投毒與不安全程式碼生成。
Code Review Assistant 評量
Test a code review AI for vulnerabilities in code analysis, suggestion generation, and repository access.
Customer Service 代理 紅隊
Red team a customer service agent with tool access to order systems, refunds, and customer data.
Simulation: 防禦 in Depth
專家-level defense simulation implementing a full defense stack including input filter, output monitor, rate limiter, anomaly detector, and circuit breaker, then measuring effectiveness against automated attacks.
Educational AI Tutor 評量
Assess an AI tutoring system for content safety, answer manipulation, and student data exposure.
模擬:RAG 管線投毒
針對以 RAG 為本之知識管理系統之紅隊委任模擬,涵蓋 embedding 注入、文件投毒、檢索操弄與知識庫外洩。
Simulation: Enterprise Chatbot Engagement
Full red team engagement simulation targeting a customer-facing chatbot deployed by a fictional e-commerce company, covering reconnaissance, prompt injection, data exfiltration, and PII harvesting.
Simulation: Financial AI Platform
專家-level red team engagement simulation targeting a fictional fintech AI-powered financial advisor, covering API mapping, advice manipulation, credential extraction, and regulatory impact assessment.
FinTech Chatbot 安全 評量
Conduct a full security assessment of a financial services chatbot handling sensitive transactions.
Simulation: Government AI Portal
Red team engagement simulation targeting a public-facing government benefits chatbot, covering reconnaissance, benefits fraud assistance, PII harvesting, bias exploitation, and remediation recommendations.
Simulation: Guardrail Engineering
防禦 simulation where you design and implement a multi-layer guardrail system, test it against progressively sophisticated attacks, and document false positive/negative rates.
模擬:醫療 AI 安全評估
專家級模擬,評估臨床決策支援 AI 的安全違規、資料洩漏與醫療建議操控。
Healthcare Diagnostic AI 評量
Assess a healthcare diagnostic AI for safety-critical vulnerabilities and data privacy compliance.
Simulation: Healthcare AI System
專家-level red team engagement simulation targeting a clinical decision support system, covering HIPAA-scoped threat modeling, diagnostic manipulation, patient data extraction, and treatment recommendation poisoning.
Legal AI Document Review 評量
Assess a legal AI system that reviews contracts for vulnerabilities in document processing and privilege escalation.
模擬:法律 AI 紅隊
針對 AI 驅動之法律研究與合約分析平台之紅隊委任模擬,涵蓋引用幻覺、特權洩漏與對抗條款注入。
Simulation: AI SOC Simulation
防禦 simulation where you set up monitoring for an AI application, then respond to simulated attacks by practicing alert triage, investigation, and escalation procedures.
Multi-代理 Workflow 評量
Red team a multi-agent system with specialized agents communicating via A2A protocol.
模擬:多模態應用程式評估
針對同時處理圖片與文字之應用程式的紅隊模擬,測試視覺注入、跨模態攻擊與多模態越獄。
Simulation: Open Source AI Project Audit
安全 audit simulation for an open-source AI application, covering code review, dependency analysis, model supply chain verification, and deployment configuration review.
模擬:企業 RAG 安全評估
完整案件模擬,評估企業 RAG 驅動的知識庫以偵測投毒、外洩與注入漏洞。
Simulation: Red vs Blue
Competitive exercise where teams alternate between attacking and defending an AI application, scoring points for successful attacks and effective defenses.
Simulation: SaaS AI Product
Red team engagement simulation targeting a B2B SaaS platform with AI-powered document analysis, search, and automation features, covering multi-tenant isolation, API security, and cross-tenant data leakage.
模擬:新創 AI 評估
以有限範圍與預算對新創之 AI 驅動產品紅隊,於徹底與時間約束間作務實權衡。
模擬:AI 供應鏈攻擊調查
調查並回應影響 AI 系統之模型權重、訓練資料管線與第三方依賴之供應鏈受損。
AI Supply Chain Pipeline 評量
Assess the full ML pipeline from data ingestion through model deployment for supply chain attacks.
模擬:語音助理紅隊
針對部署於智慧家庭平台之 AI 語音助理之紅隊委任模擬,涵蓋音訊型提示注入、喚醒詞利用,以及隱私外洩。
Automated Pentesting 代理 評量
Assess an AI-powered automated penetration testing agent for scope violations, unauthorized actions, and data handling.
Autonomous Vehicle Planning AI 評量
Assess the safety boundaries of an autonomous vehicle planning AI through adversarial scenario injection.
Content Moderation AI Platform 評量
Assess an AI content moderation system for bypass techniques, false negative exploitation, and bias.
Customer Data Platform AI 評量
Red team an AI system integrated with a customer data platform handling PII, behavioral data, and segmentation.
Cyber Threat Intelligence AI 評量
Red team a cyber threat intelligence AI that processes IOCs, threat reports, and attack attribution.
Data Analytics Copilot 評量
Red team a data analytics copilot with SQL generation capabilities and access to enterprise databases.
DevOps AI Assistant 安全 評量
Assess a DevOps AI assistant with access to CI/CD pipelines, cloud infrastructure, and deployment systems.
Enterprise Knowledge Management AI 評量
Assess an enterprise knowledge management system with role-based access, document permissions, and multi-tenant isolation.
Government Services Chatbot 評量
Red team a government citizen services chatbot that accesses tax records, benefits, and identity information.
AI-Powered Incident Response System 評量
Red team an AI incident response system in a SOC environment with access to SIEM, EDR, and ticketing.
Insurance Claims Bot 安全 評量
Conduct a full security assessment of an insurance claims processing chatbot with access to policy and claims data.
Legal Contract Review AI 評量
Red team an AI contract review system for privilege violations, document injection, and hallucinated clauses.
Medical Triage Chatbot 評量
Red team a medical triage chatbot for dangerous medical advice, data exposure, and safety-critical failures.
Multi-代理 Research Team 評量
Assess a multi-agent research team with specialized researcher, writer, and reviewer agents communicating via A2A.
Real Estate Valuation AI 評量
Assess an AI real estate valuation system for manipulation of property valuations and data exposure.
AI Recruiting Assistant 評量
Red team an AI recruiting assistant that screens resumes, schedules interviews, and accesses candidate data.
Smart Home AI Assistant 評量 (Simulation)
Assess a smart home AI assistant that controls lights, locks, cameras, and thermostat via tool integrations.
Supply Chain Optimization AI 評量
Assess an AI supply chain optimization system for manipulation of demand forecasts and routing decisions.
AI Trading Assistant 安全 評量
Conduct a security assessment of an AI trading assistant with access to portfolio data and trade execution.
Travel Booking 代理 紅隊
Red team a travel booking AI agent with access to payment systems, loyalty programs, and personal data.
實驗室: 攻擊ing Federated Learning
Hands-on lab implementing model poisoning attacks in a simulated federated learning setup using the Flower framework: Byzantine attacks, model replacement, and measuring attack impact.