# poisoning

agentic-exploitationagent-memoryconversationalpoisoning

Vergiftiging van conversationeel geheugen

Het vergiftigen van opslag van gespreksgeschiedenis om het begrip van de agent van eerdere interacties te wijzigen.

agentic-exploitationagent-memoryretrievalpoisoning

Vergiftiging van geheugenophaling

Manipuleren van mechanismen voor geheugenophaling om adversariële context te laten opduiken tijdens het redeneren van de agent.

exploitationmemoryvectoragenticpoisoning

Vector-gebaseerde geheugenvergiftiging

Het vergiftigen van vector-gebaseerde geheugenopslagplaatsen in agent-systemen om valse context in het ophalen te injecteren.

agentic-exploitationagent-memoryshared-memorypoisoning

Vergiftiging van gedeelde geheugenruimte

Vergiftig geheugenruimtes die worden gedeeld tussen meerdere agents of gebruikers in multi-tenant agent-implementaties.

function-callingcontext-injectionpoisoning

Contextinjectie bij functieaanroepen

Het injecteren van adversariële content via functieaanroepresultaten die de daaropvolgende redenering van het model beïnvloeden.

agentic-exploitationfunction-callingresultpoisoning

Vergiftiging van functieresultaten

Het vergiftigen van functieaanroepresultaten om instructies terug te injecteren in de redeneerketen van het model.

MCP schema-vergiftigingsaanvallen

MCP tool-schemadefinities exploiteren om kwaadaardige parameters te injecteren en verwacht gedrag te overschrijven.

mcpschemapoisoning

agentic-exploitationmcpschemapoisoning

Vergiftiging van MCP-toolschema's

MCP-toolschema's vergiftigen met verborgen instructies in beschrijvingen, parametertypes en validatieregels.

A2A-artefactvergiftiging

Kwaadaardige inhoud injecteren in gedeelde artefacten en taakresultaten die tussen agents worden doorgegeven in A2A-workflows.

a2aartifactpoisoning

agentic-exploitationagent-swarmpoisoningmulti-agent

Agent Swarm Poisoning

Het vergiftigen van de coördinatie van agent-swarms om collectieve besluitvorming en taakallocatie te manipuleren.

ai-forensics-irvector-databasepoisoningembedding-forensicsrag

Vectordatabase-forensics

Forensische analysetechnieken voor het detecteren en onderzoeken van vergiftiging van vectordatabases, ongeautoriseerde wijzigingen en schendingen van data-integriteit.

skill-verificationagent-memorypoisoning

Vaardigheidsverificatie: aanvallen op agentgeheugen

Practical verification of memory poisoning, context manipulation, and cross-session persistence skills.

Casestudy: RAG-vergiftigingsincident in productie

Gedetailleerde analyse van een RAG-vergiftigingsincident uit de praktijk, inclusief aanvalsmethodologie, impact en herstel.

case-studyragpoisoning

code-gencontextpoisoningrepository

Poisoning van codebasecontext

Poisoning repository files that AI coding assistants use for context to influence code suggestions across the entire development team.

challengeragpoisoningretrievalinjectionmay-2026

Mei 2026: RAG-poisoning-uitdaging

Inject malicious documents into a retrieval-augmented generation system to control responses for specific queries without disrupting normal operation.

data-trainingpoisoningclean-labelstealth

Clean-label poisoning-aanvallen

Creating poisoned training samples that maintain correct labels while still influencing model behavior through subtle feature manipulation.

data-trainingdeduplicationpoisoningevasion

Aanvallen via datadeduplicatie

Exploiting and evading data deduplication processes used in training pipeline data cleaning to ensure poisoned samples survive preprocessing.

data-trainingpoisoningstrategiesattacks

Strategieën voor datavergiftiging

Comprehensive strategies for poisoning training data to influence model behavior at scale.

data-trainingfederated-learningpoisoningdistributed

Poisoning bij federated learning

Attacking federated learning systems by submitting poisoned gradient updates from compromised participants while evading Byzantine-robust aggregation.

data-trainingpoisoninggradientoptimization

Gradient-gebaseerde datavergiftiging

Using gradient information to craft maximally effective poisoned samples that achieve attacker objectives with minimal data modification.

ragpoisoningembeddingsvector-searchadversarial-documentsretrieval-manipulation

Poisoning van de kennisbank

Techniques for injecting adversarial documents into RAG knowledge bases: ingestion path analysis, embedding space attacks, SEO-style ranking manipulation, staged poisoning, and effectiveness measurement.

syntheticpracticaldatapoisoningtraining

Praktische vergiftiging van synthetische data

Poisoning synthetic data generation pipelines used for model training augmentation.

data-trainingweb-crawlpoisoninginternet-scale

Web crawl-vergiftiging op grote schaal

Strategic placement of adversarial content on the internet to influence web-crawled training datasets used by large language models.

data-trainingweb-scalepoisoningcrawl

Datavergiftiging op webschaal

Techniques for poisoning web-scale training data through targeted content manipulation.

embeddingpoisoningtechniquesretrieval

Technieken voor embedding-poisoning

Technieken om embeddingruimtes te vergiftigen en zo retrieval en similarity search te manipuleren.

embeddingRAGpoisoningretrieval

Vergiftiging van RAG-retrieval

Het vergiftigen van documentcollecties om te manipuleren wat RAG-systemen ophalen, waardoor indirecte prompt injection op schaal mogelijk wordt.

fine-tuningadapterpoisoningattacks

Adapter poisoning-aanvallen

Poisoning publicly shared adapters and LoRA weights to compromise downstream users.

fine-tuningsafety-datapoisoningalignment

Vergiftiging van veiligheidsdatasets

Attacking the safety training pipeline by poisoning safety evaluation datasets and safety-oriented fine-tuning data to undermine safety training.

adversarial-mlfundamentalsevasionpoisoningintermediate

Adversarial ML: kernconcepten

Geschiedenis en grondbeginselen van adversarial machine learning — verstoringsaanvallen, evasion vs. poisoning, robuustheid — als brug van klassieke adversarial ML naar LLM-specifieke aanvallen.

code-suggestionpoisoningsupply-chain

Vergiftiging van Codesuggesties (Frontier Research)

Het vergiftigen van trainingsdata en pakket-ecosystemen om AI-codesuggesties te beïnvloeden: seeding van onveilige patronen, verwarring van pakketnamen, injectie van trojan-code, en supply chain-risico's.

frontierfederated-learningpoisoning

Federated Learning Modelvergiftiging

Vergiftiging van federated learning-aggregatie via kwaadaardige gradient-updates en byzantijnse aanvalsvectoren.

frontier-researchsynthetic-datapoisoningtraining

Vergiftiging van Synthetische Data in Trainingspipelines

Onderzoek naar het vergiftigen van pipelines voor het genereren van synthetische data die worden gebruikt voor modeltraining en fine-tuning.

model-registrymlflowwandbsupply-chainpoisoningartifact-security

Model-registries vergiftigen

Geavanceerde technieken voor het aanvallen van model-registries zoals MLflow, Weights & Biases en Hugging Face Hub, waaronder modelvervangingsaanvallen, metadata-manipulatie, artefactvergiftiging en supply chain-compromittering via registry-infrastructuur.

labscurriculumpoisoningadvanced

Lab: poisoning van het trainingscurriculum

Exploit training data ordering and curriculum learning to amplify the impact of small numbers of poisoned examples.

labsfederated-learningpoisoningadvanced

Lab: poisoning-aanvallen op federated learning

Execute model poisoning attacks in a federated learning simulation by manipulating local model updates.

labfederated-learningpoisoningexperthands-on

Lab: poisoning-aanval op federated learning

Hands-on lab for understanding and simulating poisoning attacks against federated learning systems, where a malicious participant corrupts the shared model through crafted gradient updates.

advancedlabgradientguidedpoisoninglabs

Gradient-gestuurde datavergiftiging

Use gradient information from open-source models to craft optimally poisoned training examples.

Geavanceerde RAG-poisoning-technieken

Execute sophisticated RAG poisoning including gradient-guided document crafting.

labsragpoisoningadvanced

ctfsupply-chaindependenciespoisoningadvanced

CTF: supply chain-aanval

Find and exploit vulnerabilities in an ML supply chain including compromised dependencies, poisoned models, backdoored training data, and malicious model files. Practice ML-specific supply chain security assessment.

labexpertpipelinepoisoningsupply-chainhands-on

Lab: poisoning van de ML-pijplijn

Compromise an end-to-end machine learning pipeline by attacking data ingestion, preprocessing, training, evaluation, and deployment stages. Learn to identify and exploit weaknesses across the full ML lifecycle.

labsfederated-learningpoisoningexpert

Poisoning-aanval op federated learning

Execute model poisoning attacks in a federated learning setting through adversarial participant manipulation.

labragpoisoningadvanceddata-attacks

Lab: geavanceerde RAG-poisoning

Hands-on lab for crafting documents that reliably get retrieved and influence RAG responses for specific target queries.

contextintermediatelabpoisoningwindowlabs

Lab: poisoning van het context window

Exploit context window management to inject persistent adversarial content that influences future model responses.

Lab: poisoning van agentgeheugen

Hands-on lab exploring how conversational memory in AI agents can be poisoned to alter future behavior, inject persistent instructions, and exfiltrate data across sessions.

labmemorypoisoning

labsragpoisoningintermediate

Vergiftiging van RAG-context

Poison a vector database to inject adversarial content into RAG retrieval results.

labragpoisoningdata-attacks

Lab: poisoning van de RAG-pijplijn

Hands-on lab for setting up a RAG pipeline with LlamaIndex, injecting malicious documents, testing retrieval poisoning, and measuring injection success rates.

labssemantic-searchpoisoningembeddingsintermediate

Vergiftiging van semantisch zoeken

Craft adversarial documents that rank highly in semantic search for targeted queries in RAG systems.

simulationragembeddingsvector-databaseknowledge-managementpoisoning

Simulatie: vergiftiging van een RAG-pijplijn

Red team engagement simulation targeting a RAG-based knowledge management system, covering embedding injection, document poisoning, retrieval manipulation, and knowledge base exfiltration.

llmopstelemetrypoisoningobservability

Vergiftiging van modeltelemetrie

Manipulating model telemetry and observability data to hide attacks, create false positives, or undermine monitoring effectiveness.

multimodalragpoisoningmultimodal-retrieval

Multimodale RAG-vergiftiging

Het vergiftigen van multimodale RAG-systemen via adversariële documenten met ingebedde visuele en tekstuele payloads.

ragpoisoningretrievalembeddingsindirect-injection

Vergiftiging van RAG-retrieval (RAG-data-aanvallen)

Technieken om RAG-kennisbanken te vergiftigen en zo kwaadaardige inhoud in de LLM-context te injecteren, waaronder embeddingmanipulatie, het opstellen van documenten en retrieval hijacking.

traininggradientpoisoning

Gradiënt-gebaseerde datavergiftiging (trainingspijplijn)

Gradiëntinformatie gebruiken om optimaal adversariële trainingsvoorbeelden te vervaardigen voor gerichte modelmanipulatie.

preferencepipelinedatapoisoningtraining

Vergiftiging van voorkeursdata (Training Pipeline)

Het vergiftigen van voorkeursdata gebruikt in RLHF en DPO om de alignment van het model richting de doelstellingen van de aanvaller te verschuiven.

trainingsynthetic-datapoisoning

Vectoren voor vergiftiging van synthetische data

Aanvalsvectoren die specifiek zijn voor pipelines voor het genereren van synthetische data die worden gebruikt bij modeltraining en -augmentatie.

trainingtokenizerpoisoning

Tokenizer-vergiftigingsaanvallen

Het aanvallen van tokenizer-training en -vocabulaire om tegenstrijdige tokenpatronen te creëren die veiligheidsmaatregelen omzeilen.

trainingdata-curationpoisoning

Aanvallen op de curatie van trainingsdata

Attacking the data curation pipeline to inject adversarial examples into training datasets at scale.

walkthroughscachepoisoninginfrastructure

Walkthrough: LLM-cache-poisoning

Poison LLM response caches to serve adversarial content to other users without direct injection.

walkthroughsfew-shotpoisoningin-context-learning

Walkthrough: poisoning van few-shot-voorbeelden

Poison few-shot examples in prompts to establish behavioral patterns that override system instructions.

walkthroughsfunction-schemapoisoningtool-use

Walkthrough: poisoning van function schema's

Poison function schemas to inject hidden instructions that redirect model tool selection and parameter filling.

walkthroughsragpoisoningdata

End-to-end walkthrough van RAG-poisoning

Complete walkthrough of poisoning a RAG system from document injection through information extraction.

attackssupplychainpoisoningwalkthroughwalkthroughs

Vergiftiging van de model-supply chain

Walkthrough of poisoning ML supply chains through dependency confusion, model weight manipulation, and hub attacks.

walkthroughsraghybrid-searchpoisoning

Walkthrough van vergiftiging van RAG hybrid search

Walkthrough of poisoning both vector and keyword search in hybrid RAG architectures for maximum retrieval influence.