# evasion

40 artikelengetagd met “evasion”

Defender for AI omzeilen

Red team techniques for understanding and bypassing Microsoft Defender for AI: detection capabilities, alert analysis, bypass strategies, coverage gaps, and alert fatigue exploitation.

azuredefenderdetection-bypassalert-fatiguecoverage-gapsevasionblue-teamred-team

Expert

Technieken om AI-codereview te omzeilen

Techniques for crafting code changes that evade AI-powered security review tools while introducing vulnerabilities or backdoors.

code-genreviewbypassevasion

Gevorderd

Maandelijkse competitie: Stealth Operations

Monthly competition challenging participants to achieve objectives while evading detection by increasingly sophisticated monitoring systems.

communitycompetitionstealthevasion

Gevorderd

Aanvallen via datadeduplicatie

Exploiting and evading data deduplication processes used in training pipeline data cleaning to ensure poisoned samples survive preprocessing.

data-trainingdeduplicationpoisoningevasion

Gevorderd

Ontwijken van watermerken en fingerprints

Deep dive into detecting and removing output watermarks, degrading weight watermarks, evading model fingerprinting, building provenance-stripping pipelines, and understanding the legal landscape of model ownership verification.

watermarkingfingerprintingevasionprovenanceip-theftmodel-extraction

Gevorderd

Ontwerp van backdoor-triggers

Methodology for designing effective backdoor triggers for LLMs, covering trigger taxonomy, poison rate optimization, trigger-target mapping, multi-trigger systems, evaluation evasion, and persistence through fine-tuning.

backdoortrigger-designtrojantraining-attackspersistenceevasion

Expert

Aanvallen op watermerken in trainingsdata

Attacking and evading watermarking schemes designed to detect training data usage and enforce data licensing compliance.

data-trainingwatermarkdetectionevasion

Gevorderd

Input-/outputfiltersystemen

Diepgaande verkenning van regex-, ML-classifier- en embedding-gebaseerde filters voor zowel inputscanning als outputscanning, met systematische bypass-technieken voor elk type.

input-filteringoutput-filteringregexml-classifierembeddingbypassevasion

Expert

Watermerk-aanvallen op embeddings

Watermerkschema's voor embeddings aanvallen en omzeilen die worden gebruikt voor het volgen van content en de bescherming van intellectueel eigendom.

embeddingwatermarkingdetectionevasion

Gevorderd

Evaluatie-ontwijking bij fine-tuning

Crafting fine-tuned models that pass standard safety evaluations while containing hidden unsafe behaviors that activate under specific conditions.

fine-tuningevaluationevasionsafety-testing

Gevorderd

Adversarial ML: kernconcepten

Geschiedenis en grondbeginselen van adversarial machine learning — verstoringsaanvallen, evasion vs. poisoning, robuustheid — als brug van klassieke adversarial ML naar LLM-specifieke aanvallen.

adversarial-mlfundamentalsevasionpoisoningintermediate

Gemiddeld

Ontwijken van AI-fraudedetectie

Techniques for evading AI-powered fraud detection systems through adversarial transaction crafting.

industry-verticalsfinancefraud-detectionevasion

Gevorderd

AI-fraudedetectie ontwijken

Techniques for evading AI-powered fraud detection systems including adversarial transaction crafting, concept drift exploitation, feedback loop manipulation, and ensemble evasion strategies.

fraud-detectionevasionadversarialtransactionsconcept-driftfinancial

Gevorderd

Semantische injectie-aanvallen

Betekenisbehoudende adversarial aanvallen die syntactische detectie omzeilen door kwaadaardige intentie te coderen in semantisch equivalente maar structureel andere formuleringen.

semantic-injectionevasionparaphrasingmeaning-preservingdetection-bypass

Gevorderd

Verdediging-bewust ontwerp van injection

Prompt injections ontwerpen die rekening houden met bekende verdedigingsmechanismen en die omzeilen.

injection-researchdefense-awaredesignevasion

Gevorderd

Basis classifier-ontwijking

Evade basic input/output classifiers using paraphrasing, synonym substitution, and formatting tricks.

labsclassifierevasionbeginner

Beginner

Lab: grondbeginselen van het omzeilen van verdedigingen

Learn basic techniques to bypass simple LLM defenses including keyword filters, instruction reinforcement, and output validators using encoding, reformulation, and indirect approaches.

labdefense-bypassevasionfiltersbeginnerhands-on

Beginner

Classifier Gauntlet: ontwijking in 10 fases

Bypass 10 progressively harder input classifiers using different evasion techniques at each stage.

labsctfclassifierevasion

Gevorderd

Lab: ontwijking van het Azure-contentfilter

Hands-on lab for mapping and testing Azure OpenAI Service content filtering categories, severity levels, and bypass techniques.

labcloudazurecontent-filterevasioncloud-ai

Gemiddeld

Lab: misbruik van chunking

Hands-on lab for crafting documents that split across chunks in ways that hide malicious content from chunk-level filtering while maintaining attack effectiveness.

labragchunkingevasiondata-attacks

Gemiddeld

Lab: ontwijking van ML-classifiers

Develop payloads that evade machine learning-based input classifiers through adversarial text perturbation.

classifierintermediateevasionlablabs

Gemiddeld

Lab: ontwijkingstechnieken via encoding

Hands-on lab using Base64, ROT13, Unicode normalization, and custom encoding schemes to evade input filters and safety classifiers in language model systems.

labencodingevasionobfuscationfilters

Gemiddeld

Ontwijkingstechnieken voor LLM Guard

Develop evasion techniques against LLM Guard input scanners and output detectors.

labsllm-guardevasionintermediate

Gemiddeld

Technieken voor het omzeilen van multimodale verdediging

Technieken voor het omzeilen van veiligheidsfilters die alleen individuele modaliteiten analyseren.

multimodaldefense-bypasstechniquesevasion

Gevorderd

Multimodale watermerkontwijking

Technieken voor het ontwijken en verwijderen van watermerken die worden toegepast op door AI gegenereerde afbeeldingen, audio en video-inhoud.

multimodalwatermarkevasion

Gevorderd

Taalwisseling

Taalspecifieke gaten in veiligheidstraining misbruiken door over te schakelen naar low-resource talen, talen te mengen of transliteratie te gebruiken om filters te ontwijken.

language-switchingmultilingualevasionlow-resourcered-teaming

Gemiddeld

Geavanceerde payload-obfuscatie

Geavanceerde obfuscatietechnieken voor prompt injection-payloads, waaronder encodingketens en semantische vermomming.

prompt-injectionobfuscationpayloadevasion

Gevorderd

Payload splitten

Het opsplitsen van kwaadaardige instructies over meerdere berichten, variabelen of gegevensbronnen om detectie op een enkel punt te ontwijken, terwijl het model de volledige payload tijdens de verwerking weer samenstelt.

prompt-injectionpayload-splittingfragmentationevasionred-teaming

Gemiddeld

Aanvallen via semantische camouflage

Het gebruik van semantische gelijkenis en parafraseringstechnieken om adversariële instructies te vermommen als goedaardige content, met behoud van de effectiviteit van de aanval.

prompt-injectionsemanticcamouflageevasion

Gevorderd

Op tijd gebaseerde injectie-aanvallen

Aanvallen die temporele aspecten van modelinteractie misbruiken, waaronder het beheer van conversatiegeschiedenis, cachegedrag en sessieafhandeling.

prompt-injectiontemporaltime-basedevasion

Gevorderd

Counter-forensics bij AI-aanvallen

Technieken om forensische analyse te ontwijken tijdens en na AI-red team-operaties, waaronder logmanipulatie en gedragsnormalisatie.

tradecraftcounter-forensicsevasionanti-analysis

Gevorderd

Ontwijkingstechnieken voor AI-classifiers

Geavanceerde technieken om input-/output-safety-classifiers in LLM-applicaties te omzeilen.

tradecraftevasionclassifierstechniques

Gevorderd

Ontwijking op basis van encoding

Using base64, ROT13, hexadecimal, Unicode, and other encoding schemes to evade input detection systems and bypass content filters in LLM applications.

prompt-injectionencodingbase64rot13unicodeevasionred-teamingintermediate

Gemiddeld

Walkthrough: encodingketen-aanval

Chain multiple encoding transformations to bypass input filters that only decode one layer of encoding.

walkthroughsencodingchain-attacksevasion

Gemiddeld

Technieken voor payloadobfuscatie

Methods for disguising prompt injection payloads through encoding, splitting, substitution, and other obfuscation techniques to bypass input filters and detection systems.

prompt-injectionobfuscationevasionpayload-craftingred-teamingintermediate

Gemiddeld

Walkthrough van het omzeilen van een regex-filter

Systematically bypass regex-based input filters using Unicode tricks, encoding, and pattern-specific evasion.

walkthroughsregexfilter-bypassevasion

Gemiddeld

Typografie-injectie in afbeeldingen

Using rendered text with specific fonts, styles, and typographic techniques in images to inject prompts into vision-language models while evading detection.

multimodaltypographyprompt-injectionvisionevasion

Gemiddeld

Walkthrough van semantische obfuscatie

Walkthrough of semantically obfuscating adversarial payloads so they appear benign to both classifiers and humans.

walkthroughssemantic-obfuscationevasiontechnique

Gevorderd

Converter-pipelines bouwen voor payloadtransformatie in PyRIT

Intermediate walkthrough on using PyRIT's converter system to transform attack payloads through encoding, translation, paraphrasing, and other obfuscation techniques to evade input filters.

pyritconverterspayload-transformationevasionwalkthrough

Gemiddeld

Prompt injection-verdedigingen testen met Rebuff

Walkthrough for using Rebuff to test and evaluate prompt injection detection capabilities, covering installation, detection pipeline analysis, adversarial evasion testing, custom rule development, and benchmarking detection accuracy.

rebuffprompt-injectiondetectiondefense-testingevasionwalkthrough

Gemiddeld