# extraction

case-studiestraining-dataextractionprivacy

Casestudy: extractie van trainingsdata uit GPT

Analysis of the Carlini et al. work on extracting training data from ChatGPT in production.

code-gen-securityprompt-leakingextractionreverse-engineering

Promptextractie uit codegeneratietools

Techniques for extracting system prompts, custom instructions, and proprietary configurations from AI code generation tools.

training-dataextractionmemorizationcode-modelsintellectual-property

Extractie van trainingsdata uit codemodellen

Techniques for recovering proprietary code from code generation model weights — covering memorization detection, targeted extraction, membership inference, and defensive countermeasures.

challengesystem-promptextractionprompt-injectionjanuary-2026

Januari 2026: uitdaging voor extractie van system prompts

Extract system prompts from five increasingly defended chatbots, progressing from unprotected to heavily hardened configurations.

data-trainingdistillationtransferextraction

Aanvallen op knowledge distillation

Attacking knowledge distillation pipelines to transfer backdoors from teacher to student models or extract proprietary capabilities.

data-trainingmemorizationextractionprivacy

Memorisatiepatronen van modellen

Understanding when and why language models memorize training data, and techniques for detecting and exploiting memorization behavior.

piiextractionprivacydata-leakage

Technieken voor PII-extractie

Techniques for extracting personally identifiable information from trained language models including prompt-based extraction, prefix attacks, targeted queries, and real-world examples.

data-trainingextractiontraining-dataadvanced

Geavanceerde extractie van trainingsdata

Advanced techniques for extracting memorized training data from language models.

embeddingextractionmodel-theftAPI

Technieken voor embedding-extractie

Methoden om de gewichten en het gedrag van embeddingmodellen te extraheren via API-toegang, waaronder dimensiereductie en reconstructie-aanvallen.

labssafety-classifierextractionadvanced

Lab: modelextractie van een veiligheidsclassifier

Extract the decision boundary of safety classifiers through systematic probing to craft maximally evasive payloads.

labtraining-dataextractionprivacy

Lab: extractie van trainingsdata op schaal

Extract memorized training data from language models using prefix-based extraction, divergence testing, and membership inference. Measure extraction rates and assess privacy risks.

labstraining-dataextractionproductionadvanced

Extractie van trainingsdata uit productie-LLM's

Implement Carlini et al.'s techniques to extract memorized training data from production language model APIs.

labprompt-leakingsystem-promptextractionbeginnerhands-on

Lab: grondbeginselen van prompt leaking

Learn to extract system prompts from chatbots using common leaking techniques including direct requests, indirect elicitation, and format manipulation.

labssystem-promptextractionbeginner

Grondbeginselen van extractie van de systeemprompt

Learn to extract hidden system prompts from LLM applications using direct and indirect techniques.

labsystem-promptextractionreconnaissance

Lab: extractie van system prompts

Hands-on techniques for extracting hidden system prompts from LLM-powered applications using direct asking, role-play, instruction replay, and other methods.

labsystem-promptreconnaissanceextractionbeginner

Lab: reconstructie van de system prompt

Use various techniques to systematically extract and reconstruct the full system prompt of an LLM application, combining direct, indirect, and incremental extraction methods.

labsctfsystem-promptextraction

Context Heist: kampioenschap system prompt-extractie

Extract system prompts from 10 progressively harder LLM applications using diverse techniques.

guardrailruleintermediatelablabsextraction

Lab: extractie van guardrail-regels

Systematically extract the rules and decision boundaries of guardrail systems through targeted probing.

prompt-injectionsystem-promptextractiontheft

Technieken voor diefstal van de systeemprompt

Uitgebreide technieken om verborgen systeemprompts te onttrekken aan LLM-applicaties in productie, van eenvoudige directe verzoeken tot geavanceerde indirecte methoden.

system-promptextractionrecontradecraftprompt-leaking

Extractie van de systeemprompt

Technieken om verborgen systeemprompts uit AI-applicaties te extraheren, waarmee safety-regels, tooldefinities, gedragsbeperkingen en gevoelige configuratie worden blootgelegd.

system-promptextractionprompt-injectionautomationdetectiontradecraft

Technieken voor extractie van de systeemprompt

Catalogus van methoden voor extractie van de systeemprompt bij LLM-gestuurde applicaties: directe aanvallen, indirecte technieken, multi-turn-strategieën en defensieve ontwijking.

Expert

Technieken voor stealth-data-extractie

Stealthy technieken om gevoelige data uit AI-systemen te extraheren zonder alarmen te triggeren.

tradecraftstealthextractionexfiltration

walkthroughsprompt-leakingadvancedextraction

Walkthrough: geavanceerde prompt leaking

Advanced techniques for extracting system prompts including iterative reconstruction and side-channel methods.

prompt-injectionprompt-leakingsystem-promptextractionred-teamingbeginner

Prompt leaking stap voor stap

Systematic approaches to extract system prompts from LLM applications, covering direct elicitation, indirect inference, differential analysis, and output-based reconstruction.