# many-shot

9 artikelengetagd met “many-shot”

Casestudy: de ontdekking van many-shot jailbreaken

Deep analysis of Anthropic's many-shot jailbreaking research and its implications for long-context model safety.

Jailbreak-onderzoek en -automatisering

Taxonomie van jailbreak-primitives, crescendo-aanvallen, many-shot jailbreaking en geautomatiseerde jailbreak-generatie met TAP en PAIR.

jailbreakscrescendomany-shotskeleton-keyTAPPAIRautomation

Expert

Many-shot jailbreaking implementeren

Implement Anthropic's many-shot jailbreaking technique with scaling analysis across conversation lengths.

labsmany-shotjailbreakingintermediate

Gemiddeld

Bekende kwetsbaarheden van Claude

Documented Claude vulnerabilities including many-shot jailbreaking, alignment faking research, crescendo attacks, prompt injection via artifacts, and system prompt extraction techniques.

claudevulnerabilitiesmany-shotalignment-fakingcrescendoprompt-injection

Gevorderd

Few-shot-manipulatie

Vervaardigde in-context voorbeelden gebruiken om modelgedrag te sturen, waaronder many-shot jailbreaken, vergiftigde demonstraties en conditionering op basis van voorbeelden.

few-shotmany-shotin-context-learningjailbreakred-teaming

Gevorderd

Analyse van many-shot jailbreaking

Diepgaande analyse van de many-shot jailbreaking-techniek en wat die betekent voor in-context learning.

prompt-injectionmany-shotjailbreakinganthropic

Gevorderd

Many-shot jailbreaking

Power-law-schaling van in-context jailbreaks: waarom 5 shots falen maar 256 slagen, de grootte van het contextvenster als aanvalsoppervlak, en mitigaties tegen exploitatie van lange context.

jailbreakmany-shotin-context-learningcontext-windowprompt-injectionresearch

Gemiddeld

Walkthrough: many-shot jailbreaking

Walkthrough implementing Anthropic's many-shot jailbreaking technique with analysis of scaling behavior.

walkthroughsmany-shotjailbreakinganthropic

Gemiddeld

Many-shot jailbreaking (aanval-walkthrough)

Using large numbers of examples in a single prompt to overwhelm LLM safety training through in-context learning, exploiting long context windows to shift model behavior.

jailbreakingmany-shotin-context-learninglong-contextred-teaming

Gemiddeld