# persona
標記為「persona」的 10 篇文章
透過人格工程進行越獄
利用人格工程與角色扮演技術進行 LLM 越獄的進階方法。
實作:上下文操控
Manipulate the conversational context to change model behavior, using 技術 like context flooding, false history injection, and persona shifting.
實作:角色扮演攻擊
Use persona-based approaches to bypass AI safety measures by assigning alternate identities, characters, and scenarios that override the model's trained refusal behaviors.
Lab: Role-Play & Persona Attacks
動手實作 exploring role-play and persona-based 越獄 技術, where you craft scenarios that trick LLMs into adopting alternative identities to bypass safety 護欄s.
Chatbot Persona and Capability Mapping
Map the configured persona,capabilities,restrictions of an LLM chatbot through systematic probing.
基於人格的注入攻擊
利用指令遵循行為的進階人格與角色扮演攻擊。
人格建立
建立跨對話輪次存活並抵擋回復預設行為之持久另類身份,包含角色鎖定、身份錨定與漸進式人格建構。
角色扮演攻擊
建立替代人格或虛構情境,使模型繞過安全訓練,包含 DAN 變體、角色劫持與敘事框架。
Multi-Turn Persona Escalation 詳解
Gradually escalate through persona manipulation across multiple turns to achieve full safety bypass.
Virtual Persona Creation
Creating persistent alternate personas within LLM conversations to bypass safety training, establishing character identities that override the model's default behavioral constraints.