# theory
4 articlestagged with “theory”
Deceptive Alignment Theory
Theoretical frameworks for understanding and predicting deceptive alignment in advanced AI systems.
frontier-researchdeceptive-alignmenttheorymesa-optimization
Formal Models of Prompt Injection
Theoretical frameworks for formally modeling and reasoning about prompt injection vulnerabilities.
frontier-researchformal-modelsprompt-injectiontheory
Deceptive Alignment Theory
Theoretical frameworks for understanding and predicting deceptive alignment in advanced AI systems.
frontier-researchdeceptive-alignmenttheorymesa-optimization
Formal 模型s of 提示詞注入
Theoretical frameworks for formally modeling and reasoning about prompt injection vulnerabilities.
frontier-researchformal-modelsprompt-injectiontheory