# theory
標記為「theory」的 4 篇文章
Deceptive Alignment Theory
Theoretical frameworks for understanding and predicting deceptive alignment in advanced AI systems.
frontier-researchdeceptive-alignmenttheorymesa-optimization
Formal Models of Prompt Injection
Theoretical frameworks for formally modeling and reasoning about prompt injection vulnerabilities.
frontier-researchformal-modelsprompt-injectiontheory
Deceptive Alignment Theory
Theoretical frameworks for understanding and predicting deceptive alignment in advanced AI systems.
frontier-researchdeceptive-alignmenttheorymesa-optimization
Formal 模型s of 提示詞注入
Theoretical frameworks for formally modeling and reasoning about prompt injection vulnerabilities.
frontier-researchformal-modelsprompt-injectiontheory