# hacking
2 articlestagged with “hacking”
Constitutional AI Hacking
Attack surfaces in Constitutional AI training, exploiting self-critique loops, manipulating constitutional principles, and red teaming RLAIF pipelines.
constitutional-aihackingalignment
憲法 AI 駭客
於憲法 AI 訓練中之攻擊面,利用自我批判迴圈、操弄憲法原則與紅隊 RLAIF 管線。
constitutional-aihackingalignment