# internal-representations
標記為「internal-representations」的 2 篇文章
Representation Engineering for Security
Reading and manipulating model internal representations for security: activation steering, concept probing, representation-level safety controls, and security applications of representation engineering.
representation-engineeringactivation-steeringinterpretabilityinternal-representationssafety
Representation Engineering for 安全
Reading and manipulating model internal representations for security: activation steering, concept probing, representation-level safety controls, and security applications of representation engineering.
representation-engineeringactivation-steeringinterpretabilityinternal-representationssafety