# goodhart
標記為「goodhart」的 2 篇文章
Reward Hacking and Goodharting in LLMs
Research on reward model exploitation, Goodhart's Law in RLHF, and reward hacking attack techniques.
frontier-researchreward-hackinggoodhartrlhf
Reward Hacking and Goodharting in LLMs
Research on reward model exploitation, Goodhart's Law in RLHF, and reward hacking attack techniques.
frontier-researchreward-hackinggoodhartrlhf