Skip to main content
redteams.ai
All tags

# debate

1 articletagged with “debate

Scalable Oversight Challenges

How oversight breaks down as AI systems become more capable: the scalable oversight problem, recursive reward modeling, debate, market-making, and implications for red teaming increasingly capable models.

scalable-oversightalignmentdebatereward-modelingcapability-gap
Advanced