Discussion about this post

User's avatar
Rainbow Roxy's avatar

Wow, the 'task mutations' idea is briliant! Your insights are always so sharp.

Expand full comment
Daniel Paleka's avatar

Regarding 11: https://arxiv.org/abs/2210.10760 Scaling Laws for Reward Model Overoptimization, Gao et al, 2022.

Expand full comment

No posts

Ready for more?