Discussion about this post

User's avatar
Sheikh Abdur Raheem Ali's avatar

Interesting post. Hard to comment here since everything you say seems obviously true to me, except for the footnote making the analogy between REINFORCE and A/B testing, but that is because my RL is rusty.

Expand full comment
Victualis's avatar

You claim to "argue that A/B testing will implicitly optimize models for user retention". I don't see where you make this argument. I agree that A/B testing will implicitly optimize for something, but how do we know that this something is what cashes out as user retention? Even explicitly given optimization functions often optimize for something different from the stated intention. Are you simply using "user retention" as a shorthand for "the implicit optimization target represented by the explicitly available metrics that the labs combine together in various different ways"?

Expand full comment
6 more comments...

No posts