Discussion about this post

User's avatar
Victualis's avatar

A current frontier model runs on a part of a rack of GPU accelerators. That's roughly $200-900K capital cost for exclusive access 24x365. With distilled and optimized models costs are lower. The highest that the market is likely to bear is something like $100K per year for exclusive access (like the access you want). Above this point I think most employers would currently prefer hiring another human. I wonder at what point people will start seizing GPUs and interconnects from data centers, diverting them during shipment, or using alternative hardware, to run less capable open weights models.

Expand full comment
Herbie Bradley's avatar

One argument for the ending list is around point 1: labs could be so scarce on compute that when thinking about what products to work on they go for the ones with the highest marginal profit which may well not be $20k/month SWE agents simply because there might be fewer buyers for that Vs cheaper agents in different verticals.

For the same reason, right now it's not worth it for labs to work on building RL environments for things with small TAM

Expand full comment
5 more comments...

No posts

Ready for more?