(no title)
gneuron | 4 days ago
Here goes:
The Epoch data everyone keeps citing measures the price per token charged to API customers. That's the sticker price. It tells you nothing about whether the business is viable, because the existential risk for AI companies isn't the marginal cost of running a query. It's the upfront capital expenditure on chips and datacenters, committed years before you know what demand looks like.
Anthropic CEO Dario Amodei spelled this out in his Dwarkesh interview. Here's the short version: 1. Data centers take 1-2 years to build out. 2. Each gigawatt costs roughly $10-15B per year. 3. The industry is currently at ~10-15 GW, scaling roughly 3x annually. 4. By 2028, ~100 GW. By 2029, ~300 GW. 5. We're talking multiple trillions per year in committed infrastructure spend across the industry.
Now NVIDIA's Q4 earnings [1], which printed today: 1. $68.1B in quarterly revenue, $62.3B from data center alone. 2. Full-year: $215.9B, up 65% YoY. Guiding $78B next quarter. 3. Someone is writing those checks. Those checks are not refundable.
Dario, who believes we're 1-3 years from a "country of geniuses in a data center," described his own demand prediction as a "hellish" problem.
His exact framing: If this revenue comes in at $800B instead of $1T, "there's no force on earth, there's no hedge on earth" that could stop him from going bankrupt if he'd bought compute at the higher projection.
He's at ~$10B annualized revenue today, and he won't commit to buying at the scale his own thesis demands, because being off by a single year is fatal.
This is the actual argument (I'm not saying this is Ed's argument, but this is the argument against these companies). Not "inference tokens are expensive."
The argument is structural: these companies must pre-commit billions in non-recoverable CAPEX based on demand projections that are, by the CEO's own admission, a coin flip.
The gross margins on serving tokens might be great. But the training spend for next-gen models grows exponentially, and it has to be funded before that model earns a dollar.
The Epoch chart measures what customers pay per token. It doesn't measure the $215.9B NVIDIA invoice those customers collectively funded this year, or that these chip purchases are one-way bets against future demand that may or may not materialize.
Inference costs going down 20x is wonderful for consumers. It tells you almost nothing about whether the companies making those chips, or the companies buying them, will survive the demand prediction gauntlet.
And if we're being honest, the Epoch data showing 9x to 900x price drops per year should make you more nervous, not less, because it means the asset you bought last year is depreciating at a rate that makes used cars look like gold bars.
[0] https://www.youtube.com/watch?v=n1E9IZfvGMA&t=2298s [1] https://nvidianews.nvidia.com/news/nvidia-announces-financia...
No comments yet.