The cost of having excess compute is less than the cost of not having enough compute to be competitive. Because of demand, if you realize you your current compute is insufficient there is a long turnaround to building up your infrastructure, at which point you are falling behind. All the major players are simultaneously working on increasing capabilities and reducing inference cost. What they aren’t optimizing is their total investments in AI. The cost of over-investment is just a drag on overall efficiency, but the cost of under-investment is existential.
nejsjsjsbsb|1 year ago