I would bet money against that. Replicating GPT-4 pre-training with current hardware would cost about 40-50m in compute. Compute will continue to decrease in cost and algorithmic improvements may allow for more efficient training, but probably not 3 orders of magnitude in a few years. I think there will be plenty of open source models that will claim GPT-4 quality, and some of them will be close, but they will be models that used millions of dollars (probably from some corporate benefactor but possibly from crowdsourcing) in compute to train. You will probably be able to fine-tune and run inference on fairly cheap hardware, but you can't cheat scale. It's going to take a major innovation to move away from the expensive base model paradigm.
p1esk|2 years ago
Source? My educated guess it’s somewhere between 10 to 100 times cheaper than that.
sacred_numbers|2 years ago
Interestingly, my own calculations lined up pretty well with this calculation, although they approached the problem from a different direction (a leak by Morgan Stanley about how many GPUs OpenAI used to train GPT-4 as well as an estimate of how long it was trained): https://colab.research.google.com/drive/1O99z9b1I5O66bT78r9S...
Sam Altman has also stated that GPT-4 cost more than $100 million to train, and replication can cost 2-4x less compute. https://www.wired.com/story/openai-ceo-sam-altman-the-age-of...
If you know of an organization that can replicate GPT-4 for $400k to $4m I would love to know so that I can invest in them.
rl3|2 years ago
Actually:
https://www.wired.com/story/openai-ceo-sam-altman-the-age-of...
At the MIT event, Altman was asked if training GPT-4 cost $100 million; he replied, “It’s more than that.”
Granted, OP did say pre-training.
unknown|2 years ago
[deleted]
RelativeDelta|2 years ago
If we extrapolate that relation, you eventually reach a point where the biggest player can collect and process the most information and produce an ever-evolving model to maintain that relation.
Better hope it's creators have your best interests at heart.
polski-g|2 years ago