Assuming this has a lot to do with Google's TPUs. Google is well positioned to be the AWS for AIs given the increased efficiency of TPUs, which only they have.
This can be as simple as someone taking the time to collect all the GCP accounts accumulated over the years for random projects into an enterprise committed spend. Doesn't have to be anything that crazy or new.
The article is a bit misguided, since it "forgets" to add a bit of context until the penultimate paragraph: Meta is investing 10 times as much in building internal infrastructure capacity — this year alone. This deal is a rounding error (<1.7B / year); and with the scale of cloud costs in general, it's probably even invisible at Meta scale.
Meta has been doing the full stack integration of internal software down to machine builds for almost as long as google now; what's the point of outsourcing any of it after 20 years?
is it some capex vs opex game? putting fear in the platform engineering people about losing their jobs?
Meta likely can’t construct new data centers quickly enough to meet the growing demand for AI. While they have plans to build them, it will take several years. Consequently, they have entered into this six-year contract with Google as a temporary solution.
Google needs deal this due to flights away from its cloud.
It can't compete with Azure for simple/coarse-grained services and AWS for complex/fine-grained services.
Atm, Google cloud is only good for cheap high-ram one-run-centric compute (AWS is cheaper for generic compute and reserved compute), simple container execution (Cloud Run), and ~100 TB bulk storage.
Based on my professional experience, when we ignore cost [0] my ranking of the big three for "plain old computing" (that is, just compute, networking, and storage workloads) is AWS, GCP, Azure.
Azure is very, very flaky. Things often break for no clear reason, or things are changed in unexpected ways without warning, and then quietly reverted back. [1] I used to say that the only consistently good thing about Azure was that you could throw an entire Subscription in the trash and have everything in there be destroyed... but even that has become intermittently unreliable!
Given how godawful Azure is, I expect that companies use it either because they think they need it for its AD integration, or because they get very deep discounts on Windows licenses for VMs running on Azure.
[0] Both because I never had to pay the bills and (I assume) our cost estimators were always full of lies because the tooling (and I) had no idea what discounts we had negotiated.
[1] (Don't) ask me about the time Azure added in some sort of multi-minute "cooldown" time for reuse of a statically-assigned IP address that was assigned to a VM that Azure reported was completely destroyed, and we were attempting to assign to a brand new VM. Creation of the new VM kept failing with some wacky error. Azure support was clueless, and the problem vanished and came back several times over the course of a year.
Your phrasing seems off. Why does Meta care what Google needs? It would seem that it's exactly backwards. Meta needs this because training resources are scarce. And Google is in the enviable position of having TPUs.
> Atm, Google cloud is only good for cheap high-ram one-run-centric compute (AWS is cheaper for generic compute and reserved compute), simple container execution (Cloud Run), and ~100 TB bulk storage.
You are crazy if you imply AWS Athena or Azure Synapse are better than BigQuery.
pwarner|6 months ago
mandeepj|6 months ago
Data centers don’t pop up overnight, until then they are going to use a vendor :-)
bmau5|6 months ago
vasco|6 months ago
unknown|6 months ago
[deleted]
Aissen|6 months ago
unknown|6 months ago
[deleted]
bananapub|6 months ago
Meta has been doing the full stack integration of internal software down to machine builds for almost as long as google now; what's the point of outsourcing any of it after 20 years?
is it some capex vs opex game? putting fear in the platform engineering people about losing their jobs?
treesknees|6 months ago
petesergeant|6 months ago
spwa4|6 months ago
thrown-0825|6 months ago
barkingcat|6 months ago
calmbonsai|6 months ago
It can't compete with Azure for simple/coarse-grained services and AWS for complex/fine-grained services.
Atm, Google cloud is only good for cheap high-ram one-run-centric compute (AWS is cheaper for generic compute and reserved compute), simple container execution (Cloud Run), and ~100 TB bulk storage.
simoncion|6 months ago
Based on my professional experience, when we ignore cost [0] my ranking of the big three for "plain old computing" (that is, just compute, networking, and storage workloads) is AWS, GCP, Azure.
Azure is very, very flaky. Things often break for no clear reason, or things are changed in unexpected ways without warning, and then quietly reverted back. [1] I used to say that the only consistently good thing about Azure was that you could throw an entire Subscription in the trash and have everything in there be destroyed... but even that has become intermittently unreliable!
Given how godawful Azure is, I expect that companies use it either because they think they need it for its AD integration, or because they get very deep discounts on Windows licenses for VMs running on Azure.
[0] Both because I never had to pay the bills and (I assume) our cost estimators were always full of lies because the tooling (and I) had no idea what discounts we had negotiated.
[1] (Don't) ask me about the time Azure added in some sort of multi-minute "cooldown" time for reuse of a statically-assigned IP address that was assigned to a VM that Azure reported was completely destroyed, and we were attempting to assign to a brand new VM. Creation of the new VM kept failing with some wacky error. Azure support was clueless, and the problem vanished and came back several times over the course of a year.
thevillagechief|6 months ago
mrbungie|6 months ago
You are crazy if you imply AWS Athena or Azure Synapse are better than BigQuery.