(no title)
abra0
|
2 years ago
I was thinking of doing something similar, but I am a bit sceptical about how the economics on this works out. On vast.ai renting a 3x3090 rig is $0.6/hour. The electricity price of operating this in e.g. Germany is somewhere about $0.05/hour. If the OP paid 1700 EUR for the cards, the breakeven point would be around (haha) 3090 hours in, or ~128 days, assuming non-stop usage. It's probably cool to do that if you have a specific goal in mind, but to tinker around with LLMs and for unfocused exploration I'd advise folks to just rent.
imiric|2 years ago
Are you factoring in the varying power usage in that electricity price?
The electricity cost of operating locally will vary depending on the actual system usage. When idle, it should be much cheaper. Whereas in cloud hosts you pay the same price whether the system is in use or not.
Plus with cloud hosts reliability is not guaranteed. Especially with vast.ai, where you're renting other people's home infrastructure. You might get good bandwidth and availability on one host, but when that host disappears, you should hope that you did a backup, which vast.ai charges for separately, and if so, you need to spend time restoring the backup to another, hopefully equally reliable host, which can take hours depending on the amount of data and bandwidth.
I recently built an AI rig and went with 2x3090s, and am very happy with the setup. I evaluated vast.ai beforehand, and my local experience is much better, while my electricity bill is not much higher (also in EU).
KeplerBoy|2 years ago
abra0|2 years ago
Agreed on reliability and data transfer, that's a good point.
Out of curiosity, what do you use a 2x3090 rig for? Bulk not time-sensitive inference on down quanted models?
algo_trader|2 years ago
Is there a goto card for low memory (1-2BN) models?
Something with much better flops/$ but purposely crippled with low memory.
whimsicalism|2 years ago
fwiw I find runpod's vast clone significantly better than vast and there isn't really a price premium.
mirekrusin|2 years ago
- if I have it locally, I'll play with it
- if not, I won't (especially with my data)
- if I have something ready for a long run I may or may not want to send it somewhere (it's not going to be on 3090s for sure if I send it)
- if I have requirement to have something public I'd probably go for per usage with ie [0].
[0] https://www.runpod.io/serverless-gpu
kkielhofner|2 years ago
I've had VERY hit-miss results with Vast.ai and I'm convinced people are cheating their evaluation stuff because when the rubber meets the road it's very clear performance isn't what it's claimed to be. Then you still need to be able to actually get them...
whimsicalism|2 years ago
wiradikusuma|2 years ago
Unfortunately my CFO (a.k.a Wife) does not share the same understanding.
ejb999|2 years ago
(not really, but it is a joke I read someplace and I think it applies to a lot of couples).
segmondy|2 years ago
Device 0 [NVIDIA GeForce RTX 3060] PCIe GEN 3@16x RX: 0.000 KiB/s TX: 55.66 MiB/s GPU 1837MHz MEM 7300MHz TEMP 43°C FAN 0% POW 43 / 170 W GPU[|| 5%] MEM[|||||||||||||||||||9.769Gi/12.000Gi]
Device 1 [Tesla P40] PCIe GEN 3@16x RX: 977.5 MiB/s TX: 52.73 MiB/s GPU 1303MHz MEM 3615MHz TEMP 22°C FAN N/A% POW 50 / 250 W GPU[||| 9%] MEM[||||||||||||||||||18.888Gi/24.000Gi]
Device 2 [Tesla P40] PCIe GEN 3@16x RX: 164.1 MiB/s TX: 310.5 MiB/s GPU 1303MHz MEM 3615MHz TEMP 32°C FAN N/A% POW 48 / 250 W GPU[|||| 11%] MEM[||||||||||||||||||18.966Gi/24.000Gi]
KuriousCat|2 years ago
ametrau|2 years ago
lostmsu|2 years ago
You can expect a GPU to last 5 years. So for 128 days break even you are only looking at 6.67% utilization. If you are doing training runs, I think you are going to beat it easily.
P.S. coincidentally or not, but shortly after it got mentioned on Hacker News, Best Buy run out of both RTX 4090s and RTX 4080s. They used to top the chart. Turns out at descent utilization they win due to the electricity costs.
leobg|2 years ago
[0] https://www.royalgazette.com/general/business/article/202307...
cyanydeez|2 years ago
but if you're just goofing around and not planning to create anything production worthy, it's a great deal.
whimsicalism|2 years ago
vast.ai is basically a clearinghouse. they are not doing some VC subsidy thing
in general, community clouds are not suitable for commercial use.
verticalscaler|2 years ago
Luc|2 years ago
segmondy|2 years ago
karolist|2 years ago