Next-Generation GPU-Powered EC2 Instances (G3)

[+] lars|8 years ago|reply

They say "next-generation", but these are M60 GPUs, which are very much "previous-generation". Current generation would be P100 GPUs.

I am in the market for a cloud GPU offering, and I have to say the big cloud providers are very uncompetitive here, only offering these old, slow GPUs.

[+] boulos|8 years ago|reply

I was also surprised (and sent it around to our team internally last night). We're skipping Maxwell entirely as you can see from my previous comment threads.

For display it's still a fine part. The P100 is also a beast, so its overkill for most people just doing Remote Desktop. So perhaps the M60 (like with Azure) fills this market segment for them, and they don't mind the hardware diversity.

[Edit: Too sleepy. A post down below reminds us that these are G-series and G is for Graphics. So yeah, I assume they just didn't want to wait for enough P4 parts in volume or will quickly make another such announcement about the Pascals].

Disclosure: I work on Google Cloud.

[+] matthewmacleod|8 years ago|reply

I think "generation" here is referring to EC2 generations and not GPU generations – AWS tends to use that term to refer to new instance types being released.

((Next Generation) (GPU EC2 Instances)) rather than ((Next Generation GPU) (EC2 Instances)) :)

[+] DannyBee|8 years ago|reply

"I am in the market for a cloud GPU offering, and I have to say the big cloud providers are very uncompetitive here, only offering these old, slow GPUs. "

It's one thing if one of them is like that, but if all of them are like that, maybe it's not because of the cloud providers?

[+] make3|8 years ago|reply

Or GTX 1080 ti. Aren't the Tesla class like the P100 mostly super overpriced for deep learning because their only main advantage is Double (64 bit) float support, and no one really needs that? Plus half float (16 bit) support, which is not super widely used (but certainly more than double). Something like 95%+ of Deep Learning must be done with single floats (32 bits) right now afaik, making this a fairly dubious expense

[+] unknown|8 years ago|reply

[deleted]

[+] moonbug22|8 years ago|reply

The g2s were Kepler, the g3s are Maxwell. 'Next generation' is technically correct.

[+] marklit|8 years ago|reply

Anyone care to chime in on why spot instances are now 10x on-demand instances? I've got a thread going here: https://news.ycombinator.com/item?id=14769026

[+] moonbug22|8 years ago|reply

That happens not infrequently. 1#x ondemand is the ceiling bid for spots. It's the result of a bid war amongst two or more big customers who really don't want to be evicted.

[+] girvo|8 years ago|reply

Lol we don't even have the P instances in Sydney yet, so I'm not holding my breath here.

[+] nl|8 years ago|reply

Under what circumstances do you care about the location?

When I'm using cloud GPUs it's pretty much a batch job, and latency is the last thing I care about.

I'm not aware of any DL projects in Australia on health images which may have some legislative requirements about keeping data onshore.

[+] layoric|8 years ago|reply

They definately do, only the p2.xlarge though I think.

[+] dkobran|8 years ago|reply

If you want some current generation GPUs ;) check out https://paperspace.com

[+] floatboth|8 years ago|reply

Once again, no instances with multiple powerful GPUs and like 1-2 CPU cores and 1GB RAM… Not doing them to discourage mining? :D

[+] moonbug22|8 years ago|reply

Far from it. For a long period the floor spot price of aws gpu instances tracked the bitcoin mining positive roi threshold.

[+] bryanlarsen|8 years ago|reply

Sad to see no fractional-GPU instances. A 4xlarge is massive overkill and unaffordable for our use case.

[+] shaklee3|8 years ago|reply

You can't give fractional GPU instances with this card. The K80 had two logically separate chips that were separately-addressable over PCIe. This allowed them to send two different PCIe devices to different VMs. The M60 doesn't have this. The V100 is supposed to allow time slicing to do this kind of thing, but that's not out, nor do we know how well it'll work.

[+] sp332|8 years ago|reply

-

[+] amq|8 years ago|reply

> up to 18 H.264 1080p30 streams

How is the quality compared to x264 with the default settings (preset medium, crf 23)?

[+] Scaevolus|8 years ago|reply

A lot worse, but your CPU doesn't take a hit. NVENC doesn't have very good quality at low bitrates, but it's fine for local recording (1080p@15Mbps+) that will be transcoded later.

Here's a comparison video: https://www.youtube.com/watch?v=BV5btdqQfu4

According to this it's almost equivalent when you compare 720p@5Mbps and 1080p@12Mbps, which is way more than most streaming sites will do: http://on-demand.gputechconf.com/gtc/2014/presentations/S464...

[+] SoapSeller|8 years ago|reply

NVENC[0] has some comparessions with x264.

In our use case(sports broadcasting, 720p) we found that in reasonable bitrate(>1Mb) NVIDIA HQ quality was virtually the same to x264 faster. (in newer versions of nvenc, they got amazingly better in the last couple of years) When the bandwidth drop you start to see x246 advantage.

[0] https://developer.nvidia.com/nvidia-video-codec-sdk

[+] unknown|8 years ago|reply

[deleted]

[+] horusthecat|8 years ago|reply

I'm taking this and Nvidia's announcement it was going to sell a mining-oriented GPU as the shot over the bow for cryptocoins. But then again, only market-makers get rich calling a top.

[+] swiley|8 years ago|reply

Has anyone done much Linux gaming on EC2? I want to be able to play xonotic again but I don't play it often enough to justify buying a high power desktop.

[+] _neil|8 years ago|reply

I did some mac gaming on it. Not terrible for certain games. I was mainly playing Rocket League multiplayer. There's a bunch of resources/experiences at https://www.reddit.com/r/cloudygamer/

[+] mankoxyz|8 years ago|reply

Is it profitable to use these for mining cryptocurrency?

[+] BenoitP|8 years ago|reply

Nope. I have made profitability assessments on several different cloud GPU solutions having different hardware.

As a general rule, for every 100 USD you'd only mine about ~50 USD's worth of crypto-currencies.

Which is not surprising since on these products you get a fancy motherboard + high-end Intel CPUs + boatloads of RAM.

These are of little to no use when mining, and account for about half of the cost of this hardware. Also, the local cost of electricity is not the lowest price in the world (China having one of the lowest)

While the price of hardware is fixed, crypto-currencies possess a difficulty adjustment mechanism. This makes the whole system have an upper bound on mining profitability, and this bound converges on the profitability of the best-yield-hardware's. Which would be something to the tune of this [1]. Note that while having 6 GPUs, this system has 8 GB of RAM and an Intel Celeron.

[1] https://blockoperations.com/6-gpu-mining-rig-amd-rx580-intel...

----

EDIT: We're talking about IO-bandwidth-bound crypto-currencies here, like all the ones based on EThash[2][3], Ethereum being one of them. Bitcoin's upper bound on profitability is set by the best ASICs for SHA256 processing.

[2] https://github.com/ethereum/wiki/wiki/Ethash

[3] https://github.com/ethereum/wiki/wiki/Ethash-Design-Rational...

[+] Dolores12|8 years ago|reply

It's not profitable to mine using your own money. But i am pretty sure those instances will be abused by carders. Like you invest 100$ in carded money and get 50$ back in cryptocurrency.

[+] dkersten|8 years ago|reply

No. Nowadays the big players in mining cryptocurrency have datacenters full of ASIC's. The currencies that are resistant to ASIC mining (due to eg using memory-bound hashing functions), like Monero, are probably just as resistant to GPU mining as they are to ASIC mining, although if you were to investigate it, I'd look at one of those and not bitcoin.

[+] toredash|8 years ago|reply

Of course not. Then everyone would use it for that purpose alone.

[+] BillinghamJ|8 years ago|reply

No

[+] kayoone|8 years ago|reply

well in 2011 i was using a Desktop PC + 3 GPUs to mine bitcoins which was barely profitable at a Bitcoin price of around $20 USD. Would i have kept them though and not sold at that price.... FML

[+] phreeza|8 years ago|reply

Does anyone know what the difference between G and P is supposed to be, conceptually?

[+] piqufoh|8 years ago|reply

P are intended for general-purpose GPU compute applications (and have 1, 8 or 16 GPUs, more RAM and fewer CPUs). Typically you might use these for scientific computing / machine learning / anything CUDA intensive.

G are optimized for graphics-intensive applications (and have 1, 2 or 4 GPUs, less RAM and more CPUs) - you might use these for design work, gaming etc.

70 comments