top | item 18553590

Amazon Elastic Inference – GPU-Powered Deep Learning Inference Acceleration

50 points| jedwhite | 7 years ago |aws.amazon.com | reply

3 comments

[+] jedwhite|7 years ago|reply

Not sure why this post is getting flagged to oblivion, but it is technically pretty interesting.

GPUs remain expensive, and their use is typically prioritized for training, and then inference is run on CPUs. This provides a cost effective way to attach GPU resources on demand to regular instances, rather than having to run dedicated GPU instances.

[+] thwy12321|7 years ago|reply

How is there not more interest in this? This is huge for anyone trying to bootstrap a machine learning business. If you need a 24/7 on prediction service the costs of full fledged GPUs can be prohibitive and a waste. This allows me to distribute my prediction across multiple nodes.

[+] borramakot|7 years ago|reply

Strangely, the announcement that AWS is making their own machine learning inference chip got flagged and removed.

https://aws.amazon.com/machine-learning/inferentia/