top | item 18553590

Amazon Elastic Inference – GPU-Powered Deep Learning Inference Acceleration

50 points| jedwhite | 7 years ago |aws.amazon.com | reply

3 comments

order
[+] jedwhite|7 years ago|reply
Not sure why this post is getting flagged to oblivion, but it is technically pretty interesting.

GPUs remain expensive, and their use is typically prioritized for training, and then inference is run on CPUs. This provides a cost effective way to attach GPU resources on demand to regular instances, rather than having to run dedicated GPU instances.

[+] thwy12321|7 years ago|reply
How is there not more interest in this? This is huge for anyone trying to bootstrap a machine learning business. If you need a 24/7 on prediction service the costs of full fledged GPUs can be prohibitive and a waste. This allows me to distribute my prediction across multiple nodes.