top | item 41123311

(no title)

igmor | 1 year ago

Can you share any data on how big of a cluster is running Ray jobs?

discuss

order

thedood|1 year ago

From the blog post, the largest individual Ray cluster that was observed running a production compaction job in Q1 had 26,846 vCPUs and ~210TiB of RAM. This is roughly equivalent to a Ray cluster composed of 839 r5.8xlarge EC2 nodes (w/ 32 vCPUs and 256GiB RAM per node).

layoric|1 year ago

For those interested, this would be at a cost of:

- ~$1691/hour on demand

- ~$1065/hour reserved

- ~$521/hour spot

Not including any related data transfer costs.