top | item 46251160

(no title)

bleepblap | 2 months ago

I think you might be swapping RDMA with RoCE - RDMA can happen entirely within a single node. For example between an NVME and a GPU.

discuss

order

wmf|2 months ago

Within a single node it's just called DMA. RDMA is DMA over a network and RoCE is RDMA over Ethernet.

bleepblap|2 months ago

Sorry, but it certainly isn't--

https://docs.nvidia.com/cuda/gpudirect-rdma/index.html

The "R" in RDMA means there are multiple DMA controllers who can "transparently" share address spaces. You can certainly share address spaces across nodes with RoCE or Infiniband, but thats a layer on top