(no title)
darknoon | 1 month ago
they're trying to compare at iso-power? I just want to see their box vs a box of 8 h100s b/c that's what people would buy instead, and they can divide tokens and watts if that's the pitch.
darknoon | 1 month ago
they're trying to compare at iso-power? I just want to see their box vs a box of 8 h100s b/c that's what people would buy instead, and they can divide tokens and watts if that's the pitch.
ac29|1 month ago
Yeah they are defining a "rack" as 15kW, though 3x H100 PCIe is only a bit over 1kW. So they are assuming GPUs are <10% of rack power usage which sounds suspiciously low.
bradfa|1 month ago
minimaltom|1 month ago
_zoltan_|1 month ago
furthermore usually NVLink connected within the box (SXM instead of PCIe cards, although the physical data link is still PCIe.)
this is important because the daughter board provides PCIe switches which usually connect NVMe drives, NICs and GPUs together such that within that subcomplex there isn't any PCIe oversubscription.
since last year for a lot of providers the standard is the GB200 I'd argue.