top | item 38365832 (no title) varunshenoy | 2 years ago Slightly different set of trade-offs, but similar mental model. You always use large batch sizes (compute bound) and the bottleneck usually ends up communication between GPUs/nodes. discuss order hn newest No comments yet.
No comments yet.