top | item 35258242 (no title) ioedward | 2 years ago Normally people split up the model across multiple GPUs, i.e. model/tensor parallelism. discuss order hn newest No comments yet.
No comments yet.