top | item 35258242

(no title)

ioedward | 2 years ago

Normally people split up the model across multiple GPUs, i.e. model/tensor parallelism.

discuss

order

No comments yet.