top | new | best | ask | show | jobs

top | item 35258242

(no title)

ioedward | 2 years ago

Normally people split up the model across multiple GPUs, i.e. model/tensor parallelism.

discuss

order

No comments yet.

powered by hn/api // news.ycombinator.com