top | item 44801019

(no title)

artembugara | 6 months ago

thanks, this part is clear to me.

but I need to understand 20 x 1k token throughput

I assume it just might be too early to know the answer

discuss

order

Tostino|6 months ago

I legitimately cannot think of any hardware that will get you to that throughput over that many streams with any of the hardware I know of (I don't work in the server space so there may be some new stuff I am unaware of).

artembugara|6 months ago

oh, I totally understand that I'd need multiple GPUs. I'd just want to know what GPU specifically and how many