top | item 42434325

(no title)

DiederikVink | 1 year ago

That's a great question, but its hard to get enough insight into how Groq is serving models to properly know what's missing.

If I had to hazard a guess, it would be that their system architecture (# of chips and chip architecture itself) might not be designed for a high concurrency situation.

discuss

order

No comments yet.