top | item 45155054 (no title) drbscl | 5 months ago Distributed compute is cool, but $320 for 13 tokens/s on a tiny input prompt, 4 bit quantization, and 3B active parameter model is very underwhelming discuss order hn newest No comments yet.
No comments yet.