(no title)
frabcus | 6 months ago
The limiting factor compared to local is dedicated VRAM - if you dedicate 80GB of VRAM locally 24 hours/day so response times are fast, you're wasting most of the time when you're not querying.
frabcus | 6 months ago
The limiting factor compared to local is dedicated VRAM - if you dedicate 80GB of VRAM locally 24 hours/day so response times are fast, you're wasting most of the time when you're not querying.
nodja|6 months ago
frabcus|6 months ago
unknown|6 months ago
[deleted]