top | item 46471908

(no title)

leoh | 1 month ago

Not for inference, right?

discuss

correct - h100 can do like 100 tokens per second on a gpt4 like model, but you'd need to account for regular fine-tuning to accurately compare to a person, hence 4 or so. of course the whole comparison is inane since computers and humans are obviously so different ha...