top | item 45578343

(no title)

xs83 | 4 months ago

Now this looks much more interesting! Is the top one input tokens and the second one output tokens?

So 38.54 t/s on 120B? Have you tested filling the context too?

discuss

ggerganov|4 months ago