top | item 40319765

(no title)

markdeloura | 1 year ago

How much less energy does a query against an 8B model take, vs a 70B model? Can we get more clever about using smaller, more specialized models?

discuss

order

No comments yet.