top | item 46681789

(no title)

pixelmelt | 1 month ago

I would look into running a 4 bit quant using llama cpp (or any of its wrappers)

discuss

order

No comments yet.