(no title)
nhecker | 7 months ago
Well now I'm curious; how is a layer judged on its relative need for precision? I guess I still have a lot of learning to do w.r.t. how quantization is done. I was under the impression it was done once, statically, and produced a new giant GGUF blob or whatever format your weights are in. Does that assumption still hold true for the approach you're describing?
irthomasthomas|7 months ago
clownpenis_fart|7 months ago
[deleted]
smcleod|7 months ago
smcleod|7 months ago