top | item 47169962 (no title) HanClinto | 4 days ago If you don't mind a stupid question, is this essentially dynamic quantization? I'm trying to understand how this is different from using a regular quantized model to squeeze more parameters into less RAM. discuss order hn newest No comments yet.
No comments yet.