(no title)
formalsystem | 1 year ago
So for example for AWQ and GPTQ we can accelerate them by using a fast int4 kernel called tinygemm
formalsystem | 1 year ago
So for example for AWQ and GPTQ we can accelerate them by using a fast int4 kernel called tinygemm
No comments yet.