top | item 46885232 (no title) codedokode | 26 days ago 30-A3B model gives 13 t/s without GPU (I noticed that token/sec * # of params matches memory bandwidth). discuss order hn newest yencabulator|24 days ago Something like 21 t/s on pure CPU on a mini PC that's <2 years old.
yencabulator|24 days ago