top | item 46961977 (no title) fishpham | 20 days ago Those won’t be sufficient to run SOTA/trillion parameter models discuss order hn newest Zambyte|20 days ago And most tasks don't demand that. general1465|20 days ago Distilled models are good enough.
Zambyte|20 days ago
general1465|20 days ago