top | item 47034495

(no title)

bertili | 14 days ago

Most certainly not, but the Unsloth MLX fits 256GB.

discuss

order

embedding-shape|14 days ago

Curious what the prefilled and token generation speed is. Apple hardware already seem embarrassingly slow for the prefill step, and OK with the token generation, but that's with way smaller models (1/4 size), so at this size? Might fit, but guessing it might be all but usable sadly.

regularfry|14 days ago

They're claiming 20+tps inference on a macbook with the unsloth quant.