top | item 47201915

(no title)

CamperBob2 | 1 day ago

Try the 27B dense model. It will likely do much better than the 35b MoE with only 3B active experts.

Also, performance on research-y questions isn't always a good indicator of how the model will do for code generation or agent orchestration.

discuss

order

regularfry|13 hours ago

Currently sat waiting for the unsloth fixed quants to drop, but I'm on the edge of my seat for this.

Balinares|6 hours ago

Wait, didn't they drop like two days ago?