top | item 44044737

(no title)

Deathmax | 9 months ago

It's not a 4B parameter model. The E4B variant is 7B parameters with 4B loaded into memory when using per-layer embedding cached to fast storage, and without vision or audio support.

discuss

order

zamadatix|9 months ago

The link says E2B and E4B have 4B and 8B raw parameters, where do you see 7B?

jdiff|9 months ago

There's a 7B mentioned in the chat arena ELO graph, I don't see any other references to it though.