top | item 44044737 (no title) Deathmax | 9 months ago It's not a 4B parameter model. The E4B variant is 7B parameters with 4B loaded into memory when using per-layer embedding cached to fast storage, and without vision or audio support. discuss order hn newest zamadatix|9 months ago The link says E2B and E4B have 4B and 8B raw parameters, where do you see 7B? jdiff|9 months ago There's a 7B mentioned in the chat arena ELO graph, I don't see any other references to it though. load replies (1)
zamadatix|9 months ago The link says E2B and E4B have 4B and 8B raw parameters, where do you see 7B? jdiff|9 months ago There's a 7B mentioned in the chat arena ELO graph, I don't see any other references to it though. load replies (1)
jdiff|9 months ago There's a 7B mentioned in the chat arena ELO graph, I don't see any other references to it though. load replies (1)
zamadatix|9 months ago
jdiff|9 months ago