top | item 39871436 (no title) radq | 1 year ago 1/3rd "activated parameters", while also requiring 2x the VRAM. discuss order hn newest YetAnotherNick|1 year ago That's the point of MoE. Sacrificing VRAM for compute/RAM bandwidth which makes it harder sell for consumer devices but easier for server devices where things are more likely to be compute or RAM bandwidth bound. unknown|1 year ago [deleted]
YetAnotherNick|1 year ago That's the point of MoE. Sacrificing VRAM for compute/RAM bandwidth which makes it harder sell for consumer devices but easier for server devices where things are more likely to be compute or RAM bandwidth bound.
YetAnotherNick|1 year ago
unknown|1 year ago
[deleted]