top | item 39871436

(no title)

radq | 1 year ago

1/3rd "activated parameters", while also requiring 2x the VRAM.

discuss

order

YetAnotherNick|1 year ago

That's the point of MoE. Sacrificing VRAM for compute/RAM bandwidth which makes it harder sell for consumer devices but easier for server devices where things are more likely to be compute or RAM bandwidth bound.