top | item 39957659

(no title)

zingelshuher | 1 year ago

Intuitively looks like models should be close enough, or sparse enough for merge to work. I wonder if MoE experts can be merged(?)

discuss

order

No comments yet.