top | item 39957659 (no title) zingelshuher | 1 year ago Intuitively looks like models should be close enough, or sparse enough for merge to work. I wonder if MoE experts can be merged(?) discuss order hn newest No comments yet.
No comments yet.