top | item 43294175

(no title)

johntb86 | 11 months ago

I'd be curious what would happen if you SFTed a larger model with successful reasoning traces from the smaller model. Would it pick up the overall reasoning pattern, but be able to apply it to more cases?

discuss

order

No comments yet.