top | item 43294175 (no title) johntb86 | 11 months ago I'd be curious what would happen if you SFTed a larger model with successful reasoning traces from the smaller model. Would it pick up the overall reasoning pattern, but be able to apply it to more cases? discuss order hn newest No comments yet.
No comments yet.