top | item 44806089 (no title) anorwell | 6 months ago I think your example reflects well on oss-20b, not poorly. It (may) show that they've been successful in separating reasoning from knowledge. You don't _want_ your small reasoning model to waste weights memorizing minutiae. discuss order hn newest No comments yet.
No comments yet.