(no title)
atlex2 | 1 year ago
For small models and when attention is "taken up", these sorts of questions really send a model for a loop. Agreed - especially noticeable with small reasoning models.
atlex2 | 1 year ago
For small models and when attention is "taken up", these sorts of questions really send a model for a loop. Agreed - especially noticeable with small reasoning models.
KiwiJohnno|1 year ago