top | item 44876513

(no title)

whymauri | 6 months ago

Papers have been doing rollouts that involve a model proposing N solutions and then self-reviewing to choose the best one (prior to the verifier). So far, I think that's been counted as one pass.

discuss

order

No comments yet.