top | item 42894162

(no title)

zora_goron | 1 year ago

Does anyone know, how "reasoning effort" is implemented technically - does this involve differences in the pre-training, RL, or prompting phases (or all)?

discuss

order

No comments yet.