top | item 45280166

(no title)

tempusalaria | 5 months ago

A lot of the current code and science capabilities do not come from NTP training.

Indeed in seems in most language model RL there is not even process supervision, so a long way from NTP

discuss

order

No comments yet.