top | item 46127938

(no title)

bckr | 2 months ago

That’s where they take their big pile of data and train the model to do next-token-prediction.

discuss

order

No comments yet.