(no title)
andy12_ | 13 days ago
If anything, they predict words based on a heuristic ensemble of what word is most likely to come next in similar sentences and what word is most likely to give a final higher reward.
andy12_ | 13 days ago
If anything, they predict words based on a heuristic ensemble of what word is most likely to come next in similar sentences and what word is most likely to give a final higher reward.
csomar|13 days ago
So... "finding the most likely next word based on what they've seen on the internet"?
andy12_|13 days ago
[1] https://arxiv.org/pdf/2509.19249
hansmayer|13 days ago
andy12_|13 days ago
[1] https://cdn.openai.com/pdf/d04913be-3f6f-4d2b-b283-ff432ef4a...