top | item 47041706

(no title)

nosuchthing | 13 days ago

LLMs can't access the training data that's less than the statistically most common token, so they use a random jitter.

With that randomness comes statistically irrelevant results.

discuss

order

No comments yet.