top | item 45807676

(no title)

hamsic | 3 months ago

Sure, a model-based coder can losslessly compress any token stream. I just meant that for human-written text, the model’s prediction diverges from how the text was actually produced — so the compression is formally lossless, but not semantically faithful or efficient.

discuss

No comments yet.