top | item 47078241

(no title)

matthewbauer | 10 days ago

It seems like they should be able to “overweight” newer training data. But the risk is the newer training data is going to skew more towards AI slop than older training data.

discuss

order

otabdeveloper4|10 days ago

There won't ever be newer training data.

The OG data came from sites like Stackoverflow. These sites will stop existing once LLMs become better and easier to use. Game over.

esclerofilo|10 days ago

Every time claude code runs tests or builds after a change, it's collecting training data.