top | item 47017244

(no title)

derefr | 15 days ago

I wonder if these publishers would be more amenable to a private archiver that only serves registered academic / journalistic research projects (the way most physical private archives do), with a specific provision to never provide data to companies that would resell it or use it for training of generative models.

discuss

order

eternauta3k|15 days ago

They already have archives with online and printed articles which they license to libraries, because the libraries take care of rate limiting and limiting abuse.

ninjagoo|15 days ago

They probably have internal archives if they're smart; but that isn't accessible to the public. I think the issue isn't whether the data is archived, but whether that information is available to the public for the foreseeable future.

g-b-r|15 days ago

They sure have archives of the newspapers, they're much less likely to have archives of what they publish online.

And a local archive is one fire, business decision, poor technical choice etc away from getting permanently lost

coffeefirst|15 days ago

Yes. Most publishers already do syndication deals. This is a fine idea.

The problem with the LLMs is they capture the value chain and give back nothing. It didn’t have to be this way. It still doesn’t.