top | item 45143877

(no title)

gooosle | 5 months ago

So... it would be a lot cheaper to just buy all of the books?

discuss

order

gpm|5 months ago

Yes, much.

And they actually went and did that afterwards. They just pirated them first.

dude250711|5 months ago

What is the HN term for this? "Bootstrapping" your start up? Or is it "growth-hacking" it?

rise_before_sun|5 months ago

Where can I find source that says Anthropic bought the pirated books afterwards? I haven't seen this in any official document.

Also, do we know if the newer models were trained without the pirated books?

eviks|5 months ago

That might be practically impossible given the number of rights holders worldwide

privatelypublic|5 months ago

The permission to buy them was already settled by Google Books in the 00's.

_alternator_|5 months ago

They did, but only after they pirated the books to begin with.

privatelypublic|5 months ago

Few. This settlement potentially weakens all challenges to the use of copyrighted works in training LLM's. I'd be shocked if behind closed doors there wasn't some give and take on the matter between Executives/investors.

A settlement means the claimants no longer have a claim, which means if they're also part of- say, the New York Times affiliated lawsuit- they have to withdraw. A neat way of kneecapping a country wide decision that LLM training on copy written material is subject to punitive measures don't you think?

freejazz|5 months ago

That's not even remotely true. Page 4 of the settlement describes released claims which only relate to the pirating of books. Again, the amount of misinformation and misunderstanding I see in copyright related threads here ASTOUNDS.