top | item 45143776

(no title)

GodelNumbering | 5 months ago

Settlement Terms (from the case pdf)

1. A Settlement Fund of at least $1.5 Billion: Anthropic has agreed to pay a minimum of $1.5 billion into a non-reversionary fund for the class members. With an estimated 500,000 copyrighted works in the class, this would amount to an approximate gross payment of $3,000 per work. If the final list of works exceeds 500,000, Anthropic will add $3,000 for each additional work.

2. Destruction of Datasets: Anthropic has committed to destroying the datasets it acquired from LibGen and PiLiMi, subject to any legal preservation requirements.

3. Limited Release of Claims: The settlement releases Anthropic only from past claims of infringement related to the works on the official "Works List" up to August 25, 2025. It does not cover any potential future infringements or any claims, past or future, related to infringing outputs generated by Anthropic's AI models.

discuss

order

privatelypublic|5 months ago

Don't forget: NO LEGAL PRECEDENT! which means, anybody suing has to start all over. You only settle in this scenario/point if you think you'll lose.

Edit: I'll get ratio'd for this- but its the exact same thing google did in it's lawsuit with Epic. They delayed while the public and courts focused in apple (oohh, EVIL apple)- apple lost, and google settled at a disadvantage before they had a legal judgment that couldn't be challenged latter.

ignoramous|5 months ago

Or, if you think your competition, also caught up in the same quagmire, stands to lose more by battling for longer than you did?

tpmoney|5 months ago

> You only settle in this scenario/point if you think you'll lose.

Or because you already got the judgement you wanted. Remember Athropic's training of the AI was determined to be fair use for all the legally acquired items, which Anthropic claims is their current acquisition model anyway. If we assume that's true for the sake of argument, there's no point in fighting a battle on the remaining part unless they have something to gain by it. Since they're not doing that anymore, they don't gain, and run a very high risk of losing more. From a purely PR perspective, this is the right move.

terminalshort|5 months ago

There is already a mountain of legal precedent that you can't just download copyrighted work. That's what this lawsuit is about. Just because one of the parties is Anthropic doesn't mean this is some new AI thing.

daedrdev|5 months ago

A full case is many more years of suits and appeals with high risks, so its natural to settle which obviously means no precedent

koolala|5 months ago

Wont Facebook just get sued for the same thing now and maybe set precedent?

rendaw|5 months ago

So they can also keep models trained on the datasets? That seems pretty big too, unless the half life of models is so low it doesn't matter.

aprilthird2021|5 months ago

It's a separate suit being wages against Meta and OpenAI etc.

There's piracy, then there's making available a model to the public which can regurgitate copyrighted works or emulate them. The latter is still unsettled

gooosle|5 months ago

So... it would be a lot cheaper to just buy all of the books?

gpm|5 months ago

Yes, much.

And they actually went and did that afterwards. They just pirated them first.

eviks|5 months ago

That might be practically impossible given the number of rights holders worldwide

privatelypublic|5 months ago

The permission to buy them was already settled by Google Books in the 00's.

_alternator_|5 months ago

They did, but only after they pirated the books to begin with.

privatelypublic|5 months ago

Few. This settlement potentially weakens all challenges to the use of copyrighted works in training LLM's. I'd be shocked if behind closed doors there wasn't some give and take on the matter between Executives/investors.

A settlement means the claimants no longer have a claim, which means if they're also part of- say, the New York Times affiliated lawsuit- they have to withdraw. A neat way of kneecapping a country wide decision that LLM training on copy written material is subject to punitive measures don't you think?

manbash|5 months ago

Thank you. I assumed it would be quicker to find the link to the case PDF here, but your summary is appreciated!

Indeed, it is not only payout, but the destruction of the datasets. Although the article does quote:

> “Anthropic says it did not even use these pirated works,” he said. “If some other generative A.I. company took data from pirated source and used it to train on and commercialized it, the potential liability is enormous. It will shake the industry — no doubt in my mind.”

Even if true, I wonder how many cases we will see in the near future.

pier25|5 months ago

Only 500,000 copyrighted works?

I was under the impression they had downloaded millions of books.

KittenInABox|5 months ago

Individual authors had to join the class action lawsuit, sadly. They were not all automatically registered for each violation.

testing22321|5 months ago

I’m an author, can I get in on this?

A_D_E_P_T|5 months ago

I had the same question.

It looks like you'll be able to search this site if the settlement is approved:

> https://www.anthropiccopyrightsettlement.com/

If your work is there, you qualify for a slice of the settlement. If not, you're outta luck.

__mharrison__|5 months ago

I'm an author. Can I get anthropic stock instead?