top | item 38782238

(no title)

which laws are broken exactly? it's not remotely settled law that "training an NN = copyright infringement"

discuss

The training isn't the issue per se, it's the regurgitation of verbatim text (or close enough to be immediately identifiable) within a for-profit product. Worse still that the regurgitation is done without attribution.

LargeTomato|2 years ago

The legal argument, which I'm sure you are very well aware of, is that training a model on data, reorganizing, and then presenting that data as your own is copyright infringement.

professor_x|2 years ago

I don't think OP is arguing in bad faith.The fact is it's unclear what laws this legal argument is supported by.

munchler|2 years ago

Can you elaborate a bit more? That’s actually just a claim, not a legal argument.

Copyright law allows for transformative uses that add something new, with a further purpose or different character, and do not substitute for the original use of the work. Are LLM’s not transformative?

unknown|2 years ago

[deleted]

unknown|2 years ago

[deleted]

anamexis|2 years ago

Right, hence the lawsuit. They allege that the Copyright Act is the law that was broken.

unknown|2 years ago

[deleted]

zztop44|2 years ago

No, we’re seeing the first steps of it (maybe) becoming settled law.