The training isn't the issue per se, it's the regurgitation of verbatim text (or close enough to be immediately identifiable) within a for-profit product. Worse still that the regurgitation is done without attribution.
The legal argument, which I'm sure you are very well aware of, is that training a model on data, reorganizing, and then presenting that data as your own is copyright infringement.
Can you elaborate a bit more? That’s actually just a claim, not a legal argument.
Copyright law allows for transformative uses that add something new, with a further purpose or different character, and do not substitute for the original use of the work. Are LLM’s not transformative?
noitpmeder|2 years ago
LargeTomato|2 years ago
professor_x|2 years ago
munchler|2 years ago
Copyright law allows for transformative uses that add something new, with a further purpose or different character, and do not substitute for the original use of the work. Are LLM’s not transformative?
unknown|2 years ago
[deleted]
unknown|2 years ago
[deleted]
anamexis|2 years ago
unknown|2 years ago
[deleted]
zztop44|2 years ago