(no title)
nilsbunger | 10 months ago
When model training reads the text and creates weights internally, is that a substantial transformation? I think there’s a pretty strong argument that it is.
nilsbunger | 10 months ago
When model training reads the text and creates weights internally, is that a substantial transformation? I think there’s a pretty strong argument that it is.
TheOtherHobbes|10 months ago
The point here is that book files have to be copied before they can be used for training. Copyright texts typically say something like "No unauthorised copying or transmission in any form (physical, electronic, etc.)"
Individuals who torrented music and video files have been bankrupted for doing exactly this.
The same laws should apply when a corporation downloads torrent files. What happens to them after they're downloaded is irrelevant to the argument.
If this is enforced (still to be seen...) it would be financially catastrophic for Meta, because there are set damages for works that have been registered for copyright protection - which most trad-pubbed books, and many self-pubbed books, are.
unknown|10 months ago
[deleted]
leovander|10 months ago
Only if they seeded the data and some other entity downloaded it, i.e. they hosted the data. In a previous article I believe it was called out that Meta was being a leecher (not seeding back what they downloaded).
It's the hosting that gets you, not the act of downloading it.
jayd16|10 months ago
Seems like a big gap there.
spwa4|10 months ago
mrgoldenbrown|10 months ago
triceratops|10 months ago
Workaccount2|10 months ago
realusername|10 months ago
The computer model is working differently of course but functionally it's the same idea.