It already applies to real people, doesn't it? I.e. if you read a book, you're not allowed to start printing and selling copies of that book without permission of the copyright owner, but if you learn something from that book you can use that knowledge, just like a model could.
triceratops|11 months ago
What about movies and music?
nightpool|11 months ago
carlosdp|11 months ago
The analogous scenario is "Can I read a book and publish a blog post with all the information in that book, in my own words?", and under US copyright law, the answer is: Yes.
ajross|11 months ago
No, but you can read a book, learn its contents, and then write and publish your own book to teach the information to others. The operation of an AI is rather closer to that than it is to copyright violation.
"Should" there be protections against AI training? Maybe! But copyright law as it stands is woefully inadequate to the task, and IMHO a lot of people aren't really treating with this. We need a functioning government to write well-considered laws for the benefit of all here. We'll see what we get.
nickpsecurity|11 months ago
https://www.theverge.com/2024/8/20/24224450/anthropic-copyri...
https://www.reuters.com/legal/litigation/google-sued-by-top-...
Terretta|11 months ago
ben_w|11 months ago
The part of that which doesn't apply is "print copies", at least not complete copies, but libraries often have photocopiers in them for fragments needed for research.
AI models shouldn't do that either, IMO. But unlimited complete copies is the mistake the Internet Archive made, too.
paulddraper|11 months ago
Yes, you can read books without paying, if that's how it is offered.
And you can photocopy books you own for your own personal use. But again....the analogy is remembering/leaning from a book.
dylan604|11 months ago
musicians can read the sheet music and memorize how to play it, and no longer need the music. they still have the information.
notatoad|11 months ago
if you have evidence that openAI is doing this with books that are not freely available, i'm sure the publishers would absolutely love to hear about it.
m1el|11 months ago
ryoshu|11 months ago
conjectures|11 months ago
Now, if you have eidetic memory and write out large chunks of the book from memory and publish them, that's what you could be sued for.
Terretta|11 months ago
Really? Or do they get sued for sharing as in republishing without transformation? Arguably a URL providing copyrighted content, is you offering a xerox machine.
It seems most "sued into oblivion" are the reshare problem, not the get one for myself problem.
alabastervlog|11 months ago
bee_rider|11 months ago
On the other hand, they aren’t just a copy of the training content, and whether the process that creates the weights is sufficiently transformative as to create a new work is… what’s up for debate, right?
Anyway I wish people would stop making these analogies. There isn’t a law covering AI models yet. It is a big industry at this point, and the lack of clarity seems like something we’d expect everybody (legislators and industry) to want to rectify.
codedokode|11 months ago
amelius|11 months ago
pier25|11 months ago
Not really. You can't multiply yourself a million times to produce content at an industrial scale.
aiono|11 months ago
echelon|11 months ago
And if you sell the outputs of your model that you trained on free content, you shouldn't be able to hide behind trade secret.
crorella|11 months ago
It is not remotely the same, the companies training the models are stealing the content from the internet and then profiting from it when they charge for the use of those models.
Terretta|11 months ago
Are you stealing a billboard when you see and remember it?
The notion that consuming the web is "stealing" needs to stop.
ben_w|11 months ago
The general concept of "warp drive" was introduced by John W. Campbell in 1957, "Islands of Space". Popularised by Trek, turned into maths by Alcubierre. Islands of Space feels like it took inspiration from both H G Wells (needing to explain why the War of the Worlds' ending was implausible) and Jules Verne (gang of gentlemen have call-to-action, encounter difficulties that would crush them like a bug and are not merely fine, they go on to further great adventure and reward).
Terry Pratchett had obvious inspirations from Shakespeare, Ringworld, Faust (in the title!).
In the pandemic I read "The Deathworlders" (web fic, not the book series of similar name), and by the time I'd read too many shark jumps to continue, I had spotted many obvious inspirations besides just the one that gave the name.
If I studied medieval lit, I could probably do the same with Shakespeare's inspiration.
sidewndr46|11 months ago
tsimionescu|11 months ago
amelius|11 months ago
This discussion reminds me of it.
simion314|11 months ago
Did OpenAI bought one copy of each book, or did they legaly borowed athe books and documents ?
if you copy paste rom books and claim is your content you are plagiarizing. LLMs were provent to copy paste trained content so now what? Should only big Tech be excluded from plagiarizing ?