(no title)
madsmith | 2 years ago
We phrase it like somehow the material is being copied into the LLM, but that’s not what it’s doing. It’s building a neural graph from the experience of consuming that content.
What would the world be like if humans couldn’t learn, train the weights of the interconnects of their neural tissue, from any material with a copyright?
api|2 years ago
At the very least I think LLMs trained on data that the trainer does not own or have rights to use in that manner should not be copyrightable.
madsmith|2 years ago
My thinking “the enemy gate is down” when considering the tokens “Ender’s Game” is my recalling a learned association of those tokens to the given token string.
My knowing that doesn’t strip the copyright. My telling someone the meaning and context of the phrase generally doesn’t strip the copyright away from Orson Scott Card. I’m not reproducing his work but my knowledge of it. And it’s dependent on what I do with that knowledge and how if I’ve violated his copyright.
We are prosecuting the LLMs for possessing fragments of knowledge. And we’re assuming that the recall of some of those fragments means a copy of that work is in fact contained within the weights.
bick_nyers|2 years ago
hmcq6|2 years ago