top | item 38170171

(no title)

somsak2 | 2 years ago

How do you square this with OpenAI's assertion that they never use data from enterprise customers for their own training? Are you suggesting they're lying?

discuss

cornholio|2 years ago

OpenAI just slurped the entire internet to train their main model, and the world just looks on as they directly compete with and disrupt authors the globe over.

Whoever thinks they are not interested in your data and won't use any trick to get it, then double down on their classic "but your honor, it's not copyright theft, the algorithm learns just like an employee exposed to the data would", isn't paying attention.

jprete|2 years ago

This is exactly why I am personally intensely opposed to treating ML training as fair use. Practically speaking the argument justifies ignoring anyone or any group’s preference not to contribute to ML training, so it’s a massive loss of freedom to everyone else.

Heyso|2 years ago

I agree with you. What come to my mind, is that GPT using private data to learn, if given back to (any) customer, you would have an indirect "open source everything".

JacobThreeThree|2 years ago

They don't have to be currently lying for this to be a valid concern.

Clauses in terms of service are routinely updated or removed.

ethbr1|2 years ago

> Clauses in terms of service are routinely updated or removed.

True, but that plays a bit differently in B2B land, because your customers also have legal teams and law firms on retainer.