top | item 33240780

(no title)

ClassAndBurn | 3 years ago

My view is the copilot is not stealing open source code. It is learning from it just as a human reader would. People's disguste is based on the assimilation of what they thought was a human trait being machine derived from their work.

The copilot service backed by an army of actual humans wouldn’t be a story at all. Nor would anyone be angry, if an individual offered coding skills as a service, and had gone through the exercise of learning great amount to open source software to do so.

No open source license was written with this in mind. Because previously learning was something only humans could do and no one had issue with sharing that knowledge. Until licenses take machine learning use into account I see no problems with Copilot.

Source cannot be open if you restrict any viewing of it.

discuss

order

Sirened|3 years ago

You aren't allowed to just read code and regurgitate it in order to claim it as your own. That is, just because you memorized this great new novel you read, it doesn't mean you can go and sit down and hammer it out and sell new copies. People go to great lengths to do this sort of things (see: clean room reverse engineering [1]) in order to try and wash themselves of liability.

[1] https://en.wikipedia.org/wiki/Clean_room_design

williamcotton|3 years ago

If the code was purely utilitarian in nature, such as something that was optimized for execution time, there is plenty of precedent stating that the code in question is not covered by copyright.

Do an internet search for “copyright utilitarian” and read up on it if you don’t believe me!

Copyright is about protecting artistic expression which is held in contrast to the useful nature of a work.

deworms|3 years ago

If you think most people pay any attention to licenses or respect them you better think again. Snippets get copied verbatim with no regard to their source all the time. Licenses have no power and are routinely ignored.

PythagoRascal|3 years ago

> It is learning from it just as a human reader would

I don't see how that invalidates the copyright/license argument. So, instead of just a straight up license violation it's a license violation via plagiarism.

That argument wouldn't hold up even if it was a human that caused the violation. You can't just paraphrase someones licensed work and then lie about looking at and pretend you made it yourself, which is basically what seems to happen with co-pilot, as it doesn't also automatically reproduce the license of the code it reproduces.

abigail95|3 years ago

> You can't just paraphrase someones licensed work

Yes you can. That's exactly why you paraphrased it instead of copying verbatim.

At the fringes, your transformation may not be enough to overcome the requirements, but that's an exception. Nearly all paraphrasing is legal by default.

ClassAndBurn|3 years ago

It learns the same way a human does by learning patterns. It is not illegal to comprehend how to accomplish tasks by reading other people's source code.

The arguments against my point always assume perfect memory of everything this model is consumed. This is the plagiarism position. In reality, some patterns are more common than others and generate a code that looks nearly identical. I can’t speak for the reasons for this, as I’m not familiar with all of the methods. However, I don’t assume that is the current working state or intent of Codex.

slondr|3 years ago

> People's disguste is based on the assimilation of what they thought was a human trait being machine derived from their work.

No, people's disgust is with Microsoft violating their legal privileges.

> The copilot service backed by an army of actual humans wouldn’t be a story at all.

Correct, it would be an open-and-shut lawsuit.

stevage|3 years ago

It isn't really learning, if it's just regurgitating whole function bodies. I use Copilot a lot, and definitely see whole functions being spit out, that were presumably written by a person somewhere.

mdswanson|3 years ago

I also use Copilot a lot, and while it does suggest large function bodies, I'm not sure that it's "regurgitating" them (though it could be...I don't know). I suspect that it's seen so many function bodies that are similar that it generates another similar output. Like autocomplete in a word processor has seen so many similar chunks of text that it reproduces them based on past experience. I don't know this as a fact, of course. I'm just reacting to the word "regurgitating."

AmericanChopper|3 years ago

The Luddite reaction to copilot is very hilarious to me. It seem to be a great way to identify low-talent coders, because who else would possibly feel so threatened by an AI?… Watching HN commenters suddenly become ardent defenders of copyright is quite the sight.

ptmcc|3 years ago

Caring about licenses, fair use, and copyright has been deeply ingrained in the open source and hacker community for literally decades.

batmanturkey|3 years ago

I see your statement as an inversion of consensus reality. What actual coder would use copilot? A beginner or dabbler.

I predict your attempt at tactically “managing” this copilot scandal will not play well on HN to experienced coders, your Microsoft colleagues chiming in next claiming it boosts their productivity notwithstanding.

Yes, I do indeed suggest astroturfing afoot.