top | item 41132541

(no title)

nwoli | 1 year ago

That’s not really fair to conclude that the training data contains vanity fair images since the prompt includes “by Vanity Fair”.

I could write “with text that says Shutterstock” in the prompt but that doesn’t necessairly mean the dataset contains that

discuss

order

minimaxir|1 year ago

The logo has the same exact copyrighted typography as the real Vanity Fair logo. I've also reproduced the same-copyrighted-typography with other brands with identical composition as copyrighted images. Just asking it "Vanity Fair cover story about Shrek" at a 3:2 ratio gives it a composition identical to a Vanity Fair cover very consistently (subject is in front of logo typography partially obscuring it)

The image linked has a traditional www watermark in the lower-left as well. Even something innocous as a "Super Mario 64" prompt shows a copyright watermark: https://x.com/minimaxir/status/1819093418246631855

fennecbutt|1 year ago

If the training data includes a public blog post which has a screenshot of a vanity fair piece?

It's like GRRM complaining that LLMs can reproduce chunks of text from his books "they fed my novels into it" Oh yeah? It's definitely not all the parts of your book quoted in millions of places online, including several dedicated wiki style sites? That wouldn't be it, right?

Carrok|1 year ago

On my list of AI concerns, whether or not Vanity Fair has it’s copyright infringed does not appear.