top | item 43475996

(no title)

meeton | 11 months ago

https://i.imgur.com/xsFKqsI.png

"Draw a picture of a full glass of wine, ie a wine glass which is full to the brim with red wine and almost at the point of spilling over... Zoom out to show the full wine glass, and add a caption to the top which says "HELL YEAH". Keep the wine level of the glass exactly the same."

discuss

order

cruffle_duffle|11 months ago

Maybe the "HELL YEAH" added a "party implication" which shifted it's "thinking" into just correct enough latent space that it was able to actually hunt down some image somewhere in its training data of a truly full glass of wine.

I almost wonder if prompting it "similar to a full glass of beer" would get it shifted just enough.

Stevvo|11 months ago

Can't replicate. Maybe the rollout is staggered? Using Plus from Europe, it's consistently giving me a half full glass.

amy_petrik|11 months ago

I am using Plus from Australia, and while I am not getting a full glass, nor am I getting a half full glass. The glass I'm getting is half empty.

coder543|11 months ago

Is it drawing the image from top to bottom very slowly over the course of at least 30 seconds? If not, then you're using DALL-E, not 4o image generation.

raxxorraxor|11 months ago

The EU got the drunken version. And a good drunk know not to top of a glass of wine ever. In that context the glass is already "full".

But aside from that it would only be comparable if would compare your prompts.

sionisrecur|11 months ago

Maybe it's half empty.

qingcharles|11 months ago

You might still be on DALL-E. My account is if you use ChatGPT.

I switched over to the sora.com domain and now I have access to it.

eitland|11 months ago

Most interesting thing to me is the spelling is correct.

I'm not a heavy user of AI or image generation in general, so is this also part of the new release or has this been fixed silently since last I tried?

widerporst|11 months ago

It very much looks like a side effect of this new architecture. In my experience, text looks much better in recent DALL-E images (so what ChatGPT was using before), but it is still noticeably mangled when printing more than a few letters. This model update seems to improve text rendering by a lot, at least as long as the content is clearly specified.

However, when giving a prompt that requires the model to come up with the text itself, it still seems to struggle a bit, as can be seen in this hilarious example from the post: https://images.ctfassets.net/kftzwdyauwt9/21nVyfD2KFeriJXUNL...

dghlsakjg|11 months ago

The head of foam on that glass of wine is perfect!

ASalazarMX|11 months ago

I think we're really fscked, because even AI image detectors think the images are genuine. They look great in Photoshop forensics too. I hope the arms race between generators and detectors doesn't stop here.