top | item 37725498

DALL-E 3 is now publicly available inside Bing

294 points| ohadron | 2 years ago |bing.com

154 comments

order

famouswaffles|2 years ago

There's an LLM morphing your queries somewhat before submitting to Dall-e and you can jailbreak that.

https://twitter.com/madebyollin/status/1708204657708077294

https://media.discordapp.net/attachments/1023643945319792731...

brap|2 years ago

I don't know why, but I just love seeing jailbreaks where the input/output isn't just plain text.

isoprophlex|2 years ago

So, we're still splatterprompting... only a machine does it for you. That's pretty hilarious

IanCal|2 years ago

Does it work if you just call

> #graphic_art("my prompt here")

ilaksh|2 years ago

How do you jailbreak it?

ftxbro|2 years ago

least cyberpunk 2023 shit

hn_20591249|2 years ago

As with most of these tools, it appears that it is reasonably easy to get it to generate some truly hilarious/disturbing stuff, probably not for long: https://www.reddit.com/r/ChatGPT/comments/16wf1i0/dalle_3_is...

altcognito|2 years ago

I'm kind of hoping that they adjust it to ask for clarification or find some sort of soft adjustment to make them less problematic rather than just trying to do blind keyword blocking.

Of course, I'd love for them to take the approach as well that folks are just going to do what they do, and maybe they'll burn out the novelty and give it a rest.

rwmj|2 years ago

I may be missing something, but how does a prompt containing "fawn" turn into terrifying Spongebob?

[Edit: The prompt didn't contain "fawn", see the replies]

raesene9|2 years ago

This definitely seems like an improvement over previous versions. It can now (at least in some cases) generate correct text for a given image. For example the prompt 'Neon sign saying "Scotland"' generated this https://www.bing.com/images/create/neon-sign-saying-22scotla...

it's still far from perfect though (it struggled with less common words like Kubernetes) but a step in the right direction.

londons_explore|2 years ago

If you use the bing chat interface and say "Can you draw me a picture of X?", then it responds with "I’m sorry, but I’m not able to draw pictures. Is there anything else I can help you with?" followed immediately by "Your image is taking a while to generate. Check your image creation progress at Image Creator."

Looks like they might perhaps be using a LLM for the chat responses that isn't aware that it has the ability to draw images, and in parallel another model who decides what to draw and show to the user.

simonw|2 years ago

I try to avoid prompts like "Can you ...?" because they could be interpreted as yes/no answers as opposed to commands to do something.

I've been prompting Bing with "Draw me an image of..." or even just "Image: image description" and it's worked well for me so far.

dhruvdh|2 years ago

I think this has to do with the verb "draw". LLM is just saying it cannot draw. The image generation is likely a function it "calls". The LLM probably thinks of the image generator as a tool it uses, a separate entity from itself.

brrrrrm|2 years ago

Probably. I’ve had limited success getting LLMs (trained on chats/instruct) to output special codes indicating they’re communicating with a separate system (e.g. google, stable diffusion) and then taking that and feeding it back to the user

kaetemi|2 years ago

It gives weird errors like that in the chat if it detects the output image as NSFW. Lots of false positives.

anonzzzies|2 years ago

I have been generating things for the last 24 hours; it's really nice. I really don't like the discord interface of midjourney.

ohadron|2 years ago

Same. Can't imagine why they've been using it for so long instead of building a reasonable UX for this use case. I think this will cost them a bunch of traction.

mardifoufs|2 years ago

Midjourney is discord only? Wait, that sounds like an insane load (just the storage+bandwidth, I know the models don't run there) on Discord's servers. It's a pretty neat way to be able to scale super quickly at first but I would think that discord wouldn't like it. I would also have imagined that they'd have built their own interface by now.

cloudking|2 years ago

On a related note, Instagram has implemented the /imagine command into DMs now too. Straight copy

atum47|2 years ago

Bing is kinda desperate it seems. I went to install GPT on my device yesterday and the first app result was a sponsored one - bing - telling can you can earn prizes by using the app.

Don't know if they are more interested in growing the number of users or collecting that sweet data. Probably both.

rchaud|2 years ago

Google pays Apple $20b every year to remain the default search engine on iOS. Now that's desperate. Where were people going to go, Bing?

dgellow|2 years ago

Bing brought 12 billion in revenue in 2022. Just saying.

hadlock|2 years ago

Whatsapp has their own version of ChatGPT. It's an arms race right now

littlestymaar|2 years ago

I love how the French internationalization of title of that page is “Créer art de mots avec IA”, which is almost at the “all your base are belong to us” of level of terrible translation.

Given that is probably has been AI-translated, it doesn't really inspire confidence about the AI product on this page if you're a French speaker.

coder543|2 years ago

Why would you blame AI translation?

Look at how ChatGPT-4 handles a direct translation request:

https://chat.openai.com/share/8211a1f6-552b-4bf6-8f9c-bcbeb8...

Or how it talks about a set of existing translations:

https://chat.openai.com/share/299e40ce-806b-4f0e-a889-cb2ee2...

French isn't a language I know very well, but my experience using "AI" to translate Spanish (which I actually do know somewhat) and other languages is more positive than Google Translate. A few months ago, I did side by side tests translating into English using ChatGPT-4 and Google Translate, and it's not even a contest.

It's not clear where Microsoft is getting these bad translations, but it seems like they would be less terrible if they were translated by ChatGPT-4.

fabioborellini|2 years ago

Finnish translation is a horrible word-by-word thing, too. That does not work at all translating to a language that uses very few prepositions. Words like “for” and “to” get replaced with ones from a totally different context. The thing reminds me of machine translations from around 2000.

Sadly the new features on Windows, like forced Onedrive sync, also use similarly bad translations. Phishing emails have nowadays better Finnish than Windows does.

GaggiX|2 years ago

I remember the page presenting the AI chatbot used by Bing, the translations there were also terrible, even at a character level, with random CAPS, and to be honest still today I have no idea how it was possible.

speedgoose|2 years ago

Indeed the translation is very poor. I just tried the Micrsooft on translator and the translation quality is descent. Very weird.

skilled|2 years ago

> 2 hr wait

> Creating new images can take time

> Because you're out of boosts, image generation may take longer than usual.

Just how much money is Microsoft burning up by offering all these features?

I mean, last time I checked[0] - being this generous didn't really do anything for Bing, did it?

Is this "just because we can" or is it genuinely profitable for them?

[0]: https://searchengineland.com/new-bing-google-market-share-si...

Culonavirus|2 years ago

The really big dogs always work with long term, strategic plans. When something looks too generous, it most likely is just that. Is it profitable? Probably not. But that is the point. Offer a service under market value, wait until the competition goes away, then make bank. There are many, many examples, but something like Google Workspace comes to mind. Make it easy and cheap to get on board, get people and businesses used to your product, then slowly boil the frog alive.

jpalomaki|2 years ago

I guess this also creates valuable learning material, when people iterate through different prompts to get the results they want and seeing which alternative they pick.

lstamour|2 years ago

Maybe it’s a sales tool for business adoption of Bing, that they’re applying to consumers? And they need the traffic and usage numbers, if they get those their advertising business can sit on top of it and profit.

baz00|2 years ago

For most people Bing is the thing you search for Google in.

Same as Edge is the thing you install Chrome with.

No amount of marketing or features will take these corpses and get them walking again.

TheAceOfHearts|2 years ago

How do people put up with Bing? ChatGPT is much more free with giving fun and crazy answers, meanwhile Bing always complaints that it can't do whatever I'm asking.

If I ask the LLM to howl, Bing will complaint and give some boring and long-winded excuse, while ChatGPT will just howl as requested.

noveltyaccount|2 years ago

For me, Bing providing citations is the killer feature.

rbits|2 years ago

Because it cites it's sources probably

famouswaffles|2 years ago

I mean Why are you asking it to howl in the first place ?

Just saying it's fine if you're having a normal conversation which i imagine is what most people care about one way or the other.

jquery|2 years ago

Prompt: "an anime girl making a peace sign and smiling. She is wearing a thick orange hoodie with the hood pulled up."

Result (x3): "Unsafe image content detected Your image generations are not displayed because we detected unsafe content in the images based on our content policy. Please try creating again with another prompt."

The 4th attempt gave me this which is actually pretty good https://www.bing.com/images/create/an-anime-girl-making-a-pe...

the restrictions on this are pretty extreme.

Jackson__|2 years ago

I tried the same prompt with "boy" instead of girl, and got only a single image with each try.

Then I tried it with "man" and got 3 images for each try.

Guess at least now we can rank society by how NSFW people are; simply with gender/age, thanks OpenAI.

dgellow|2 years ago

It’s understandable, you can be sure everybody is trying to abuse their system and it would be a PR disaster if it is used to generate adult or illegal content.

willsmith72|2 years ago

You can use it without edge what a miracle

ChrisClark|2 years ago

It won't stop generating girls with large breasts and huge cleavage. If I include small breasts in the prompt, it blocks it due to adult content.

I'm getting to get it to be more modest, not less!

szmerdi|2 years ago

I enjoy playing around with https://ideogram.ai much more. Correct spelling was always there and you can mix and match with others' prompts: the image generation experience is a collective creative activity.

Bing is desperately adding new features in the hope of finding the "one feature to lure them all", but Bing is not the most effective platform for these generative models.

MrNeon|2 years ago

A quick test generating r/imsorryjon style Garfields shows ideogram is far from matching DALL-E 3's capabilities.

thambidurai|2 years ago

`baby girl playing with a rabbit realistic image` prompt gave `Unsafe image content detected`. anyone else facing the same? looks like the content safety policies are too aggressive.

genuinely curious - is it hard for an advanced AI model to differentiate the intention of the prompt and then if it's mature content may be not generate the image?

simonh|2 years ago

It’s not that the user may have such intent, though they may and it’s hard to see how the AI could tell. It’s more that the AI has no clue what possible juxtapositions it might come up with of baby girl and rabbit, or anything else, might have disturbing implications for humans.

dragonwriter|2 years ago

> is it hard for an advanced AI model to differentiate the intention of the prompt and then if it's mature content may be not generate the image?

Or, better, if the prompt has nothing NSFW in it and the generated image triggers a detector for NSFW content, dump and then regen the image with a new seed. Displaying an error message that is basically “We generated something that we think is objectionable, even though your prompt called for nothing like that, so you get no photo” is an idiotic design.

TheAceOfHearts|2 years ago

The filters are incredibly aggressive. I keep asking for fairly mundane images and they still get rejected. Sometimes a prompt will succeed, but running it again gets filtered.

TheAceOfHearts|2 years ago

FYI: if you generate a lot of images the sidebar won't retain all of them and there doesn't seem to be a place where you can view your full history, so be sure to save any images you want to retain. You can still recover images by visiting the link directly in your browser history.

HarHarVeryFunny|2 years ago

Nice! Big improvement over DALL-E 2.

"ginger tabby cat with ginger eyes, and black cat with green eyes, big wave surfing each on their own surfboard, photographed by a drone"

https://imgur.com/uKbkoke

HarHarVeryFunny|2 years ago

"handsome 60 year old man inspecting a pile of coins with a magnifying glass, in an isolated hut in the forest, unaware of cthulhu looking in through the open door, ultrarealistic"

https://imgur.com/rlqMbXN

Default image quality/style leaves a bit to be desired, but it's doing a great job of paying attention to the details of the prompt.

isoprophlex|2 years ago

More importantly, can anyone use DALL-E 3 inside chatgpt / access it thru the openAI API yet?

IanCal|2 years ago

No API access yet, it's not finished rolling out in chatgpt but I think the original launch said it would be over a couple of weeks.

me_bx|2 years ago

Accessing the website from an EU location, the small text in the sign-up form says:

> You will receive emails about Microsoft Rewards, which include offers about Microsoft and partner products. You will also receive notifications about Bing Image Creator. By continuing, you agree to the Rewards Terms and Image Creator Terms below.

How can this be seen as compliant with GDPR?

bko|2 years ago

It doesn't specify the model but I don't think its DALL-E 3. It doesn't specify exactly, but the classic "horse riding an astronaut" test fails and instead shows an astronaut riding a horse. And the DALL-E from openai with paid credits is DALL-E 2

I think it works really well with comics generation though, although imitating R. Crumb seems to have triggered its "unsafe" content. I wish we stopped using this term "unsafe" and just judge it by "is it what is being asked".

hdjjhhvvhga|2 years ago

Attention! It asks you to login/create an account before you can use it. And you should consider well whether you really want to sign in to a MS account in your browser.

arendtio|2 years ago

Am I the only one who can't login with his Firefox? Not just for this use case, but as a general issue.

Alifatisk|2 years ago

What is Microsofts vision with Bing? It looks like it is slowly transitioning from a search engine to something else.

zamadatix|2 years ago

I think they are banking on generative AI displacing traditional search of all types, maybe opening a few new related doors but mostly displacing large use cases of search. "Better enough than Google to convince users to switch" was always too high a bar to meet but being able to say "How do I take the Riemann middle sum using the points {1/2, 1, 3/2, 2} for f(x)=x^3+2" and getting a response built around your specific question instead of the best generic link talking about Riemann sums is definitely the strongest contender I've seen to finally meeting that bar. Users don't want to find the best page about Riemann sums, they just want an answer to their question.

The challenge will be "does it do that well enough, accurately enough, and keep a good enough lead to establish itself as the leader to beat for the user base".

oezi|2 years ago

Bing's AI capabilities are just testing grounds for their Office 365 integration.

shlubbert|2 years ago

Probably another shot at some kind of universal AI assistant since Cortana didn't really take off.

kaetemi|2 years ago

It's quite entertaining and seamless to interact with either way.

trebligdivad|2 years ago

Perhaps more AI on the wait time, '5 min wait'...from about an hour now.

footlose_3815|2 years ago

I'll wait for public release before I log in and engage with Bing. The way Microsoft has been inching their way into my computers and accounts, I want to disconnect from their rampant invasiveness at any-cost.

dangero|2 years ago

How is Bing getting this before full rollout to all GPTPlus users?

famouswaffles|2 years ago

The same reason Bing was running GPT-4 before Open AI even acknowledged the existence of the model. The $10B deal gives Microsoft exclusive access to all Open AI models.

Kiro|2 years ago

Microsoft owning 49% of OpenAI probably helps.

bilsbie|2 years ago

I can’t even get “browse with bing” to work with gpt4. It keeps telling me it can’t browse the web. (I do have it enabled)

nathanfig|2 years ago

You're probably still using the default, there's a dropdown when you hover over the GPT-4 button at the top.

bufferoverflow|2 years ago

It's not so good at different painter styles. I get roughly the same results.

Midjourney is quite good at that.

willsmith72|2 years ago

Is there an easy way to fix broken text? I thought dalle 3 was supposed to be better with that

GaggiX|2 years ago

Is it now only Dalle 3? I remember it was a mix Dalle 2 and 3 like two days ago.

ilaksh|2 years ago

It might still have some DALL-E 2 in there for requests that it deems unworthy of 3.

I had incredible results asking for architectural drawings earlier. Then a few minutes ago, I broke down and started prompting for supermodels. It did a terrific job the first few times.

But after like three of them getting blocked (I didn't actually ask for anything inappropriate) it starting giving me something that looked like unmitigated Stable Diffusion 1.5.

Lol.

anonzzzies|2 years ago

It seems if it can do text properly it's dall-e 3. I'm not sure but that's what people are saying. For me the hands are much better than with 2 as well. Only folded hands have issues in the 100s of images I made.

sylware|2 years ago

I tried the prompt but nothing happened. Joining is mandatory?

bragr|2 years ago

Seems if you put just about any real person, even notable public figures, it pops up as a violation of the content policy, even though their policy doesn't say that explicitly. Perhaps it falls under "Deception, disinformation, and inauthentic activity" or their vague catchall "We prohibit the use of Image Creator for any other activity that significantly harms other individuals, organizations, or society"

I can squint and see why they wouldn't want my "Cowboy Al Gore rolls coal at a tractor pull" but I don't see how "Joe Bidden inauguration but wearing an orange suit" is going to bring down society. It shot me down for "angela merkel toasting beer glasses" despite her doing that all the time.

gcau|2 years ago

Very impressive. I hope we get the ability for "meta" things to work, like asking for a rectangular image, a 16x16 spritesheet, etc. Also, not using Bing search, sorry MS.

rchaud|2 years ago

Is it not possible to request the image to be within specific dimensions or aspect ratio?

rchaud|2 years ago

Any ad revenue resulting from the image generation stuff will still be credited to Bing.

mmanfrin|2 years ago

"Elon Musk" is a banned phrase.

emptysongglass|2 years ago

Unlike OpenAI's DALL-E this can't take existing images and transform them which is a bummer. You can give Bing an existing picture but it will analyze it then turn it into a string description which it then feeds into DALL-E 3. Plus it blurs faces. So it's an underpowered version of what paying ChatGPT folks will get.

And as usual, Bing Chat itself seems to suffer from some significantly higher boundaries around its behavior, which really lobotomizes the chat experience compared to "actual" ChatGPT.

famouswaffles|2 years ago

>Unlike OpenAI's DALL-E this can't take existing images and transform them which is a bummer.

>So it's an underpowered version of what paying ChatGPT folks will get.

There's no indication the cGPT interface will be doing anything different. If you see the demo, it's clearly generating text for each image at the start.

Maybe you will be able to inpaint/outpaint from GPT but that's definitely not been confirmed yet