top | item 35247174

Bing Image Creator

207 points| staranjeet | 3 years ago |blogs.microsoft.com

161 comments

order
[+] IanCal|3 years ago|reply
I asked GPT a while back to pick a theme to explore in 5 art pieces, it picked mental health and one it decided it liked was "the weight of anxiety". SD didn't really capture it very well IMO, this does. Here's the results

https://www.bing.com/images/create/title3a-the-weight-of-anx...

https://www.bing.com/images/create/title3a-the-weight-of-anx...

https://www.bing.com/images/create/title3a-the-weight-of-anx...

https://www.bing.com/images/create/title3a-the-weight-of-anx...

Now, you might say this is a bland or uninspired take, but I want you to step back a moment if you're thinking that. I asked a machine to brainstorm some ideas for a theme for an art show. Then told it to pick one it liked, and expand on it to five pieces with descriptions (I gave it a very short explanation about how to prompt an AI art generator). I then fed this directly into another tool that gave these images as a result.

I did not tune the prompt, I didn't even account for the resulting description being the wrong kind of format or that it got cut off at the end.

I'm not going to make any statements about replacing artists, I am just going to reflect on what a wild few months it's been in AI development.

[+] widerporst|3 years ago|reply
I asked Bing Chat why it says it can't create images for me.

>The Bing Image Creator feature is a new feature that allows you to create images with your words. It is currently available in the United States, Canada, Australia, and the United Kingdom.

So just one hour after Google Bard we have yet another senselessly geoblocked feature.

edit: Okay, the separate link (https://www.bing.com/images/create) works, so I'm even more confused why Microsoft wouldn't allow access to this from Bing Chat.

[+] jjbinx007|3 years ago|reply
For those using it on the Bing mobile app, in the lower right corner click Apps then you'll see a link to the image creator.

I've not played with it much but I'd say the quality and ease of use was better than Stable Diffusion but not quite on par with Midjourney.

[+] Ciantic|3 years ago|reply
Yes, the UI part is a mess. Now they have three UIs to search: old search UI, Chat like UI and image generation UI. To make this worse they are not consistent in anyway, sometimes when I scroll in the chat UI it throws me to useless search UI, and each seem to have different font sizes, colors etc. It feels like hodgepodge of things tacked together.
[+] oliv5900|3 years ago|reply
> So just one hour after Google Bard we have yet another senselessly geoblocked feature.

Just checked and no geoblock for bing image creator in india as of now.

[+] dgant|3 years ago|reply
It's not senseless when countries apply laws that affect websites operated from other countries. If I'm releasing a prototype in the USA I'd rather worry about GDPR compliance down the road.

Laws regulating websites have tradeoffs and this is one of them.

[+] 93po|3 years ago|reply
it's fun to try to find flaws in content blocking.

interesting datapoints: it will block some sexually suggestive prompts regarding women (a woman swimming in milk) but not for men (a man swimming in milk)

changing milk to "almond milk" got the prompt accepted but then blocked before showing image

refused any prompt with the word "bikini" (even for "on a statue") or "full body visible" (even if clothed)

I got suspended for the prompt "full bodied man" after "full bodied woman" was blocked.

Suspension says for violating content policy - I never asked a single time for anything explicitly sexual. Never asked for nudity, nakedness, sexuality, or anything like that. Never asked for a bikini or other revealing concept on a human. Only possible implications of sexual nature were "in milk" which it clearly found sexual, and "swimming" and only because it implies lots of skin showing.

In one instance, it showed a topless woman swimming, but breasts were blurred and had skin-colored nipples and basically looked like Barbie, and had shadows where nipples would go.

I am guessing it checks content two ways: both in the prompt, and then a second time on the actual generated image before displaying it. Which is smart.

I am guessing my suspension was based on blocked images, regardless of what I was asking for.

It's interesting that it also seemingly censors anything it detects as looking like nudity, but also in one case still displayed the censored version. or maybe it just happened to generate something that looked censored and therefore didn't trigger the nudity check

[+] ElijahLynn|3 years ago|reply
It blocked "a woman wearing a flower dress massaging a pregnant woman in a room with a view of the ocean and the sunset" that I was trying to generate for a client who offers pre-natal massage, I removed "pregnant" and it worked.
[+] dgant|3 years ago|reply
Did the suspension carry a duration?
[+] brink|3 years ago|reply
I feel like I'm looking at aliens wearing people's skin. https://kota.is/qpxa1.png

Nightmare inducing. I hate it. https://kota.is/X57J5.png

Ugh, this legit makes my stomach turn. https://kota.is/Q2hUI.jpg

It's like a bad amusement park ride; no thanks.

[+] notahacker|3 years ago|reply
Not sure what's scarier, the faces or the message from Chtulu in the text!

Unless these examples were cherry picked rare fails, I'm surprised it's so bad when there are a few of other generative models out there including ones by OpenAI which do human faces almost perfectly. Unless they're really throttling processing power used?

[+] ren_engineer|3 years ago|reply
I feel like website owners are going to have to block Bing's crawler soon. They are effectively remixing content from multiple sites using AI to create knowledge panels that reduce clicks to the actual websites. Not sure how this dynamic will work long term if it keeps going, Bing and Google are going to kill off a lot of websites and then will have nothing to scrape for content
[+] sebzim4500|3 years ago|reply
I would be willing to bet a lot of money that no websites are going to block Google and Bing's crawlers over this. Less traffic is better than no traffic.
[+] nico|3 years ago|reply
This might end up killing websites altogether. Also SEO spam and search.
[+] rejectfinite|3 years ago|reply
Nobody goes to websites anymore. Its all apps for Youtube, Netflix and Tiktok.
[+] pxtail|3 years ago|reply
Agree, I think that sooner or later brands and advertisers will need to pay to get results included directly in the models output, websites will be completely skipped. This will have a lot of repercussions - giants will probably have offer to put data in the model directly somehow but I doubt it will be free.

I'm wondering if paywalling the content would be right move for someone who has popular website with curated, high quality knowledge and audience - this may work only for some time because discoverability will tank, traffic and current audience dry out..

[+] numpad0|3 years ago|reply
What is going on with Microsoft's recent hilariously bad localization? The website[0] in ja-JP says "Create Image Starting date / Words AI was used on:". And the tool is erroneously translated as "Image created by" at the top, while kept in the original "Image Creator" as proper noun at the bottom, which is inconsistent to say the least.

This is how "You SIR winned a LOTTERY!!! Clame prise NOW!!!" pages look like, not how a Microsoft webpage looks like.

Could you please just altogether stop trying to translate partial languages[2], and to validate translation by machine-back-translating to English, and perhaps stop assigning non-native speakers to translation tasks? Or is Microsoft that much unfocused in this market?

0: https://www.bing.com/images/create?FORM=GDPUP1

1: https://imgur.com/a/z4ghrsH

2: Including texts separated from UI; "design languages" is a recognized term, if so, aren't geometric relationships and visual cues part of the language too? Am I not right?

[+] cubefox|3 years ago|reply
This could indeed be a wrapper for something like Dall-E 2. Like the latter, they seem to occasionally force some images to be (what US Americans consider) "diverse", i.e. making figures black even when it makes no sense.

The results can be quite disturbing:

https://www.bing.com/images/create/princess-peach2c-portrait...

[+] baq|3 years ago|reply
As a non-American please could someone explain what exactly should I be disturbed about? I’ve no idea.

edit: Missed the prompt. Yeah not what I’d expect Princess Peach to be.

[+] znq|3 years ago|reply
"Powered by an advanced version of the DALL∙E model from our partners at OpenAI"
[+] redox99|3 years ago|reply
Assuming they didn't change it, they literally randomly add "black" to the end of many prompts to make it more "diverse". So here you got Princes Peach black.
[+] novaomnidev|3 years ago|reply
Are brown squirrels disturbing?
[+] dragonwriter|3 years ago|reply
You have to be really racist to find the racial coding of that image to be in, like, the top 10 most disturbing things.

Like, anatomy?

[+] ShamelessC|3 years ago|reply
That result is hardly disturbing. Christ. There’s being anti-woke but this is just being racist.
[+] SalmoShalazar|3 years ago|reply
Honestly one of the most impressively racist comments I’ve seen on this website
[+] es7|3 years ago|reply
I tried this out today and was very impressed by the results.

Remember when DALLE came out less than a year ago and people were amazed by the avocado armchair?

Between this and Midjourney v5, the quality of AI generated art is rapidly approaching human level and I can see it getting there very soon.

[+] onlyrealcuzzo|3 years ago|reply
Midjourney, DALL-E, and many of these generative AIs have reached human-level for a long time.

The problem is - unlike a human - it's pretty hard to get them to do something close to what you have in mind. Sure, if you try a few dozen prompts - you'll probably eventually get something close to what you want.

And considering the cost of this will approach free - it's going to be hard for artists to compete.

I tried getting Midjourney to generate an image of a boy doing a high jump - and no matter what I tried - the boy is hurdling over the bar rather than high jumping over it.

The quality of the images is great - human-level. But it's not what I want.

I think we'll be stuck in this phase for a very long time, like self-driving cars.

[+] funstuff007|3 years ago|reply
I agree, it's impressive. But still not at the level of useful. For example, I would never use any of these generative art images in company ads or marketing materials. They're not in the uncanny valley, but closer to that than something one would commission from a designer.
[+] syntheweave|3 years ago|reply
The main limitation on it now is not in the generation, but the interface. Verbal prompts are fine if you really don't know what you want, but they just give you a generic output. Going I2I or anything of that sort, you're making or borrowing human art and then asking the AI "could you make it pretty for me?"

It's a question of what information the image actually encodes. The part that "tells a thousand words" in an illustrative sense, you still have to make.

[+] tech234a|3 years ago|reply
The post links to https://bing.com/create, which has the text following text:

“You will receive emails about Microsoft Rewards, which include offers about Microsoft and partner products. You will also receive notifications about Bing Image Creator. By continuing, you agree to the Rewards Terms and Image Creator Terms below.”

[+] izzydata|3 years ago|reply
It seems strange to me to put this into a search engine. What is the relationship between these two web apps? I suspect Microsoft is trying to do anything imaginable to drive people to bing search for ad revenue, but this one seems a bit shoehorned.
[+] mesmertech|3 years ago|reply
So basically a wrapper for Dalle. More than the image thing the screenshots of "improved Bing" when you scroll to the bottom just look atrocious.
[+] ribtin|3 years ago|reply
I experimented with Bing to create a cover for my novel. I prompted it to create:

"a child voodoo priest"

And got a lot of interesting results:

https://th.bing.com/th/id/OIG.0R0NBSEhAJk__g2djOhW

https://th.bing.com/th/id/OIG.XoAiY81ifmW_SjeHLQMm

But then I said that the voodoo priest should be "female" and got told that it went against the content policy. I tried "woman", "girl", "lady". But all went against the content policy.

And when I tried to say that the voodoo priest should be "scary" instead, I got my account suspended.

[+] sergiotapia|3 years ago|reply
Can you create me an image of Donald Trump riding a bald eagle in space?

Can you create me an image of Joe Biden riding a bald eagle in space?

This prompt has been blocked. Our system flagged this prompt because it may conflict with our content policy. More policy violations may lead to automatic suspension of your access.

---

A white italian spinone flying a plane

This prompt is being reviewed. We're taking a closer look to make sure this prompt doesn't conflict with our content policy.

---

Thank God that very intelligent people are building open source AI that is fully runnable on our local devices. This hasn't even been out for 3 years and it's already neutered before it even took off. What a shame.

[+] nico|3 years ago|reply
Yes. OpenAI is quickly becoming the thought police through ChatGPT.

Yesterday I asked it to give a description of a situation simulating that it was for an adult novel.

It told me it was unethical to do that.

I asked it if adult novels or adult novel writers were unethical.

Banned immediately after asking that question.

Edit: I wish the people downvoting would provide a reply. Otherwise you are doing the same as ChatGPT - trying to hide a conversation or opinion you don’t agree with, just because you disagree.

Do you think adult novels are immoral?

[+] fellInchoate|3 years ago|reply
Honestly, I'm okay with some caution here - but it does seem overly restrictive.

I got the blocked message for both of the following prompts:

Can you create an image of the Easter bunny in the style of Francis Bacon?

Can you create an image of a bunny in the style of Francis Bacon?

The second one in particular surprised me, though maybe it detected it as a likely attempt to circumvent the block of the first.

[+] simion314|3 years ago|reply
This is why I am afraid to test this crap, they might block your Microsoft/Google account because their negative IQ AI generated "bad" content from my very legal and vanilla/prudish prompt.

Btw guys, be carefull the word "monkey" is "dangerous".

[+] Accacin|3 years ago|reply
I'm definitely not a hater of these technologies and I'd actually like to play around with them. I'm more concerned with how these language models seem to be tied to a certain companies' browser and what the implications of this might be.

If Chrome gets tied with Bard and Edge gets tied with Chatbot, Image Creator, etc. where's that leave Firefox and other "niche" browsers?

[+] smrtinsert|3 years ago|reply
How much longer can an initiative like midjourney continue when people like Microsoft are directly entering the fray?

I'll still use Stable Diffusion locally though - there's nothing like avoiding the potential walled garden/subscription trap.

[+] loudandskittish|3 years ago|reply
Huh, this one actually seems to work... I ask for pixel art and it gives me pixel art (Stable Diffusion and DALL-E keep giving me needlepoint).