I'm kind of hoping that they adjust it to ask for clarification or find some sort of soft adjustment to make them less problematic rather than just trying to do blind keyword blocking.
Of course, I'd love for them to take the approach as well that folks are just going to do what they do, and maybe they'll burn out the novelty and give it a rest.
This definitely seems like an improvement over previous versions. It can now (at least in some cases) generate correct text for a given image. For example the prompt 'Neon sign saying "Scotland"' generated this https://www.bing.com/images/create/neon-sign-saying-22scotla...
it's still far from perfect though (it struggled with less common words like Kubernetes) but a step in the right direction.
If you use the bing chat interface and say "Can you draw me a picture of X?", then it responds with "I’m sorry, but I’m not able to draw pictures. Is there anything else I can help you with?" followed immediately by "Your image is taking a while to generate. Check your image creation progress at Image Creator."
Looks like they might perhaps be using a LLM for the chat responses that isn't aware that it has the ability to draw images, and in parallel another model who decides what to draw and show to the user.
I think this has to do with the verb "draw". LLM is just saying it cannot draw. The image generation is likely a function it "calls". The LLM probably thinks of the image generator as a tool it uses, a separate entity from itself.
Probably. I’ve had limited success getting LLMs (trained on chats/instruct) to output special codes indicating they’re communicating with a separate system (e.g. google, stable diffusion) and then taking that and feeding it back to the user
Same. Can't imagine why they've been using it for so long instead of building a reasonable UX for this use case. I think this will cost them a bunch of traction.
Midjourney is discord only? Wait, that sounds like an insane load (just the storage+bandwidth, I know the models don't run there) on Discord's servers. It's a pretty neat way to be able to scale super quickly at first but I would think that discord wouldn't like it. I would also have imagined that they'd have built their own interface by now.
Bing is kinda desperate it seems. I went to install GPT on my device yesterday and the first app result was a sponsored one - bing - telling can you can earn prizes by using the app.
Don't know if they are more interested in growing the number of users or collecting that sweet data. Probably both.
I love how the French internationalization of title of that page is “Créer art de
mots avec IA”, which is almost at the “all your base are belong to us” of level of terrible translation.
Given that is probably has been AI-translated, it doesn't really inspire confidence about the AI product on this page if you're a French speaker.
French isn't a language I know very well, but my experience using "AI" to translate Spanish (which I actually do know somewhat) and other languages is more positive than Google Translate. A few months ago, I did side by side tests translating into English using ChatGPT-4 and Google Translate, and it's not even a contest.
It's not clear where Microsoft is getting these bad translations, but it seems like they would be less terrible if they were translated by ChatGPT-4.
Finnish translation is a horrible word-by-word thing, too. That does not work at all translating to a language that uses very few prepositions. Words like “for” and “to” get replaced with ones from a totally different context. The thing reminds me of machine translations from around 2000.
Sadly the new features on Windows, like forced Onedrive sync, also use similarly bad translations. Phishing emails have nowadays better Finnish than Windows does.
I remember the page presenting the AI chatbot used by Bing, the translations there were also terrible, even at a character level, with random CAPS, and to be honest still today I have no idea how it was possible.
The really big dogs always work with long term, strategic plans. When something looks too generous, it most likely is just that. Is it profitable? Probably not. But that is the point. Offer a service under market value, wait until the competition goes away, then make bank. There are many, many examples, but something like Google Workspace comes to mind. Make it easy and cheap to get on board, get people and businesses used to your product, then slowly boil the frog alive.
I guess this also creates valuable learning material, when people iterate through different prompts to get the results they want and seeing which alternative they pick.
Maybe it’s a sales tool for business adoption of Bing, that they’re applying to consumers? And they need the traffic and usage numbers, if they get those their advertising business can sit on top of it and profit.
How do people put up with Bing? ChatGPT is much more free with giving fun and crazy answers, meanwhile Bing always complaints that it can't do whatever I'm asking.
If I ask the LLM to howl, Bing will complaint and give some boring and long-winded excuse, while ChatGPT will just howl as requested.
Prompt: "an anime girl making a peace sign and smiling. She is wearing a thick orange hoodie with the hood pulled up."
Result (x3): "Unsafe image content detected
Your image generations are not displayed because we detected unsafe content in the images based on our content policy. Please try creating again with another prompt."
It’s understandable, you can be sure everybody is trying to abuse their system and it would be a PR disaster if it is used to generate adult or illegal content.
I enjoy playing around with https://ideogram.ai much more. Correct spelling was always there and you can mix and match with others' prompts: the image generation experience is a collective creative activity.
Bing is desperately adding new features in the hope of finding the "one feature to lure them all", but Bing is not the most effective platform for these generative models.
`baby girl playing with a rabbit realistic image` prompt gave `Unsafe image content detected`. anyone else facing the same? looks like the content safety policies are too aggressive.
genuinely curious - is it hard for an advanced AI model to differentiate the intention of the prompt and then if it's mature content may be not generate the image?
It’s not that the user may have such intent, though they may and it’s hard to see how the AI could tell. It’s more that the AI has no clue what possible juxtapositions it might come up with of baby girl and rabbit, or anything else, might have disturbing implications for humans.
> is it hard for an advanced AI model to differentiate the intention of the prompt and then if it's mature content may be not generate the image?
Or, better, if the prompt has nothing NSFW in it and the generated image triggers a detector for NSFW content, dump and then regen the image with a new seed. Displaying an error message that is basically “We generated something that we think is objectionable, even though your prompt called for nothing like that, so you get no photo” is an idiotic design.
The filters are incredibly aggressive. I keep asking for fairly mundane images and they still get rejected. Sometimes a prompt will succeed, but running it again gets filtered.
FYI: if you generate a lot of images the sidebar won't retain all of them and there doesn't seem to be a place where you can view your full history, so be sure to save any images you want to retain. You can still recover images by visiting the link directly in your browser history.
"handsome 60 year old man inspecting a pile of coins with a magnifying glass, in an isolated hut in the forest, unaware of cthulhu looking in through the open door, ultrarealistic"
Accessing the website from an EU location, the small text in the sign-up form says:
> You will receive emails about Microsoft Rewards, which include offers about Microsoft and partner products. You will also receive notifications about Bing Image Creator. By continuing, you agree to the Rewards Terms and Image Creator Terms below.
It doesn't specify the model but I don't think its DALL-E 3. It doesn't specify exactly, but the classic "horse riding an astronaut" test fails and instead shows an astronaut riding a horse. And the DALL-E from openai with paid credits is DALL-E 2
I think it works really well with comics generation though, although imitating R. Crumb seems to have triggered its "unsafe" content. I wish we stopped using this term "unsafe" and just judge it by "is it what is being asked".
Attention! It asks you to login/create an account before you can use it. And you should consider well whether you really want to sign in to a MS account in your browser.
I think they are banking on generative AI displacing traditional search of all types, maybe opening a few new related doors but mostly displacing large use cases of search. "Better enough than Google to convince users to switch" was always too high a bar to meet but being able to say "How do I take the Riemann middle sum using the points {1/2, 1, 3/2, 2} for f(x)=x^3+2" and getting a response built around your specific question instead of the best generic link talking about Riemann sums is definitely the strongest contender I've seen to finally meeting that bar. Users don't want to find the best page about Riemann sums, they just want an answer to their question.
The challenge will be "does it do that well enough, accurately enough, and keep a good enough lead to establish itself as the leader to beat for the user base".
I'll wait for public release before I log in and engage with Bing. The way Microsoft has been inching their way into my computers and accounts, I want to disconnect from their rampant invasiveness at any-cost.
The same reason Bing was running GPT-4 before Open AI even acknowledged the existence of the model. The $10B deal gives Microsoft exclusive access to all Open AI models.
It might still have some DALL-E 2 in there for requests that it deems unworthy of 3.
I had incredible results asking for architectural drawings earlier. Then a few minutes ago, I broke down and started prompting for supermodels. It did a terrific job the first few times.
But after like three of them getting blocked (I didn't actually ask for anything inappropriate) it starting giving me something that looked like unmitigated Stable Diffusion 1.5.
It seems if it can do text properly it's dall-e 3. I'm not sure but that's what people are saying. For me the hands are much better than with 2 as well. Only folded hands have issues in the 100s of images I made.
Seems if you put just about any real person, even notable public figures, it pops up as a violation of the content policy, even though their policy doesn't say that explicitly. Perhaps it falls under "Deception, disinformation, and inauthentic activity" or their vague catchall "We prohibit the use of Image Creator for any other activity that significantly harms other individuals, organizations, or society"
I can squint and see why they wouldn't want my "Cowboy Al Gore rolls coal at a tractor pull" but I don't see how "Joe Bidden inauguration but wearing an orange suit" is going to bring down society. It shot me down for "angela merkel toasting beer glasses" despite her doing that all the time.
Very impressive. I hope we get the ability for "meta" things to work, like asking for a rectangular image, a 16x16 spritesheet, etc. Also, not using Bing search, sorry MS.
Unlike OpenAI's DALL-E this can't take existing images and transform them which is a bummer. You can give Bing an existing picture but it will analyze it then turn it into a string description which it then feeds into DALL-E 3. Plus it blurs faces. So it's an underpowered version of what paying ChatGPT folks will get.
And as usual, Bing Chat itself seems to suffer from some significantly higher boundaries around its behavior, which really lobotomizes the chat experience compared to "actual" ChatGPT.
>Unlike OpenAI's DALL-E this can't take existing images and transform them which is a bummer.
>So it's an underpowered version of what paying ChatGPT folks will get.
There's no indication the cGPT interface will be doing anything different. If you see the demo, it's clearly generating text for each image at the start.
Maybe you will be able to inpaint/outpaint from GPT but that's definitely not been confirmed yet
famouswaffles|2 years ago
https://twitter.com/madebyollin/status/1708204657708077294
https://media.discordapp.net/attachments/1023643945319792731...
brap|2 years ago
isoprophlex|2 years ago
IanCal|2 years ago
> #graphic_art("my prompt here")
unknown|2 years ago
[deleted]
ilaksh|2 years ago
ftxbro|2 years ago
hn_20591249|2 years ago
altcognito|2 years ago
Of course, I'd love for them to take the approach as well that folks are just going to do what they do, and maybe they'll burn out the novelty and give it a rest.
rwmj|2 years ago
[Edit: The prompt didn't contain "fawn", see the replies]
russfink|2 years ago
c0pium|2 years ago
Nursie|2 years ago
raesene9|2 years ago
it's still far from perfect though (it struggled with less common words like Kubernetes) but a step in the right direction.
londons_explore|2 years ago
Looks like they might perhaps be using a LLM for the chat responses that isn't aware that it has the ability to draw images, and in parallel another model who decides what to draw and show to the user.
simonw|2 years ago
I've been prompting Bing with "Draw me an image of..." or even just "Image: image description" and it's worked well for me so far.
dhruvdh|2 years ago
brrrrrm|2 years ago
kaetemi|2 years ago
anonzzzies|2 years ago
ohadron|2 years ago
mardifoufs|2 years ago
cloudking|2 years ago
atum47|2 years ago
Don't know if they are more interested in growing the number of users or collecting that sweet data. Probably both.
abraham|2 years ago
https://en.wikipedia.org/wiki/Microsoft_Bing#:~:text=Bing%20....
rchaud|2 years ago
dgellow|2 years ago
hadlock|2 years ago
littlestymaar|2 years ago
Given that is probably has been AI-translated, it doesn't really inspire confidence about the AI product on this page if you're a French speaker.
coder543|2 years ago
Look at how ChatGPT-4 handles a direct translation request:
https://chat.openai.com/share/8211a1f6-552b-4bf6-8f9c-bcbeb8...
Or how it talks about a set of existing translations:
https://chat.openai.com/share/299e40ce-806b-4f0e-a889-cb2ee2...
French isn't a language I know very well, but my experience using "AI" to translate Spanish (which I actually do know somewhat) and other languages is more positive than Google Translate. A few months ago, I did side by side tests translating into English using ChatGPT-4 and Google Translate, and it's not even a contest.
It's not clear where Microsoft is getting these bad translations, but it seems like they would be less terrible if they were translated by ChatGPT-4.
fabioborellini|2 years ago
Sadly the new features on Windows, like forced Onedrive sync, also use similarly bad translations. Phishing emails have nowadays better Finnish than Windows does.
GaggiX|2 years ago
speedgoose|2 years ago
LorenDB|2 years ago
skilled|2 years ago
> Creating new images can take time
> Because you're out of boosts, image generation may take longer than usual.
Just how much money is Microsoft burning up by offering all these features?
I mean, last time I checked[0] - being this generous didn't really do anything for Bing, did it?
Is this "just because we can" or is it genuinely profitable for them?
[0]: https://searchengineland.com/new-bing-google-market-share-si...
Culonavirus|2 years ago
dalf|2 years ago
https://jobs.careers.microsoft.com/global/en/job/1627555/Pri...
Found on Slashdot: https://m.slashdot.org/story/419681
jpalomaki|2 years ago
lstamour|2 years ago
baz00|2 years ago
Same as Edge is the thing you install Chrome with.
No amount of marketing or features will take these corpses and get them walking again.
TheAceOfHearts|2 years ago
If I ask the LLM to howl, Bing will complaint and give some boring and long-winded excuse, while ChatGPT will just howl as requested.
noveltyaccount|2 years ago
rbits|2 years ago
famouswaffles|2 years ago
Just saying it's fine if you're having a normal conversation which i imagine is what most people care about one way or the other.
jquery|2 years ago
Result (x3): "Unsafe image content detected Your image generations are not displayed because we detected unsafe content in the images based on our content policy. Please try creating again with another prompt."
The 4th attempt gave me this which is actually pretty good https://www.bing.com/images/create/an-anime-girl-making-a-pe...
the restrictions on this are pretty extreme.
Jackson__|2 years ago
Then I tried it with "man" and got 3 images for each try.
Guess at least now we can rank society by how NSFW people are; simply with gender/age, thanks OpenAI.
dgellow|2 years ago
willsmith72|2 years ago
ChrisClark|2 years ago
I'm getting to get it to be more modest, not less!
szmerdi|2 years ago
Bing is desperately adding new features in the hope of finding the "one feature to lure them all", but Bing is not the most effective platform for these generative models.
MrNeon|2 years ago
thambidurai|2 years ago
genuinely curious - is it hard for an advanced AI model to differentiate the intention of the prompt and then if it's mature content may be not generate the image?
simonh|2 years ago
dragonwriter|2 years ago
Or, better, if the prompt has nothing NSFW in it and the generated image triggers a detector for NSFW content, dump and then regen the image with a new seed. Displaying an error message that is basically “We generated something that we think is objectionable, even though your prompt called for nothing like that, so you get no photo” is an idiotic design.
TheAceOfHearts|2 years ago
TheAceOfHearts|2 years ago
HarHarVeryFunny|2 years ago
"ginger tabby cat with ginger eyes, and black cat with green eyes, big wave surfing each on their own surfboard, photographed by a drone"
https://imgur.com/uKbkoke
HarHarVeryFunny|2 years ago
https://imgur.com/rlqMbXN
Default image quality/style leaves a bit to be desired, but it's doing a great job of paying attention to the details of the prompt.
isoprophlex|2 years ago
IanCal|2 years ago
famouswaffles|2 years ago
me_bx|2 years ago
> You will receive emails about Microsoft Rewards, which include offers about Microsoft and partner products. You will also receive notifications about Bing Image Creator. By continuing, you agree to the Rewards Terms and Image Creator Terms below.
How can this be seen as compliant with GDPR?
bko|2 years ago
I think it works really well with comics generation though, although imitating R. Crumb seems to have triggered its "unsafe" content. I wish we stopped using this term "unsafe" and just judge it by "is it what is being asked".
unknown|2 years ago
[deleted]
hdjjhhvvhga|2 years ago
arendtio|2 years ago
russfink|2 years ago
Alifatisk|2 years ago
zamadatix|2 years ago
The challenge will be "does it do that well enough, accurately enough, and keep a good enough lead to establish itself as the leader to beat for the user base".
oezi|2 years ago
shlubbert|2 years ago
kaetemi|2 years ago
callalex|2 years ago
https://www.bleepingcomputer.com/news/security/bing-chat-res...
trebligdivad|2 years ago
footlose_3815|2 years ago
dangero|2 years ago
famouswaffles|2 years ago
Kiro|2 years ago
bilsbie|2 years ago
nathanfig|2 years ago
unknown|2 years ago
[deleted]
bufferoverflow|2 years ago
Midjourney is quite good at that.
willsmith72|2 years ago
GaggiX|2 years ago
ilaksh|2 years ago
I had incredible results asking for architectural drawings earlier. Then a few minutes ago, I broke down and started prompting for supermodels. It did a terrific job the first few times.
But after like three of them getting blocked (I didn't actually ask for anything inappropriate) it starting giving me something that looked like unmitigated Stable Diffusion 1.5.
Lol.
anonzzzies|2 years ago
sylware|2 years ago
bragr|2 years ago
I can squint and see why they wouldn't want my "Cowboy Al Gore rolls coal at a tractor pull" but I don't see how "Joe Bidden inauguration but wearing an orange suit" is going to bring down society. It shot me down for "angela merkel toasting beer glasses" despite her doing that all the time.
bragr|2 years ago
JFK as an alien: https://www.bing.com/images/create/jfk-we-choose-to-go-to-th...
JFK and Fidel Castro at a fictional peace conference: https://www.bing.com/images/create/jfk-and-castro-meeting-at...
gcau|2 years ago
rchaud|2 years ago
rchaud|2 years ago
mmanfrin|2 years ago
airstrike|2 years ago
zer0c00ler|2 years ago
[deleted]
avgtechenjoyer|2 years ago
[deleted]
emptysongglass|2 years ago
And as usual, Bing Chat itself seems to suffer from some significantly higher boundaries around its behavior, which really lobotomizes the chat experience compared to "actual" ChatGPT.
famouswaffles|2 years ago
>So it's an underpowered version of what paying ChatGPT folks will get.
There's no indication the cGPT interface will be doing anything different. If you see the demo, it's clearly generating text for each image at the start.
Maybe you will be able to inpaint/outpaint from GPT but that's definitely not been confirmed yet