top | item 34769970

Illusion Diffusion: Optical Illusions Using Stable Diffusion

116 points| tancik | 3 years ago |github.com

11 comments

order
[+] prolyxis|3 years ago|reply
This mostly just left me with a greater appreciation for the seamlessness of the original rabbit-duck illusion.
[+] alanbernstein|3 years ago|reply
This is great. Those examples are not the best quality, but they're impressive.

That prompted me to generate ambigrams with stable diffusion. The results looked odd, as ambigrams tend to, but the "text" was largely illegible. I wonder when the state of the art will be able to handle that request.

[+] goldfeld|3 years ago|reply
It's odd that image AIs are not ready to overlay text. If you ask Dall-E or Midjourney also to say a few letters they do lots of nearest random neighbors by not just scrambling the idea of the word but also scribbling anything on top that it thinks looks remotely like writing but is not in any language. Maybe it's still developing the ability to read and maybe secretly creating a completely new script and lang.
[+] suyash|3 years ago|reply
love ambigrams and illusions, any description on how one can create their own ? Thanks!
[+] IIAOPSW|3 years ago|reply
I've generally been disappointed by my prompts for optical illusions. I thought it would be better at it. An optical illusion is basically what happens when you relax the constraints on a graphical depiction, allowing objects to be connected in ways that are inconsistent with 3d geometry. The trick is that the inconsistency has to be global not local. Anywhere you zoom in on still looks like normal 3d space. I expected SD to be good at this, as a priori it never had a conception of how 3d space must look to begin with.

Here's where it sucked. It seems to have learned the superficial aesthetic of an optical illusion or of "Escher" without learning the relevant component. It spits out things that either aren't optical illusions, or are just random disconnected spattering of geometrical inconsistencies without any overarching theme. A person made optical illusion will generally have a single main loop of impossibly connected objects, or at least some simple overall topology. The illusion is expected to exist on the global scale of the image, not as a weird pocket of a mostly normal image.

[+] goldfeld|3 years ago|reply
Quite cool! and creative use of AI. Do you consider the project is all fleshed out or are there improvements that could be done?