This is great. Those examples are not the best quality, but they're impressive.
That prompted me to generate ambigrams with stable diffusion. The results looked odd, as ambigrams tend to, but the "text" was largely illegible. I wonder when the state of the art will be able to handle that request.
It's odd that image AIs are not ready to overlay text. If you ask Dall-E or Midjourney also to say a few letters they do lots of nearest random neighbors by not just scrambling the idea of the word but also scribbling anything on top that it thinks looks remotely like writing but is not in any language. Maybe it's still developing the ability to read and maybe secretly creating a completely new script and lang.
I've generally been disappointed by my prompts for optical illusions. I thought it would be better at it. An optical illusion is basically what happens when you relax the constraints on a graphical depiction, allowing objects to be connected in ways that are inconsistent with 3d geometry. The trick is that the inconsistency has to be global not local. Anywhere you zoom in on still looks like normal 3d space. I expected SD to be good at this, as a priori it never had a conception of how 3d space must look to begin with.
Here's where it sucked. It seems to have learned the superficial aesthetic of an optical illusion or of "Escher" without learning the relevant component. It spits out things that either aren't optical illusions, or are just random disconnected spattering of geometrical inconsistencies without any overarching theme. A person made optical illusion will generally have a single main loop of impossibly connected objects, or at least some simple overall topology. The illusion is expected to exist on the global scale of the image, not as a weird pocket of a mostly normal image.
[+] [-] prolyxis|3 years ago|reply
[+] [-] frob|3 years ago|reply
[+] [-] alanbernstein|3 years ago|reply
That prompted me to generate ambigrams with stable diffusion. The results looked odd, as ambigrams tend to, but the "text" was largely illegible. I wonder when the state of the art will be able to handle that request.
[+] [-] goldfeld|3 years ago|reply
[+] [-] suyash|3 years ago|reply
[+] [-] IIAOPSW|3 years ago|reply
Here's where it sucked. It seems to have learned the superficial aesthetic of an optical illusion or of "Escher" without learning the relevant component. It spits out things that either aren't optical illusions, or are just random disconnected spattering of geometrical inconsistencies without any overarching theme. A person made optical illusion will generally have a single main loop of impossibly connected objects, or at least some simple overall topology. The illusion is expected to exist on the global scale of the image, not as a weird pocket of a mostly normal image.
[+] [-] jimmySixDOF|3 years ago|reply
[1] https://news.ycombinator.com/item?id=25045392
[+] [-] goldfeld|3 years ago|reply