(no title)
eminence32 | 3 months ago
Assuming that this new model works as advertised, it's interesting to me that it took this long to get an image generation model that can reliably generate text. Why is text generation in images so hard?
eminence32 | 3 months ago
Assuming that this new model works as advertised, it's interesting to me that it took this long to get an image generation model that can reliably generate text. Why is text generation in images so hard?
unknown|3 months ago
[deleted]
Filligree|3 months ago
- It requires an AI that actually understands English, I.e. an LLM. Older, diffusion-only models were naturally terrible at that, because they weren’t trained on it.
- It requires the AI to make no mistakes on image rendering, and that’s a high bar. Mistakes in image generation are so common we have memes about it, and for all that hands generally work fine now, the rest of the picture is full of mistakes you can’t tell are mistakes. Entirely impossible with text.
Nano Banana Pro seems to somewhat reliably produce entire pictures without any mistakes at all.
tobr|3 months ago
DesertVarnish|3 months ago