I was trying to get it to create an image of a tiger jumping on a pogo stick, which is way beyond its capabilities, but it cannot create an image of a pogo stick in isolation.
When given an image of an empty wine glass, it can't fill it to the brim with wine. The pogo stick drawers and wine glass fillers can enjoy their job security for months to come!
This is where smaller models are just going to be more constrained and will require additional prompting to coax out the physical description of a "pogo stick". I had similar issues when generating Alexander the Great leading a charge on a hippity-hop / space hopper.
You are right, just tried even with reference images it can't do it for me. Maybe with some good prompting.
Because in theory I would say that knowledge is something that does not have to be baked in the model but could be added using reference images if the model is capable enough to reason about them.
CamperBob2|1 month ago
Tiger on pogo stick: https://i.imgur.com/lnGfbjy.jpeg
Dunno what this is, but it's not a pogo stick: https://i.imgur.com/OmMiLzQ.jpeg
Nano Banana Pro FTW: https://i.imgur.com/6B7VBR9.jpeg
nomel|1 month ago
downboots|1 month ago
vunderba|1 month ago
Z-Image / Flux 2 / Hidream / Omnigen2 / Qwen Samples:
https://imgur.com/a/tB6YUSu
This is where smaller models are just going to be more constrained and will require additional prompting to coax out the physical description of a "pogo stick". I had similar issues when generating Alexander the Great leading a charge on a hippity-hop / space hopper.
mhl47|1 month ago
Because in theory I would say that knowledge is something that does not have to be baked in the model but could be added using reference images if the model is capable enough to reason about them.