top | item 28082835

(no title)

math0ne | 4 years ago

Really cool, if there are just two terms is is basically two images combined?

existential dread simulator: https://nightcafe-studio.imgix.net/jobs/GFM872ChxPF4Xeb2jTiC...

graffiti sunset: https://nightcafe-studio.imgix.net/jobs/MJfXTM5rFdNqHcR5xBRG...

discuss

It's not, but I can see why the "Graffiti sunset" would make you think that. It does seem to be two different images combined.

I won't claim to have a good knowledge of the inner workings, but basically CLIP will give a score for how well the image matches your prompt, and that's used as a loss function for the GAN.

On the Colab notebooks you can specify multiple prompts with different weights, in which case I assume it has multiple loss functions that it's trying to optimise for.

I'm more of a web app guy than a deep tech guy. Some of what I wrote can probably be corrected by someone with a better knowledge of the ML.

kaoD|4 years ago

At least in Google Collab I found longer prompts to work better.

E.g. instead of "graffiti sunset" you might get better results with "a sunset painted in the style of graffiti" or similar.