top | item 32378492

(no title)

centra_minded | 3 years ago

GPT-3 and DALL-E (by my subjective opinion) feel much better than other models that their respective tasks.

DALL-E in particular kind of blows the VQGAN+CLIP messing around I've done out of the water. GPT-3 feels markedly better than other text generation or chatbots I've tried.

Definitely these are well marketed, and not the only models aorund, but they also feel ahead of other things I've tried. Can you point out some of these other tools/models?

discuss

order

andybak|3 years ago

It's not as simple as "x is better than y". They all have their own flavours. I'd had results from JAX CLIP Guided Diffusion that I can't get from anything else and some of my early experiments with Disco Diffusion have a quality that is unique. I think people will always mix and match models due to their unique qualities.

Having said that I'm on the beta for Stable Diffusion ( https://stability.ai/ ) and it's remarkably capable across a broad range of styles. Dall-E probably still has the edge for more complex semantic prompts and photographic coherance but it's very good and it's got a very open strategy.