top | item 37759637

(no title)

Rantenki | 2 years ago

I never mentioned reinforcement learning, and my DK statement was completely around using flawed fonts for graphic design, etc.

My partner _is_ a professional graphic designer, and we _have_ seen some pretty terrible client graphics that came out of Midjourney. They're amazing for what they are, but it's very difficult to get something out of it that competes with a professional illustrator, even ignoring the whole copyrighted content in the model issue.

discuss

BoorishBears|2 years ago

Reinforcement learning from human feeedback is the training you're referring to, you just don't realize it.

RLHF is why 2 years ago "They're amazing for what they are" would have been "They're so hideous no one in their right mind would use them", and why in 2 years that too will be some weaker form of argument.

There's no special knowledge needed to know "I like X over Y": RLHF allows a model to turn that into guidance at a scale that's never been possible before.