top | item 43519036

(no title)

nextts | 11 months ago

Yes. There are 2 aspects to this.

Roughly (from lay understand) LLMs predict what their training data would say. They are first trained on "the internet, etc." so they can predict words well, e.g. finish off "Paris is the..." then using human feedback they are trained further to work in chat mode and be non-offensive, concise, be pleasant etc.

discuss

No comments yet.