top | item 40384031

(no title)

Do you know why these are so short? What is the algorithm/magic in all of these?

I tried to make sense of it but cannot

discuss

Hugsun|1 year ago

Architecturally, LLMs are very simple compared to many software projects.

The crux of their behavior comes from their learned weights which are gigabytes and can cost millions to obtain via training.

DavidSJ|1 year ago

The magic is in the billions of learned weights (~synapses). This is just the scaffolding that runs them.

chpatrick|1 year ago

The magic is the structure of the model, and the real magic is the billions of weights.